GEO Scoring Categories

Glippy evaluates websites across 16 categories, each measuring a different aspect of AI and LLM readiness.

What is GEO?

Generative Engine Optimization measures how well your website is prepared for AI crawlers, LLM-powered search engines, and AI agents. Higher GEO scores mean AI systems can better understand, cite, and interact with your content.

How Scoring Works

  • Each category produces a score from 0-100
  • The overall score is calculated from all category scores

Letter Grades

A+ 90-100 A 80-89 B 70-79 C 60-69 D 40-59 F 0-39

Category Details

1 Structured Data & Schema

Measures the presence and quality of machine-readable structured data that helps AI systems understand your content's meaning and context.

What we check

  • JSON-LD presence and validity
  • Schema.org types (FAQPage, Article, Product, HowTo, etc.)
  • Speakable markup for voice assistants
  • Schema validation and completeness

Why it matters

Structured data is the primary way AI systems extract facts and relationships from web pages. High-quality schemas dramatically improve how AI cites and represents your content.

2 Semantic HTML

Evaluates the use of meaningful HTML elements that convey document structure and content hierarchy to AI parsers.

What we check

  • Proper heading hierarchy (H1-H6)
  • Semantic elements (<article>, <nav>, <main>, <section>)
  • Content-to-markup ratio
  • Logical document outline

Why it matters

AI systems parse HTML structure to understand content relationships. Proper semantic markup helps LLMs identify main content, navigation, and content sections.

3 Accessibility for Agents

Checks accessibility features that help both human assistive technologies and AI agents understand your content.

What we check

  • Lang attribute on HTML element
  • Alt text on images
  • ARIA labels and roles
  • Descriptive link text (not "click here")

Why it matters

Accessibility features provide crucial context that AI systems use to understand images, interactive elements, and content purpose. What's good for screen readers is often good for AI.

4 Internal Linking

Analyzes how well your site's pages are connected and how clearly the link structure conveys content relationships.

What we check

  • Link density and distribution
  • Navigation structure clarity
  • Breadcrumb markup
  • Related content links

Why it matters

AI crawlers use internal links to discover content and understand topical relationships. Clear linking helps AI systems map your site's knowledge graph.

5 Meta & Discoverability

Evaluates metadata that helps AI systems understand and properly attribute your content.

What we check

  • Title tag presence and quality
  • Meta description
  • Canonical URL
  • Open Graph and Twitter Card tags
  • Hreflang for internationalization

Why it matters

Metadata provides context that AI uses when summarizing, citing, and categorizing your content. Good metadata improves how AI represents your pages.

6 Machine Readability

Measures technical factors that determine whether AI systems can access and process your content.

What we check

  • Server-side rendering (SSR) vs. client-only JS
  • Bot blocking mechanisms
  • Robots.txt rules for AI crawlers (GPTBot, ClaudeBot, etc.)
  • llms.txt file presence and contents

Why it matters

If AI crawlers can't access or render your content, nothing else matters. This category ensures your technical setup doesn't block AI systems.

Allow-list GlippyBot too

If you're auditing your own site with the MCP server, make sure your WAF and robots.txt aren't blocking GlippyBot itself. See the GlippyBot allow-list guide.

7 Entity & Authority

Checks for signals that help AI systems understand who created the content and establish trustworthiness.

What we check

  • Author information and bylines
  • Publication dates
  • Organization schema
  • About and contact information
  • E-E-A-T experience signals (first-hand usage, dates, locations)
  • Credential mentions (certifications, qualifications, titles)
  • Editorial policy page or disclosure
  • Contact completeness (email, phone, physical address)

Why it matters

AI systems increasingly consider source authority when deciding what to cite. Clear authorship, credentials, and organizational identity help establish trust.

8 Citability & Answer-Readiness

Evaluates how well your content is formatted for AI systems to extract and cite specific answers.

What we check

  • FAQ content with clear Q&A format
  • Data tables with headers
  • Well-structured lists
  • Lead paragraph quality (answering who/what/when/where)

Why it matters

LLM-powered search engines prefer content they can easily quote. FAQ sections, tables, and clear lead paragraphs make your content more likely to be featured in AI answers.

9 Performance & Crawlability

Measures factors affecting how efficiently AI systems can crawl and process your pages.

What we check

  • Image dimensions specified in HTML
  • Lazy loading implementation
  • Resource hints (preload, prefetch)
  • Page size and complexity

Why it matters

While less critical than content quality, performance affects crawl efficiency. Faster, lighter pages are more likely to be fully indexed.

10 Agent Interactivity

Evaluates support for AI agents that can take actions on behalf of users.

What we check

  • WebMCP tools and endpoints
  • Form annotations and labels
  • Agent-callable actions
  • API documentation presence

Why it matters

This is an emerging category as AI agents become more capable. Sites that support programmatic interaction will have advantages as agent-based browsing grows.

11 Content Positioning

Measures how clearly your content distinguishes you from competitors and backs claims with proof.

What we check

  • Brand differentiation language (what makes you different)
  • Proof points (numbers, case studies, evidence)
  • Social proof (testimonials, reviews, logos, press)
  • Unique value propositions beyond generic claims

Why it matters

When LLMs are asked to recommend or compare products, they pull the concrete, differentiating details they can find. Pages that read like every other page in the category do not get quoted.

12 Content Freshness

Checks for signals that tell AI systems when content was written or last updated.

What we check

  • Publication and modification dates in HTML and schema
  • Apparent age of content based on temporal references
  • Temporal language ("in 2025", "last quarter", "recently")
  • Stale-date warning signals (years-old references to "latest" things)

Why it matters

AI answer engines weight fresh content higher for time-sensitive questions. Without clear date signals, LLMs will either skip your page or attach low confidence to its claims.

13 Information Density

Evaluates how much substantive content sits between the filler, scaffolding, and promotional text.

What we check

  • Substantive-to-filler ratio across the page
  • Number of meaningful content sections
  • Claim-evidence pairing (assertions backed by specifics)
  • Foam detection (generic, content-free marketing phrases)

Why it matters

LLMs extract citable facts per token they process. Dense, specific pages yield more citations than long pages padded with hedging and generic copy.

14 Factual Verifiability

Checks whether your claims can be independently verified by an AI system.

What we check

  • Citations and source links
  • Attribution for statistics and quotes
  • Methodology disclosure for studies or data
  • Links to primary sources, not just internal pages

Why it matters

Verifiability is becoming a direct ranking signal for AI answer engines. Unsourced claims are increasingly skipped in favor of pages that show their work.

15 Content Comprehensiveness

Measures whether the page covers the topic fully enough to answer follow-up questions.

What we check

  • Word count relative to topic scope
  • Heading coverage across subtopics
  • Definitions of key terms
  • Comparisons, alternatives, and edge cases

Why it matters

LLMs prefer pages that already answer the likely next question. Thin pages get cited only for the headline; comprehensive pages become the go-to source for a whole topic.

16 Multimodal Content

Evaluates how accessible your non-text content is to AI systems that increasingly process images, video, and audio.

What we check

  • Image alt text quality and coverage
  • Figure and caption markup
  • Video and audio presence with transcripts or captions
  • Inline SVG with accessible titles
  • Schema for multimedia (VideoObject, ImageObject)

Why it matters

Multimodal LLMs can now interpret images and video directly. Pages with rich, well-described media surface in a growing set of queries that text-only pages cannot.