Structured data is not enough: why AI search needs a memory layer
Structured data is not enough: why AI search needs a memory layer
Why it matters
Schema.org JSON-LD markup - the backbone of modern SEO - barely moves the needle for AI retrieval. AI systems read pages as a continuous text stream, so hidden structured data gets truncated or diluted. The real unlock is making your knowledge graph visible on the page surface, which improved AI answer accuracy by up to 34%. This reframes SEO from “tag it and hope” to “make your knowledge reasonably navigable by machines.”
Key findings
- Schema.org JSON-LD alone provides only minimal improvements for AI retrieval - it competes with page content and gets lost in the text stream
- “Enhanced Entity Pages” - surfacing knowledge graph data directly in page content - improved AI answer accuracy significantly:
- Standard retrieval systems: +29,6%
- Multi-step autonomous systems: +30%
- Most optimised pipeline: +34%
- AI systems ingest pages as continuous text, so JSON-LD buried in
<head>or<script>tags gets truncated or diluted during chunking - The knowledge graph must become a visible “memory layer,” not hidden code
Enhanced entity pages - what they look like
- Surface entity properties in natural language prose (not just markup)
- Expose connections between entities via internal links
- Provide navigational affordances (breadcrumbs, related entities, structured sections)
- Add contextual instructions that help AI systems reason over the content
Three layers of AI visibility
- Citations (inventory) - AI can find and cite your content
- Reasoning (operability) - AI can reason over your knowledge, connect facts, answer multi-hop questions
- Actions (agentic) - AI can act on your content (book, purchase, query)
The reasoning web (SEO 3.0)
- The new question is not “can Google find this?” but “can AI reason over your knowledge?”
- Linked Data architecture + content negotiation: one URI serves multiple representations (HTML for humans, RDF/JSON-LD for machines)
- This is a shift from optimising for ranking to optimising for machine comprehension
Research reference
- Paper: arXiv:2603.10700
Actionable takeaways
- Stop relying on JSON-LD alone for AI discoverability - it is necessary but not sufficient
- Build entity pages that surface structured knowledge in readable prose
- Connect entities through internal links and navigational structure
- Think in terms of “can an AI reason over this?” not “does it have the right schema?”
- Audit existing content for the three visibility layers: citations, reasoning, actions
- Consider Linked Data / content negotiation architecture for serving both human and machine representations