llms.txt vs. JSON-LD Schema vs. Sitemap — Which AI File Does What?
All four files help AI systems find and understand your product — but they work at different layers and tell AI different things. You need all of them. Here's what each one does.
llms.txtDirect product brief for AI agents — tells LLMs what your product is, who it's for, what it does, and how much it costs.
JSON-LD Structured DataMachine-readable product facts embedded in your page.
sitemap.xmlIndex of all URLs on your site.
llm.jsonStructured JSON variant of llms.
llms.txtUp to 10 pts (AI Crawl Access + Agent Readiness)Direct product brief for AI agents — tells LLMs what your product is, who it's for, what it does, and how much it costs.
File details
What AI systems get
- Product name, tagline, and one-line description
- Target audience and key use cases
- Feature list
- Pricing summary
- Links to key pages
JSON-LD Structured DataUp to 7 pts (Structured Data & Meta category)Machine-readable product facts embedded in your page. The highest-fidelity structured signal available — tells AI systems your exact product name, category, description, URL, and pricing in a format they can parse unambiguously.
File details
What AI systems get
- Product @type (SoftwareApplication, Product, Organization)
- name, description, url fields
- applicationCategory
- offers (pricing)
- Breadcrumb hierarchy
- FAQ structured answers
sitemap.xmlUp to 7 pts (AI Crawl Access category)Index of all URLs on your site. Tells crawlers what pages exist, how often they change, and their relative priority — so AI systems don't miss important product pages.
File details
What AI systems get
- Complete list of URLs to crawl
- Last modified dates
- Change frequency hints
- Priority signals
llm.jsonSupplemental — contributes to overall AI agent readinessStructured JSON variant of llms.txt. Provides the same core product information in a machine-parseable format — useful for AI agents that prefer JSON over plain text.
File details
What AI systems get
- name, tagline, description
- category and category_claim
- target_audience array
- use_cases array
- features list
- pricing object
- links map
How they work together
sitemap.xml tells AI crawlers that your pricing page, features page, and docs exist — otherwise they might only crawl your homepage.
JSON-LD schema on your homepage and pricing page gives AI systems precise, unambiguous product facts that override any inference they might make from your content.
llms.txt gives AI agents a single-file product brief they can read before visiting any pages — a shortcut to accurate product understanding.
llm.json provides the same brief in a machine-parseable structured format for AI agents that prefer JSON over plain text.
Together, they ensure AI systems can: (1) find all your pages, (2) understand your product precisely, (3) describe it accurately in answers, and (4) recommend it confidently to users.
Frequently Asked Questions
Do I need all three — llms.txt, JSON-LD, and sitemap.xml?
Yes, ideally. Each serves a different purpose: sitemap.xml helps AI crawlers discover all your pages, JSON-LD gives AI systems precise machine-readable product facts, and llms.txt gives AI agents a direct product brief. They work together. If you can only do one, JSON-LD has the broadest impact across both SEO and AI visibility.
What does llms.txt tell AI that robots.txt doesn't?
robots.txt controls access (which URLs can be crawled). llms.txt provides content (product description, audience, features, pricing). They serve entirely different purposes and you should have both.
Does JSON-LD schema help ChatGPT and Perplexity?
Yes. JSON-LD (especially SoftwareApplication and Product schemas) gives AI systems precise, machine-readable product facts. Accurate JSON-LD reduces the chance of AI systems misrepresenting your product name, category, pricing, or description.
If I have a good sitemap, do I still need llms.txt?
Yes. A sitemap tells AI crawlers which URLs to visit — not what your product is. An AI crawler can follow every URL in your sitemap and still not know your product's name or pricing if it's buried in JavaScript-rendered content. llms.txt delivers that product information directly.
Generate all four files automatically
Scan your site and get llms.txt, llms-full.txt, llm.json, and JSON-LD schema — ready to deploy.