llms.txt is a community-proposed Markdown file at your site root (/llms.txt) that gives AI models a clean, curated map of your most important content. It was proposed by AI researcher Jeremy Howard, not by a standards body like the W3C or IETF, and it remains an unofficial convention as of 2026. Unlike robots.txt, which controls crawler access, llms.txt is purely advisory: it aims to help models comprehend your site, not to grant or deny entry. It is cheap and low-risk to publish — but there is no public, controlled evidence that adding it measurably increases how often AI engines cite you.
What is llms.txt, and what goes in it?
llms.txt is a plain-text, Markdown-formatted file placed at the root of a domain (for example, /llms.txt) that acts as a curated, human-readable guide to your site's most important content for large language models. A typical file opens with an H1 site name, a short blockquote summary, and then sections of links — each a page URL with a one-line description — so a model can quickly understand what your site covers and where the canonical pages live.
A companion file, llms-full.txt, goes further by including the full text of those key pages inline, so a model can ingest the actual content rather than just a map of links. The stated design goal is comprehension at inference time: where a sitemap lists URLs for discovery, llms.txt tries to give a model a clean structured overview of your site when it generates an answer.
The convention was proposed by Jeremy Howard in 2024 and has since been documented by tooling and SEO vendors including GitBook and Semrush. It is worth being precise about its status: it is a community proposal, not an adopted standard, and there is no protocol that compels any AI system to fetch or honor it.
- An H1 with your site name and a one- or two-sentence blockquote summary.
- Sections (for example Guides, Tools, API) each holding a short list of links.
- One line per link: the page URL followed by a brief description of what it covers.
- Optionally, a companion llms-full.txt that inlines the full text of those pages.
How is llms.txt different from robots.txt and sitemap.xml?
The three files solve different problems. robots.txt controls crawler access — which user agents may fetch which paths. sitemap.xml lists URLs to help crawlers discover and index pages. llms.txt is optimized for AI comprehension and semantic understanding of your content, not access control and not discovery.
The popular analogy that llms.txt is a robots.txt for AI is widely repeated but technically imprecise. Compliant crawlers honor robots.txt directives as a well-established convention; llms.txt has no known enforcement mechanism and is purely advisory. Publishing one does not block, allow, or require anything — it simply offers a model a tidy summary if it chooses to look.
Practically, that means llms.txt is additive: it sits alongside robots.txt and your sitemap rather than replacing either. If your robots.txt blocks the AI bots, a perfect llms.txt changes nothing, because access is the gating factor, not comprehension.
Does llms.txt actually improve AI visibility?
Be skeptical of strong claims here. As of 2026, no source provides a controlled study demonstrating a causal link between publishing llms.txt and increased AI citation frequency. Ahrefs has gone as far as calling it a solution in search of a problem, and other analysts (including SE Ranking) have voiced similar skepticism about measurable impact.
The positive anecdotes circulating online are mostly single-vendor and unverified. One frequently cited case study (Springs Apps) reports a 20% increase in search visibility and a 15% improvement in accurate AI answers after adding llms.txt, but that figure is not corroborated by any independent source and should not be treated as generalizable. Crucially, no public documentation confirms that GPTBot, ClaudeBot, or PerplexityBot actively fetch and act on llms.txt — the consumption behavior is assumed, not proven.
What is real is adoption interest: by April 2026, companies including Anthropic, Stripe, Zapier, Cloudflare, Vercel, and Hugging Face had published llms.txt on their domains. That signals the convention has momentum among technical teams — but adoption by notable companies is not the same as evidence that it changes citation outcomes.
Who benefits most from llms.txt today?
The clearest fit is documentation. GitBook notes that llms.txt is especially useful for documentation sites with frequently changing content, multiple sections, or REST and GraphQL API references, because the file can point models at canonical endpoints and versioned paths instead of leaving them to guess.
There may also be a small, indirect signal: Ahrefs data on one site (Redbus) showed the llms.txt page itself receiving search clicks, which some practitioners argue feeds additional context to AI platforms via retrieval pipelines. Treat this as a minor, second-order effect rather than a reason to expect a visibility jump.
For a typical marketing site or blog with a handful of pages, the upside is modest. The file is still cheap to maintain, but it is unlikely to be the thing that determines whether you get cited.
How do you create and deploy an llms.txt file?
Start by listing the pages you most want a model to understand — your pillar guides, key product or docs pages, and high-value references. Write an H1 with your site name, a one- or two-sentence blockquote summary of what the site is, then group the links into sections (for example, Guides, Tools, API) with a short description after each link. Keep it concise and current; a stale map is worse than none.
Place the file at your domain root so it resolves at /llms.txt, and optionally generate an llms-full.txt with the full text of those key pages for models that prefer inline content. Confirm both are reachable and return plain text. This site publishes its own llms.txt and llms-full.txt as a working example.
If you would rather not hand-write it, you can generate one from your content automatically — and you can confirm whether your site already serves one (and check the rest of your AI-crawlability signals) with our free crawlability checker.
- 1List the pages you most want a model to understand (pillar guides, key docs, references).
- 2Write the H1, blockquote summary, and sectioned links with one-line descriptions.
- 3Place the file at /llms.txt and confirm it returns plain text.
- 4Optionally publish /llms-full.txt with the full content inline.
- 5Keep it current — a stale map is worse than none.
Should you bother? A pragmatic recommendation
The reasonable stance for most sites: llms.txt is low-cost and low-risk, which means publishing one is fine, but do not expect it to be decisive and do not let it crowd out the fundamentals. The things that actually determine whether AI engines can read and cite you are access and rendering — whether your robots.txt allows the AI bots, and whether your content is in the static HTML.
Fix those first. Make sure the retrieval bots are allowed in your robots.txt, serve your content server-side, and structure each page so the answer is easy to extract. Then add llms.txt as a tidy bonus, not a magic lever.
For the full picture of what makes a site visible to AI answer engines, read the complete AI crawlability guide, and use the checker to see where your site stands today.
What are the key takeaways?
llms.txt has been a community proposal since 2024 — not a standard, and not backed by controlled citation evidence — so these five points capture what actually matters for AI visibility.
- llms.txt is a 2024 community proposal by Jeremy Howard, not a W3C or IETF standard.
- It aids comprehension; robots.txt controls access — the two are not interchangeable.
- No controlled study shows it increases AI citations, so treat it as low-risk hygiene.
- By April 2026, Anthropic, Stripe, Zapier, Cloudflare, Vercel and Hugging Face had published one.
- Documentation-heavy sites benefit most; fix robots.txt and rendering first.
Frequently asked questions
Is llms.txt an official standard?+
No. llms.txt is a community proposal introduced by Jeremy Howard, not a standard ratified by the W3C, IETF, or any other standards body. As of 2026 it remains an unofficial convention with no enforcement mechanism.
Do ChatGPT, Claude, or Perplexity actually read llms.txt?+
There is no public documentation confirming that GPTBot, ClaudeBot, or PerplexityBot fetch and act on llms.txt. The idea that these models consume it is an assumption, not a documented behavior, so treat any visibility claims with caution.
Does llms.txt improve SEO or AI citations?+
No controlled study shows that publishing llms.txt measurably increases AI citations, and some analysts are openly skeptical. Treat it as low-risk hygiene rather than a ranking or citation lever.
What is the difference between llms.txt and robots.txt?+
robots.txt controls crawler access and is honored by compliant crawlers; llms.txt is advisory and aims to help models comprehend your site. robots.txt decides who can read you, llms.txt only offers a summary if a model chooses to use it.
What is llms-full.txt?+
llms-full.txt is a companion file that includes the full text of your key pages inline, rather than just linking to them. It lets a model ingest the actual content in one fetch, which can suit documentation-heavy sites.
Sources
- What Is llms.txt, and Should You Care About It? (Ahrefs)
- What Is LLMs.txt & Should You Use It? (Semrush)
- What is llms.txt? Why it matters and how to create it (GitBook)
- What is llms.txt? A guide to better AI optimization (Wix)
- LLMs.txt: Why Brands Rely On It and Why It Doesn't Work (SE Ranking)
- llms.txt and AI Visibility: Results from a GEO Study (Otterly.ai)
- What Is LLMs.txt? & Do You Need One? (Neil Patel)
- What Is LLMs.txt? Plus, Why You Need It On Your Site (AIOSEO)