All resources

Guide

What Is llms.txt? The AI-Readability Standard, Explained (2026)

llms.txt is a proposed standard — a Markdown file at your site's root that gives AI models a clean, curated map of your most important content. Proposed by Jeremy Howard of Answer.AI in 2024, it has spread to roughly 5.6% of the top 10,000 sites by mid-2026. But its real-world value is contested: Google says it doesn't use the file and has no plans to, and the clearest evidence of genuine use today is on developer docs read by coding agents.

What Is llms.txt? The AI-Readability Standard, Explained (2026)

llms.txt is a proposed web standard: a single Markdown file placed at the root of your site (at /llms.txt) that gives large language models a clean, curated map of your most important content. Think of it as a reading guide written for machines — not a wall, but a welcome mat that points an AI to the pages you most want it to understand and cite.

It was proposed by Jeremy Howard, co-founder of Answer.AI, in September 2024 as "a proposal to provide information to help LLMs use websites." The official specification at llmstxt.org keeps it deliberately simple: the only required element is an H1 with your project or brand name, followed by an optional one-line summary in a blockquote and H2-delimited lists of links to your key pages. It is designed for inference time — the moment an AI is answering a question — not for training.

How is llms.txt different from robots.txt?

This is the most common point of confusion, and the distinction matters.

FilePurposeWhat it controls
robots.txtAccess controlWhich crawlers may or may not fetch which URLs
llms.txtContent curationA clean, prioritized map of your best content for AI to read
llms-full.txtFull-text bundleYour entire core content concatenated into one Markdown file

robots.txt tells bots where they may go. llms.txt blocks nothing — it simply hands an AI a tidy index so it doesn't have to fight through your navigation, scripts, and boilerplate to find what matters. The two are meant to coexist alongside your sitemap.xml. There is also a companion file, llms-full.txt, which Mintlify says it developed in collaboration with Anthropic, containing your full plain-text content in one document for models that want everything at once.

Is anyone actually using it?

Adoption is climbing fast, but from a small base. As of June 2026, llms.txt was present on 5.61% of the top 10,000 websites, up from just 1.04% a year earlier — roughly 5.4x growth in twelve months. Much of that surge is platform-driven rather than organic: Shopify silently pushed llms.txt to every store by default in spring 2026, taking adoption across Shopify sites to over 78%.

The critical question is whether the major AI engines read it. Here the evidence is sobering. Google's John Mueller said no AI services have confirmed using llms.txt, comparing it to the long-ignored keywords meta tag — a self-declared signal anyone can write. Gary Illyes later confirmed Google "doesn't support llms.txt and isn't planning to," though server logs suggest OpenAI's crawler may occasionally check for the file.

Where llms.txt clearly earns its keep is developer documentation. The file is consumed mainly by coding and IDE agents — Cursor, Windsurf, Claude Code, GitHub Copilot — pointed at docs, which is why the developer-doc sites of Anthropic, Perplexity, Mistral, and Cohere host one even though their consumer front doors do not.

Should you add llms.txt in 2026?

Our honest take: yes if it's cheap, but with clear eyes about why. A minimal, accurate llms.txt takes about fifteen minutes to write, the downside risk is essentially zero, and adoption momentum is real. If you publish technical documentation or an API, the case is strong — that's exactly the content coding agents read today. For a general marketing site, treat it as a low-cost hedge, not a growth lever.

What it is not is a substitute for the work that demonstrably moves AI visibility: well-structured, crawlable, genuinely authoritative content. That foundation — covered in our guide to whether your site is even letting AI crawlers in and our primer on Answer Engine Optimization — is what actually gets you cited. llms.txt is a small, sensible addition on top of it, and it's the kind of detail our SEO & AI search team and web development team handle as part of making a site genuinely machine-readable.

Sources

FAQ

Quick
answers.

No. Gary Illyes said Google doesn't support it and has no plans to, and John Mueller compared it to the long-ignored keywords meta tag — a self-declared signal anyone can write.

Keep reading

Go deeper.

Guide

AI Crawlers 101: Is Your Site Even Letting GPTBot and ClaudeBot In?

If you want to show up in ChatGPT, Claude, or Perplexity, two things have to be true: the AI crawlers must be allowed in your robots.txt, and your content must exist in the raw HTML they fetch.

Guide

What Is Generative Engine Optimization (GEO)?

Generative Engine Optimization (GEO) is the practice of structuring your content so generative AI engines (ChatGPT, Google AI Mode, Perplexity, Gemini) cite, quote, and recommend your brand inside their answers.

Guide

Why Your Brand Isn't Showing Up in ChatGPT (and How to Fix It)

If your brand never appears in ChatGPT, it usually comes down to three fixable causes: each AI engine cites differently and you are optimizing for the wrong one, your brand search volume is too low to register as a known entity, or your site is technically unreadable to AI crawlers.

Glossary

llms.txt

llms.txt is a proposed plain-text Markdown file placed at a site's root to give large language models a concise, curated guide to its most important pages.

Glossary

Structured Data

Structured data is standardized code, usually schema.org vocabulary added in JSON-LD, that describes a page's content so search engines can understand it and display rich results such as reviews, FAQs, products, events, and breadcrumbs.

Glossary

Retrieval-Augmented Generation (RAG)

Retrieval-augmented generation (RAG) is an AI technique that retrieves relevant information from an external knowledge source at query time and feeds it to a large language model as context before it generates an answer.

Glossary

AI Search / Answer Engines

AI search refers to search experiences that return a synthesized, often cited answer instead of a list of links — including Google AI Overviews and AI Mode, ChatGPT, Perplexity, and Copilot.

Your growth starts here

Let's build the
growth engine.

Tell us where growth is stuck. We'll show you what one integrated team can move — and how fast.