Do I Really Need LLMs.txt For AEO?

Large Language Models (LLMs) like ChatGPT, Google’s Bard/SGE, Perplexity, and Claude have completely changed how people discover information online. And with that shift, the SEO world is scrambling for ways to “show up” inside these AI answers.

One idea that’s been making noise lately is the llms.txt file — pitched as a kind of “robots.txt for AI.” The theory sounds tempting: drop a file at your site’s root, list out your best content in a neat format, and AI systems will magically pick it up and cite you.

But here’s the thing: I’ve dug into this, checked what’s being said, looked at server logs, and the reality doesn’t match the hype. No major AI system is actually using llms.txt to rank or cite content right now. To me, chasing this trend feels like a distraction from the things that actually move the needle in Answer Engine Optimization (AEO).

In this post, I’ll share why llms.txt isn’t the silver bullet some people want it to be, what Google’s John Mueller had to say about it, and where I think SEOs and content creators should really be focusing if they want to get noticed in AI-driven answers.

What Is LLMs.txt and Where Did the Idea Come From?

LLMs.txt is a proposed new standard (emphasis on proposed) for making web content more accessible to LLM-based AI assistants. The concept was introduced in late 2024 by Jeremy Howard (fast.ai) as a simple Markdown file at your website’s root (yourdomain.com/llms.txt) that provides an LLM-friendly guide to your content. Think of it as a hand-crafted sitemap for AI: it would highlight key pages or sections of your site in plain text, minus all the HTML navigation clutter, to help an AI “understand” the structure and find the main content.

You can read the full proposal on llmstext.org.

Despite the name similarity, LLMs.txt is not a crawler control file like robots.txt. It doesn’t block or allow bots from anything. Instead, it’s more like a menu or map of your high-value content that you want AIs to ingest and possibly cite. For example, you might list your most in-depth guides, product FAQs, or API docs in llms.txt with links to simplified (maybe Markdown) versions. The hope is that an AI agent reading your site during inference (answer generation) would see that file, go “Oh, here’s the important stuff,” and use it to provide better answers or attribute you as a source.

The idea gained traction among some technologists and SEOs in early 2025. Proponents likened it to giving AI a treasure map: “Here be gold – cite this content from my site.” Enthusiasts pointed out that the Agents to Agents (A2A) protocol from Google’s experimental projects even included an llms.txt example, hinting at future support. Some startups (like Mintlify and Profound) began advocating “Generative Engine Optimization (GEO)” and claimed that early data showed benefits to providing simplified content files (even an llms-full.txt containing full site text) for LLM consumption. On the surface, it sounds exciting – a new way to optimize for AI and get ahead of the curve.

However, it’s critical to note that LLMs.txt remains a voluntary proposal, not an official web standard. Unlike sitemaps or robots directives (which have RFCs and broad agreement), llms.txt is an idea still in flux. It was never about telling AI bots what not to do (as robots.txt does), but about giving them a helping hand to find content. And that leads to the big question: are any AI systems actually looking for that helping hand?

LLMs.txt vs Robots.txt vs Sitemap.xml

It’s easy to hear “LLMs.txt” and assume it functions like a robots.txt for chatbots. In reality, the analogy doesn’t hold. Robots.txt is an established standard used for decades by search engines (Google, Bing, etc.) to know which URLs they are allowed or disallowed to crawl. Major crawlers automatically request robots.txt from a site and obey its rules. To put a contrast on it, LLMs.txt has no such enforcement or widespread adoption. It’s not about controlling anything – it’s essentially a suggestion file with no guarantee any bot will ever read it.

To clarify the differences:

Feature / Purpose	Robots.txt	LLMs.txt	Sitemap.xml
Primary Role	Crawler exclusion protocol – controls which parts of a site bots can or cannot access	Curated content list – highlights “important” pages in a simplified format	URL index – provides a complete list of site URLs for search engines
Adoption & Standards	Long-standing web standard, universally supported by Google, Bing, etc.	Experimental proposal, no adoption by major LLMs (as of 2025)	Widely accepted standard; actively used by Google, Bing, and others
Enforcement	Mandatory: bots are expected to check and obey `Disallow/Allow` rules	Optional: no AI or LLM is required to read or follow it	Strongly encouraged: search engines routinely fetch and use it
Controls Access?	Yes – can block or allow crawling of specific paths	No – cannot block or allow access; does not prevent scraping	No – simply provides URLs for indexing; doesn’t block crawling
Format	Plain text file with rules	Markdown/text file with links to preferred content	XML file structured for search engine parsing
Current Use Cases	Managing crawl budget, preventing sensitive/irrelevant pages from being crawled	Theoretical shortcut for AI agents to “find” key content	Helping search engines discover and index all site pages
Real-World Impact	Direct and proven: determines what gets crawled or ignored	None: major AI systems don’t check or use it	High: improves indexing speed and coverage in search engines
Industry Analogy	Gatekeeper: “Don’t enter these rooms”	Pamphlet: “Here’s what I think matters” (AI can ignore it)	Directory: “Here’s the full map of the building”

In short, `llms.txt is not a magic switch for AI crawling or ranking. It doesn’t override how LLMs work internally. LLM answer engines don’t blindly follow webmaster directives the way search crawlers do – they pull information based on their training data and real-time search results, focusing on content quality and relevance. A file you host on your site telling them “here’s my best stuff” has no binding power. It’s more like leaving a pamphlet on your doorstep hoping a passing AI postman will pick it up – but no one’s sure the postman even walks that route.

Who Is Using `llms.txt`?

Let’s break it down—some organizations are rolling with llms.txt, but major AI platforms are staying on the sidelines.

Who are the early adopters?

Documentation platforms like Mintlify helped kick off adoption when they enabled llms.txt across their hosted docs (think Anthropic, Cursor), adoption jumped massively.
Developer-centric brands are showcasing real implementations:
- Anthropic – full, structured files covering API docs and prompt libraries
- Cloudflare – organizes heavy service documentation by product/service areas
- Zapier – lists API endpoint docs cleanly and hierarchically
- Stripe, Youform, Coinbase, Tiptap – all practical llms.txt users with distinct approaches (product pages, blog focus, SDKs, description pages)
Smaller names:
Pinecone, Windsurf, Svelte.dev, Rainbowkit, Wordlift, Hugging Face have rolled out llms.txt—often pushed via Mintlify integrations or custom setup.
Many others: A directory like [directory.llmstxt.cloud] lists hundreds of sites tapping into the standard—indicative of grassroots momentum rather than AI-driven demand.

Here’s where the hype hits the wall:

OpenAI (GPTBot) – still honors robots.txt, but does not support or fetch llms.txt
Anthropic (Claude) – publishes its own llms.txt, but has not confirmed that its crawlers actually use it
Google (Gemini/Bard) – manages AI crawl behavior via robots.txt (e.g. Google-Extended), and does not reference llms.txt
Meta (LLaMA) – no public crawler guidance, and no sign of llms.txt usage

But What About the Big AI Providers?

The short answer as of August 2025 is no. Despite the buzz, no major LLM provider or AI search platform has confirmed using llms.txt in their algorithms or crawlers. Google’s John Mueller stated this bluntly in June:

FWIW no AI system currently uses llms.txt.— John Mueller
(@johnmu.com) June 17, 2025 at 5:40 PM

He went on to explain that none of the consumer-facing LLMs or chatbots even bother to request the llms.txt file when they visit sites – something easily verified in server logs. If you put up an llms.txt today, you’ll likely find that GPTBot, Claude’s crawler, Perplexity’s bot – none of them ever download it. (They do routinely fetch your robots.txt, because that’s standard procedure, but they ignore llms.txt.)

It’s telling that Mueller compared llms.txt To the long-defunct keywords meta tag – a hint that it provides no real ranking value. Similarly, an SEO who hosted 20,000 domains reported that “no AI agents or bots are downloading the LLMs.txt files, apart from some niche user agents”. In other words, the only things hitting llms.txt might be curious SEO tools or web scanners, not the AI heavyweights themselves.

Proponents might argue that “support is coming, just wait.” Indeed, early articles in 2025 painted an optimistic picture that OpenAI, Anthropic, and others had “started referencing llms.txt when crawling sites”. Some companies even implemented it on their docs (Anthropic asked a vendor to set up llms.txt for their API documentation, for example) as experiments. But no public commitments have been made by OpenAI, Anthropic, Google, Meta, or Microsoft that their models will honor or use llms.txt in any official way. To date:

OpenAI – No indication that ChatGPT or GPT-4’s browsing looks for llms.txt. OpenAI’s own crawler (GPTBot) only adheres to robots.txt and similar standard tags for opt-outs.
Anthropic (Claude) – Aside from the one-off doc site test, no evidence that Claude’s retrieval system generally uses llms.txt. No public documentation on it.
Google – Google’s Search Generative Experience (SGE) uses Google’s index; it doesn’t send a bot to crawl at answer time but uses indexed content. Google has experimented conceptually (A2A protocol on GitHub includes an example file), yet Google’s official stance via Mueller is that nobody uses it and it’s not needed if you already have good sitemaps.
Bing/Microsoft – No mention of llms.txt support. Bing Chat and copilots draw from Bing’s indexed search results and trusted sources, not ad-hoc crawling of your site structure.
Perplexity – As an AI search engine, Perplexity uses Bing results and its crawling for specific queries. It does have a user-agent (PerplexityAI) that might hit sites, but there’s no indication it checks for llms.txt. Some findings suggest Perplexity’s bot doesn’t even strictly respect robots.txt in all cases, which implies it’s not looking for new, non-standard files either.

Even SEO plugin makers acknowledge the lack of uptake. Yoast and RankMath, the popular WordPress SEO plugin, added a feature to generate llms.txt for users (“to guide LLMs like ChatGPT to your most important content”), seroundtable.com – jumping on the trend. But right in the announcement of that feature, it’s noted that “no one is using it yet.” This “build it and they will come” approach might be future-proofing, or it might be putting the cart before the horse.

In short, despite the noise, none of the major LLM providers officially adopt or rely on llms.txt. Even when files exist on high-profile domains, they’re not being consumed in any meaningful way by AI systems—just servers or community trackers.

Creating a llms.txt can feel like future-proofing—but remember: there’s a world of difference between publishing a file and having an AI crawl or follow it. Today, use cases are mostly internal or documentation-driven, not AI visibility-driven. The standard is alive in code, not in practice.

So, is llms.txt actually useful?

I think, even if LLMs started reading it, llms.txt still wouldn’t be a game changer.

John Mueller said this pretty bluntly on Reddit:

“AFAIK none of the AI services have said they’re using LLMs.TXT (and you can tell when you look at your server logs that they don’t even check for it). To me, it’s comparable to the keywords meta tag – this is what a site-owner claims their site is about … (Is the site really like that? well, you can check it. At that point, why not just check the site directly?).”

And honestly, I agree with him. If none of the major AI services—OpenAI, Anthropic, Google—are even checking the file, then what’s the point? To me, llms.txt feels redundant. Why would an AI rely on a Markdown list of links when it can already crawl the actual content (and verify it’s not spam)?

Here’s my hypothesis on why it won’t deliver much even if adoption grows:

Content quality beats text files. AI assistants rank based on relevance and authority, not just because you hand them a curated list.
No enforcement. Unlike robots.txt, it’s optional. Bots can ignore it without consequence.
Easy to misuse. Anyone could stuff it with landing pages and call them “high quality.” AIs would still need to fact-check the actual site.
Not real-time. Most AI answers come from pre-indexed search data, not live crawling. A file sitting on your server won’t even be seen.
Training confusion. It doesn’t stop or allow model training—that’s still handled via robots.txt and opt-out policies.

So, in my view, llms.txt is less of a breakthrough and more of a comfort blanket for SEOs who want to feel they have some control over AI visibility. But visibility in answer engines will always come back to the same fundamentals: good content, crawlable sites, and authority.

Then, Why Is Everyone in SEO Talking About llms.txt?

Honestly, I think it comes down to hype and FOMO. Generative AI is shaking up search, and SEOs are desperate for anything that might get their content cited in ChatGPT, Gemini, Perplexity, or other Answer engines. So when llms.txt it showed up, people jumped on it “just in case” — the same way folks once scrambled for voice search or meta keywords.

It also gives a false sense of control. Traditional SEO had levers like keywords and links; AI feels random and opaque. Dropping a file at your root directory feels like doing something to influence AI, even if the bots aren’t listening.

Add in vendors hyping “LLM visibility scores” and tools like Yoast rushing to add llms.txt support, and you’ve got a perfect recipe for buzz. It sounds techy, it’s easy to set up, and it scratches that itch of wanting control.

But to me, this feels like déjà vu. John Mueller even joked that relying on it is like “maybe they’ll use it tomorrow? Maybe I’ll win the lottery tomorrow?” And he’s right. Whenever the SEO industry latches onto a supposed secret weapon without platform support, it usually turns out to be nothing more than a shiny distraction.

What do I think can help you rank in AI answers?

I don’t think chasing llms.txt is worth your time. If you really want visibility in ChatGPT, Bard, Bing Chat, Claude, or Perplexity, the playbook looks a lot like good SEO. The difference is you need to think less about “ranking in Google” and more about “being the source an AI trusts enough to quote.”

Here’s what I’d focus on instead:

Rank well in search. Most AI assistants still pull directly from Google or Bing’s top results. If you’re not ranking there, you’re not likely to show up in AI answers either.
Use structured data. Schema markup (FAQ, HowTo, Product, etc.) and clean semantic HTML help search engines (and by extension, AI systems that rely on them) understand your content faster and more accurately.
Optimize for entities. AI cares about connections. Make sure your content and brand are linked to relevant entities in knowledge graphs, Wikipedia, and authoritative sources. It’s about proving authority, not just claiming it.
Answer real questions. Users ask AIs in natural language. Write clear, direct answers to those questions on your site. Think FAQ-style, not fluff. If your content buries the answer, the AI will likely quote someone else.
Fix technical SEO. Crawlability, speed, and accessibility matter. If an AI bot times out or can’t parse your page, it won’t use it. Keep your site fast, clean, and bot-friendly.
Build credibility. AIs prefer citing trusted sources. Backlinks, mentions, expert bios, and updated content all send the right signals. Authority is everything here.
Monitor and refine. Check which sites AI tools are citing for your target queries. If competitors are getting the mentions, study why and adjust your content. This is early days for AEO, so iteration matters.

In my view, this is what real Answer Engine Optimization looks like: doing the hard work of building content that’s clear, authoritative, and easy for machines to interpret. No shortcuts, no gimmicky text files — just solid fundamentals, adapted to the way AI systems fetch and cite information.

Is it going to be bad if I add `llms.txt` on my website?

I won’t say it’s “bad.” Adding an llms.txt file won’t hurt your site in any way. It’s just a simple Markdown or text file sitting at your root directory — search engines and AI crawlers will ignore it if they don’t care about it.

The real issue isn’t risk, it’s false expectation. Don’t expect traffic boosts, citations, or better AI visibility just because you added it. Think of it as harmless busywork: it won’t break anything, but it also won’t move the needle for AEO.

Conclusion

The buzz around llms.txt is a good reminder that in SEO — and now AEO — there are no shortcuts. We’ve seen this before with meta keywords, quick-fix tags, and other fads that never delivered. A text file isn’t going to suddenly make ChatGPT or Perplexity cite you. Not today, and probably not tomorrow.

For now, the real play is the same as it’s always been: create high-quality content, make it easy for machines to parse, and build authority that AI systems can trust. If you’re doing that, you’ll naturally end up in the pool of sources LLMs pull from. If you’re not, no llms.txt file will save you.

So my advice? Don’t waste cycles on placebo tactics. Put your energy into becoming the best answer in your niche — because that’s what answer engines are looking for.

And hey, if llms.txt does magically become a standard a year from now, you can spin one up in five minutes. But at least by then, you’ll already have the real foundation that matters.

Will LLMs.txt Help My Site Show Up in ChatGPT or Perplexity?

What Is LLMs.txt and Where Did the Idea Come From?

LLMs.txt vs Robots.txt vs Sitemap.xml

Who Is Using `llms.txt`?

Who are the early adopters?

But What About the Big AI Providers?

So, is llms.txt actually useful?

Then, Why Is Everyone in SEO Talking About llms.txt?

What do I think can help you rank in AI answers?

Is it going to be bad if I add `llms.txt` on my website?

Conclusion

People Also Ask:

Does LLMs.txt actually work? Has anyone seen real results?

Do AI crawlers (ChatGPT, Perplexity, Claude, Gemini) actually read or obey LLMs.txt?

Is LLMs.txt like robots.txt? Can it control or block AI training and crawling?

If it’s easy to add, should I add LLMs.txt anyway?

What should I do instead to rank in AI answer engines like ChatGPT and Perplexity?

On This page

Take a Pause with Intervals

A Sunday letter on building, writing, and thinking deeper as a developer — short, honest, and worth your time.

Related Posts

Will LLMs.txt Help My Site Show Up in ChatGPT or Perplexity?

What Is LLMs.txt and Where Did the Idea Come From?

LLMs.txt vs Robots.txt vs Sitemap.xml

Who Is Using llms.txt?

Who are the early adopters?

But What About the Big AI Providers?

So, is llms.txt actually useful?

Then, Why Is Everyone in SEO Talking About llms.txt?

What do I think can help you rank in AI answers?

Is it going to be bad if I add llms.txt on my website?

Conclusion

People Also Ask:

Does LLMs.txt actually work? Has anyone seen real results?

Do AI crawlers (ChatGPT, Perplexity, Claude, Gemini) actually read or obey LLMs.txt?

Is LLMs.txt like robots.txt? Can it control or block AI training and crawling?

If it’s easy to add, should I add LLMs.txt anyway?

What should I do instead to rank in AI answer engines like ChatGPT and Perplexity?

On This page

Take a Pause with Intervals

A Sunday letter on building, writing, and thinking deeper as a developer — short, honest, and worth your time.

Related Posts

How to Document GraphQL APIs for Developers

How to Build AI Agents with LangGraph: A Step-by-Step Tutorial (2026)

How to Use the Diátaxis Framework for Developer Docs

How to Write Error Messages That Help Developers Debug Faster

How I Built an Automated Local Business Lead Finder with n8n and SearchAPI

How to Use Terraform to Provision an AWS EC2 Instance

Who Is Using `llms.txt`?

Is it going to be bad if I add `llms.txt` on my website?