Crawlability 8 min read

llms.txt in 2026: do AI crawlers actually read it?

Multiple independent server-log studies land on the same answer: the AI search crawlers almost never request your llms.txt. Yet Google PageSpeed Insights just started checking that you have one. Both are true, and here is what to do about it.

Last updated July 5, 2026

Key takeaway

As of mid-2026, the major AI search crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended) almost never fetch llms.txt. Independent server-log studies put real-world consumption near zero. Ship it anyway if you publish docs or developer tools, because that is the one reader genuinely using it today, and because Google PageSpeed Insights now audits for it.

Independent log studies agree: AI search bots request llms.txt in well under 1% of AI visits, and several logged none at all.
Google contradicts itself: Search says you don’t need llms.txt, while Google’s own Lighthouse and PageSpeed Insights started auditing for it in May 2026. The momentum is on the agent layer, not search ranking.
The real consumer is developer tooling: Cursor, Claude Code and Copilot fetch it to load your docs cleanly, and that works right now.
It is cheap and risk-free, so shipping one is a fair bet. It is just not the citation lever the hype sells.

Mostly, no. As of mid-2026, the AI search and answer crawlers people most want to reach (GPTBot, ClaudeBot, PerplexityBot, and Google’s AI bots) almost never request your llms.txt. That is not a guess; it is what independent server logs keep showing, study after study. And yet the file is gaining ground in an unexpected place: Google’s own PageSpeed Insights now checks whether you have one. Both things are true, and this guide keeps them straight, so you know exactly what an llms.txt does and does not do for you today. For the wider shift this file is now part of, see what agentic browsing is.

01 · The evidenceWhat the server logs actually show

The honest test is simple, and several teams have run it: drop a valid llms.txt at your root, then watch your access logs for known AI user agents and count who actually requests it. The results line up to an uncomfortable degree.

Who logged it	Sample	What they found
OtterlyAI	90 days, 60,000+ AI bot hits	`llms.txt` requested in 0.1% of AI visits
Search Engine Land	Server logs, Aug to Oct 2025	Zero hits from GPTBot, ClaudeBot, PerplexityBot or Google-Extended
AEM CDN audit	1,000 domains, 30 days of raw CDN logs	No GPTBot, ClaudeBot or PerplexityBot at all; Googlebot was 95% of hits
limy.ai	500M+ AI bot visits, 90 days	408 requests to `llms.txt`, which it calls negligible

A few patterns repeat across all of them. Traditional crawlers like Googlebot and Bingbot touch the file occasionally. The dedicated answer-engine bots mostly do not, and in several logs never appear against the file at all. Meta’s and Microsoft’s AI crawlers are the most likely to show up, and even they tend to prefer structured files over plain text.

0.1%

of AI bot visits requested llms.txt over a 90-day window of more than 60,000 hits, in OtterlyAI’s server-log study. The file was correctly installed. The crawlers simply did not ask for it.

The takeaway is not that the file is broken. It is that, right now, the open-web answer engines are not its audience.

02 · The mixed signalWhat Google actually says about llms.txt

It depends which part of Google you ask, and in 2026 the two halves stopped agreeing.

Google Search is blunt. In a May 2026 optimisation guide it listed llms.txt among the tactics you do not need for its generative AI features, grouping it with other GEO folklore. Its search advocates had already said for months that Google does not use the file, with one likening it to the long-discredited keywords meta tag. No major AI lab (OpenAI, Anthropic, Meta, Mistral) has publicly committed to reading it in production either. The answer engines are steered by robots.txt, which they honour. llms.txt is not part of that contract.

Then, the same month, Google’s Chrome team pulled the other way. Lighthouse 13.3, the engine behind PageSpeed Insights, shipped a new Agentic Browsing audit category in its default configuration, and one of its checks is for a valid llms.txt: it verifies the file exists, is reachable without server errors, and is markdown with at least one H1, pointing site owners at the official spec. Because PageSpeed Insights inherits Lighthouse within a couple of weeks, every site owner who runs a report now gets flagged for a missing or malformed llms.txt, whether they wanted the signal or not.

Search ranking and agent readiness are different layers

Google Search ignoring llms.txt and Google’s Lighthouse auditing for it are not actually a contradiction. One is about ranking pages in search results, which the file does not affect. The other is about how well an AI agent can read and act on your site, which is exactly what the file was designed for. Same company, two different jobs.

So is Google for it or against it? Both, and coherently so once you separate the layers. Search ranking does not use llms.txt. Agentic browsing, where an AI agent reads and acts on your site, is a different surface, and that is where Lighthouse frames the file as a discoverability and efficiency signal: without it, its documentation notes, agents spend longer crawling your pages just to work out your structure. That distinction is the whole point of this article. The file’s future is the agent layer, not the search results page.

It is also a tell about where this is heading. Lighthouse bundles the llms.txt check with audits for WebMCP, the accessibility tree, and layout stability, which together read as a first attempt at scoring whether your site is built for AI agents at all. That bundle is Chrome’s new Agentic Browsing category, and it maps to what we call agent readiness: llms.txt is one row in it.

03 · The real use caseThe reader that does use it

Here is the part the doom takes miss. llms.txt has a genuine, working consumer today: the AI tools your technical users already run.

When a developer points Cursor, Claude Code, or GitHub Copilot at your documentation, those tools fetch llms.txt to load your docs as clean markdown instead of parsing navigation, scripts, and styling out of raw HTML. It keeps them inside their token budget and makes their suggestions about your product more accurate. This is why documentation platforms like Fern, and framework sites such as Next.js and Tailwind, now ship one as a matter of course. It is plumbing for AI-assisted development, not a search-ranking play.

The file works. It just works for the assistant inside your user’s editor, not the answer engine out on the open web.

· The reframe that matters

That reframes the whole question. The audience for llms.txt is not, yet, the open-web answer engine. It is the assistant sitting in your user’s editor. If that user is a developer, the file is doing real work the moment you publish it. You can push it further with an instructions section, a pattern Stripe is often credited with, that tells assistants how to describe your product and steers them away from deprecated APIs. That is influence over what the tool says about you, which is the actual prize.

04 · The decisionSo should you ship one?

Yes, in almost every case, but go in with the right expectation. Find your row:

If you are	What llms.txt does for you	Worth it?
A docs, API, or developer-tools product	A real retrieval channel for Cursor, Claude Code and Copilot, working now	Yes, clear benefit today
A general product hoping for AI citations	Cheap future-proofing, plus a passing PageSpeed audit; no citation lift yet	Yes, but keep expectations low
Hoping it replaces SEO, indexing, or blocking	Nothing useful: it is not a ranking signal, a sitemap, or an access control	No, fix the fundamentals first

The throughline: ship llms.txt because it is cheap, harmless, useful to AI coding tools, and now flagged by Google’s own PageSpeed Insights, not because it will get you quoted in ChatGPT this quarter. And do not let it crowd out the work that actually drives AI visibility, which is the same work that drives everything else: pages a machine can parse, content worth citing, and enough presence that you actually get indexed and mentioned in the first place.

Free · 30 seconds

Is your llms.txt even valid?

Google’s PageSpeed Insights now flags a missing or malformed one, and a file with broken links helps no one regardless. Nilkick checks whether yours is present and well-formed, alongside schema, sitemap, and the rest of your readiness signals.

Get your free scoreNo account · no email wall

05 · VerificationHow to check whether anyone reads yours

The fastest validity check is now Google’s own: run your URL through PageSpeed Insights and read the Agentic Browsing section, which reports whether your llms.txt is present and well-formed. That tells you the file passes. It does not tell you anyone is reading it.

For who is actually fetching it, you need your own logs. Filter your CDN or server access logs for the AI user agents and count requests to /llms.txt and /llms-full.txt:

AI user-agents to grep for

GPTBot              # OpenAI (crawl)
OAI-SearchBot       # OpenAI (search)
ClaudeBot           # Anthropic
PerplexityBot       # Perplexity
Google-Extended     # Google AI
Applebot-Extended   # Apple AI
Meta-ExternalAgent  # Meta AI

If you are on Cloudflare, its bot analytics breaks traffic down by user agent without you touching raw logs. For a sharper signal, drop a honeypot link inside the file pointing at a URL nothing else references. Any hit to that URL means something actually parsed the file and followed a link. Set your expectations low, watch over weeks rather than hours, and you will get an honest read on who, if anyone, is reading you.

FAQ

Common questions

Do AI crawlers actually read llms.txt in 2026?

Mostly no. Every independent server-log study published so far finds that the dedicated AI search crawlers (GPTBot, ClaudeBot, PerplexityBot, Google-Extended) request llms.txt in a tiny fraction of visits, often zero. One 90-day study across 60,000+ AI bot hits saw the file used in 0.1% of them. Developer and IDE tools are a different story: they do fetch it.

Does Google use llms.txt?

Why does PageSpeed Insights flag my missing llms.txt now?

If the crawlers ignore it, why ship llms.txt at all?

Will llms.txt get me cited in ChatGPT or Perplexity?

The nudge off zero

Get your free launch-readiness score

See what else is between your product and its first real users. Nilkick scores your readiness and hands you the map. Free, no login.

https:// optional · no account · we don't email you

Keep going · Agent readiness cluster All guides →

Guide

The Agentic Browsing checks in PageSpeed Insights (and how to pass them)

PageSpeed Insights now shows an Agentic Browsing score. Here are the three checks it runs, how to pass each, and why your ratio jumps between runs.

5 min read

Guide

Lighthouse's new Agentic Browsing category, explained

A field guide to Google's Lighthouse Agentic Browsing category: the four audits, why it scores a pass ratio not 0-100, and the Chrome 150 caveats.

6 min read

Guide

Schema.org for indie products: the JSON-LD to actually paste in

The two structured-data blocks worth adding to an indie product: Organization with sameAs, and SoftwareApplication. Copy-paste JSON-LD, where it goes, and what it does (and does not) do.

7 min read