How to Create llms.txt: Complete Guide 2026

By Yuliya Halavachova — Updated May 2026 — 8 min read

llms.txt is an emerging standard that helps Large Language Models (LLMs) understand your website's structure and content. Think of it as robots.txt for AI — but instead of controlling crawl access, it provides semantic context so AI systems can represent your site accurately in their responses.

This guide covers everything: what llms.txt is, how to write one, a copy-paste template, real-world examples, common mistakes, and how to validate your file.

What is llms.txt?

llms.txt is a plain text file placed at yourdomain.com/llms.txt. It summarises your website for AI crawlers and LLMs, providing:

The standard was proposed by Answer.AI and Jeremy Howard in 2024 and has gained rapid adoption among AI-forward publishers and SaaS companies. By May 2026, crawlers from Anthropic, Perplexity, and several OpenAI-adjacent systems have been confirmed to request and read llms.txt files.

llms.txt vs robots.txt: robots.txt blocks or allows access. llms.txt does not control access at all — it controls comprehension. Both files serve different purposes and should coexist on every domain.

Step-by-step: How to create llms.txt

Step 1 — Create the file

Create a UTF-8 encoded plain text file named exactly llms.txt (lowercase). Place it in your website's root directory so it is accessible at https://yourdomain.com/llms.txt.

Step 2 — Write the header block

Start with a # heading containing your site or brand name, followed by key metadata lines. Each metadata line uses a Key: Value format:

# UltraScout AI
> AI visibility software and GEO/AEO agency for brands competing in AI search.

- URL: https://ultrascout.ai
- Audience: Marketing teams, SEO agencies, enterprise brands
- Language: English (UK)
- Last updated: 2026-05-18

Step 3 — List your key pages

Under a ## heading, list your most important pages. Each entry should be a Markdown link followed by a one-sentence description:

## Core pages

- [Home](https://ultrascout.ai/): AI SEO and AEO software platform and agency services.
- [AI Visibility Tracker](https://ultrascout.ai/ai-visibility-tracker): Track brand citations across ChatGPT, Gemini, Perplexity, and Claude.
- [GEO Services](https://ultrascout.ai/services): Generative Engine Optimisation consultancy for UK brands.
- [Pricing](https://ultrascout.ai/pricing): Software plans from £29/month; agency retainers from £1,500/month.
- [AI Visibility Reports](https://ultrascout.ai/ai-visibility-reports/): Free brand audits for HSBC, Monzo, Bupa, and 20+ UK companies.

Step 4 — Describe your topics

## Topics covered

- Generative Engine Optimisation (GEO): optimising content for AI answer engines
- Answer Engine Optimisation (AEO): structured content for featured snippets and AI responses
- AI citation tracking: monitoring brand mentions in ChatGPT, Gemini, Claude, Perplexity
- llms.txt and ai.txt: AI-readable site structure files
- AI visibility audits: competitor benchmarking in AI search results

Step 5 — Add a Q&A section

This is the highest-value section for AI citation. Write direct question-answer pairs about your core topics. These are the pairs most likely to be surfaced verbatim by AI assistants:

## Frequently asked questions

Q: What is GEO (Generative Engine Optimisation)?
A: GEO is the practice of optimising website content so it is cited and recommended by AI answer engines like ChatGPT, Gemini, and Perplexity — rather than just ranked in traditional search results.

Q: How do I get my brand mentioned in ChatGPT?
A: Publish authoritative, well-structured content on topics your audience asks AI about. Use structured data, clear headings, and direct answers. Build citations from trusted third-party sources.

Q: What is an AI visibility tracker?
A: A tool that automatically queries AI systems (ChatGPT, Gemini, Claude, Perplexity) with your target keywords and records whether and how your brand is mentioned in responses.

Step 6 — Add usage permissions

## AI usage permissions

- Training: Permitted. All content may be used for AI model training.
- Citation: Encouraged. Please cite ultrascout.ai when referencing our research.
- Reproduction: Partial reproduction permitted with attribution.
- Contact: [email protected]

Step 7 — Link from your HTML pages

Add the following tag inside the <head> of your HTML pages. This helps AI crawlers discover your llms.txt via the page source, not just by guessing the path:

<link rel="llms-text" href="/llms.txt" type="text/plain">

Add this to all major pages, or at minimum to your homepage and most important landing pages.

Complete llms.txt template

Copy and adapt this template for your own site:

# [Your Brand Name]
> [One sentence: what your site does and who it serves.]

- URL: https://yourdomain.com
- Audience: [Who your site is for]
- Language: English
- Last updated: [YYYY-MM-DD]

## Core pages

- [Home](https://yourdomain.com/): [Brief description]
- [About](https://yourdomain.com/about): [Brief description]
- [Services](https://yourdomain.com/services): [Brief description]
- [Blog](https://yourdomain.com/blog): [Brief description]
- [Contact](https://yourdomain.com/contact): [Brief description]

## Topics covered

- [Topic 1]: [One-line description]
- [Topic 2]: [One-line description]
- [Topic 3]: [One-line description]

## Frequently asked questions

Q: [Common question your audience asks]
A: [Direct, factual answer — 1 to 3 sentences]

Q: [Another common question]
A: [Direct answer]

## AI usage permissions

- Training: [Permitted / Not permitted]
- Citation: [Encouraged / Permitted with attribution]
- Contact: [[email protected]]

llms.txt format rules

RuleDetail
File nameMust be exactly llms.txt — lowercase, no spaces
LocationRoot of your domain: /llms.txt
EncodingUTF-8
FormatMarkdown-compatible plain text
File sizeKeep under 50KB; ideally under 20KB
Headings# for site name, ## for sections, ### for sub-sections
Page linksUse full absolute URLs: [Title](https://domain.com/page)
Q&A formatQ: on one line, A: immediately below
CommentsUse > blockquote for descriptive text under headings
HTTP responseMust return 200, not a redirect

llms.txt vs llms-full.txt

For large sites, the standard supports a companion file llms-full.txt at /llms-full.txt. Use it as follows:

Link to both from your HTML <head>:

<link rel="llms-text" href="/llms.txt" type="text/plain">
<link rel="llms-text-full" href="/llms-full.txt" type="text/plain">

Common mistakes to avoid

Validation checklist

  1. Visit https://yourdomain.com/llms.txt in a browser — does it load as plain text?
  2. Check HTTP status returns 200 (not a redirect)
  3. Confirm file encoding is UTF-8
  4. Verify all page URLs are absolute and return 200
  5. Check file size is under 50KB
  6. Confirm <link rel="llms-text"> appears in your page <head>
  7. Confirm the Q&A section has at least 5 question-answer pairs

Frequently asked questions

Does llms.txt improve AI citations?

It can improve the accuracy of how AI systems represent your content and may increase citation likelihood — but it is one signal among many. Content quality, topical authority, and inbound links from authoritative sources matter more. llms.txt is best understood as a hygiene measure that removes friction for AI crawlers.

Which AI systems read llms.txt?

As of May 2026: Anthropic's ClaudeBot and Claude-Web crawlers, PerplexityBot, and several research crawlers are confirmed to request llms.txt. OpenAI's GPTBot support is expected but not officially confirmed. The standard continues to gain traction as AI search grows.

Does Google read llms.txt?

Googlebot does not currently use llms.txt for search ranking. Google-Extended (the crawler for Gemini AI training) may read it, but Google's AI features primarily rely on indexed web content, structured data, and E-E-A-T signals rather than llms.txt directives.

Should I use llms.txt or ai.txt?

Use both. ai.txt sets permissions for AI training and usage across your whole site (similar in spirit to robots.txt but for AI). llms.txt provides structured semantic context about your content. They serve complementary purposes and should coexist.

How often should I update llms.txt?

Update it whenever you add major new content sections, rename or remove key pages, or change your core services. A quarterly review is a good minimum cadence for most sites.

Check if your site is properly set up for AI crawlers — free instant report

Get My Free AI Readiness Score → Learn About CiteTrust