Llms Txt Builder

Build and maintain `/llms.txt` + `/llms-full.txt` discovery files (Jeremy Howard 2024 proposed…

Builds and maintains the /llms.txt and /llms-full.txt discovery files that let AI crawlers like GPTBot, ClaudeBot, and PerplexityBot find your content hierarchy without parsing HTML. It generates both files from a single CMS or MDX source at build time, wires up cache purge on content change, and runs conformance checks so your site stays AI-discoverable.

$15 one-time
Add to a kit →

Prices include 20% VAT. · Forged on real agency work · one-time, no lock-in

  • Type Skill
  • Category Search & AEO
  • Delivery Email · instant
  • License One-time
Run preview
forgehouse, llms-txt-builder

Inside the run · no black box

See the actual work before you buy it.

The skill generates /llms.txt (short index, 500KB cap) and /llms-full.txt (full content, 5MB cap) from one canonical CMS source at build time, then wires the validation and cache purge chain around them. It is positioned honestly: a cheap side benefit for Perplexity and agent tools, never sold as a Google ranking lever.

  1. Fetches the content inventory from the single source (a Sanity query or MDX files) grouped by priority: primary pages first, then pillar guides, recent blog posts, and a low-priority Optional section.
  2. Builds llms.txt to spec: one H1 site name, a blockquote description, H2 sections ordered by priority (the order itself is the signal AI bots read), and link lines with descriptions capped at 100 characters.
  3. Builds llms-full.txt with each page's full markdown body inline, so an AI agent reads everything in one request without HTML parsing or JS hydration.
  4. Runs the conformance validator: exactly one H1, absolute https URLs only, zero admin/staging/preview/PII leaks, size limits enforced; any error exits the build with a nonzero code.
  5. Wires the freshness chain: a CMS webhook triggers a rebuild or ISR revalidate, then purges the CDN cache, so both files update within about 60 seconds of a content change.
  6. Adds drift detection: a cron compares the llms.txt entry count against sitemap.xml URLs, and a delta above 10 percent triggers a rebuild.
Use cases · what happens when you plug it in

One power source. 6 lines out.

llms-txt-builder · core

core active · 6 lines

  1. Making a large multilingual site discoverable to AI crawlers

    ✓ making a large multiling…
  2. Increasing AI citation odds for ChatGPT, Perplexity, and AI Overviews

    ✓ increasing ai citation o…
  3. Signaling content priority with grouped sections and an Optional tier

    ✓ signaling content priority
  4. Rapidly exposing new content to AI bots after a deploy

    ✓ rapidly exposing new con…
  5. Resolving 'ChatGPT isn't reading or citing us' complaints

    ✓ resolving 'chatgpt isn't
  6. Keeping llms.txt in sync with sitemap.xml via drift detection

    ✓ keeping llms.txt in sync
Benefits · what you walk away with

Yours to keep.

Drag time forward. Watch what stays.

Forever

That's what owning means.

The rented stack

ai writing tool: subscription

expired · access lost

analytics suite: subscription

expired · access lost

design platform: subscription

expired · access lost

(nothing left)

Your forge

  1. Get AI engines to see new content in hours instead of weeks

    license: perpetual
  2. Steer crawl budget toward your highest-value pages first

    license: perpetual
  3. Prevent admin and staging URL leaks with a built-in validator

    license: perpetual
  4. Keep AI discovery files in sync automatically on every content change

    license: perpetual

subscriptions expire · deeds don't

What's included · the full manifest

Everything in the box.

Pick a piece up. Watch it work.

Build-time generators for Astro and Next.js from a CMS or MDX source

part 01 of 06 · in the box

6 parts · one working system · ships instantly by email

From the field · a real case

This wasn’t written at a desk.

The problem

The fix

The result

Who it's for

This wasn't forged for everyone.

  • Not for you if you'd rather rent a tool than own one.
  • Not for you if you want someone else to run your stack.
  • Not for you if you're happy guessing.
Still here? Good.

SEO engineers and developers who want their content reliably discovered and cited by AI search engines.

then this was forged for you.

Works with

Universal by design: these run in any AI. Delivered in the open Agent Skills + MCP format (native in Claude); ChatGPT, Gemini, Cursor and Copilot adapt the same files their own way.

  • Claude Native format
  • ChatGPT Adapts via open standards
  • Gemini Adapts via open standards
  • Cursor Adapts via open standards
  • Copilot Adapts via open standards
Questions · still in the air

Catch what's on your mind.

the air is clear. nothing between you and the forge.
catch a spark: the forge will answer

  1. My site is not on Astro or Next.js. Can I still use this?

    The build-time generators target Astro 5 and Next.js 15 with a Sanity or MDX single source. On another stack you can borrow the file structure, the conformance validator logic, and the sync discipline, but you would wire the generation step into your own build yourself.

  2. Why not just write llms.txt by hand once?

    Because it goes stale the moment content changes. The pipeline regenerates both llms.txt and llms-full.txt from your CMS or MDX source at build time, purges the CDN cache on content change, and runs drift detection against sitemap.xml so the file never silently falls behind your site.

  3. Will this guarantee that ChatGPT or Perplexity cites my site?

    No. llms.txt is a proposed standard (llmstxt.org) and crawler adoption varies. What it does is remove the discovery barrier: GPTBot, ClaudeBot, and PerplexityBot get a clean content hierarchy instead of HTML soup. Whether you get cited still depends on the engine and your content.

  4. How is it delivered?

    By email right after purchase: ready to run, downloaded instantly, no setup wait.

  5. One-time or subscription?

    A one-time purchase; no subscription or hidden fees. VAT (20%) is included.

  6. Can I get a refund?

    As a digital product, it can’t be refunded once downloaded. That’s why we show exactly what’s inside and who it’s for, right here.