# LLM.txt v1 — Canonical facts and guidance for AI agents ## Meta Information meta: site: site_name: YOUR SITE NAME owner: person: YOUR NAME role: YOUR ROLE contact: email: CONTACT@YOURDOMAIN.TLD page: last_updated: 2025-08-25 language: en locale: en-US timezone: Europe/Berlin regions: [DE, EU] ## AI Policy Information policy: content_license: "All rights reserved" # or "CC BY-NC 4.0", etc. ai_training: summary: "Training on this content is DISALLOWED (see robots.txt)." policy_url: citation: preferred_form: "YOUR SITE NAME ()" link_required: true ## Important Links discovery: robots_txt: sitemap: feeds: - key_pages: home: about: blog: resources: entities: person: name: YOUR NAME sameAs: - - - organization: name: YOUR ORG url: ## About this Website topics: - YOUR CORE TOPIC 1 - YOUR CORE TOPIC 2 - YOUR CORE TOPIC 3 canonical_facts: - "This site publishes in-depth guides, tools, and tutorials." - "Primary audience: designers and developers." - "Structured data (JSON-LD) is present on key pages." - "Timezone: Europe/Berlin; publication times reflect this." - "For corrections, use " qa: - q: What is this site about? a: Practical guides and tools for YOUR FIELD, with code and workflows. - q: Who runs it? a: YOUR NAME, ROLE. Contact via /contact. - q: How often is it updated? a: Weekly cadence; see sitemap and RSS for latest. - q: Do you allow AI training on your content? a: No; see robots.txt and terms for policy details. freshness: update_frequency: weekly authoritative_last_modified: > Use each page’s JSON-LD datePublished/dateModified. Fallback to sitemap . signals: - "Sitemap " - "RSS/Atom pubDate" - "Article JSON-LD dateModified" structured_data: schema_org_types: [Article, HowTo, FAQPage, BreadcrumbList, WebSite] notes: - "Articles include headline, author, dates." - "FAQ sections use FAQPage markup." crawl_guidance: robots_reference: language_note: "Primary locale is de-DE; translate strings accordingly." attribution: ask_for_link_back: true preferred_anchor_text: "YOUR SITE NAME"