GUIDEUPDATED FEBRUARY 2026

ROBOTS.TXT FOR AI BOTS

Complete reference for configuring robots.txt to control AI crawler access to your website. Covers all major AI bots: OpenAI (GPTBot, ChatGPT-User, OAI-SearchBot), Anthropic (ClaudeBot), Google (Google-Extended), Perplexity, ByteDance, Meta, and more.

COPY-PASTE TEMPLATES

✅ ALLOW ALL AI BOTS (Recommended for most sites)

Maximizes your visibility in AI search results and AI-generated answers

# robots.txt — Allow AI Bots
# Generated by NORAD.io (https://norad.io/guides/robots-txt-ai-bots)

# OpenAI
User-agent: GPTBot
Allow: /

User-agent: ChatGPT-User
Allow: /

User-agent: OAI-SearchBot
Allow: /

# Anthropic (Claude)
User-agent: ClaudeBot
Allow: /

User-agent: claude-web
Allow: /

User-agent: anthropic-ai
Allow: /

# Google AI
User-agent: Google-Extended
Allow: /

# Perplexity
User-agent: PerplexityBot
Allow: /

# Other AI Crawlers
User-agent: Bytespider
Allow: /

User-agent: CCBot
Allow: /

User-agent: meta-externalagent
Allow: /

User-agent: cohere-ai
Allow: /

Sitemap: https://yoursite.com/sitemap.xml

🚫 BLOCK ALL AI BOTS

Prevents AI training on your content — but also removes you from AI search results

# robots.txt — Block All AI Bots
# WARNING: This removes your content from ChatGPT, Claude, Perplexity answers

# OpenAI
User-agent: GPTBot
Disallow: /

User-agent: ChatGPT-User
Disallow: /

User-agent: OAI-SearchBot
Disallow: /

# Anthropic
User-agent: ClaudeBot
Disallow: /

User-agent: claude-web
Disallow: /

User-agent: anthropic-ai
Disallow: /

# Google AI (blocks Gemini training, NOT regular Google Search)
User-agent: Google-Extended
Disallow: /

# Perplexity
User-agent: PerplexityBot
Disallow: /

# ByteDance
User-agent: Bytespider
Disallow: /

# Common Crawl
User-agent: CCBot
Disallow: /

# Meta AI
User-agent: meta-externalagent
Disallow: /

# Cohere
User-agent: cohere-ai
Disallow: /

⚡ SELECTIVE ACCESS (Best practice)

Allow search/citation bots, block training-only crawlers

# robots.txt — Selective AI Bot Access
# Allow citation/search bots, restrict training crawlers

# ALLOW: Search and citation bots (appear in AI answers)
User-agent: ChatGPT-User
Allow: /

User-agent: OAI-SearchBot
Allow: /

User-agent: PerplexityBot
Allow: /

User-agent: Perplexity-User
Allow: /

# RESTRICT: Training-only crawlers (protect content)
User-agent: GPTBot
Disallow: /premium/
Disallow: /members/
Allow: /blog/
Allow: /docs/
Allow: /

User-agent: ClaudeBot
Disallow: /premium/
Disallow: /members/
Allow: /blog/
Allow: /docs/
Allow: /

User-agent: Google-Extended
Disallow: /premium/
Allow: /

# BLOCK: Aggressive/unwanted crawlers
User-agent: Bytespider
Disallow: /

User-agent: CCBot
Disallow: /

Sitemap: https://yoursite.com/sitemap.xml

COMPLETE AI BOT REFERENCE

All known AI bot User-Agent directives for robots.txt as of February 2026:

ROBOTS.TXT DIRECTIVEORGANIZATIONTYPERESPECTS ROBOTS.TXT
User-agent: GPTBotOpenAITraining✅ Yes
User-agent: ChatGPT-UserOpenAISearch/Browse✅ Yes
User-agent: OAI-SearchBotOpenAISearch✅ Yes
User-agent: ClaudeBotAnthropicTraining✅ Yes
User-agent: claude-webAnthropicBrowse✅ Yes
User-agent: anthropic-aiAnthropicTraining✅ Yes
User-agent: Google-ExtendedGoogleAI Training✅ Yes
User-agent: PerplexityBotPerplexity AISearch Index⚠️ Disputed
User-agent: BytespiderByteDanceTraining⚠️ Partial
User-agent: CCBotCommon CrawlArchive✅ Yes
User-agent: meta-externalagentMetaTraining✅ Yes
User-agent: cohere-aiCohereTraining✅ Yes
User-agent: AmazonbotAmazonAlexa/AI✅ Yes
User-agent: YouBotYou.comSearch✅ Yes
User-agent: AI2BotAllen AIResearch✅ Yes
User-agent: ApplebotAppleSiri/Search✅ Yes

IMPORTANT NOTES

⚠️

robots.txt is advisory, not enforced

Legitimate crawlers respect robots.txt, but malicious ones ignore it. For enforcement, use IP blocking or authentication.

🔍

Blocking Google-Extended does NOT affect Google Search

Google-Extended controls only Gemini AI training. Googlebot (regular search) is a separate directive.

📊

Monitor before blocking

Use NORAD to see which bots visit your site and how often. Data-driven decisions are better than blanket blocks.

🔄

New bots appear regularly

The AI crawler landscape changes monthly. Bookmark this page — we update it as new bots emerge.

← DETECT AI BOTSBOT DATABASE →PROTECT YOUR SITE →