AI BOT & CRAWLER DATABASE

40 bot types tracked across the NORAD.io global radar network. Complete reference with User-Agent strings, IP ranges, detection guides, and robots.txt configurations.

BOTS TRACKED

CRAWLERS

HIGH RISK

ORGANIZATIONS

WHAT ARE AI BOTS?

AI bots are automated programs that access websites to collect data for artificial intelligence systems. They include search engine crawlers like Googlebot and Bingbot that index content for search results, AI training crawlers like GPTBot and ClaudeBot that collect data for training large language models (LLMs), and AI assistant browsers like ChatGPT-User and Perplexity-User that fetch pages in real-time during AI conversations.

The rise of generative AI has dramatically increased bot traffic across the web. In 2025-2026, AI-related crawlers account for a growing share of website traffic, often exceeding human visitors on content-heavy sites. Understanding which AI bots access your site — and controlling that access through robots.txt, User-Agent detection, and IP-level policies — is essential for modern web operations.

NORAD.io monitors all major AI bots globally, providing real-time visibility into crawl activity, behavioral patterns, and compliance with site access policies. Each bot profile below includes the complete User-Agent string, known IP ranges, detection tips, and robots.txt configuration examples.

BOT	ORGANIZATION	USER-AGENT	RISK	ROBOTS.TXT
GPTBot OpenAI	OpenAI	`Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko;…`	LOW	✓
ClaudeBot Anthropic	Anthropic	`Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko;…`	LOW	✓
ChatGPT-User OpenAI	OpenAI	`Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko;…`	LOW	✓
Googlebot Google	Google	`Mozilla/5.0 (compatible; Googlebot/2.1; +http://ww…`	LOW	✓
Google-Extended Google	Google	`Mozilla/5.0 (compatible; Google-Extended; +https:/…`	LOW	✓
Bingbot Microsoft	Microsoft	`Mozilla/5.0 (compatible; bingbot/2.0; +http://www.…`	LOW	✓
PerplexityBot Perplexity AI	Perplexity AI	`Mozilla/5.0 (compatible; PerplexityBot/1.0; +https…`	LOW	✓
Bytespider ByteDance	ByteDance	`Mozilla/5.0 (Linux; Android 5.0) AppleWebKit/537.3…`	MEDIUM	✗
CCBot Common Crawl	Common Crawl	`CCBot/2.0 (https://commoncrawl.org/faq/)…`	LOW	✓
Amazonbot Amazon	Amazon	`Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) Ap…`	LOW	✓
FacebookBot Meta	Meta	`facebookexternalhit/1.1 (+http://www.facebook.com/…`	LOW	✓
AhrefsBot Ahrefs	Ahrefs	`Mozilla/5.0 (compatible; AhrefsBot/7.0; +http://ah…`	LOW	✓
SemrushBot Semrush	Semrush	`Mozilla/5.0 (compatible; SemrushBot/7~bl; +http://…`	LOW	✓
Applebot Apple	Apple	`Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_5) Ap…`	LOW	✓
YandexBot Yandex	Yandex	`Mozilla/5.0 (compatible; YandexBot/3.0; +http://ya…`	LOW	✓
Headless Chrome Unknown	Unknown	`Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36…`	HIGH	✗
Playwright Microsoft	Microsoft	`Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36…`	HIGH	✗
Scrapy Open Source	Open Source	`Scrapy/2.11 (+https://scrapy.org)…`	MEDIUM	✓
Python Requests Unknown	Unknown	`python-requests/2.31.0…`	MEDIUM	✗
DuckDuckBot DuckDuckGo	DuckDuckGo	`DuckDuckBot/1.1; (+http://duckduckgo.com/duckduckb…`	LOW	✓
Puppeteer Google	Google	`Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36…`	HIGH	✗
cURL Open Source	Open Source	`curl/8.5.0…`	MEDIUM	✗
Selenium Open Source	Open Source	`Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36…`	HIGH	✗
Twitterbot X (Twitter)	X (Twitter)	`Twitterbot/1.0…`	LOW	✓
LinkedInBot LinkedIn	LinkedIn	`LinkedInBot/1.0 (compatible; Mozilla/5.0; Apache-H…`	LOW	✓
Baiduspider Baidu	Baidu	`Mozilla/5.0 (compatible; Baiduspider/2.0; +http://…`	LOW	✓
Sogou Spider Sogou	Sogou	`Sogou web spider/4.0(+http://www.sogou.com/docs/he…`	LOW	✓
MJ12bot Majestic	Majestic	`Mozilla/5.0 (compatible; MJ12bot/v1.4.8; http://mj…`	LOW	✓
DotBot Moz	Moz	`Mozilla/5.0 (compatible; DotBot/1.2; +https://open…`	LOW	✓
Anthropic-AI Anthropic	Anthropic	`Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko;…`	LOW	✓
OAI-SearchBot OpenAI	OpenAI	`Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko;…`	LOW	✓
Perplexity-User Perplexity AI	Perplexity AI	`Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko;…`	LOW	✓
Claude-Web Anthropic	Anthropic	`Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko;…`	LOW	✓
Meta-ExternalAgent Meta	Meta	`Mozilla/5.0 (compatible; Meta-ExternalAgent/1.0; +…`	MEDIUM	✓
Cohere-AI Cohere	Cohere	`Mozilla/5.0 (compatible; cohere-ai; +https://coher…`	LOW	✓
AI2Bot Allen Institute for AI	Allen Institute for AI	`Mozilla/5.0 (compatible; AI2Bot/1.0; +https://alle…`	LOW	✓
YouBot You.com	You.com	`Mozilla/5.0 (compatible; YouBot/1.0; +https://abou…`	LOW	✓
PetalBot Huawei	Huawei	`Mozilla/5.0 (compatible; PetalBot;+https://webmast…`	LOW	✓
DataForSeoBot DataForSEO	DataForSEO	`Mozilla/5.0 (compatible; DataForSeoBot/1.0; +https…`	LOW	✓
PhantomJS Open Source	Open Source	`Mozilla/5.0 (Unknown; Linux x86_64) AppleWebKit/53…`	HIGH	✗

AI BOT & CRAWLER DATABASE

WHAT ARE AI BOTS?

📋 ALL TRACKED BOTS

🔍 SEARCH & AI CRAWLERS

🤖 AI ASSISTANTS

📊 SEO & DATA SCRAPERS

⚡ AUTOMATION AGENTS

📡 ALL AI BOT USER-AGENT STRINGS

PROTECT YOUR WEBSITE