YANDEXBOT

LOW RISK🔍 SEARCH & AI CRAWLER

Yandex search engine crawler — Russia's largest search engine

ORGANIZATION
Yandex
FIRST SEEN
2005-01
RESPECTS ROBOTS.TXT
✓ YES
DOCUMENTATION
yandex.com
DAILY VISITS
COUNTRIES ACTIVE
TRACKING
STATUS
LAST SEEN

📡 YANDEXBOT USER-AGENT STRING

Mozilla/5.0 (compatible; YandexBot/3.0; +http://yandex.com/bots)

This is the User-Agent header sent by YandexBot in HTTP requests. Use this to identify YandexBot in your server access logs.

📋 ABOUT YANDEXBOT

YandexBot is the web crawler for Yandex, the largest search engine in Russia with significant market share across Russian-speaking countries. YandexBot indexes web content for Yandex Search, Yandex Images, Yandex Video, and Yandex's AI assistant Alice. It is one of the older and more established search engine crawlers still in active operation.

YandexBot is notable for supporting several unique robots.txt directives that other search engines don't implement, including the Clean-param directive (which helps handle URL parameters) and the Host directive (for specifying preferred domain versions). Yandex provides comprehensive webmaster tools and detailed documentation about its crawler's behavior.

NORAD.io tracks YandexBot activity globally, noting that its crawl patterns are particularly important for sites targeting Russian-speaking audiences. Even for sites outside this market, YandexBot activity is a useful signal — unexpected YandexBot crawling of typically non-Russian content can sometimes indicate indexing anomalies or crawler behavior worth investigating.

🎯 HOW TO DETECT YANDEXBOT

  • User-Agent contains 'YandexBot/3.0'
  • Verify via reverse DNS — should resolve to *.yandex.ru, *.yandex.net, or *.yandex.com
  • Multiple Yandex bot variants: YandexImages, YandexVideo, YandexMedia, YandexDirect
  • Yandex publishes extensive IP ranges for verification
  • Supports unique robots.txt directives like Clean-param and Host

🌐 YANDEXBOT KNOWN IP RANGES

5.45.192.0/185.255.192.0/1837.9.64.0/1837.140.128.0/1877.88.0.0/1887.250.224.0/1993.158.128.0/1895.108.128.0/17100.43.64.0/19130.193.32.0/19141.8.128.0/18178.154.128.0/17199.21.96.0/22213.180.192.0/19

Use these CIDR ranges to verify YandexBot identity at the network level. Always combine with User-Agent verification for accurate detection.

🔄 CRAWL BEHAVIOR

Systematic crawling with moderate rates. Respects robots.txt including Crawl-delay and Clean-param directives. Supports Yandex-specific robots.txt extensions. Can be managed via Yandex Webmaster Tools.

PURPOSE

Indexes web content for Yandex Search, the dominant search engine in Russia and Russian-speaking countries, as well as Yandex's AI assistant Alice.

🤖 ROBOTS.TXT CONFIGURATION

User-agent: YandexBot
Allow: /
Crawl-delay: 2

# Yandex supports Clean-param directive:
# Clean-param: utm_source&utm_medium /

YandexBot respects robots.txt directives. Add this to your robots.txt file at the root of your domain.

🗺️ WHERE IS YANDEXBOT ACTIVE?

⚠️ RELATED THREATS

📂 MORE 🔍 SEARCH & AI CRAWLERS

📚 RELATED GUIDES

PROTECT YOUR WEBSITE

Deploy SiteTrust to monitor and control AI bot access to your site with the Agent Passport Standard.

INSTALL SITETRUST →