Search bot request filtering
Allow basic search bots filtering through a single switch in the Request Filtering Dashboard section
Search bots generate legitimate internet traffic allowing search engines to index your website. But you might not want search bots to trigger Fingerprint Identification or Smart Signals processing.
Although you can filter requests based on the User-Agent
header, you would need to manage your list of all different search bot User-Agent
strings. Instead, we keep and update our own list of search bot filtering rules that can be turned on or off through a switch in the Dashboard.
Search bots that are filtered through this setting then behave in the same way as other request filtering rules and you don't get billed for them. Keep in mind that we won't return any information except requestId
in those cases.
Search bots filtering based on
User-Agent
is a basic mechanism we provide for free to filter out commonly encountered and generally well behaved search bots.Because of its simplicity, it might introduce both false positive and false negative cases. If you need better protection against sophisticated bots, check out our Smart Signals that contain a paid Bot Detection product.
Configuration
Search Bots Filtering can be turned on or off in our Dashboard.
- Navigate to Dashboard > Security and switch to the Bots tab.
- Enable Exclude Search Bots to start filtering search bots from your billable API calls.
List of supported search bots User-Agent
strings
User-Agent
stringsThe list of search botsUser-Agent
strings we support is public and we keep it updated based on the currently supported User-Agent
strings. The following table contains all supported User-Agent
strings with their respective information sources.
Owner | User-Agent (Substrings) | Public doc |
---|---|---|
Mail.ru | mail.ru | source |
Microsoft | bingbot AdIdxBot MicrosoftPreview | source |
Apple | applebot | source |
Huawei | petalbot | source |
Naver | naver.me/spd | source |
Yahoo | slurp | source |
Baidu | baiduspider | source |
Sogou | sogou web spider sogou wap spider sogou pic spider | source |
Exalead | exabot | source |
Coccoc | coccocbot | source |
Seznam | seznamBot | source |
googlebot AdsBot-Google Storebot-Google APIs-Google AdsBot-Google-Mobile Mediapartners-Google Googlebot-Image Googlebot-News Googlebot-Video AdsBot-Google-Mobile-Apps FeedFetcher-Google Google-Read-Aloud Google-Site-Verification | source | |
Yandex | yandex.com/bots | source |
Bytedance | bytespider | source |
Duckduckgo | duckduckbot | source |
Ahrefs | AhrefsBot AhrefsSiteAudit | source |
facebookexternalhit facebookcatalog | source | |
Pinterestbot | source | |
Cincraw | Cincraw | source |
Semrush | http://www.semrush.com/bot.html | |
Seekport | SeekportBot | source |
Amazon | Amazonbot/0.1 | source |
DataDog | datadog | source |
Updated about 2 months ago