s2bot
Bot User-Agent:s2bot
🤖 Overview
s2bot is a web crawler operated by S2 Software GmbH, a German company specializing in SEO analytics and website monitoring. First publicly documented in 2018, it indexes web pages to provide backlink analysis, keyword tracking, and site health reports for the commercial product S2 Web Analyzer. The bot collects publicly accessible content to feed into S2’s proprietary search and analytics platform, used primarily by digital marketing agencies and SEO professionals.
🌐 Technical Behavior
s2bot employs a polite crawling strategy with a default request interval of 10 seconds between successive requests to the same domain, according to its official crawl policy at s2bot.com/crawl-policy. It operates from a static IP range of 192.0.2.0/24 (a reserved example block; actual ranges are documented on the S2 website) and uses HTTP/1.1 with persistent connections. The crawler fetches both HTML and linked resources such as CSS and JavaScript but does not execute client-side scripts. It respects Last-Modified and ETag headers to minimize bandwidth usage and may make up to 5 concurrent connections to a single host. Requests are made during business hours in the CET time zone, and the bot pauses during off-peak hours to reduce server impact.
📋 robots.txt Compliance
s2bot fully adheres to the Robots Exclusion Protocol, including Allow and Disallow directives, as confirmed by its official documentation. It also supports the Crawl-Delay directive, which overrides its default 10-second interval. Evidence from S2 Software’s website explicitly states that the bot ignores any URL blocked by robots.txt, and they actively audit compliance quarterly.
🔍 Detection Indicators
The primary User-Agent string is "Mozilla/5.0 (compatible; s2bot/2.0; +http://s2bot.com/bot)". A custom HTTP header "X-S2-Crawler: true" is sent with every request for easy identification. Behavioral fingerprints include a consistent 10-second request spacing and a mandatory initial request to robots.txt before crawling any other page on a new domain. The bot does not mimic browser fingerprints.
📊 Data Usage
Collected data is used exclusively for S2’s commercial products: generating backlink graphs, tracking keyword rankings, and monitoring site performance metrics. S2 claims that data is anonymized and not sold to third parties, as per their privacy policy at s2bot.com/privacy. It is not used for AI model training or general search indexing, distinguishing it from larger crawlers like GPTBot or Googlebot.
⚙️ Rate Limiting Policy
Because s2bot can generate a moderate volume of requests across many domains, it is rate-limited to prevent server overload. The recommended threshold is 100 requests per minute per IP, with temporary blocking triggered after exceeding 500 requests in a 10-minute window—a policy derived from S2’s own guidelines for webmasters to ensure fair usage without harming site performance.
Similar Threats
⚠️
Your Site May Be Hemorrhaging Revenue to Bots
Unwanted bots inflate your analytics, drain server resources, and slow down real users. Check if your site is affected — completely free.
Check My Site for FreeFree to start · Cancel anytime
ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.