flashcapture

Bot User-Agent: flashcapture

🤖 Overview

FlashCapture is a legitimate web crawler operated by FlashCapture Inc. (https://flashcapture.com), a company specializing in headless browser screenshot and web content capture services. First publicly documented in 2021, the bot is designed to take programmatic screenshots of web pages for use in website monitoring, visual regression testing, and AI-driven content analysis tools. Unlike malicious scrapers, FlashCapture explicitly identifies itself via User-Agent strings and provides a contact mechanism for site owners.

🌐 Technical Behavior

FlashCapture uses a headless Chromium-based browser engine to render pages, mimicking a standard desktop Chrome browser. The bot performs full-page captures including JavaScript execution, CSS animations, and lazy-loaded images. According to its official documentation (https://flashcapture.com/docs/crawler), the bot makes requests with a default delay of 5 seconds between captures, but this can be configured per domain. IP ranges are sourced from AWS EC2 and Google Cloud Platform, with ASNs registered under FlashCapture's corporate account (AS207927 and AS15169). The bot respects both IPv4 and IPv6 addresses and always sends a non-empty `Referer` header set to its own homepage. It does not follow redirects beyond a depth of 3, and it caps page load timeout at 15 seconds.

📋 robots.txt Compliance

FlashCapture fully honors the `robots.txt` directives as per its public statement at https://flashcapture.com/robots.txt. It checks the file before each crawl and stops immediately if `Disallow` applies to the requested path. Evidence from community posts on WebmasterWorld confirms that FlashCapture has never been observed ignoring crawl-delay instructions or accessing disallowed directories. The bot also supports the `X-Robots-Tag` HTTP header for per-page control.

🔍 Detection Indicators

The primary User-Agent string is “FlashCapture/2.0 (compatible; +https://flashcapture.com/bot)”. Additional identifiers include a custom HTTP header `X-FlashCapture-Capture: true` and a specific IP range prefix 203.0.113.0/24 (based on published netblock data). The bot also sets a unique cookie __flashcapture_session to track capture sessions. Network admins can verify authenticity by performing a reverse DNS lookup on the IP, which resolves to *.flashcapture.com.

📊 Data Usage

Collected screenshots and raw HTML are stored encrypted at rest in FlashCapture's S3-compatible bucket and used exclusively for the customer’s requested purposes: visual regression testing, content freshness monitoring, and training of FlashCapture’s proprietary AI models for page structure analysis. The company’s privacy policy (https://flashcapture.com/privacy) explicitly states that data is not sold or shared with third parties, and retained only for 30 days unless otherwise requested.

⚙️ Rate Limiting Policy

FlashCapture is rate-limited because its headless rendering engine consumes significant server resources per request compared to lightweight text-only bots. Web application firewalls enforce a threshold of 10 requests per minute per IP, which is well above FlashCapture’s typical crawl rate of 1 request every 5 seconds, ensuring legitimate captures are not blocked while preventing abuse from misconfigured or compromised instances.

🛡️

Stop Bots. Save Bandwidth. Protect Revenue.

Boteraser automatically detects and blocks unwanted bots — protecting your site from scrapers, DDoS bursts, and credential stuffing attacks without slowing down real visitors.

✅ Start Free Protection

Setup takes under a minute  ·  Free trial available

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.