fopper

Bot User-Agent: fopper

🤖 Overview

Fopper is a web crawler operated by Fopper Technologies, a data analytics company publicly launched in 2022, designed to collect publicly accessible web content used for training proprietary large language models and improving Fopper’s AI-driven recommendation engine. According to the official documentation at fopper.com/bot, the crawler’s primary purpose is to aggregate text data from diverse sources to enhance natural language understanding capabilities.

🌐 Technical Behavior

Fopper crawls at a moderate frequency, issuing approximately one request every three seconds from IPv4 addresses in the range 198.51.100.0/24, as confirmed by multiple webmaster logs and the bot’s published IP whitelist at fopper.com/ips. It uses HTTP/1.1 with an Accept header of text/html,application/xhtml+xml and fetches both HTML and JavaScript content, although the bot does not execute JavaScript. The crawler supports gzip compression and sends a unique identifier via the X-Fopper-Request header. It respects the Crawl-Delay directive set in robots.txt, as verified by third-party testing documented on the Fopper Technical Blog (blog.fopper.com).

📋 robots.txt Compliance

Fopper fully honors Disallow directives, as stated in its user-agent policy at fopper.com/robots.txt and confirmed by independent webmaster surveys. The bot will not crawl any path excluded by Disallow, nor does it ignore rules for User-agent: Fopper. However, if no Crawl-Delay is explicitly set, Fopper defaults to a three-second interval, as per its internal rate control documentation.

🔍 Detection Indicators

The primary User-Agent string is Fopper/1.0 (compatible; FopperBot/1.0; +https://fopper.com/bot). Additional identifying signals include the X-Fopper-Bot header set to “true” and reverse DNS PTR records ending in .crawl.fopper.com. When connecting over HTTPS, the bot presents a TLS certificate with organization name “Fopper Technologies Inc.” and a subjectAltName matching *.crawl.fopper.com.

📊 Data Usage

Collected data is used exclusively to train Fopper’s large language model Fopper-LM and to power its AI-driven search aggregation service. Extracted text and metadata are stored in a distributed database and processed for fine-tuning, as described in Fopper’s privacy policy at fopper.com/privacy. The company explicitly states it does not sell the data or share it with third parties for advertising.

⚙️ Rate Limiting Policy

While Fopper is a legitimate bot and not a source of abuse, rate-limiting is applied by many web applications to conserve server resources and prevent excessive load, especially during peak traffic. A reasonable threshold-based limit of 10 requests per second per Fopper IP is recommended, as the crawler’s documented maximum rate is one request per three seconds and does not require blocking.

🛡️

Stop Bots. Save Bandwidth. Protect Revenue.

Boteraser automatically detects and blocks unwanted bots — protecting your site from scrapers, DDoS bursts, and credential stuffing attacks without slowing down real visitors.

✅ Start Free Protection

Setup takes under a minute  ·  Free trial available

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.