offline explorer

Bot User-Agent: offline-explorer

🤖 Overview

Offline Explorer is a legitimate web-crawling utility developed and maintained by MetaProducts Corporation, primarily designed for downloading entire websites for offline browsing, archiving, and local content mirroring. First released in the late 1990s, it is not an AI crawler but a desktop application that can be configured to perform automated, recursive HTTP requests to collect static and dynamic web resources for personal or organizational use.

🌐 Technical Behavior

The tool operates by sending standard HTTP/1.1 GET and HEAD requests, following all HTML links, sources, CSS, JavaScript, and other referenced assets up to a user-defined depth limit (typically 1–20 levels). It supports IPv4 and IPv6 addresses and uses HTTP Keep-Alive connections to reduce overhead. By default, it can spawn between 1 and 50 concurrent threads per project, which may cause aggressive request bursts if not throttled. The official documentation (MetaProducts Offline Explorer Help) recommends setting a crawl delay of at least 1–5 seconds per host to avoid server overload. IP addresses originate from the user’s own network, not from a fixed MetaProducts pool, making per‑IP rate‑limiting dependent on the operator’s configuration.

📋 robots.txt Compliance

According to MetaProducts’ published specifications, Offline Explorer can be configured to obey robots.txt directives by enabling the “Respect robots.txt” option in its project settings. When enabled, the crawler parses the Disallow and Crawl-delay directives. However, if the user disables this feature (which is allowed in the software’s advanced options), the crawler will ignore robots.txt entirely. There is no publicly documented default; the behavior depends entirely on the operator’s choice.

🔍 Detection Indicators

The default User-Agent string is Offline Explorer/2.0 (version number may vary). Additional versions include Offline Explorer/3.0 and MetaProducts Offline Explorer. The application also sends a Referer header matching the current download URL, and often includes an Accept: */* header. Behavioral fingerprints include rapid sequential requests for linked resources (e.g., images and scripts) without human pause, and the absence of mouse-movement or JavaScript execution signals. Network administrators can log the User-Agent and check for high concurrency from a single IP.

📊 Data Usage

Collected data is stored locally on the user’s machine as a complete offline copy of the website, preserving directory structure, file names, and relative links. The primary use cases are offline research, website backups, educational archiving, and local content analysis. It is not used for AI training, search indexing, or any automated inference. The software does not share collected data with third parties by default.

⚙️ Rate Limiting Policy

Because Offline Explorer can be configured to launch dozens of simultaneous connections without built-in throttle limits, website operators should apply threshold-based rate limiting (e.g., max 10 requests per second per IP) to protect server resources. The policy rationale is that even a benign, legitimate tool can degrade performance if misconfigured, and setting a crawl‑delay disallow directive in robots.txt is the recommended first line of defense.

53% of Web Traffic Is Bots in 2026

— Imperva Bad Bot Report 2026

How much of your traffic is automated? Get your personal bot traffic report and see exactly what's hitting your server — completely free.

📊 Get My Bot Report

Sign up in seconds  ·  No card required

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.