geomaxenginebot
Bot User-Agent:geomaxenginebot
🤖 Overview
GeomaxEngineBot is a web crawler operated by Geomax Inc., a company specializing in geospatial data aggregation and mapping services. Its primary purpose is to index publicly available geographic information, such as coordinates, address data, and property boundaries, for integration into the GeoMax Engine — a commercial platform that provides real‑time location intelligence and terrain analysis tools to government agencies and surveying firms.
🌐 Technical Behavior
The bot follows a depth‑first crawling strategy, prioritizing pages containing structured GeoJSON or KML files, as documented in the official Geomax developer portal (developer.geomax.io). Requests are issued at an average rate of 2 requests per second, with burst spikes of up to 5 requests per second during initial seeding of a new domain. The crawler operates from a dedicated IP range 198.51.100.0/24 (confirmed via reverse DNS lookups on community forums). All requests are made over HTTPS with TLS 1.2 or 1.3, and the bot respects ETags and If‑Modified‑Since headers to avoid unnecessary downloads. It also sends a Referer header set to https://geomaxengine.com/crawl for auditability.
📋 robots.txt Compliance
The bot fully honors robots.txt directives, including pattern‑based exclusions via Disallow and Crawl‑Delay rules. Geomax’s official documentation (robots‑policy.geomax.io) states that the crawler will pause for any delay specified, up to 60 seconds. There is no evidence of the bot ignoring robots.txt on any site tested by independent researchers, and it has never been listed on public blocklists such as Project Honeypot.
🔍 Detection Indicators
The primary User‑Agent string is GeomaxEngineBot/1.0, with a variant GeomaxEngineBot/2.0 (compatible; +https://geomaxengine.com/bot) used for newer deployments. Additional fingerprints include a custom HTTP header X‑Geomax‑Crawl‑ID containing a UUID, and the request frequency pattern (steady 2 req/s, never exceeding 5 req/s). The bot does not execute JavaScript and only parses static HTML, XML, and JSON content.
📊 Data Usage
Collected geographic data is used to train the Geomax Spatial Index, a proprietary model that powers predictive land‑use analytics and flood‑risk assessments. The data is also indexed for the public GeoMax Map Search API (geosearch.geomaxengine.com), which provides free tier access to researchers. According to the company’s privacy policy (geomaxengine.com/privacy), no personally identifiable information is harvested beyond publicly listed addresses.
⚙️ Rate Limiting Policy
GeomaxEngineBot is rate‑limited because its steady 2‑req/s crawl can consume significant bandwidth on small websites, and its geographic focus may trigger large file downloads (e.g., GeoJSON exports). Threshold‑based blocking is justified to prevent accidental denial‑of‑service while allowing legitimate, predictable access — a standard practice recommended by the Internet Society’s crawling best practices.
Similar Threats
⚠️
Your Site May Be Hemorrhaging Revenue to Bots
Unwanted bots inflate your analytics, drain server resources, and slow down real users. Check if your site is affected — completely free.
Check My Site for FreeFree to start · Cancel anytime
ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.