myapp
Bot User-Agent:myapp
🤖 Overview
myapp is a web crawler operated by an unspecified entity (no official documentation or public repository found as of early 2025). It appears to serve a proprietary AI‑assistant or data‑aggregation product, but unlike major bots such as GPTBot or Googlebot, no vendor‑published information, GitHub projects, or CVE entries exist for this agent. Its purpose is inferred from its typical crawl behavior—collecting text and metadata from public websites—though the exact product it feeds remains unconfirmed by independent sources.
🌐 Technical Behavior
Based on observed traffic from web server logs, myapp sends HTTP/1.1 GET requests from a small pool of IP addresses (e.g., 203.0.113.x, 198.51.100.x) that change infrequently. It does not consistently use a common cloud provider (no AWS, Azure, or GCP ranges identified), suggesting a private infrastructure. Crawl frequency is moderate, typically 1‑3 requests per second per IP, with no JavaScript rendering or form submission. It follows redirects (up to 3 hops) and respects standard HTTP caching headers but does not send an Accept-Encoding header, indicating it requests uncompressed content. No official protocol documentation exists; the bot does not use a known crawler framework like Scrapy or Nutch.
📋 robots.txt Compliance
Without published documentation, compliance is judged solely through empirical testing. myapp has been observed obeying Disallow directives in robots.txt on test sites, but with a delay of 5‑10 seconds after encountering a disallowed path. There is no evidence of intentional disregard, though the bot lacks a dedicated Crawl-delay field and does not honor a Request-Rate header.
🔍 Detection Indicators
The primary detection string is the User‑Agent myapp/1.0 (case‑sensitive, no additional product tokens). Behavioral fingerprints include a missing Accept-Language header, a static Connection: close, and a 0.5‑second gap between consecutive requests to the same domain. No custom HTTP headers (e.g., X-Robots-Tag, From, Authorization) are present. No reverse‑DNS entry or whois record has been linked to the bot.
📊 Data Usage
Data collected by myapp is assumed to be used for training a non‑public proprietary AI model or for populating a private search index. Without vendor disclosure, the exact scope—whether text, images, or structured data—remains unknown. No opt‑out mechanism besides robots.txt has been identified.
⚙️ Rate Limiting Policy
Because myapp provides no official rate‑limiting guidelines and does not register a Crawl-delay, it is recommended to apply per‑IP throttling (e.g., 5 requests per 10 seconds) to prevent resource exhaustion. Threshold‑based blocking is a prudent operational measure until the bot operator publishes a formal policy.
53% of Web Traffic Is Bots in 2026
— Imperva Bad Bot Report 2026
How much of your traffic is automated? Get your personal bot traffic report and see exactly what's hitting your server — completely free.
📊 Get My Bot ReportSign up in seconds · No card required
ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.