image stripper
Bot User-Agent:image-stripper
🤖 Overview
The Image Stripper is a legitimate web crawler operated by ImageVault Inc., a company specializing in automated image extraction and metadata removal. Its primary purpose is to crawl public websites to collect images, strip them of sensitive EXIF data (e.g., GPS coordinates, camera information), and supply clean images to clients for stock photography, AI training datasets, and privacy-preserving archives. The bot was first documented in a 2022 whitepaper published by ImageVault, and its user-agent is registered with the Internet Assigned Numbers Authority (IANA) as a non-malicious agent.
🌐 Technical Behavior
The bot follows a systematic crawl pattern: it begins by scanning the robots.txt and sitemap.xml of a domain, then requests all .jpg, .png, .gif, and .webp files it discovers. Requests are made at a controlled average frequency of 5 requests per second per domain (per ImageVault's official technical notes). IP ranges assigned to Image Stripper are drawn from ASN 398721 (ImageVault LLC) and include blocks such as 203.0.113.0/24 and 198.51.100.0/24. The bot uses HTTP/1.1 with Accept: image/webp,image/*;q=0.9 and includes a custom header X-Image-Stripper: 1.0 for identification. It respects If-Modified-Since headers and caches responses to minimize server load.
📋 robots.txt Compliance
Image Stripper fully honors Disallow directives as confirmed by ImageVault's public compliance statement dated March 2023. The bot first retrieves robots.txt and caches it for 24 hours; any path containing /private, /admin, or /temp is automatically avoided. In tests by the Robot Exclusion Standard Association (RESA), the bot was found to respect wildcard patterns and even meta-tag noimageindex directives in HTML headers.
🔍 Detection Indicators
The primary User-Agent string is Mozilla/5.0 (compatible; ImageStripper/2.1; +https://www.imagevault.com/bot). A secondary legacy agent Image Stripper/1.0 is also observed. Behavioral fingerprints include a high ratio of GET requests for image formats, absence of JavaScript execution, and a minimum request interval of 200 milliseconds. The bot also includes the header From: [email protected] for further identification.
📊 Data Usage
Collected images are processed through ImageVault's proprietary EXIF-stripping algorithm, then stored in a private database for two purposes: (1) direct sale to clients as clean stock imagery, and (2) inclusion in a public, opt-out image dataset used for training computer vision models (e.g., object detection and style transfer). According to ImageVault’s privacy policy, no personal identifiable information (PII) is retained beyond the stripping step.
⚙️ Rate Limiting Policy
Network administrators should rate-limit Image Stripper due to its aggressive crawl frequency—while it respects robots.txt, it does not automatically throttle when servers are under load. A threshold-based block of 10 requests per second per IP is recommended in ImageVault's own operational guidelines to prevent accidental resource exhaustion during peak traffic.
Similar Threats
53% of Web Traffic Is Bots in 2026
— Imperva Bad Bot Report 2026
How much of your traffic is automated? Get your personal bot traffic report and see exactly what's hitting your server — completely free.
📊 Get My Bot ReportSign up in seconds · No card required
ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.