vocusbot
Bot User-Agent:vocusbot
๐ค Overview
VocusBot is a web crawler operated by Vocus, a U.S.-based provider of public relations and marketing software (now part of Cision). Its primary purpose is to collect publicly accessible news articles, blog posts, and media content to feed into the Vocus platform for media monitoring, PR analytics, and campaign measurement. The bot is documented in the official Vocus product documentation and is listed in common robots.txt standards as a legitimate automated agent.
๐ Technical Behavior
VocusBot performs HTTP GET requests with a configurable crawl rate, typically one request every few seconds per domain. It follows both internal and external links within a site to index pages that contain news or press-related content. According to publicly available IP ranges used by Vocus (e.g., 204.9.177.0/24 and 208.91.196.0/24 as reported in webmaster forums and the Vocus support site), the bot originates from a limited set of static IP addresses. It supports HTTP/1.1 and respects `Cache-Control` and `Last-Modified` headers to reduce server load. The bot does not execute JavaScript; it only examines raw HTML and text content.
๐ robots.txt Compliance
VocusBot honors the Disallow directives found in `robots.txt`. Vocus official documentation explicitly states that the bot checks for `robots.txt` before crawling each site and will not access paths blocked by those rules. However, if a site does not have a `robots.txt`, the bot will crawl at full speed unless otherwise throttled. Evidence from the Vocus support site (knowledge base article KB-4521) confirms that webmasters can block VocusBot using standard `User-agent: VocusBot` directives.
๐ Detection Indicators
The primary User-Agent string is VocusBot/1.0, optionally followed by the platform version (e.g., `VocusBot/1.0 (+https://www.vocus.com/bot)`). The bot includes the HTTP header `From: [email protected]` on some requests. It does not use common header modifications, making it identifiable by its consistent User-Agent and IP ranges. Behavioral fingerprints include a fixed crawl interval when not throttled and no cookie support.
๐ Data Usage
Collected data is processed by Vocus's proprietary algorithms to extract named entities, sentiment, and publication metadata. This information populates client dashboards that track media coverage, share of voice, and PR campaign effectiveness. Data is not shared externally or used for AI training; it is exclusively retained within the Vocus platform for client analytics. According to the Vocus privacy policy, crawled content may be cached for up to 90 days for deduplication purposes.
โ๏ธ Rate Limiting Policy
Even though VocusBot is a legitimate agent, it can generate heavy request volumes when allowed to crawl unrestricted. Rate limiting is applied with a threshold of 5 requests per second per IP to prevent degradation of web application performance for other users, aligning with standard defensive practices for any high-frequency crawler.
Similar Threats
๐ก๏ธ
Stop Bots. Save Bandwidth. Protect Revenue.
Boteraser automatically detects and blocks unwanted bots โ protecting your site from scrapers, DDoS bursts, and credential stuffing attacks without slowing down real visitors.
โ Start Free ProtectionSetup takes under a minute ยท Free trial available
ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.