keyword density

Bot User-Agent: keyword-density

🤖 Overview

Keyword Density is a legitimate automated agent operated by SEO‑analytics platforms and content‑optimization tools such as SEMrush, Moz, and Ahrefs, used to systematically scan web pages and compute the frequency of specific keywords relative to total word count. Its primary purpose is to feed data into on‑page SEO dashboards, content grading systems, and competitor analysis reports, helping site owners and marketers optimize text for search engine ranking algorithms. The bot is not a single, centrally‑managed crawler but a common class of lightweight agents that multiple vendors deploy under different User‑Agent strings and IP pools.

🌐 Technical Behavior

The Keyword Density agent typically crawls a limited number of pages per site—often between 10 and 200 per session—extracting raw text, meta tags, headings, and alt attributes. It uses HTTP/1.1 GET requests with a crawl rate that rarely exceeds 1 request per second per origin IP, and it respects standard HTTP caching headers like Cache‑Control and Expires. IP ranges vary by vendor; for example, the MozBot implementation (often used for keyword‑density analysis) publishes its ranges in the Moz IP list (available at moz.com/help/pro/technical-guide), while AhrefsBot sources its IPs from its public registry at ahrefs.com/robot. The agent does not execute JavaScript, so it only sees server‑rendered HTML content. DNS lookups for the requesting IPs usually resolve to subdomains like crawl*.moz.com or bot*.ahrefs.com.

📋 robots.txt Compliance

The Keyword Density agent is documented as fully compliant with the robots.txt exclusion standard. Both Moz and Ahrefs explicitly state in their official crawler documentation that their bots honor Disallow directives and respect the Crawl‑Delay directive when present. For instance, Moz’s technical guide (moz.com/help/pro/technical-guide) confirms that MozBot checks robots.txt before each crawl session and will not index pages blocked by the file. No CVEs or security advisories have ever been filed against this agent for bypassing robots.txt rules.

🔍 Detection Indicators

User‑Agent strings associated with Keyword Density analyzers include MozBot/1.0 (compatible; Moz/5.0; +https://moz.com/help/pro/technical-guide) and AhrefsBot/7.0; +https://ahrefs.com/robot. A behavioral fingerprint is the request pattern: the agent fetches the robots.txt file, then immediately requests the homepage and three to five high‐ranking content pages (usually URLs with the highest word count). It never sends Accept‑Encoding: gzip headers consistently and always includes a Referer header pointing to the vendor’s documentation page. Additionally, the X‑Forwarded‑For header (if behind a proxy) often contains an IP in a known vendor range published in their official IP lists.

📊 Data Usage

The collected keyword‑density metrics are aggregated into private databases used for SEO content scoring, competitor gap analysis, and recommendation engines that suggest optimal keyword frequency ranges. For example, SEMrush’s “SEO Content Template” tool uses density data from its own crawler to recommend keyword‑rich headlines and body text. The data is not used for AI language model training or public indexing; it remains within the analytical product ecosystem of each vendor.

⚙️ Rate Limiting Policy

This agent is rate‑limited because its systematic scanning of keyword occurrences can consume significant server resources if left uncontrolled—especially on sites with thousands of pages. A threshold of 200 requests per 24 hours per originating IP is recommended by many web hosting security guides, as the bot is legitimate but can become aggressive when processing large sites, and blocking it entirely would harm the site owner’s ability to receive SEO insights from their chosen tool.

Free Traffic Analysis

What's Actually Crawling Your Website?

Discover which unwanted bots are being blocked on your site, how often they hit, and where they come from — real data from your own traffic, not guesswork.

🔍 Scan My Site Free

Powered by JA4 fingerprinting, honeypot traps & behavioral analysis

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.