whatweb

Bot User-Agent: whatweb

⚠️ Overview

WhatWeb is an open-source web reconnaissance tool originally created by Andrew Horton (alias urbanadventurer) and first released in 2011. It is currently maintained on GitHub under the urbanadventurer/WhatWeb repository, with contributions from dozens of security researchers. While designed for legitimate security assessments, it is widely used by malicious actors for automated fingerprinting of web applications to identify vulnerable content management systems, frameworks, and server software.

🔧 Technical Capabilities

WhatWeb operates by sending HTTP requests to target URLs and analyzing response headers, HTML content, cookies, and JavaScript files to identify over 1,800 distinct web technologies. It leverages a plugin-based architecture where each plugin targets a specific technology, such as WordPress, Joomla, Apache, Nginx, or jQuery. The tool supports multiple scanning modes including aggressive, which performs additional requests to extract version numbers and configuration details, and stealth mode, which introduces random delays to evade basic detection. It can also scan entire subnets, follow redirects, and output results in JSON, XML, or plain text formats. By default, WhatWeb uses the User-Agent string “WhatWeb/0.5.5” (or the current version), making it trivially identifiable unless the operator overrides it. The tool does not exploit vulnerabilities itself but provides critical intelligence for subsequent attacks, such as SQL injection or remote code execution tailored to the identified technology stack.

📜 History & Notable Incidents

WhatWeb was first released in 2011 and quickly became a staple in penetration testing toolkits like Kali Linux. It has been referenced in multiple cybersecurity incident reports as a reconnaissance tool used in advanced persistent threat (APT) operations, particularly during the initial footprinting phase of attacks against government and educational websites. No CVEs are assigned to WhatWeb itself because it is a passive scanner, but it has been observed in the wild as part of automated scanning campaigns that precede ransomware deployments, such as the LockBit and Conti groups using it to identify outdated WordPress plugins. The tool’s source code at https://github.com/urbanadventurer/WhatWeb has been forked over 2,000 times, indicating its widespread adoption.

🔍 Detection Indicators

The primary detection indicator for WhatWeb is its default User-Agent string: “WhatWeb/0.5.5” or “WhatWeb/0.4.9” for older versions. However, advanced users can modify this, so secondary behavioral fingerprints include rapid sequential requests to common paths like /wp-admin, /admin, /robots.txt, and /readme.html, combined with unusual Accept headers that request multiple content types. Traffic from WhatWeb often exhibits a pattern of requesting a large number of distinct URLs within a short timeframe, each with a unique set of HTTP headers designed to elicit technology-specific responses. Network intrusion detection systems (NIDS) can create signatures for the default User-Agent and for the plugin-specific URL patterns, such as query parameters like ?q=wordpress.

☠️ Risk & Impact

While WhatWeb does not directly exploit vulnerabilities, its reconnaissance capabilities enable attackers to build a detailed and accurate inventory of target technologies. This information can then be cross-referenced with public vulnerability databases to select precise attack vectors, leading to data breaches, defacements, or ransomware infections. For web applications running unpatched CMS versions, the presence of a WhatWeb scan often precedes a successful compromise, as demonstrated in numerous incident response case studies.

🛡️ Mitigation

WhatWeb is blocked immediately upon detection because it provides attackers with a free and highly detailed map of the target’s software stack, drastically reducing the time and effort needed to launch a successful exploit. Organizations should deploy web application firewalls (WAF) that filter requests containing the default User-Agent string and implement rate limiting to slow down automated scanning tools. Additionally, regular patching of identified technologies reduces the value of the intelligence gathered by WhatWeb.

Free Bot Analysis

Is Your Site Under Bot Attack Right Now?

Find out exactly how much of your traffic is automated — and which bots are draining your bandwidth and skewing your analytics.

Run Free Bot Scan →

No credit card required  ·  Results in minutes

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.