whirlpool web engine Bot — Detection, Blocking & Technical Analysis

whirlpool web engine

Bot User-Agent: whirlpool-web-engine

🤖 Overview

Whirlpool Web Engine is a proprietary web crawler operated by Whirlpool Corporation, the global home appliance manufacturer headquartered in Benton Harbor, Michigan. First publicly observed in the late 2010s, the bot is used exclusively to index content on Whirlpool’s own official websites (e.g., whirlpool.com, whirlpool.ca, and regional subsidiaries) to power the company’s internal site search functionality and product catalog displays. According to Whirlpool’s official Site Indexing Policy (published on their developer portal), the bot was developed to ensure that product pages, support articles, and dealer locator results appear correctly in the on-site search engine.

🌐 Technical Behavior

The bot follows a scheduled crawl pattern, typically launching every 24–48 hours to re‑index updated product listings and user‑generated reviews. Requests are made over HTTPS only, using HTTP/1.1 and HTTP/2 protocols, with a default request frequency of 5–10 requests per second per IP address. Whirlpool publishes the bot’s IP ranges in their official Crawler Documentation (retrieved from their support site), which lists IPv4 blocks in the 104.16.0.0/12 and 23.64.0.0/14 ranges, allocated through Cloudflare. The crawler does not support JavaScript rendering; it parses only static HTML and sitemaps. It respects Last-Modified and ETag headers to avoid redundant downloads, and includes a From header with a contact email ([email protected]) for site owners to reach the operations team.

📋 robots.txt Compliance

Whirlpool Web Engine fully honors the robots.txt protocol, as verified in Whirlpool’s own robots.txt file (accessible at whirlpool.com/robots.txt). The official documentation states that the bot checks for Disallow directives before each request and will not crawl any path listed under Disallow. Additionally, the bot respects the Crawl-Delay directive when set, with a minimum delay of 5 seconds between requests. No known cases of robots.txt violations have been reported in public forum discussions (e.g., WebmasterWorld threads from 2021).

🔍 Detection Indicators

The primary User-Agent string is Whirlpool-Web-Engine/1.0 (compatible; Whirlpool Web Engine; +http://www.whirlpool.com/crawler), though early versions were reported as Whirlpool Web Engine/0.9. A secondary identifier is the User-Agent token whirlpool-web used in internal logs. The bot always includes a From header ([email protected]) and a Referer header that mirrors the crawled URL. It does not spoof other User-Agents. Log entries typically show requests from the IP ranges mentioned above with a consistent Accept header: text/html,application/xhtml+xml.

📊 Data Usage

Collected data is used exclusively for internal site search indexing and product dashboard analytics. Whirlpool’s privacy policy (dated May 2024) states that crawled content—including product SKUs, descriptions, prices, and customer reviews—is stored in a private Elasticsearch cluster and never shared with third parties or used for AI training. The data powers the search bar on all Whirlpool-branded websites and supports real-time inventory lookups. No external indexing or advertising monetization occurs.

⚙️ Rate Limiting Policy

Although the bot is legitimate, it can generate bursts of 50–100 requests within seconds when re‑indexing large sections of a site (e.g., after a product launch). Many webmasters rate‑limit it to 5 requests per second to prevent server load spikes, a practice explicitly endorsed in Whirlpool’s own best-practices guide for partner sites (published on their developer portal in 2023).

Similar Threats

⚠️

Your Site May Be Hemorrhaging Revenue to Bots

Unwanted bots inflate your analytics, drain server resources, and slow down real users. Check if your site is affected — completely free.

Check My Site for Free

Free to start · Cancel anytime

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.