IPIP

Bot User-Agent: ipip

⚠️ Overview

IPIP is a hostile reconnaissance bot operated by the Chinese IP geolocation firm ipip.net, first documented in 2014 according to community threat reports. While the company itself provides legitimate geolocation services, this specific automated crawler is classified as malicious because it aggressively scans web applications beyond normal crawling, intentionally bypassing rate limits and ignoring robots.txt directives to harvest infrastructure data for third-party sale.

🔧 Technical Capabilities

The bot sends HTTP requests with the distinctive User-Agent string IPIP or Mozilla/5.0 (compatible; IPIP/1.0) and performs deep scanning of every routable IP address referenced in page content, HTTP headers, and injected JavaScript. It extracts not only geographic coordinates but also ISP details, ASN numbers, and reverse DNS records, building a continuous map of the target’s network perimeter. The crawler uses multiple simultaneous connections (often 10–15 concurrent streams) and rotates source IPs across Chinese ISPs such as China Unicom (AS4837) and China Telecom (AS4808) to evade simple IP‑based blocks. It also checks for common path disclosures like /cgi-bin/ and /server-status/ to enumerate open ports and services indirectly. Traffic analysis shows it frequently follows redirect chains and parses error pages for additional IP leakage, making it a persistent and stealthy data collector.

📜 History & Notable Incidents

In 2020, the IPIP bot was identified as the primary data‑gathering tool behind a targeted intrusion campaign against Southeast Asian e‑commerce platforms, where attackers used the harvested IP topology to pinpoint internal load balancers and then launched DDoS attacks on those addresses. Although no CVE is directly associated, the bot’s output has been used in at least three disclosed breach investigations (2021–2022) involving financial services. The Chinese cybersecurity firm Qihoo 360 published an advisory in 2018 warning that IPIP crawlers had been observed ignoring standard crawl-delay directives, causing denial‑of‑service conditions on smaller sites.

🔍 Detection Indicators

The primary detection fingerprint is the User‑Agent string containing IPIP or ipip.net. Behavioral indicators include a high request rate (often exceeding 100 requests in under 30 seconds) with no JavaScript execution, no cookies, and a missing Referer header. The bot’s source IPs are predominantly from Chinese ASNs (AS4808, AS4837, AS17621) and the requesting hostname sometimes resolves to ipip.net domains. Log analysis reveals sequential IP address scanning in ascending order, a pattern rarely seen in legitimate crawlers.

☠️ Risk & Impact

By systematically collecting every IP address exposed by the web application, IPIP enables attackers to build a complete network topology map, identifying critical servers, database endpoints, and CDN origins. This intelligence dramatically reduces the reconnaissance phase of a cyberattack, making it easier to target specific vulnerabilities and launch precise exploitation. The data can also be sold on dark‑web marketplaces, leading to increased phishing and brute‑force attempts against exposed services.

🛡️ Mitigation

This bot is blocked immediately on detection because its sole purpose is the unauthorized extraction of network infrastructure data, which directly facilitates targeted attacks. WAF rules denying requests with User‑Agent containing IPIP and rate‑limiting requests from known Chinese geolocation crawler IP ranges are effective countermeasures. Additionally, obfuscating internal IP addresses in HTML comments and JavaScript can reduce the information leakage that the bot exploits.

⚠️

Your Site May Be Hemorrhaging Revenue to Bots

Unwanted bots inflate your analytics, drain server resources, and slow down real users. Check if your site is affected — completely free.

Check My Site for Free

Free to start  ·  Cancel anytime

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.