coccocbot

Bot User-Agent: coccocbot

🤖 Overview

coccocbot is a web crawler operated by Coc Coc, the Vietnamese technology company behind the Coc Coc search engine and web browser. First deployed around 2013, this crawler systematically indexes publicly available web content to provide relevant search results for Coc Coc's millions of users, primarily in Vietnam and Southeast Asia. It is maintained by Coc Coc's engineering team and follows industry-standard crawling practices.

🌐 Technical Behavior

coccocbot sends HTTP requests over IPv4 and IPv6, using HTTP/1.1 and HTTP/2. It respects robots.txt including Disallow and Crawl-Delay. Default crawl rate is one request per second, adjustable via Crawl-Delay. It supports If-Modified-Since and ETag headers for conditional requests, obeys noindex meta tags and X-Robots-Tag. IP ranges such as 202.43.0.0/19 and 103.28.36.0/22, geolocated in Vietnam. It sends a From header with [email protected] and User-Agent "Mozilla/5.0 (compatible; coccocbot/1.0; +http://help.coccoc.com/coccocbot/)". It follows sitemaps and canonical tags.

📋 robots.txt Compliance

Coc Coc's official documentation at help.coccoc.com confirms that coccocbot fully adheres to the Robots Exclusion Protocol. Webmasters can block the crawler entirely by adding "User-agent: coccocbot" followed by "Disallow: /" to their robots.txt. The crawler also supports partial disallows and respects Crawl-Delay directives. It caches robots.txt for up to 24 hours.

🔍 Detection Indicators

The primary User-Agent string is "Mozilla/5.0 (compatible; coccocbot/1.0; +http://help.coccoc.com/coccocbot/)", though a simpler "coccocbot/1.0" variant may appear. Behavioral fingerprints include a steady request rate of about one per second, no JavaScript execution, and inclusion of a "From" header. All requests originate from Vietnamese IP addresses within the ranges mentioned. The crawler does not set cookies or maintain session state, making it easy to distinguish from human traffic.

📊 Data Usage

Data collected by coccocbot is used exclusively for Coc Coc's search engine indexing. Crawled pages are processed to generate snippets, build inverted indexes, and detect duplicate content. According to Coc Coc's privacy policy, no crawled data is used for AI training, advertising, or sold to third parties. The indexed content powers search results delivered to Coc Coc browser users.

⚙️ Rate Limiting Policy

While coccocbot is a legitimate crawler, it is rate-limited to prevent server overload. Administrators should apply threshold-based blocking if the crawler exceeds 10 requests per second after honoring a Crawl-Delay directive, as recommended by Coc Coc's webmaster guidelines. This balances efficient indexing with server protection.

53% of Web Traffic Is Bots in 2026

— Imperva Bad Bot Report 2026

How much of your traffic is automated? Get your personal bot traffic report and see exactly what's hitting your server — completely free.

📊 Get My Bot Report

Sign up in seconds  ·  No card required

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.