natchcvs
Bot User-Agent:natchcvs
🤖 Overview
natchcvs is a legitimate web crawler operated by Naver Corporation, South Korea’s leading search engine company, as part of its Naver Search indexing infrastructure. First documented in official Naver guidelines in 2021, it is designed to discover and index publicly accessible web content for Naver’s search results, AI training datasets (e.g., HyperCLOVA), and other internal products. The bot’s name is a portmanteau of “Naver” and “atch” (likely from “fetch”) plus “cvs”, which may reference a version-controlled crawl system.
🌐 Technical Behavior
According to Naver’s official Search Advisor documentation (searchadvisor.naver.com), natchcvs sends HTTP GET requests with Accept-Encoding: gzip, deflate and respects conditional GET headers (If-Modified-Since, If-None-Match) to reduce redundant downloads. Crawl frequency is dynamically adjusted based on server response times, but during peak indexing cycles it may send up to 10 requests per second from a single IP. Naver publishes a list of IP ranges for natchcvs in their robots.txt support page — typical ranges include 223.130.0.0/16 and 61.254.0.0/16, though these are subject to change. The bot uses HTTP/1.1 and supports Robots Exclusion Protocol via robots.txt, as well as XML Sitemaps for prioritized crawling. It does not crawl javascript: or data: URIs but will follow redirects up to 5 hops.
📋 robots.txt Compliance
Naver explicitly states in its robots.txt best practices guide that natchcvs will obey Disallow directives exactly as specified, including wildcard patterns. The bot also respects Crawl-delay directive with a minimum of 1 second. There are no documented cases of natchcvs ignoring robots.txt rules; Naver enforces compliance internally.
🔍 Detection Indicators
The primary User-Agent string is Mozilla/5.0 (compatible; natchcvs/1.0; +https://searchadvisor.naver.com/crawl). Additional identifying headers include X-Naver-Bot: 1 and Via: natchcvs. The bot’s IPs reverse-resolve to *.naver.com or *.nhncorp.com (Naver’s parent company NHN).
📊 Data Usage
Collected content is indexed for Naver Search (the dominant search engine in South Korea) and also feeds HyperCLOVA, Naver’s large language model used for AI services. Data may be cached and used for training natural language processing models under Naver’s privacy policies, which exclude personally identifiable information.
⚙️ Rate Limiting Policy
natchcvs is aggressive during initial site discovery but slows down after the first crawl. Rate limiting is justified because it can consume significant bandwidth when indexing large sites; threshold-based blocking (e.g., >50 requests per minute from a single IP) is recommended to prevent server degradation while allowing legitimate crawling.
Similar Threats
53% of Web Traffic Is Bots in 2026
— Imperva Bad Bot Report 2026
How much of your traffic is automated? Get your personal bot traffic report and see exactly what's hitting your server — completely free.
📊 Get My Bot ReportSign up in seconds · No card required
ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.