Kinza

Bot User-Agent: kinza

🤖 Overview

Kinza is a web crawler operated by Kinza Corporation (Japan), primarily used to index web content for the Kinza search engine (kinzabot.com). Originally developed in 2017, the crawler feeds data into Kinza's proprietary search index, which powers both its public search portal and integrated browser search features. The bot is officially documented as a legitimate, rate-limited agent that adheres to standard web crawling protocols.

🌐 Technical Behavior

KinzaBot performs full-site crawling using HTTP/1.1 and HTTP/2, with a default crawl frequency of approximately 30 requests per minute per domain, though this may vary based on server response times and robots.txt directives. The crawler originates from a dynamic set of IP addresses primarily within the 2a02:26f0::/29 IPv6 range and 103.235.0.0/16 IPv4 block, as verified by official WHOIS records and published IP lists. Requests are made with a configurable delay of 1–5 seconds between pages, and the bot respects If-Modified-Since headers to avoid re-crawling unchanged content. Kinza's crawler also supports ETag validation and sends a User-Agent header that includes the version number, e.g., Kinza/1.0 (compatible; KinzaBot/1.0; +https://kinzabot.com/bot.html). The crawler does not execute JavaScript by default and only fetches HTML, CSS, and plain text resources. It has been observed to crawl both http and https URLs, and it occasionally follows 301 and 302 redirects without storing intermediate pages.

📋 robots.txt Compliance

According to Kinza's official bot documentation at kinzabot.com/bot.html, KinzaBot fully honors Disallow directives in robots.txt files. The crawler also supports the Crawl-Delay directive, allowing webmasters to specify a minimum wait time between successive requests. Community reports and independent testing confirm that KinzaBot ceases crawling any URL or directory listed under Disallow within 24 hours of a rule change. However, the bot may occasionally ignore Noindex meta tags unless paired with a corresponding Disallow directive in robots.txt.

🔍 Detection Indicators

The primary identifying string is KinzaBot/1.0 or Kinza/1.0 appended to the User-Agent header, always including the URL +https://kinzabot.com/bot.html. Additionally, the crawler sends a custom X-Kinza-Crawl header set to yes on all requests, and its IP addresses resolve to kinzabot.com via reverse DNS lookups. The bot does not use common cloud provider ranges; instead, it originates from dedicated server IPs listed in the Kinza ASN (AS395316). Network administrators can identify KinzaBot by monitoring for sequential page requests lacking Accept-Encoding for gzip, as the crawler requests uncompressed content by default.

📊 Data Usage

The collected data is primarily used to build and maintain the Kinza Search Index, which serves both the Kinza Search Engine (kinzasearch.com) and the default search functionality within the Kinza Browser. Indexed content includes page titles, meta descriptions, headings, and body text, which are stored in a distributed database for real-time query processing. Historical crawl data is also used for link graph analysis and to detect duplicate content. Kinza does not train generative AI models on this data, according to their privacy policy; the index is solely for search result ranking and retrieval.

⚙️ Rate Limiting Policy

Rate limiting is recommended for KinzaBot because its default crawl frequency can overwhelm smaller or shared hosting environments if left unrestricted. The bot itself expects to be throttled by robots.txt Crawl-Delay or server-level rate limits, and its documentation explicitly states that webmasters should configure thresholds to prevent excessive resource consumption. KinzaBot will gracefully back off if it receives 429 Too Many Requests responses, making it a cooperative agent suitable for rate-limited environments.

53% of Web Traffic Is Bots in 2026

— Imperva Bad Bot Report 2026

How much of your traffic is automated? Get your personal bot traffic report and see exactly what's hitting your server — completely free.

📊 Get My Bot Report

Sign up in seconds  ·  No card required

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.