riight

Bot User-Agent: riight

🤖 Overview

riight is a legitimate web crawler operated by Riight B.V., a European search engine company headquartered in the Netherlands. First publicly documented in 2022, its primary purpose is to index publicly accessible web pages to feed the Riight Search Engine, which emphasizes privacy, unbiased results, and no personalized tracking. The bot collects content for search result display and does not use data for AI training.

🌐 Technical Behavior

Riightbot (also identified as Riight/1.0) performs HTTP/1.1 GET requests with standard headers and respects a minimum crawl delay of 1 second between requests, as documented in its official robots.txt compliance statement at riight.com/crawler. It originates from a pool of IP addresses primarily in the European Union, with blocks aggregated under ASN AS60404 (Riight B.V.). Crawling frequency varies by site authority but typically stays under 50 requests per domain per hour to avoid overloading servers. The crawler follows sitemap.xml directives and indexes text, HTML, and structured data such as Schema.org markup.

📋 robots.txt Compliance

Riightbot fully honors Disallow directives in robots.txt, including wildcard and path-based rules, as confirmed by its public documentation at riight.com/robots-txt. It also supports the Crawl-Delay directive, allowing webmasters to set a minimum interval between requests. No known violations or complaints have been recorded on major webmaster forums or security advisories.

🔍 Detection Indicators

The primary User-Agent string is Riightbot/1.0 (or Riight/1.0), often accompanied by a secondary token like Mozilla/5.0 (compatible; Riightbot/1.0; +https://riight.com/bot). Behavioral fingerprints include consistent HTTP Referer headers set to https://riight.com/ and a default Accept-Language of en-US,en;q=0.9. No IP‑based geographical variation has been observed; all requests originate from the Netherlands.

📊 Data Usage

All collected content is exclusively used for building and updating the Riight Search Engine index, which delivers organic search results without tracking user behavior or generating ad revenue. The bot does not scrape for machine learning training, large language models, or any secondary commercial purpose. Data retention policies are aligned with GDPR, with crawled pages stored temporarily for indexing and removed upon site owner request.

⚙️ Rate Limiting Policy

Riightbot is rate‑limited by most web applications due to its sustained crawling from a relatively small IP range (under 50 addresses) and a default crawl delay that may still generate noticeable load on high‑traffic sites. The policy recommends a threshold of 100 requests per hour per IP before implementing 429 Too Many Requests responses, balancing legitimate indexing needs with server protection.

🛡️

Stop Bots. Save Bandwidth. Protect Revenue.

Boteraser automatically detects and blocks unwanted bots — protecting your site from scrapers, DDoS bursts, and credential stuffing attacks without slowing down real visitors.

✅ Start Free Protection

Setup takes under a minute  ·  Free trial available

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.