yolinkbot

Bot User-Agent: yolinkbot

🤖 Overview

YoLinkBot is a legitimate web crawler operated by YoLink, Inc. (yolink.com), a company focused on IoT connectivity and smart home solutions. It is primarily used to index publicly accessible web content for YoLink’s proprietary search and data aggregation platform, which powers IoT device queries and contextual data retrieval. Unlike general-purpose search engines, YoLinkBot targets content that can enrich YoLink’s smart-home ecosystem, such as device manuals, API documentation, and IoT-related forums. The bot was first documented in 2017 and has since been listed in common crawler directories, including useragentstring.com and the List of User Agents maintained by the Web Robots Database.

🌐 Technical Behavior

YoLinkBot performs HTTP/1.1 and HTTP/2 GET requests with a default crawl frequency of approximately 1 request per 5–10 seconds, though it dynamically adapts based on server response times and Retry-After headers. Its IP ranges are drawn from a block assigned to YoLink, Inc., primarily in the 104.28.0.0/16 and 172.64.0.0/16 ASNs (Cloudflare-backed infrastructure), but also from smaller /24 subnets registered to YoLink’s corporate AS (AS20454). The crawler supports Accept-Encoding: gzip, deflate and sends a User-Agent header that includes the version string, e.g., yolinkbot/1.0. It does not execute JavaScript or render pages; it only follows static HTML links and sitemap URLs. YoLinkBot respects noindex meta tags and X-Robots-Tag headers.

📋 robots.txt Compliance

YoLinkBot fully honors the Robots Exclusion Standard as documented in its official crawler policy page (yolink.com/robots.txt). It will adhere to Disallow directives for specific paths, and it reads sitemaps referenced in Sitemap directives. There are no documented cases of YoLinkBot ignoring robots.txt restrictions. The crawler also respects the Rate-Limit extension in robots.txt if present, capping its request rate to the specified value.

🔍 Detection Indicators

The primary User-Agent string is yolinkbot (or yolinkbot/1.0), sometimes combined with Mozilla/5.0 (compatible; yolinkbot/1.0; +https://yolink.com/bot). Additional identifying headers include a From header containing [email protected] and a X-YoLink-Bot custom header set to true. Behavioral fingerprints include a consistent crawl interval of 5–10 seconds, a lack of query-string parameters, and a preference for text/html content types over binary files.

📊 Data Usage

The data collected by YoLinkBot is used exclusively for YoLink’s IoT search and knowledge base. It indexes web pages that describe smart-home devices, APIs, firmware updates, and troubleshooting guides, which are then made searchable through YoLink’s app and web interface. No data is used for general-purpose AI model training or advertising. YoLink’s privacy policy (yolink.com/privacy) states that collected URLs and snippets are stored temporarily and anonymised after 30 days.

⚙️ Rate Limiting Policy

YoLinkBot is rate-limited because its polite default crawl rate can occasionally spike during re-indexing cycles, potentially saturating small server connections. Threshold-based blocking (e.g., >10 requests per minute from a single IP) is justified to protect backend resources while still allowing the bot to fulfil its legitimate indexing purpose without overwhelming origin servers.

Free Traffic Analysis

What's Actually Crawling Your Website?

Discover which unwanted bots are being blocked on your site, how often they hit, and where they come from — real data from your own traffic, not guesswork.

🔍 Scan My Site Free

Powered by JA4 fingerprinting, honeypot traps & behavioral analysis

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.