Wonderbot

Bot User-Agent: wonderbot

🤖 Overview

Wonderbot is a web crawler operated by Wondercraft AI, a company focused on AI-powered content creation and audio generation. It is designed to collect publicly accessible web content for training their proprietary AI models and improving their text-to-speech and content summarization services.

🌐 Technical Behavior

Wonderbot follows a rate-limited crawl pattern, making requests at intervals of 5-10 seconds per domain to avoid server overload. It primarily crawls HTML pages, RSS feeds, and sitemap.xml files. Based on observed traffic, it operates from IP ranges in the 45.87.80.0/23 block (registered to Wondercraft AI via Hetzner) and uses IPv4 connections with HTTP/1.1 keep-alive. It does not fetch binary assets like images or PDFs, focusing solely on textual content.

📋 robots.txt Compliance

Wonderbot fully honors robots.txt directives. Official documentation from Wondercraft AI (wondercraft.ai/robots) confirms it checks the file before each crawl and respects Disallow rules. It does not cache or bypass the file.

🔍 Detection Indicators

The primary User-Agent string is Wonderbot/1.0 (compatible; +https://wondercraft.ai/wonderbot). Some variations include a version suffix like Wonderbot/1.1. It also sends a custom X-Wonderbot-Request header set to true, as per their developer documentation. No other identifying headers are documented.

📊 Data Usage

Collected data is used exclusively for training Wondercraft’s generative AI models, particularly for improving the naturalness of synthesized voices and content summarization accuracy. The company states it does not sell or share raw crawl data; processed model weights are internal.

⚙️ Rate Limiting Policy

Wonderbot is rate-limited because its requests, even at 5-second intervals, can accumulate across multiple URLs and become aggressive if misconfigured. Threshold-based blocking with a limit of 50 requests per minute per IP is recommended per Wondercraft’s own guidance to protect small servers from unintended load.

53% of Web Traffic Is Bots in 2026

— Imperva Bad Bot Report 2026

How much of your traffic is automated? Get your personal bot traffic report and see exactly what's hitting your server — completely free.

📊 Get My Bot Report

Sign up in seconds  ·  No card required

ⓘ Data Notice: The information presented above has been compiled from publicly available internet sources. Boteraser aggregates this data solely for informational purposes and does not independently classify, evaluate, or endorse any findings about the bots listed. The accuracy and completeness of this information is the sole responsibility of the original publishers. Boteraser and its operators accept no liability for any decisions made based on this data.