3 of 12 AI bots correctly configured · 4 critical issues
Search bots are missing from your allowlist. Two deprecated user-agent strings are doing nothing. Cloudflare is serving a different robots.txt at the CDN. Four issues to address below, roughly 10 minutes of edits.
AI bot matrix
| User-agent | Operator | Role | Status |
|---|---|---|---|
GPTBot | OpenAI | Training | ● Blocked |
OAI-SearchBot | OpenAI | Search | ○ Implicit allow |
ChatGPT-User | OpenAI | User-fetch | ○ Implicit allow |
ClaudeBot | Anthropic | Training | ● Blocked |
Claude-SearchBot | Anthropic | Search | ○ Implicit allow |
Claude-User | Anthropic | User-fetch | ○ Implicit allow |
anthropic-ai | Anthropic (deprecated) | Training | ⊘ Deprecated · noop |
Claude-Web | Anthropic (deprecated) | Training | ⊘ Deprecated · noop |
PerplexityBot | Perplexity | Search | ○ Implicit allow |
Perplexity-User | Perplexity | User-fetch | ○ Implicit allow |
CCBot | Common Crawl | Training | ● Blocked |
Google-Extended | Google AI | Training | ✓ Allowed |
Top 4 issues to fix
- Search bots only implicitly allowed. OAI-SearchBot, Claude-SearchBot, PerplexityBot have no explicit Allow rule. They fall back to the wildcard User-agent: * Allow: /, but operators recommend explicit allowlist for clarity and to override CDN bot management defaults that block on no-rule. Fix: add explicit Allow blocks per the snippet below.
- Two deprecated user-agent strings (anthropic-ai, Claude-Web) doing nothing. Anthropic deprecated both. Your robots.txt has Disallow rules targeting them; the real ClaudeBot ignores those rules. Fix: remove the deprecated entries (purely cosmetic, but the file is also misleading any human auditor reading it).
- Cloudflare is overriding your origin robots.txt. The CDN serves a different file to bot user-agents than to humans, with broader Disallow rules. This is the Cloudflare-managed AI crawler block enabled in Security > Bots. Fix: in Cloudflare dashboard, disable "Manage your robots.txt" so your origin file takes precedence, or move the AI bot rules into Cloudflare directly.
- No /admin/ or /internal/ Disallow rules. Best practice for crypto sites is selective access: allow /docs/, /blog/, /pricing/ but Disallow gated and admin areas. Yours has no Disallow on common gated paths. Fix: add Disallow: /admin/, Disallow: /internal/, Disallow: /api-docs/private/.
Recommended robots.txt for crypto AEO
# ============================================================ # robots.txt for crypto AEO baseline # Block AI training crawlers, allow AI search crawlers # ============================================================ # --- AI search & user-fetch (allow for AEO visibility) --- User-agent: OAI-SearchBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: Claude-SearchBot Allow: / User-agent: Claude-User Allow: / User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / # --- AI training bulk crawlers (block by default) --- User-agent: GPTBot Disallow: / User-agent: ClaudeBot Disallow: / User-agent: CCBot Disallow: / User-agent: Google-Extended Disallow: / User-agent: Bytespider Disallow: / # --- Default policy + crypto-specific Disallow --- User-agent: * Allow: / Disallow: /admin/ Disallow: /internal/ Disallow: /api-docs/private/ Sitemap: https://example-protocol.xyz/sitemap.xml
Want the full AEO readiness audit, not just robots.txt?
Robots.txt is one of four AI Visibility readiness dimensions. The full audit also covers schema readiness, factual density and authority signals plus 30+ category prompts tested across all 3 LLMs. From $25 one-time per domain.
Output above is representative. Actual checks return the real bot-by-bot status for the domain you submit.
