Audit module · 05 · Classifier

Crypto keyword intent classifier. Six buckets, calibrated for Web3.

Q: What does intent-content mismatch look like in practice?

Common pattern: a landing page targets 'how does liquid staking work' (informational) but the page is built like a product page with a prominent 'Stake Now' CTA above the fold (transactional). Users searching the informational query bounce because they came to learn, not act. The fix is to lead with the explanation, then introduce the product mid-page once the user has the context. Intent mismatches like this are the most common single ranking issue we see across crypto sites.

Generic intent classifiers map crypto queries into 4 SaaS buckets. Web3 actually splits into 6, including investigation (commercial with risk overlay) and disambiguation (token vs protocol vs company).

Try the classifier

6 intent bucketsPer-page intent mapPowered by Gemini AI Analyzer

By the numbers

~30%of crypto queries that generic classifiers tag as "commercial" actually have an investigation overlay (need risk disclosure first).Illustrative figure based on TG3 client audit patterns

15-30%of typical landing pages have a primary keyword whose intent does not match the page content. The single most common ranking issue we see.Illustrative; varies by site

~9sto classify a typical keyword corpus through the Gemini AI Analyzer, in parallel with the seven other audit modules.Median per-corpus classification time

// What the classifier sees

A representative classifier output

Click the example queries to see how the classifier maps each one across the 6 buckets, plus what it recommends as the matching content type.

crypto keyword intent classifierlive

best DeFi yield protocols 2026

Primary intent

Investigation

InvestigationCommercial overlay

User wants to compare yield options but expects risk disclosure first. Pure commercial pages bounce; informational-first pages with a clean "compare" section convert.

Crypto keyword intent in six buckets

Each bucket maps to a different content type, schema combination and CTA pattern. Mismatches between keyword bucket and page content are the most common single ranking issue on crypto sites.

Informational

User wants to learn. No commercial intent. Generic classifiers handle this bucket well; we keep it for completeness.

Example queries

how does liquid staking work
what is yield farming
ethereum gas explained

Recommended: Long-form explainer · Article schema · No CTAs above the fold

Navigational

User is going somewhere known. Brand or feature names, app actions. They want to land fast and act fast.

Example queries

connect metamask
uniswap interface
aave dashboard login

Recommended: Action-first utility page · HowTo schema · No marketing copy

Commercial

User is evaluating products to use. Higher-funnel commercial. Pure product pages work here.

Example queries

usdc lending interest rates
aave vs compound
best hardware wallet 2026

Recommended: Product comparison · FinancialProduct + Article schema · Visible "Try it" CTA

Transactional

User is ready to act now. Direct conversion path. Less explanation, more action.

Example queries

swap eth to usdc
buy bitcoin with credit card
stake eth on lido

Recommended: Action-first product page · CryptoExchange schema · CTA above the fold

Investigation

Unique to YMYL/crypto. Commercial intent but with risk disclosure expected first. Pure product pages bounce.

Example queries

best DeFi yield protocols 2026
safest stablecoin to hold
highest APY crypto savings

Recommended: Comparison guide with risk-first opener · Article + FAQPage + FinancialProduct schema · Full disclosure

Disambiguation

Unique to crypto. Query references a symbol mapping to multiple entities (token, protocol, company, foundation).

Example queries

aave price (token vs protocol)
uni founder (token vs protocol vs Uniswap Labs)
sol staking (Solana token vs protocol)

Recommended: Entity-resolved page · Distinct Organization + FinancialProduct entities with @id · Canonical strategy

// What an audit finding looks like

A typical intent-content mismatch finding

This is the most common single ranking issue we see across crypto sites. Almost always fixable with copy and structure changes, no migration needed.

● HIGHFinding 06 of 19 · Keyword Intent

example-protocol.com / yield

Investigation-intent keyword targeting transactional page

The /yield page targets "best DeFi yield protocols 2026". Users searching this come to compare and evaluate, then bounce on a transactional pitch.

Keyword classifier verdict: Investigation intent (commercial with risk overlay)
Page reality: Transactional · "Stake Now" hero CTA above the fold
No risk disclosure visible until scroll
Ahrefs data: high impressions, CTR well below category median

Keyword intentInvestigation

Current page intentTransactional

Effort to fix~4 hrs · copy + structure

→ Recommended fix path

Restructure the page to lead with methodology and risk framing.
Move comparison table mid-page with FinancialProduct schema per row.
Move "Stake Now" CTA below the comparison, post-explanation.
Add a 6-question FAQ block targeting investigation queries (APY calculation, impermanent loss, audit firms).
Re-audit at 60 days to measure CTR delta.

Illustrative finding based on common patterns observed in TG3 client audits, not a real Crawlux scan output.

// How the audit runs

How the keyword intent module runs

Four phases, all in parallel with the seven other audit modules. The classifier itself is the Gemini AI Analyzer's Crypto Keyword Intent Classifier check.

01
Keyword corpus extraction
Crawlux extracts the per-page primary keyword from each page on your domain by parsing title tags, H1s, meta descriptions and structured data. Secondary keywords are pulled from H2s and the surrounding context. The corpus is deduplicated and normalized before classification.
02
Gemini classification pass
Each keyword is sent through the Gemini AI Analyzer's Crypto Keyword Intent Classifier. The model returns a primary bucket, an optional secondary overlay (e.g. Investigation + Commercial), a confidence score and the recommended content type. Low-confidence classifications are flagged for manual review.
03
Page content intent inference
In parallel, the page content itself is classified by the same model. CTA placement, headline structure, content depth, schema present and presence of risk disclosure all feed the page-intent verdict. The two verdicts are then compared per page.
04
Mismatch report
Pages where keyword intent differs from page intent are flagged. Each mismatch is rated by gap size and effort to fix. The PDF report includes the recommended content restructure per page, with example sections from competitors who match the intent correctly.

// vs the alternatives

Crawlux 6-bucket model vs Ahrefs and Semrush

Same crypto query set, three different verdicts. The 4-bucket models conflate Investigation into Commercial and Disambiguation into Navigational, which costs ranking on the misclassified pages.

Query	Ahrefs intent	Semrush intent	Crawlux 6-bucket
"best DeFi yield protocols 2026"	Commercial	Commercial	✓ Investigation (commercial overlay)
"safest stablecoin to hold"	Commercial	Commercial	✓ Investigation
"aave price"	Navigational	Navigational	✓ Disambiguation (token vs protocol)
"uni founder"	Informational	Informational	✓ Disambiguation (token vs Uniswap Labs)
"connect metamask"	Navigational	Navigational	✓ Navigational (agreement)
"how does liquid staking work"	Informational	Informational	✓ Informational (agreement)
"swap eth to usdc on uniswap"	Transactional	Transactional	✓ Transactional (agreement)
"highest APY crypto savings"	Commercial	Commercial	✓ Investigation (regulatory risk flag)
"buy bitcoin with credit card"	Transactional	Transactional	✓ Transactional (jurisdiction flag)
Crypto-aware regulatory risk flag	✕ Not measured	✕ Not measured	✓ Per query

Verdicts illustrative. Note: Crawlux agrees with Ahrefs/Semrush on roughly 70% of crypto queries; the 30% disagreement is concentrated in Investigation and Disambiguation buckets.

// Keyword intent FAQ

Keyword intent questions, answered

Common questions from teams running keyword strategy on crypto and Web3 sites.

Why does crypto need its own keyword intent classifier?

Generic intent classifiers (built into Ahrefs, Semrush, Surfer) were trained on the open web. They classify queries into 4 buckets: informational, navigational, commercial, transactional. That works for SaaS. Crypto search behavior splits two of those buckets further. "Best DeFi yield" is commercial but with an investigation overlay because users expect risk disclosure first. "AAVE price" is navigational but ambiguous between AAVE token and Aave protocol. The 4-bucket model collapses these into the wrong bucket; the 6-bucket Crawlux model keeps them distinct.

What are the six crypto intent buckets?

Commercial (evaluating products to buy/use), Transactional (ready to act now), Informational (learning), Navigational (looking for a known site), Investigation (commercial but requires risk disclosure first, unique to YMYL/crypto) and Disambiguation (ambiguous query needing entity resolution, e.g. AAVE token vs Aave protocol). Each bucket maps to a different content type and ranking strategy. See the bucket grid above.

Does this work for non-English queries?

The classifier currently runs in English with high confidence and supports queries in 9 additional languages (Spanish, Portuguese, French, German, Russian, Turkish, Korean, Japanese, Chinese) at lower confidence. Language detection runs first; if confidence is below threshold the query is flagged and skipped from the audit. Multi-language support expands per quarter as we collect labeled query data.

Can the classifier detect token disambiguation issues?

Yes. Disambiguation is one of the 6 buckets specifically because it is so common in crypto. Many queries reference symbols (BTC, ETH, AAVE, UNI) that map to multiple entities (the token itself, the protocol, the company, the foundation). The classifier flags these and recommends the page implement disambiguation schema (Organization vs FinancialProduct distinct entities with @id references) so AI models can resolve which entity the page is about.

How does this differ from Ahrefs keyword intent?

Ahrefs uses a 4-bucket model (informational, navigational, commercial, transactional) trained on the general web. The model is reliable for SaaS and e-commerce queries. For crypto-specific queries, it tends to over-classify into commercial (everything that mentions a token or protocol) which masks the investigation and disambiguation patterns Crawlux flags separately. Both tools agree on roughly 70% of queries; the 30% disagreement is where Crawlux adds value for crypto sites.

How does intent affect ranking?

Search engines and AI models reward content that matches user intent. Ranking commercial content for an informational query produces high bounce and low engagement, which feeds back into ranking. The Crawlux audit flags pages where the page content does not match the primary keyword intent. Most teams find 15-30% of their landing pages have intent mismatches that are easier to fix than building new pages.

Does the classifier flag regulatory risk?

Yes, in parallel. The Gemini AI Analyzer runs both the keyword intent classifier and the regulatory risk detector together. Queries that imply regulated financial activity (lending, derivatives, security tokens) in jurisdictions with stricter regulation (US, UK, EU under MiCA) are flagged so you can ensure the matching page has the right disclosures. This is more important for AEO than for Google ranking, since AI models surface jurisdiction-aware answers.

What does intent-content mismatch look like in practice?

Common pattern: a landing page targets "how does liquid staking work" (informational) but the page is built like a product page with a prominent "Stake Now" CTA above the fold (transactional). Users searching the informational query bounce because they came to learn, not act. The fix is to lead with the explanation, then introduce the product mid-page once the user has the context. Intent mismatches like this are the most common single ranking issue we see across crypto sites.

RUN THIS AUDIT FREE

Run this audit on your crypto site.

No signup, no credit card. Full 8-module report in 60 seconds.

Free first audit · No signup · 60 seconds · Full PDF report

Crypto keyword intent classifier. Six buckets, calibrated for Web3.

By the numbers

A representative classifier output

Crypto keyword intent in six buckets

Informational

Navigational

Commercial

Transactional

Investigation

Disambiguation

A typical intent-content mismatch finding

Investigation-intent keyword targeting transactional page

How the keyword intent module runs

Keyword corpus extraction

Gemini classification pass

Page content intent inference

Mismatch report

Crawlux 6-bucket model vs Ahrefs and Semrush

Keyword intent questions, answered

Run this audit on your crypto site.