Methodology · ~3,500 words · v3 · Apr 2026

How Crawlux scores a crypto site.

Open methodology is verifiable methodology. This page documents the 23 analyzers, the 6 check groups, the scoring weights, where the calibration data comes from, the versioning policy and how to challenge a score. The audit logic is opinionated where it needs to be and open where it can be.

View pricingAll 8 modules free on first audit

Table of contents

8 sections · ~10 min read

Why the methodology is open
The 6 check groups
The 23 analyzers in detail
Scoring model and weights
Calibration sources
Where it is open vs proprietary
Versioning and review cadence
How to challenge a score

Section 01

// Philosophy

Why the methodology is open

Closed audit methodologies are black boxes. Buyers cannot verify the logic. Findings feel arbitrary. Disagreements have no path to resolution. The Crawlux methodology is the opposite: every weight, every threshold, every calibration source is documented.

Closed methodology

What most audit tools do

Proprietary scores with no published rules
No calibration sources disclosed
No version history or change log
No path to challenge a score
Black box for buyer evaluation

Open methodology

What Crawlux publishes

Every analyzer documented with thresholds
Calibration sources cited per dimension
Quarterly version history tracked
Email path to challenge findings
Verifiable for buyers and auditors

The moat is execution, not the rules

Other auditors are welcome to use the Crawlux methodology with or without Crawlux. The competitive advantage is running 23 analyzers across 30+ AI prompts in 2 minutes, not the rules themselves.

Section 02

// Structure

The 6 check groups

Every analyzer belongs to one of six check groups. Each group answers a distinct question about the site. The groups exist because crypto rankings get determined by signals that span the marketing site, the on-chain layer and the AI search layer.

Technical SEO foundation

Core Web Vitals, robots.txt, XML sitemap, canonical, render-blocking resources, mobile viewport, JavaScript rendering, internal linking. Standard SEO foundation, applied to crypto context.

Schema and structured data

FinancialProduct on token pages, CryptoExchange on platforms, SoftwareApplication for wallets and SDKs, Organization plus Person for E-E-A-T. Schema correctness validated against type-specific rules.

YMYL and trust signals

Audit firm citations (Trail of Bits, OpenZeppelin, CertiK), founder identity coverage, regulatory disclosure, domain WHOIS trust, GitHub repository activity. Trust layer specific to financial sites.

AI Visibility (AEO)

4-factor citation model: schema correctness, factual density, AI bot access, authority citations. Citation rate testing across ChatGPT, Perplexity and Claude with category-tuned prompt sets.

Authority and backlinks

Web3 Backlink Toxicity Rubric. Tier 1 sources (DefiLlama, CoinGecko, Etherscan, audit firms) weighted higher than generic high-DA sources for crypto. PBN cluster detection.

On-chain signals

TVL, holder distribution, contract security, GitHub activity, audit firm coverage. On-chain signals provide trust verification that traditional SEO cannot capture.

Why 6 groups, not 8

The 8 audit dimensions in the Crypto SEO Guide describe what the audit covers. The 6 check groups describe how the analyzers are organized internally. Keyword Intent and Competitor Analysis are workflows that span multiple check groups, not standalone groups.

Section 03

// Analyzers

The 23 analyzers in detail

Each analyzer runs as a discrete check with named inputs, calibrated thresholds and explicit output. The analyzers below are listed with their check group prefix and a one-line description.

A01

Core Web Vitals

Technical · LCP, CLS, INP thresholds

Measures LCP under 2.5s, CLS under 0.1, INP under 200ms. Sources from PageSpeed Insights API. Tests mobile and desktop separately.

A02

Robots and crawl access

Technical · robots.txt validation

Validates robots.txt syntax, checks for accidental Disallow rules on key paths, confirms 7 AI bot allow rules (GPTBot, OAI-SearchBot, ClaudeBot, Claude-SearchBot, PerplexityBot, Google-Extended, Applebot-Extended).

A03

Canonical and indexability

Technical · canonical correctness

Tests canonical tag presence, self-referencing canonicals on key templates, parameter handling, multi-chain canonical patterns. Catches the most common crypto site indexation issue.

A04

JavaScript render coverage

Technical · server-render detection

Tests whether key facts (tokenomics, team info, audit citations) are server-rendered or behind JavaScript. 69% of AI crawlers cannot execute JS so this is a citation-blocking check.

A05

XML sitemap and internal linking

Technical · structure validation

Validates sitemap presence and freshness, identifies orphan pages, checks pillar-cluster topology, validates breadcrumb consistency. Crawl architecture audit.

B01

FinancialProduct schema validator

Schema · token page schema

Detects token information pages, validates FinancialProduct usage, checks for required and recommended properties. Catches the most common crypto SEO mistake (generic Product schema on tokens).

B02

CryptoExchange schema validator

Schema · exchange page schema

Detects exchange platform pages, validates CryptoExchange usage, checks supportedAssets array, tradingFee, address, hasOfferCatalog patterns.

B03

SoftwareApplication validator

Schema · wallet/SDK schema

Detects wallet, SDK and protocol client pages, validates SoftwareApplication with correct applicationCategory (FinanceApplication for wallets, DeveloperApplication for SDKs).

B04

Organization and Person schema

Schema · E-E-A-T signal validation

Validates Organization on company pages, Person on team pages, sameAs array linking to LinkedIn, GitHub and Twitter. Critical for YMYL trust signals.

C01

Audit firm citation detection

YMYL · trust signal

Detects citations to Trail of Bits, OpenZeppelin, CertiK, Quantstamp and other audit firms. Validates audit dates, firm names, PDF link integrity.

C02

Founder identity coverage

YMYL · E-E-A-T author signals

Checks for named founders with Person schema, sameAs links to verified social profiles, professional history. Anonymous-founder projects score lower on YMYL trust.

C03

Regulatory disclosure

YMYL · compliance signal

Detects regulatory status disclosure (FinCEN MSB, MiCAR, BitLicense, FCA), geographic restrictions, "not investment advice" language on relevant pages.

C04

Domain WHOIS trust

YMYL · domain age and history

Checks domain registration date, WHOIS privacy patterns, historical ownership changes. Newer domains score lower on YMYL trust by default.

D01

AEO schema readiness

AI Visibility · factor 01

Tests whether key facts are exposed in JSON-LD that AI extractors can parse reliably. Cross-references with B01-B04 schema validators.

D02

Factual density scorer

AI Visibility · factor 02

Measures named entities, numbers and dates per 100 words. Below 3 per 100 the page reads as marketing rather than reference, which AI engines weight lower.

D03

AI bot access tester

AI Visibility · factor 03

Tests crawl access for 7 AI bots using actual user-agent strings. Catches CDN-level filtering that robots.txt does not show. Edge filtering blocks bots before robots.txt.

D04

Citation rate measurement

AI Visibility · live LLM testing

Runs 30+ category-relevant prompts against ChatGPT, Perplexity and Claude. Records citation rate per LLM, per intent bucket, per competitor cited alongside.

E01

Web3 Backlink Toxicity Rubric

Authority · backlink scoring

Scores backlinks against the open Tier 1/2/3 framework. ~80 Tier 1 crypto authority sources recognized. PBN clusters and toxic neighborhoods flagged for disavow.

E02

Authority source coverage

Authority · DefiLlama, CoinGecko

Detects whether the project is listed on DefiLlama, CoinGecko, CoinMarketCap, Etherscan. These two cover ~60% of crypto AI citation prompts.

E03

PBN cluster detection

Authority · toxicity flags

Detects PBN cluster patterns by hosting analysis, registration overlaps, content thinness signatures and link velocity anomalies. Generates disavow file output.

F01

On-chain trust signals

On-chain · TVL, holders

Pulls TVL from DefiLlama, holder distribution from Etherscan, market cap from CoinGecko. On-chain trust signals weight heavily for crypto AI citation.

F02

GitHub activity scorer

On-chain · repository freshness

Measures public repository activity: commit frequency, named contributor count, recent release tags. AI engines weight active GitHub activity as a "is this project alive" signal.

F03

Smart contract security signals

On-chain · contract trust

Cross-references audit firm reports, checks contract verification status on block explorers, validates ownership renouncement claims, detects honeypot patterns.

Section 04

// Weighting

Scoring model and weights

Each check group contributes a fixed percentage to the overall site score. Within each group, individual analyzers contribute roughly equal weight. The total adds to 100.

Check group	Analyzers	Weight	Why this weight
Technical SEO foundation	5	15%	Foundation. Necessary but not crypto-differentiating.
Schema and structured data	4	20%	High leverage for AEO. Wrong type kills citation eligibility.
YMYL and trust signals	4	15%	YMYL is mandatory for crypto rankings. Anonymous projects score lower.
AI Visibility (AEO)	4	20%	Crypto AEO matters more than for any other vertical.
Authority and backlinks	3	15%	Web3 Backlink Toxicity Rubric weights crypto authority sources higher.
On-chain signals	3	15%	On-chain trust signals are unique to crypto, weight matches AEO.
Overall score	23	100%	Weights reviewed quarterly against 53+ live audits.

Within each group, analyzer-level weights are roughly equal. Schema group weights skew slightly toward FinancialProduct (B01) because it is the highest-impact single fix on most crypto sites. AI Visibility group weights skew toward citation rate measurement (D04) because that is the only direct outcome measurement.

Site type adjustments

The keyword intent classifier (separate from the 23 analyzers) detects site type and applies adjustments. Token information sites get more weight on schema and on-chain signals. Exchange platforms get more weight on YMYL and authority. Wallet sites get more weight on schema and reviews.

Section 05

// Data sources

Calibration sources

Calibration is the part most audit tools hide. The Crawlux methodology calibrates against three source types: public crypto data APIs, public audit firm reports and internal audit data across live sites.

Source type 01

Public crypto data APIs

On-chain authority signals, market data, contract verification, holder distribution. Calibrates the on-chain check group and authority source coverage analyzer.

DefiLlama API (TVL data)
CoinGecko API (market cap, volume)
Etherscan, Polygonscan, Solscan
CoinMarketCap (asset listings)

Source type 02

Audit firm public reports

YMYL trust signal calibration. Public audit reports define what "audited and secure" actually means in citable terms for the YMYL group.

Trail of Bits public reports
OpenZeppelin Audits
CertiK Skynet
Quantstamp public reports

Source type 03

Internal audit calibration

Dimensional weight calibration across 53+ live site audits. Identifies which analyzers correlate with actual ranking and citation outcomes.

53+ audited live sites
Quarterly weight recalibration
30+ prompts × 3 LLMs per audit
Anonymized aggregate metrics

Source type 04

External SEO calibration

Cross-validation against established SEO data sources for technical SEO and backlink dimensions. Used to verify Crawlux scores align with external benchmarks.

Google PageSpeed Insights API
DataForSEO (backlinks)
Schema.org validator
Google Rich Results Test

What calibration does NOT use

Generic Domain Authority and Domain Rating are intentionally excluded from crypto authority calibration. Generic DA models score DefiLlama and Etherscan as moderate when they are Tier 1 for crypto. Using them would propagate that error into Crawlux scores.

Section 06

// Boundaries

Where it is open vs proprietary

The methodology is open. The execution is proprietary. The line between the two is worth being explicit about.

Open and documented

Anyone can use these

The 6 check group structure
The 23 analyzer definitions
The scoring weights table
Web3 Backlink Toxicity Rubric tiers
4-factor AEO citation model
Per-analyzer thresholds and rules
Calibration source list

Proprietary execution

What Crawlux runs

Internal calibration data across 53+ audits
Specific weight tunings per category
Full prompt sets (30+ per category)
Auto-detection logic for site categorization
Performance optimizations and caching
Programmatic LLM citation parsing
White-label PDF generation pipeline

Other auditors are welcome to use the open methodology, build their own implementations and even compete with Crawlux directly. The published rules are not the moat.

The moat is running 23 analyzers across 30+ AI prompts in 2 minutes, generating a 23-page report and integrating into a quarterly audit workflow at agency scale.

Section 07

// Versioning

Versioning and review cadence

The methodology gets reviewed on a fixed cadence. Minor revisions adjust weights and thresholds. Major revisions reconsider the analyzer set.

Quarterly minor revisions

90-day cadence

Weight adjustments based on new audit data
Threshold updates per analyzer
New AI bot user-agents added to D03
Calibration source refreshes
Per-category weight tuning

Annual major reviews

Once per year

Reconsider analyzer set (add or deprecate)
Restructure check groups if needed
Revise scoring model fundamentals
Update Web3 Backlink Toxicity Rubric tiers
Cross-validate against industry benchmarks

Version history is public

Every version change gets logged with the date, the change description and the reasoning. The current methodology is v3, published April 2026. Previous versions are archived for audit reproducibility.

Sites audited in earlier versions can be re-scored in the current version on request. Score changes between versions get flagged with the version delta so users understand what changed and why.

Section 08

// Disputes

How to challenge a score

Every finding in the audit report includes the analyzer that fired, the rule it failed and the calibration source. If you believe the rule is wrong or the calibration is outdated, the methodology team reviews challenges quarterly.

Identify the analyzer

The audit report lists each finding with its analyzer code (A01 through F03). Reference this code in any methodology challenge so the team knows which rule is being disputed.

Cite the calibration source

Each analyzer's calibration source is documented in section 5 of this page. If you believe the source is outdated or inappropriate for your context, that is the technical basis for a challenge.

Submit via contact

Contact the methodology team via the link on the about page. Include the analyzer code, the calibration concern and (ideally) supporting data or sources. Justified challenges feed into quarterly review.

The most common challenges that have changed weights in past reviews: backlink toxicity scoring for newer DeFi protocols where standard PBN signatures need adjustment, YMYL signals for non-US regulatory frameworks (MiCAR, FCA, MAS, ASIC), GitHub activity scoring for projects using monorepos versus distributed repositories.

What does not get changed

Subjective disagreements ("I do not like getting a low score") are not methodology challenges. Justified challenges include: outdated calibration data, miscategorized site type, inappropriate weighting for specific subverticals.

// Methodology FAQ

Common questions

Six questions covering the open methodology philosophy, calibration and review process.

Why publish the methodology openly?

Open methodology is verifiable methodology. Buyers evaluating audit tools should be able to see how scores get calculated, where the calibration data comes from and where the rubric is opinionated versus open. Closed methodology is a black box that makes scores feel arbitrary. The Crawlux methodology is documented openly so other auditors can use it with or without Crawlux. The moat is execution at scale, not the rules.

Where does the calibration data come from?

Three sources. First, public crypto data APIs (DefiLlama, CoinGecko, CoinMarketCap, Etherscan) provide on-chain authority signals. Second, audit firm public reports (Trail of Bits, OpenZeppelin, CertiK) calibrate the YMYL trust dimension. Third, internal audit runs across 53+ live sites provide the dimensional weight calibration. The weights get reviewed quarterly against new data.

How often does the methodology get updated?

Quarterly minor revisions, annual major reviews. Minor revisions adjust weights based on new audit data and changes in AI search behavior. Major reviews reconsider the analyzer set: adding new checks, deprecating outdated checks, restructuring check groups. The current version is v3, published April 2026. Version history is tracked publicly.

Can I challenge a score I disagree with?

Yes. Each finding in the audit report includes the rule it failed and the calibration source. If you believe the rule is wrong or the calibration is outdated, contact the methodology team via the contact link on the about page. Justified challenges feed into the quarterly review cycle. The most common challenges that have changed weights: backlink toxicity for newer DeFi protocols and YMYL signals for non-US regulatory frameworks.

Is the methodology better than Ahrefs or Semrush?

It is different, not strictly better. Ahrefs and Semrush methodologies are excellent for general SEO across news, ecommerce and SaaS verticals. The Crawlux methodology is tuned specifically for crypto and Web3, where authority signals (DefiLlama, audit firms) and schema requirements (FinancialProduct, CryptoExchange) differ from generic SEO. Use the right methodology for the vertical. For non-crypto sites, generic SEO methodologies score better. For crypto sites, the crypto-tuned methodology catches more.

What are the 6 check groups?

Technical SEO foundation (Core Web Vitals, robots.txt, canonical, render-blocking). Schema and structured data (FinancialProduct, CryptoExchange, Person). YMYL and trust signals (audit firm citations, founder identity, regulatory disclosure). AI Visibility (4-factor citation model). Authority and backlinks (Web3 toxicity rubric, PBN detection). On-chain signals (TVL, holders, contract security, GitHub activity). Each group contains 3 to 5 analyzers totaling 23 across the audit.

RUN YOUR FIRST AUDIT FREE

See Crawlux on your own crypto site.

No signup, no credit card. Full Web3-tuned audit report in 60 seconds.

Free first audit · No signup · 60 seconds · Full PDF report