The Agent Readiness Score Is Here – And It's Already Misleading
On April 17, 2026, Cloudflare launched isitagentready.com, a public scanner that grades any website on its preparedness for AI agents. Paste a URL, get a score out of 100, see which of 16 checks pass or fail. For the first time, agent-readiness moved from gut feel to a measurable number. But that number, by itself, is structurally misleading for the majority of websites that run the default scan.
I ran the scanner on my own content site, nohacks.co. The default All Checks preset returned a score of 33 out of 100, Level 2 “Bot-Aware.” Yet when I switched to the Content Site preset—hidden inside a Customize dropdown—the same website scored 67 out of 100. A 34-point gap from the same scanner on the same day. The only difference: which checks were included.
This is not a bug. It is a design choice that will shape how thousands of web professionals, consultancies, and executives interpret their agent-readiness posture. And if you share the default score without understanding the preset, you are broadcasting a number that understates your actual readiness—especially if you run a content-heavy site.
What the Scanner Actually Checks
The scanner groups 16 checks into five categories: Discoverability (robots.txt, sitemap, Link headers), Content (Markdown content negotiation), Bot Access Control (AI bot rules, Content Signals, Web Bot Auth), API, Auth, MCP & Skill Discovery (six checks including API Catalog, OAuth discovery, MCP Server Card, Agent Skills index, WebMCP), and Commerce (three optional checks for payment protocols).
The Commerce category is correctly excluded from scoring on non-commerce sites. But the API/Auth/MCP/Skill Discovery category—six checks—is included by default on every scan, even for a blog that has no APIs, no OAuth endpoints, and no MCP server. The Content Site preset removes those six checks, leaving only the six that matter for content publishers. That is why the score jumps from 33 to 67.
The problem: the Content Site preset is hidden behind a Customize dropdown that most users will never open. The default scan is All Checks. The shareable number—the one that gets tweeted, compared, and optimized—is the All Checks composite. For content websites, that number is structurally too low.
Goodhart's Law Is Already in Motion
Goodhart's Law states: when a measure becomes a target, it ceases to be a good measure. The Agent Readiness Score is now a public, shareable, compared number. Three predictable failures will follow.
First, score-chasing. Website owners will add empty MCP Server Cards, publish Agent Skills indexes with no actual skills, and ship WebMCP tools that do nothing—just to pass the scanner's checks. The score goes up; real agent behavior does not change.
Second, a consultancy gold rush. Just as PageRank spawned link farms and Core Web Vitals spawned performance theater, the Agent Readiness Score will spawn “Agent Readiness optimization” services that sell the number rather than the underlying architecture. The history of SEO tells us this is inevitable.
Third, premature standardization. The scanner scores emerging proposals—like Content Signals, Web Bot Auth, and llms.txt—alongside ratified RFCs. Including them as scored signals accelerates adoption past the point where the specs are ready for real traffic. A website owner fixing a failing check becomes the marginal adopter who tips a draft into a de facto standard before the governance work is done.
Winners and Losers
Winners: Cloudflare gains a new moat. The scanner integrates with Radar, the URL Scanner API, and an MCP endpoint that agents themselves can call. This positions Cloudflare as the arbiter of agent-readiness, driving adoption of its own proposals (Content Signals, Web Bot Auth) and locking developers into its ecosystem. AI developers using Cloudflare get a benchmarking tool that improves trust in their agents.
Losers: Independent AI agent testing services lose mindshare as Cloudflare's integrated scanner becomes the default. Competing cloud providers (AWS, Google Cloud) without a similar scoring system risk being perceived as less AI-friendly. Content websites that share the default All Checks score will appear less ready than they actually are, potentially harming partnerships or ad revenue.
Second-Order Effects
Expect other vendors to ship their own scanners with different checklists. A Google scanner would emphasize UCP and structured data; a Perplexity scanner might prioritize different bot rules. Website owners will see different scores from different scanners, fragmenting the metric. The real standard will emerge from the intersection of checks that appear in every scanner. Those are the de facto standards worth optimizing for.
Also watch for regulatory attention. If the Agent Readiness Score becomes a de facto gatekeeper for AI agent access, it could raise antitrust concerns—especially if Cloudflare uses it to steer traffic toward its own services.
Market and Industry Impact
The scanner establishes a new layer of infrastructure services focused on AI agent quality assurance. It will become part of the AI development lifecycle, similar to how PageSpeed Insights became part of web development. Expect integration with CI/CD pipelines, monitoring dashboards, and SEO toolkits. The market for agent-readiness consulting will emerge within 12 months.
Executive Action
- Run the scanner on your website using the Content Site or API/Application preset, not the default. Track individual check outcomes, not the composite score.
- Fix the checks that map to real agent runtime behavior today: robots.txt, sitemap, Markdown content negotiation, and structured data. Ignore proposal-stage formats until they gain ecosystem traction.
- Do not hire a consultancy to optimize your Agent Readiness Score. The work is straightforward: a half-day audit and a focused remediation sprint will outperform any packaged service.
Why This Matters
The Agent Readiness Score is the first public metric for a new layer of the web. It will shape investment, partnership decisions, and competitive positioning for years. If you optimize for the wrong number—or share a misleading default score—you will make strategic decisions based on flawed data. The cost of that mistake compounds as agent traffic grows.
Final Take
Cloudflare has shipped a genuinely useful tool. But the composite score is a marketing layer, not a strategic signal. The real intelligence lives in the per-check detail. Read the report. Fix what matters. Ignore the number. And always, always check the preset.
Rate the Intelligence Signal
Intelligence FAQ
The default 'All Checks' preset includes six API/Auth/MCP checks that are irrelevant for content sites. Switch to the 'Content Site' preset inside the Customize dropdown to get an accurate score.
No. Optimize for the checks that map to real agent runtime behavior today: robots.txt, sitemap, Markdown content negotiation, and structured data. Ignore proposal-stage formats until they gain ecosystem traction.



