Context: The Science Behind AI Source Selection
A study analyzing 1.2 million ChatGPT responses reveals that AI citation distribution is not democratic but highly concentrated, with a small set of domains controlling visibility. The data, spanning 21,482 citation rows across 127 unique prompts, shows that AI prioritizes sources differently than traditional search, with concentration levels varying significantly by industry.
Strategic Analysis: Decoding the Citation Hierarchy
The core finding is extreme concentration in AI citations. For example, the top 10% of domains capture 59.5% of all citations in verticals like Education, while Healthcare shows fragmentation with only 13.0% concentration for the top 10% of domains. This variation indicates that industry maturity and query type dictate strategic opportunities for visibility.
Concentration by Vertical: From Winner-Take-Most to Fragmented Fields
In Education and Crypto, citation patterns are highly concentrated due to narrow query spaces and established authority sources, such as tefl.org in Education. In contrast, Healthcare and CRM/SaaS exhibit lower concentration, with citations spread across hundreds of domains, suggesting entry points for new players. Finance displays moderate concentration but query-specific dominance, like forfiduciary.com in advisor locator pages.
Length and Structure: Industry-Specific Impacts
Word count correlates with citation frequency, but effects are industry-specific. Finance content peaks at 5,000-10,000 words with 10.9 citations per page, then drops sharply to 4.92 citations per page at 10,000-20,000 words. Education shows steady gains with length, while CRM/SaaS has weaker length effects, emphasizing structural optimization over word count. The bottom 10% of any page earns only 2.4-4.4% of citations, roughly a quarter of the peak band's share.
Evergreen Pages: Key to Citation Breadth
A small percentage of URLs, cited 10 or more times, serve as evergreen pages covering multiple query intents, such as category-level guides or comparison roundups. These comprehensive resources, like chainstack.com's Solana RPC provider page, answer clusters of prompts and earn broad citation reach, demonstrating that clustered, authoritative content compounds value over time.
Implications and Strategic Actions
This concentration necessitates a shift from keyword-focused SEO to cluster-centric content architecture. Businesses should audit existing content for citation potential, invest in evergreen pages tailored to industry-specific patterns—e.g., optimizing word length for Finance or Education—and prioritize structural elements, such as front-loading key data in the first 30% of pages, to enhance AI visibility and avoid obscurity in the evolving digital landscape.
Source: Search Engine Journal
Rate the Intelligence Signal
Intelligence FAQ
If your domain is not among the top 30 in your topic, AI may rarely cite you, reducing exposure in platforms like ChatGPT and impacting lead generation.
Varies by vertical: aim for 5,000-10,000 words in Finance, longer in Education and Crypto, and prioritize structure over length in CRM/SaaS.
Yes, by focusing on underserved sub-topics or creating authoritative evergreen pages that cover query clusters, rather than competing directly with dominant domains.


