What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 3 questions

Less than 30 seconds. Find out how much you really know about Google search.

🕒 ~30s 🎯 3 questions 📚 SEO Google

Official statement

Search Console aggregate reports (mobile-friendly, structured data, Core Web Vitals) only show a sample of the indexed pages, not the entirety. In extreme cases, this sample may be limited to a single page. Therefore, the number of pages in these reports does not reflect the actual number of indexed pages. To know the exact number of indexed pages, you must refer to the Index Coverage report.
4:51
🎥 Source video

Extracted from a Google Search Central video

⏱ 48:25 💬 EN 📅 26/06/2020 ✂ 16 statements
Watch on YouTube (4:51) →
Other statements from this video 15
  1. 0:38 Désactiver temporairement son panier e-commerce pénalise-t-il vraiment le référencement ?
  2. 3:15 Faut-il bloquer complètement un site e-commerce en période de fermeture temporaire ?
  3. 4:51 La taille d'échantillon Search Console varie-t-elle selon la qualité perçue de votre site ?
  4. 4:51 Pourquoi les agrégateurs de liens ont-ils tant de mal à ranker ?
  5. 9:29 Googlebot ignore-t-il vraiment les banners de consentement cookies lors de l'indexation ?
  6. 12:12 Faut-il encore utiliser le Disavow Tool pour gérer les liens spam ?
  7. 20:56 Comment Google actualise-t-il vraiment le cache AMP de vos pages ?
  8. 20:56 Pourquoi Google affiche-t-il parfois les versions HTML et AMP d'une même page simultanément dans les SERP ?
  9. 23:41 Comment organiser les sitemaps quand on gère des milliers de sous-domaines ?
  10. 23:41 Pourquoi vos milliers de sous-domaines ralentissent-ils le crawl de Google ?
  11. 23:41 Comment gérer efficacement des milliers de sous-domaines dans Search Console ?
  12. 27:54 Search Console compte-t-elle vraiment tous les clics que vous croyez ?
  13. 30:58 Le contenu masqué en CSS est-il vraiment indexé en mobile-first ?
  14. 34:12 Pourquoi votre site SEO oscille-t-il entre bon et pénalisé sans raison apparente ?
  15. 37:52 Quelle structure d'URL choisir pour maximiser votre ranking international ?
📅
Official statement from (5 years ago)
TL;DR

Search Console aggregate reports (mobile-friendly, structured data, Core Web Vitals) only show a sample of the indexed pages, sometimes reduced to a single page in extreme cases. The direct consequence is that you cannot rely on these reports to measure the actual coverage of your site. Only the Index Coverage report provides a comprehensive view of the number of pages actually indexed by Google.

What you need to understand

What does Google really mean by 'sample' in these reports?

When Mueller talks about a sample, he refers to a partial selection of the indexed pages that Google analyzes to create the aggregate reports. The algorithm selects a representative subset rather than crawling your entire site for these specific views.

This sampling approach specifically applies to Mobile-Friendly, Structured Data, and Core Web Vitals reports. Google does not crawl every URL in your index to populate these dashboards - it takes what it considers sufficiently indicative of the overall state of your site.

Why might a sample be limited to a single page?

The extreme case mentioned by Mueller — a sample reduced to a single page — typically occurs on small sites or those with a very homogeneous structure. If Google detects that all your pages share the same template, the same mobile configuration, and the same technical patterns, it may deem it unnecessary to analyze more.

This crawl resource optimization logic makes sense from Google's perspective, but it poses a reliability issue for SEO professionals who want complete data. A site with 50,000 pages may have its Core Web Vitals report fed by only 200 analyzed URLs.

What is the difference with the Index Coverage report?

The Index Coverage report operates differently: it relies on actual indexing logs from Google, not on a post-crawl sampling. Every time Googlebot attempts to index a page, that attempt is recorded and reflected in this report.

It is thus the only reliable source to know the exact number of indexed pages. Other aggregate reports only serve to identify qualitative issues on a subset of your site — not to measure comprehensive coverage.

  • Mobile-Friendly, Structured Data, and Core Web Vitals reports use a variable sample, sometimes very small (down to a single page in extreme cases)
  • Only the Index Coverage report reflects the actual number of pages indexed by Google
  • Never rely on the volume of URLs shown in aggregate reports to estimate the overall health of your indexing
  • Sampling is a resource optimization on Google's side, not a bug or a temporary technical limitation
  • A technically homogeneous site will see a more restricted sample than a heterogeneous site with varied templates

SEO Expert opinion

Is this statement consistent with field observations?

Absolutely. For years, there have been massive discrepancies between the number of URLs reported in aggregate reports and the actual indexing volume measured via Index Coverage or site: queries. Mueller finally confirms what many suspected: these reports are not designed to be exhaustive.

The problem is that this crucial information is never clearly indicated in the Search Console interface. Nothing signals to the user that the displayed figures are partial and unrepresentative of the total volume. This opacity has led thousands of SEOs to make strategic decisions based on truncated data.

What nuances should be added to this statement?

Mueller remains deliberately vague about the sampling criteria. We do not know how Google selects the analyzed pages: is it random? Based on internal PageRank? On crawl frequency? On the diversity of detected templates? [To be verified] — Google has never publicly documented this selection algorithm.

Another point: Mueller talks about 'extreme cases' for the sample reduced to one page but provides no threshold or statistic. From what site size does sampling become significant? What average proportion of the site is analyzed? This data is missing to contextualize the extent of the problem.

Does this limitation impact all types of sites the same way?

No. Sites with a heterogeneous architecture — multiple templates, varied technical sections, mix desktop/mobile/PWA — likely benefit from a broader sample. Google must analyze more pages to capture diversity.

Conversely, an e-commerce site with 100,000 structured product listings using the same template might see an absurdly small sample. The danger? A markup error on a specific category may never be detected if that section is not included in the analyzed sample.

Warning: If you fix an issue detected in an aggregate report, do not rely on the update delay of this report to validate your fix. The re-crawl of the sample may take weeks or may never include the corrected URLs. Validate your corrections through manual tests (Mobile-Friendly Test, Rich Results Test) or via third-party tools that crawl your entire site.

Practical impact and recommendations

How can I effectively measure the indexing coverage of my site?

Forget the aggregate reports for this task. Focus on Index Coverage as the source of truth for indexing volume. Regularly export this report and compare it to your XML sitemap to identify excluded or undiscovered URLs.

Supplement with targeted site: queries to check the indexing of critical sections. For example: site:example.com/category-a/ to measure the coverage of a specific category. Cross-reference this data with your server logs to detect crawled but non-indexed pages.

What should I do if an aggregate report signals a problem in a sample?

Take the signal seriously, but audit the entire site — not just the URLs listed in the report. If Google detects a structured data issue on 5 pages in the sample, chances are the problem affects hundreds or thousands of other un-analyzed pages.

Use comprehensive crawlers (Screaming Frog, OnCrawl, Botify) to scan 100% of your URLs and identify all occurrences of the issue. Never settle for fixing just the few pages listed by Search Console — you would only be addressing a visible fraction of the actual problem.

Should I continue monitoring these aggregate reports?

Yes, but interpret them as qualitative indicators rather than quantitative. A sudden spike in Core Web Vitals errors signals a real issue, even if only 50 URLs are shown while your site has 10,000. What matters is the trend and nature of the error, not the absolute volume.

Set up automatic alerts for abnormal variations in these reports. A sharp drop in the number of valid structured data URLs may indicate a faulty deployment — even if that number represents only a sample.

  • Use exclusively Index Coverage to measure actual indexing volume
  • Audit the entire site when an aggregate report signals a problem, do not only fix the listed URLs
  • Validate your technical corrections via manual tests (Mobile-Friendly Test, Rich Results Test) rather than wait for the Search Console report updates
  • Regularly export Index Coverage and cross-reference with your XML sitemaps to detect coverage discrepancies
  • Set alerts for sharp variations in aggregate reports to quickly detect regressions
  • Complement Search Console with third-party crawlers capable of analyzing 100% of your site
Mueller's revelation radically changes how you should interpret Search Console. Aggregate reports are merely partial probes — useful for detecting typical issues, but unusable for measuring comprehensive coverage. Only Index Coverage provides a complete view of your indexing. This distinction is critical for effectively steering your SEO strategy. Implementing robust monitoring and cross-referencing multiple data sources (Search Console, crawlers, server logs) requires sharp expertise and rigorous processes. If this complexity exceeds your internal resources, engaging a specialized SEO agency can prevent you from blindly navigating with truncated data.

❓ Frequently Asked Questions

Le rapport Core Web Vitals affiche 200 URLs — est-ce que cela signifie que seules 200 pages de mon site sont indexées ?
Non, absolument pas. Ce chiffre représente uniquement l'échantillon analysé par Google pour ce rapport spécifique. Votre site peut contenir 50 000 pages indexées dont seulement 200 ont été sélectionnées pour alimenter le rapport Core Web Vitals. Consultez Index Coverage pour connaître le nombre réel de pages indexées.
Si Google ne crawle qu'un échantillon, comment puis-je être sûr que toutes mes pages sont conformes mobile-friendly ?
Vous ne pouvez pas vous fier uniquement au rapport Mobile-Friendly de Search Console. Utilisez un crawler qui analyse l'intégralité de votre site (Screaming Frog, OnCrawl) ou testez manuellement les templates critiques via le Mobile-Friendly Test de Google pour valider la conformité de toutes vos pages.
J'ai corrigé des erreurs de données structurées signalées dans Search Console, mais le rapport ne se met pas à jour — est-ce normal ?
Oui, c'est cohérent avec le fonctionnement par échantillon. Les pages corrigées ne font peut-être pas partie de l'échantillon analysé, ou Google n'a pas encore re-crawlé cet échantillon. Validez vos corrections via le Rich Results Test plutôt que d'attendre la mise à jour du rapport.
Peut-on forcer Google à analyser davantage de pages dans ces rapports agrégés ?
Non, la taille et la composition de l'échantillon sont entièrement contrôlées par Google. Vous ne pouvez pas influencer directement ce processus. La seule approche consiste à améliorer la diversité technique de votre site, ce qui pourrait théoriquement inciter Google à élargir l'échantillon — mais c'est purement spéculatif.
Le rapport Index Coverage est-il lui aussi basé sur un échantillon ou reflète-t-il 100% des tentatives d'indexation ?
Index Coverage reflète la totalité des tentatives d'indexation enregistrées par Google — c'est la seule source exhaustive dans Search Console. Ce rapport s'appuie sur les logs d'indexation réels, pas sur un échantillonnage post-crawl comme les rapports agrégés.
🏷 Related Topics
Domain Age & History Crawl & Indexing Mobile SEO Web Performance Search Console

🎥 From the same video 15

Other SEO insights extracted from this same Google Search Central video · duration 48 min · published on 26/06/2020

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.