What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 3 questions

Less than 30 seconds. Find out how much you really know about Google search.

🕒 ~30s 🎯 3 questions 📚 SEO Google

Official statement

Google has no specific ratio regarding how many pages a site should have or how many should be indexed. Generally, fewer pages with more concentrated value perform better than many diluted pages.
🎥 Source video

Extracted from a Google Search Central video

💬 EN 📅 18/02/2022 ✂ 24 statements
Watch on YouTube →
Other statements from this video 23
  1. Google compte-t-il vraiment tous les liens visibles dans Search Console ?
  2. Faut-il vraiment concentrer son contenu sur moins de pages pour ranker ?
  3. Les critères d'avis produits Google s'appliquent-ils même si votre site n'est pas classé comme site d'avis ?
  4. L'API Indexing de Google fonctionne-t-elle vraiment pour tous les contenus ?
  5. L'E-A-T influence-t-il vraiment le classement Google ou n'est-ce qu'un mythe ?
  6. Les mentions de marque sans lien ont-elles un impact sur votre référencement ?
  7. Les commentaires d'utilisateurs améliorent-ils vraiment le classement dans Google ?
  8. Les certificats SSL premium influencent-ils vraiment le référencement Google ?
  9. PDF et HTML avec le même contenu : faut-il craindre une cannibalisation dans les SERPs ?
  10. Peut-on vraiment piloter l'indexation des PDF via les headers HTTP ?
  11. Faut-il encore utiliser rel=next et rel=prev pour la pagination ?
  12. Googlebot peut-il vraiment indexer vos contenus en défilement infini ?
  13. Faut-il s'inquiéter de la page référente affichée dans Google Search Console ?
  14. Faut-il vraiment rediriger l'ancien sitemap en 301 ou soumettre le nouveau directement ?
  15. Pourquoi 97% de crawl refresh est-il un signal positif pour votre site ?
  16. Comment Google détermine-t-il réellement la vitesse de crawl de votre site ?
  17. Vitesse de crawl et Core Web Vitals : pourquoi Google fait-il la distinction ?
  18. Pourquoi Google ralentit-il son crawl après un changement d'hébergement ?
  19. Le paramètre de taux de crawl est-il vraiment un plafond et non un objectif ?
  20. Le CTR peut-il vraiment pénaliser le reste de votre site ?
  21. Le maillage interne est-il vraiment l'élément le plus déterminant pour le SEO ?
  22. Le linking interne agit-il vraiment instantanément après recrawl ?
  23. Faut-il s'inquiéter si Google ne crawle pas toutes vos pages ?
📅
Official statement from (4 years ago)
TL;DR

Google doesn't impose any ideal ratio between the total number of pages on a site and those that should be indexed. The preferred approach: concentrate value on fewer pages rather than dilute authority across excessive volume. Quality consistently trumps quantity when it comes to indexation.

What you need to understand

Why does Google refuse to give a precise ratio?

Google doesn't set a numerical rule because each site has unique architecture and different objectives. An e-commerce store with 50,000 products is nothing like a blog with 200 articles. Imposing a universal percentage would be absurd.

This position also reflects a technical reality: the algorithm evaluates the individual relevance of each page, not a global indexation score. The search engine doesn't need you to reach a threshold — it simply wants to find content that deserves to be ranked.

What does "fewer pages with more value" actually mean in practice?

The statement directly targets sites that artificially inflate their volume: pointless tag pages, nearly identical variations, auto-generated content with no real utility. This phenomenon dilutes crawl budget and scatters quality signals.

Concentrating value means merging redundant content, removing orphaned pages, deindexing what adds nothing to the user experience. Every indexed URL must justify its existence through a specific search intent.

Does this apply to all types of websites?

Not exactly. A news site can legitimately publish hundreds of pages per week. A local directory needs to index each business listing. The principle remains valid, but application varies depending on your editorial model.

The crucial nuance: even with high volume, each page must have an identifiable added value. It's not the absolute number that's problematic, it's the proportion of weak or duplicate content.

  • Google sets no ratio of total pages / indexed pages
  • Concentrating value on fewer pages outperforms spreading it across many
  • Evaluation happens page by page, not at the site-wide level
  • Business context dictates the legitimate volume of indexation
  • Every indexed URL must respond to a specific search intent

SEO Expert opinion

Is this recommendation consistent with real-world observations?

Absolutely. Technical audits consistently show that sites that prune their index see their strategic pages perform better. Crawl budget concentrates on relevant URLs, quality signals no longer get dispersed.

However — and this is where Google remains vague — the definition of "concentrated value" remains subjective. [To be verified] because Google provides no measurable criteria for qualifying what deserves indexation. We're still navigating blind on this notion of "quality."

What gray areas remain in this statement?

Mueller carefully avoids discussing technical thresholds. From how many weak pages is the entire site penalized? What percentage of thin content triggers overall devaluation? Radio silence.

Another evasive point: treatment of very high-volume sites. A pure-play e-commerce player with 500,000 product references can't "concentrate value" like a brochure site. Google suggests a universal approach while constraints differ radically by sector.

Warning: This statement shouldn't serve as an excuse to underdevelop a site. Some sectors require exhaustive coverage. Artificially reducing page count out of dogmatism can destroy your long-tail visibility.

In what cases does this rule not apply?

News sites and large editorial platforms can't play the minimalism card. Their model relies on freshness and volume. As long as each article covers a specific angle, mass indexation remains legitimate.

Same goes for comparison sites, directories, or marketplaces: their value comes precisely from comprehensiveness. Massive deindexation would kill their value proposition. Here, the challenge becomes fine-grained quality management rather than crude volume reduction.

Practical impact and recommendations

How do you identify pages that dilute your site's value?

Start by cross-referencing Search Console with your analytics tool. Extract indexed URLs generating zero clicks over 6 months, zero impressions, zero organic traffic. These are your first deindexation candidates.

Next examine technical pages: internal search results, parameterized filters, poorly managed pagination, tags without unique content. These URLs burn crawl budget without return. A well-calibrated robots.txt or noindex tags solve the problem.

What concrete actions should you take following this statement?

Launch a complete indexation audit: how many pages does Google have in its index vs how many do you actually want indexed? The gap often reveals massive leaks (archives, empty categories, duplicate content).

For each type of weak content, make a decision: improve, merge, redirect, or deindex. Orphaned pages disappear. Similar content consolidates via 301 redirects. Pointless variations get noindexed.

Strengthen retained pages: targeted internal linking, enriched content, on-page optimizations. If you cut your index in half, the remaining pages must become twice as powerful.

  • Extract all indexed URLs via Search Console and exhaustive crawl
  • Identify pages with zero traffic / zero impressions over 6+ months
  • Spot duplicate or near-identical content to merge
  • Block indexation of pages with no SEO value (filters, internal search)
  • Consolidate weak content via 301 redirects to pillar pages
  • Strengthen internal linking toward retained strategic pages
  • Monitor indexation rate evolution and performance post-cleanup
This statement confirms what practitioners have observed for years: selective indexation outperforms maximum indexation. Google rewards sites that make its job easier by offering only high-value-added content. Implementing this strategy requires pointed technical expertise and comprehensive architecture vision. If the scope seems complex or you lack internal resources, partnering with a specialized SEO agency can save you precious time and secure performance gains.

❓ Frequently Asked Questions

Quel est le bon ratio de pages indexées pour un site e-commerce ?
Il n'existe pas de ratio universel. L'essentiel est que chaque fiche produit, catégorie ou page de contenu apporte une valeur unique. Un site avec 10 000 produits peut légitimement indexer autant de pages si chacune cible une intention de recherche spécifique.
Faut-il désindexer les anciennes pages de blog peu performantes ?
Pas systématiquement. Si l'article traite d'un sujet toujours recherché mais performe mal, enrichissez-le plutôt que de le supprimer. En revanche, les contenus obsolètes sans potentiel de mise à jour peuvent être redirigés vers des pages actualisées ou désindexés.
Comment Google mesure-t-il la « valeur concentrée » d'une page ?
Google ne détaille pas ses critères précis, mais combine probablement profondeur du contenu, engagement utilisateur, autorité du domaine, qualité du maillage interne et pertinence thématique. L'absence de définition chiffrée oblige à se fier aux signaux indirects : trafic, temps de session, taux de rebond.
Les pages en noindex consomment-elles toujours du crawl budget ?
Oui, tant qu'elles restent accessibles et crawlables. Le noindex empêche l'indexation mais pas le crawl. Pour économiser vraiment du budget, il faut bloquer ces URLs via robots.txt ou supprimer les liens internes qui y mènent.
Un site peut-il être pénalisé pour avoir trop de pages indexées ?
Pas directement. Google ne punit pas le volume en soi, mais un index gonflé de contenu faible dilue les signaux de qualité et ralentit le crawl des pages stratégiques. L'impact est indirect mais bien réel sur les performances globales.
🏷 Related Topics
Domain Age & History Crawl & Indexing AI & SEO

🎥 From the same video 23

Other SEO insights extracted from this same Google Search Central video · published on 18/02/2022

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.