What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

Directory sites are not automatically categorized as doorway pages. Their purpose must provide real value to users; otherwise, they risk being seen as obsolete.
14:44
🎥 Source video

Extracted from a Google Search Central video

⏱ 56:09 💬 EN 📅 08/12/2015 ✂ 13 statements
Watch on YouTube (14:44) →
Other statements from this video 12
  1. 4:10 Les erreurs hreflang pénalisent-elles vraiment votre référencement ?
  2. 9:13 Faut-il vraiment pointer les canonicals vers chaque version linguistique ?
  3. 11:00 Les citations et liens vers des sources reconnues améliorent-ils vraiment le classement Google ?
  4. 11:38 Faut-il vraiment pointer x-default vers une page générique plutôt que vers une langue principale ?
  5. 11:47 Le balisage rel=author est-il encore utile pour le SEO ?
  6. 12:26 Pourquoi un site pénalisé manuellement ne retrouve-t-il pas son classement après levée de la sanction ?
  7. 24:12 Les liens internes doivent-ils vraiment être « naturels » pour Google ?
  8. 27:24 Le balisage schema incorrect nuit-il vraiment au classement Google ?
  9. 30:08 Les 200 facteurs de classement Google : faut-il encore investir dans les backlinks ?
  10. 35:56 Faut-il vraiment rediriger toutes les pages obsolètes après une refonte ?
  11. 37:02 Pourquoi hreflang ne fonctionne-t-il que sur les pages canoniques ?
  12. 48:10 Google peut-il supprimer des fonctionnalités de recherche sans prévenir les SEO ?
📅
Official statement from (10 years ago)
TL;DR

Google differentiates legitimate directory pages from doorway pages based on one criterion: real user value. A directory that only aims to manipulate crawling or intercept traffic will be treated as obsolete. The distinction lies in the purpose of the content: if your category pages exist solely for SEO without providing unique information, you may be in trouble.

What you need to understand

What distinction does Google make between directories and doorway pages?

A directory (or category page) organizes content in a hierarchical manner. Google accepts it when it helps users navigate or discover relevant resources. A doorway page, however, exists solely to capture organic traffic and direct it to a final destination without any inherent value.

The boundary is often blurry. Many e-commerce sites or directories create hundreds of location-based or thematic filtering pages that technically fall under directories but behave like doorways if they present no unique or useful content.

What does Google mean by 'real value' for users?

Google never precisely defines this term, and that's the problem. We can infer that real value means: editorialized content, comparative information, functional filters, personalized recommendations, or any data that justifies the existence of the page beyond simply ranking for a query.

If your page 'Plumber Paris 15' differs from 'Plumber Paris 16' only by the postal code in the H1, you are in the gray area. If it contains an interactive map, verified location-based reviews, average neighborhood rates, that's different.

What does 'perceived as obsolete' mean in practice?

Google does not mention an explicit manual penalty here but talks about algorithmic downgrading. A site packed with empty or duplicate directory pages will be treated as light spam: less crawl budget allocated, gradual de-indexing, or near-zero ranking.

The term 'obsolete' is significant. Google suggests that this type of structure belongs to the past of SEO, when it was possible to manipulate the index with minimal variations. The modern algorithm favors the consolidation of strong content rather than the scattering of weak pages.

  • Legitimate Directory: Provides navigation, context, or unique data at each hierarchical level.
  • Doorway Page: Created solely to intercept a long-tail query without offering original content.
  • Gray Area: Filter pages or multiple geolocations with minimal templated content – risk of downgrading if there is no real differentiation.
  • Key Criterion: Would the page be useful if it offered no SEO traffic? If the answer is no, you're likely outside the limits.

SEO Expert opinion

Does this statement align with what is observed in the field?

Yes, but with massive inconsistencies across sectors. In real estate, travel, or local directories, some sites still rank with thousands of ultra-templated directory pages. Google does not treat all of them as obsolete, far from it.

The reality is that the algorithm tolerates these structures better if the domain has strong authority, a significant volume of backlinks, or a long history. A new site attempting the same approach gets crushed. The double standard is obvious. [To be verified] if Google really applies this filter uniformly or lets major players slip through due to index inertia.

What nuances should be added to this official position?

Google states 'the purpose must provide value' but gives no quantifiable threshold. How many unique words? What duplication rate is acceptable? What depth of taxonomy is okay? Nothing. It’s all vague, protecting Google from any contractual obligation.

A second nuance: Mueller talks about perception, not binary penalties. A site can very well have pages perceived as obsolete without facing a visible sanction, just a slowdown in crawling or poor ranking. The signal is never clear, and you’ll never know if your issue comes from that or another factor.

In what cases does this rule not really apply?

Dominant marketplaces (Amazon, Booking, etc.) blatantly violate this rule with millions of self-generated filter pages. Google leaves them alone because they have massive direct user demand and huge brand search. Their 'value' does not come from editorialized content, but from inventory and liquidity.

Another exception: news or reference sites that create tag or subject pages with automatic aggregation. If the site has a high crawl frequency and a loyal audience, Google tolerates lighter structures as long as navigation remains coherent. The bar is not the same for everyone.

Warning: If you're creating dozens of location-based or thematic directory pages without differentiated content, monitor your indexing rate and crawl budget in Search Console. A sudden drop in the number of pages crawled per day is often the first signal of a 'doorway' perception issue.

Practical impact and recommendations

How can you assess if your directory pages are compliant or at risk?

Start with a manual audit of your category, filter, or geolocation pages. Ask yourself: does this page have a reason to exist if I remove SEO traffic? If the answer is 'no' for more than 30% of your indexed pages, you have a structural problem.

Next, analyze the crawl rate by page type in Search Console. If Google crawls your product pages 10 times more than your category pages, it’s a signal that the algorithm assesses them as less priority, even unnecessary. Also compare organic CTR: directory pages with a CTR < 1% are likely perceived as weak.

What concrete actions can secure your existing directories?

Add unique editorial content to each important category page: written introduction, specific FAQ, selection criteria, comparisons. If you manage 500 filter pages, prioritize the 50 generating 80% of the traffic, and enrich them first.

Use strategic noindex on low-value directory pages: improbable combined filters, granular geolocations without audience, automatically generated tags without content. It’s better to have a clean index of 200 strong pages than a polluted index of 2000 ghost pages.

How to structure new directory pages without risk?

Adopt a demand-driven approach: create a category page only if you have real search data (Search Console, Google Trends, keyword tools). No speculation on geo or thematic variations without proven volume.

For each new page, impose a minimum content threshold: 300 unique words, at least 5 differentiating elements (map, reviews, local stats, etc.), and a coherent internal linking structure. If you can’t meet these criteria, consolidate into a broader parent page.

  • Audit all category/filter pages and identify those with no unique content or significant traffic.
  • Enhance the 20% of directory pages generating 80% of the traffic with editorialized content, FAQs, specific data.
  • Noindex or remove combined filter pages, hyper-granular geolocations, or auto-generated tags without audience.
  • Monitor crawl budget in Search Console: any sharp decrease in crawl on categories = alert signal.
  • Create new directory pages only if proven search volume and ability to produce 300+ unique words.
  • Test a sample of poorly performing pages by temporarily de-indexing them to measure the overall impact on crawl and ranking.
The distinction between a legitimate directory and a doorway page rests on a subjective criterion: the user value perceived by Google. Without clear thresholds, the safest strategy is to prioritize quality over quantity, enhance pages with high potential, and de-index weak variations. These optimizations require a detailed analysis of your crawl budget, traffic patterns, and time-consuming editorial rewriting. If you manage a complex site with hundreds of categories or filters, working with a specialized SEO agency can be invaluable for prioritizing actions, automating large-scale analysis, and avoiding pitfalls that trigger algorithmic downgrading.

❓ Frequently Asked Questions

Une page de catégorie e-commerce sans produits en stock peut-elle être considérée comme doorway page ?
Oui, si elle reste indexée et n'apporte aucune information alternative (prévision de réassort, produits similaires, contenu éditorial). Google la verra comme une coquille vide destinée uniquement à ranker.
Faut-il noindexer systématiquement les pages de filtres combinés (ex: « chaussures rouges taille 42 Paris ») ?
Pas systématiquement, mais si aucune recherche réelle n'existe pour cette combinaison et que la page est 100 % templée, le noindex limite le risque de dilution de crawl budget et de perception doorway.
Combien de mots uniques minimum faut-il sur une page de répertoire pour éviter le filtre doorway ?
Google ne donne aucun chiffre. Par expérience, 250-300 mots uniques et structurés réduisent le risque, mais le vrai critère reste la différenciation réelle par rapport aux pages sœurs.
Un site avec 10 000 pages de répertoire géolocalisées peut-il ranker sans problème si l'autorité de domaine est forte ?
Oui, en pratique beaucoup de gros acteurs le font. Mais un nouveau site tentant la même structure sera probablement ignoré ou déclassé, car l'algo tolère mieux les structures anciennes ou autoritaires.
Si Google crawle moins mes pages de catégories, est-ce un signal qu'il les perçoit comme doorways ?
Pas forcément, mais c'est un indicateur de faible priorité. Si le crawl chute brutalement sans raison technique (vitesse, serveur), c'est souvent que l'algo juge ces pages peu utiles ou redondantes.
🏷 Related Topics
Domain Age & History AI & SEO Penalties & Spam

🎥 From the same video 12

Other SEO insights extracted from this same Google Search Central video · duration 56 min · published on 08/12/2015

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.