What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

If a page is identified as irrelevant or a product is unavailable, it may be treated as a soft 404. This means Google could choose not to display this page in search results until it is updated to reflect valid content.
25:10
🎥 Source video

Extracted from a Google Search Central video

⏱ 54:18 💬 EN 📅 17/05/2018 ✂ 23 statements
Watch on YouTube (25:10) →
Other statements from this video 22
  1. 2:37 Le maillage entre plusieurs projets web est-il risqué pour le SEO ?
  2. 3:41 L'attribut hreflang influence-t-il vraiment le classement de vos pages internationales ?
  3. 6:00 Le ciblage géographique influence-t-il vraiment le classement local de votre site ?
  4. 10:21 Les liens ont-ils vraiment perdu de leur importance pour le ranking ?
  5. 13:12 Les signaux sociaux influencent-ils vraiment le classement Google ?
  6. 13:26 L'indexation Mobile First fonctionne-t-elle vraiment sans optimisation mobile ?
  7. 13:44 Pourquoi votre site ne retrouve-t-il pas son classement après la levée d'une pénalité manuelle ?
  8. 14:34 Comment Google choisit-il vraiment la version canonique d'une page en cas de contenu dupliqué ?
  9. 16:15 Le cache Google révèle-t-il vraiment les différences mobile-desktop qui impactent votre classement ?
  10. 17:42 L'indexation mobile-first signifie-t-elle que Google pénalise les sites non optimisés pour mobile ?
  11. 19:34 Faut-il vraiment implémenter hreflang sur tous les sites multilingues ?
  12. 23:41 La balise canonical écrase-t-elle vraiment toutes vos variations produit ?
  13. 25:20 Les soft 404 sur produits indisponibles peuvent-ils faire chuter vos positions ?
  14. 27:12 Les signaux sociaux influencent-ils réellement le référencement naturel ?
  15. 29:38 Les liens vers une page canonicalisée perdent-ils leur valeur SEO ?
  16. 31:44 Les canonicals et en-têtes rendus en JavaScript sont-ils réellement ignorés par Google ?
  17. 36:40 Faut-il encore optimiser la longueur de ses meta descriptions pour Google ?
  18. 50:01 Peut-on bloquer les fichiers vidéo MP4 dans robots.txt sans risquer de pénalités SEO ?
  19. 60:20 Faut-il vraiment optimiser la longueur de ses meta descriptions ?
  20. 70:24 Pourquoi Search Console affiche-t-il certaines ressources comme bloquées alors qu'elles sont censées être accessibles ?
  21. 73:40 Google indexe-t-il vraiment les réponses JSON brutes ?
  22. 75:16 Pourquoi le HTML statique initial d'une SPA conditionne-t-il son indexation ?
📅
Official statement from (7 years ago)
TL;DR

Google treats certain pages as soft 404s even if they return a 200 code, particularly unavailable product pages or content deemed irrelevant. These pages vanish from results until they are updated with valid content. Detection remains vague: Google does not clearly detail the criteria that turn a page into a soft 404, leaving SEOs uncertain about the exact threshold for triggering.

What you need to understand

What is a soft 404 and why does Google care about it?

A soft 404 refers to a page that returns an HTTP 200 (success) code when it should technically return a 404 (not found) or a 410 (definitely gone). Technically, the server says, "all is well," but the content is empty, generic, or worthless.

Google seeks to identify these situations where the response code does not reflect the reality of the page. An out-of-stock product showing "Out of stock" with an almost empty page, a category emptied of all its items, or an internal search that returns no results—these are typical cases.

How does Google detect that a page is a soft 404?

The engine relies on several behavioral and technical signals. Extremely short content compared to other pages on the site, recurring textual markers ("no results," "unavailable," "page not found"), or an HTML structure similar to real 404 pages trigger alerts.

Google also compares the page to its history: if it previously contained substantial content and has suddenly gone hollow, the risk increases. The bounce rate and lack of clicks in Search Console may strengthen the signal, though Google doesn’t explicitly admit this.

Why does this statement change the game for e-commerce?

Many online stores keep unavailable product listings live to retain SEO traffic or sales history. They display a message like "Temporarily out of stock" with a notification form.

If Google deems the content insufficient, the page disappears from results. It doesn't necessarily lose its complete indexing, but it becomes invisible. As long as you don’t update it with valid content, it remains out of play.

  • HTTP 200 doesn’t guarantee indexing: content takes precedence over HTTP status
  • Pages flagged as soft 404 are temporarily removed from results, not permanently deindexed
  • Google does not specify the exact content threshold required to avoid this treatment
  • Textual markers ("unavailable," "no results") are red flags for the algorithm
  • The time for restoration remains unclear: how long before Google recrawls and reevaluates the page?

SEO Expert opinion

Does this statement align with what’s observed in the field?

Yes, and it confirms behaviors seen for years. E-commerce sites with hundreds of empty product listings regularly see their indexing rate drop in Search Console. Google no longer counts them as "discovered but not indexed"; it categorizes them as soft 404s.

What’s new is Mueller’s clarity: he confirms it’s not a bug, it’s a architectural choice of the engine. If the content doesn’t hold up, the page is out of the scope. The issue is that “valid content” remains a subjective notion. How many words? What type of media? Google doesn’t say.

What nuances should be added to this rule?

Google does not treat all unavailable products the same way. A product sheet enhanced with customer reviews, FAQs, videos, and recommendations is at less risk than a skeletal page with just "Out of stock." Context matters.

Then there’s the issue of existing traffic. A page that still receives organic visits or quality backlinks seems to enjoy greater tolerance. Google hesitates to treat it as a soft 404 if it continues generating engagement signals. [To be verified]: this tolerance is undocumented anywhere; it’s an empirical observation.

In what cases does this rule not apply?

Internal search result pages with no results are a borderline case. If your site generates thousands of such URLs, they can be treated as soft 404s even if they technically display a coherent user interface. Here, the right reflex is to block these URLs via robots.txt or noindex.

Pages of temporarily empty categories pose the same challenge. If you sell swimsuits in winter and empty the category, Google may deem it irrelevant. The solution? Add editorial content, guides, or related products to maintain content density.

Caution: Google does not provide a specific timeframe between detecting a soft 404 and removing it from results. Some sites report a disappearance within days, while others take weeks. Crawl frequency likely plays a role, but without official confirmation.

Practical impact and recommendations

What steps should you take to avoid treatment as a soft 404?

The first step: audit all unavailable product pages or empty categories. Use Screaming Frog or your favorite crawler to identify pages with little textual content (fewer than 200 words) and markers of "out of stock," "unavailable," or "no results."

For each page, decide on a strategy: either enrich the content (reviews, FAQs, alternatives), redirect with a 301 to a parent or similar page, or go for a 404 or 410 if the product will not return. The HTTP status must reflect reality.

What mistakes should you absolutely avoid in managing unavailable products?

Never allow a page to return a 200 code with almost empty content. This is the perfect scenario for a soft 404. Google prefers a true 404 to a false 200 because it simplifies its crawling and indexing work.

Avoid chain redirections: product A unavailable → category B → sub-category C → homepage. Google loses patience after 3-4 hops and may give up. A direct redirection to the best alternative is always preferable.

How can you check that your pages are not being treated as soft 404s?

Check the “Pages” section in Search Console, tab “Why pages are not indexed.” If you see “Page with redirection” or “Soft 404” on URLs expected to be indexed, it’s a warning sign.

Also compare the number of indexed URLs in Search Console with the actual number of URLs in your sitemap. A significant gap (30% or more) often indicates that Google is filtering out many pages deemed irrelevant.

  • Audit pages with fewer than 200 words and markers “unavailable” or “out of stock”
  • Enhance exhausted product sheets with FAQs, reviews, guides, or alternatives
  • Use a true 404 or 410 for permanently discontinued products
  • Redirect temporary empty pages with a 301 to the parent category or a similar product
  • Monitor Search Console for soft 404 alerts
  • Block automatically generated URLs without value (internal search, empty filters) via robots.txt or noindex
Managing soft 404s requires ongoing monitoring and a keen understanding of site architecture. Between technical auditing, editorial enrichment, and redirect management, the task can quickly become complex, especially with catalogs featuring thousands of references. If you lack time or internal resources, a specialized SEO agency can structure a strategy suited to your sector and automate part of the monitoring to avoid silent traffic losses.

❓ Frequently Asked Questions

Un soft 404 entraîne-t-il une désindexation complète de la page ?
Non, la page reste techniquement indexée mais Google choisit de ne pas la montrer dans les résultats. Elle peut réapparaître si le contenu est corrigé et re-crawlé.
Combien de temps faut-il pour qu'une page soft 404 soit à nouveau visible ?
Google ne fournit pas de délai précis. Cela dépend de la fréquence de crawl du site et de la rapidité avec laquelle vous corrigez le contenu. Certains signalent quelques jours, d'autres plusieurs semaines.
Faut-il préférer un 404 réel ou un 200 avec peu de contenu ?
Un 404 réel est préférable. Il indique clairement à Google que la page n'existe plus, évitant ainsi le traitement ambiguë en soft 404 et permettant au moteur d'optimiser son crawl budget.
Les pages de recherche interne sans résultat sont-elles considérées comme soft 404 ?
Oui, si elles renvoient un code 200 et affichent un contenu très pauvre. Il est recommandé de les bloquer via robots.txt ou de leur appliquer une balise noindex.
Peut-on enrichir une fiche produit indisponible pour éviter le soft 404 ?
Oui, ajouter des avis clients, une FAQ, des produits alternatifs ou du contenu éditorial augmente les chances que Google la considère comme valable et la maintienne dans les résultats.
🏷 Related Topics
Domain Age & History Content E-commerce AI & SEO Local Search

🎥 From the same video 22

Other SEO insights extracted from this same Google Search Central video · duration 54 min · published on 17/05/2018

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.