What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 3 questions

Less than 30 seconds. Find out how much you really know about Google search.

🕒 ~30s 🎯 3 questions 📚 SEO Google

Official statement

For classified sites with expired content, either redirect to the category page (soft 404) or return a 404 error. Both options remove the page from search results. Do not keep old pages labeled 'expired' for too long as Google needs to crawl and recognize that they are outdated.
722:53
🎥 Source video

Extracted from a Google Search Central video

⏱ 996h50 💬 EN 📅 12/03/2021 ✂ 43 statements
Watch on YouTube (722:53) →
Other statements from this video 42
  1. 42:49 Peut-on vraiment utiliser hreflang entre plusieurs domaines distincts ?
  2. 48:45 Peut-on vraiment utiliser hreflang entre plusieurs domaines distincts ?
  3. 58:47 Faut-il vraiment éviter de dupliquer son contenu sur deux sites distincts ?
  4. 58:47 Faut-il vraiment éviter de créer plusieurs sites pour le même contenu ?
  5. 91:16 Faut-il vraiment indexer les pages de recherche interne de votre site ?
  6. 91:16 Faut-il bloquer les pages de recherche interne pour éviter l'indexation d'un espace infini ?
  7. 125:44 Les Core Web Vitals influencent-ils vraiment le budget de crawl de Google ?
  8. 125:44 Réduire la taille de page améliore-t-il vraiment le budget crawl ?
  9. 152:31 Le rapport de liens internes dans Search Console reflète-t-il vraiment l'état de votre maillage ?
  10. 152:31 Pourquoi le rapport de liens internes de Search Console ne montre-t-il qu'un échantillon ?
  11. 172:13 Faut-il vraiment s'inquiéter des chaînes de redirections pour le crawl Google ?
  12. 172:13 Combien de redirections Google suit-il réellement avant de fractionner le crawl ?
  13. 201:37 Comment Google segmente-t-il réellement vos Core Web Vitals par groupes de pages ?
  14. 201:37 Comment Google segmente-t-il réellement vos Core Web Vitals par groupes de pages ?
  15. 248:11 AMP ou canonique : qui récolte vraiment les signaux SEO ?
  16. 257:21 Le Chrome UX Report compte-t-il vraiment vos pages AMP en cache ?
  17. 272:10 Faut-il vraiment rediriger vos URLs AMP lors d'un changement ?
  18. 272:10 Faut-il vraiment rediriger vos anciennes URLs AMP vers les nouvelles ?
  19. 294:42 AMP est-il vraiment neutre pour le classement Google ou cache-t-il un levier de visibilité invisible ?
  20. 296:42 AMP est-il vraiment un facteur de classement Google ou juste un ticket d'entrée pour certaines features ?
  21. 342:21 Pourquoi le contenu copié surclasse-t-il parfois l'original malgré le DMCA ?
  22. 342:21 Le DMCA est-il vraiment efficace pour protéger votre contenu dupliqué sur Google ?
  23. 359:44 Pourquoi le contenu copié surclasse-t-il votre contenu original dans Google ?
  24. 409:35 Pourquoi vos featured snippets disparaissent-ils sans raison technique ?
  25. 409:35 Les featured snippets et résultats enrichis fluctuent-ils vraiment par hasard ?
  26. 455:08 Le contenu masqué en responsive mobile est-il vraiment indexé par Google ?
  27. 455:08 Le contenu caché en CSS responsive est-il vraiment indexé par Google ?
  28. 563:51 Les structured data peuvent-elles vraiment forcer l'affichage d'un knowledge panel ?
  29. 563:51 Existe-t-il un balisage structuré qui garantit l'apparition d'un Knowledge Panel ?
  30. 583:50 Pourquoi la plupart des sites n'obtiennent-ils jamais de sitelinks dans Google ?
  31. 583:50 Peut-on vraiment forcer l'affichage des sitelinks dans Google ?
  32. 649:39 Les redirections 301 transfèrent-elles vraiment 100 % du jus SEO sans perte ?
  33. 649:39 Les redirections 301 transfèrent-elles vraiment 100% du PageRank et des signaux SEO ?
  34. 722:53 Faut-il vraiment supprimer les pages expirées ou peut-on les laisser avec un label 'expiré' ?
  35. 859:32 Les mots-clés dans l'URL : facteur de ranking ou simple béquille temporaire ?
  36. 859:32 Les mots dans l'URL influencent-ils vraiment le classement Google ?
  37. 908:40 Faut-il vraiment ajouter des structured data sur les vidéos YouTube embarquées ?
  38. 909:01 Faut-il vraiment ajouter des données structurées vidéo quand on embed déjà YouTube ?
  39. 932:46 Les Core Web Vitals impactent-ils vraiment le SEO desktop ?
  40. 932:46 Pourquoi Google ignore-t-il les Core Web Vitals desktop dans son algorithme de classement ?
  41. 952:49 L'API et l'interface Search Console affichent-elles vraiment les mêmes données ?
  42. 963:49 Peut-on utiliser des templates différents par version linguistique sans pénaliser son SEO international ?
📅
Official statement from (5 years ago)
TL;DR

Google recommends returning a 404 or redirecting expired content (ads, events, promotions) to a category page rather than leaving it indexed with a simple 'expired' label. Leaving these pages crawlable forces Google to continuously recrawl to check their obsolescence, which dilutes the crawl budget and pollutes the index. Specifically, a 404 or a 301 redirect to the relevant category speeds up de-indexation and frees up crawl resources for active pages.

What you need to understand

Why does Google insist on removing expired content?<\/h3>

Google's logic is simple: every crawlable page consumes crawl budget.<\/strong> If a classified site keeps 10,000 expired listings marked 'expired' but still accessible with a 200 OK status, Googlebot has to revisit them regularly to check if their status has changed. This process is resource-intensive and slows down the discovery of new listings or updates to active content.<\/p>

Mueller specifies that a redirect or a 404 allows Google to quickly remove these pages from the index.<\/strong> A 404 clearly signals 'this content no longer exists,' triggering a gradual de-indexation. A 301 redirect to the parent category transfers the relevance signal (and some link juice if the page had it) to a still useful page. Google treats a redirect to a generic page as a soft 404 — meaning it understands that the original content no longer exists and adjusts its handling.<\/p>

What is the difference between a real 404 and a soft 404?<\/h3>

A soft 404 occurs when a page returns a 200 OK status but displays empty or generic content<\/strong> (message 'this ad no longer exists,' blank page with just a header/footer). Google detects these signals and treats the page as if it returned a 404, but this process takes time and consumes crawl budget unnecessarily.<\/p>

A real 404 is an explicit HTTP signal. Google instantly understands that the resource no longer exists and accelerates de-indexation.<\/strong> A 301 redirect to a relevant category is an interesting compromise: it avoids the loss of authority and guides the user (and the bot) to still relevant content. However, caution: mass redirecting to the homepage is treated as a soft 404 by Google, as the destination has no semantic relation to the original URL.<\/p>

What is the real cost of keeping thousands of indexable expired pages?<\/h3>

The first impact is the waste of crawl budget.<\/strong> A site with 50,000 active listings and 200,000 expired listings still accessible with a 200 OK status forces Google to crawl an excessive volume of outdated pages. This ratio dilutes the bot's attention on what really matters: fresh listings, updated categories, conversion pages.<\/p>

The second effect is pollution of the index.<\/strong> Google can temporarily keep these pages in its results, generating clicks to expired content, which degrades the user experience and increases the bounce rate. Over time, Google adjusts its quality perception of the site and may reduce crawling frequency or downgrade certain sections. On fast-turnaround sites (real estate, automotive, events), this problem amplifies exponentially.<\/p>

  • 404 or 301:<\/strong> both options are valid depending on the context; a 404 is cleaner for truly outdated content, a 301 preserves authority if the page had backlinks.<\/li>
  • Avoid soft 404s:<\/strong> do not return 200 OK with an 'expired page' message — it's the worst compromise for crawl budget.<\/li>
  • Smart redirection:<\/strong> redirect to the category or a relevant similar page, never to a generic homepage.<\/li>
  • Timeliness:<\/strong> do not keep indexable expired pages 'just in case'; Google recommends quick and proactive management.<\/li>
  • High turnover sites:<\/strong> classifieds, events, flash promotions are particularly concerned by this directive.<\/li><\/ul>

SEO Expert opinion

Is this recommendation consistent with observational data?

Yes, largely. SEO audits of high-volume temporary content sites (real estate, automotive, events) consistently show a wasted crawl budget on expired pages.<\/strong> Server logs reveal that Googlebot may spend 60 to 80% of its time on outdated URLs if they remain accessible with 200 OK status. This pattern is confirmed in Search Console: coverage reports show thousands of 'crawled, currently not indexed' pages that often correspond to expired content that Google visits but deems irrelevant.<\/p>

The nuance lies in the residual value of certain expired pages.<\/strong> A real estate listing that has generated quality backlinks or sustained organic traffic may benefit from being redirected to a similar listing or a geolocated category rather than being deleted abruptly. In this case, the 301 preserves some authority. However, this assumes granular handling, which is rarely scalable across tens of thousands of URLs.<\/p>

What are the limitations of this directive for certain site models?

E-commerce sites with frequent restocks<\/strong> pose a real dilemma. A product out of stock today may be back in stock in 15 days. Returning it to a 404 or prematurely redirecting risks losing SEO history (positions, backlinks, organic traffic). Google itself recommends in this case to keep the page at 200 OK with a clear message 'temporarily unavailable' and a rich snippet Product with availability = OutOfStock. This is not a contradiction, but a distinction between 'definitively expired' (sold listing) and 'temporarily unavailable' (out of stock).<\/p>

Another edge case: news or press sites. An article about a past event remains relevant for informational search<\/strong> even if the event is over. Deleting or redirecting this content would be counterproductive. Mueller specifically refers to 'classifieds' and content with no inherent value once expired. Therefore, context is important: the rule applies to ephemeral transactional content, not to lasting informational content. [To be verified]<\/strong> in hybrid cases like seasonal buying guides ('Best smartphone 2023'): should they be redirected or updated?

What about managing soft 404s and chain redirects?

Google detects soft 404s with increasing accuracy thanks to machine learning, but this is not instantaneous. A site that returns 200 OK on 'expired ad' pages can remain indexed for several weeks or even months before Google rectifies. This delay is a black hole for crawl budget.<\/strong> An explicit 404 or a well-targeted 301 avoids this latency.<\/p>

However, be careful with chain redirects.<\/strong> If an ad A redirects to a category B, which in turn redirects to a landing page C, Google may lose part of the signal and consider the journey a dead end. Redirects should be direct and point to a stable destination. Similarly, massively redirecting 10,000 expired ads to 5 generic categories can trigger an alert with Google: the bot will understand that these redirects are a technical workaround, not a true semantic match. Result: treated as a soft 404.

Warning:<\/strong> Massive redirects to generic pages may be interpreted as soft 404s by Google. Always prioritize granularity and semantic relevance of the destination.<\/div>

Practical impact and recommendations

How to effectively handle expired content on a classified site?

First step: segment expired content<\/strong> according to their residual value. Listings without backlinks, historical organic traffic, and potential for reactivation should simply return a 404. This is the most common scenario on fast-turnover sites. For listings that have generated authority (backlinks, social shares, sustained traffic), prioritize a 301 redirect to the most relevant category or a similar active listing if you have the technical capacity to match automatically.

Second point: automate the process.<\/strong> On a site with thousands of ads changing daily, manually managing expirations is impossible. Set up a server-side script that, X days post-expiration (e.g., 7 days to allow a reactivation margin), automatically switches the HTTP status of the page to 404 or triggers a 301 redirect according to predefined rules (presence of backlinks, historical traffic volume, category). This process should be auditable via logs to prevent major errors.

What mistakes to avoid during implementation?

Never redirect en masse to the homepage. This is the worst mistake: Google treats these redirects as soft 404s and you lose the advantage of a 301. The destination must have semantic consistency<\/strong> with the source URL. An ad for a rental in Paris 15th should redirect to the category 'Rentals Paris 15th,' not to 'All our listings.'

Another trap: keeping pages with an 'expired' banner for too long. Mueller talks about ‘long’ without specifying a threshold, but field observations suggest that beyond 2-3 weeks<\/strong>, crawl budget is already significantly impacted. If you need to retain a record for UX (user history, commercial follow-up), do it in a non-crawlable area (robots.txt or temporary noindex), then switch to 404 or 301 as soon as possible. [To be verified]<\/strong> if temporary noindex is preferable to a 200 OK with an 'expired' label — theoretically yes, but it adds a technical step.

How to check that expired content management aligns with Google recommendations?

Analyze your server logs to identify the volume of crawl on expired URLs.<\/strong> If Googlebot spends more than 20-30% of its time on these pages, you have a problem. Compare the ratio of active crawl / obsolete crawl over a rolling 30-day period. Use Search Console to track 'crawled, currently not indexed' that often correspond to undeclared soft 404s.

Manually test a sample of expired URLs: check the returned HTTP code (404 or 301), the destination if redirected, and the speed of de-indexation (via site:yourdomain.com for 'exact title of the ad'). If expired pages remain indexed after 4-6 weeks, it’s a warning signal. Finally, monitor your Core Web Vitals and crawl times: an improvement after cleaning expired pages confirms that the crawl budget is better allocated.

  • Segment<\/strong> expired content according to their residual value (backlinks, traffic, potential for reactivation).
  • Automate<\/strong> switching to 404 or 301 after expiration, with granular rules.
  • Avoid<\/strong> massive redirects to the homepage or generic pages.
  • Regularly audit<\/strong> server logs to measure crawl on obsolete content.
  • Check<\/strong> effective de-indexation of expired pages via Search Console and manual searches.
  • Test<\/strong> returned HTTP codes and relevance of redirect destinations.
  • <\/ul>
    Managing expired content is a critical lever for optimizing crawl budget and maintaining a clean index. A clear strategy (404 vs 301) coupled with technical automation allows this issue to be addressed at scale. These optimizations, while conceptually simple, often require advanced technical expertise to be effectively deployed on high-volume sites. If you manage a site with thousands of temporary contents, support from a specialized SEO agency may prove valuable to design a robust architecture, audit impacts, and carry out migrations without loss of visibility.<\/div>

❓ Frequently Asked Questions

Une 404 nuit-elle au SEO global du site ?
Non. Google a confirmé à de nombreuses reprises que les 404 sont normales et n'impactent pas le classement des autres pages. Elles signalent simplement qu'une ressource n'existe plus. Ce qui nuit au SEO, c'est de conserver des pages obsolètes indexables qui diluent le crawl budget.
Combien de temps faut-il pour qu'une page en 404 disparaisse de l'index ?
Généralement entre quelques jours et 4-6 semaines, selon la fréquence de crawl du site et l'ancienneté de la page. Une page très crawlée disparaît rapidement. Une page peu visitée peut rester indexée plusieurs mois avant que Google ne la retire définitivement.
Faut-il rediriger vers la catégorie ou vers une annonce similaire ?
Vers une annonce similaire si vous pouvez matcher automatiquement avec pertinence (même zone géographique, même type de bien, même gamme de prix). Sinon, vers la catégorie la plus précise possible. Jamais vers la homepage.
Peut-on conserver une page expirée en noindex au lieu de la supprimer ?
Techniquement oui, mais c'est sous-optimal. Le noindex empêche l'indexation mais la page reste crawlable, donc consomme du crawl budget. Une 404 ou une 301 est plus efficace pour libérer totalement les ressources de crawl.
Quel est le délai raisonnable avant de basculer une annonce expirée en 404 ou 301 ?
Entre 7 et 14 jours maximum après expiration, sauf si vous avez une raison commerciale de prolonger (relance client, fenêtre de réactivation). Au-delà de 2-3 semaines, l'impact négatif sur le crawl budget devient mesurable.

🎥 From the same video 42

Other SEO insights extracted from this same Google Search Central video · duration 996h50 · published on 12/03/2021

🎥 Watch the full video on YouTube →

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.