What does Google say about SEO? /

Official statement

For classified sites with expired content, either redirect to the category page (soft 404) or return a 404 error. Both options remove the page from search results. Do not keep old pages labeled 'expired' for too long as Google needs to crawl and recognize that they are outdated.
722:53
🎥 Source video

Extracted from a Google Search Central video

⏱ 996h50 💬 EN 📅 12/03/2021 ✂ 43 statements
Watch on YouTube (722:53) →
Other statements from this video 42
  1. 42:49 Can hreflang really be used across multiple distinct domains?
  2. 48:45 Can hreflang really be used across multiple distinct domains?
  3. 58:47 Should you really avoid duplicating your content across two distinct sites?
  4. 58:47 Should you really avoid creating multiple sites for the same content?
  5. 91:16 Is it really necessary to index the internal search pages on your site?
  6. 91:16 Should you block internal search pages to prevent indexing of infinite space?
  7. 125:44 Do Core Web Vitals Really Influence Google's Crawl Budget?
  8. 125:44 Can reducing page size really enhance your crawl budget?
  9. 152:31 Does the internal links report in Search Console truly reflect the state of your link structure?
  10. 152:31 Why does the Search Console's internal links report show only a sample?
  11. 172:13 Should you really be concerned about redirect chains for Google's crawl?
  12. 172:13 How many redirects does Google really follow before it splits the crawl?
  13. 201:37 How does Google actually segment your Core Web Vitals by groups of pages?
  14. 201:37 How does Google actually segment your Core Web Vitals by page groups?
  15. 248:11 Is it true that AMP or canonical really captures the SEO signals?
  16. 257:21 Does the Chrome UX Report really count your cached AMP pages?
  17. 272:10 Is it necessary to redirect your AMP URLs during a change?
  18. 272:10 Should you really redirect your old AMP URLs to the new ones?
  19. 294:42 Is AMP really neutral for Google rankings, or does it hide an invisible visibility lever?
  20. 296:42 Is AMP really a Google ranking factor or just a ticket to access certain features?
  21. 342:21 Why does copied content sometimes outrank the original despite the DMCA?
  22. 342:21 Is the DMCA really effective in protecting your duplicated content on Google?
  23. 359:44 Why does copied content outrank your original material on Google?
  24. 409:35 Why do your featured snippets disappear seemingly without a technical reason?
  25. 409:35 Do featured snippets and rich results really fluctuate randomly?
  26. 455:08 Is it true that mobile hidden content is really indexed by Google?
  27. 455:08 Is it true that Google really indexes hidden content in responsive CSS?
  28. 563:51 Can structured data really force the display of a knowledge panel?
  29. 563:51 Is there any structured markup that guarantees the appearance of a Knowledge Panel?
  30. 583:50 Why do most websites never get sitelinks in Google?
  31. 583:50 Can you really force sitelinks to appear in Google?
  32. 649:39 Do 301 redirects really transfer 100% of SEO juice without any loss?
  33. 649:39 Do 301 redirects really transfer 100% of PageRank and SEO signals?
  34. 722:53 Should you really remove expired pages or can you leave them labeled 'expired'?
  35. 859:32 Are keywords in the URL a ranking factor or just a temporary crutch?
  36. 859:32 Do words in the URL really influence Google rankings?
  37. 908:40 Should you really add structured data to embedded YouTube videos?
  38. 909:01 Should you really add video structured data when you're already embedding YouTube?
  39. 932:46 Does Page Experience really only matter for mobile SEO?
  40. 932:46 Why is Google ignoring desktop Core Web Vitals in its ranking algorithm?
  41. 952:49 Do the API and Search Console interface really display the same data?
  42. 963:49 Can you use different templates for each language version without harming international SEO?
📅
Official statement from (5 years ago)
TL;DR

Google recommends returning a 404 or redirecting expired content (ads, events, promotions) to a category page rather than leaving it indexed with a simple 'expired' label. Leaving these pages crawlable forces Google to continuously recrawl to check their obsolescence, which dilutes the crawl budget and pollutes the index. Specifically, a 404 or a 301 redirect to the relevant category speeds up de-indexation and frees up crawl resources for active pages.

What you need to understand

Why does Google insist on removing expired content?<\/h3>

Google's logic is simple: every crawlable page consumes crawl budget.<\/strong> If a classified site keeps 10,000 expired listings marked 'expired' but still accessible with a 200 OK status, Googlebot has to revisit them regularly to check if their status has changed. This process is resource-intensive and slows down the discovery of new listings or updates to active content.<\/p>

Mueller specifies that a redirect or a 404 allows Google to quickly remove these pages from the index.<\/strong> A 404 clearly signals 'this content no longer exists,' triggering a gradual de-indexation. A 301 redirect to the parent category transfers the relevance signal (and some link juice if the page had it) to a still useful page. Google treats a redirect to a generic page as a soft 404 — meaning it understands that the original content no longer exists and adjusts its handling.<\/p>

What is the difference between a real 404 and a soft 404?<\/h3>

A soft 404 occurs when a page returns a 200 OK status but displays empty or generic content<\/strong> (message 'this ad no longer exists,' blank page with just a header/footer). Google detects these signals and treats the page as if it returned a 404, but this process takes time and consumes crawl budget unnecessarily.<\/p>

A real 404 is an explicit HTTP signal. Google instantly understands that the resource no longer exists and accelerates de-indexation.<\/strong> A 301 redirect to a relevant category is an interesting compromise: it avoids the loss of authority and guides the user (and the bot) to still relevant content. However, caution: mass redirecting to the homepage is treated as a soft 404 by Google, as the destination has no semantic relation to the original URL.<\/p>

What is the real cost of keeping thousands of indexable expired pages?<\/h3>

The first impact is the waste of crawl budget.<\/strong> A site with 50,000 active listings and 200,000 expired listings still accessible with a 200 OK status forces Google to crawl an excessive volume of outdated pages. This ratio dilutes the bot's attention on what really matters: fresh listings, updated categories, conversion pages.<\/p>

The second effect is pollution of the index.<\/strong> Google can temporarily keep these pages in its results, generating clicks to expired content, which degrades the user experience and increases the bounce rate. Over time, Google adjusts its quality perception of the site and may reduce crawling frequency or downgrade certain sections. On fast-turnaround sites (real estate, automotive, events), this problem amplifies exponentially.<\/p>

  • 404 or 301:<\/strong> both options are valid depending on the context; a 404 is cleaner for truly outdated content, a 301 preserves authority if the page had backlinks.<\/li>
  • Avoid soft 404s:<\/strong> do not return 200 OK with an 'expired page' message — it's the worst compromise for crawl budget.<\/li>
  • Smart redirection:<\/strong> redirect to the category or a relevant similar page, never to a generic homepage.<\/li>
  • Timeliness:<\/strong> do not keep indexable expired pages 'just in case'; Google recommends quick and proactive management.<\/li>
  • High turnover sites:<\/strong> classifieds, events, flash promotions are particularly concerned by this directive.<\/li><\/ul>

SEO Expert opinion

Is this recommendation consistent with observational data?

Yes, largely. SEO audits of high-volume temporary content sites (real estate, automotive, events) consistently show a wasted crawl budget on expired pages.<\/strong> Server logs reveal that Googlebot may spend 60 to 80% of its time on outdated URLs if they remain accessible with 200 OK status. This pattern is confirmed in Search Console: coverage reports show thousands of 'crawled, currently not indexed' pages that often correspond to expired content that Google visits but deems irrelevant.<\/p>

The nuance lies in the residual value of certain expired pages.<\/strong> A real estate listing that has generated quality backlinks or sustained organic traffic may benefit from being redirected to a similar listing or a geolocated category rather than being deleted abruptly. In this case, the 301 preserves some authority. However, this assumes granular handling, which is rarely scalable across tens of thousands of URLs.<\/p>

What are the limitations of this directive for certain site models?

E-commerce sites with frequent restocks<\/strong> pose a real dilemma. A product out of stock today may be back in stock in 15 days. Returning it to a 404 or prematurely redirecting risks losing SEO history (positions, backlinks, organic traffic). Google itself recommends in this case to keep the page at 200 OK with a clear message 'temporarily unavailable' and a rich snippet Product with availability = OutOfStock. This is not a contradiction, but a distinction between 'definitively expired' (sold listing) and 'temporarily unavailable' (out of stock).<\/p>

Another edge case: news or press sites. An article about a past event remains relevant for informational search<\/strong> even if the event is over. Deleting or redirecting this content would be counterproductive. Mueller specifically refers to 'classifieds' and content with no inherent value once expired. Therefore, context is important: the rule applies to ephemeral transactional content, not to lasting informational content. [To be verified]<\/strong> in hybrid cases like seasonal buying guides ('Best smartphone 2023'): should they be redirected or updated?

What about managing soft 404s and chain redirects?

Google detects soft 404s with increasing accuracy thanks to machine learning, but this is not instantaneous. A site that returns 200 OK on 'expired ad' pages can remain indexed for several weeks or even months before Google rectifies. This delay is a black hole for crawl budget.<\/strong> An explicit 404 or a well-targeted 301 avoids this latency.<\/p>

However, be careful with chain redirects.<\/strong> If an ad A redirects to a category B, which in turn redirects to a landing page C, Google may lose part of the signal and consider the journey a dead end. Redirects should be direct and point to a stable destination. Similarly, massively redirecting 10,000 expired ads to 5 generic categories can trigger an alert with Google: the bot will understand that these redirects are a technical workaround, not a true semantic match. Result: treated as a soft 404.

Warning:<\/strong> Massive redirects to generic pages may be interpreted as soft 404s by Google. Always prioritize granularity and semantic relevance of the destination.<\/div>

Practical impact and recommendations

How to effectively handle expired content on a classified site?

First step: segment expired content<\/strong> according to their residual value. Listings without backlinks, historical organic traffic, and potential for reactivation should simply return a 404. This is the most common scenario on fast-turnover sites. For listings that have generated authority (backlinks, social shares, sustained traffic), prioritize a 301 redirect to the most relevant category or a similar active listing if you have the technical capacity to match automatically.

Second point: automate the process.<\/strong> On a site with thousands of ads changing daily, manually managing expirations is impossible. Set up a server-side script that, X days post-expiration (e.g., 7 days to allow a reactivation margin), automatically switches the HTTP status of the page to 404 or triggers a 301 redirect according to predefined rules (presence of backlinks, historical traffic volume, category). This process should be auditable via logs to prevent major errors.

What mistakes to avoid during implementation?

Never redirect en masse to the homepage. This is the worst mistake: Google treats these redirects as soft 404s and you lose the advantage of a 301. The destination must have semantic consistency<\/strong> with the source URL. An ad for a rental in Paris 15th should redirect to the category 'Rentals Paris 15th,' not to 'All our listings.'

Another trap: keeping pages with an 'expired' banner for too long. Mueller talks about ‘long’ without specifying a threshold, but field observations suggest that beyond 2-3 weeks<\/strong>, crawl budget is already significantly impacted. If you need to retain a record for UX (user history, commercial follow-up), do it in a non-crawlable area (robots.txt or temporary noindex), then switch to 404 or 301 as soon as possible. [To be verified]<\/strong> if temporary noindex is preferable to a 200 OK with an 'expired' label — theoretically yes, but it adds a technical step.

How to check that expired content management aligns with Google recommendations?

Analyze your server logs to identify the volume of crawl on expired URLs.<\/strong> If Googlebot spends more than 20-30% of its time on these pages, you have a problem. Compare the ratio of active crawl / obsolete crawl over a rolling 30-day period. Use Search Console to track 'crawled, currently not indexed' that often correspond to undeclared soft 404s.

Manually test a sample of expired URLs: check the returned HTTP code (404 or 301), the destination if redirected, and the speed of de-indexation (via site:yourdomain.com for 'exact title of the ad'). If expired pages remain indexed after 4-6 weeks, it’s a warning signal. Finally, monitor your Core Web Vitals and crawl times: an improvement after cleaning expired pages confirms that the crawl budget is better allocated.

  • Segment<\/strong> expired content according to their residual value (backlinks, traffic, potential for reactivation).
  • Automate<\/strong> switching to 404 or 301 after expiration, with granular rules.
  • Avoid<\/strong> massive redirects to the homepage or generic pages.
  • Regularly audit<\/strong> server logs to measure crawl on obsolete content.
  • Check<\/strong> effective de-indexation of expired pages via Search Console and manual searches.
  • Test<\/strong> returned HTTP codes and relevance of redirect destinations.
  • <\/ul>
    Managing expired content is a critical lever for optimizing crawl budget and maintaining a clean index. A clear strategy (404 vs 301) coupled with technical automation allows this issue to be addressed at scale. These optimizations, while conceptually simple, often require advanced technical expertise to be effectively deployed on high-volume sites. If you manage a site with thousands of temporary contents, support from a specialized SEO agency may prove valuable to design a robust architecture, audit impacts, and carry out migrations without loss of visibility.<\/div>

❓ Frequently Asked Questions

Une 404 nuit-elle au SEO global du site ?
Non. Google a confirmé à de nombreuses reprises que les 404 sont normales et n'impactent pas le classement des autres pages. Elles signalent simplement qu'une ressource n'existe plus. Ce qui nuit au SEO, c'est de conserver des pages obsolètes indexables qui diluent le crawl budget.
Combien de temps faut-il pour qu'une page en 404 disparaisse de l'index ?
Généralement entre quelques jours et 4-6 semaines, selon la fréquence de crawl du site et l'ancienneté de la page. Une page très crawlée disparaît rapidement. Une page peu visitée peut rester indexée plusieurs mois avant que Google ne la retire définitivement.
Faut-il rediriger vers la catégorie ou vers une annonce similaire ?
Vers une annonce similaire si vous pouvez matcher automatiquement avec pertinence (même zone géographique, même type de bien, même gamme de prix). Sinon, vers la catégorie la plus précise possible. Jamais vers la homepage.
Peut-on conserver une page expirée en noindex au lieu de la supprimer ?
Techniquement oui, mais c'est sous-optimal. Le noindex empêche l'indexation mais la page reste crawlable, donc consomme du crawl budget. Une 404 ou une 301 est plus efficace pour libérer totalement les ressources de crawl.
Quel est le délai raisonnable avant de basculer une annonce expirée en 404 ou 301 ?
Entre 7 et 14 jours maximum après expiration, sauf si vous avez une raison commerciale de prolonger (relance client, fenêtre de réactivation). Au-delà de 2-3 semaines, l'impact négatif sur le crawl budget devient mesurable.

🎥 From the same video 42

Other SEO insights extracted from this same Google Search Central video · duration 996h50 · published on 12/03/2021

🎥 Watch the full video on YouTube →

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.