What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

If a site that has been closed for a year is relaunched on a new domain and its content has been copied by scraping sites, Google will likely not penalize the legitimate site for duplication. The algorithm generally detects who the original author is. If in doubt, report spam sites via the Spam Report.
50:14
🎥 Source video

Extracted from a Google Search Central video

⏱ 1h14 💬 EN 📅 04/06/2020 ✂ 44 statements
Watch on YouTube (50:14) →
Other statements from this video 43
  1. 2:22 What should you do if your site lost traffic after a Core Update without making any mistakes?
  2. 2:22 Are Core Web Vitals Really Going to Transform Your SEO Strategy?
  3. 3:50 Does a ranking drop after a Core Update really indicate an issue with your site?
  4. 3:50 Should You Really Wait Before Optimizing Core Web Vitals?
  5. 3:50 Why is Google delaying the complete transition to the Mobile-First Index?
  6. 7:07 Can Google really delay Mobile-First Indexing indefinitely?
  7. 11:00 Why doesn't Google canonicalize URLs with fragments in sitelinks and rich results?
  8. 11:00 Do URLs with fragments (#) in Search Console mean you need to rethink your tracking and analysis strategy?
  9. 14:34 Why do the numbers from Analytics, Search Console, and My Business never match?
  10. 14:35 Why do your Google metrics never align between Search Console, Analytics, and Business Profile?
  11. 16:37 How are FAQ clicks really counted in Search Console?
  12. 18:44 Are mobile and desktop accordions really neutral for SEO?
  13. 18:44 Is it true that mobile accordion hidden content is indexed as visible content?
  14. 29:45 Does the rel=canonical via HTTP header really still work?
  15. 30:09 Does the HTTP header rel=canonical really work to manage duplicate content?
  16. 31:00 Why does Search Console still show 'PC Googlebot' on recent sites when Mobile-First Index is supposed to be the standard?
  17. 31:02 Is it true that all sites indexed after July 2019 default to Mobile-First Indexing?
  18. 33:28 Why does Google emphasize textual context in Search Console feedback?
  19. 33:31 Are Search Console tools really enough to solve your indexing problems?
  20. 33:59 Why are your pages still not indexed after 60 days in Search Console?
  21. 37:24 What happens when Google occasionally indexes HTTP instead of HTTPS even after an SSL migration?
  22. 37:53 Is it really necessary to combine both 301 redirections AND canonical tags for an HTTPS migration?
  23. 39:16 What really causes your sitemap to fail in Search Console and how can you effectively resolve the issue?
  24. 41:29 Is your brand disappearing from the SERPs for no apparent reason: can Google feedback really fix it?
  25. 44:07 Should you choose a subdomain or a new domain for launching a service?
  26. 44:34 Subdomain or New Domain: What Does Google Really Think for SEO?
  27. 44:34 Do Google penalties really transfer between domains and subdomains?
  28. 45:27 Do Google penalties really spread between domains and subdomains?
  29. 48:24 Should you really overlook PageRank when deciding between a domain and a subdomain?
  30. 48:33 Do links between root domains and subdomains really pass PageRank?
  31. 49:58 Should you really be worried about duplicate content from scraping?
  32. 50:14 Should you really report every scraping URL via the Spam Report to prompt action from Google?
  33. 57:15 Is it really necessary to report spam URL by URL to assist Google?
  34. 58:57 Why does Google refuse to show your FAQs in rich results despite perfect markup?
  35. 59:54 Why doesn't Google display your FAQ rich results even with perfect markup?
  36. 65:15 Is it possible to add FAQs to your pages just to secure rich results in SEO?
  37. 65:45 Can you really add a FAQ just to get the rich result without risking penalties?
  38. 67:27 Should you still optimize rel=next/prev tags for pagination?
  39. 67:58 Should you really submit all paginated pages in the XML sitemap?
  40. 70:10 Should you really index all category pages to optimize your crawl budget?
  41. 70:18 Should you really stop placing category pages in noindex?
  42. 72:04 Does the number of JavaScript files really slow down Google indexing?
  43. 72:24 Does Googlebot really render all JavaScript in a single pass?
📅
Official statement from (6 years ago)
TL;DR

Google states that relaunching a closed site on a new domain typically does not trigger a duplicate penalty, even if scrapers have copied the content during the downtime. The algorithm is supposed to identify the legitimate original author. In practice, this means that migrating or reactivating archived content should not expose you to a duplicate filter — but if in doubt, reporting spam sites remains the only concrete action suggested.

What you need to understand

What does this statement really mean for a site restarting its activities?

When a site remains closed for an extended period, it's not uncommon for opportunistic scrapers to siphon its content and republish it elsewhere. The classic fear: relaunching the original domain or migrating content to a new domain could trigger a duplicate content filter. Google states that this scenario should not pose a major issue.

Google's algorithm uses temporal and authority signals to determine who the legitimate author is. Indexing history, content age, link profile, user behavior: these are all criteria that theoretically help distinguish the original from the copy. In this specific case, the relaunched site would retain its original status even in the face of parasitic duplicates.

Why doesn't Google systematically penalize duplicate content?

Contrary to a persistent misconception, duplicate content is not a penalty in the strict sense. Google simply filters out redundant versions to display only one URL in the results. The question is: which URL?

In the case of massive scraping, Google relies on trust signals: domain age, natural backlink profile, editorial consistency, brand signals. A legitimate site that republishes its archived content normally possesses these signals, whereas an opportunistic scraper does not have them. That’s the theory, at least.

What should you do if Google makes a mistake and favors copies?

Google recommends using the Spam Report to report sites that have siphoned your content. This is the only corrective action mentioned here, which speaks volumes about the limits of automation: the algorithm can make mistakes, especially if the authority signals of the legitimate site have been weakened by prolonged inactivity.

There isn’t an automatic mechanism to claim the precedence of content. No interdomain canonical tag, no universal authorship declaration. Manual reporting remains the only lever — and its effectiveness is never guaranteed or measurable.

  • The algorithm theoretically favors the original author through temporal and authority signals
  • Duplicate content is not a penalty but a deduplication filter in the SERPs
  • The Spam Report is the only recourse if Google preferentially indexes the copies
  • No technical mechanism to claim precedence exists from the webmaster side
  • Prolonged inactivity weakens authority signals, which can complicate automatic recognition

SEO Expert opinion

Does this statement truly reflect what we observe in the field?

Let's be honest: Google is right most of the time. In the majority of cases, an established domain that resumes its activities does not suffer a duplicate penalty against second-tier scrapers. But that "probably" in the statement is telling. There are situations where the algorithm hesitates or makes mistakes, particularly when the original site has lost its trust signals during inactivity.

A domain that has been offline for a year may see its backlinks becoming obsolete, its traffic profile disappearing, and its reputation eroding. If, in the meantime, scrapers have built an artificial link ecosystem around their copies, Google may temporarily favor those versions — especially if they generate traffic or engagement signals, even if they are false. [To be verified]: Google never discloses the precise thresholds that trigger a reversal of priority between original and copy.

What are the concrete limits of this algorithmic protection?

The problem intensifies in three specific scenarios. First case: content is republished on a new domain without history. Even if it's the same owner, Google has no technical means to automatically establish the link between the old and new domain. The signals reset to zero.

Second case: scrapers have massively enriched or modified the copied content. If the copy contains substantial additions, Google may consider it an improved version and favor it. Third case: the legitimate site generates no engagement signals upon restart (zero traffic, zero fresh backlinks), while the copies artificially generate them via PBNs or bot traffic. The algorithm may be temporarily deceived.

How effective is the Spam Report as a solution?

Google's advice — "report spam sites" — is technically correct but operationally weak. The Spam Report works on a large scale for obvious content farms, but for isolated cases of targeted scraping, the effectiveness is anecdotal. No SLA, no feedback, no guarantee of processing.

In practice, waiting for a manual action from Google is not a strategy. It's better to focus your efforts on rapidly rebuilding positive signals: qualified traffic, fresh editorial backlinks, social engagement, brand mentions. This is what convinces the algorithm, not a reporting form. [To be verified]: Google never publishes statistics on processing rates or the real impact of Spam Reports submitted by individual webmasters.

Attention: If you relaunch a site after long inactivity on a new domain, don't rely solely on Google’s automatic detection. Implement an active re-legitimation strategy: 301 redirects if possible, targeted link-building campaigns, reactivating social accounts, strong brand signals. The algorithm needs tangible proof of continuity.

Practical impact and recommendations

What should you do before relaunching an inactive site?

Before any republication, audit the extent of scraping of your content. Use tools like Copyscape, Ahrefs Content Explorer, or a simple Google search with exact excerpts in quotes. Identify how many domains have copied your pages and assess their respective authority. If you detect copies on high-authority sites, the risk of algorithmic confusion increases.

Next, prepare for the reactivation of your authority signals. If you are relaunching on the same domain, ensure that the old URLs are restored exactly to benefit from indexing history. If migrating to a new domain, set up 301 redirects from the old one — even if Google says it's not mandatory, it's a strong signal of continuity.

How can you maximize your chances of being recognized as a legitimate source?

Publish fresh content immediately after reactivation. A site that restarts with only dated content appears suspicious. Add update dates, new sections, and editorial enrichments. This proves that the site is actively maintained and not just a static clone.

Launch a mini-campaign of targeted editorial link-building in the initial weeks. A few mentions in industry media, a press release announcing the relaunch, strategic guest posts: these are all signals that enhance your legitimacy. Google places enormous weight on fresh backlinks in its authority and precedence calculations.

What mistakes should you absolutely avoid?

Never republish your archived content without any modification. Even minor adjustments — updating dates, adding sections, optimizing tags — send a signal of freshness. Content that is strictly identical to existing copies may raise doubts in the algorithm.

Also, avoid neglecting Search Console. Actively submit your sitemap, monitor crawling errors, and check that indexing is progressing normally. If you notice that copies are indexed but not your original pages, it's time to fill out Spam Reports — but without expecting miracles, as we have seen.

  • Audit the extent of scraping with Copyscape or Ahrefs Content Explorer
  • Restore the original URLs exactly or set up clean 301 redirects
  • Publish fresh content upon reactivation to prove editorial activity
  • Launch a targeted link-building campaign within the first 30 days
  • Update dates and enrich archived content before republication
  • Submit the sitemap in Search Console and actively monitor indexing
Relaunching a site after long inactivity requires a proactive approach to maximize your chances of being recognized as a legitimate source by Google. The algorithm generally makes the right choice, but it needs clear signals: technical continuity via redirects, editorial freshness via updates, regained authority through strategic link-building. These optimizations can be complex to orchestrate alone, especially when managing technical migration, content production, and link acquisition simultaneously. In this context, relying on a specialized SEO agency can ensure a methodical reactivation and prevent mistakes that might compromise the algorithmic recognition of your legitimacy.

❓ Frequently Asked Questions

Google pénalise-t-il vraiment le duplicate content ?
Non, le duplicate content n'est pas une pénalité mais un filtre de dédoublonnage. Google choisit simplement quelle version afficher dans les résultats, généralement en privilégiant l'original détecté via des signaux d'autorité et d'ancienneté.
Comment Google identifie-t-il l'auteur original d'un contenu ?
Google utilise des signaux temporels (première indexation, historique d'archives), d'autorité (profil de backlinks, confiance du domaine) et comportementaux (trafic, engagement) pour déterminer la source légitime. Ces signaux ne sont jamais détaillés publiquement.
Faut-il obligatoirement garder le même domaine pour éviter les problèmes de duplicate ?
Non, mais migrer sur un nouveau domaine efface votre historique d'indexation et affaiblit vos signaux d'autorité. Des redirections 301 depuis l'ancien domaine et une stratégie de re-légitimation active sont alors indispensables.
Le Spam Report est-il efficace contre les scrapers ?
Son efficacité est limitée pour des cas isolés. Google traite en priorité les fermes de contenu à grande échelle. Pour un scraping ciblé, mieux vaut renforcer vos propres signaux d'autorité que d'attendre une action manuelle hypothétique.
Combien de temps faut-il pour que Google reconnaisse le site légitime après un relaunch ?
Cela dépend de la vitesse à laquelle vous reconstituez vos signaux d'autorité. Avec du contenu frais, du netlinking actif et un bon suivi Search Console, quelques semaines suffisent généralement. Sans ces signaux, la reconnaissance peut prendre plusieurs mois ou ne jamais se faire correctement.
🏷 Related Topics
Algorithms Content AI & SEO JavaScript & Technical SEO Domain Name Penalties & Spam

🎥 From the same video 43

Other SEO insights extracted from this same Google Search Central video · duration 1h14 · published on 04/06/2020

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.