What does Google say about SEO? /

Official statement

Google may select a different canonical than the declared one when multiple country versions contain exactly the same content in the same language. The algorithms group these URLs and choose a canonical version to save on crawl, even if hreflang is present. To force separate indexing, the content must be significantly different between versions.
22:15
🎥 Source video

Extracted from a Google Search Central video

⏱ 56:47 💬 EN 📅 04/08/2020 ✂ 39 statements
Watch on YouTube (22:15) →
Other statements from this video 38
  1. 1:08 How does my site get included in the Chrome User Experience Report without signing up?
  2. 1:08 How does your site end up in the Chrome User Experience Report?
  3. 2:10 How can you measure Core Web Vitals when your site isn't in CrUX?
  4. 3:14 Can negative reviews really penalize your Google ranking?
  5. 3:14 Can negative reviews really hurt your Google ranking?
  6. 7:57 Should you really separate sitemaps for pages and images?
  7. 7:57 Does splitting your sitemaps truly impact crawling and indexing?
  8. 9:01 Could a 304 Not Modified code actually prevent your pages from being indexed?
  9. 9:01 Is the 304 Not Modified code really a trap for your indexing?
  10. 11:39 Does Google Cache Really Influence the Ranking of Your Pages?
  11. 11:39 Is Google Cache really not useful for assessing a page's SEO quality?
  12. 13:51 Why doesn't your niche change generate any traffic despite all your SEO efforts?
  13. 14:51 Are link directories truly dead for SEO?
  14. 17:59 Do translated pages really count as duplicate content in Google's eyes?
  15. 17:59 Are translated pages really treated as unique content by Google?
  16. 20:20 Why does Google ignore your canonical tags, and how can you enforce separate indexing for your regional URLs?
  17. 23:14 Why is your Search Console crawl budget skyrocketing for seemingly no reason?
  18. 23:18 Why is your Search Console crawl budget skyrocketing for no apparent reason?
  19. 25:52 Should you really limit the crawl rate in Search Console?
  20. 26:58 Hreflang and geo-targeting: Can Google really ignore your international signals?
  21. 28:58 Are Hreflang and Canonical really reliable for geographic targeting?
  22. 34:26 Why is Search Console showing the wrong URL for Hreflang and Canonical?
  23. 34:26 Why does Search Console display a different canonical than what appears in the SERP for your hreflang pages?
  24. 38:38 How does Google really differentiate between two sites in the same language but targeting different countries?
  25. 38:42 Should you canonicalize all your country versions to a single URL?
  26. 38:42 Should you really keep each hreflang page self-canonical?
  27. 39:13 How can local signals help you prevent canonicalization between your multi-country pages?
  28. 43:13 Should you really abandon country variations in hreflang?
  29. 45:34 Is it really necessary to use hreflang for a multilingual website?
  30. 47:44 Do Facebook comments really impact your site's SEO and EAT?
  31. 48:51 Should you isolate UGC and News content in subdomains to avoid penalties?
  32. 50:58 Should you create a lightweight version for Googlebot to speed up crawling?
  33. 50:58 Should you focus on optimizing your site speed for Googlebot or your actual users?
  34. 50:58 Should you serve a streamlined version of your pages to Googlebot to improve crawl efficiency?
  35. 52:33 Can you create local pages by city without risking penalties for doorway pages?
  36. 52:33 How can you tell a legitimate city page from a penalizable doorway page?
  37. 54:38 Has Google's manual action for doorway pages disappeared in favor of algorithmic solutions?
  38. 54:38 Are doorway pages still subject to manual penalties from Google?
📅
Official statement from (5 years ago)
TL;DR

Google can ignore your canonical and hreflang tags if multiple country versions strictly publish the same content in the same language. The algorithms merge these URLs by selecting an arbitrary version to save on crawl budget. The only solution: significantly differentiate the content among each country version, beyond mere cosmetic adjustments.

What you need to understand

Why does Google group supposedly distinct URLs?

When a French e-commerce site publishes exactly the same catalog in French for .fr, .be, and .ch, Google considers this as duplication. It doesn't matter if you have correctly implemented hreflang and self-referential canonical tags on each version. The algorithms detect the identity of the content and operate clustering to avoid wasting crawl budget.

The result? Google chooses one single canonical URL from your country versions, often the one it deems most relevant according to opaque criteria: domain popularity, geographical signals, indexing history. Other versions end up as duplicates ignored, even if they target distinct markets.

Does hreflang become useless in this specific case?

No, but its role then becomes limited to display selection in the SERPs based on the user's geolocation. If Google has grouped .fr, .be, and .ch under a single canonical, hreflang can still direct a Belgian user to .be rather than .fr — as long as that version is actually indexed.

The trap: Search Console may show .be as “Alternate page with proper canonical tag”, which can create the illusion that everything is functioning properly. But if the content is identical, Google does not guarantee separate indexing. You end up with only one version indexed, and the others are in canonical limbo.

What does “significantly different” mean in Google's eyes?

Mueller provides no numeric threshold — as is typical. In practice, a simple change in currency, legal mentions, or footer is not enough. There must be substantial variations: localized product descriptions, editorial content unique to each market, different pricing structures with contextual explanations.

Concrete example? If you sell sneakers, don’t just change € to CHF. Write product descriptions tailored to Swiss preferences, add local guides (“Delivery in Switzerland: timelines and fees”), create region-specific editorial sections. Google needs to perceive a distinct editorial intent, not a copy-paste job with monetary variables.

  • Algorithmic clustering: Google merges URLs with identical content to save on crawl, regardless of your hreflang/canonical structure
  • Opaque selection: the canonical version chosen by Google does not necessarily correspond to your declared preference
  • Conditional indexing: the other versions remain technically “known” but are not indexed separately
  • Substantial differentiation required: cosmetic changes are insufficient; real editorial variations between markets are needed
  • Hreflang retains a role: it can guide geolocated display, but does not enforce indexing if the content is duplicated

SEO Expert opinion

Is this statement consistent with real-world observations?

Absolutely, and it’s even a relief that Google finally acknowledges it publicly. For years, we have seen multi-country sites with perfect hreflang mysteriously lose national versions from the index. Search Console showed “Alternate page”, everything seemed technically correct, but .be or .ch URLs never ranked. The issue was not implementation; it was duplicated content.

What still shocks some practitioners: hreflang was marketed as the anti-duplicate solution for international sites. Mueller reminds us that no — hreflang indicates linguistic alternatives; it does not magically cleanse duplication. If the content is identical, Google applies its clustering logic, period. The tags then become simply display suggestions, not indexing orders.

What nuances should be added to this rule?

First point: “exactly the same content” remains vague. [To be verified] but we can assume that Google uses some form of internal similarity calculation, probably a derivative of shingling or fingerprinting. A textual duplication rate over 85-90% likely triggers the clustering. But again, no official figures, just empirical deductions.

Second nuance: the statement explicitly concerns “same content in the same language.” If you have a .fr in French and a .ca in English, the problem obviously doesn’t arise. But what about a .fr and .ca both in French, with Quebec orthographic variations? Totally gray area. My intuition: if the differences are limited to “colour” vs “color” or a few scattered Quebecisms, Google is likely to cluster anyway.

In what cases does this rule not fully apply?

If your site has a very strong domain-level authority and a long indexing history on each ccTLD, Google may tolerate more duplication without aggressively merging. I’ve seen big e-commerce players maintain several indexed country versions despite 95% identical content — probably because each domain generates significant traffic and local backlinks.

Another exception: sites with strong geographical signals outside content. Local hosting, backlinks from media in the target country, Google Business profiles by market — all of this can weigh on Google’s decision. But be careful, don’t rely solely on this: it’s support, not a solution. Differentiated content remains the only reliable strategy.

Warning: some third-party SEO tools may display your country versions as “indexed” while they are just “known” to Google. Always check in Search Console the “Pages” → “Why pages aren't indexed” tab to see if your URLs are marked as “Duplicate, Google chose different canonical.”

Practical impact and recommendations

What should you concretely do to avoid this clustering?

First, honestly audit the degree of differentiation between your country versions. Open two URLs side-by-side (.fr vs .be for instance) and compare line by line. If only the header, footer, and a few legal mentions change, you’re in the red zone. You need to rewrite strategic content: product sheets, categories, service pages.

Next, create editorial content unique to each market. A blog with localized articles, buying guides tailored to local regulations, specific FAQs. The goal: that Google scans the page and detects a unique textual footprint. An identical template engine with replaced variables is no longer enough — you need tailored writing.

How to check that Google is correctly indexing each version separately?

In Search Console, connect each property (fr, be, ch) and compare the metrics of effective indexing. If .be shows 500 submitted URLs but only 50 indexed, with the other 450 as “Duplicate, Google chose different canonical”, you have proof of clustering. Cross-reference with site:votresite.be in Google: if the results mainly show .fr URLs, it means Google has made a decision.

Also, use the URL inspection tool on a few key pages of each version. Check the section “Google-selected canonical”: if it consistently points to .fr when inspecting .be, the diagnosis is clear. Be careful, this tool only shows the decision for the tested URL, not an overview — but repeating this on 10-15 strategic URLs will give you a reliable trend.

What mistakes should you absolutely avoid in this setup?

Do not cannibalize your own efforts by declaring a cross-canonical “for safety”. If .be points to .fr as canonical, you validate the clustering yourself — Google doesn’t even need to decide. Each version must have a strict self-referencing canonical, and hreflang must link the versions together without hierarchy.

Avoid the trap of automatically generated “spin content”. Just replacing a few words with synonyms via a script will not create sufficient differentiation — Google has detected these patterns for years. Authentic writing work is needed, not algorithmic camouflage. Finally, do not overlook local on-site signals: mentions of physical addresses, national phone numbers, native currencies displayed everywhere (not just a JS selector), all reinforce the perception of distinct pages.

  • Differentiate at least 30-40% of the textual content between each country version (descriptions, editorial, FAQ)
  • Create unique content sections for each market: local guides, regional case studies, customer testimonials from the country
  • Check in Search Console that each property shows an indexing rate >80% of the submitted URLs
  • Use the URL inspection tool to validate that Google respects your self-referencing canonicals
  • Maintain strong geographical signals: local hosting if possible, backlinks from local media, LocalBusiness schema with addresses
  • Monthly monitor indexing metrics to detect any gradual clustering
Managing a robust multi-country architecture requires sharp expertise in SEO internationalization and constant arbitration between technical constraints and editorial resources. These optimizations can quickly become time-consuming, especially if you manage multiple markets simultaneously. Engaging an SEO agency specialized in international websites can be wise to structure an effective differentiation strategy, monitor cross-market indexing, and avoid the clustering traps that undermine your local visibility.

❓ Frequently Asked Questions

Hreflang peut-il forcer l'indexation de versions pays à contenu identique ?
Non. Hreflang indique des alternatives linguistiques/géographiques mais ne contraint pas Google à indexer toutes les versions. Si le contenu est identique, Google applique son clustering et choisit une canonical unique, même avec hreflang correct.
Quel pourcentage de différenciation textuelle faut-il entre versions pays ?
Google ne communique aucun seuil officiel. Les observations terrain suggèrent qu'un minimum de 30-40% de contenu unique substantiel (pas juste cosmétique) est nécessaire pour éviter le clustering, mais cela reste empirique.
Un site peut-il avoir .fr indexé mais .be et .ch ignorés malgré hreflang ?
Oui, c'est exactement le scénario décrit par Mueller. Google indexe la version qu'il juge canonique et marque les autres comme duplicates, même si elles ciblent des pays différents et ont un hreflang correct.
Changer uniquement la devise et les mentions légales suffit-il ?
Non. Ces modifications sont considérées comme cosmétiques. Il faut des variations éditoriales substantielles : descriptions réécrites, contenus locaux additionnels, sections spécifiques à chaque marché.
Comment savoir quelle version Google a choisie comme canonical ?
Utilisez l'outil d'inspection d'URL dans Search Console sur chaque version pays. La section 'Google-selected canonical' indique quelle URL Google considère comme référence, même si elle diffère de votre déclaration.
🏷 Related Topics
Algorithms Content Crawl & Indexing Domain Name International SEO

🎥 From the same video 38

Other SEO insights extracted from this same Google Search Central video · duration 56 min · published on 04/08/2020

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.