What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

When two pages have identical or nearly identical content (same language, different countries), Google often chooses one as canonical, even with hreflang. Hreflang allows for swapping the displayed URL according to the search country, but indexing will be based on the canonical. To enforce separate indexing, the content must be significantly different, not simply adding a local phone number.
41:05
🎥 Source video

Extracted from a Google Search Central video

⏱ 54:50 💬 EN 📅 15/05/2020 ✂ 23 statements
Watch on YouTube (41:05) →
Other statements from this video 22
  1. 3:03 Do temporary 404 errors during a migration really kill your SEO?
  2. 4:56 Is it true that Googlebot crawls from the USA: how can you avoid the geo-IP cloaking trap?
  3. 8:42 Can you really block Googlebot state by state in the U.S. without breaking everything?
  4. 11:31 Why does Google not index all your pages despite active crawling?
  5. 12:17 Are Reddit's nofollow links really useless for SEO?
  6. 14:14 Should you always enable loading='lazy' on all your images to boost SEO?
  7. 15:25 Should you really reduce the number of language versions for hreflang?
  8. 18:27 Should you really fix every 404 error reported in Search Console?
  9. 20:47 Are jump links really useless for Google's crawling?
  10. 21:55 Should you disavow ghost backlinks that are only visible in Search Console?
  11. 23:20 Why doesn't the Disavow file hide bad links in Search Console?
  12. 29:18 Should you really contextualize the alt attribute beyond a visual description?
  13. 32:47 Should you really worry about 301 redirects and multiple 404 pages?
  14. 33:02 Is Google algorithmically downgrading specific sectors during health crises?
  15. 34:06 Should you really use different domain names for a multilingual site?
  16. 36:28 Should you really make all recipe images indexable to perform well in SEO?
  17. 37:49 Should you encode non-ASCII characters in XML sitemap URLs?
  18. 38:15 Does Hreflang Really Ensure Accurate Geographic Targeting for Your International Traffic?
  19. 45:51 Should you develop unique content to effectively index various versions of the same service?
  20. 46:27 Should you create a new page or update the existing one for a temporary change?
  21. 49:01 Is it really necessary to avoid using multiple title and meta description tags on a single page?
  22. 52:13 Are 500/503 errors lasting a few hours really invisible to your indexing?
📅
Official statement from (6 years ago)
TL;DR

Google often canonicalizes multi-country pages with identical or nearly identical content into a single URL, even when hreflang is properly set up. Hreflang allows the displayed URL to swap based on the user's geolocation, but does not prevent back-end canonicalization. To force separate country indexing, substantial content differentiation is necessary, not just modifying a phone number or local address.

What you need to understand

What is the difference between canonicalization and hreflang display?

The confusion arises from the fact that canonicalization and SERP display are two distinct mechanisms. When Google crawls two nearly identical pages (same language, different countries), it selects one URL as the canonical version for indexing. This URL accumulates ranking signals and serves as the reference in the index.

Hreflang then comes into play at the display stage: depending on the user's geolocation, Google can swap the displayed URL in the results to show the local version. But behind the scenes, it is always the canonical that carries the SEO weight. This distinction is crucial: having two indexed URLs and having two URLs that display based on country is not the same thing.

Why does Google canonicalize even with hreflang configured?

Because hreflang is not an indexing directive, it is a geographical and linguistic targeting indication. Google uses it to understand that a French page fr-FR and a French page fr-BE are variants, but if the content is 95% identical, the engine considers there is no value in indexing both.

The content deduplication algorithm triggers before hreflang. Google detects the similarity, merges signals onto a single canonical URL, then uses hreflang to route the displays. This is consistent with the engine's logic: avoid duplicate content in the index while providing the best user experience at the front-end.

What constitutes a 'significant' difference for Google?

This is the crux of the matter, and Google remains willingly vague about the exact threshold. Changing a phone number, an address, or a few local mentions is clearly insufficient. The main textual content must differ substantially: paragraphs written differently, information specific to the local market, entire sections tailored to regional needs.

We are talking about a minimum of 30-40% unique content per page, with real added value for the local user. A simple template engine with local variables will fool no one. Google analyzes the ratio of identical/different tokens — if the delta is marginal, canonicalization applies.

  • Canonicalization vs display: two distinct mechanisms, hreflang does not prevent deduplication
  • Hreflang does not force indexing: it is a targeting directive, not inclusion in the index
  • Substantial differentiation required: 30-40% unique content, not just cosmetic local variables
  • Impact on SEO signals: a single canonical URL accumulates backlinks, authority, and history
  • Risk of misrouting: if the canonical chosen by Google is not the one you want, local visibility is lost

SEO Expert opinion

Is this statement consistent with field observations?

Absolutely. We see this behavior daily on multi-country e-commerce sites with identical product catalogs. Google often chooses the .com version or the first one crawled as canonical, even when hreflang is perfectly implemented. SEOs are pulling their hair out because Search Console clearly shows the alternate pages, but only one actually accumulates organic impressions.

The classic trap: a site deploys .fr, .be, .ch with the same CMS, the same content translated once, and hopes to rank three times. Result: Google consolidates on a single URL, and the other two become ghosts in the index. They technically exist, but never rank alone; they only serve as display swaps. This is exactly what Mueller describes.

What nuances should be added to this rule?

First point: canonicalization is not always definitive. If one of the pages receives massive local backlinks or a very strong regional popularity signal (local branded searches, significant direct traffic), Google may reconsider and index separately. But this is rare and requires a significant signal delta.

Second nuance: the definition of 'significantly different content' varies by sector. For local news, a few specific paragraphs may suffice. For evergreen generic content ('how to write a CV'), the angle, examples, and tone really need to be reworked. [To be confirmed]: Google has never communicated a numerical threshold, so any recommendation on an exact percentage remains empirical.

In what cases does this rule not apply?

When pages target different languages (fr-FR vs en-GB), cross-language canonicalization is much rarer. Google understands that these are distinct contents by nature. The issue arises mostly for variants same language, different countries: fr-FR vs fr-BE vs fr-CH, en-US vs en-GB vs en-AU.

Another exception: pages with user-generated content (reviews, comments) can naturally diverge between countries and escape canonicalization. But relying on this is risky. Finally, if you force an explicit canonical tag to each local version, Google generally respects it — but it creates other problems (no consolidation of signals, dilution of SEO juice). Let's be honest: forcing separate indexing without differentiated content is a losing battle.

Practical impact and recommendations

What should be done concretely to avoid unintentional canonicalization?

The only sustainable solution: create genuinely localized content. Not pseudo-localization with dynamic variables. We are talking about specific writing by market: local case studies, customer testimonials from the country, paragraphs adapted to regional regulations or habits, lexicon and phrase turns unique to each area.

Concretely, this means an editorial budget multiplied by the number of countries. If you cannot invest in that, it’s better to adopt a single-domain strategy with hreflang for display swapping, rather than creating 5 ghost sites that will cannibalize your SEO efforts. This is a strategic choice to be made in advance, not a surprise to discover 6 months after launch.

What mistakes should be avoided in multi-country management?

Mistake #1: believing hreflang = guaranteed indexing. Hreflang says 'these pages are variants', not 'index them all separately'. If content is identical, Google makes its choice and you suffer. Mistake #2: just adding a 'our local offices' block at the bottom of the page and hoping it’s enough. Google analyzes the main content, not the footers.

Mistake #3: not monitoring which URL Google chooses as canonical. You may deploy .be thinking it will be indexed, only to discover 3 months later that .fr ranks and .be only serves as a swap. Result: your .be backlinks point to a secondary URL, your local PR efforts fall flat. Regularly check in Search Console the canonical URL chosen by Google for each page cluster.

How can I check if my multi-country architecture is properly indexed?

Use the Search Console by property (one GSC per domain/sub-folder). Look at the 'Pages' report and the 'Why pages are not indexed' tab. If you see large numbers of pages marked 'Duplicates, canonical URL chosen by Google different', that is the exact symptom described by Mueller. Then dig deeper with a Screaming Frog or Oncrawl crawl to identify clusters of nearly identical pages.

Also test in search: type site:yourdomain.be and see if Google displays the .be URLs or if it redirects to .fr in the snippets. If your .be pages never appear alone in organic SERPs (beyond geolocated search), it’s likely they’re not indexed on their own. And that’s where it gets tricky: you have the infrastructure, but not the substance.

  • Page-by-page content audit to measure similarity rates between country versions (goal:

❓ Frequently Asked Questions

Hreflang suffit-il à garantir l'indexation de toutes mes pages pays ?
Non. Hreflang est une directive de ciblage géographique et linguistique, pas une instruction d'indexation. Si vos pages sont quasi-identiques, Google choisira une seule URL canonique même avec hreflang parfaitement configuré.
Quel pourcentage de contenu différent faut-il pour éviter la canonicalisation ?
Google ne communique pas de seuil exact. Les observations terrain suggèrent 30-40% de contenu textuel unique minimum, avec une vraie valeur ajoutée locale, pas juste des variables cosmétiques comme un numéro de téléphone.
Si Google canonicalise mes pages pays, est-ce que je perds tout mon SEO local ?
Vous perdez l'indexation séparée et la possibilité de ranker indépendamment par pays. Hreflang permettra de swapper l'URL affichée selon la géolocalisation, mais une seule version accumule les signaux de ranking (backlinks, autorité, historique).
Comment savoir quelle URL Google a choisie comme canonique ?
Utilisez Google Search Console, section « Pages », puis vérifiez les URLs marquées comme « Doublons, URL canonique choisie par Google différente ». Vous pouvez aussi inspecter une URL spécifique pour voir la canonique détectée.
Peut-on forcer l'indexation séparée avec une balise canonical auto-référencée ?
Une balise canonical auto-référencée sur chaque page pays aide, mais si le contenu reste quasi-identique, Google peut quand même canonicaliser via son algorithme. La seule solution fiable reste la différenciation de contenu substantielle.
🏷 Related Topics
Domain Age & History Content Crawl & Indexing AI & SEO Domain Name Local Search International SEO

🎥 From the same video 22

Other SEO insights extracted from this same Google Search Central video · duration 54 min · published on 15/05/2020

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.