What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

To manage duplicate content from authoritative sources, it is recommended to use the rel=canonical tag to reinforce the canonical source. Hreflang should be used if the content is specifically translated for different language markets.
4:07
🎥 Source video

Extracted from a Google Search Central video

⏱ 1h04 💬 EN 📅 20/07/2018 ✂ 13 statements
Watch on YouTube (4:07) →
Other statements from this video 12
  1. 1:03 Pourquoi se focaliser sur les facteurs de classement fait-il perdre de vue l'essentiel ?
  2. 2:33 Google My Business et SEO classique : vraiment deux mondes séparés ?
  3. 5:15 Les redirections 301 transfèrent-elles réellement 100% du PageRank et des signaux SEO ?
  4. 6:15 La balise canonical fonctionne-t-elle vraiment comme une redirection 301 ?
  5. 11:19 Comment accélérer le crawl de votre site e-commerce sans gaspiller le budget Google ?
  6. 13:37 Peut-on vraiment réactiver des liens désavoués sans pénalité ?
  7. 18:36 L'indexation mobile-first modifie-t-elle vraiment les extraits visibles par tous les utilisateurs mobiles ?
  8. 26:22 HTTPS et indexation mobile : pourquoi Google traite-t-il HTTP et HTTPS comme deux sites distincts ?
  9. 27:04 Le robots.txt peut-il vraiment bloquer l'indexation de vos pages ?
  10. 30:08 Comment supprimer une section de site entière de Google en moins de 24h ?
  11. 32:12 Le désaveu de liens est-il encore utile contre les attaques SEO négatives ?
  12. 35:42 Hreflang : quelle méthode d'implémentation fonctionne vraiment pour l'international ?
📅
Official statement from (7 years ago)
TL;DR

Google recommends using rel=canonical to consolidate signals to the authoritative source of duplicate content, while hreflang should be reserved for translated content aimed at different language markets. This distinction is game-changing: hreflang is not a generic anti-duplicate solution. Combining both tags requires rigorous logic to avoid conflicting signals that muddle indexing.

What you need to understand

What’s the difference between duplicate content and translated content?

Duplicate content comes from the same source disseminated across multiple URLs (parameters, printable versions, multiple domains). Google may hesitate over which page to index. The rel=canonical tag explicitly indicates which URL deserves the reference credit.

Translated content, on the other hand, targets users in different languages with tailored versions. These pages are not accidental duplicates but intentional variations. Hreflang allows Google to serve the correct language version based on geolocation and user preferences without penalizing others.

Why does Google emphasize the distinction between the two tags?

Confusing canonical and hreflang generates conflicting signals. If a French page points its canonical to the English version while declaring hreflang fr, Google receives two opposing instructions: "Ignore me, index the English" on one side and "I am the correct version for France" on the other.

This ambiguity slows crawling, dilutes PageRank, and can even remove the local variant from indexing. Google needs consistency: canonical to say "this is the same, take this one", hreflang to say "these are different, serve the right one depending on the context". Mixing them up asks Google to choose between two incompatible logics.

When should you use canonical and hreflang together?

Both tags legitimately coexist when a translated version exists in multiple technical variants (HTTPS/HTTP, with/without www, session parameters). Each language then has its canonical URL, and the canonicals of each language connect through hreflang.

For example: site.com/fr/?session=123 points its canonical to site.com/fr/, while site.com/fr/ declares hreflang to site.com/en/ and site.com/de/. Each language consolidates its internal duplicates before declaring its international relationships. It's clean, it's logical.

  • Canonical: consolidates the URLs of the same version of content, reinforces the authoritative source, avoids diluting ranking signals.
  • Hreflang: connects translations to each other, guides Google toward the correct language/region, preserves the indexing of each legitimate variant.
  • Combining the two requires a clear architecture: first solve technical duplicates, then declare language variants.
  • A poorly configured site can send contradictory signals that de-index whole pages or strip them of local visibility.
  • Regular checks in Search Console (coverage report, hreflang errors) become essential to detect inconsistencies.

SEO Expert opinion

Is this recommendation really being followed in the field?

Honestly, many international sites mix everything up. I've seen setups where every French page points its canonical to the default English version, hoping hreflang will fix it. The result: Google indexes the English version, ignores the French, and local organic traffic plummets.

Google's distinction seems clear on paper, but it requires a strict technical governance. Common CMSs (multilingual WordPress, Shopify Markets) often generate awkward combinations. A manual audit remains essential, especially for sites with over 1000 pages. [Check] regularly with Screaming Frog or Oncrawl crawls to spot inconsistencies.

When do canonical and hreflang conflict?

The classic conflict: a French page declares hreflang to English but also points its canonical to English. Google receives the order to index English while serving French to the French audience. Impossible. It usually chooses to follow the canonical, leading the French version to disappear from indexing or lose its local positioning.

Another tricky case: missing self-referential canonical tags. If /fr/ declares hreflang but does not point its canonical to itself, Google may interpret a parameterized URL as the canonical source. Traffic then flows to a broken or non-optimized URL. The rule: each page with hreflang must have an explicit canonical, even if it's to itself.

Is it really necessary to duplicate hreflang in the sitemap AND in the HTML?

Google accepts both methods, but in practice, HTML remains more reliable. Hreflang sitemaps are often poorly generated, incomplete, or forgotten during updates. A tag in the <head> travels with the page, ensuring consistency.

Large sites (10,000+ pages) may prefer the sitemap for performance reasons, but then it’s essential to monitor every generation. A broken or outdated hreflang sitemap can silently de-index hundreds of pages. Personally, I prefer HTML for any site under 5000 pages, doubling with the sitemap only if maintenance is automated and tested.

Practical impact and recommendations

How can you verify that canonical and hreflang are properly configured?

Start with a complete crawl using Screaming Frog or Sitebulb. Export the "Canonical URL" and "Hreflang" columns. Filter the pages with hreflang: each should point its canonical to itself or to a URL in the same language. Any cross-language canonical is suspect.

Then, check in Search Console the "Coverage" report and hreflang errors. Google reports loops, missing canonicals, and unilateral declarations (a French page declares English, but English does not declare French in return). These asymmetries break indexing. Correct them as a priority.

What fatal errors should you absolutely avoid?

Never point a cross-language canonical if hreflang is active. This is the most common contradiction. If your French page needs to be indexed, its canonical must be French. Period. The only exception: a primary language (English) and regional variants (EN-GB, EN-US) that canonicalize to EN-US while declaring hreflang.

Another trap: forgetting x-default in hreflang. Google uses this tag to serve a default page to users whose language is not explicitly covered. Without x-default, a FR/EN/DE site might display the German version to a Spanish user. Always define an x-default to the main language or a selection page.

What should you do if your site displays persistent hreflang errors?

Prioritize errors by volume. An error on 10 strategic pages is more critical than an issue on 500 archive pages. First, fix the high-traffic pages: categories, flagship products, SEO landing pages. Then test the correction with the URL inspection tool in Search Console.

If errors multiply despite corrections, it's often an architecture issue: a CMS generating incorrect dynamic tags, a CDN injecting parameters, 302 redirects instead of 301 that break signals. At this stage, a complete technical audit becomes necessary. These configurations can quickly become too complex to manage in-house without sharp expertise. Consulting a specialized SEO agency for international projects provides a precise diagnosis and a tailored action plan, especially if your technical stack includes multiple tools (CMS, CDN, translation middleware).

  • Crawl the site to extract all canonical and hreflang tags
  • Check that each page with hreflang points its canonical to itself or a URL of the same language
  • Audit hreflang errors in Search Console and correct asymmetries
  • Test strategic pages with the URL inspection tool
  • Add a hreflang x-default to cover undeclared languages
  • Monitor server logs to identify crawl loops or 404 errors on canonical URLs
The combined management of canonical and hreflang relies on a simple rule: use canonical to eliminate technical duplicates of the same version, and hreflang to link legitimate translations. Any contradiction between the two tags weakens indexing. Regular audits and coherent architecture are essential to maintain international visibility without diluting the ranking signal.

❓ Frequently Asked Questions

Peut-on utiliser canonical et hreflang sur la même page ?
Oui, c'est même souvent nécessaire. Chaque variante linguistique peut avoir son canonical pour consolider ses duplicatas techniques, puis déclarer hreflang vers les autres langues. L'important est que le canonical reste dans la même langue que la page.
Que se passe-t-il si une page FR pointe son canonical vers EN tout en déclarant hreflang FR ?
Google suit généralement le canonical et indexe la version EN, ignorant le signal hreflang FR. La page française perd sa visibilité locale et peut disparaître de l'index. C'est une contradiction à corriger immédiatement.
Faut-il mettre hreflang dans le HTML ou dans le sitemap ?
Les deux fonctionnent, mais le HTML est plus fiable pour les sites de taille moyenne. Les sitemaps hreflang sont souvent mal générés ou oubliés lors des mises à jour. Pour les très gros sites, le sitemap reste une option viable si la maintenance est automatisée.
Le x-default hreflang est-il vraiment obligatoire ?
Pas obligatoire, mais fortement recommandé. Il permet de définir une page par défaut pour les utilisateurs dont la langue ou la région n'est pas explicitement couverte, évitant que Google serve une variante aléatoire.
Comment détecter les erreurs hreflang qui passent inaperçues ?
Utilise Search Console pour repérer les erreurs signalées par Google, et complète avec un crawl Screaming Frog pour vérifier la cohérence des balises. Les asymétries (page A déclare B, mais B ne déclare pas A) sont fréquentes et invisibles sans outil.
🏷 Related Topics
Content Crawl & Indexing International SEO

🎥 From the same video 12

Other SEO insights extracted from this same Google Search Central video · duration 1h04 · published on 20/07/2018

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.