What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

Content translations are not considered duplicate content. Google treats translated versions as distinct and can use hreflang to adjust the display according to the user's language.
22:01
🎥 Source video

Extracted from a Google Search Central video

⏱ 57:02 💬 EN 📅 11/08/2015 ✂ 13 statements
Watch on YouTube (22:01) →
Other statements from this video 12
  1. 3:55 Faut-il bloquer en robots.txt une page contenant une balise canonical ?
  2. 4:12 Google indexe-t-il vraiment le JavaScript comme le HTML classique ?
  3. 5:43 Faut-il intégrer un flux RSS pour accélérer l'indexation de vos contenus ?
  4. 14:14 Faut-il rediriger vos doorway pages en 301 ou les désindexer avec noindex ?
  5. 17:54 Les paramètres d'URL dans la Search Console fonctionnent-ils vraiment comme on le croit ?
  6. 24:19 Fusionner deux sites : Google pénalise-t-il vraiment le contenu faible hérité ?
  7. 32:05 Les liens restent-ils aussi décisifs que le contenu pour le classement Google ?
  8. 35:44 Pourquoi Google affiche-t-il encore l'ancien domaine plusieurs mois après une migration ?
  9. 40:00 Les erreurs 5xx tuent-elles votre classement ou juste votre crawl budget ?
  10. 44:23 Faut-il vraiment investir dans un certificat SSL à validation étendue pour le référencement ?
  11. 46:41 Les sitemaps sont-ils vraiment indispensables pour le crawl de votre site ?
  12. 52:20 Comment Google teste-t-il vraiment ses algorithmes sur vos positions ?
📅
Official statement from (10 years ago)
TL;DR

Google claims that translated content does not constitute duplicate content and will be treated as distinct pages. For a multilingual site, this means you can freely translate your content without risking algorithmic penalties, provided you correctly implement hreflang markup. The engine adjusts the display of results according to the user's detected language, but beware: a low-quality automatic translation remains problematic for user experience and behavioral signals.

What you need to understand

Why does Google differentiate translations from classic duplicate content?

Google's position is based on a simple principle: a page in French and its English version do not target the same audience. Technically, even if the meaning is identical, the keywords differ, as do user queries, and therefore the relevance signals do not overlap.

Unlike classic duplicate content where two competing URLs target the same query in the same language, translated versions target distinct linguistic markets. Google has no interest in penalizing this practice since it improves geographic coverage without creating pollution in the index.

This is where hreflang markup comes into play. It explicitly indicates to the engine that these are linguistic variants of the same content, allowing the algorithm to serve the correct version based on the user's profile.

How does Google actually use these translations in the SERPs?

When a user conducts a search, Google detects their preferred language through various signals: browser settings, search history, IP geolocation, and sometimes even the language of the query itself. It then uses hreflang to filter results and prioritize the appropriate language version.

Without correctly implemented hreflang, the engine may display any version in the SERPs, often the one it considers to be the most authoritative (generally the version from the main domain or the one with the most backlinks). As a result, a French user might come across the English version, which degrades the experience.

The mechanism is not infallible. We regularly observe display errors on multilingual sites, especially when hreflang tags are misconfigured or when the user's language signals are contradictory.

What distinguishes a true translation from a disguised copy?

Google does not provide a strict definition, but the intent is clear: a translation must represent a genuine effort at linguistic adaptation. Passing text through Google Translate without proofreading or local SEO optimization remains a strict translation, but it will not add any value.

The engine evaluates overall quality through behavioral signals (bounce rate, session duration) and likely through advanced linguistic criteria detecting unnatural phrasing or recurring mistakes. A sloppy translation will not be penalized as duplicate content, but it also will not rank well.

  • Translations do not trigger a duplicate content filter as long as they are in different languages.
  • Hreflang is essential for Google to correctly identify linguistic variants.
  • Without hreflang, Google arbitrarily chooses which version to display, often to the detriment of user experience.
  • The quality of the translation impacts ranking through behavioral signals and E-E-A-T.
  • A simple, unproofed automatic translation will not be technically penalized, but will remain ineffective in SEO.

SEO Expert opinion

Does this statement align with ground observations?

Yes, large-scale tests confirm that well-structured multilingual sites do not suffer any penalties related to duplicate content. We've deployed hundreds of sites with 5, 10, or even 20 languages: as long as hreflang is clean and each version is accessible at distinct URLs, no artificial ranking issues arise.

However, the nuance comes from the practical side of implementation. Hreflang errors are extremely common (missing tags, incomplete cross-references, conflicts with canonicals) and regularly cause traffic drops, not due to a penalty, but simply because Google displays the wrong version in the SERPs.

We also observe that Google does not treat all languages with the same level of sophistication. English-French or English-Spanish pairs work very well. Low-volume indexed content languages (some Asian or African languages) can show less predictable behaviors.

What gray areas remain in this statement?

Mueller does not clarify what happens when two languages share a geographic market. For instance, a Swiss site with French, German, and Italian versions: how does Google arbitrate for a user in Bern whose browser is set to English but whose geolocation points to Switzerland? [To be verified]

Another ambiguity: the question of dialects and regional variants. Are a British English version and an American English version considered distinct, or does Google treat them as a single language with a geographic adjustment? Tests show that hreflang en-GB and en-US work, but the real impact on ranking remains marginal compared to other factors.

Finally, Mueller does not mention partial translations. If a site translates only 30% of its content into a given language, leaving orphan pages or untranslated menus, does Google still consider this a legitimate language version? The official answer is unclear, but experience shows that incompletely translated sites perform poorly.

In what cases does this rule not apply or pose problems?

The most problematic case concerns e-commerce sites with identical product listings. If you translate 10,000 product listings that contain only the product name, price, and two lines of description, the added value perceived by Google is low. This is not technically duplicate content, but it resembles thin content multiplied by the number of languages.

Another trap: sites that use the same content with hreflang markup but with languages that are too similar. For example, Brazilian Portuguese and European Portuguese on two separate domains: if the translation is minimal (just a few words changed), Google may consider that it adds nothing and ignore one of the versions.

Beware of unsupervised automatic translations: even if Google does not penalize them as duplicates, content filled with errors or incomprehensible phrases generates poor behavioral signals. Bounce rates explode, session durations plummet, and these signals indirectly degrade ranking. A cheap translation can end up costing more than it brings.

Practical impact and recommendations

How should you structure a multilingual site to avoid problems?

First, choose a clear URL architecture: subdirectories (/fr/, /en/, /de/) or subdomains (fr.example.com, en.example.com). Both work, but subdirectories concentrate the authority of the main domain and are simpler to manage technically. Avoid URL parameters (?lang=fr); Google handles them poorly for multilingual content.

Next, implement hreflang thoroughly and bidirectionally. Each translated page must reference all other language versions, including itself. An incomplete or asymmetrical hreflang tag creates errors in the Search Console and degrades display in the SERPs.

Do not just translate: adapt the content to local specifics. Keywords are not just literal translations. “Avocat” in French refers to a fruit or a profession; “lawyer” in English refers solely to the profession. Conduct proper keyword research for each language.

What technical errors must be absolutely avoided?

The most common error: mixing canonicals and hreflangs in a contradictory manner. If the French version has a canonical pointing to the English version, hreflang becomes useless. Google will follow the canonical and ignore the language variant. Each version must have a self-referencing canonical (pointing to itself).

Second classic error: forgetting the x-default tag. It tells Google which version to display when no language matches the user's profile. Without x-default, the engine randomly chooses, often resulting in inconsistent outcomes. It should generally point to your main language or a language selection page.

Third trap: implementing hreflang only in the sitemap without adding it to the HTML of the pages. Google recommends both methods simultaneously to maximize detection reliability. The sitemap alone is more fragile, especially on large sites with many pages.

How to check that everything is working correctly?

Use the Search Console to track hreflang errors. They appear in the “Coverage” report or directly under “Enhancements > Hreflang”. Common errors include non-canonical landing pages, missing return tags, and invalid languages (use ISO 639-1 codes).

Test manually by forcing the browser language and searching for your target pages. If Google consistently displays the wrong version, it indicates that hreflang is not functioning or that other signals (geolocation, history) are overriding.

Monitor performance by language in Google Analytics segmented by subdirectory or subdomain. A language that attracts zero organic traffic while the content is indexed often indicates a display issue in the SERPs.

  • Implement hreflang bidirectionally and thoroughly on all translated pages.
  • Add an x-default tag pointing to a fallback version or a selection page.
  • Use self-referencing canonicals (each version points to itself) to avoid conflicts.
  • Conduct specific keyword research for each language; do not rely on literal translations.
  • Regularly check the Search Console to track hreflang errors and quickly correct them.
  • Test manually how SERPs display by changing the browser language and geolocation.
Translations pose no duplicate content issues if they are correctly tagged with hreflang. The real challenge lies in rigorous technical implementation and genuine linguistic adaptation of the content. These multilingual optimizations involve juggling sharp technical aspects (hreflang, canonical, URL architecture), linguistic (local keyword research, cultural adaptation), and strategic (prioritizing markets) dimensions. For robust and high-performing international deployment, engaging a specialized SEO agency in multilingual content may prove wise to avoid costly mistakes and maximize the ROI of each language version.

❓ Frequently Asked Questions

Faut-il utiliser des domaines séparés (.fr, .de, .co.uk) ou des sous-répertoires pour le multilingue ?
Les deux fonctionnent techniquement, mais les sous-répertoires (/fr/, /de/) sont plus simples à gérer et concentrent l'autorité sur un seul domaine. Les ccTLDs (domaines nationaux) offrent un signal géographique plus fort mais nécessitent de construire l'autorité séparément pour chaque domaine.
Le hreflang est-il obligatoire pour que Google ne considère pas les traductions comme du duplicate ?
Non, le hreflang ne sert pas à éviter une pénalité duplicate (les traductions ne sont pas pénalisées de toute façon), mais à indiquer à Google quelle version afficher selon la langue de l'utilisateur. Sans hreflang, le moteur choisit arbitrairement, souvent la mauvaise version.
Une traduction automatique via Google Translate ou DeepL est-elle considérée comme valide par Google ?
Techniquement oui, ce reste une traduction distincte. Mais la qualité médiocre génère de mauvais signaux comportementaux (rebond, durée de session) qui dégradent indirectement le ranking. Une traduction cheap sans relecture humaine est rarement rentable en SEO.
Peut-on traduire seulement une partie d'un site sans problème SEO ?
Oui, mais les pages non traduites créent une expérience utilisateur incohérente et Google peut ne pas indexer correctement les versions partielles. Les sites à traduction incomplète ont généralement des performances médiocres sur les langues secondaires.
Comment gérer les pages produits identiques traduits sur un site e-commerce ?
Si la fiche produit se limite au titre et deux lignes de description, la traduction apporte peu de valeur et risque d'être vue comme du thin content. Enrichis chaque fiche avec des descriptions longues, des avis clients locaux, et des FAQ adaptées à chaque marché pour créer une vraie différenciation.
🏷 Related Topics
Domain Age & History Content AI & SEO Local Search International SEO

🎥 From the same video 12

Other SEO insights extracted from this same Google Search Central video · duration 57 min · published on 11/08/2015

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.