Official statement
Other statements from this video 16 ▾
- 4:03 Pourquoi un contenu de qualité ne garantit-il pas un bon classement dans Google ?
- 7:37 Faut-il encore prévoir un fallback JavaScript pour le lazy loading natif ?
- 9:21 HTTPS améliore-t-il vraiment le référencement ou est-ce un mythe SEO ?
- 11:53 Les URLs en caractères japonais bloquent-elles l'indexation au-delà de 100 pages ?
- 15:27 Peut-on choisir quelle page de son domaine Google affiche dans les SERP ?
- 18:17 Existe-t-il vraiment une limite au nombre d'items dans les carousels de recettes ?
- 21:17 Pourquoi les pages indexées persistent-elles dans site: après la fermeture d'un service ?
- 26:37 Les soft 404 pénalisent-ils vraiment votre SEO global ?
- 29:45 Pourquoi les nouveaux sites basculent-ils automatiquement en mobile-first indexing ?
- 33:14 Faut-il vraiment s'inquiéter de la distinction entre / et /index.html ?
- 34:38 L'outil de désaveu de liens sert-il vraiment à combattre le negative SEO ?
- 40:54 Google neutralise-t-il vraiment la majorité des liens spam automatiquement ?
- 42:38 L'URL canonique peut-elle changer selon la géolocalisation du visiteur ?
- 45:54 Pourquoi max-image-preview:large est-il indispensable pour Google Discover ?
- 48:25 Un redirect mal configuré puis corrigé peut-il quand même transférer le PageRank ?
- 54:52 Peut-on forcer Google à afficher une page plutôt qu'une autre pour une même requête ?
Google, through Gary Illyes, states that if multiple pages present the same text with purely visual variations (layout, display order), canonicalization is optional. The absolute imperative remains to provide precisely the same information to crawlers and users — any discrepancy between the two would be penalized as cloaking. Specifically, if your textual content is strictly identical and only the design changes, you can treat these URLs as duplicates or keep them separate, depending on your architecture.
What you need to understand
What does “same content, different appearance” really mean?
Google refers here to pages that display the same raw text but with variations in presentation: modified block order, different typography, colors adapted based on user profiles, or responsive design that reorganizes elements.
Typically, this concerns sites that personalize the UI based on user parameters (dark/light mode, display preferences, dynamic sorting) without altering the semantic content. As long as the informational substance remains identical, Google considers these variations to not provide differentiated value for ranking.
Why does Google leave the choice of whether to canonicalize or not?
Because these visual variations do not create problematic semantic duplication for the algorithm. The engine already identifies these pages as nearly identical and will naturally apply a consolidation process if necessary.
On the other hand, if you choose not to canonicalize, Google might index multiple URLs without guaranteeing which one will be served in the SERPs. The decision thus depends on your need for strict control over the preferred URL versus the flexibility of architecture.
Where is the line between acceptable variation and distinct content?
The boundary remains blurry — and this is where the issue arises. Google talks about “placement order” or “visual appearance,” but does not specify if modifying the informational hierarchy (block A before block B versus B before A) changes the semantic signal.
In practice, if the H1-H6 tags, the title, main paragraphs, and HTML structure remain identical or nearly identical, you are in the tolerance zone. As soon as the textual content differs substantially (addition/removal of sections, rephrasing), it falls outside the scope of this declaration.
- Acceptable variation: same text, reorganized block order, different CSS, UI customization
- Problematic variation: different textual content, added/removed sections, rephrased titles or paragraphs
- Absolute imperative: serve exactly the same information to users and Googlebot — any divergence = cloaking
- Strategic choice: canonicalize if you want to control the reference URL, keep distinct if the architecture requires it without major SEO impact
- Blurred limit: Google does not precisely define where “visual variation” ends and “semantic difference” begins
SEO Expert opinion
Is this statement consistent with observed practices in the field?
Yes, overall. Tests conducted on highly customized UI sites show that Google tolerates cosmetic variations without penalizing indexing. Problematic cases arise when developers confuse “different appearance” with “adaptive content” — for example, an e-commerce site that modifies product descriptions based on the device.
But be cautious: the wording “基本的な文章内容が同じ” (“the core textual content is identical”) leaves room for interpretation. In reality, even a change in HTML structure can influence Google’s semantic understanding. [To be verified] to what extent a mere reordering of blocks impacts topical relevance score.
What concrete risks are there if you do not canonicalize these variants?
First, loss of control over the URL served in the SERPs. Google will arbitrarily choose which variant to index and display, which can create inconsistencies in your analytics and A/B tests. Secondly, potential dilution of popularity signals if backlinks point to different URLs of the same page.
Thirdly — and this is the most insidious — risk of wasted crawl budget on redundant pages for large sites. If you generate 10 URLs per product just for sorting variations, Googlebot will spend time on these duplicates instead of exploring new strategic pages.
In what cases does this rule absolutely not apply?
As soon as you serve differential content based on the user-agent, you step outside the boundaries of this tolerance. Even a minor nuance — like hiding a paragraph from crawlers to “improve” bot readability — places you into pure and simple cloaking.
Similarly, if your visual variants involve changes in meta title/description tags, you create contradictory signals. Google cannot treat two pages as identical if their metadata differs. Lastly, be wary of multilingual sites: serving the same content with just a UI translation is not a simple visual variation — each language must have its own canonical URL with appropriate hreflang.
Practical impact and recommendations
What should I do if my site generates these variants?
First, audit all generated URLs to identify which are true visual variations versus distinct content. Use a crawler configured with different user-agents and compare the extracted raw text — if the diff shows discrepancies beyond CSS, these are not acceptable variants.
Second, decide on a consistent canonicalization strategy. If you opt for consolidation, choose the “cleanest” URL (without unnecessary parameters, descriptive URL) as canonical and apply it systematically. If you keep distinct, ensure your sitemaps and internal linking prioritize a reference URL to avoid dilution.
How can I check that Googlebot sees exactly what my users see?
Use the URL Inspection Tool in Search Console and compare the HTML rendering captured by Google with the actual rendering in your browser. Enable the “View Rendered Source Code” mode to spot any dynamic content that might differ. If you are using JavaScript to modify the display, verify that Googlebot actually executes the JS and sees the final result.
Also test with curl simulating Googlebot (user-agent “Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)”) and compare it to a conventional desktop user-agent. Any differences in the <title>, <h1>, or main paragraphs is a warning signal.
What critical mistakes should be absolutely avoided in this context?
Never hide content from Googlebot under the pretext of optimizing crawl time. Even if your intention is innocent, Google interprets this as cloaking. Also, avoid serving a “lighter” version to bots: they must see exactly the same informational richness as humans.
Be cautious of URL parameters that change presentation without semantic value (e.g., ?layout=grid vs ?layout=list). If you do not canonicalize, Google may index both and randomly choose which one to serve. Instead, use cookies or localStorage to store user preferences without modifying the URL.
- Scrape the site with multiple user-agents and compare the extracted text to identify any discrepancies
- Define a canonical reference URL for each group of visual variants
- Check in Search Console that Googlebot's rendering is strictly identical to the user's rendering
- Test with curl + Googlebot user-agent to spot any conditional content served to bots
- Document in a table the acceptable variants (pure UI) versus content variations (to be treated as separate pages)
- Configure the XML sitemap to reference only canonical URLs, not variants
❓ Frequently Asked Questions
Si je change l'ordre des produits affichés sur une page catégorie via un tri utilisateur, dois-je canonicaliser ?
Le mode sombre/clair qui ne change que le CSS nécessite-t-il une canonicalisation ?
Google peut-il pénaliser si je laisse plusieurs variantes visuelles non canonicalisées ?
Comment traiter les pages AMP qui présentent le même contenu avec une mise en page allégée ?
Si mon site sert du contenu personnalisé selon la géolocalisation mais que l'URL reste la même, est-ce acceptable ?
🎥 From the same video 16
Other SEO insights extracted from this same Google Search Central video · duration 59 min · published on 02/07/2020
🎥 Watch the full video on YouTube →
💬 Comments (0)
Be the first to comment.