What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

Google chooses a canonical URL based on two main criteria: the site’s preference and the utility for the user. To indicate the site's preference, one can use the link rel=canonical annotation, redirects, internal links, and specify preferred URLs in the sitemap file. Google favors HTTPS URLs and those that are aesthetically pleasing.
1:36
🎥 Source video

Extracted from a Google Search Central video

⏱ 3:09 💬 EN 📅 04/09/2019 ✂ 3 statements
Watch on YouTube (1:36) →
Other statements from this video 2
  1. 2:38 Pourquoi la cohérence de canonisation de vos URLs détermine-t-elle votre visibilité dans Google ?
  2. 3:09 Une URL canonique incorrecte peut-elle vraiment nuire à votre classement SEO ?
📅
Official statement from (6 years ago)
TL;DR

Google selects the canonical URL of a page by weighing two dimensions: the technical signals you emit (canonical, redirects, sitemap, internal linking) and what it deems useful for the end user. HTTPS and 'aesthetic' URLs carry weight — but Google has the final say. In practical terms, your annotations are suggestions, not commands. If your site sends conflicting signals or if Google finds that another URL better serves the user, it can ignore your preference.

What you need to understand

Why doesn't Google always respect your canonical tag?

The rel=canonical tag is a recommendation, not a directive. Google takes it into account, but it has never been mandatory. If your technical signals contradict each other — a sitemap pointing to one URL, internal links pointing to another, poorly configured redirects — the algorithm will decide based on what it finds most consistent.

The second criterion, utility for the user, remains deliberately vague. Google does not detail the specific metrics, but we can deduce that the quality of the URL (readability, structure), the HTTPS protocol, and potentially the click-through rate play a role. A cryptic URL with lengthy parameters is likely to be overlooked in favor of a clean version, even if your canonical points to the former.

What does Google mean by 'aesthetically pleasing URLs'?

Mueller refers to 'clean' URLs, free from session IDs, unnecessary tracking parameters, and parameter duplicates. A URL like /product/mens-running-shoes will always be preferred over /product.php?id=1234&ref=abc&session=xyz. This is a signal of editorial and technical quality.

This preference is also based on user behavior: a readable URL inspires more trust in the SERPs. Consequently, Google integrates this dimension before the click, favoring the URL it believes is most likely to be clicked and shared. As a result, if you have two versions of the same page, one with a clean URL and the other with tracking, you lose control over the canonical choice.

How does Google arbitrate between conflicting signals?

Imagine a site that declares URL A in its sitemap, but 100% of the internal links point to URL B, with a canonical tag referencing A. Google will cross these signals and may very well choose B if the crawl finds it to be the version actually highlighted by the site. Internal linking is a permanent vote of confidence — much more reliable than a standalone tag in the <head>.

301 redirects remain the strongest signal. If you properly redirect A to B, Google considers B to be the canonical version, period. But if you multiply the chains of redirects or loops, the algorithm may decide to not canonize either and choose a third URL discovered elsewhere.

  • Google treats the canonical tag as a strong suggestion, but not enforceable
  • 301 redirects are the most powerful canonicalization signal
  • Internal linking carries significant weight: if all your links point to one URL, that’s the one Google will likely retain
  • HTTPS is favored by default over HTTP, even in the absence of other signals
  • 'Clean' URLs (without superfluous parameters) are favored for their user utility

SEO Expert opinion

Is this statement consistent with field observations?

Yes, broadly speaking. It is regularly observed that Google ignores canonicals when the internal linking heavily points to another URL, or when the sitemap contains different URLs than those declared as canonical. This aligns with the idea that Google cross-references multiple signals and does not blindly trust a tag.

However, opacity remains regarding what exactly constitutes 'utility for the user.' [To be verified] Does Google measure the CTR of URLs in the SERPs to adjust its canonical choice? Does it analyze the semantic structure of the URL to deduce its relevance? Mueller does not provide any specific numerical details. We remain in ambiguity — and this is intentional.

What limits should be placed on this recommendation?

Mueller's discourse oversimplifies significantly. He does not mention cases where Google cannot choose a canonical — typically when two URLs differ too much in content or intent. In this case, neither is canonized, leading to unresolved duplicate content.

A second limitation: the notion of 'aesthetically pleasing' URLs is subjective. A URL with technical identifiers can be perfectly legitimate for an API or CMS. Google should not systematically penalize these structures — but in practice, it does. We have seen sites with URLs like /product/12345 losing their canonical in favor of redirected versions like /product/product-name, even though the first was declared canonical.

In what cases does this rule not apply?

On sites with complex pagination or filter facets, Google may choose a canonical URL that does not match your tag or your internal linking, simply because it deems it more 'representative.' This is particularly true in e-commerce: a page /shoes?size=42&color=black may be canonized to /shoes if Google believes the filtered version does not provide unique value.

Another exception: multilingual or multi-regional sites. Google may choose a canonical URL in a different language than the one you specified if the content is deemed equivalent and the other version has more authority. Hreflang tags do not prevent this behavior — they serve for internationalization, not canonicalization.

Warning: If you have conflicting signals (canonical to A, linking to B, sitemap with C), Google may choose D — a URL you have never mentioned but which it has discovered through external crawling. Always check in Search Console which URL is actually retained.

Practical impact and recommendations

What practical steps should you take to control the canonical choice?

The first step: audit the consistency of your signals. Extract your sitemap, crawl your site to identify all canonical tags, list your 301 redirects, and compare. If you find contradictions, fix them immediately. A sitemap containing non-canonical URLs is a signal of confusion for Google.

Next, clean up your URLs. Remove unnecessary tracking parameters from your internal links. If you absolutely must track, use server-side solutions or intermediary redirects, but do not pollute your crawlable URLs. A clean URL is more likely to be retained as canonical.

How to check that Google has understood your preference?

Use the 'Coverage' report in Search Console and filter by 'Detected, currently not indexed' or 'Excluded by canonical tag'. If URLs you thought were canonical appear as excluded, it means Google has chosen another version. Dig deeper by inspecting the URL using the live test tool.

A second check: compare indexed URLs via a search site:yourdomain.com with the URLs you declared canonical. If you see variants with parameters or HTTP versions despite migrating everything to HTTPS, it's an alarm signal. Google did not follow your preference.

What technical errors block canonical recognition?

The first classic error: redirect chains. If A redirects to B, which redirects to C, which has a canonical pointing to D, Google can get lost and choose a random URL. Limit yourself to one redirect per URL, and ensure that the final target bears a canonical to itself.

The second error: poorly formatted relative canonicals. If your tag points to /product instead of https://yourdomain.com/product, and your site has subdomains or protocol variants, Google may misinterpret the canonical. Always favor absolute URLs.

  • Crawl your site and extract all canonical tags: check that they point to accessible URLs (200), not to 404s or redirects
  • Compare the XML sitemap with the declared canonicals: each URL in the sitemap should be self-canonical (canonical to itself)
  • Audit the internal linking: if 80% of your links point to URL A but your canonical designates B, correct the linking
  • Migrate 100% of the site to HTTPS and enforce the HTTP → HTTPS redirect: Google systematically favors HTTPS
  • Clean up tracking parameters in internal URLs: use UTMs only on external links (campaigns, social)
  • Test duplicate URLs via the Search Console inspection tool to see which version Google retained as canonical
The issue of canonical choice is central to avoiding authority dilution and duplicate content. But the devil is in the technical details: an inconsistency between sitemap, canonical, and internal linking can be enough to lose control. These optimizations demand a sharp expertise in information architecture and crawling — if your site exceeds a few hundred pages or if you manage an e-commerce with facets, support from a specialized SEO agency can help you avoid costly mistakes and save you months of post-deployment corrections.

❓ Frequently Asked Questions

Google peut-il ignorer complètement ma balise canonical ?
Oui. La balise canonical est une recommandation, pas une directive. Si Google détecte des signaux contradictoires (maillage interne, sitemap, redirections) ou juge qu'une autre URL est plus utile pour l'utilisateur, il peut choisir une URL canonique différente de celle que vous avez spécifiée.
Les redirections 301 sont-elles plus fortes que la balise canonical ?
Oui. Une redirection 301 est le signal le plus puissant pour indiquer à Google qu'une URL A doit être remplacée par une URL B. La balise canonical, elle, suggère qu'une URL est la version préférée parmi plusieurs variantes accessibles. En cas de conflit, la redirection l'emporte.
Pourquoi Google indexe-t-il une URL HTTP alors que j'ai migré en HTTPS ?
Si vous n'avez pas mis en place de redirection 301 permanente de HTTP vers HTTPS, ou si votre maillage interne pointe encore vers des URLs en HTTP, Google peut continuer à crawler et indexer l'ancienne version. Vérifiez aussi que votre sitemap ne contient que des URLs HTTPS.
Faut-il ajouter une balise canonical sur toutes les pages ?
Oui, par précaution. Même sur une page unique sans doublon, une balise canonical auto-référencée (pointant vers elle-même) clarifie votre intention et évite que Google ne crée un canonical vers une variante découverte ailleurs (par exemple avec des paramètres de session).
Comment savoir quelle URL Google a choisie comme canonical ?
Utilisez l'outil d'inspection d'URL dans la Search Console. Entrez l'URL que vous souhaitez vérifier, et Google vous indiquera quelle URL il considère comme canonical. Si elle diffère de votre balise, c'est un signal que vos signaux techniques sont contradictoires.
🏷 Related Topics
Crawl & Indexing HTTPS & Security Links & Backlinks Domain Name PDF & Files Redirects Search Console

🎥 From the same video 2

Other SEO insights extracted from this same Google Search Central video · duration 3 min · published on 04/09/2019

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.