Is it true that Google really respects the canonical tag?

Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

The rel=canonical tag is a signal for Google, indicating a preference for indexing a different URL. Google must first index and process the page to see the rel=canonical tag. There is a possibility that Google will choose a different URL as canonical based on several other factors such as redirects, internal links, and the sitemap.

3:12

🎥 Source video

Extracted from a Google Search Central video

⏱ 50:59 💬 EN 📅 11/03/2016 ✂ 27 statements

Watch on YouTube (3:12) →

✂ Other statements from this video 26 ▾

📅

Official statement from March 11, 2016 (10 years ago)

⚠ A more recent statement exists on this topic Does Google really treat noindex as an absolute rule, or does it bend the rules ... Martin Splitt · February 3, 2022 View statement →

TL;DR

Google treats the rel=canonical tag as a mere preference signal, not as an absolute directive. The engine must first index and crawl the page to discover this tag, which creates an inevitable delay. In practice, Google reserves the right to select a different canonical URL by cross-referencing several signals: 301 redirects, internal linking, presence in the XML sitemap, and content consistency.

What you need to understand

Why doesn’t Google always follow the canonical tag?

Mueller’s statement clarifies things: rel=canonical is not an instruction, it’s a suggestion. Google analyzes this tag as a clue among others to determine which URL deserves to be indexed as the main version.

Specifically, the engine collects several conflicting or converging signals. If your redirects point to URL A, your sitemap references URL B, and your internal links favor URL C, Google arbitrates according to its own logic. The canonical tag weighs in, but does not decide alone.

What is the delay before Google sees the tag?

The crucial point lies in the timeline: Google must index the page before reading the canonical tag. In other words, if you publish a new URL with a tag pointing to an existing version, Google will crawl this new page first, process it, analyze its source code, and only then discover your preference.

This delay creates a time window during which the “undesired” page may appear in the index. This is particularly problematic for sites with a limited crawl budget or large volumes of dynamically generated pages.

What other signals compete with the canonical?

Mueller explicitly mentions redirects, internal linking, and the sitemap. In reality, Google aggregates much more data: URL age, volume of backlinks pointing to each variant, content consistency between the two versions, crawl history, perceived quality of each page.

If URL B receives a massive amount of external links while your canonical points to the little-known URL A, Google might decide that B deserves primary indexing. The engine aims to offer the best user experience, not just to follow your technical preferences.

The canonical tag is a signal, never an imperative directive like a 301
Google must crawl and index the page before discovering the tag, creating an unavoidable delay
Several competing factors influence the final choice: redirects, internal links, sitemap, backlinks, URL age
The engine retains its arbitration power and may ignore your preference if other signals are stronger
A poorly configured canonical can be completely ignored without explicit notification in Search Console

SEO Expert opinion

Is this statement consistent with field observations?

Absolutely. SEOs regularly encounter cases where Google indexes a different URL than the one specified in the canonical. The URL inspection report in Search Console often shows “Canonical URL chosen by Google” that differs from “Canonical URL declared by the user.”

What still surprises some practitioners is the frequency of these discrepancies. On e-commerce sites with multiple URL parameters, Google sometimes ignores 20-30% of canonicals. The engine does what it wants, and it says so openly.

What nuances should we add to this official discourse?

Mueller remains vague about the actual weighting of each signal. We know Google aggregates several factors, but which ones weigh the most? [To be verified]: no official data quantifies the relative weight of a canonical versus a 301 versus internal linking.

Another area of ambiguity: the indexing delay before the tag is discovered. For a site crawled daily, this isn’t an issue. For a site with a tight crawl budget, a new page may remain “orphaned” for weeks before Google returns to process it and finally discovers the canonical. Mueller never specifies these magnitudes.

In what cases does this rule fail completely?

First classic case: circular canonicals. Page A points to B, page B points to C, page C points to A. Google abandons the tag and chooses arbitrarily. Second scenario: canonical pointing to a URL that returns a 404 or a 301. The signal becomes contradictory, and Google ignores everything.

Third problematic situation: significantly different content between the source page and the canonical target. If you place a canonical from a product page to a category, Google may decide that you are mistaken and index the product page anyway. The engine analyzes the content, not just the tags.

Warning: Never use rel=canonical as a lazy substitute for a real content strategy. Google detects abuses (massive canonicals pointing to a pillar page to “concentrate juice”) and may simply ignore all your tags if the pattern seems manipulative.

Practical impact and recommendations

What concrete steps should be taken to maximize respect for the canonical?

The first rule: absolute consistency between all signals. If you want Google to index URL A, your sitemap must reference only A, your internal links must point to A, your redirects must lead to A, and of course your canonical must designate A. Every conflicting signal reduces the likelihood that Google will follow your preference.

The second action: regularly audit the “Coverage” report in Search Console. Filter the “Excluded” pages with the reason “Alternate page with appropriate canonical tag.” Check that these are indeed pages you wanted to exclude. Conversely, inspect indexed pages to identify those where Google chose a canonical different from yours.

What mistakes should be absolutely avoided?

Never use canonical to a paginated or temporary URL. Google will eventually index this URL but will lose it during pagination changes. The result: unstable indexing. Similarly, avoid canonicals to URLs with session or tracking parameters: these URLs change, making your canonical invalid.

Another frequent mistake: canonical in relative rather than absolute form. Technically acceptable, but a source of bugs if your CMS incorrectly generates the paths. Always prefer complete absolute URLs with HTTPS protocol. Finally, never stack multiple canonical tags in the same <head>: Google only takes the first one or ignores them all.

How can you verify that your setup is actually working?

Test with the URL inspection tool in Search Console. Enter the relevant URL, look at “Canonical URL chosen by Google.” If it differs from your declaration, dig deeper: analyze the internal links pointing to this page, check the sitemap, and review redirects. Google explicitly tells you which URL it recognized.

Second method: targeted site: query. Search site:yourdomain.com "exact page title". Which URL appears first? If it’s not the one you canonicalized, it means Google made another choice. Cross-reference with a crawler like Screaming Frog to identify inconsistencies in your internal linking.

Align all signals: sitemap, internal links, redirects, canonical all point to the same target URL
Only use absolute HTTPS URLs in canonical tags
Audit Search Console monthly to spot discrepancies between declared canonical and retained canonical
Never canonicalize to a temporary, paginated URL, or one with session parameters
Test with URL inspection for each strategic page after modifying the canonical
Avoid chains of canonicals (A to B to C): point directly to the final target

The canonical tag remains a powerful tool for managing duplicate content, but it requires rigorous and consistent configuration across the site. Discrepancies between technical signals can be costly in terms of indexing. For complex sites with thousands of URLs and multi-faceted architectures, maintaining this consistency quickly becomes challenging to manage manually. If you notice persistent gaps between your canonicals and Google's choices despite your adjustments, a thorough technical audit by a specialized agency can identify the invisible contradictions and get your architecture back on solid rails.

❓ Frequently Asked Questions

Google respecte-t-il toujours la balise canonical ?

Non. Google traite rel=canonical comme un signal de préférence, pas une directive absolue. Le moteur croise cette balise avec d'autres signaux (redirections, liens internes, sitemap) et peut choisir une URL différente s'il estime que d'autres indices sont plus pertinents.

Combien de temps faut-il pour que Google découvre une balise canonical ?

Google doit d'abord crawler et indexer la page pour lire la balise canonical dans le code source. Le délai dépend de votre crawl budget et de la fréquence de passage du bot. Pour un site régulièrement crawlé, cela peut prendre quelques jours ; pour un site peu visité, plusieurs semaines.

Canonical en relatif ou absolu, quelle différence ?

Les deux formats sont techniquement valides, mais l'URL absolue (avec https:// complet) évite les bugs liés aux chemins relatifs mal générés par certains CMS. Privilégiez toujours l'absolu pour éliminer toute ambiguïté.

Peut-on utiliser canonical pour concentrer le "jus SEO" sur une page pilier ?

Non, c'est un abus détecté par Google. Canonical sert à gérer le contenu dupliqué ou quasi-dupliqué, pas à manipuler le flux de popularité. Si vous canonicalisez massivement des contenus distincts vers une page unique, Google peut ignorer toutes vos balises.

Comment savoir si Google a ignoré ma canonical ?

Utilisez l'outil d'inspection d'URL dans Search Console. Comparez « URL canonique déclarée par l'utilisateur » et « URL canonique choisie par Google ». Si elles diffèrent, Google a arbitré autrement. Analysez alors les autres signaux (liens internes, sitemap, redirections) pour identifier les contradictions.

🏷 Related Topics

canonical indexation crawl budget contenu dupliqué Search Console maillage interne sitemap XML redirections

Domain Age & History Crawl & Indexing AI & SEO Links & Backlinks Domain Name Redirects Search Console

🎥 From the same video 26

Other SEO insights extracted from this same Google Search Central video · duration 50 min · published on 11/03/2016

🎥 Watch the full video on YouTube →

Related statements

« Previous

Using the RankBrain Algorithm...

A/B Testing and Its Impact on SEO...

« Back to results