What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

Google considers the simultaneous use of 'noindex' and 'canonical' tags on the same page to be contradictory. If you want a page not to be indexed, use 'noindex' alone. The canonical tag sends the page's signals to another URL designated as the preferred version.
6:50
🎥 Source video

Extracted from a Google Search Central video

⏱ 42:25 💬 EN 📅 28/01/2016 ✂ 7 statements
Watch on YouTube (6:50) →
Other statements from this video 6
  1. 1:09 Pourquoi Google ignore-t-il les contenus de vos footers pour le ranking ?
  2. 2:41 Les meta descriptions sont-elles vraiment inutiles pour le SEO ?
  3. 4:47 Les pages anciennes perdent-elles leur ranking après chaque mise à jour Google ?
  4. 27:58 JavaScript et SEO : Google indexe-t-il vraiment vos contenus chargés dynamiquement ?
  5. 31:31 Les redirections bloquées par robots.txt cassent-elles vraiment vos liens ?
  6. 34:04 Les employés de Google peuvent-ils vraiment manipuler le ranking de votre site ?
📅
Official statement from (10 years ago)
TL;DR

Google claims that combining noindex and canonical on the same page is contradictory: the canonical tag transfers signals to a canonical URL, while noindex requests de-indexing. For an SEO practitioner, this means one must choose a side: either you want a page to be crawled but not indexed (noindex only), or you want to consolidate signals towards a preferred version (canonical only). The situation gets complicated when inheriting a poorly configured site with both tags active.

What you need to understand

Why does Google's directive create practical problems?

Google's statement seems clear on paper: noindex and canonical should never coexist. Nonetheless, this situation frequently arises in the reality of SEO audits, often accidentally during poorly managed migrations or redesigns.

The technical issue is simple: the canonical tag tells Google which URL should receive ranking signals (backlinks, authority, content). The noindex tag explicitly instructs not to index the page. Thus, Google receives two incompatible instructions: transfer signals to a canonical URL while the source page should not even exist in the index.

How does Google actually handle these pages in production?

According to field observations, Google generally favors the noindex instruction when both tags are present. The page ends up de-indexed, and the signals it could have transmitted via the canonical are lost in the crawl budget limbo.

However, this is not systemic. On some high-authority sites, Google continues to crawl these pages for months before completely de-indexing them. Behavior varies according to domain trust, allocated crawl budget, and overall consistency of the site's robot directives.

What are the exceptions that would justify this configuration?

None, according to Google. And for once, their position aligns with technical logic. If you do not want a page to be indexed, noindex alone is more than sufficient.

The only situation where one might theoretically defend this combination is during a complex technical migration where one wants to temporarily block indexing while preserving the signal consolidation structure. But even in that case, it's a patch that conceals a planning issue. It is better to properly manage 301 redirects rather than tinker with contradictory directives.

  • Google favors noindex when both tags coexist, neutralizing the effect of the canonical
  • This configuration often arises by mistake during migrations or poorly orchestrated redesigns
  • No legitimate use case justifies combining noindex and canonical on the same page
  • Ranking signals are lost when a canonical page is de-indexed via noindex
  • The crawl budget is wasted on pages that Google will end up ignoring anyway

SEO Expert opinion

Does this directive truly reflect Google's observed behavior?

Yes, and it's rare enough to be noteworthy. Empirical tests confirm that Google indeed treats this combination as a configuration error. In the majority of cases, the noindex tag wins out, and the page disappears from the index within 2 to 8 weeks following the crawl.

But let's be honest: the de-indexing delay varies greatly. On low-authority sites, Google can entirely ignore these pages from the first crawl. On established domains with a solid history, I've seen pages remain indexed for 4 to 6 months despite the simultaneous presence of noindex and canonical. The behavior is not binary; it depends on the site's context.

What are the situations where this becomes truly problematic?

The real tragedy occurs on e-commerce sites with thousands of product variations. Imagine a catalog where each color generates a distinct URL, with a canonical pointing to the main page. If a developer applies a global noindex on the variations to avoid duplicate content, the signals accumulated by these pages (customer reviews, backlinks to specific variants) never reach the canonical page.

Another frequent scenario involves multilingual sites with hreflang. When a country is temporarily closed for business reasons, some add a noindex to these versions while maintaining the canonical and hreflang. The result: Google completely loses track of the site's international structure, and clean re-indexing becomes a technical nightmare.

Are there nuances that Google fails to mention?

Absolutely. Google does not clarify what happens to signals already accumulated before applying the noindex. If a page received quality backlinks for months before a noindex was added, are those signals permanently lost? The documentation remains vague. [To be verified]

Similarly, Google does not clarify the behavior when the page with noindex+canonical is eventually cleaned up. Do historical signals return to the canonical if the noindex is removed? Observations suggest that no: once a page has been de-indexed with this awkward configuration, it often requires a forced complete recrawl and waiting several cycles before Google correctly reassesses the proper directives.

Warning: If you inherit a site with this active configuration on hundreds of pages, do not fix everything at once. A massive and simultaneous removal of noindex tags can trigger an explosive indexing of low-quality content, which can temporarily degrade the site's overall performance in the SERPs. Proceed in gradual waves prioritizing pages with high SEO potential.

Practical impact and recommendations

How can you quickly audit your site to detect this issue?

The first reflex: extract all URLs with a noindex tag from your CMS or via a Screaming Frog crawl. Then, filter those that also contain a canonical tag. If you find matches, you have identified your problem pages.

For medium to large sites, a direct SQL query on your database might be quicker. Look for records where the meta_robots field contains "noindex" AND where the canonical_url field is not empty. The export should be null if your configuration is clean.

What is the safest correction procedure?

Never remove both tags at the same time, as you would create a directive void. The correct sequence depends on your ultimate goal. If you want the page to remain out of the index, remove only the canonical and keep the noindex. If you want to consolidate signals, remove the noindex and let the canonical do its job.

In 80% of cases, the correct decision is to remove the noindex. The affected pages are usually marked noindex by mistake or out of over-caution regarding duplicate content. Once the noindex is removed, force a recrawl via the Search Console to speed up the consideration. Monitor actual indexing via a targeted "site:" query during the following 3 to 4 weeks.

What mistakes should you absolutely avoid during the correction?

Don't just correct the code: check that your templates, plugins, and middlewares do not reinsert these directives. On WordPress, some SEO plugins apply cascading rules that can recreate the problem after manual correction.

Another classic pitfall: correcting the HTML tags but forgetting the X-Robots-Tag directives in the HTTP headers. If your server sends "X-Robots-Tag: noindex" in the headers while having a clean canonical in the HTML, you have solved nothing. Use a tool like HTTP Header Checker or Chrome DevTools to verify the actual headers.

  • Crawl the entire site with Screaming Frog or Oncrawl filtering for "noindex + canonical present"
  • Export the list of affected URLs and categorize them by intention (de-index or consolidate)
  • Remove the superfluous tag based on the objective: canonical if the page is to remain out of the index, noindex if it should transmit its signals
  • Check that the X-Robots-Tag HTTP headers do not contradict the HTML tags
  • Force a recrawl of the corrected pages via Google Search Console ("Request indexing" function)
  • Monitor indexing progression over 30 days using targeted site: queries and Search Console coverage reports
The combination of noindex + canonical is a configuration error that Google penalizes by favoring de-indexing, nullifying any benefit of signal consolidation. Auditing this problem is technically simple, but correcting it requires case-by-case analysis to avoid temporarily degrading site performance. These technical optimizations, while conceptually clear, often demand sharp expertise to be implemented safely at scale. If your site has hundreds of URLs in this situation or if you manage a complex architecture (multilingual, e-commerce facets, dynamic parameters), the support of a specialized SEO agency can secure the transition and avoid costly visibility errors.

❓ Frequently Asked Questions

Si je retire le noindex d'une page avec canonical, les signaux remontent-ils immédiatement vers l'URL canonique ?
Non, il faut que Google recrawle la page corrigée et réintègre les signaux progressivement. Comptez 2 à 6 semaines selon le crawl budget de votre site et l'autorité de la page concernée.
Une page avec noindex et canonical continue d'être crawlée par Googlebot ?
Oui, le noindex n'empêche pas le crawl, il bloque uniquement l'indexation. Google continuera de visiter la page mais n'ajoutera pas son contenu à l'index. La canonical sera techniquement lue mais ignorée en pratique.
Peut-on utiliser noindex dans le robots.txt au lieu de la balise meta pour éviter ce conflit ?
Non, la directive noindex n'existe pas dans robots.txt selon les standards officiels. Seuls User-agent, Disallow, Allow, Crawl-delay et Sitemap sont valides. Utiliser noindex dans robots.txt sera simplement ignoré par Google.
Si ma page noindex reçoit des backlinks de qualité, ces signaux sont-ils définitivement perdus ?
Dans la plupart des cas, oui. Une page noindex ne transmet pas son autorité, même si elle possède une canonical. Les backlinks pointant vers elle ne bénéficient ni à la page elle-même ni à l'URL canonique désignée.
Comment gérer les variantes produit en e-commerce sans tomber dans ce piège ?
Utilisez canonical sur les variantes sans ajouter de noindex. Si le contenu des variantes est vraiment pauvre, travaillez plutôt sur l'enrichissement (avis spécifiques, images dédiées) ou consolidez réellement via des redirections 301 si ces pages n'apportent aucune valeur utilisateur.
🏷 Related Topics
Domain Age & History Crawl & Indexing AI & SEO Domain Name

🎥 From the same video 6

Other SEO insights extracted from this same Google Search Central video · duration 42 min · published on 28/01/2016

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.