What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

If products are accessible through filters but these filters are canonicalized to a non-filtered URL, it may affect indexing if the internal links to the products are not clear.
16:47
🎥 Source video

Extracted from a Google Search Central video

⏱ 59:08 💬 EN 📅 04/04/2017 ✂ 20 statements
Watch on YouTube (16:47) →
Other statements from this video 19
  1. 1:34 Les redirections font-elles vraiment perdre du PageRank ou pas ?
  2. 1:35 Les redirections multiples diluent-elles réellement le jus de lien transmis ?
  3. 2:05 Les redirections sur sous-domaines vers l'externe pénalisent-elles vraiment votre SEO ?
  4. 2:36 Les redirections diluent-elles vraiment la puissance de vos liens ?
  5. 7:28 Pourquoi vos pages n'apparaissent-elles pas dans l'index malgré votre sitemap ?
  6. 15:33 Les erreurs 404 impactent-elles vraiment votre positionnement dans Google ?
  7. 15:42 Faut-il supprimer les pages de profil avec peu de contenu pour éviter une pénalité ?
  8. 17:41 Faut-il encore utiliser 'noindex' dans robots.txt ou est-ce déjà obsolète ?
  9. 19:56 Faut-il vraiment passer tous vos liens externes en nofollow par défaut ?
  10. 21:14 La canonisation vers la page 1 peut-elle ruiner l'indexation de vos produits ?
  11. 26:02 Le texte d'ancrage des liens internes influence-t-il vraiment le positionnement ?
  12. 26:17 Le texte d'ancrage interne influence-t-il vraiment la compréhension de vos pages par Google ?
  13. 39:23 La compression d'images impacte-t-elle vraiment votre classement Google ?
  14. 46:01 Le Data Highlighter reste-t-il pertinent pour tester les données structurées ?
  15. 46:05 Faut-il abandonner le Data Highlighter pour implémenter du balisage structuré directement ?
  16. 54:42 Faut-il vraiment éviter les redirections IP automatiques sur les sites multilingues ?
  17. 55:16 Faut-il vraiment limiter les redirections IP à la page d'accueil pour le SEO multilingue ?
  18. 60:12 Les appels publicitaires non affichés impactent-ils vraiment l'indexation de vos pages ?
  19. 90:15 Faut-il vraiment conserver les redirections après la suppression d'un produit ?
📅
Official statement from (9 years ago)
TL;DR

Google confirms that canonicalizing filtered URLs to a non-filtered URL can block product indexing if the direct internal links to these products are insufficient. Specifically, your product listings may never appear in search results if Google discovers them only through canonicalized filtered pages. The solution: create a strong internal linking structure with direct links to each product, regardless of filters.

What you need to understand

What happens when a filtered page points to a non-filtered page?

You have a category page with color, size, and price filters. Each combination generates a different URL: /t-shirts, /t-shirts?color=blue, /t-shirts?color=blue&size=M.

To avoid content duplication, you add a canonical tag from /t-shirts?color=blue to /t-shirts. This is a common, even recommended practice. The problem? If products are only accessible via these filtered pages, Google might never see them.

Why doesn’t Google follow links on a canonicalized page?

A canonical tag tells Google: "This page is a variant, the original is elsewhere." The search engine will transfer signals (authority, content, links) to the canonical URL.

However, the links present on the canonicalized page are not always followed with the same priority. Google discovers them, sure, but if these links are the only entry points to your product listings, indexing becomes uncertain. The crawl budget focuses on canonical URLs, not on variants.

In what context does this statement make sense?

This issue mainly affects e-commerce sites with exclusively filtered navigation. Imagine a catalog of 10,000 products where each listing is only accessible by applying 2-3 filters from a category page.

If all these filtered URLs are canonicalized to /category, and there are no direct links to the products (neither in the XML sitemap, nor in a "all our products" page, nor in the menu), Google may simply never index these listings. The result? Invisible products in organic search.

  • Canonical does not mean "do not crawl", but Google prioritizes the canonical URL for indexing and PageRank distribution.
  • Links on a canonicalized page are discovered, but not necessarily explored with the same depth as a link from an indexable page.
  • A clear internal linking structure to the products (from non-filtered categories, sitemaps, hub pages) is essential to ensure their indexing.
  • The XML sitemap is not always enough: Google prefers HTML links to understand a page's importance.
  • This issue is amplified on large catalogs where the crawl budget becomes a limiting factor.

SEO Expert opinion

Is this statement consistent with real-world observations?

Yes, and it’s even a welcome confirmation. On many analyzed e-commerce sites, I observed non-indexed product listings while they were technically accessible. The common point? No direct link, only paths through canonicalized filtered URLs.

Google does not say that links on canonicalized pages are totally ignored, but their weight in the internal link graph is clearly reduced. If these links are the only access point, indexing becomes random. [To verify]: Google does not specify the exact threshold of "clarity" in the internal linking structure needed to counter this effect.

What nuances should be added to this rule?

Not all sites are equal when it comes to this issue. A site with a high crawl budget (strong authority, few pages) will cope better than a marketplace with 500,000 URLs. Click depth also plays a role: a product 6 clicks away from a canonicalized page is at greater risk than a product 2 clicks away.

Another nuance: XML sitemaps do not solve everything. Google uses them for discovery, but a product present only in the sitemap, without a direct HTML link, remains fragile. The engine interprets this absence as a signal of low importance.

When does this rule not really apply?

If your architecture already has robust direct links to each product (from the homepage, major non-filtered categories, "new arrivals" or "bestsellers" pages), the risk is minimal. Canonicalized filters then become merely alternative paths, not the main route.

Similarly, on a small catalog (a few hundred products), Google generally crawls everything without difficulty, even with a suboptimal linking structure. The problem becomes critical from a few thousand pages, where the crawl budget starts to segment.

Attention: Do not confuse canonical and noindex. A page with a canonical tag is crawlable, its links are discovered. But if these links are the only paths to your products, Google may explore them without prioritizing them for indexing. The risk is silent: no errors in Search Console, just an absence of indexing.

Practical impact and recommendations

What concrete steps should be taken to secure product indexing?

Create a strong internal linking structure with direct HTML links to each product listing. Specifically: a paginated "All our products" page, blocks for "Latest Additions" or "Best Sellers" on the homepage, links from main categories (non-filtered).

Ensure that each product is accessible within 3 clicks maximum from the homepage, via links that do not go through canonicalized URLs. Add these URLs to your XML sitemap, but do not rely solely on that.

What mistakes should absolutely be avoided in filter management?

Do not canonicalize all your filtered pages by reflex. If a filter generates a page with a distinct search intent (e.g., /organic-cotton-t-shirts), leave it indexable. Canonical should serve to eliminate variants without value, not to blindly homogenize.

Do not depend on JavaScript links to save indexing. Even if Google crawls JS, traditional HTML links remain the most reliable signal. A site that reveals its products only after JS interaction on a canonicalized filtered page combines two handicaps.

How can you check that your site is not affected?

Compare the number of product listings in your database with the number of pages indexed in Search Console ("Coverage" report). A significant gap is a warning sign. Use the URL Inspection Tool on some less visible products: if Google says "URL discovered, currently not indexed," investigate the internal linking structure.

Analyze the server logs: if Googlebot crawls your canonicalized filtered pages but ignores the product listings, it means the link between the two is not clear enough. Use a tool like Screaming Frog to map click depth: any product more than 4 clicks away is suspicious.

  • Create hub pages ("New Arrivals", "Top Sellers") with direct links to individual products.
  • Add a "Recommended Products" block in the sidebar or footer of main category pages.
  • Ensure that each product appears in the XML sitemap with an up-to-date lastmod.
  • Audit click depth: no product should exceed 3-4 clicks from the homepage.
  • Segment filters: let highly intent-driven combinations be indexable, canonicalize the rest.
  • Monitor the "Coverage" report in Search Console to detect products marked as "Discovered, not indexed."
Optimizing the internal linking structure and canonicalization requires specialized technical expertise, especially on large catalogs with thousands of references. Analyzing logs, mapping crawl patterns, and identifying optimal indexing paths are time-consuming tasks. If your team lacks resources or specialized skills, engaging an SEO agency skilled in e-commerce issues can expedite resolution and avoid costly visibility losses. A thorough audit can quickly diagnose blockages and prioritize high-impact actions.

❓ Frequently Asked Questions

Les produits présents uniquement dans le sitemap XML sont-ils garantis d'être indexés ?
Non. Le sitemap aide à la découverte, mais Google privilégie les liens HTML internes pour évaluer l'importance d'une page. Un produit sans lien direct risque d'être ignoré même s'il est dans le sitemap.
Faut-il supprimer toutes les balises canonical sur les pages filtrées ?
Non, ce serait contre-productif. Gardez-les pour les variantes sans valeur ajoutée (tri, pagination simple). Laissez indexables les filtres correspondant à des intentions de recherche distinctes.
Combien de liens directs vers un produit sont nécessaires pour garantir son indexation ?
Il n'y a pas de seuil magique. Un seul lien depuis une page bien crawlée et proche de la homepage peut suffire. L'essentiel est que ce lien soit HTML, suivi, et sur une page indexable.
Les liens en JavaScript sont-ils pris en compte pour l'indexation des produits ?
Google les crawle, mais ils sont moins fiables que les liens HTML classiques. Sur une page canonicalisée, un lien JS vers un produit cumule deux handicaps : signal faible et page non prioritaire.
Comment prioriser les produits à relier en interne si j'en ai des milliers ?
Commencez par les produits à forte marge, forte demande ou nouveautés. Utilisez des blocs dynamiques (bestsellers, promotions) pour créer des liens contextuels sans tout lier manuellement.
🏷 Related Topics
Crawl & Indexing E-commerce AI & SEO Links & Backlinks Domain Name

🎥 From the same video 19

Other SEO insights extracted from this same Google Search Central video · duration 59 min · published on 04/04/2017

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.