Official statement
Other statements from this video 13 ▾
- 1:04 Faut-il rediriger ou laisser en 404 les pages obsolètes ?
- 3:17 Comment gérer efficacement une pénalité manuelle Google sans perdre des mois de trafic ?
- 8:06 Changer de CMS fait-il vraiment chuter vos positions Google ?
- 14:35 Le contenu généré par les utilisateurs peut-il nuire au classement de votre site ?
- 16:07 Panda est-il vraiment devenu un signal de qualité permanent pour tous les algorithmes Google ?
- 17:13 Pourquoi vos balises hreflang doivent-elles pointer vers les URL canoniques ?
- 19:11 Les liens nofollow nuisent-ils vraiment au classement SEO de votre site ?
- 21:37 Les backlinks toxiques peuvent-ils vraiment détruire votre SEO ?
- 24:58 Pourquoi vos rich results chutent-ils sans que votre trafic ne bouge ?
- 26:02 Pourquoi Google cache-t-il certaines de vos pages dans les résultats de recherche ?
- 31:27 Les pop-ups mobiles tuent-ils vraiment votre référencement ?
- 35:56 Les chaînes de redirections tuent-elles vraiment votre PageRank ?
- 45:49 La balise unavailable_after peut-elle vraiment anticiper vos 404 et accélérer la désindexation ?
Google recommends allowing filtered Magento pages to be crawlable while using canonical tags pointing to the main page. This approach enables Googlebot to discover products without diluting PageRank. However, be cautious: this strategy assumes a sufficient crawl budget and well-managed technical architecture, or you risk burying your strategic pages.
What you need to understand
Why does Google want to crawl pages that it won't index?
The logic may seem contradictory at first. Google recommends allowing the crawling of filtered variants generated by Magento (sorts, price filters, product attributes) even if these pages will not be indexed. The engine needs these paths to discover the entire catalog.
Specifically, a user may access a product sheet via the filter "Red dresses size 38" while Googlebot might never find this product through the main navigation. By blocking these filtered URLs, you potentially cut off exploration routes to strategic pages. Google uses these variants as bridges, not as final destinations.
How does the canonical tag solve the duplication issue?
The canonical tag tells Google which version to index when multiple URLs display identical or nearly identical content. In the case of Magento filters, all variants (with or without sorting, with or without filters) point to the mother page via rel=canonical.
This approach preserves PageRank by consolidating signals onto a single URL while allowing content discovery. Google crawls the variants, follows the internal links they contain, but focuses authority on the canonical page. It's a compromise between accessibility and SEO effectiveness.
What are the risks if we block these pages in robots.txt?
Blocking filtered URLs seems logical to save crawl budget. The problem is: Googlebot can't see the links present on these pages. If your internal linking mainly goes through filtered listing pages, you're creating SEO orphans.
Another trap: robots.txt prevents crawling but does not stop indexing. Google can index a blocked URL if it discovers it via an external link, but without being able to read its content or its canonical tag. You end up with indexed URLs without a title or description, a nightmare for the SERP.
- Allowing the crawling of filtered pages enables complete discovery of the product catalog
- The canonical tag consolidates PageRank on the main page despite URL variants
- Blocking in robots.txt creates orphans and prevents Google from reading canonical tags
- This strategy assumes a sufficient crawl budget and a clean technical architecture
- Filters must generate consistent URLs to facilitate canonical management
SEO Expert opinion
Does this recommendation work for all e-commerce sites?
Let's be honest: Mueller's strategy is optimal for medium-sized catalogs with a well-structured technical architecture. For a site with 50,000 products and 15 combinable filters, you're looking at millions of potential URLs. The crawl budget becomes a real issue.
I've seen e-commerce sites ruin their indexing by leaving everything open. Google spends its time crawling worthless variants while new product sheets stay queued for weeks. [To verify]: Google does not provide any figures on the threshold where this approach becomes counterproductive.
Is the canonical tag really enough to solve everything?
In theory, yes. In practice, it’s more complicated. Google does not guarantee to respect the canonical 100%, it’s a strong directive but not an absolute command. I've observed cases where Google indexes the filtered page despite a correct canonical, especially if it receives direct backlinks.
Another concern: Magento sometimes generates shaky dynamic canonicals, especially on poorly configured 1.x versions. A technical audit is essential before you open the floodgates. And that’s where it gets tricky: how many sites have verified that their canonicals point to URLs with the correct protocol, without residual parameters, without cascading redirects?
When should we still block certain variants?
Not all filters are equal. Sorts (ascending/descending price, new arrivals, etc.) generally do not create any SEO value. Infinite paginations do not either. In these cases, a noindex, follow or selective blocking may be smarter than a canonical.
Specifically, I would tend to allow relevant business filters (size, color, material) that truly help product discovery, and to block or noindex purely UX variants. Mueller's recommendation lacks granularity: it assumes that all Magento filters have the same value, which is rarely the case in practice.
Practical impact and recommendations
How to configure Magento to implement this strategy?
By default, Magento 2 generates canonicals to the main category page, but only if the configuration is correct. Go to Stores > Configuration > Catalog > Catalog > Search Engine Optimization and check that "Use Canonical Link Meta Tag For Categories" and "Use Canonical Link Meta Tag For Products" are enabled.
Then verify that your filters indeed generate consistent URLs. Some third-party modules create chaotic parameters (?filter_color=red&sort=price vs /color-red/sort-price/). The more predictable your URLs are, the more reliable canonical management becomes. Test a few combinations manually and inspect the source code.
What technical errors are lurking with this approach?
The most common error: chained canonicals. The filtered page points to an intermediate page which itself points to the main page. Google hates that and may completely ignore the directive. Each canonical must point directly to the final URL, without detours.
Another classic trap: mixing canonical and noindex on the same page. If you set noindex, the canonical becomes useless (Google won't index either one). Choose your side. For Magento filters, the logic remains: crawlable + canonical, not noindex.
How to monitor that everything is working correctly?
Search Console is your best friend here. Go to "Pages" and then filter by "Excluded" to see if any filtered variants appear as "Detected, currently not indexed" or "Crawled, currently not indexed." This is normal and desirable. If they appear indexed despite the canonical, you have a problem.
Also monitor the crawl budget in "Crawl Stats." If the number of crawled pages explodes without any indexing improvement, you're wasting resources. A well-optimized site sees its ratio of crawled pages to indexed pages remain stable or improve, not deteriorate.
- Activate automatic canonicals in SEO config for categories and products in Magento
- Audit the URLs generated by filters to verify their consistency and predictability
- Check that each canonical points directly to the final URL without a chain of redirects
- Do not mix noindex and canonical on the same filtered pages
- Monitor Search Console to confirm that the variants remain "crawled but not indexed"
- Watch the crawl budget to detect potential resource wastage
❓ Frequently Asked Questions
Peut-on utiliser noindex au lieu de canonical sur les pages filtrées Magento ?
Faut-il mettre les URLs filtrées dans le sitemap XML ?
Comment gérer les filtres qui créent du contenu unique (descriptions différentes) ?
Le paramètre URL doit-il être dans un ordre spécifique pour le canonical ?
Combien de temps avant que Google respecte les canonical après un déploiement ?
🎥 From the same video 13
Other SEO insights extracted from this same Google Search Central video · duration 55 min · published on 30/05/2017
🎥 Watch the full video on YouTube →
💬 Comments (0)
Be the first to comment.