What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

For parameters defined as specifying content ('item id' or 'sku'), Google generally recommends exploring each URL. The same applies to 'translates', unless you wish to exclude automatic translations from search results.
11:23
🎥 Source video

Extracted from a Google Search Central video

⏱ 15:05 💬 EN 📅 14/08/2012 ✂ 6 statements
Watch on YouTube (11:23) →
Other statements from this video 5
  1. 0:34 Faut-il vraiment configurer les paramètres d'URL dans Google Search Console ?
  2. 10:17 Faut-il vraiment bloquer les paramètres de filtrage dans le crawl ?
  3. 11:46 Faut-il vraiment laisser Googlebot explorer vos paramètres de tri ?
  4. 12:00 Faut-il vraiment placer ses traductions dans des sous-dossiers pour ranker à l'international ?
  5. 12:32 Faut-il vraiment laisser Google crawler toutes vos pages paginées ?
📅
Official statement from (13 years ago)
TL;DR

Google recommends systematically exploring each URL containing product specification parameters (item ID, SKU) and translation. This stance directly impacts the management of crawl budget and indexing strategy for e-commerce and multilingual sites. A notable exception concerns automatic translations, which you can exclude if the quality of the generated content does not meet your standards.

What you need to understand

What distinction does Google make between specification parameters and navigation parameters?

Google differentiates between two families of URL parameters. Navigation parameters (sorting, filtering, pagination) do not fundamentally change the content: they reorganize or restrict the display. Specification parameters (SKU, item ID, product references) identify unique and distinct content.

This nuance is crucial. A parameter ?color=red on a product page does not create a new product; it displays a variant. In contrast, ?sku=ABC123 points to a specific item in your catalog. Google considers that each SKU deserves its own crawl because it represents a distinct entity in your inventory.

Why does Google insist on the systematic crawl of item IDs and SKUs?

The reasoning is straightforward: each product reference is a potential landing page. A user can directly land on /product?sku=XYZ789 via an external link, a price comparison site, or a product search. If Google does not crawl this URL, it cannot index or rank it in the SERPs.

For an e-commerce site with thousands of references, this means exposing each SKU to crawl. Google's recommendation aims to ensure that your indexing coverage is comprehensive, without gaps in the catalog. This is particularly critical for merchant sites where the business model relies on visibility product by product.

What should be done with automatic translation parameters?

Google introduces a willing exception: you can block the crawl of URLs with the translates parameter if you do not wish to index auto-generated translations. This is an implicit acknowledgment that machine translation often produces lower quality content.

The engine allows you to choose. If your automatic translations are only meant to improve internal UX without aspirations for ranking, exclude them from the crawl. Conversely, if you stand by their quality and want them indexed, let Google explore them. This flexibility contrasts with the strict directive concerning SKUs.

  • Product specification parameters (SKU, item ID): Google recommends exhaustive crawling to ensure catalog coverage
  • Automatic translation parameters: optional crawl, to block if the quality of generated content does not justify indexing
  • Crawl budget impact: this approach can multiply the crawled URLs, requiring fine management of priorities
  • Key distinction: a parameter that identifies unique content (SKU) is not treated as a simple navigation filter
  • Indexing strategy: each product reference must be discoverable and indexed individually

SEO Expert opinion

Is this recommendation consistent with field observations?

Yes and no. For small catalogs (a few hundred products), letting Google crawl every URL with SKU poses no problem. The crawl budget is more than sufficient, and indexing coverage improves. Average merchant sites do see better indexing rates by directly exposing product variants.

On the other hand, for massive catalogs (tens of thousands of references), this approach can saturate the crawl budget. Google does not crawl indefinitely: an e-commerce site with 50,000 SKUs generating 3-4 parameterized URLs each ends up with 150,000-200,000 pages to crawl. In this case, the engine prioritizes, and some URLs remain undiscovered for weeks. [To verify] based on your catalog size.

What nuances should be added to this directive?

Google does not specify how to handle parameter combinations. A product may have a SKU + a color + a size, generating URLs like ?sku=ABC&color=red&size=L. Should all combinations be crawled or just the base SKU? The statement remains unclear on this point.

Additionally, nothing is mentioned regarding canonicalization. If you expose /product/ABC123 AND /product?sku=ABC123, which version should be indexed? The crawl recommendation does not imply duplicate indexing. Smart use of rel=canonical remains essential, but Google does not mention it here.

Finally, the distinction between automatic translations and human translations is not addressed. Does the translates parameter only apply to machine translations, or does it cover all forms of translation? [To verify]: the wording raises some doubt.

Warning: On very large catalogs, applying this recommendation without adjustment may cause crawl congestion. Prioritize strategic SKUs and monitor the Search Console for URLs discovered but not crawled.

In what cases does this rule not apply?

If your site generates dynamic URLs for user personalization (e.g., ?userid=12345), these parameters are not considered "product specifications" in Google's view. They should be blocked via robots.txt or noindex to avoid exposing billions of unnecessary variations.

Moreover, session, tracking, or sorting parameters are not subject to this recommendation. Google explicitly mentions item ID and SKU, which means product identifiers. Everything else (filters, display order, analytics parameters) must be managed with traditional tools: URL Parameters Tool (now obsolete), rel=canonical, or outright blocking.

Practical impact and recommendations

What should be done practically on an e-commerce site?

Start by auditing your URL structures. Identify all parameters that carry a SKU, item ID, or unique product reference. Check in Search Console (Coverage report) how many of these URLs are discovered, crawled, and indexed. A significant gap indicates a crawl budget or configuration issue.

Next, streamline your canonical URLs. If you offer /product/ABC123 and /product?sku=ABC123, choose a master version and canonicalize the other. Google can crawl both, but only one should be indexed. Avoid diluting internal PageRank over duplicates.

How can automatic translations be managed without polluting the index?

If you use a machine translation service to generate multilingual content, evaluate its quality. Have a few pages proofread by a native speaker. If the result is roughly done, it’s better to block these URLs with noindex or via robots.txt.

Alternatively, set up a dedicated parameter like ?translates=auto for machine translations, distinct from your manually translated URLs. This way, you can allow the crawl of human translations while blocking the auto-generated versions. This granularity eases management within the robots.txt file.

What mistakes should be avoided in managing SKU parameters?

Never blindly block all URL parameters in robots.txt. This is the classic mistake: a developer adds Disallow: /*? and cuts access to all parameterized URLs, including SKUs. The result: thousands of products become invisible to Google.

Another pitfall is multiplying parameter combinations without control. A product with 5 attributes (color, size, material, finish, option) can potentially generate hundreds of URLs. If each combination results in a distinct page, you will explode the crawl budget. Favor a main URL per SKU, with attributes handled in client-side JavaScript or through anchors.

  • Identify all parameters carrying a SKU or item ID in your URL architecture
  • Check in Search Console the coverage rate of these URLs (discovered vs indexed)
  • Define a unique canonical URL per product and canonicalize the parameterized variants
  • Audit the quality of automatic translations before exposing them to crawl
  • Configure robots.txt to block only non-product parameters (session, tracking, sorting)
  • Monitor the crawl budget via server logs to detect saturations
Implementing this Google recommendation requires a careful analysis of your URL architecture and decisions regarding crawl prioritization. On a large catalog, technical complexity can quickly become significant: between managing canonicals, optimizing crawl budget, and configuring robots.txt, mistakes can be costly in terms of visibility. If you lack internal resources or your catalog exceeds a few thousand references, hiring a specialized SEO agency can help you secure the implementation and avoid classic pitfalls that hinder indexing.

❓ Frequently Asked Questions

Dois-je crawler toutes les combinaisons de paramètres produit (SKU + couleur + taille) ?
Google recommande de crawler chaque SKU, mais reste flou sur les combinaisons d'attributs. En pratique, privilégiez une URL principale par SKU et gérez les variantes (couleur, taille) via des canonicales ou du JavaScript côté client pour éviter d'exploser le crawl budget.
Que faire si mon crawl budget ne suffit pas à explorer tous mes SKU ?
Priorisez les produits stratégiques (best-sellers, marges élevées, stock disponible) en facilitant leur découverte via le maillage interne. Utilisez les logs serveur pour identifier les URLs découvertes mais non crawlées, et ajustez votre architecture de liens en conséquence.
Les traductions automatiques doivent-elles systématiquement être bloquées ?
Non. Google vous laisse le choix : si la qualité est acceptable et que vous assumez de les indexer, laissez-les être crawlées. Si elles sont approximatives, bloquez-les avec <code>noindex</code> ou <code>robots.txt</code> pour préserver la qualité de votre index.
Comment distinguer un paramètre de spécification d'un paramètre de navigation ?
Un paramètre de spécification (SKU, item ID) identifie un contenu unique et distinct. Un paramètre de navigation (tri, filtre, pagination) réorganise ou restreint l'affichage d'un même ensemble de contenus. Le premier mérite le crawl, le second nécessite canonicalisation ou blocage.
Faut-il utiliser rel=canonical sur les URLs avec paramètres SKU ?
Oui, si vous exposez plusieurs URLs pour le même produit (ex: <code>/produit/ABC123</code> et <code>/produit?sku=ABC123</code>). Définissez une version maître et canonicalisez les autres pour éviter la dilution du PageRank et les doublons dans l'index.
🏷 Related Topics
Content AI & SEO Domain Name International SEO

🎥 From the same video 5

Other SEO insights extracted from this same Google Search Central video · duration 15 min · published on 14/08/2012

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.