Official statement
Other statements from this video 27 ▾
- 13:31 Can your slow pages drag down the rankings of your entire site?
- 13:33 Do Core Web Vitals really affect your entire site or just your slow pages?
- 13:33 Can you really block the collection of Core Web Vitals using robots.txt or noindex?
- 14:54 Why does CrUX collect your Core Web Vitals even if you block Googlebot?
- 15:50 Does Google really underplay the true importance of Page Experience in rankings?
- 16:36 Is Page Experience really just a secondary ranking signal?
- 17:28 Does LCP truly measure the speed perceived by the user?
- 19:57 Do Core Web Vitals really measure continuously throughout the user session?
- 20:04 Do Core Web Vitals really change after the initial page load?
- 21:22 How does Google estimate your Core Web Vitals when CrUX data is lacking?
- 27:07 How does Google now assign AMP cache's CrUX data to the origin?
- 29:47 Is AMP still necessary to rank in Top Stories on mobile?
- 32:31 How can you leverage server logs to uncover 4xx errors in Search Console?
- 34:34 Why do new sites experience extreme volatility in indexing and ranking?
- 34:34 Should you really analyze server logs to diagnose 4xx errors in Search Console?
- 34:34 Why does your new site fluctuate like a yo-yo in the SERPs?
- 40:03 Should you really report copied content from your site using Google's spam form?
- 40:20 How can you effectively report copied content spam to Google?
- 43:43 Are your franchise pages considered doorway pages by Google?
- 45:46 Is duplicate content really harmless to your SEO?
- 45:46 Is it true that duplicate content won't penalize your SEO?
- 45:46 Are your franchise pages seen as doorway pages by Google?
- 51:52 Does the http:// or https:// namespace in an XML sitemap really affect crawlability?
- 52:00 Does using HTTPS for your XML sitemap namespace hurt your SEO ranking?
- 55:56 Is it really sufficient to include only one version, mobile or desktop, in your XML sitemap?
- 56:00 Should you really submit both mobile AND desktop versions in your sitemap?
- 61:54 Should you give up on AMP if you’re using GA4 to measure your performance?
Google uses scores from similar pages on the same site to estimate the Core Web Vitals of a page lacking CrUX data. This estimation logic—based on proximity or a default site-wide score—directly impacts the ranking of under-visited pages. For an SEO, this means that optimizing high-traffic pages can indirectly enhance the overall domain score.
What you need to understand
What is CrUX and why do some pages lack data?
The Chrome User Experience Report (CrUX) aggregates real user performance metrics collected from Chrome users. For a page to have sufficient CrUX data, it must receive a significant volume of visits over a given period.
Pages with low organic traffic (deep pages, niche content, new pages) often do not meet this threshold. Google then has no reliable metrics to evaluate their Core Web Vitals. Specifically, a little-visited product page or an old blog page with few visitors will not have its own CrUX score.
How does the 'similar site' estimation work?
Google employs a fallback logic: if a page lacks data, the algorithm first looks for 'similar' pages within the same domain. Similarity likely depends on HTML structure, common templates, content type, or navigation depth.
If the site's structure is too complex or heterogeneous, Google defaults to the overall domain score. In other words, the average rating of all CWV for the site becomes the proxy for this isolated page. This approach prevents penalizing an entire site for a handful of pages without data, but it also obscures local performance variations.
Why is this statement important for an SEO?
It confirms that Google does not rely solely on direct measures for all content. Therefore, a site can improve the perceived score of hundreds of pages by optimizing only those that generate traffic—and thus measurable CrUX data.
Conversely, a site with a few slow yet highly visited pages risks contaminating the score of the entire domain. This estimation logic creates a side effect: optimizing poor performers can disproportionately impact the overall ranking.
- Pages without CrUX data inherit the score from similar pages or the entire domain
- The site's structure influences the choice of estimation (similarity or global)
- Optimizing high-traffic pages can improve the score of invisible pages
- A heterogeneous site (varied templates, disparate performances) risks more fallback to the global score
- This logic is opaque: Google does not detail the 'similarity' criteria
SEO Expert opinion
Is this statement consistent with field observations?
Yes, it aligns with feedback from SEOs who have observed ranking improvements on lesser-visited pages after optimizing main templates. If Google were relying solely on PageSpeed Insights (synthetic tests), these variations would be inexplicable. The effect of CrUX score propagation confirms this mechanism.
However, Google remains deliberately vague about the criteria for 'similar pages'. It’s believed to involve DOM patterns, common CSS classes, similar load times—but there’s no confirmation. [To verify]: sites with multiple templates (blog, e-commerce, landing pages) could experience inconsistent estimations if the algorithm does not segment correctly.
What are the limitations and grey areas of this approach?
The statement does not specify the threshold of sufficient data. Does a page with 100 visits per month have usable CrUX data? 500? 1000? This opacity makes it difficult to anticipate exactly which pages will be estimated and which will have their own score.
Another point: what happens if a 'similar' page itself has an estimated score? Does Google apply a cascading estimation or switch directly to the global score? This question remains unresolved. Furthermore, the notion of 'complex structure' is subjective—does a site with 10 different templates qualify as 'complex'? [To verify].
What is the real risk for an e-commerce or media site?
An e-commerce site with thousands of little-visited product pages may see the majority of its pages estimated based on the global score or a handful of prototype pages. If the homepage and main categories are slow, the entire catalog inherits this penalty, even if the product pages are technically light.
For media sites, the risk is the opposite: highly consulted news articles (often overloaded with ad scripts) can drag down the score of the entire archive. Let's be honest: this logic favors homogeneous and well-structured sites—and penalizes technical patchworks accumulated over the years.
Practical impact and recommendations
What should you prioritize auditing on your site?
Start by identifying the pages that generate real traffic and thus measurable CrUX data. Google Search Console, combined with CrUX via PageSpeed Insights API, allows you to cross-reference URLs with organic traffic and available CWV scores.
Next, segment your site by templates or content types: homepage, categories, product pages, blog posts, conversion pages. For each segment, measure CWV under real conditions (RUM, CrUX Dashboard). If a high-volume template shows poor scores, it potentially contaminates the entire estimated segment.
How can you optimize to reduce the negative impact of estimations?
Favor a homogeneous architecture: the fewer different templates you have, the more accurately Google can refine its similarity estimations. Unify third-party scripts, critical CSS, and loading patterns as much as possible. A site with 3 well-optimized templates is better than a site with 15 disparate templates.
For high-traffic pages, apply classic optimizations: aggressive lazy loading, AVIF/WebP compression, font preloading, elimination of blocking JavaScript, efficient CDN. These pages serve as a reference for the estimation—taking care of them equates to caring for the entire domain. And that’s where it gets tricky: if your infrastructure is aging or technical teams do not prioritize CWV, these optimizations can remain in the backlog for months.
What mistakes should you absolutely avoid?
Do not focus solely on strategic pages (conversion, top 3 ranking). A little-visited page but belonging to a large segment can inherit the average score of the segment. If all your product pages are slow except for 10, the 10 fast ones will not be sufficient to compensate.
Avoid multiplying custom templates for every type of page. The more you fragment, the more likely Google is to default to the global domain score. Lastly, do not neglect pages without organic traffic but accessible (sitemap, internal linking): they are indexed, estimated, and can influence the site's overall perception. Specifically? Regularly clean up obsolete or poorly performing pages that pollute your average score.
- Identify pages with real vs estimated CrUX data (CrUX API + Search Console)
- Segment the site by templates and measure the average CWV by segment
- Prioritize optimizing high-traffic templates
- Homogenize the technical architecture (scripts, CSS, loading patterns)
- Remove or noindex obsolete pages with poor CWV that drag down the global score
- Monitor the evolution of CrUX scores at the domain level (CrUX Dashboard BigQuery)
❓ Frequently Asked Questions
Comment savoir si ma page utilise des données CrUX réelles ou estimées ?
Le score PageSpeed Insights reflète-t-il le score CrUX utilisé pour le ranking ?
Est-ce que toutes les pages d'un même template ont le même score CrUX estimé ?
Faut-il optimiser les pages sans trafic organique pour améliorer le score global ?
Un site avec plusieurs sous-domaines a-t-il un score CrUX global unique ?
🎥 From the same video 27
Other SEO insights extracted from this same Google Search Central video · duration 1h07 · published on 28/01/2021
🎥 Watch the full video on YouTube →
💬 Comments (0)
Be the first to comment.