How does Google estimate your Core Web Vitals when CrUX data is lacking?

Quick SEO Quiz

Test your SEO knowledge in 3 questions

Less than 30 seconds. Find out how much you really know about Google search.

🕒 ~30s 🎯 3 questions 📚 SEO Google

Official statement

When CrUX data is insufficient for a page, Google may use scores from similar pages on the same site to estimate the score. If the site structure is complex, the overall site score may be applied.

21:22

🎥 Source video

Extracted from a Google Search Central video

⏱ 1h07 💬 EN 📅 28/01/2021 ✂ 28 statements

Watch on YouTube (21:22) →

✂ Other statements from this video 27 ▾

📅

Official statement from January 28, 2021 (5 years ago)

⚠ A more recent statement exists on this topic Why Do Core Web Vitals Show Different Results Between CrUX and Search Console? Google · August 26, 2025 View statement →

TL;DR

Google confirms that in the absence of sufficient CrUX data for a page, it may rely on scores from similar pages on the same site, or even the overall domain score if the architecture is complex. For an SEO, this means that a little-visited page may inherit the performance of neighboring pages — an estimation mechanism that can either work in your favor or penalize you depending on the overall state of the site. The recommendation: optimize the performance of the entire domain, not just strategic pages.

What you need to understand

What is CrUX and why do some pages lack data?

The Chrome User Experience Report (CrUX) gathers real performance metrics from users' Chrome browsers. When a page receives little traffic, it does not accumulate enough data to generate a statistically reliable report.

Google therefore needs a solution: either ignore these pages, or estimate their score. The statement confirms that similarity estimation is the chosen method, preventing an entire segment of sites — particularly smaller sites or deep pages — from being entirely excluded from the Core Web Vitals ranking.

How does this similarity estimation work?

Google operates in two steps. First, it looks for similar pages on the same domain with usable CrUX data. Similarity is based on structure, template, loaded resources — not on textual content.

If this approach fails because the site architecture is too complex or the patterns too varied, Google then applies the overall domain score. This is a safety net that ensures a signal, even if approximate.

Why does this approach pose a problem for SEO practitioners?

The estimation masks the real disparities. A poorly optimized orphan page with no traffic could inherit a good score if the rest of the site performs well — and vice versa. This complicates auditing: you no longer know if the displayed score reflects reality or a smoothed average.

Another issue: sites with heterogeneous architecture (e-commerce with blog sections, interactive tools, product sheets) risk having radically different pages aggregated under a single estimated score, diluting the signal’s relevance.

CrUX relies on real field data, so certain pages without traffic have no exploitable history.
Google utilizes similar pages from the same site to fill in the gaps, prioritizing architecture and resources.
If the architecture is too complex, the overall domain score is applied by default.
This estimation method makes it difficult to accurately identify underperforming pages without direct CrUX data.
Optimization should therefore aim for overall coherence rather than a page-by-page isolated approach.

SEO Expert opinion

Is this statement consistent with field observations?

On paper, yes. Tools like PageSpeed Insights or Search Console have displayed aggregated scores by group of URLs for a long time, indicating this estimation mechanism. The problem is that Google never specifies the data threshold required for a page to be considered “sufficiently documented”.

Indeed, we observe cases where orphan pages inherit scores that do not correspond to their technical reality. But without access to the raw CrUX metrics per page, it is impossible to verify whether the estimation works in your favor or hinders you. [To verify]: Google does not publish either the similarity criteria or the respective weight of different metrics in the estimation algorithm.

What nuances should be added to this statement?

The phrase “similar pages” remains vague. Structural similarity, yes — but to what degree? A product sheet template with 10 different components can generate enormous performance variations depending on the images, third-party scripts, the number of recommended products. Thus, the estimation risks smoothing critical gaps.

A second point: applying the overall site score to a complex architecture means drowning out the specifics. A site with a fast blog and a heavy JavaScript configurator will have an average score that neither reflects one nor the other. For an SEO, this means treating each section as a micro-site with its own target performance.

In what cases does this rule not apply?

If a page has sufficient CrUX data, Google estimates nothing — it uses the real metrics. But the threshold of “insufficiency” is never publicly defined. Empirically, pages with less than a few hundred monthly Chrome visits seem to fall under estimation, but this is a field observation, not an official rule.

Another exception: noindex or crawl-blocked pages do not count in the calculation, even if they generate traffic. Google does not include them in the CrUX reports. Finally, very new sites without sufficient history may not have a score at all for several weeks.

Attention: If you have a multilingual or multi-regional site, CrUX scores are aggregated by origin (domain + subdomain), not by language or country. A slow version in .fr can pull down the entire domain if the data is insufficient to segment.

Practical impact and recommendations

What concrete steps should be taken to manage this estimation?

First, map your templates. Identify groups of pages that share the same technical structure: product sheets, blog articles, landing pages, category pages. Each group must have homogeneous performance, as Google will likely treat them as a unit.

Then, concentrate your optimization efforts on the most critical templates — those that generate SEO traffic or represent a significant volume of pages. If your blog performs poorly, all orphan pages from the blog will inherit this bad score by estimation.

What mistakes to avoid in this estimation context?

Do not assume that an invisible page escapes oversight. If it shares a template with well-crawled pages, it contaminates the group’s estimation. As a result: a slow page without traffic can damage the score of an entire cluster.

Another trap: optimizing only high-traffic pages. If the rest of the site is a technical disaster, the overall score will remain mediocre and hinder your strategic pages as a side effect. The silo approach no longer works with this aggregation logic.

How can I check if my site benefits from this estimation logic?

Use the Search Console, “Experience” tab > “Core Web Vitals”. Look at the groups of URLs classified as “Good”, “Needs Improvement”, “Poor”. If you see pages with no traffic displayed in these reports, it means they are benefiting from (or suffering from) the estimation.

Then compare with PageSpeed Insights on specific URLs. If PSI shows “No field data available” but the Search Console still ranks the page, it is the estimation at play. At this stage, deep dive into the technical analysis of the template to understand where loading time is spent.

Map the page templates and measure the performance of each group using Lighthouse or WebPageTest.
Prioritize the optimization of strategic page clusters (high volume, high SEO traffic) to avoid contamination by estimation.
Check in the Search Console if pages without CrUX data still appear in Core Web Vitals reports.
Monitor the evolution of the overall domain score in CrUX via BigQuery to anticipate aggregation impacts.
Avoid overly heterogeneous architectures: the more distinct your templates are technically, the blurrier the overall estimation will be.
Test orphaned or little-visited pages with synthetic tools (Lighthouse) to detect discrepancies between technical reality and displayed estimated score.

The CrUX estimation by similarity or overall score requires thinking about performance as a site-wide strategy, not page by page. Heterogeneous sites or poorly optimized orphan sections can drag down the entire domain as a side effect. The key: homogenize performances by template, monitor Search Console reports, and cross-check with synthetic tests to detect dissonances. If this approach seems too complex to manage alone — especially the technical mapping of templates, CrUX analysis via BigQuery, or fine-tuning of critical resources — consulting a specialized SEO agency in performance can save you valuable time and avoid costly missteps.

❓ Frequently Asked Questions

Peut-on savoir si une page utilise des données CrUX réelles ou estimées ?

Pas directement. Si PageSpeed Insights affiche « Aucune donnée terrain disponible » mais que la Search Console classe la page dans un groupe, c'est probablement une estimation. Sinon, il faut croiser avec les données CrUX brutes via BigQuery.

Le score global du site peut-il pénaliser une page rapide sans trafic ?

Oui. Si votre domaine a un score CrUX médiocre et qu'une page orpheline rapide manque de données, elle héritera du score global et pourrait être classée « À améliorer » malgré sa bonne performance technique.

Comment Google définit-il la « similarité » entre pages ?

Google n'a jamais publié les critères exacts. Empiriquement, cela semble reposer sur le template HTML, les ressources chargées (CSS, JS, images) et l'architecture DOM. Le contenu textuel ne semble pas être un facteur.

Un site sans données CrUX du tout est-il pénalisé dans le classement ?

Non, Google ne pénalise pas l'absence de données. En revanche, le signal Core Web Vitals ne peut pas jouer en votre faveur comme facteur de ranking. Vous êtes neutre, pas pénalisé.

Faut-il optimiser les pages sans trafic si elles n'ont pas de données CrUX ?

Oui, car elles peuvent influencer le score estimé d'un cluster de pages similaires, voire le score global du site. Une page lente orpheline contamine potentiellement l'ensemble du template.

🏷 Related Topics

Core Web Vitals CrUX estimation performance ranking PageSpeed template architecture

Domain Age & History AI & SEO Pagination & Structure Web Performance

🎥 From the same video 27

Other SEO insights extracted from this same Google Search Central video · duration 1h07 · published on 28/01/2021

🎥 Watch the full video on YouTube →

Related statements

« Previous

Duplicate Content: No Automatic Penalty...

Mobile/Desktop Sitemap: Is it Enough to Include Ju...

« Back to results