What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

PageSpeed Insights does not use Googlebot to fetch the page, which can lead to discrepancies in evaluation if the robots.txt prevents Googlebot from accessing crucial resources like CSS or JavaScript.
18:51
🎥 Source video

Extracted from a Google Search Central video

⏱ 33:51 💬 EN 📅 13/03/2015 ✂ 8 statements
Watch on YouTube (18:51) →
Other statements from this video 7
  1. 4:40 Le mobile-first indexing rend-il vraiment votre SEO desktop obsolète ?
  2. 5:11 Quels outils Google faut-il vraiment utiliser pour tester la compatibilité mobile de son site ?
  3. 6:15 Quel outil Google choisir pour diagnostiquer vos problèmes mobiles ?
  4. 9:49 L'expérience mobile pénalise-t-elle réellement votre positionnement Google ?
  5. 11:26 Pourquoi Google Search Console reste-t-elle incontournable pour diagnostiquer les problèmes d'indexation ?
  6. 27:10 Les futurs changements algorithmiques de Google affecteront-ils uniquement le mobile ?
  7. 30:08 Le responsive design est-il vraiment obligatoire pour le référencement mobile ?
📅
Official statement from (11 years ago)
TL;DR

PageSpeed Insights uses its own user agent to fetch pages, which is different from Googlebot. If your robots.txt blocks Googlebot from accessing CSS or JavaScript resources, you will see an optimistic PSI score that does not reflect what Google actually indexes. As a result, you might think everything is fine while Googlebot is crawling a degraded version of your site.

What you need to understand

Are PageSpeed Insights and Googlebot the same thing?

No. PageSpeed Insights uses its own crawler, which is technically based on Lighthouse and Chrome infrastructure, not Googlebot. When you run a PSI test, Google fetches your page with a different user agent than the one it utilizes for indexing your content.

Specifically? If your robots.txt blocks certain resources to Googlebot (typically CSS files, JavaScript, or fonts), PageSpeed Insights will still have access. It will return a score based on a complete version of the page, with all styles and scripts loaded. But Googlebot will crawl a truncated version.

Why does this distinction pose a problem in SEO?

Because you are optimizing blindly. You run PSI, see green across the board, and think that Core Web Vitals are top-notch. However, Google, on its side, indexes a page without critical CSS or missing certain scripts, which can degrade the actual user experience measured by the engine.

The classic trap: blocking /wp-content/themes/ or /assets/ in the robots.txt out of fear for crawl budget or due to misconfiguration. PSI still loads everything, but Googlebot sees a broken page, poorly structured, with incomplete rendering. Your Search Console metrics then show degraded signals that do not match your PSI tests.

How can I tell if my site is affected?

Three warning signals: significant discrepancies between PSI and Core Web Vitals reports in Search Console, warnings in GSC about blocked resources, or pages that display poorly in the URL inspection tool (incomplete HTML rendering).

Check your robots.txt line by line. Any Disallow pointing to CSS, JS, or fonts directories is suspect. Google has recommended for years not to block any critical rendering resources, but this mistake remains common in poorly configured CMS or haphazard migrations.

  • PageSpeed Insights does not go through Googlebot, so it bypasses any robots.txt restrictions you impose on the engine.
  • An optimal PSI score does not guarantee that Google indexes a performant version of your page.
  • Blocking CSS or JavaScript to Googlebot creates a gap between your tests and the crawling reality.
  • The Core Web Vitals measured by Google rely on what Googlebot can actually load, not on what PSI sees.
  • Always check the URL inspection tool to compare Googlebot's rendering with your local version.

SEO Expert opinion

Is this statement consistent with what we observe on the ground?

Absolutely. We regularly see sites with impeccable PageSpeed Insights scores but persistent Core Web Vitals alerts in Search Console. A typical case: a WordPress site that blocks /wp-includes/ or /wp-content/ as a precaution, thinking it protects sensitive files.

PSI loads the page without restriction and shows 95+ everywhere. However, Googlebot, encounters a 403 on critical CSS, renders the page in degraded mode, and reports catastrophic metrics. The client cannot understand why their optimizations do not reflect in the rankings when all their tests show green.

What nuances should we apply to this statement?

Google does not explicitly specify which user agent PSI uses or how it handles potential redirects or geographic variations. We know that PSI is based on Lighthouse infrastructure, but the technical details remain vague. [To verify]: does PSI respect robots.txt directives for other user agents or does it completely ignore the file?

Another point: Google talks about "crucial resources" without precisely defining what is crucial. In reality, everything that impacts initial rendering is crucial: inline or external CSS, deferred loading JavaScript, webfonts, even certain SVGs. But a practitioner must test on a case-by-case basis as the boundary is not documented.

In what cases does this rule not apply?

If your robots.txt is clean and does not block any rendering resources, the difference between PSI and Googlebot will be marginal. This is primarily an issue with legacy sites, poorly configured CMS, or migrations where a robots.txt was copied and pasted without auditing.

Also beware of sites with overly broad Disallow rules (like Disallow: /wp-content/), often inherited from outdated SEO recommendations. In such cases, PSI and Googlebot consistently diverge. If you have a modern site with a minimalist robots.txt, you will likely never see this inconsistency.

Alert: Never rely solely on PageSpeed Insights to validate your Core Web Vitals. Always cross-check with the URL inspection tool in Search Console and the actual field data in the CWV report.

Practical impact and recommendations

What concrete steps should I take to avoid this discrepancy?

First action: audit your robots.txt immediately. Open it, identify all Disallow lines that point to directories containing critical CSS, JavaScript, fonts, or images. If in doubt, temporarily comment out the line and test Googlebot's rendering via the URL inspection tool.

Then, use the "Test robots.txt file" tool in Search Console (Settings > Explorers > robots.txt section). Enter the URLs of your main CSS and JS files, and check that they are not blocked for Googlebot Desktop and Googlebot Smartphone. If a critical file is prohibited, remove the corresponding rule.

What errors should I absolutely avoid?

Never block /wp-content/themes/, /assets/, /static/, /dist/, or any directory containing rendering resources. This is a classic error inherited from old SEO practices where it was thought to protect content or save crawl budget. Today, it just breaks Googlebot's rendering.

Another trap: relying solely on PageSpeed Insights tests to validate optimizations. PSI is a good indicator, but it does not reflect crawling reality. Always validate with the URL inspection tool and the actual CWV data from Search Console, collected over 28-day rolling periods.

How can I verify that my site is compliant after fixes?

Once your robots.txt is cleaned up, force a reindexing of key pages via the URL inspection tool. Wait 48-72 hours, then compare the HTML rendering captured by Googlebot with your local version. If styles and scripts are loaded properly, you are good to go.

Then monitor the Core Web Vitals report in Search Console over 4 weeks. If the metrics improve and get closer to your PSI scores, it means Googlebot is finally seeing the same thing as your testing tool. Otherwise, there are likely other blocks or stray redirects remaining.

  • Audit the robots.txt and remove any Disallow rule blocking CSS, JavaScript, fonts, or critical images.
  • Test each resource URL via the "Test robots.txt file" tool in Search Console.
  • Validate Googlebot's rendering with the URL inspection tool, comparing it with the local version.
  • Cross-check PageSpeed Insights scores with actual data from the Core Web Vitals report (28-day rolling).
  • Force reindexing of strategic pages after correcting the robots.txt.
  • Monitor discrepancies between PSI and GSC over a month to confirm the issue has been resolved.
This type of robots.txt audit combined with Googlebot validation can quickly become time-consuming, especially on sites with complex architecture or customized CMS. If you observe persistent discrepancies between your tests and the metrics from Search Console, or if your robots.txt contains dozens of inherited rules with unknown origins, seeking assistance from an SEO agency specialized in crawling and indexing can save you months of trial and error.

❓ Frequently Asked Questions

PageSpeed Insights utilise-t-il vraiment un user-agent différent de Googlebot ?
Oui. PSI se base sur l'infrastructure Lighthouse et Chrome, avec son propre user-agent. Il ne passe pas par Googlebot, donc il contourne les restrictions robots.txt appliquées au crawler Google.
Si mon score PSI est excellent mais mes Core Web Vitals dans GSC sont mauvais, c'est forcément un problème de robots.txt ?
Pas forcément, mais c'est une piste prioritaire. D'autres causes possibles : variations géographiques, trafic mobile vs desktop, ou problèmes de serveur sous charge réelle. Vérifiez d'abord le robots.txt et le rendu Googlebot via l'outil d'inspection.
Peut-on bloquer certains fichiers JavaScript non critiques sans impacter le rendu Googlebot ?
En théorie oui, si ces scripts sont vraiment non critiques (analytics, chat, publicité). Mais en pratique, c'est risqué : Google recommande de ne rien bloquer. Testez systématiquement le rendu après chaque modification.
Les données Core Web Vitals de Search Console reflètent-elles toujours ce que Googlebot voit ?
Les CWV de GSC agrègent des données terrain collectées via Chrome User Experience Report (CrUX), donc des vrais utilisateurs. Elles complètent le rendu Googlebot mais ne le remplacent pas. Les deux sources doivent converger si tout est bien configuré.
Faut-il auditer le robots.txt à chaque mise à jour majeure du site ?
Oui, systématiquement. Chaque refonte, migration ou changement de CMS peut introduire de nouvelles règles Disallow par défaut. Un audit robots.txt doit faire partie de votre checklist post-déploiement.
🏷 Related Topics
Domain Age & History Crawl & Indexing AI & SEO JavaScript & Technical SEO Web Performance

🎥 From the same video 7

Other SEO insights extracted from this same Google Search Central video · duration 33 min · published on 13/03/2015

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.