Official statement
Other statements from this video 19 ▾
- 27:21 Why does it take 28 days for your Core Web Vitals to update in Search Console?
- 36:39 Is it really necessary to test your Core Web Vitals in the lab to prevent regressions?
- 98:33 Do CSS animations really hurt your Core Web Vitals?
- 121:49 Will Core Web Vitals Change Again, and How Can You Prepare for Upcoming Updates?
- 146:15 Are city-specific pages really just doorway pages doomed by Google?
- 185:36 Does the crawl budget really depend on your server speed?
- 203:58 Should you really start small to unlock your crawl budget?
- 228:24 Should you really regenerate your sitemaps to remove obsolete URLs?
- 259:19 Why does Google refuse to provide Voice Search data in Search Console?
- 295:52 How can you compel Google to refresh your JavaScript and CSS files during rendering?
- 317:32 How can you effectively map URLs and verify redirects during migration to avoid losing rankings?
- 353:48 Do you really need to include dates in structured data?
- 390:26 Is it really necessary to change the date of an article with every update?
- 432:21 Should you really count the number of H1 tags on a page?
- 450:30 Do headings really hold as much importance as Google thinks?
- 555:58 Are LSI keywords really beneficial for Google SEO?
- 585:16 Is there a magic number of links per page to optimize internal PageRank?
- 674:32 Do JSON requests really impact your crawl budget?
- 789:13 Can Google really figure out that a URL is duplicated without even crawling it?
Blocking JSON via robots.txt prevents Google from indexing content that relies on these files after rendering. This rule applies to both your own site and third-party sites using your public APIs. Specifically, if your visible content requires JavaScript loaded via JSON, blocking these resources renders your pages invisible to Google.
What you need to understand
Why does blocking JSON cause indexing problems?
Google operates in two stages: initial crawl and then JavaScript rendering. When Googlebot retrieves your raw HTML, it then initiates a rendering process to execute the JS and load dynamic resources. If your JSON files are blocked in robots.txt, the bot can download your HTML but cannot retrieve the data needed for the final rendering. The result: it indexes an empty or incomplete page, even if everything works visually on the user side. Applications using React, Vue, or Angular often load their content via JSON API calls. If you block \/api\/*.json, for example, Google will never see the content generated after hydration. This is particularly critical for e-commerce sites where product listings, prices, and availability are loaded dynamically. Without access to the JSON, Google indexes product pages without descriptions or prices — essentially invisible in the results. Yes, and it's less intuitive. If you provide a public API that is consumed by other sites, blocking your JSON endpoints prevents the indexing of content displayed on those third-party sites. Imagine a review aggregator using your API: if you block \/reviews.json, the aggregated content will not be indexable by Google, even if it's not your own site. You indirectly penalize your partners.How does this rule impact sites using modern frameworks?
Are third-party sites using your APIs affected as well?
SEO Expert opinion
Does this statement truly reflect observed behavior in the field?
Yes, absolutely. Technical audits regularly show sites with misconfigured robots.txt blocking \/wp-json\/, \/api\/, or \/\*.json out of excessive caution. The problem is that many developers believe they are "protecting" their data by blocking these endpoints without realizing they are sabotaging their own indexing. I've seen Shopify stores lose 40% of their organic traffic after mistakenly blocking their collection JSONs. Of course. If your JSON contains sensitive data (user info, B2B pricing, internal stock), it should be blocked — but then, do not use it to display indexable public content. The distinction is simple: JSON used for client-side rendering of visible content = do not block. Purely backend or admin JSON = it's up to you. [To verify]: Google has never specified whether authentication mechanisms (tokens, headers) are sufficient to circumvent this issue without blocking in robots.txt. None. Unlike other SEO signals where you can compensate (weak backlinks but excellent content), blocking a critical JSON equates to making your page invisible. It’s binary. Always test your robots.txt modifications with Search Console > URL Inspection > Test live URL. If the rendered output is empty while your page functions normally, you've blocked an essential resource.Are there cases where blocking JSON remains legitimate?
What is the acceptable margin of error in this configuration?
Practical impact and recommendations
How can you quickly audit your current robots.txt rules?
Download your robots.txt and look for all lines containing .json, \api\/, \data\/, or \content\/. For each Disallow rule found, ask yourself: "Does this file serve to display visible content for users?” Then use the Test robots.txt tool in Search Console. Paste a JSON URL that you suspect is blocked and check if Googlebot can access it. If it’s blocked while that JSON loads your product listings, you’ve found your culprit. Immediately remove the corresponding Disallow rule in robots.txt. Then, force a quick reindexing via Search Console by requesting inspection of the affected pages. Monitor your server logs in the following days: you should see Googlebot crawling the previously blocked JSONs. If not within 72 hours, it may indicate that this rule wasn’t the only cause (also check HTTP headers, X-Robots-Tag, etc.). For public data (product listings, articles, reviews), keep JSONs accessible without restriction. For sensitive data, consider using token authentication or serving these JSONs from a non-public subdomain. You can also implement server-side rendering (SSR) or static site generation (SSG) to ensure that critical content is present in the initial HTML, without relying on JavaScript rendering. Less elegant technically, but much more robust from an SEO perspective.What should you do if you discover critical blocked JSON for indexing?
What strategy should you adopt to secure your APIs without blocking indexing?
❓ Frequently Asked Questions
Bloquer un JSON dans robots.txt affecte-t-il uniquement Googlebot ou aussi les autres moteurs ?
Peut-on bloquer partiellement les JSON, par exemple uniquement pour certains crawlers ?
Si mon JSON est accessible mais retourne une 401 ou 403, est-ce équivalent à un blocage robots.txt ?
Les JSON chargés via fetch() côté client sont-ils concernés par cette règle ?
Comment savoir si mes pages sont indexées avec ou sans le contenu JSON ?
🎥 From the same video 19
Other SEO insights extracted from this same Google Search Central video · duration 912h44 · published on 05/03/2021
🎥 Watch the full video on YouTube →
💬 Comments (0)
Be the first to comment.