Should you block pages receiving backlinks with robots.txt?

Quick SEO Quiz

Test your SEO knowledge in 3 questions

Less than 30 seconds. Find out how much you really know about Google search.

🕒 ~30s 🎯 3 questions 📚 SEO Google

Official statement

If a page is blocked by robots.txt, Google does not know its content and cannot indirectly transfer external links to the main content of the site. It is important to avoid blocking important pages that receive links.

5:50

🎥 Source video

Extracted from a Google Search Central video

⏱ 1h14 💬 EN 📅 11/12/2020 ✂ 46 statements

Watch on YouTube (5:50) →

✂ Other statements from this video 45 ▾

📅

Official statement from December 11, 2020 (5 years ago)

⚠ A more recent statement exists on this topic Should You Really Use Noindex Rather Than Robots.txt to Deindex a Page? John Mueller · March 15, 2021 View statement →

TL;DR

Google states that a page blocked by robots.txt cannot transfer the value of external links to the rest of the site. Essentially, blocking a page that receives backlinks is a waste of SEO juice. The recommendation is to identify these pages and allow them to be crawled, even if their content seems less strategic.

What you need to understand

Why does blocking by robots.txt prevent value transfer?

When a page is blocked by robots.txt, Googlebot simply cannot crawl it. Without access to the content, the algorithm ignores the internal structure of that page — its outgoing links, its interlinking, its thematic context.

While the engine does detect the existence of the page and may even index its URL if mentioned elsewhere, it cannot follow the outgoing links from this page or assess their relevance. As a result, backlinks pointing to this URL do not propagate to the rest of the site.

How is this different from a noindex tag or a 404 page?

A page with noindex remains crawlable. Google can read its content, follow its internal links, and transfer the PageRank assigned to it. The noindex directive simply prevents indexing but does not stop the flow of juice.

A 404 or deleted page also interrupts value transfer, but for a different reason: it no longer exists. The external link then becomes a lost backlink — no redirection, no transmission.

In what contexts is this mistake common?

Some SEOs block by robots.txt pages they consider unhelpful: old landing pages, redundant category pages, temporary content. The problem is that these pages may have accumulated backlinks over time, particularly through past campaigns or organic mentions.

Another common case is technical or admin pages (login, cart, checkout). If these URLs receive links — which happens more often than one might think — their blocking prevents any value transfer to the strategic pages.

Blocking by robots.txt prevents Googlebot from crawling the content and following internal links
Backlinks to a blocked page do not propagate to the rest of the site
A page with noindex remains crawlable and passes juice through its internal links
Technical or temporary pages can receive backlinks unbeknownst to you
It is crucial to regularly audit blocked pages to identify those receiving links

SEO Expert opinion

Is this statement consistent with field observations?

Yes, this is one of the most consensus-based claims from Google. Empirical tests confirm it: blocking a page with robots.txt does indeed cut off the transmission of PageRank through its outgoing links. This is not surprising — it's even the fundamental logic of crawl budget.

Where it gets interesting is with pages that receive backlinks without the SEO being aware. A link profile audit often reveals blocked URLs that capture juice without redistributing it. That's pure and simple waste.

What nuances should be added to this rule?

First nuance: if a blocked page receives toxic or spammy backlinks, blocking can serve as a barrier. While you lose potential juice, you also avoid contamination. In this case, it's best to redirect with a 301 to a healthy page after cleaning.

Second nuance: the internal links of a blocked page are not followed, but the external outgoing links are also not detected. If you block a page that links to partner sites, Google will not see those links — which can be strategic in some cases. [To be verified] according to your objectives.

When does this rule not apply?

If a page is blocked by robots.txt AND subject to a 301 redirection, the value of the backlinks normally transfers to the target URL. The robots.txt does not prevent HTTP redirections from working — it only prevents crawling of the final content.

Another exception: pages blocked by robots.txt can still be indexed by URL if backlinks point to them. Google lists them as "No information available". These URLs indexed without content transfer nothing, but their presence in the index can cause duplication or dilution issues.

Caution: a page blocked by robots.txt that receives backlinks and is redirected with 301 transmits link value. However, if the redirection is in JavaScript or meta refresh, Google will not detect it without crawling the page. Always prefer server redirects.

Practical impact and recommendations

What concrete steps should be taken to avoid this problem?

Your first reflex: audit your robots.txt file and cross-reference the blocked URLs with your backlink profile. Tools like Ahrefs, Majestic, or SEMrush help export the list of pages receiving external links. Compare this list with the Disallow directives in your robots.txt.

If you identify blocked pages that capture juice, there are two options. You can either unblock them to allow for value transfer through their internal links or set up a 301 redirection to a strategic page — the juice will transfer directly, without passing through the blocked page's content.

What errors should be avoided during the audit?

A common mistake is unblocking all pages that receive backlinks without checking the quality of those links. A spam or toxic backlink does not deserve to have a whole page unblocked. First, analyze the profile, disavow if necessary, then decide.

Another trap is forgetting that some pages blocked by robots.txt are still indexed. If you unblock a technical page (login, cart) to recover juice, you risk indexing it with poor content. Combine unblocking with a noindex tag if you want to retain control.

How can I check if my site is compliant?

Use Google Search Console to spot URLs blocked by robots.txt that appear in your link profile. The "Links" tab lists all pages receiving backlinks — filter those marked as "Blocked by robots.txt".

Complete this with a technical crawl via Screaming Frog or Botify. Enable the option "Respect robots.txt" and identify the blocked pages. Cross-reference this crawl with your backlink export to spot potential juice losses.

Export the complete list of URLs blocked by robots.txt
Cross-check this list with the backlink profile (Ahrefs, Majestic, GSC)
Analyze the quality of the backlinks on the blocked pages before unblocking them
Unblock strategic pages or set up 301 redirections
Combine unblocking and noindex for technical pages with low editorial value
Regularly check the "Links" tab in Google Search Console to spot new blocked pages

Blocking by robots.txt a page that receives backlinks is like leaving SEO juice on the table. Regularly auditing the robots.txt file and link profile is essential to prevent this value leak. If you find that several strategic pages are affected, it may be wise to consult a specialized SEO agency for personalized support — these cross-optimizations (crawl, backlinks, redirections) require sharp technical expertise and an overall vision that is difficult to master alone.

❓ Frequently Asked Questions

Bloquer une page par robots.txt empêche-t-il Google de la voir dans les backlinks ?

Non. Google détecte les backlinks vers une page bloquée, mais ne peut pas crawler son contenu ni suivre ses liens internes. Le jus reste bloqué à l'entrée.

Peut-on bloquer une page par robots.txt et la rediriger en 301 en même temps ?

Oui. La redirection 301 fonctionne avant que Googlebot ne lise le robots.txt. Les backlinks vers la page bloquée se transmettent normalement vers l'URL cible.

Une page en noindex transmet-elle le jus de ses backlinks ?

Oui, car Google peut crawler la page et suivre ses liens internes. Le noindex empêche l'indexation, pas la circulation du PageRank.

Comment identifier les pages bloquées qui reçoivent des backlinks ?

Exportez les URLs bloquées par robots.txt, puis croisez avec votre profil de backlinks dans Ahrefs, Majestic ou Google Search Console. Les outils de crawl comme Screaming Frog facilitent cette analyse.

Faut-il débloquer toutes les pages qui reçoivent des backlinks ?

Pas nécessairement. Analysez d'abord la qualité des liens. Si ce sont des backlinks spam, mieux vaut désavouer et éventuellement garder le blocage. Sinon, débloquez ou redirigez en 301.

🏷 Related Topics

robots.txt backlinks PageRank crawl budget noindex redirections 301 maillage interne jus SEO

Domain Age & History Content Crawl & Indexing Links & Backlinks

🎥 From the same video 45

Other SEO insights extracted from this same Google Search Central video · duration 1h14 · published on 11/12/2020

🎥 Watch the full video on YouTube →

Related statements

« Previous

Hidden text for accessibility is not considered sp...

Old Press Release Links Hold Little Value...

« Back to results