Why does Google refuse to read a Robots.txt file with an uppercase letter?

Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

Google only recognizes the robots.txt file with a lowercase 'r'; any other case will not be recognized.

41:18

🎥 Source video

Extracted from a Google Search Central video

⏱ 50:53 💬 EN 📅 21/01/2016 ✂ 14 statements

Watch on YouTube (41:18) →

✂ Other statements from this video 13 ▾

□ Pourquoi Google vous pousse-t-il à poster vos problèmes d'indexation dans son forum ?
1:06 Pourquoi Google impose-t-il les URLs www plutôt que m-dot comme source principale pour les applications ?
2:46 Les pages 404 nuisent-elles vraiment au classement SEO ?
3:26 Comment Google Panda juge-t-il vraiment la qualité de votre contenu ?
6:08 Pourquoi Panda ne fonctionne-t-il pas en temps réel et qu'est-ce que ça change pour votre site ?
10:14 Le budget de crawl dépend-il vraiment de la qualité du contenu ?
12:32 La vitesse mobile affecte-t-elle vraiment le classement Google ou est-ce un mythe SEO ?
14:16 Le deep linking fonctionne-t-il sans site mobile m-dot ?
15:24 La personnalisation des résultats Google repose-t-elle vraiment sur votre historique de navigation ?
25:39 AdWords booste-t-il vraiment votre référencement naturel ?
26:11 Pourquoi vos redirections mobile-desktop cassent-elles votre SEO sans que vous le sachiez ?
33:59 Les liens de faible qualité peuvent-ils vraiment pénaliser votre site ?
40:11 Un site hors ligne perd-il son référencement Google ?

📅

Official statement from January 21, 2016 (10 years ago)

⚠ A more recent statement exists on this topic Should You Block Crawling of the robots.txt File in the robots.txt Itself? John Mueller · July 2, 2018 View statement →

TL;DR

Google only recognizes robots.txt written with a lowercase 'r'. Any other case (Robots.txt, ROBOTS.TXT) will be ignored by the crawler, rendering the file invisible. This strict rule reflects the historical specification of the Robots Exclusion Protocol (REP) and requires precise naming of the file to ensure it is correctly interpreted by Googlebot.

What you need to understand

What is the origin of this case requirement?

The Robots Exclusion Protocol (REP) was formalized in 1994 by Martijn Koster, and the original specification mandated robots.txt in strict lowercase. This technical convention stems from the Unix systems of the time, where file names are case-sensitive.

Google upholds this rule not out of outdated practices, but out of respect for the RFC 9309 standard published in September 2022. The engine makes no exceptions: a file named Robots.txt, ROBOTS.TXT, or RoBots.TxT will simply be ignored as if it does not exist. Googlebot will only look for /robots.txt at the root of the domain.

How do web servers respond to this constraint?

Case sensitivity depends on the operating system of the server. On Linux/Unix, robots.txt and Robots.txt are two distinct files. On Windows/IIS, the NTFS file system is case-insensitive by default, but Apache or Nginx will serve the exact requested file.

This discrepancy creates tricky situations: a Windows administrator can create Robots.txt, test it locally with success, but find that Googlebot cannot find it in production. The error is silent, with no alerts in Search Console explicitly indicating this case confusion.

What are the practical consequences of incorrect case?

An incorrectly named robots.txt file equates to a total absence of the file. Googlebot will interpret this absence as universal permission to crawl. All your Disallow directives, specific User-agents, or references to the XML sitemap will be lost.

Specifically, sensitive pages (staging, admin, session parameters) may be crawled and indexed. The crawl budget can be wasted on unnecessary URLs. And if you block CSS/JS resources via robots.txt for testing, that protection disappears instantly.

robots.txt (strict lowercase): the only version recognized by Google
Robots.txt, ROBOTS.TXT: completely ignored, no error message
Server sensitivity: Linux/Unix distinguishes case, Windows IIS may hide the issue locally
Search Console: the robots.txt testing tool works only if the file is correctly named
Crawl impact: absence = total permission, loss of all directives

SEO Expert opinion

Is this statement consistent with field observations?

Absolutely. Tests confirm that Google applies this rule without exception or grace period. A migration audit regularly shows sites losing their crawl restrictions after a server change simply because the new sysadmin created the file with an uppercase letter.

The behavior is consistent across all Google User-agents (Googlebot, Googlebot-Image, Google-InspectionTool). Strict case is not a technical preference; it is a mandatory RFC compliance. Bing, for reference, applies the same rule — no major engine makes exceptions.

What nuances should be considered regarding this seemingly simple rule?

The apparent simplicity hides server configuration traps. Some shared Windows hosting environments serve the file regardless of the requested case, creating a false impression of compatibility. Therefore, a manual test in a browser is insufficient.

Another subtlety: 301 redirects to robots.txt. If /Robots.txt redirects to /robots.txt, Google will follow this redirect and read the correct file. But this workaround introduces unnecessary latency and consumes crawl budget. It's better to fix the source. [To be verified] if Google tolerates permanent redirects without response time penalties in all contexts.

In what cases can this rule cause practical issues?

CMS migrations frequently generate recurring errors. An export from WordPress to an IIS server may automatically create Robots.txt if the script uses .NET conventions. CI/CD deployment systems that create files via scripts must explicitly enforce lowercase casing.

Multi-domain environments with dynamically generated robots.txt files must ensure the template respects the case. A single misconfigured domain in a pool of 50 sites can go unnoticed for months until mass indexing of staging pages signals the problem.

Windows IIS servers require heightened vigilance: the local file system can obscure case errors, but Apache/Nginx in production will expose them brutally.

Practical impact and recommendations

How can you check if your robots.txt file is correctly named?

First step: test the exact URL https://yourdomain.com/robots.txt (strict lowercase) in a private browsing window. The file should display with a Content-Type of text/plain and an HTTP status of 200. If you get a 404, the file does not exist or is improperly named.

Next, test with an uppercase: https://yourdomain.com/Robots.txt. If this URL also works, your server is case-insensitive (typical of Windows IIS), but Google will only search for the lowercase version. Therefore, you must rename the file via SSH/FTP or through your hosting panel.

What mistakes should be avoided when creating or modifying the file?

Never create the file via a Windows editor that might impose a hidden extension (.txt.txt) or a default case. Use a code editor with UTF-8 without BOM encoding. Check Unix permissions (644 minimum) to ensure readability by the web server.

CDN caching systems (Cloudflare, Fastly) may serve an old version of the file if you rename without purging the cache. After any case change, force a complete cache purge and wait 5-10 minutes before testing with the Search Console tool.

How can you ensure compliance across multiple sites?

For a multi-site infrastructure, script an automated check using curl or wget that verifies the presence of /robots.txt (lowercase) and the absence of /Robots.txt. Integrate this check into CI/CD pipelines to block non-compliant deployments.

Agencies managing client portfolios should systematically audit the case of the file during onboarding. A simple Python script can test 100 domains in a few minutes and report naming anomalies before they impact crawling.

Manually test https://domain.com/robots.txt (lowercase) → expected status 200
Test https://domain.com/Robots.txt → should ideally return a 404
Check UTF-8 without BOM encoding and Unix permissions 644
Purge CDN cache after any file renaming
Validate content using the Search Console robots.txt tool
Automate case verification in CI/CD deployment scripts

Strict compliance with the lowercase robots.txt case is non-negotiable. A silent error can expose hundreds of sensitive pages to crawling without visible alerts. For complex infrastructures or critical migrations, assistance from a specialized SEO agency can help detect these configuration traps before they affect organic visibility. A complete technical audit identifies these invisible details that make the difference between controlled crawling and wasted budget.

❓ Frequently Asked Questions

Googlebot lit-il un fichier nommé ROBOTS.TXT en majuscules ?

Non, Google ignore totalement toute casse autre que robots.txt (minuscules strictes). Un fichier ROBOTS.TXT sera traité comme inexistant.

Un serveur Windows IIS peut-il poser problème pour la casse du fichier ?

Oui, NTFS est insensible à la casse localement, mais le serveur web doit servir exactement /robots.txt. Un test local peut masquer une erreur visible en production.

Une redirection 301 de Robots.txt vers robots.txt fonctionne-t-elle ?

Google suivra la redirection et lira le fichier correct, mais cette solution introduit une latence inutile. Mieux vaut renommer directement le fichier.

Search Console signale-t-il explicitement une erreur de casse du fichier robots.txt ?

Non, aucune alerte spécifique n'est émise. L'outil de test robots.txt ne fonctionnera simplement pas si le fichier est mal nommé, comme s'il était absent.

Les autres moteurs de recherche appliquent-ils la même règle de casse stricte ?

Oui, Bing et les moteurs conformes RFC 9309 exigent également robots.txt en minuscules. Aucun moteur majeur ne tolère une casse différente.

🏷 Related Topics

robots.txt crawl protocole REP Googlebot configuration serveur nomenclature fichier RFC 9309 audit technique

Crawl & Indexing PDF & Files

🎥 From the same video 13

Other SEO insights extracted from this same Google Search Central video · duration 50 min · published on 21/01/2016

🎥 Watch the full video on YouTube →

Related statements

« Previous

Encouragement to Report Technical Issues in the Fo...

Panda does not operate in real time...

« Back to results