What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

Google can crawl certain protocols in addition to HTTP and HTTPS, such as FTP. However, it is best to stick to HTTP or HTTPS for pages to ensure optimal indexing.
0:01
🎥 Source video

Extracted from a Google Search Central video

⏱ 0:32 💬 EN 📅 14/01/2011
Watch on YouTube (0:01) →
📅
Official statement from (15 years ago)
TL;DR

Google can technically crawl certain protocols like FTP, but strongly advises against using them for web content. To ensure reliable indexing, HTTP and HTTPS remain the only truly supported protocols. Anything outside these two standards risks partial indexing or outright exclusion.

What you need to understand

Can Google really index FTP content or other protocols?

The official answer is yes, technically Google can crawl FTP. In practice, this capability has existed since the early days of the engine, when public FTP servers hosted vast amounts of PDF documents, archives, and text files.

But this "can" doesn't mean "should." The distinction is crucial. Google's crawler is optimized for HTTP/HTTPS, not for other protocols. FTP, gopher, or even some proprietary protocols can be read, but there is no guarantee of equivalent processing.

Why does Google maintain this technical compatibility if it is discouraged?

Technical legacy above all. Millions of FTP links still exist on the web, especially in academic archives, technical forums, old documentation. Ignoring these protocols entirely would create gaps in the index.

That said, support remains minimal. No JavaScript rendering, obviously, no tracking of complex redirects, no quality signals comparable to those of a modern HTTPS page. A file on FTP may appear in the results, but it will come with a huge disadvantage in terms of ranking.

What is the concrete difference between "crawling" and "optimal indexing"?

Crawling is retrieving the content. Optimal indexing means analyzing all modern signals: Core Web Vitals, HTTPS, structured data, crawl rendering, internal links, domain authority, freshness, mobile-first, etc.

An FTP file would be crawled like a plain text document. No rich semantic context, no analyzed layout, no measured UX. It is akin to indexing a black-and-white photocopy of a color magazine: technically possible but factually useless for ranking properly.

  • HTTP/HTTPS benefit from hundreds of ranking signals that other protocols completely ignore
  • FTP and other protocols are crawled without rendering, without behavioral analysis, without performance metrics
  • Switching to HTTPS is an explicit trust signal, absent in alternative protocols
  • Search Console and Analytics tools only work with HTTP/HTTPS, making any SEO management impossible elsewhere
  • No support for modern standards (schema.org, OpenGraph, meta robots) on non-web protocols

SEO Expert opinion

Is this statement consistent with field observations?

Absolutely. I have never seen an FTP site rank properly for a competitive query. The rare instances of FTP results involve ultra-niche searches, often exact file names, where no competing HTTP content exists.

Google's wording is actually diplomatic but unambiguous. "Can crawl" does not mean "will treat fairly." It's a technical tolerance, not a strategic recommendation. In 15 years of practice, I have never had to defend an FTP choice to a client, and that's a good thing.

In what cases might this rule not apply strictly?

Let's be honest: no legitimate business case justifies using FTP for production web content. The only theoretical scenarios where it might make sense are historical archives accessible only in read mode, where preservation is prioritized over ranking.

But even then, the best practice remains to expose that content via an HTTP interface with enriched metadata. An HTTP wrapper around backend FTP resources provides the best of both worlds: stable archival on the server side, modern accessibility on the user and engines side.

What nuances should be added to this official position?

The main nuance concerns embedded resources. A HTTPS page can point to an FTP resource (like a technical PDF) without penalizing the host page directly. It’s not optimal, but it’s not catastrophic either.

What matters is that the main navigation, the priority indexable content, and the user experience are on HTTP/HTTPS. If 99% of your site is in HTTPS and 1% of ancillary resources point to legacy FTP, no one is going to downgrade your domain because of it. [To be verified]: no public data confirms the exact impact of outgoing FTP links on the PageRank of an HTTPS page, but empirically, the effect appears negligible as long as the volume remains marginal.

Practical impact and recommendations

What should you do if your site still uses FTP?

Immediate migration to HTTP/HTTPS, with no negotiations possible. Every day on FTP is a day of lost ranking. The migration itself is technically simple: standard web hosting, free SSL certificate (Let's Encrypt), 301 redirects from old FTP URLs if they are referenced elsewhere.

The real work is in the inventory. List all content accessible via FTP, map out incoming external links, prepare a redirect plan. If internal tools or partners link to your FTP resources, warn them in advance of the protocol change.

How can you check that your infrastructure doesn’t expose anything inadvertently in alternative protocols?

Audit open ports using tools like Nmap or Shodan. Ensure that only ports 80 (HTTP) and 443 (HTTPS) respond publicly. Port 21 (FTP) should never be open on the public web, except for specific internal needs with strong authentication.

Also test your site’s URLs by replacing https:// with ftp:// in the address bar. If something loads, you have a configuration leak to seal immediately. Many misconfigured servers inadvertently expose entire directories via anonymous FTP.

What mistakes should be avoided when transitioning to an exclusive HTTP/HTTPS?

First mistake: leaving internal links pointing to old FTP URLs after migration. This creates unnecessary redirect chains and dilutes link juice. Clean up the HTML at the source, not just via .htaccess.

Second mistake: neglecting HTTPS when migrating from FTP. Switching from FTP to HTTP without SSL is a waste. You lose a positive ranking signal and expose your users to browser warnings. HTTPS must be the default standard, without exception, even for static content.

  • Inventory all content currently served via non-HTTP/HTTPS protocols
  • Set up web hosting with native HTTPS support and valid SSL certificate
  • Configure permanent 301 redirects from old FTP URLs to their HTTPS equivalents
  • Close public FTP ports (21, 20) on the firewall if there is no legitimate business need
  • Audit incoming external links pointing to old FTP resources and contact the concerned webmasters
  • Check in Search Console that no FTP URL appears in coverage or error reports
The transition from a legacy protocol to modern HTTPS is technically simple but requires rigor and method. Between the initial audit, server configuration, redirects, and post-migration monitoring, these optimizations can quickly become complex if your infrastructure is heterogeneous or if you lack internal technical resources. Engaging a specialized SEO agency can secure this migration without service interruption or loss of positions, while also benefiting from support on ancillary aspects (certificates, performance, optimal server configuration).

❓ Frequently Asked Questions

Google indexe-t-il encore des pages FTP en pratique ?
Oui, techniquement Google peut indexer des ressources FTP, mais elles apparaissent rarement dans les résultats car elles manquent de tous les signaux modernes de ranking. Leur présence se limite à des niches très spécifiques sans concurrence HTTP.
Un lien FTP sur ma page HTTPS peut-il me pénaliser ?
Pas directement, mais c'est une mauvaise pratique. Un lien sortant vers FTP n'apporte aucune valeur SEO et peut dégrader l'expérience utilisateur. Privilégiez toujours des ressources HTTPS pour conserver cohérence et confiance.
Peut-on utiliser des protocoles personnalisés et être indexé ?
Non, en pratique seuls HTTP et HTTPS garantissent une indexation fiable. Les protocoles custom (app://, myprotocol://) ne sont pas crawlables par Googlebot et resteront invisibles pour le moteur.
Les archives FTP académiques disparaissent-elles de Google ?
Pas nécessairement, mais leur visibilité décline naturellement face à des contenus HTTPS mieux structurés. Si vous gérez de telles archives, migrez-les vers une interface web moderne pour préserver leur accessibilité long terme.
Faut-il rediriger les anciennes URLs FTP après migration ?
Oui, absolument. Configurez des redirections 301 depuis ftp://votredomaine.com/fichier vers https://votredomaine.com/fichier pour transférer l'autorité des liens entrants existants et éviter les erreurs 404.
🏷 Related Topics
Domain Age & History Crawl & Indexing HTTPS & Security AI & SEO

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.