Are POST requests really eating up your crawl budget?

Quick SEO Quiz

Test your SEO knowledge in 3 questions

Less than 30 seconds. Find out how much you really know about Google search.

🕒 ~30s 🎯 3 questions 📚 SEO Google

Official statement

POST requests cannot be cached by Google, unlike GET requests. If your pages make POST requests to APIs, they will consume more crawl budget with each crawl because they cannot benefit from caching.

🎥 Source video

Extracted from a Google Search Central video

💬 EN 📅 25/08/2022 ✂ 13 statements

Watch on YouTube →

✂ Other statements from this video 12 ▾

📅

Official statement from August 25, 2022 (3 years ago)

⚠ A more recent statement exists on this topic Does Google Merchant Center crawling count against your SEO crawl budget? John Mueller · April 30, 2024 View statement →

TL;DR

Google cannot cache POST requests, unlike GET requests. As a result: each crawl re-fetches these resources entirely, which eats into your crawl budget. If your pages rely on POST APIs to display content, you pay the price every time the bot visits.

What you need to understand

Why doesn't Google cache POST requests?

The difference between GET and POST is not just a technical convention — it carries a semantic intention. GET requests are meant to be idempotent: same URL, same result, every time. Google can safely store the response in cache and serve it again on the next crawl.

POST requests, on the other hand, are designed to modify a state or transmit variable data. Googlebot cannot assume that two identical calls will return the same content — hence the impossibility of caching anything.

How does this impact crawl budget?

Each POST request must be executed in full with every crawl. No shortcuts, no 304 Not Modified, no reuse of a previous response. If your page loads 10 POST endpoints to assemble its DOM, the bot must query all of them on every visit.

Crawl budget is a limited envelope of requests that Google agrees to perform on your site within a given time frame. The more resources you consume per page, the less is left to explore other URLs — or to recrawl your important pages more frequently.

Which architectures are particularly at risk?

Single Page Applications (SPAs) that assemble their content via API calls are the first targets. If these calls go through POST — often by habit or bad practice —, the bot experiences the full latency on every visit.

Headless or JAMstack sites that rely on external APIs to hydrate pages on the client side face the same risk. Crawling becomes expensive, slow, and Google may decide to return less often.

GET = cacheable: Google reuses previous responses if nothing has changed
POST = always fresh: each request is executed in full, even if the content is identical
A site with many POST calls consumes more crawl budget per page
Modern JS architectures (SPA, headless) are particularly exposed if misconfigured

SEO Expert opinion

Is this statement consistent with what we observe in the field?

Yes — and it's even documented for years in the HTTP specs. It's not a Google quirk, it's an intrinsic property of the protocol. POST requests are not idempotent, so they are not cacheable by default (unless explicitly allowed by specific headers, which remains rare).

On the crawl budget side, field observations confirm: sites that abuse POST requests to load content see their crawl frequency stagnate, especially if API latency is high. Google cannot afford to wait 500 ms per POST endpoint across millions of pages.

What nuance should be added to this statement?

Martin Splitt doesn't specify how critical this overconsumption is. Is it 10% extra crawl budget? 50%? It depends on the number of POST calls, their latency, the size of responses. [To verify]: Google has never published a detailed benchmark on this topic.

Second nuance: not all POST requests are equal. A POST that consistently returns the same JSON can technically be cached server-side with a well-configured CDN. Googlebot will then see an instant response, even if the initial request is a POST. But you have to set that up yourself — Google won't do it for you.

In which cases does this rule not pose a problem?

If your POST pages only serve non-indexable user actions (contact forms, cart, checkout), no SEO impact. Google doesn't crawl these interactions — or shouldn't, if you've properly marked them noindex.

Another case: sites with excess crawl budget. A 50-page blog can afford a few misplaced POST requests without it changing anything. The problem really arises on large e-commerce catalogs or content portals with tens of thousands of URLs.

Caution: some JS frameworks (React, Vue, Angular) use POST by default for their API calls, even when GET would be more appropriate. Check your configurations before going live.

Practical impact and recommendations

What should you audit first on your site?

Start by identifying all API calls that participate in rendering indexable content. Chrome DevTools, Network tab, filter by "Fetch/XHR" and track the POST requests. If an endpoint serves critical SEO content (titles, descriptions, prices, reviews), it should switch to GET.

Next, check the latency of these requests. A POST that responds in 50 ms is less of a concern than a GET that takes 2 seconds. But at equal latency, GET always wins — so you might as well switch everything that can be switched.

How do you convert a POST to a GET without breaking everything?

Most of the time, it's a matter of convention. If your POST only retrieves data without modifying state server-side, it can become a GET. Pass parameters in the URL or query string rather than in the body.

If you need to transmit a lot of data (for example, complex filters), consider a hash system: store the request server-side, return an identifier, and call that identifier via GET. Google will then be able to cache the response.

What mistakes should you absolutely avoid?

Don't switch all your POST requests to GET without thinking. Actions that modify data (adding to cart, form submission, voting) must remain POST for security and HTTP semantics reasons.

Another trap: converting a POST to a GET without adjusting cache headers server-side. If your API returns Cache-Control: no-cache, Google won't cache anything even with a GET. Properly configure your ETag and Last-Modified.

Audit all API calls that serve indexable content
Identify which ones are POST when they could be GET
Measure the latency of each endpoint to prioritize optimizations
Switch to GET requests that are read-only (idempotent)
Configure cache headers server-side (Cache-Control, ETag)
Verify that Googlebot can properly execute your APIs (no CORS or firewall blocking)
Monitor crawl budget evolution in Search Console after making changes

Switching your APIs from POST to GET is not a technical revolution, but it requires a methodical overhaul of your endpoints and coordination between dev and SEO teams. If your JS architecture is complex — multi-layer SPA, headless CMS, third-party integrations —, these adjustments can quickly become sensitive. Working with an SEO agency specialized in modern architectures helps secure this migration without breaking existing functionality, while ensuring that each change truly serves your crawl budget.

❓ Frequently Asked Questions

Un site avec quelques requêtes POST est-il pénalisé par Google ?

Non, pas de pénalité à proprement parler. Mais il consommera plus de crawl budget, ce qui peut ralentir la fréquence de crawl sur les gros sites. Sur un petit site, l'impact est négligeable.

Peut-on forcer Google à cacher une réponse POST ?

Non, Google respecte la sémantique HTTP. Un POST n'est pas cachable par défaut. Vous pouvez en revanche mettre en place un cache côté serveur ou CDN pour accélérer la réponse, mais Googlebot fera quand même la requête.

Les frameworks JS modernes utilisent-ils POST par défaut ?

Pas systématiquement, mais certains outils de génération d'API ou de state management (Redux, Apollo) peuvent configurer POST par défaut pour des raisons de flexibilité. Il faut vérifier et ajuster cas par cas.

Comment savoir si mes POST posent problème ?

Regardez dans la Search Console : si votre fréquence de crawl stagne malgré du contenu frais régulier, et que vous avez beaucoup d'appels POST, c'est un signal. Mesurez aussi le temps de réponse moyen de vos APIs.

Un CDN peut-il compenser l'absence de cache Google sur les POST ?

Oui, partiellement. Un CDN bien configuré peut servir une réponse instantanée même pour un POST, réduisant la latence côté Googlebot. Mais ça ne change rien au fait que le bot doit quand même exécuter la requête — vous ne récupérez pas le crawl budget économisé.

🏷 Related Topics

crawl budget requêtes POST requêtes GET API SEO SPA cache HTTP Googlebot JavaScript SEO

Domain Age & History Crawl & Indexing AI & SEO JavaScript & Technical SEO Web Performance

🎥 From the same video 12

Other SEO insights extracted from this same Google Search Central video · published on 25/08/2022

🎥 Watch the full video on YouTube →

Related statements

« Previous

"Discovered, not crawled" signals a quality or tec...

Over 90% of websites don't need to worry about cra...

« Back to results