Official statement
Other statements from this video 16 ▾
- □ Faut-il vraiment supprimer les balises meta keywords de votre site ?
- □ Faut-il modifier la date lastmod du sitemap à chaque mise à jour mineure ?
- □ Pourquoi Google ignore-t-il votre meta description alors que vous l'avez soigneusement rédigée ?
- □ Faut-il vraiment nettoyer les backlinks spammés de votre profil de liens ?
- □ Faut-il encore optimiser la densité de mots-clés pour le SEO ?
- □ Le désaveu de liens suffit-il à récupérer vos positions perdues après une pénalité ?
- □ Pourquoi les redirections 301 restent-elles le nerf de la guerre lors d'un changement de domaine ?
- □ Un code 404 ciblé sur Googlebot peut-il bloquer l'indexation de vos pages ?
- □ Faut-il vraiment avoir le même contenu sur mobile et desktop pour l'indexation mobile-first ?
- □ Faut-il vraiment demander la suppression des URLs redirigées de l'index Google ?
- □ Vérifier son site dans Search Console améliore-t-il vraiment son référencement ?
- □ Pourquoi Google refuse-t-il le contenu multilingue dynamique sur une même URL ?
- □ Que se passe-t-il quand vos liens hreflang ne se valident pas tous ?
- □ Les liens footer « Made by X » sont-ils vraiment sans danger pour votre SEO ?
- □ Comment configurer correctement les balises canonical et alternate pour un site m-dot ?
- □ Les données EXIF des images sont-elles inutiles pour le SEO ?
Google confirms that having the same URLs in both a news sitemap and a general sitemap poses no technical problem. Mueller clarifies that it's not ideal and recommends separate sitemaps to simplify management, but no penalty or malfunction should be feared if duplication occurs.
What you need to understand
Why does this question about duplicate URLs come up?
News sites often need to manage two types of sitemaps: a general sitemap that lists all pages on the site, and a news-specific sitemap in Google News format.
The temptation is great to include recent articles in both files. Some CMS platforms do this automatically, creating a gray area about potential risks.
What exactly does Mueller say about the technical functioning?
Google's position is clear: no technical problem results from this duplication. The search engine knows how to handle identical URLs present in multiple sitemaps.
The "not ideal" qualifier used by Mueller relates more to optimizing management than to any impact on crawling or indexing. This is an important nuance — it's not an error, just an approach that could be improved.
What's the difference between "causes no problem" and "not ideal"?
Google distinguishes two levels here: functional impact (none) and organizational efficiency (improvable). Your site will not be penalized, your pages will be crawled normally.
The "not ideal" refers to maintenance simplicity: separating feeds makes it easier to track errors, analyze logs, and avoid unnecessary redundancies in Search Console.
- No negative impact on crawling or indexing if you duplicate URLs
- Separating sitemaps improves management clarity and monitoring
- Google treats each sitemap independently — duplicates are simply ignored
- The news sitemap has specific criteria (freshness, format) that the general sitemap doesn't have
SEO Expert opinion
Does this statement match real-world observations?
Yes, and it's consistent with how Googlebot works. In practice, sites that have URLs present in multiple sitemaps experience no crawling anomalies.
Server logs confirm that Google doesn't crawl the same URL twice just because it appears in two sitemaps. The search engine intelligently deduplicates. Let's be honest: this isn't where your indexing problems lie.
Why does Mueller still say it's "not ideal"?
It's a pragmatic position that reflects Google's experience. Separating sitemaps avoids confusion when diagnosing errors or analyzing performance in Search Console.
If your news sitemap contains 50,000 URLs instead of 1,000 recent articles, you dilute the freshness signal. And that's where it gets stuck — not technically, but strategically.
The news sitemap should remain focused on recent news content (typically 48 hours-7 days). Mixing it with your entire catalog dilutes its effectiveness.
In what cases could this duplication still cause problems?
If your news sitemap becomes huge because of duplicates, you risk exceeding the 50,000 URL or 50 MB per file limit. Technically, this remains manageable with sitemap indexes, but you unnecessarily complicate things.
Another point: some SEO analysis or monitoring tools will flag these duplicates as anomalies, generating noise in your reports. [To verify]: the actual impact on crawl budget prioritization remains unclear in this statement — Mueller doesn't say whether Google allocates resources differently depending on the sitemap type.
Practical impact and recommendations
What should you actually do with your sitemaps?
The recommendation remains simple: separate your sitemaps even if it's not mandatory. Your general sitemap contains all your pages, your news sitemap only recent articles eligible for Google News.
If you're in a situation where URLs overlap (CMS configured that way, technical constraint), don't panic. It works — but plan a migration to clean architecture when possible.
How should you properly structure sitemaps for a news site?
The news sitemap should contain only articles published in the last 2-7 days. Strict format with mandatory <news:news> tags.
The general sitemap includes everything: recent articles, archives, editorial pages, categories. No time limit, just the size limit (50,000 URLs per file).
Use a sitemap index to group both types together. Search Console lets you submit multiple sitemaps — take advantage of it to segment clearly.
What mistakes should you avoid in this configuration?
- Don't let your news sitemap grow indefinitely — implement automatic purging of content older than 7 days
- Don't duplicate out of laziness: if your CMS automatically generates duplicates, look for a configuration option to separate feeds
- Verify in Search Console that both sitemaps are properly recognized and treated separately
- Make sure your news sitemap properly follows the Google News XML format with specific tags
- Monitor your file sizes — a news sitemap of 40 MB likely signals a configuration problem
❓ Frequently Asked Questions
Googlebot crawle-t-il deux fois la même URL si elle apparaît dans deux sitemaps différents ?
Quel délai maximum pour garder un article dans le sitemap news ?
Peut-on soumettre plusieurs sitemaps news pour un même site ?
Les erreurs dans le sitemap news affectent-elles l'indexation générale du site ?
Faut-il obligatoirement un sitemap news pour apparaître dans Google Actualités ?
🎥 From the same video 16
Other SEO insights extracted from this same Google Search Central video · published on 31/01/2023
🎥 Watch the full video on YouTube →
💬 Comments (0)
Be the first to comment.