What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 3 questions

Less than 30 seconds. Find out how much you really know about Google search.

🕒 ~30s 🎯 3 questions 📚 SEO Google

Official statement

Including the same URLs in both a news sitemap and a general sitemap causes no problems, although it's not ideal. It's generally simpler to have separate sitemaps.
🎥 Source video

Extracted from a Google Search Central video

💬 EN 📅 31/01/2023 ✂ 17 statements
Watch on YouTube →
Other statements from this video 16
  1. Faut-il vraiment supprimer les balises meta keywords de votre site ?
  2. Faut-il modifier la date lastmod du sitemap à chaque mise à jour mineure ?
  3. Pourquoi Google ignore-t-il votre meta description alors que vous l'avez soigneusement rédigée ?
  4. Faut-il vraiment nettoyer les backlinks spammés de votre profil de liens ?
  5. Faut-il encore optimiser la densité de mots-clés pour le SEO ?
  6. Le désaveu de liens suffit-il à récupérer vos positions perdues après une pénalité ?
  7. Pourquoi les redirections 301 restent-elles le nerf de la guerre lors d'un changement de domaine ?
  8. Un code 404 ciblé sur Googlebot peut-il bloquer l'indexation de vos pages ?
  9. Faut-il vraiment avoir le même contenu sur mobile et desktop pour l'indexation mobile-first ?
  10. Faut-il vraiment demander la suppression des URLs redirigées de l'index Google ?
  11. Vérifier son site dans Search Console améliore-t-il vraiment son référencement ?
  12. Pourquoi Google refuse-t-il le contenu multilingue dynamique sur une même URL ?
  13. Que se passe-t-il quand vos liens hreflang ne se valident pas tous ?
  14. Les liens footer « Made by X » sont-ils vraiment sans danger pour votre SEO ?
  15. Comment configurer correctement les balises canonical et alternate pour un site m-dot ?
  16. Les données EXIF des images sont-elles inutiles pour le SEO ?
📅
Official statement from (3 years ago)
TL;DR

Google confirms that having the same URLs in both a news sitemap and a general sitemap poses no technical problem. Mueller clarifies that it's not ideal and recommends separate sitemaps to simplify management, but no penalty or malfunction should be feared if duplication occurs.

What you need to understand

Why does this question about duplicate URLs come up?

News sites often need to manage two types of sitemaps: a general sitemap that lists all pages on the site, and a news-specific sitemap in Google News format.

The temptation is great to include recent articles in both files. Some CMS platforms do this automatically, creating a gray area about potential risks.

What exactly does Mueller say about the technical functioning?

Google's position is clear: no technical problem results from this duplication. The search engine knows how to handle identical URLs present in multiple sitemaps.

The "not ideal" qualifier used by Mueller relates more to optimizing management than to any impact on crawling or indexing. This is an important nuance — it's not an error, just an approach that could be improved.

What's the difference between "causes no problem" and "not ideal"?

Google distinguishes two levels here: functional impact (none) and organizational efficiency (improvable). Your site will not be penalized, your pages will be crawled normally.

The "not ideal" refers to maintenance simplicity: separating feeds makes it easier to track errors, analyze logs, and avoid unnecessary redundancies in Search Console.

  • No negative impact on crawling or indexing if you duplicate URLs
  • Separating sitemaps improves management clarity and monitoring
  • Google treats each sitemap independently — duplicates are simply ignored
  • The news sitemap has specific criteria (freshness, format) that the general sitemap doesn't have

SEO Expert opinion

Does this statement match real-world observations?

Yes, and it's consistent with how Googlebot works. In practice, sites that have URLs present in multiple sitemaps experience no crawling anomalies.

Server logs confirm that Google doesn't crawl the same URL twice just because it appears in two sitemaps. The search engine intelligently deduplicates. Let's be honest: this isn't where your indexing problems lie.

Why does Mueller still say it's "not ideal"?

It's a pragmatic position that reflects Google's experience. Separating sitemaps avoids confusion when diagnosing errors or analyzing performance in Search Console.

If your news sitemap contains 50,000 URLs instead of 1,000 recent articles, you dilute the freshness signal. And that's where it gets stuck — not technically, but strategically.

The news sitemap should remain focused on recent news content (typically 48 hours-7 days). Mixing it with your entire catalog dilutes its effectiveness.

In what cases could this duplication still cause problems?

If your news sitemap becomes huge because of duplicates, you risk exceeding the 50,000 URL or 50 MB per file limit. Technically, this remains manageable with sitemap indexes, but you unnecessarily complicate things.

Another point: some SEO analysis or monitoring tools will flag these duplicates as anomalies, generating noise in your reports. [To verify]: the actual impact on crawl budget prioritization remains unclear in this statement — Mueller doesn't say whether Google allocates resources differently depending on the sitemap type.

Attention: Don't confuse "absence of problem" with "absence of impact". A news sitemap polluted by thousands of old URLs loses its freshness signal role, even if Google doesn't penalize you for it.

Practical impact and recommendations

What should you actually do with your sitemaps?

The recommendation remains simple: separate your sitemaps even if it's not mandatory. Your general sitemap contains all your pages, your news sitemap only recent articles eligible for Google News.

If you're in a situation where URLs overlap (CMS configured that way, technical constraint), don't panic. It works — but plan a migration to clean architecture when possible.

How should you properly structure sitemaps for a news site?

The news sitemap should contain only articles published in the last 2-7 days. Strict format with mandatory <news:news> tags.

The general sitemap includes everything: recent articles, archives, editorial pages, categories. No time limit, just the size limit (50,000 URLs per file).

Use a sitemap index to group both types together. Search Console lets you submit multiple sitemaps — take advantage of it to segment clearly.

What mistakes should you avoid in this configuration?

  • Don't let your news sitemap grow indefinitely — implement automatic purging of content older than 7 days
  • Don't duplicate out of laziness: if your CMS automatically generates duplicates, look for a configuration option to separate feeds
  • Verify in Search Console that both sitemaps are properly recognized and treated separately
  • Make sure your news sitemap properly follows the Google News XML format with specific tags
  • Monitor your file sizes — a news sitemap of 40 MB likely signals a configuration problem
In practice? Separate your sitemaps for clarity, but don't worry if URLs end up in both — Google handles it. Sitemap architecture may seem trivial, but it often reveals deeper issues with content management and editorial strategy. If you notice your current configuration generates inconsistencies or if you want to fine-tune your Google News presence, the expertise of a specialized SEO agency can save you time and avoid costly mistakes.

❓ Frequently Asked Questions

Googlebot crawle-t-il deux fois la même URL si elle apparaît dans deux sitemaps différents ?
Non. Google déduplique automatiquement les URLs avant le crawl. Une URL présente dans plusieurs sitemaps ne sera crawlée qu'une seule fois selon la logique habituelle de priorisation du moteur.
Quel délai maximum pour garder un article dans le sitemap news ?
Google recommande de ne conserver que les contenus publiés dans les 2 derniers jours, avec une tolérance jusqu'à 7 jours maximum. Au-delà, retirez-les du sitemap news même s'ils restent dans le sitemap général.
Peut-on soumettre plusieurs sitemaps news pour un même site ?
Oui, c'est possible et parfois nécessaire si vous dépassez la limite de 1 000 URLs par sitemap (recommandation Google News). Utilisez un index de sitemaps pour les regrouper.
Les erreurs dans le sitemap news affectent-elles l'indexation générale du site ?
Non. Les sitemaps news et généraux sont traités indépendamment. Une erreur dans le sitemap news n'impacte pas le crawl des autres pages via le sitemap principal.
Faut-il obligatoirement un sitemap news pour apparaître dans Google Actualités ?
Non, ce n'est pas obligatoire mais fortement recommandé. Le sitemap news accélère la découverte et l'indexation des contenus frais, ce qui est crucial pour l'actualité.
🏷 Related Topics
Crawl & Indexing Discover & News Domain Name Search Console

🎥 From the same video 16

Other SEO insights extracted from this same Google Search Central video · published on 31/01/2023

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.