What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 3 questions

Less than 30 seconds. Find out how much you really know about Google search.

🕒 ~30s 🎯 3 questions 📚 SEO Google

Official statement

The internal links report in Search Console is based on a sample of pages from the site, not all indexed pages. It is not on the same level as the index coverage report.
152:31
🎥 Source video

Extracted from a Google Search Central video

⏱ 996h50 💬 EN 📅 12/03/2021 ✂ 43 statements
Watch on YouTube (152:31) →
Other statements from this video 42
  1. 42:49 Peut-on vraiment utiliser hreflang entre plusieurs domaines distincts ?
  2. 48:45 Peut-on vraiment utiliser hreflang entre plusieurs domaines distincts ?
  3. 58:47 Faut-il vraiment éviter de dupliquer son contenu sur deux sites distincts ?
  4. 58:47 Faut-il vraiment éviter de créer plusieurs sites pour le même contenu ?
  5. 91:16 Faut-il vraiment indexer les pages de recherche interne de votre site ?
  6. 91:16 Faut-il bloquer les pages de recherche interne pour éviter l'indexation d'un espace infini ?
  7. 125:44 Les Core Web Vitals influencent-ils vraiment le budget de crawl de Google ?
  8. 125:44 Réduire la taille de page améliore-t-il vraiment le budget crawl ?
  9. 152:31 Pourquoi le rapport de liens internes de Search Console ne montre-t-il qu'un échantillon ?
  10. 172:13 Faut-il vraiment s'inquiéter des chaînes de redirections pour le crawl Google ?
  11. 172:13 Combien de redirections Google suit-il réellement avant de fractionner le crawl ?
  12. 201:37 Comment Google segmente-t-il réellement vos Core Web Vitals par groupes de pages ?
  13. 201:37 Comment Google segmente-t-il réellement vos Core Web Vitals par groupes de pages ?
  14. 248:11 AMP ou canonique : qui récolte vraiment les signaux SEO ?
  15. 257:21 Le Chrome UX Report compte-t-il vraiment vos pages AMP en cache ?
  16. 272:10 Faut-il vraiment rediriger vos URLs AMP lors d'un changement ?
  17. 272:10 Faut-il vraiment rediriger vos anciennes URLs AMP vers les nouvelles ?
  18. 294:42 AMP est-il vraiment neutre pour le classement Google ou cache-t-il un levier de visibilité invisible ?
  19. 296:42 AMP est-il vraiment un facteur de classement Google ou juste un ticket d'entrée pour certaines features ?
  20. 342:21 Pourquoi le contenu copié surclasse-t-il parfois l'original malgré le DMCA ?
  21. 342:21 Le DMCA est-il vraiment efficace pour protéger votre contenu dupliqué sur Google ?
  22. 359:44 Pourquoi le contenu copié surclasse-t-il votre contenu original dans Google ?
  23. 409:35 Pourquoi vos featured snippets disparaissent-ils sans raison technique ?
  24. 409:35 Les featured snippets et résultats enrichis fluctuent-ils vraiment par hasard ?
  25. 455:08 Le contenu masqué en responsive mobile est-il vraiment indexé par Google ?
  26. 455:08 Le contenu caché en CSS responsive est-il vraiment indexé par Google ?
  27. 563:51 Les structured data peuvent-elles vraiment forcer l'affichage d'un knowledge panel ?
  28. 563:51 Existe-t-il un balisage structuré qui garantit l'apparition d'un Knowledge Panel ?
  29. 583:50 Pourquoi la plupart des sites n'obtiennent-ils jamais de sitelinks dans Google ?
  30. 583:50 Peut-on vraiment forcer l'affichage des sitelinks dans Google ?
  31. 649:39 Les redirections 301 transfèrent-elles vraiment 100 % du jus SEO sans perte ?
  32. 649:39 Les redirections 301 transfèrent-elles vraiment 100% du PageRank et des signaux SEO ?
  33. 722:53 Faut-il vraiment supprimer ou rediriger les contenus expirés plutôt que de les garder indexables ?
  34. 722:53 Faut-il vraiment supprimer les pages expirées ou peut-on les laisser avec un label 'expiré' ?
  35. 859:32 Les mots-clés dans l'URL : facteur de ranking ou simple béquille temporaire ?
  36. 859:32 Les mots dans l'URL influencent-ils vraiment le classement Google ?
  37. 908:40 Faut-il vraiment ajouter des structured data sur les vidéos YouTube embarquées ?
  38. 909:01 Faut-il vraiment ajouter des données structurées vidéo quand on embed déjà YouTube ?
  39. 932:46 Les Core Web Vitals impactent-ils vraiment le SEO desktop ?
  40. 932:46 Pourquoi Google ignore-t-il les Core Web Vitals desktop dans son algorithme de classement ?
  41. 952:49 L'API et l'interface Search Console affichent-elles vraiment les mêmes données ?
  42. 963:49 Peut-on utiliser des templates différents par version linguistique sans pénaliser son SEO international ?
📅
Official statement from (5 years ago)
TL;DR

Google confirms that the internal links report in Search Console is based on a sample of pages, not the entire indexed site. Unlike the index coverage report, this tool does not guarantee a comprehensive view of your linking structure. For a reliable audit, you need to cross-reference Search Console with third-party tools capable of crawling your entire structure.

What you need to understand

What does "based on a sample" actually mean? <\/h3>

When John Mueller <\/strong> specifies that the internal links report relies on a sample, he openly admits that Search Console does not scan all of your pages <\/strong>. Google selects a representative subset—without specifying the selection criteria or the size of this sample.<\/p>

The problem? If your site has 10,000 pages, you have no guarantee that all 10,000 are analyzed. Orphan pages, deep sections, or rarely crawled URLs <\/strong> are likely to fly under the radar. This is not a bug, it’s a structural limitation of the tool.<\/p>

Why does this approach differ from the index coverage report? <\/h3>

The index coverage report <\/strong> (now "Indexed Pages") documents all the URLs that Google has attempted to index, whether they are valid, excluded, or erroneous. It provides an almost comprehensive view, powered by crawl and indexing logs.<\/p>

The internal links report, however, does not pretend to this exhaustiveness. It offers a statistical overview <\/strong>, useful for detecting macro trends—highly linked pages, isolated pages—but inadequate for a detailed link audit. In other words: don’t count on it to validate that every strategic URL is receiving its internal links.<\/p>

Which pages are likely to escape the sample? <\/h3>

Google does not communicate the rules for selecting the sample, but reasonable hypotheses can be made. Rarely crawled pages, deep pagination URLs, recently published content not yet stabilized <\/strong> in the index are more likely to be ignored.<\/p>

Sites with complex architecture—multi-language, multi-domain, thousands of categories—are particularly exposed. If your strategic linking relies on level 4 or 5 pages <\/strong>, you cannot rely on Search Console to verify their receipt of internal links.<\/p>

  • The internal links report only covers a sample of pages <\/strong>, with no guarantee of completeness.<\/li>
  • It is not designed to replace a comprehensive link audit <\/strong>, unlike the index coverage report, which aims for completeness.<\/li>
  • Deep, orphan, or rarely crawled pages <\/strong> may not appear in the displayed data.<\/li>
  • This tool remains useful for detecting macro trends <\/strong> (over-linked pages, under-linked pages), but cannot serve as an absolute reference.<\/li>
  • A reliable audit requires cross-referencing Search Console with a third-party crawler <\/strong> capable of analyzing your entire structure.<\/li><\/ul>

SEO Expert opinion

Does this statement align with real-world observations? <\/h3>

Yes, and it confirms what many practitioners have empirically found. The numbers in the internal links report never perfectly match those from Screaming Frog or Oncrawl <\/strong>. The discrepancies are not anecdotal: we’re sometimes talking about 20 to 40% missing pages in Search Console.<\/p>

What is appreciated is that Google acknowledges this. Too often, official tools are presented as absolute truths, while they are partial indicators <\/strong>. Here, Mueller dots the i's: Search Console is not your sole source of truth for internal linking.<\/p>

What nuances should be added to this claim? <\/h3>

The fact that the sample is partial does not mean it is useless. Search Console reflects Google’s view <\/strong>, and that is precisely what matters. If a strategic page does not appear in the internal links report, it may be because Google has never crawled it—or very rarely.<\/p>

In other words, the absence of a URL in this report can be a warning sign <\/strong>: orphan page, excessive depth, unintentional blockage. The tool then becomes an indirect diagnostic of crawlability, even if it wasn’t designed for that. [To be verified] <\/strong>: Google never specifies whether the sample is random, weighted by crawl budget, or filtered by other criteria.<\/p>

In what cases does this report remain relevant nonetheless? <\/h3>

For small to medium-sized sites (< 5,000 pages), the sample is likely to cover a significant portion of the architecture. Strategic pages—home, main categories, SEO landing pages <\/strong>—are generally well represented, as they are crawled frequently.<\/p>

The report also becomes relevant for identifying flagrant anomalies <\/strong>: a contact page that would have 500 internal links (a sign of a template issue), a pillar category with only 2 incoming links (likely structural isolation). These are macro alerts that the sample can reveal, even if incomplete.<\/p>

Warning: <\/strong> Never rely on the internal links report to validate a link structure before a migration, redesign, or launch of a new architecture. A complete crawler is essential in these critical contexts.<\/div>

Practical impact and recommendations

What should you do to audit your internal linking? <\/h3>

First, never rely solely on Search Console <\/strong> for a serious link audit. Use a third-party crawler—Screaming Frog, Oncrawl, Botify, Sitebulb—that can explore your entire site, respecting your robots.txt rules and following internal links as Googlebot would.<\/p>

Then cross-reference the data: compare the list of pages crawled by your tool with those present in Search Console. URLs absent from Search Console but present in your crawl <\/strong> are likely under-crawled by Google, or even orphaned from the perspective of its algorithms. This is a strong signal to investigate.<\/p>

What mistakes should be avoided when interpreting this report? <\/h3>

Do not draw definitive conclusions about the exact number of links pointing to a given page. If Search Console shows 12 internal links to a URL, that does not mean there are only 12— this is what Google detected in its sample <\/strong>. The reality may be much higher.<\/p>

Also, avoid over-optimizing based on this report. If a strategic page shows few internal links in Search Console, don’t blindly multiply anchors everywhere. First check with a crawler if the problem is real or just an artifact of sampling. <\/strong> The goal remains a coherent architecture, not a cosmetic score in a tool.<\/p>

How can this report be integrated into a broader SEO strategy? <\/h3>

Use the internal links report as a trend detection tool <\/strong>, not as an absolute source of truth. Identify pages that consistently appear with few incoming links—these may potentially be orphan pages or poorly integrated into your architecture.<\/p>

Complement this analysis with your crawl budget data: if a page receives few internal links and <\/em> is rarely crawled (see server logs), you have a structural problem to correct. The Search Console report then becomes an element of a broader diagnosis <\/strong>, never an end in itself.<\/p>

  • Crawl the entire site with a third-party tool <\/strong> to obtain a comprehensive view of the linking structure.<\/li>
  • Compare crawl data with that from Search Console <\/strong> to identify underrepresented or orphan pages.<\/li>
  • Never rely on the exact internal link counts <\/strong> displayed in Search Console—they are partial indicators.<\/li>
  • Use the report to detect macro anomalies <\/strong> (over-linking, isolation) rather than for a detailed audit.<\/li>
  • Cross-reference with server logs <\/strong> to confirm that a poorly linked page is also rarely crawled.<\/li>
  • Review the architecture and linking of strategic pages <\/strong> identified as under-linked or absent from the report.<\/li><\/ul>
    The internal links report in Search Console remains a valuable tool for detecting weak signals, but it can in no way replace a complete link audit. Always systematically combine it with a comprehensive crawler and your server logs. Optimizing a complex architecture often requires a methodical and technical approach—if you lack resources or expertise internally, consulting a specialized SEO agency can help secure these critical projects and avoid costly mistakes.<\/div>

❓ Frequently Asked Questions

Le rapport de liens internes dans Search Console est-il fiable pour auditer mon maillage ?
Non, il repose sur un échantillon de pages et ne couvre pas l'intégralité de votre site. Utilisez un crawler tiers pour un audit exhaustif.
Pourquoi certaines de mes pages n'apparaissent-elles pas dans le rapport de liens internes ?
Parce qu'elles ne font probablement pas partie de l'échantillon sélectionné par Google. Cela peut aussi signaler un problème de crawlabilité ou d'isolation structurelle.
Quelle différence entre le rapport de liens internes et le rapport de couverture d'index ?
Le rapport de couverture vise l'exhaustivité et documente toutes les URLs que Google a tenté d'indexer. Le rapport de liens internes est basé sur un échantillon et ne garantit pas une vision complète.
Puis-je me fier aux chiffres exacts de liens internes affichés dans Search Console ?
Non, ces chiffres reflètent ce que Google a détecté dans son échantillon, pas la réalité exhaustive. Croisez toujours avec un crawler complet.
Comment savoir si une page stratégique est bien maillée selon Google ?
Vérifiez si elle apparaît dans le rapport de liens internes avec un nombre de liens cohérent. Si elle est absente ou affiche peu de liens, crawlez votre site et consultez vos logs serveur pour confirmer un éventuel problème de crawlabilité.

🎥 From the same video 42

Other SEO insights extracted from this same Google Search Central video · duration 996h50 · published on 12/03/2021

🎥 Watch the full video on YouTube →

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.