Official statement
Other statements from this video 42 ▾
- 42:49 Can hreflang really be used across multiple distinct domains?
- 48:45 Can hreflang really be used across multiple distinct domains?
- 58:47 Should you really avoid duplicating your content across two distinct sites?
- 58:47 Should you really avoid creating multiple sites for the same content?
- 91:16 Is it really necessary to index the internal search pages on your site?
- 91:16 Should you block internal search pages to prevent indexing of infinite space?
- 125:44 Do Core Web Vitals Really Influence Google's Crawl Budget?
- 125:44 Can reducing page size really enhance your crawl budget?
- 152:31 Why does the Search Console's internal links report show only a sample?
- 172:13 Should you really be concerned about redirect chains for Google's crawl?
- 172:13 How many redirects does Google really follow before it splits the crawl?
- 201:37 How does Google actually segment your Core Web Vitals by groups of pages?
- 201:37 How does Google actually segment your Core Web Vitals by page groups?
- 248:11 Is it true that AMP or canonical really captures the SEO signals?
- 257:21 Does the Chrome UX Report really count your cached AMP pages?
- 272:10 Is it necessary to redirect your AMP URLs during a change?
- 272:10 Should you really redirect your old AMP URLs to the new ones?
- 294:42 Is AMP really neutral for Google rankings, or does it hide an invisible visibility lever?
- 296:42 Is AMP really a Google ranking factor or just a ticket to access certain features?
- 342:21 Why does copied content sometimes outrank the original despite the DMCA?
- 342:21 Is the DMCA really effective in protecting your duplicated content on Google?
- 359:44 Why does copied content outrank your original material on Google?
- 409:35 Why do your featured snippets disappear seemingly without a technical reason?
- 409:35 Do featured snippets and rich results really fluctuate randomly?
- 455:08 Is it true that mobile hidden content is really indexed by Google?
- 455:08 Is it true that Google really indexes hidden content in responsive CSS?
- 563:51 Can structured data really force the display of a knowledge panel?
- 563:51 Is there any structured markup that guarantees the appearance of a Knowledge Panel?
- 583:50 Why do most websites never get sitelinks in Google?
- 583:50 Can you really force sitelinks to appear in Google?
- 649:39 Do 301 redirects really transfer 100% of SEO juice without any loss?
- 649:39 Do 301 redirects really transfer 100% of PageRank and SEO signals?
- 722:53 Should you really delete or redirect expired content instead of keeping it indexable?
- 722:53 Should you really remove expired pages or can you leave them labeled 'expired'?
- 859:32 Are keywords in the URL a ranking factor or just a temporary crutch?
- 859:32 Do words in the URL really influence Google rankings?
- 908:40 Should you really add structured data to embedded YouTube videos?
- 909:01 Should you really add video structured data when you're already embedding YouTube?
- 932:46 Does Page Experience really only matter for mobile SEO?
- 932:46 Why is Google ignoring desktop Core Web Vitals in its ranking algorithm?
- 952:49 Do the API and Search Console interface really display the same data?
- 963:49 Can you use different templates for each language version without harming international SEO?
Google confirms that the internal links report in Search Console is based on a sample of pages, not the entire indexed site. Unlike the index coverage report, this tool does not guarantee a comprehensive view of your linking structure. For a reliable audit, you need to cross-reference Search Console with third-party tools capable of crawling your entire structure.
What you need to understand
What does "based on a sample" actually mean?
When John Mueller specifies that the internal links report relies on a sample, he openly admits that Search Console does not scan all of your pages . Google selects a representative subset—without specifying the selection criteria or the size of this sample. The problem? If your site has 10,000 pages, you have no guarantee that all 10,000 are analyzed. Orphan pages, deep sections, or rarely crawled URLs are likely to fly under the radar. This is not a bug, it’s a structural limitation of the tool. The index coverage report (now "Indexed Pages") documents all the URLs that Google has attempted to index, whether they are valid, excluded, or erroneous. It provides an almost comprehensive view, powered by crawl and indexing logs. The internal links report, however, does not pretend to this exhaustiveness. It offers a statistical overview , useful for detecting macro trends—highly linked pages, isolated pages—but inadequate for a detailed link audit. In other words: don’t count on it to validate that every strategic URL is receiving its internal links. Google does not communicate the rules for selecting the sample, but reasonable hypotheses can be made. Rarely crawled pages, deep pagination URLs, recently published content not yet stabilized in the index are more likely to be ignored. Sites with complex architecture—multi-language, multi-domain, thousands of categories—are particularly exposed. If your strategic linking relies on level 4 or 5 pages , you cannot rely on Search Console to verify their receipt of internal links.Why does this approach differ from the index coverage report?
Which pages are likely to escape the sample?
SEO Expert opinion
Does this statement align with real-world observations?
Yes, and it confirms what many practitioners have empirically found. The numbers in the internal links report never perfectly match those from Screaming Frog or Oncrawl . The discrepancies are not anecdotal: we’re sometimes talking about 20 to 40% missing pages in Search Console. What is appreciated is that Google acknowledges this. Too often, official tools are presented as absolute truths, while they are partial indicators . Here, Mueller dots the i's: Search Console is not your sole source of truth for internal linking. The fact that the sample is partial does not mean it is useless. Search Console reflects Google’s view , and that is precisely what matters. If a strategic page does not appear in the internal links report, it may be because Google has never crawled it—or very rarely. In other words, the absence of a URL in this report can be a warning sign : orphan page, excessive depth, unintentional blockage. The tool then becomes an indirect diagnostic of crawlability, even if it wasn’t designed for that. [To be verified] : Google never specifies whether the sample is random, weighted by crawl budget, or filtered by other criteria. For small to medium-sized sites (< 5,000 pages), the sample is likely to cover a significant portion of the architecture. Strategic pages—home, main categories, SEO landing pages —are generally well represented, as they are crawled frequently. The report also becomes relevant for identifying flagrant anomalies : a contact page that would have 500 internal links (a sign of a template issue), a pillar category with only 2 incoming links (likely structural isolation). These are macro alerts that the sample can reveal, even if incomplete.What nuances should be added to this claim?
In what cases does this report remain relevant nonetheless?
Practical impact and recommendations
What should you do to audit your internal linking?
First, never rely solely on Search Console for a serious link audit. Use a third-party crawler—Screaming Frog, Oncrawl, Botify, Sitebulb—that can explore your entire site, respecting your robots.txt rules and following internal links as Googlebot would. Then cross-reference the data: compare the list of pages crawled by your tool with those present in Search Console. URLs absent from Search Console but present in your crawl are likely under-crawled by Google, or even orphaned from the perspective of its algorithms. This is a strong signal to investigate. Do not draw definitive conclusions about the exact number of links pointing to a given page. If Search Console shows 12 internal links to a URL, that does not mean there are only 12— this is what Google detected in its sample . The reality may be much higher. Also, avoid over-optimizing based on this report. If a strategic page shows few internal links in Search Console, don’t blindly multiply anchors everywhere. First check with a crawler if the problem is real or just an artifact of sampling. The goal remains a coherent architecture, not a cosmetic score in a tool. Use the internal links report as a trend detection tool , not as an absolute source of truth. Identify pages that consistently appear with few incoming links—these may potentially be orphan pages or poorly integrated into your architecture. Complement this analysis with your crawl budget data: if a page receives few internal links and is rarely crawled (see server logs), you have a structural problem to correct. The Search Console report then becomes an element of a broader diagnosis , never an end in itself.What mistakes should be avoided when interpreting this report?
How can this report be integrated into a broader SEO strategy?
❓ Frequently Asked Questions
Le rapport de liens internes dans Search Console est-il fiable pour auditer mon maillage ?
Pourquoi certaines de mes pages n'apparaissent-elles pas dans le rapport de liens internes ?
Quelle différence entre le rapport de liens internes et le rapport de couverture d'index ?
Puis-je me fier aux chiffres exacts de liens internes affichés dans Search Console ?
Comment savoir si une page stratégique est bien maillée selon Google ?
🎥 From the same video 42
Other SEO insights extracted from this same Google Search Central video · duration 996h50 · published on 12/03/2021
🎥 Watch the full video on YouTube →
💬 Comments (0)
Be the first to comment.