What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 5 questions

Less than a minute. Find out how much you really know about Google search.

🕒 ~1 min 🎯 5 questions

Official statement

The content of FAQ structured data must be visible word-for-word on the page. Algorithms check that the text is present to avoid promising something in the results that the user cannot find on the page. The algorithms may become stricter.
36:48
🎥 Source video

Extracted from a Google Search Central video

⏱ 1h03 💬 EN 📅 29/10/2020 ✂ 25 statements
Watch on YouTube (36:48) →
Other statements from this video 24
  1. 1:21 Does lazy loading really harm your content's indexing by Google?
  2. 5:18 How can you confirm if Google is truly indexing your lazy-loaded content?
  3. 6:19 Why do your images stay indexed long after the text content has disappeared?
  4. 8:26 Should you really archive out-of-stock products instead of leaving them marked as unavailable?
  5. 9:27 Do out of stock pages really hurt your Google rankings?
  6. 12:05 Should you really delete your out-of-stock product pages to avoid a quality penalty?
  7. 17:16 Is it really necessary to avoid any migration following a failed domain migration?
  8. 20:36 Should you really cancel a failed domain migration or commit to it fully?
  9. 21:40 How does Google really handle the separation of a site into two distinct entities?
  10. 24:10 Does Google really analyze the audio of your podcasts for SEO?
  11. 26:27 Should you really index all your pagination pages?
  12. 30:06 Can paged pages really disappear from Google search results?
  13. 32:45 Do outbound links that are 404 really harm the perceived quality of a page?
  14. 33:49 Is EAT really a ranking factor or just a Google smokescreen?
  15. 34:54 Do structured FAQs really help improve rankings on Google?
  16. 39:10 Is it true that Google still indexes Flash content, or should everything be migrated to pure HTML?
  17. 41:36 Should you hide GDPR consent banners from Googlebot to avoid cloaking?
  18. 43:57 Are Quality Raters really evaluating your site to lower its ranking?
  19. 45:30 Can your website's language versions really have completely different designs?
  20. 47:42 Do 302 redirects really pass on as much PageRank as 301 redirects?
  21. 50:58 Does Google instantly change the canonical URL after removing a redirect?
  22. 53:43 Do 302 redirects really end up being treated as permanent 301s?
  23. 55:45 Can you really migrate multiple sites to a single domain using Google's Change of Address tool?
  24. 58:54 Why does keeping your old sites live kill your new domain?
📅
Official statement from (5 years ago)
TL;DR

Google requires that the content of FAQ structured data be visible word-for-word on the page. Algorithms automatically check this match to ensure that users find what was promised in the rich results. Mueller notes that the algorithms could become stricter on this point, making this requirement even stricter in the future.

What you need to understand

Why does Google enforce this strict visibility rule?

The reason lies in user experience in the SERPs. When a user sees an FAQ answer directly in the rich results, they expect to find that exact text when they arrive on the page.

If the content differs, even slightly, it creates a break in trust. Google wants to avoid situations where a site markup has an invisible or partially hidden FAQ to capture clicks. This is an anti-abuse measure that protects the quality of rich results.

What does “visible word-for-word” actually mean?

This phrasing allows for no paraphrasing or rephrasing. The text present in the JSON-LD or microdata must match exactly the text displayed in the visible HTML of the page.

Even minor variations — replacing a synonym, reversing phrases, adding a word — can trigger a rejection of the FAQ markup. Google scans the visible DOM and compares it with the structured data. No exact match? No rich display.

Can algorithms really become stricter?

Mueller uses the conditional, which leaves a strategic uncertainty. Currently, some sites get away with minor discrepancies, but Google reserves the right to tighten the algorithmic checks.

This could translate into a more rigid matching: detection of syntactical variations, zero tolerance for punctuation discrepancies, checking the actual CSS visibility (display:none, opacity, etc.). It’s safe to say that it’s better to be in strict compliance from now on.

  • Exact match required between structured data and visible content
  • Automatic algorithmic check comparing the markup to the rendered DOM
  • Risk of gradual tightening of validation rules
  • Goal: total consistency between SERP promise and page content
  • No tolerance for hidden or rephrased FAQs

SEO Expert opinion

Is this statement consistent with observed practices in the field?

Yes, and it’s even a late confirmation of what many of us have already observed. The cases of FAQ rich snippets disappearing after an update are multiplying when the markup content does not match the visible content.

However — and this is where it gets interesting — the application of this rule remains uneven across sectors. Some health or finance sites are penalized for an extra semicolon, while others in e-commerce get away with minor variations. [To verify]: does Google apply different tolerance thresholds based on YMYL domains?

What nuances should be added to this absolute rule?

The phrasing “word-for-word” poses a practical issue: what about HTML tags in the visible content? If your FAQ contains a link or a , should it be replicated in the JSON-LD?

Field experience shows that only plain text matters. You can have HTML markup in the visible field as long as the extracted text corresponds to the structured text. But be careful: some CMSs automatically generate non-breaking spaces or invisible Unicode characters that break the matching.

Another vague point: punctuation. A question with or without a period at the end, a space before a question mark (French typography) — does that break the correspondence? [To verify] because Mueller does not specify the level of granularity for the matching.

In what cases could this rule pose a problem?

Multilingual sites with automatically translated FAQs are particularly exposed. If the translation in the JSON-LD slightly differs from the displayed one (different translation engine, cache, etc.), Google may refuse the snippet.

The same issue arises for sites with personalized or A/B tested content. If Googlebot sees one version of the FAQ and a user sees another, you are in violation. The solution? Exclude FAQs from testing or ensure that the structured data always reflects the default version.

Note: FAQs loaded in JavaScript after the initial rendering may not be detected by Google if the markup text does not exactly match the text present at the time of crawling. Timing matters.

Practical impact and recommendations

What should you do concretely to remain compliant?

First, automate the verification. Don’t rely on manual checks: a script that extracts the visible text from your FAQs and compares it to the JSON-LD will save you a lot of time. Several SEO tools offer this functionality, but a simple Python script with BeautifulSoup will do the job.

Next, unify your source of truth. If your CMS generates FAQs on the front end and your SEO plugin generates structured data independently, you create a risk of divergence. Ideally, one data source should feed both the visible HTML and the JSON-LD.

Finally, monitor rich snippets in the Search Console. A sharp drop in impressions with FAQ rich results? Likely that Google detected a non-compliance. GSC won’t explicitly tell you, but it’s a warning signal.

What mistakes should be absolutely avoided?

The classic mistake: markup of FAQs that are not visible above the fold. Technically, the content is present in the DOM, but if the user has to scroll or click to see it, Google may consider that you are over-promising.

Another frequent trap: accordion FAQs with content hidden in display:none. If the text is not visible in its initial state (even if a click reveals it), it’s risky. Prefer accordions that render the content in the DOM even in the closed state, with a simple max-height:0 and overflow:hidden.

And beware of “optimized” rephrasing in the structured data. You cannot rewrite a visible question “How to choose X?” into “What is the best way to choose X?” in the markup, even if it seems more SEO-friendly.

How can I verify that my site is compliant?

Use the Google Rich Results Test on your FAQ pages. The tool will show you what Google extracts from the structured data. Compare visually with what is displayed on the page. Any slight discrepancy? Correct it.

Supplement with a programmatic check: a script that parses the JSON-LD, extracts the FAQ text, then parses the visible HTML and compares. If you have hundreds of FAQ pages, this is the only scalable approach. Otherwise, you might overlook variations introduced by contributors or template updates.

These optimizations may seem simple in theory, but their technical implementation at scale — especially on sites with multiple contributors, complex CMSs, or multi-application stacks — requires deep expertise and meticulous monitoring. If you manage a large site or if your technical stack makes auditing complex, hiring a specialized SEO agency will help you quickly identify and correct non-compliances before they impact your performance in the SERPs.

  • Verify exact match between visible text and JSON-LD on all FAQ pages
  • Eliminate any FAQ markup that is not visible without user interaction
  • Automate detection of discrepancies with a regular monitoring script
  • Exclude FAQs from A/B tests or ensure that structured data reflects the default version
  • Check performances of FAQ rich snippets in the Search Console
  • Test each page with the Rich Results Test after modifying FAQ content
Google provides no leeway on the visibility of structured FAQs. The match between visible text and structured data must be perfect, with no rephrasing or omissions. Algorithms may become stricter, so it's better to be in strict compliance right now. Automate verification and unify your data sources to avoid discrepancies.

❓ Frequently Asked Questions

Peut-on baliser des FAQ qui s'affichent uniquement après un clic sur un accordéon ?
Oui, à condition que le texte soit présent dans le DOM visible, même si masqué visuellement par CSS (max-height, opacity). Si le contenu est en display:none ou chargé dynamiquement après le clic, c'est risqué. Google veut que le texte soit techniquement accessible sans interaction.
Les variations de ponctuation (espace avant ?, guillemets typographiques) cassent-elles le matching ?
Mueller ne précise pas le niveau de tolérance exact. L'expérience terrain montre que des écarts mineurs de ponctuation passent parfois, mais mieux vaut viser une correspondance stricte. Les espaces insécables et caractères Unicode invisibles sont particulièrement problématiques.
Faut-il reproduire les balises HTML (strong, em, liens) dans le JSON-LD ?
Non. Seul le texte brut compte. Vous pouvez avoir du markup HTML dans le contenu visible tant que le texte extrait (sans balises) correspond exactement au texte structuré dans le JSON-LD.
Comment gérer les FAQ multilingues avec traduction automatique ?
Assurez-vous que le moteur de traduction utilisé pour le contenu visible et pour le structured data est le même, ou que les deux sont générés depuis une source unique. Sinon, des variations mineures de traduction peuvent casser la correspondance et bloquer les rich snippets.
Quel outil utiliser pour vérifier la conformité à grande échelle ?
Combinez le Rich Results Test de Google pour des vérifications ponctuelles avec un script custom (Python + BeautifulSoup ou Puppeteer) qui extrait et compare automatiquement le texte visible vs JSON-LD sur toutes vos pages FAQ. C'est la seule approche scalable pour des centaines de pages.
🏷 Related Topics
Algorithms Domain Age & History Content Structured Data AI & SEO

🎥 From the same video 24

Other SEO insights extracted from this same Google Search Central video · duration 1h03 · published on 29/10/2020

🎥 Watch the full video on YouTube →

Related statements

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.