What does Google say about SEO? /
Quick SEO Quiz

Test your SEO knowledge in 3 questions

Less than 30 seconds. Find out how much you really know about Google search.

🕒 ~30s 🎯 3 questions 📚 SEO Google

Official statement

The content of FAQ structured data must be visible word-for-word on the page. Algorithms check that the text is present to avoid promising something in the results that the user cannot find on the page. The algorithms may become stricter.
36:48
🎥 Source video

Extracted from a Google Search Central video

⏱ 1h03 💬 EN 📅 29/10/2020 ✂ 25 statements
Watch on YouTube (36:48) →
Other statements from this video 24
  1. 1:21 Le lazy loading tue-t-il l'indexation de votre contenu par Google ?
  2. 5:18 Comment vérifier si Google indexe vraiment votre contenu lazy-loaded ?
  3. 6:19 Pourquoi vos images restent-elles indexées bien après la disparition du contenu textuel ?
  4. 8:26 Faut-il vraiment archiver les produits épuisés plutôt que les laisser en rupture de stock ?
  5. 9:27 Les pages en rupture de stock nuisent-elles vraiment à votre référencement Google ?
  6. 12:05 Faut-il vraiment supprimer vos pages de produits épuisés pour éviter une pénalité qualité ?
  7. 17:16 Faut-il vraiment éviter toute migration après une première migration de domaine ratée ?
  8. 20:36 Faut-il vraiment annuler une migration de domaine ratée ou l'assumer jusqu'au bout ?
  9. 21:40 Comment Google traite-t-il réellement la séparation d'un site en deux entités distinctes ?
  10. 24:10 Google analyse-t-il vraiment l'audio de vos podcasts pour le référencement ?
  11. 26:27 Faut-il vraiment indexer toutes vos pages de pagination ?
  12. 30:06 Les pages paginées peuvent-elles vraiment disparaître des résultats Google ?
  13. 32:45 Les liens sortants en 404 pénalisent-ils vraiment la qualité perçue d'une page ?
  14. 33:49 L'EAT est-il vraiment un facteur de classement ou juste un écran de fumée Google ?
  15. 34:54 Les FAQ structurées aident-elles vraiment à mieux ranker dans Google ?
  16. 39:10 Google indexe-t-il encore le contenu Flash, ou faut-il tout migrer vers le HTML pur ?
  17. 41:36 Faut-il masquer les bannières RGPD à Googlebot pour éviter le cloaking ?
  18. 43:57 Les Quality Raters notent-ils vraiment votre site pour le déclasser ?
  19. 45:30 Peut-on vraiment avoir un design complètement différent entre les versions linguistiques d'un site ?
  20. 47:42 Les redirections 302 peuvent-elles vraiment transmettre autant de PageRank que les 301 ?
  21. 50:58 Google change-t-il immédiatement l'URL canonique après la suppression d'une redirection ?
  22. 53:43 Les redirections 302 finissent-elles vraiment par être traitées comme des 301 permanentes ?
  23. 55:45 Peut-on vraiment migrer plusieurs sites vers un seul domaine avec l'outil Change of Address de Google ?
  24. 58:54 Pourquoi garder vos anciens sites en ligne tue-t-il votre nouveau domaine ?
📅
Official statement from (5 years ago)
TL;DR

Google requires that the content of FAQ structured data be visible word-for-word on the page. Algorithms automatically check this match to ensure that users find what was promised in the rich results. Mueller notes that the algorithms could become stricter on this point, making this requirement even stricter in the future.

What you need to understand

Why does Google enforce this strict visibility rule?

The reason lies in user experience in the SERPs. When a user sees an FAQ answer directly in the rich results, they expect to find that exact text when they arrive on the page.

If the content differs, even slightly, it creates a break in trust. Google wants to avoid situations where a site markup has an invisible or partially hidden FAQ to capture clicks. This is an anti-abuse measure that protects the quality of rich results.

What does “visible word-for-word” actually mean?

This phrasing allows for no paraphrasing or rephrasing. The text present in the JSON-LD or microdata must match exactly the text displayed in the visible HTML of the page.

Even minor variations — replacing a synonym, reversing phrases, adding a word — can trigger a rejection of the FAQ markup. Google scans the visible DOM and compares it with the structured data. No exact match? No rich display.

Can algorithms really become stricter?

Mueller uses the conditional, which leaves a strategic uncertainty. Currently, some sites get away with minor discrepancies, but Google reserves the right to tighten the algorithmic checks.

This could translate into a more rigid matching: detection of syntactical variations, zero tolerance for punctuation discrepancies, checking the actual CSS visibility (display:none, opacity, etc.). It’s safe to say that it’s better to be in strict compliance from now on.

  • Exact match required between structured data and visible content
  • Automatic algorithmic check comparing the markup to the rendered DOM
  • Risk of gradual tightening of validation rules
  • Goal: total consistency between SERP promise and page content
  • No tolerance for hidden or rephrased FAQs

SEO Expert opinion

Is this statement consistent with observed practices in the field?

Yes, and it’s even a late confirmation of what many of us have already observed. The cases of FAQ rich snippets disappearing after an update are multiplying when the markup content does not match the visible content.

However — and this is where it gets interesting — the application of this rule remains uneven across sectors. Some health or finance sites are penalized for an extra semicolon, while others in e-commerce get away with minor variations. [To verify]: does Google apply different tolerance thresholds based on YMYL domains?

What nuances should be added to this absolute rule?

The phrasing “word-for-word” poses a practical issue: what about HTML tags in the visible content? If your FAQ contains a link or a , should it be replicated in the JSON-LD?

Field experience shows that only plain text matters. You can have HTML markup in the visible field as long as the extracted text corresponds to the structured text. But be careful: some CMSs automatically generate non-breaking spaces or invisible Unicode characters that break the matching.

Another vague point: punctuation. A question with or without a period at the end, a space before a question mark (French typography) — does that break the correspondence? [To verify] because Mueller does not specify the level of granularity for the matching.

In what cases could this rule pose a problem?

Multilingual sites with automatically translated FAQs are particularly exposed. If the translation in the JSON-LD slightly differs from the displayed one (different translation engine, cache, etc.), Google may refuse the snippet.

The same issue arises for sites with personalized or A/B tested content. If Googlebot sees one version of the FAQ and a user sees another, you are in violation. The solution? Exclude FAQs from testing or ensure that the structured data always reflects the default version.

Note: FAQs loaded in JavaScript after the initial rendering may not be detected by Google if the markup text does not exactly match the text present at the time of crawling. Timing matters.

Practical impact and recommendations

What should you do concretely to remain compliant?

First, automate the verification. Don’t rely on manual checks: a script that extracts the visible text from your FAQs and compares it to the JSON-LD will save you a lot of time. Several SEO tools offer this functionality, but a simple Python script with BeautifulSoup will do the job.

Next, unify your source of truth. If your CMS generates FAQs on the front end and your SEO plugin generates structured data independently, you create a risk of divergence. Ideally, one data source should feed both the visible HTML and the JSON-LD.

Finally, monitor rich snippets in the Search Console. A sharp drop in impressions with FAQ rich results? Likely that Google detected a non-compliance. GSC won’t explicitly tell you, but it’s a warning signal.

What mistakes should be absolutely avoided?

The classic mistake: markup of FAQs that are not visible above the fold. Technically, the content is present in the DOM, but if the user has to scroll or click to see it, Google may consider that you are over-promising.

Another frequent trap: accordion FAQs with content hidden in display:none. If the text is not visible in its initial state (even if a click reveals it), it’s risky. Prefer accordions that render the content in the DOM even in the closed state, with a simple max-height:0 and overflow:hidden.

And beware of “optimized” rephrasing in the structured data. You cannot rewrite a visible question “How to choose X?” into “What is the best way to choose X?” in the markup, even if it seems more SEO-friendly.

How can I verify that my site is compliant?

Use the Google Rich Results Test on your FAQ pages. The tool will show you what Google extracts from the structured data. Compare visually with what is displayed on the page. Any slight discrepancy? Correct it.

Supplement with a programmatic check: a script that parses the JSON-LD, extracts the FAQ text, then parses the visible HTML and compares. If you have hundreds of FAQ pages, this is the only scalable approach. Otherwise, you might overlook variations introduced by contributors or template updates.

These optimizations may seem simple in theory, but their technical implementation at scale — especially on sites with multiple contributors, complex CMSs, or multi-application stacks — requires deep expertise and meticulous monitoring. If you manage a large site or if your technical stack makes auditing complex, hiring a specialized SEO agency will help you quickly identify and correct non-compliances before they impact your performance in the SERPs.

  • Verify exact match between visible text and JSON-LD on all FAQ pages
  • Eliminate any FAQ markup that is not visible without user interaction
  • Automate detection of discrepancies with a regular monitoring script
  • Exclude FAQs from A/B tests or ensure that structured data reflects the default version
  • Check performances of FAQ rich snippets in the Search Console
  • Test each page with the Rich Results Test after modifying FAQ content
Google provides no leeway on the visibility of structured FAQs. The match between visible text and structured data must be perfect, with no rephrasing or omissions. Algorithms may become stricter, so it's better to be in strict compliance right now. Automate verification and unify your data sources to avoid discrepancies.

❓ Frequently Asked Questions

Peut-on baliser des FAQ qui s'affichent uniquement après un clic sur un accordéon ?
Oui, à condition que le texte soit présent dans le DOM visible, même si masqué visuellement par CSS (max-height, opacity). Si le contenu est en display:none ou chargé dynamiquement après le clic, c'est risqué. Google veut que le texte soit techniquement accessible sans interaction.
Les variations de ponctuation (espace avant ?, guillemets typographiques) cassent-elles le matching ?
Mueller ne précise pas le niveau de tolérance exact. L'expérience terrain montre que des écarts mineurs de ponctuation passent parfois, mais mieux vaut viser une correspondance stricte. Les espaces insécables et caractères Unicode invisibles sont particulièrement problématiques.
Faut-il reproduire les balises HTML (strong, em, liens) dans le JSON-LD ?
Non. Seul le texte brut compte. Vous pouvez avoir du markup HTML dans le contenu visible tant que le texte extrait (sans balises) correspond exactement au texte structuré dans le JSON-LD.
Comment gérer les FAQ multilingues avec traduction automatique ?
Assurez-vous que le moteur de traduction utilisé pour le contenu visible et pour le structured data est le même, ou que les deux sont générés depuis une source unique. Sinon, des variations mineures de traduction peuvent casser la correspondance et bloquer les rich snippets.
Quel outil utiliser pour vérifier la conformité à grande échelle ?
Combinez le Rich Results Test de Google pour des vérifications ponctuelles avec un script custom (Python + BeautifulSoup ou Puppeteer) qui extrait et compare automatiquement le texte visible vs JSON-LD sur toutes vos pages FAQ. C'est la seule approche scalable pour des centaines de pages.

🎥 From the same video 24

Other SEO insights extracted from this same Google Search Central video · duration 1h03 · published on 29/10/2020

🎥 Watch the full video on YouTube →

💬 Comments (0)

Be the first to comment.

2000 characters remaining
🔔

Get real-time analysis of the latest Google SEO declarations

Be the first to know every time a new official Google statement drops — with full expert analysis.

No spam. Unsubscribe in one click.