Within Platform Design
When Do Misinformation Labels Actually Work?
Warning labels can lower belief in false claims, but vague or distrusted labels may fail or even harden group resistance.
On this page
- What effective labels tell readers
- Why disputed labels can backfire
- Design choices that build label trust
Page outline Jump by section
Introduction
Warning labels are one of the most widely used platform tools for reducing belief in myths and misinformation. Rather than removing content, they add context: a post may be marked as disputed, fact-checked, misleading, state-affiliated, or lacking evidence. Research generally finds that well-designed labels can reduce belief in false claims and lower willingness to share them. However, their success depends heavily on trust. If users distrust the institution applying the label, view it as politically motivated, or cannot understand why it appears, the intervention becomes less effective and can sometimes strengthen resistance among particular groups. The central challenge is therefore not merely whether platforms label content, but whether users regard those labels as credible, fair and informative. [ScienceDirect]sciencedirect.comScienceDirectReview Misinformation warning labels are widely effectiveby C Martel · 2023 · Cited by 163 — Recent evidence indicates that…
What Effective Labels Tell Readers
The strongest warning labels do more than announce that something is questionable. They provide enough information for readers to understand why caution is warranted.
Studies reviewing misinformation interventions consistently find that labels are more effective when they are specific, visible and connected to a credible source. Labels that identify a claim as false, misleading or fact-checked generally reduce belief and sharing intentions more reliably than vague alerts. Their effect is particularly strong when users encounter the warning before accepting or sharing the claim. [ScienceDirect]sciencedirect.comScienceDirectReview Misinformation warning labels are widely effectiveby C Martel · 2023 · Cited by 163 — Recent evidence indicates that…
Researchers have also found benefits when labels explain their reasoning. Experiments examining AI-assisted and crowd-assisted warning systems suggest that users respond more positively when they receive some explanation of how the warning was generated rather than a bare warning alone. Explanatory labels can make the intervention appear more informative and less arbitrary. [Sage Journals]journals.sagepub.comSage JournalsReducing Misinformation Credibility: How Explanations…by P Mena · Cited by 4 — An online experiment found that explanatio…
A useful distinction exists between:
- Generic warnings, such as “This content may be misleading”.
- Evidence-based warnings, such as “Independent fact-checkers found no evidence supporting this claim”.
- Context labels, which provide additional information rather than directly judging truthfulness.
The latter two approaches tend to generate less suspicion because they give readers a reason for the intervention rather than simply issuing a command about what to believe. [Sage Journals]journals.sagepub.comSage JournalsReducing Misinformation Credibility: How Explanations…by P Mena · Cited by 4 — An online experiment found that explanatio…
Why Disputed Labels Can Backfire
The most controversial question is whether warning labels sometimes increase belief in misinformation rather than reduce it.
Early concerns about “backfire effects” emerged when platforms experimented with simple disputed tags. Facebook’s original red-flag system was eventually abandoned after evidence suggested that highly visible warning symbols could sometimes reinforce the perception that a claim was controversial rather than false. The company shifted towards providing related fact-checks and contextual information instead. [Time]time.comThe red warning icon, intended to combat misinformation, occasionally reinforced readers' beliefs and led to more sharing of disputed con…
Research has identified several mechanisms behind these failures.
Reactance and perceived censorship
People often resist information that appears to threaten their autonomy. When a label is interpreted as an attempt to control what users think, some individuals respond by defending the flagged claim more strongly. This phenomenon, known as psychological reactance, is especially relevant when misinformation aligns with a person’s political identity or worldview. The Journal of Social Media in Society [thejsms.org]thejsms.orgwarning label types in discrediting misinformation posts.Keywords: misinformation, warning label, worldview inconsistency, psychological…
A study examining Twitter’s disputed labels on election misinformation found that some politically knowledgeable supporters of the labelled politician became more likely to regard the misinformation as true after seeing the warning. Researchers suggested that distrust of the platform may have contributed to this reaction. [Misinformation Review]misinforeview.hks.harvard.eduMisinformation ReviewTrump, Twitter, and truth judgments: The effects of “…by JC Blanchar · 2024 · Cited by 10 — Trump voters with hig…
The trust transfer problem
Users do not evaluate labels in isolation. They evaluate the source behind them.
If readers distrust fact-checkers, media organisations, governments or technology companies, they may discount the warning regardless of its accuracy. In highly polarised environments, labels can become symbols of group conflict rather than signals of reliability. A warning intended to communicate evidence can instead be interpreted as evidence of bias. [ScienceDirect]sciencedirect.comScienceDirectReview Misinformation warning labels are widely effectiveby C Martel · 2023 · Cited by 163 — Recent evidence indicates that…
Interestingly, newer research suggests that complete trust in fact-checkers may not be necessary for labels to work. Experiments reported by MIT researchers found that fact-checking labels still reduced belief in misinformation even among participants who expressed scepticism towards fact-checkers. Nevertheless, effectiveness varied, showing that trust remains an important moderator rather than an all-or-nothing requirement. [MIT Sloan]mitsloan.mit.eduwarning labels fact checkers work even if you dont trust themMIT SloanWarning labels from fact checkers work — even if you don't…2 Sept 2024 — Fact-checker warning labels on social media can sign…
The implied truth effect
Another risk appears when platforms label only a small fraction of false content.
If users see warnings attached to some posts but not others, they may infer that unlabelled content has effectively been verified. Researchers call this the “implied truth effect”. In such cases, incomplete labelling can unintentionally increase trust in misinformation that escapes review. [Nieman Lab]niemanlab.orgNieman LabAdding a “disputed” label to fake news seems to work…Sep 12, 2017 — Labeling only some fake news stories as fake can make s…
This problem helps explain why researchers often argue for broad and consistent application rather than selective warnings that leave large amounts of questionable content untouched. [ScienceDirect]sciencedirect.comScienceDirectReview Misinformation warning labels are widely effectiveby C Martel · 2023 · Cited by 163 — Recent evidence indicates that…
Why Trust Is a Design Problem, Not Just a User Problem
It is tempting to treat distrust as something users bring with them, but platform design decisions can either increase or decrease trust in labels.
The source of a warning matters. Studies comparing labels from platforms, fact-checkers, other users and artificial intelligence systems show that effectiveness can vary depending on whom users believe is making the judgement. In some experiments, AI-generated warnings performed surprisingly well among participants who reported low trust in traditional news organisations, suggesting that perceptions of neutrality can influence outcomes. [arXiv]arxiv.orgSource details in endnotes.
Visibility also matters. A warning hidden behind multiple clicks may be ignored, while a dramatic warning may provoke defensive reactions. Research on content labels repeatedly finds that prominence helps only when paired with clarity and credibility. Highly conspicuous warnings that lack explanation can generate scepticism rather than reflection. ScienceDirect [Sage Journals]journals.sagepub.comSage JournalsReducing Misinformation Credibility: How Explanations…by P Mena · Cited by 4 — An online experiment found that explanatio…
Timing is equally important. Labels applied before engagement are generally more effective than corrections encountered after a belief has already formed. Once a false claim becomes familiar, repetition itself can increase perceived accuracy, making later corrections more difficult. [PMC]pmc.ncbi.nlm.nih.govPMCSocial media and the spread of misinformation - PMC - NIHby E Denniss · 2025 · Cited by 137 — There is evidence that warning and fact…
Design Choices That Build Label Trust
Evidence from platform experiments and misinformation research points towards several principles that make labels more trustworthy.
Explain the judgement. Users are more receptive when they understand why content has been flagged and what evidence supports the warning. [Sage Journals]journals.sagepub.comSage JournalsReducing Misinformation Credibility: How Explanations…by P Mena · Cited by 4 — An online experiment found that explanatio… [Sage Journals]journals.sagepub.comSage JournalsReducing Misinformation Credibility: How Explanations…by P Mena · Cited by 4 — An online experiment found that explanatio…
Use credible and transparent sources. Whether labels come from professional fact-checkers, community reviewers or automated systems, users should be able to understand who generated the warning and how. [ScienceDirect]sciencedirect.comScienceDirectReview Misinformation warning labels are widely effectiveby C Martel · 2023 · Cited by 163 — Recent evidence indicates that…
Apply labels consistently. Broad coverage reduces the implied truth effect created when only a small subset of misinformation receives warnings. [Nieman Lab]niemanlab.orgNieman LabAdding a “disputed” label to fake news seems to work…Sep 12, 2017 — Labeling only some fake news stories as fake can make s…
Provide context, not only judgement. Supplementary articles, explanations and corrective information often outperform simple disputed tags because they help users evaluate claims independently. [Time]time.comThe red warning icon, intended to combat misinformation, occasionally reinforced readers' beliefs and led to more sharing of disputed con… [Teen Vogue]teenvogue.comPreviously, disputed articles were flagged with a red warning, but research showed that related articles providing more context were more…
Avoid unnecessary alarm signals. Strong visual cues can attract attention, but they can also trigger defensiveness if users perceive them as attempts to shame or censor. [Time]time.comThe red warning icon, intended to combat misinformation, occasionally reinforced readers' beliefs and led to more sharing of disputed con…
The Main Lesson
The debate over misinformation labels is no longer mainly about whether they work at all. A growing body of evidence indicates that warning labels usually reduce belief in and sharing of false claims when they are visible, specific and timely. The harder question is how to preserve trust in the label itself. A warning that readers see as informative can slow the spread of myths. A warning that readers see as partisan, arbitrary or opaque may be ignored—or, among some audiences, become another reason to cling to the underlying misconception. The effectiveness of labels therefore depends as much on trust-building design choices as on the factual accuracy of the warning they deliver. [Carnegie Endowment]carnegieendowment.orgcountering disinformation effectively an evidence based policy guideCountering Disinformation Effectively: An Evidence-Based…31 Jan 2024 — A high-level, evidence-informed guide to some of the major prop… [3ScienceDirect 3MIT]sciencedirect.comScienceDirectReview Misinformation warning labels are widely effectiveby C Martel · 2023 · Cited by 163 — Recent evidence indicates that… Sloan](#endnote-6 “Snippet: MIT SloanWarning labels from fact checkers work — even if you don’t…2 Sept 2024 — Fact-checker warning labels on social media can sign”)
Amazon book picks
Further Reading
Books and field guides related to When Do Misinformation Labels Actually Work?. Use these as the next step if you want deeper reading beyond the article.
The Psychology of Fake News
Explains how audiences react to credibility signals and corrections.
The Constitution of Knowledge
Relevant to institutional credibility and trust in labels.
The Death of Truth [Hardcover] Michiko Kakutani
First published 2018. Subjects: United states, politics and government, 2017-2021, Political culture, Truth.
Endnotes
-
Source: sciencedirect.com
Link: https://www.sciencedirect.com/science/article/abs/pii/S2352250X23001550Source snippet
ScienceDirectReview Misinformation warning labels are widely effectiveby C Martel · 2023 · Cited by 163 — Recent evidence indicates that...
-
Source: pmc.ncbi.nlm.nih.gov
Link: https://pmc.ncbi.nlm.nih.gov/articles/PMC11955583/Source snippet
PMCSocial media and the spread of misinformation - PMC - NIHby E Denniss · 2025 · Cited by 137 — There is evidence that warning and fact...
-
Source: arxiv.org
Link: https://arxiv.org/abs/2112.03450 -
Source: time.com
Link: https://time.com/5077002/facebook-fake-news-articles/Source snippet
The red warning icon, intended to combat misinformation, occasionally reinforced readers' beliefs and led to more sharing of disputed con...
-
Source: sciencedirect.com
Link: https://www.sciencedirect.com/science/article/pii/S1071581924000600Source snippet
ScienceDirectPeer-supplied credibility labels as an online misinformation...by S Pareek · 2024 · Cited by 19 — In such scenarios, our cr...
-
Source: mitsloan.mit.edu
Title: warning labels fact checkers work even if you dont trust them
Link: https://mitsloan.mit.edu/press/warning-labels-fact-checkers-work-even-if-you-dont-trust-themSource snippet
MIT SloanWarning labels from fact checkers work — even if you don't...2 Sept 2024 — Fact-checker warning labels on social media can sign...
-
Source: ide.mit.edu
Link: https://ide.mit.edu/wp-content/uploads/2024/10/10-21-24-research-brief-final.pdf?x65156=Source snippet
FACT-CHECKER LABELS WORK—EVEN WITH...We also investigate the rise of [fake news]({{ 'fake-news/' | relative_url }}) and misinformation and the development of a digital cultu...
-
Source: pmc.ncbi.nlm.nih.gov
Link: https://pmc.ncbi.nlm.nih.gov/articles/PMC6279465/Source snippet
PMCPrior exposure increases perceived accuracy of fake newsby G Pennycook · 2018 · Cited by 1892 — Participants were randomized into two...
-
Source: arxiv.org
Link: https://arxiv.org/abs/2407.21592Source snippet
arXivDoes the Source of a Warning Matter? Examining the Effectiveness of Veracity Warning Labels Across WarnersJuly 31, 2024...
Published: July 31, 2024
-
Source: arxiv.org
Link: https://arxiv.org/html/2407.21592v1Source snippet
Does the Source of a Warning Matter? Examining...31 Jul 2024 — In this study, we conducted an online, between-subjects experiment (N = 2...
-
Source: arxiv.org
Link: https://arxiv.org/abs/2503.05711 -
Source: sciencedirect.com
Link: https://www.sciencedirect.com/science/article/pii/S004727272500043XSource snippet
[Debunking]({{ 'debunking/' | relative_url }}) “fake news” on social media: Immediate and...by LM Berger · 2025 · Cited by 21 — We conduct a randomized survey experiment to...
-
Source: mitsloan.mit.edu
Title: Experts look at how they spread and what can be done to stop them.Read more
Link: https://mitsloan.mit.edu/ideas-made-to-matter/mit-sloan-research-about-social-media-misinformation-and-electionsSource snippet
Sloan research about social media, misinformation...5 Oct 2020 — “Fake news” and misinformation peak online during presidential electio...
-
Source: facebook.com
Link: https://www.facebook.com/YaleUniversity/posts/fact-checking-makes-a-differencea-new-study-co-authored-by-yales-johan-ugander-f/1212740750883729/Source snippet
University Factual corrections published on Facebook's...Read more...
-
Source: journals.sagepub.com
Link: https://journals.sagepub.com/doi/abs/10.1177/10776990251347657Source snippet
Sage JournalsReducing Misinformation Credibility: How Explanations...by P Mena · Cited by 4 — An online experiment found that explanatio...
-
Source: journals.sagepub.com
Link: https://journals.sagepub.com/doi/10.1177/10776990251347657Source snippet
Sage JournalsReducing Misinformation Credibility: How Explanations...19 Jun 2025 — A study found some evidence that explanations may boo...
-
Source: journals.sagepub.com
Link: https://journals.sagepub.com/doi/10.1177/01634437241237936Source snippet
Sage JournalsFacebook's measures against information disorder do not...12 Mar 2024 — Research has indicated that warning labels on false...
-
Source: carnegieendowment.org
Title: countering disinformation effectively an evidence based policy guide
Link: https://carnegieendowment.org/russia-eurasia/research/2024/01/countering-disinformation-effectively-an-evidence-based-policy-guideSource snippet
Countering Disinformation Effectively: An Evidence-Based...31 Jan 2024 — A high-level, evidence-informed guide to some of the major prop...
-
Source: teenvogue.com
Link: https://www.teenvogue.com/story/facebook-changes-fake-news-flagging-approachSource snippet
Previously, disputed articles were flagged with a red warning, but research showed that related articles providing more context were more...
-
Source: thejsms.org
Link: https://thejsms.org/index.php/JSMS/article/view/1431/715Source snippet
warning label types in discrediting misinformation posts.Keywords: misinformation, warning label, worldview inconsistency, psychological...
-
Source: thejsms.org
Link: https://thejsms.org/index.php/JSMS/article/view/1431Source snippet
The Journal of Social Media in SocietyExamining the effectiveness of social media warning labelsby B Zhang · 2024 · Cited by 4 — The resu...
-
Source: misinforeview.hks.harvard.edu
Link: https://misinforeview.hks.harvard.edu/article/trump-twitter-and-truth-judgments-the-effects-of-disputed-tags-and-political-knowledge-on-the-judged-truthfulness-of-election-misinformation/Source snippet
Misinformation ReviewTrump, Twitter, and truth judgments: The effects of “...by JC Blanchar · 2024 · Cited by 10 — Trump voters with hig...
-
Source: niemanlab.org
Link: https://www.niemanlab.org/2017/09/adding-a-disputed-label-to-fake-news-seems-to-work-a-little-but-for-some-groups-it-actually-backfires/Source snippet
Nieman LabAdding a “disputed” label to fake news seems to work...Sep 12, 2017 — Labeling only some fake news stories as fake can make s...
-
Source: carnegieendowment.org
Title: countering disinformation effectively an evidence based policy guide
Link: https://carnegieendowment.org/research/2024/01/countering-disinformation-effectively-an-evidence-based-policy-guideSource snippet
Countering Disinformation Effectively: An Evidence-Based...Jan 31, 2024 — For these and other reasons, a growing number of experts rejec...
Additional References
-
Source: ftc.gov
Link: https://www.ftc.gov/system/files/documents/public_events/1582978/social_media_covid-19_misinformation_interventions_viewed_positively_but_have_limited_impact.pdfSource snippet
Social Media COVID-19 Misinformation Interventions...by C Geeng · 2020 · Cited by 18 — While having post-specific “disputed” labels migh...
-
Source: theguardian.com
Link: https://www.theguardian.com/technology/2024/sep/20/trump-tweets-false-label-credibility-supportersSource snippet
The research, analyzing data from a December 2020 survey of 1,072 Americans, indicated that Trump voters became more likely to believe mi...
Published: December 2020
-
Source: misinforeview.hks.harvard.edu
Title: state media warning labels can counteract the effects of foreign misinformation
Link: https://misinforeview.hks.harvard.edu/article/state-media-warning-labels-can-counteract-the-effects-of-foreign-misinformation/Source snippet
media warning labels can counteract the effects of...by J Nassetta · 2020 · Cited by 70 — We show that these labels have the ability to...
-
Source: news.yale.edu
Title: flagging misinformation social media reduces engagement study finds
Link: https://news.yale.edu/2025/09/25/flagging-misinformation-social-media-reduces-engagement-study-findsSource snippet
YaleNews25 Sept 2025 — Pointing out potentially misleading posts on social media significantly reduces the number of reposts, likes, repl...
-
Source: researchgate.net
Link: https://www.researchgate.net/publication/334740447_Cleaning_Up_Social_Media_The_Effect_of_Warning_Labels_on_Likelihood_of_Sharing_False_News_on_FacebookSource snippet
ebook may indeed help the current efforts to combat sharing of deceiving...Read more...
-
Source: osf.io
Link: https://osf.io/ugt2v/Source snippet
perceived credibility of a fake news post exaggerating the consequences of...
-
Source: tandfonline.com
Link: https://www.tandfonline.com/doi/full/10.1080/10810730.2024.2385638Source snippet
Examining the Effects of Social Media Warning Labels on...by B Zhang · 2024 · Cited by 7 — This study investigates the nuanced effects o...
-
Source: nature.com
Link: https://www.nature.com/articles/s41562-024-02086-1Source snippet
a systematic review and meta-analysis of news judgementsby J Pfänder · 2025 · Cited by 68 — Social media and trust in news: an experiment...
-
Source: pmc.ncbi.nlm.nih.gov
Link: https://pmc.ncbi.nlm.nih.gov/articles/PMC11094033/Source snippet
PMCby S Williams-Ceci · 2024 · Cited by 15 — In contrast, while a warning about unreliable sources might decrease trust in misinformation...
-
Source: pmc.ncbi.nlm.nih.gov
Link: https://pmc.ncbi.nlm.nih.gov/articles/PMC12015313/Source snippet
adhere to content warning labels even when they are...by BD Horne · 2025 · Cited by 5 — In a series of three online experiments, we find...
Topic Tree



