Exploring Generalizability of Fine-Tuned Models for Fake News Detection

Published 2022 in International Conference on Communications in Computing

ABSTRACT

The Covid-19 pandemic has caused a dramatic and parallel rise in dangerous misinformation, denoted an ‘infodemic’ by the CDC and WHO. Misinformation tied to the Covid-19 infodemic changes continuously; this can lead to performance degradation of fine-tuned models due to concept drift. Degredation can be mitigated if models generalize well-enough to capture some cyclical aspects of drifted data. In this paper, we explore generalizability of pre-trained and fine-tuned fake news detectors across 9 fake news datasets. We show that existing models often overfit on their training dataset and have poor performance on unseen data. However, on some subsets of unseen data that overlap with training data, models have higher accuracy. Based on this observation, we also present KMeans-Proxy, a fast and effective method based on K-Means clustering for quickly identifying these overlapping subsets of unseen data. KMeans-Proxy improves generalizability on unseen fake news datasets by 0.1-0.2 f1-points across datasets. We present both our generalizability experiments as well as KMeans-Proxy to further research in tackling the fake news problem.

PUBLICATION RECORD

Publication year
2022
Venue
International Conference on Communications in Computing
Publication date
2022-12-01
Fields of study
Computer Science
Identifiers
DOI 10.1109/CIC56439.2022.00022
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Explainable Misinformation Detection Across Multiple Social Media Platforms
2022cited by this paper
Data Management Opportunities for Foundation Models
2022cited by this paper
Convergence of online k-means
2022cited by this paper
Shoring Up the Foundations: Fusing Model Embeddings and Weak Supervision
2022influential reference
Multi-Source Domain Adaptation with Weak Supervision for Early Fake News Detection
2021cited by this paper
Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection
2021influential reference
On the Opportunities and Risks of Foundation Models
2021cited by this paper
Machine learning with a reject option: a survey
2021cited by this paper
End-to-End Weak Supervision
2021cited by this paper
Misinformation Adoption or Rejection in the Era of COVID-19
2021cited by this paper
A COVID-19 Rumor Dataset
2021cited by this paper
Transformer-based Language Model Fine-tuning Methods for COVID-19 Fake News Detection
2021cited by this paper
Model Generalization on COVID-19 Fake News Detection
2021cited by this paper
COVID-19 Fake News Dataset
2021cited by this paper
A Heuristic-driven Uncertainty based Ensemble Framework for Fake News Detection in Tweets and News Articles
2021cited by this paper
LightMBERT: A Simple Yet Effective Method for Multilingual BERT Distillation
2021cited by this paper
FakeSens: A Social Sensing Approach to COVID-19 Misinformation Detection on Social Media
2021cited by this paper
The Instagram Infodemic: Cobranding of Conspiracy Theories, Coronavirus Disease 2019 and Authority-Questioning Beliefs
2020cited by this paper
Exploring BERT Parameter Efficiency on the Stanford Question Answering Dataset v2.0
2020cited by this paper
Beyond Artificial Reality
2020cited by this paper
ProxyNCA++: Revisiting and Revitalizing Proxy Neighborhood Component Analysis
2020cited by this paper
What Happens To BERT Embeddings During Fine-tuning?
2020cited by this paper
COVID-Twitter-BERT: A natural language processing model to analyse COVID-19 content on Twitter
2020cited by this paper
CoAID: COVID-19 Healthcare Misinformation Dataset
2020cited by this paper
Characterizing COVID-19 Misinformation Communities Using a Novel Twitter Dataset
2020cited by this paper
A stance data set on polarized conversations on Twitter about the efficacy of hydroxychloroquine as a treatment for COVID-19
2020cited by this paper
NLP-based Feature Extraction for the Detection of COVID-19 Misinformation Videos on YouTube
2020cited by this paper
COVIDLies: Detecting COVID-19 Misinformation on Social Media
2020cited by this paper
The different forms of COVID-19 misinformation and their consequences
2020cited by this paper
MCNNet: Generalizing Fake News Detection with a Multichannel Convolutional Neural Network using a Novel COVID-19 Dataset
2020cited by this paper
SelectiveNet: A Deep Neural Network with an Integrated Reject Option
2019cited by this paper
Towards Lingua Franca Named Entity Recognition with BERT
2019cited by this paper
What Would Elsa Do? Freezing Layers During Transformer Fine-Tuning
2019cited by this paper
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
2019influential reference
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019influential reference
Concept Drift Adaptive Physical Event Detection for Social Media Streams
2019cited by this paper
Interpretable machine learning with reject option
2018cited by this paper
An introduction to domain adaptation and transfer learning
2018cited by this paper
Snorkel: Rapid Training Data Creation with Weak Supervision
2017cited by this paper
No Fuss Distance Metric Learning Using Proxies
2017cited by this paper
Analysing patterns of spatial and niche overlap among species at multiple resolutions
2016cited by this paper
Accurate estimation of influenza epidemics using Google search data via ARGO
2015cited by this paper
Google Flu Trends Still Appears Sick: An Evaluation of the 2013-2014 Flu Season
2014cited by this paper
A survey on concept drift adaptation
2014cited by this paper
Probabilistic Lipschitzness A niceness assumption for deterministic labels
2013influential reference
Learning under Concept Drift: an Overview
2010cited by this paper

CITED BY

A review of fake news detection based on transfer learning
2025cites this paper
An exploration of features to improve the generalisability of fake news detection models
2025cites this paper
CrediBench: Building Web-Scale Network Datasets for Information Integrity
2025cites this paper
Boosting generalization of fine-tuning BERT for fake news detection
2024cites this paper
Rough-Fuzzy Graph Learning Domain Adaptation for Fake News Detection
2024cites this paper
Towards Reliable Misinformation Mitigation: Generalization, Uncertainty, and GPT-4
2023cites this paper
Synthetic Lies: Understanding AI-Generated Misinformation and Evaluating Algorithmic and Human Solutions
2023cites this paper
Constructive Interpretability with CoLabel: Corroborative Integration, Complementary Features, and Collaborative Learning
2022cites this paper