Ensembles of Recurrent Networks for Classifying the Relationship of Fake News Titles

Published 2019 in Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

ABSTRACT

Nowadays, everyone can create and publish news and information anonymously online. However, the credibility of such news and information are not guaranteed. To differentiate fake news from genuine news, one can compare a recent news with earlier posted ones. Identified suspicious news can be debunked to stop the fake news from spreading further. In this paper, we investigate the advantages of recurrent neural networks-based language representations (e.g., BERT, BiLSTM) in order to build ensemble classifiers that can accurately predict if one news title is related to, and, additionally disagrees with an earlier news title. Our experiments, on a dataset of 321k news titles created for the WSDM 2019 challenge, show that the BERT-based models significantly outperform BiLSTM, which in-turn significantly outperforms a simpler embedding-based representation. Furthermore, even the state-of-the-art BERT approach can be enhanced when combined with a simple BM25 feature.

PUBLICATION RECORD

Publication year
2019
Venue
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Publication date
2019-07-18
Fields of study
Computer Science
Identifiers
DOI 10.1145/3331184.3331305
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

End-to-End Open-Domain Question Answering with BERTserini
2019cited by this paper
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019cited by this paper
MatchZoo: A Learning, Practicing, and Developing System for Neural Text Matching
2019cited by this paper
The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News
2018cited by this paper
DR-BiLSTM: Dependent Reading Bidirectional LSTM for Natural Language Inference
2018cited by this paper
Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information
2018cited by this paper
Neural Ranking Models with Weak Supervision
2017cited by this paper
Exploring the Limits of Language Modeling
2016cited by this paper
Rumor Identification and Belief Investigation on Twitter
2016cited by this paper
Siamese Recurrent Architectures for Learning Sentence Similarity
2016cited by this paper
Detecting Rumors from Microblogs with Recurrent Neural Networks
2016influential reference
LSTM-based Deep Learning Models for non-factoid answer selection
2015cited by this paper
Efficient Estimation of Word Representations in Vector Space
2013cited by this paper
Syntactic Stylometry for Deception Detection
2012cited by this paper
Rectified Linear Units Improve Restricted Boltzmann Machines
2010influential reference
Modern Information Retrieval : A Brief Overview
2001cited by this paper
GatfordCentre for Interactive Systems ResearchDepartment of Information
1996cited by this paper
An Efficient Recognition and Syntax-Analysis Algorithm for Context-Free Languages
1965cited by this paper

CITED BY

What is in a title? Characterizing product titles in e-commerce
2025cites this paper
RumorLLM: A Rumor Large Language Model-Based Fake-News-Detection Data-Augmentation Approach
2024cites this paper
SEN-CTD: semantic enhancement network with content-title discrepancy for fake news detection
2024cites this paper
Beyond Text: Multimodal Credibility Assessment Approaches for Online User-Generated Content
2024cites this paper
Hindi fake news detection using transformer ensembles
2023cites this paper
Ensembles to Detect Fake News – An Approach Based on Specialized Classifiers
2023cites this paper
Ax-to-Grind Urdu: Benchmark Dataset for Urdu Fake News Detection
2023cites this paper
Crowdsourced Fact-Checking at Twitter: How Does the Crowd Compare With Experts?
2022cites this paper
An ensemble model for classifying idioms and literal texts using BERT and RoBERTa
2022cites this paper
Entity-Assisted Language Models for Identifying Check-worthy Sentences
2022cites this paper
Leveraging Users' Social Network Embeddings for Fake News Detection on Twitter
2022cites this paper
Enhancing Zero-Shot Stance Detection via Targeted Background Knowledge
2022cites this paper
A multimodal fake news detection model based on crossmodal attention residual and multichannel convolutional neural networks
2021cites this paper
Large-Scale Question Tagging via Joint Question-Topic Embedding Learning
2020cites this paper