Global Entity Relationship Enhancement Network for Multimodal Sarcasm Detection

Xiaobao Wang,Meng Ge,Lingshan Li,Di Jin,Kai He,Erik Cambria

Published 2026 in IEEE Transactions on Affective Computing

ABSTRACT

Sarcasm functions as a distinct mode of communication, intended to convey a meaning contrary to its literal interpretation. With the rapid proliferation of social networks, various manifestations of multimodal sarcasm have become widespread. Consequently, there is a growing emphasis on discerning sarcasm conveyed through multimodal data. Existing research often provides only a superficial interpretation of images, lacking a thorough exploration of the contextual nuances embedded within, particularly in understanding the scene depicted in the image. In this paper, we aim to delve deeper into the information embedded within images. Specifically, we begin by extracting entity relationships from images to capture the contextual information they convey. Additionally, we utilize image text recognition to extract textual information from the images. After conducting a comprehensive analysis of the image information, we establish consistency modeling between the image content and text using external knowledge. Finally, we employ a graph neural network to process the constructed cross-modal graph and make predictions regarding sarcasm. Extensive experiments validate the state-of-the-art performance of our model on publicly available multimodal Twitter datasets.

PUBLICATION RECORD

Publication year
2026
Venue
IEEE Transactions on Affective Computing
Publication date
2026-01-01
Fields of study
Not labeled
Identifiers
DOI 10.1109/TAFFC.2025.3650022
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Elevating Knowledge-Enhanced Entity and Relationship Understanding for Sarcasm Detection
2025cited by this paper
Intra-modal Relation and Emotional Incongruity Learning using Graph Attention Networks for Multimodal Sarcasm Detection
2025cited by this paper
Fusion and Discrimination: A Multimodal Graph Contrastive Learning Framework for Multimodal Sarcasm Detection
2024cited by this paper
SarcNet: A Multilingual Multimodal Sarcasm Detection Dataset
2024cited by this paper
G^2SAM: Graph-Based Global Semantic Awareness Method for Multimodal Sarcasm Detection
2024cited by this paper
Neuro-Symbolic Sentiment Analysis with Dynamic Word Sense Disambiguation
2023cited by this paper
Quantum Fuzzy Neural Network for multimodal sentiment and sarcasm detection
2023cited by this paper
The Biases of Pre-Trained Language Models: An Empirical Study on Prompt-Based Sentiment Analysis and Emotion Detection
2023cited by this paper
MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System
2023cited by this paper
Mutual-Enhanced Incongruity Learning Network for Multi-Modal Sarcasm Detection
2023cited by this paper
Augmenting Affective Dependency Graph via Iterative Incongruity Graph Learning for Sarcasm Detection
2023cited by this paper
Dynamic Routing Transformer Network for Multimodal Sarcasm Detection
2023cited by this paper
MetaPro Online: A Computational Metaphor Processing Online System
2023cited by this paper
Commonsense Knowledge Enhanced Sentiment Dependency Graph for Sarcasm Detection
2023cited by this paper
Meta-Based Self-Training and Re-Weighting for Aspect-Based Sentiment Analysis
2023cited by this paper
Multi-Modal Sarcasm Detection Based on Cross-Modal Composition of Inscribed Entity Relations
2023cited by this paper
Masking and Generation: An Unsupervised Method for Sarcasm Detection
2022cited by this paper
Sememe knowledge and auxiliary information enhanced approach for sarcasm detection
2022cited by this paper
Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network
2022influential reference
SenticNet 7: A Commonsense-based Neurosymbolic AI Framework for Explainable Sentiment Analysis
2022cited by this paper
Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement
2022influential reference
Unbiased Scene Graph Generation From Biased Training
2020influential reference
Modeling Intra and Inter-modality Incongruity for Multi-Modal Sarcasm Detection
2020influential reference
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
2020cited by this paper
Understanding the patient perspective of epilepsy treatment through text mining of online patient support groups.
2019cited by this paper
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019cited by this paper
Towards Multimodal Sarcasm Detection (An _Obviously_ Perfect Paper)
2019cited by this paper
Sarcasm Detection with Self-matching Networks and Low-rank Bilinear Pooling
2019cited by this paper
NTUA-SLP at SemEval-2018 Task 3: Tracking Ironic Tweets using Ensembles of Word and Character Level Attentive RNNs
2018cited by this paper
Word Embedding and WordNet Based Metaphor Identification and Interpretation
2018cited by this paper
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
2017cited by this paper
Learning Cognitive Features from Gaze Data for Sentiment and Sarcasm Classification using Convolutional Neural Network
2017cited by this paper
Detecting Sarcasm in Multimodal Social Platforms
2016cited by this paper
Deep Residual Learning for Image Recognition
2015cited by this paper
Sarcasm Detection in Twitter: "All Your Products Are Incredibly Amazing!!!" - Are They Really?
2014cited by this paper
GloVe: Global Vectors for Word Representation
2014cited by this paper
Sarcasm as Contrast between a Positive Sentiment and Negative Situation
2013cited by this paper

CITED BY

No citing papers are available for this paper.