PHAED: A Speaker-Aware Parallel Hierarchical Attentive Encoder-Decoder Model for Multi-Turn Dialogue Generation

Published 2024 in IEEE Transactions on Big Data

ABSTRACT

This article presents a novel open-domain dialogue generation model emphasizing the differentiation of speakers in multi-turn conversations. Differing from prior work that treats the conversation history as a long text, we argue that capturing relative social relations among utterances (i.e., generated by either the same speaker or different persons) benefits the machine capturing fine-grained context information from a conversation history to improve context coherence in the generated response. Given that, we propose a Parallel Hierarchical Attentive Encoder-Decoder (PHAED) model that can effectively leverage conversation history by modeling each utterance with the awareness of its speaker and contextual associations with the same speaker's previous messages. Specifically, to distinguish the speaker roles over a multi-turn conversation (involving two speakers), we regard the utterances from one speaker as responses and those from the other as queries. After understanding queries via hierarchical encoder with inner-query and inter-query encodings, transformer-xl style decoder reuses the hidden states of previously generated responses to generate a new response. Our empirical results with three large-scale benchmarks show that PHAED significantly outperforms baseline models on both automatic and human evaluations. Furthermore, our ablation study shows that dialogue models with speaker tokens can generally decrease the possibility of generating non-coherent responses.

PUBLICATION RECORD

Publication year
2024
Venue
IEEE Transactions on Big Data
Publication date
2024-02-01
Fields of study
Linguistics, Computer Science
Identifiers
DOI 10.1109/TBDATA.2023.3316472
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Discourse Structure Extraction from Pre-Trained and Fine-Tuned Language Models in Dialogues
2023cited by this paper
ChatGPT
2023cited by this paper
A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation
2022influential reference
Medical Dialogue Response Generation with Pivotal Information Recalling
2022cited by this paper
LSTM Based Phishing Detection for Big Email Data
2022cited by this paper
Discovering Dialog Structure Graph for Coherent Dialog Generation
2021cited by this paper
Generating Relevant and Coherent Dialogue Responses using Self-Separated Conditional Variational AutoEncoders
2021cited by this paper
The Importance of Modeling Social Factors of Language: Theory and Practice
2021cited by this paper
Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight
2020cited by this paper
Speaker or Listener? The Role of a Dialogue Agent
2020cited by this paper
Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue
2020cited by this paper
A Large-Scale Chinese Short-Text Conversation Dataset
2020influential reference
PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning
2020cited by this paper
Learning a Simple and Effective Model for Multi-turn Response Generation with Auxiliary Tasks
2020influential reference
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
2019cited by this paper
Transformer-XL: Attentive Language Models beyond a Fixed-Length Context
2019influential reference
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents
2019influential reference
Language Models are Unsupervised Multitask Learners
2019cited by this paper
Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection
2019cited by this paper
Implicit Discourse Relation Identification for Open-domain Dialogues
2019cited by this paper
ReCoSa: Detecting the Relevant Contexts with Self-Attention for Multi-turn Dialogue Generation
2019influential reference
Effective Incorporation of Speaker Information in Utterance Encoding in Dialog
2019cited by this paper
BiLSTM-SSVM: Training the BiLSTM with a Structured Hinge Loss for Named-Entity Recognition
2019cited by this paper
Variational Hierarchical User-based Conversation Model
2019cited by this paper
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable
2019cited by this paper
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
2019cited by this paper
A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse Data
2019cited by this paper
DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation
2019influential reference
Personalizing Dialogue Agents: I have a dog, do you have pets too?
2018cited by this paper
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems
2018cited by this paper
Wizard of Wikipedia: Knowledge-Powered Conversational agents
2018cited by this paper
Self-Attention with Relative Position Representations
2018cited by this paper
Attention is All you Need
2017influential reference
Speaker Role Contextual Modeling for Language Understanding and Dialogue Policy Learning
2017cited by this paper
Hierarchical Recurrent Attention Network for Response Generation
2017influential reference
How to Make Context More Useful? An Empirical Study on Context-Aware Neural Conversational Models
2017influential reference
Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models
2017cited by this paper
A Survey on Dialogue Systems: Recent Advances and New Frontiers
2017cited by this paper
A Persona-Based Neural Conversation Model
2016cited by this paper
Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning
2016cited by this paper
A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues
2016cited by this paper
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
2015influential reference
A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion
2015cited by this paper
Adam: A Method for Stochastic Optimization
2014influential reference
Measuring nominal scale agreement among many raters.
1971influential reference

CITED BY

Terrain Scene Generation Using a Lightweight Vector Quantized Generative Adversarial Network
2025cites this paper
Optimal word order for non-causal text generation with Large Language Models: The Spanish case
2025cites this paper
HCN-RLR-CAN: A novel human-computer negotiation model based on round-level recurrence and causal attention networks
2025cites this paper
ParallelCVAE: A Parallel CVAE Mechanism for Multi-Turn Dialog Response Generation Model
2025cites this paper
Enhancing Clinical Accuracy of Medical Chatbots With Large Language Models
2024cites this paper