Improving Variational Encoder-Decoders in Dialogue Generation

Xiaoyu Shen,Hui Su,Shuzi Niu,Vera Demberg

Published 2018 in AAAI Conference on Artificial Intelligence

ABSTRACT

Variational encoder-decoders (VEDs) have shown promising results in dialogue generation. However, the latent variable distributions are usually approximated by a much simpler model than the powerful RNN structure used for encoding and decoding, yielding the KL-vanishing problem and inconsistent training objective. In this paper, we separate the training step into two phases: The first phase learns to autoencode discrete texts into continuous embeddings, from which the second phase learns to generalize latent representations by reconstructing the encoded embedding. In this case, latent variables are sampled by transforming Gaussian noise through multi-layer perceptrons and are trained with a separate VED model, which has the potential of realizing a much more flexible distribution. We compare our model with current popular models and the experiment demonstrates substantial improvement in both metric-based and human evaluations.

PUBLICATION RECORD

Publication year
2018
Venue
AAAI Conference on Artificial Intelligence
Publication date
2018-02-06
Fields of study
Computer Science
Identifiers
DOI 10.1609/aaai.v32i1.11960 arXiv 1802.02032
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

AUTO-ENCODING VARIATIONAL BAYES
2020cited by this paper
GENERATIVE ADVERSARIAL NETS
2018cited by this paper
A Hybrid Convolutional Variational Autoencoder for Text Generation
2017cited by this paper
Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders
2017cited by this paper
DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset
2017cited by this paper
Improved Variational Autoencoders for Text Modeling using Dilated Convolutions
2017cited by this paper
Piecewise Latent Variables for Neural Variational Text Processing
2017cited by this paper
A Conditional Variational Framework for Dialog Generation
2017cited by this paper
Data Noising as Smoothing in Neural Network Language Models
2017cited by this paper
Improved Variational Inference with Inverse Autoregressive Flow
2016influential reference
Variational Lossy Autoencoder
2016cited by this paper
How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation
2016cited by this paper
A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues
2016influential reference
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
2016cited by this paper
beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework
2016cited by this paper
Generating Sentences from a Continuous Space
2015cited by this paper
Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
2015cited by this paper
Adversarial Autoencoders
2015cited by this paper
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
2015cited by this paper
A Neural Conversational Model
2015cited by this paper
Variational Inference with Normalizing Flows
2015cited by this paper
Learning Structured Output Representation using Deep Conditional Generative Models
2015cited by this paper
Stochastic Backpropagation and Approximate Inference in Deep Generative Models
2014cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
Sequence to Sequence Learning with Neural Networks
2014cited by this paper
Markov Chain Monte Carlo and Variational Inference: Bridging the Gap
2014cited by this paper
Auto-Encoding Variational Bayes
2013cited by this paper
Sequence Transduction with Recurrent Neural Networks
2012cited by this paper
A Neural Probabilistic Language Model
2003cited by this paper
An Introduction to Variational Methods for Graphical Models
1999cited by this paper
The "wake-sleep" algorithm for unsupervised neural networks.
1995cited by this paper

CITED BY

ConSum4DCR: A Q&A Knowledge Extraction Method for Developer Chatroom Environments
2025cites this paper
Pick the Better and Leave the Rest: Leveraging Multiple Retrieved Results to Guide Response Generation
2025cites this paper
AISum4DCR: A Multi-Topic Dialog Summarization Approach for Developer Chat Rooms Integrating AI Chain and Prompt Engineering
2025cites this paper
Multi-party Response Generation with Relation Disentanglement
2024cites this paper
Rumor Detection on Social Media with Reinforcement Learning-based Key Propagation Graph Generator
2024cites this paper
Multimodal Dialogue Systems via Capturing Context-aware Dependencies and Ordinal Information of Semantic Elements
2024cites this paper
Continual Learning with Dirichlet Generative-based Rehearsal
2023cites this paper
A variational selection mechanism for article comment generation
2023cites this paper
DialogDiffAE: Dialogue Generation with Diffusion-Equipped Auto-Encoder
2023cites this paper
EmoKbGAN: Emotion controlled response generation using Generative Adversarial Network for knowledge grounded conversation
2023cites this paper
Mixing It Up: Inducing Empathy and Politeness using Multiple Behaviour-aware Generators for Conversational Systems
2023cites this paper
A Dual Latent Variable Personalized Dialogue Agent
2023cites this paper
Semi-Supervised Variational Autoencoders for Out-of-Distribution Generation
2023cites this paper
Partially Randomizing Transformer Weights for Dialogue Response Diversity
2023cites this paper
Boosting Summarization with Normalizing Flows and Aggressive Training
2023cites this paper
Adversarial Conversational Shaping for Intelligent Agents
2023cites this paper
Dior-CVAE: Diffusion Priors in Variational Dialog Generation
2023cites this paper
PoliSe: Reinforcing Politeness Using User Sentiment for Customer Care Response Generation
2022cites this paper
EmoSen: Generating Sentiment and Emotion Controlled Responses in a Multimodal Dialogue System
2022cites this paper
Flow-Based Variational Sequence Autoencoder
2022cites this paper
PixelSeg: Pixel-by-Pixel Stochastic Semantic Segmentation for Ambiguous Medical Images
2022cites this paper
Sequential or jumping: context-adaptive response generation for open-domain dialogue systems
2022cites this paper
PEVAE: A Hierarchical VAE for Personalized Explainable Recommendation.
2022cites this paper
Taming Continuous Posteriors for Latent Variational Dialogue Policies
2022cites this paper
Hierarchical Inductive Transfer for Continual Dialogue Learning
2022cites this paper
Dialogue management based on forcing a user through a discourse tree of a text
2022cites this paper
DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation
2021cites this paper
Speculative Sampling in Variational Autoencoders for Dialogue Response Generation
2021cites this paper
Controllable Semantic Parsing via Retrieval Augmentation
2021cites this paper
ADA-INCVAE: Improved data generation using variational autoencoder for imbalanced classification
2021cites this paper
TransVae:A Novel Variational Sequence-to-Sequence Framework for Semi-supervised Learning and Diversity Improvement
2021cites this paper
Variational Dialogue Generation with Normalizing Flows
2021influential citation
Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection
2021influential citation
A Simple and Efficient Multi-Task Learning Approach for Conditioned Dialogue Generation
2021cites this paper
Improving Conversation Modelling using Attention Based Variational Hierarchical RNN
2021cites this paper
Controllable and Diverse Text Generation in E-commerce
2021cites this paper
Aspect-Aware Response Generation for Multimodal Dialogue System
2021cites this paper
Controllable Text Generation with Focused Variation
2020cites this paper
Posterior-GAN: Towards Informative and Coherent Response Generation with Posterior Generative Adversarial Network
2020cites this paper
Collaborative filtering recommendation algorithm based on variational inference
2020cites this paper
EmpTransfo: A Multi-head Transformer Architecture for Creating Empathetic Dialog Systems
2020cites this paper
Diversity regularized autoencoders for text generation
2020cites this paper
Towards Multimodal Response Generation with Exemplar Augmentation and Curriculum Optimization
2020influential citation
APo-VAE: Text Generation in Hyperbolic Space
2020cites this paper
Knowledge-aware Attentive Wasserstein Adversarial Dialogue Response Generation
2020influential citation
Diversifying Dialogue Generation with Non-Conversational Text
2020cites this paper
Knowledge-Grounded Chatbot Based on Dual Wasserstein Generative Adversarial Networks with Effective Attention Mechanisms
2020cites this paper
Incorporating Politeness across Languages in Customer Care Responses: Towards building a Multi-lingual Empathetic Dialogue Agent
2020cites this paper
Natural Language Generation Using Transformer Network in an Open-Domain Setting
2020cites this paper
Towards building an affect-aware dialogue agent with deep neural networks
2020cites this paper
How to Generate Reasonable Texts with Controlled Attributes
2020cites this paper
Lexicon-Enhanced Transformer with Pointing for Domains Specific Generative Question Answering
2020cites this paper
Generalized Conditioned Dialogue Generation Based on Pre-trained Language Model
2020cites this paper
Condition-Transforming Variational Autoencoder for Generating Diverse Short Text Conversations
2020cites this paper
Stochasticity and Non-Autoregressive Modeling in Deep Generative Models of Text
2020cites this paper
Improving Variational Autoencoder for Text Modelling with Timestep-Wise Regularisation
2020cites this paper
More to diverse: Generating diversified responses in a task oriented multimodal dialog system
2020cites this paper
DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances
2020cites this paper
Choose Your Words Wisely: Leveraging Embedded Dialog Trajectories to Enhance Performance in Open-Domain Conversations
2020cites this paper
Parallel Variational Autoencoders for Multiple Responses Generation
2020cites this paper
On Dialogue Systems Based on Deep Learning
2020cites this paper
μ-Forcing
2019cites this paper
μ-Forcing: Training Variational Recurrent Autoencoders for Text Generation
2019cites this paper
Challenges in Building Intelligent Open-domain Dialog Systems
2019cites this paper
Improving Neural Conversational Models with Entropy-Based Data Filtering
2019cites this paper
Learning Disentangled Representation in Latent Stochastic Models: A Case Study with Image Captioning
2019cites this paper
Linguistically-Informed Specificity and Semantic Plausibility for Dialogue Generation
2019cites this paper
Adversarial Learning on the Latent Space for Diverse Dialog Generation
2019cites this paper
Improve Diverse Text Generation by Self Labeling Conditional Variational Auto Encoder
2019cites this paper
Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model
2019cites this paper
It's How You Say It: Identifying Appropriate Register for Chatbot Language Design
2019cites this paper
Open-Domain Dialogue Generation: Presence, Limitation and Future Directions
2019cites this paper
Improving Variational Auto-Encoder with Self-Attention and Mutual Information for Image Generation
2019cites this paper
Seq2VAR: Multivariate Time Series Representation with Relational Neural Networks and Linear Autoregressive Model
2019cites this paper
Emotion-Aware and Human-Like Autonomous Agents
2019cites this paper
Flexible Text Modeling with Semi-Implicit Latent Representations
2019cites this paper
Revision in Continuous Space: Unsupervised Text Style Transfer without Adversarial Learning
2019cites this paper
Guiding Variational Response Generator to Exploit Persona
2019cites this paper
Conditional Response Generation Using Variational Alignment
2019influential citation
A Latent-Constrained Variational Neural Dialogue Model for Information-Rich Responses
2019cites this paper
Multi-Turn Chatbot Based on Query-Context Attentions and Dual Wasserstein Generative Adversarial Networks
2019cites this paper
Generalization in Generation: A closer look at Exposure Bias
2019influential citation
Modeling Personalization in Continuous Space for Response Generation via Augmented Wasserstein Autoencoders
2019cites this paper
A Semi-Supervised Stable Variational Network for Promoting Replier-Consistency in Dialogue Generation
2019cites this paper
Hierarchical Reinforcement Learning for Open-Domain Dialog
2019cites this paper
Select and Attend: Towards Controllable Content Selection in Text Generation
2019cites this paper
Boosting Variational Generative Model via Condition Enhancing and Lexical-Editing
2019cites this paper
Ordinal and Attribute Aware Response Generation in a Multimodal Dialogue System
2019cites this paper
Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems
2019cites this paper
Improving Multi-turn Dialogue Modelling with Utterance ReWriter
2019cites this paper
Courteously Yours: Inducing courteous behavior in Customer Care responses using Reinforced Pointer Generator Network
2019cites this paper
Revision in Continuous Space: Fine-Grained Control of Text Style Transfer
2019cites this paper
Generative Multi-Turn Chatbot Using Generative Adversarial Network
2018cites this paper
NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation
2018cites this paper
Better Conversations by Modeling, Filtering, and Optimizing for Coherence and Diversity
2018cites this paper
Importance of Search and Evaluation Strategies in Neural Dialogue Modeling
2018cites this paper
Modeling Psychotherapy Dialogues with Kernelized Hashcode Representations: A Nonparametric Information-Theoretic Approach.
2018cites this paper
Aiming to Know You Better Perhaps Makes Me a More Engaging Dialogue Partner
2018cites this paper
DialogWAE: Multimodal Response Generation with Conditional Wasserstein Auto-Encoder
2018influential citation
SAGNet
2018cites this paper