Adversarial Learning for Neural Dialogue Generation

Jiwei Li,Will Monroe,Tianlin Shi,Sébastien Jean,Alan Ritter,Dan Jurafsky

Published 2017 in Conference on Empirical Methods in Natural Language Processing

ABSTRACT

We apply adversarial training to open-domain dialogue generation, training a system to produce sequences that are indistinguishable from human-generated dialogue utterances. We cast the task as a reinforcement learning problem where we jointly train two systems: a generative model to produce response sequences, and a discriminator—analagous to the human evaluator in the Turing test— to distinguish between the human-generated dialogues and the machine-generated ones. In this generative adversarial network approach, the outputs from the discriminator are used to encourage the system towards more human-like dialogue. Further, we investigate models for adversarial evaluation that uses success in fooling an adversary as a dialogue evaluation metric, while avoiding a number of potential pitfalls. Experimental results on several metrics, including adversarial evaluation, demonstrate that the adversarially-trained system generates higher-quality responses than previous baselines

PUBLICATION RECORD

Publication year
2017
Venue
Conference on Empirical Methods in Natural Language Processing
Publication date
2017-01-23
Fields of study
Computer Science
Identifiers
DOI 10.18653/v1/D17-1230 arXiv 1701.06547
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

GENERATIVE ADVERSARIAL NETS
2018cited by this paper
Generating Long and Diverse Responses with Neural Conversation Models
2017cited by this paper
Aspect-augmented Adversarial Networks for Domain Adaptation
2017cited by this paper
Adversarial Evaluation of Dialogue Models
2017cited by this paper
Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses
2017cited by this paper
Generative Deep Neural Networks for Dialogue: A Short Review
2016cited by this paper
An Actor-Critic Algorithm for Sequence Prediction
2016cited by this paper
Mastering the game of Go with deep neural networks and tree search
2016influential reference
Deep Active Learning for Dialogue Generation
2016cited by this paper
Improved Techniques for Training GANs
2016cited by this paper
Incorporating Loose-Structured Knowledge into LSTM with Recall Gate for Conversation Modeling
2016cited by this paper
Strategy and Policy Learning for Non-Task-Oriented Conversational Systems
2016influential reference
Sequence-to-Sequence Learning as Beam-Search Optimization
2016cited by this paper
A Persona-Based Neural Conversation Model
2016influential reference
Deep Reinforcement Learning for Dialogue Generation
2016influential reference
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
2016influential reference
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
2016cited by this paper
Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation
2016cited by this paper
On the Evaluation of Dialogue Systems with Next Utterance Classification
2016cited by this paper
How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation
2016cited by this paper
Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification
2016cited by this paper
A Network-based End-to-End Trainable Task-oriented Dialogue System
2016cited by this paper
A Simple, Fast Diverse Decoding Algorithm for Neural Generation
2016influential reference
A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues
2016cited by this paper
Professor Forcing: A New Algorithm for Training Recurrent Networks
2016influential reference
Online Sequence-to-Sequence Reinforcement Learning for Open-Domain Conversational Agents
2016cited by this paper
LSTM based Conversation Models
2016influential reference
Continuously Learning Neural Dialogue Management
2016cited by this paper
Incorporating loose-structured knowledge into conversation modeling via recall-gate LSTM
2016cited by this paper
Minimum Risk Training for Neural Machine Translation
2015cited by this paper
A Neural Network Approach to Context-Sensitive Generation of Conversational Responses
2015influential reference
A Hierarchical Neural Autoencoder for Paragraphs and Documents
2015cited by this paper
Neural Responding Machine for Short-Text Conversation
2015cited by this paper
Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks
2015cited by this paper
A Diversity-Promoting Objective Function for Neural Conversation Models
2015cited by this paper
A Neural Attention Model for Abstractive Sentence Summarization
2015cited by this paper
Effective Approaches to Attention-based Neural Machine Translation
2015cited by this paper
Generating Sentences from a Continuous Space
2015cited by this paper
RECURRENT NEURAL NETWORKS
2015cited by this paper
Attention with Intention for a Neural Network Conversation Model
2015cited by this paper
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
2015cited by this paper
A Neural Conversational Model
2015cited by this paper
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models
2015cited by this paper
Sequence to Sequence Learning with Neural Networks
2014cited by this paper
Neural Machine Translation by Jointly Learning to Align and Translate
2014cited by this paper
Data-Driven Response Generation in Social Media
2011influential reference
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning
2004cited by this paper
Stochastic Optimization
2003cited by this paper
Learning to classify text using support vector machines - methods, theory and algorithms
2002cited by this paper
Long Short-Term Memory
1997influential reference
Likelihood ratio gradient estimation for stochastic systems
1990cited by this paper
Computing Machinery and Intelligence
1950cited by this paper

CITED BY

Reinforced Decoder: Towards Training Recurrent Neural Networks for Time Series Forecasting
2026cites this paper
Learning with Multiple Correct Answers -- A Trichotomy of Regret Bounds under Different Feedback Models
2026influential citation
ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders
2026cites this paper
Limitation Learning: Catching Adverse Dialog with GAIL
2025cites this paper
Text adversarial attacks using policy gradients against deep learning classifiers
2025cites this paper
Dual-LLM Adversarial Framework for Information Extraction from Research Literature
2025cites this paper
Integrated gradients-based defense against adversarial word substitution attacks
2025cites this paper
Adaptive Social Learning via Mode Policy Optimization for Language Agents
2025cites this paper
Conversations From Make-Believe: An Attentive Encoder–Decoder Chatbot Trained on Scripted Dialogue
2025cites this paper
Who's Asking? Simulating Role-Based Questions for Conversational AI Evaluation
2025cites this paper
DIAL: Direct Iterative Adversarial Learning for Realistic Multi-Turn Dialogue Simulation
2025influential citation
Enhancing LLM-Based Social Bot via an Adversarial Learning Framework
2025cites this paper
A Diffusion-TGAN Framework for Spatio-Temporal Speed Imputation and Trajectory Reconstruction
2025cites this paper
High-Quality Trajectory Generation via Domain-Knowledge Enhanced GANs
2025cites this paper
Generation of Mobility Patterns for Private Vehicles using Multi-headed Sequence Generative Adversarial Networks
2025cites this paper
Generative Networks for Image Generation in East Asia’s Innovation Systems: A Bibliometric Analysis
2025cites this paper
Dialogue-Based Disease Diagnosis Using Hierarchical Reinforcement Learning with Multi-Expert Feedback
2025cites this paper
Towards Cost-Effective Reward Guided Text Generation
2025cites this paper
Improving Matching Models With Contextual Attention for Multi-Turn Response Selection in Retrieval-Based Chatbots
2025cites this paper
Personalized Product Description Generation With Gated Pointer-Generator Transformer
2025cites this paper
Measuring the Robustness of Reference-Free Dialogue Evaluation Systems
2025cites this paper
Enhancing GPT-2 Text Generation Through Reinforcement Learning and Token-Based Environments
2025cites this paper
Neural Methods for Data-to-text Generation
2024cites this paper
A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation
2024cites this paper
An Emotional Dialogue System Using Conditional Generative Adversarial Networks with a Sequence-to-Sequence Transformer Encoder
2024cites this paper
Position-based focal loss for diverse and relevant response generation
2024cites this paper
EnronSR: A Benchmark for Evaluating AI-Generated Email Replies
2024cites this paper
A Generative Adversarial Framework for Dialogue Generation with Neural Architecture Search
2024cites this paper
HIPPL: Hierarchical Intent-Inferring Pointer Network With Pseudo Labeling for Consistent Persona-Driven Dialogue Generation [Research Frontier]
2024cites this paper
Wearable Wisdom: A Bi-Modal Behavioral Biometric Scheme for Smartwatch User Authentication
2024cites this paper
Harnessing Business and Media Insights with Large Language Models
2024cites this paper
Adversarial Attack and Defense for Transductive Support Vector Machine
2024cites this paper
RoBERTa-Augmented Synthesis for Detecting Malicious API Requests
2024cites this paper
SHADE: Speaker-History-Aware Dialog Generation Through Contrastive and Prompt Learning
2024cites this paper
Generative Adversarial Network-Based Data Augmentation for Enhancing Wireless Physical Layer Authentication
2024cites this paper
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator
2024cites this paper
Advancements and Challenges in Continual Learning for Natural Language Processing: Insights and Future Prospects
2024cites this paper
A deep pedestrian trajectory generator for complex indoor environments
2024cites this paper
Intent-Aware Dialogue Generation and Multi-Task Contrastive Learning for Multi-Turn Intent Classification
2024cites this paper
Multimodal Dialogue Systems via Capturing Context-aware Dependencies and Ordinal Information of Semantic Elements
2024cites this paper
CNN-Based Metrics for Performance Evaluation of Generative Adversarial Networks
2024cites this paper
Construction and application of a novel WGAN-CNN-based predicting approach for dust concentration at underground coal mine working faces
2024cites this paper
A Gradient Analysis Framework for Rewarding Good and Penalizing Bad Examples in Language Models
2024cites this paper
ProcessGAN: Generating Privacy-Preserving Time-Aware Process Data with Conditional Generative Adversarial Nets
2024cites this paper
The Critique of Critique
2024cites this paper
Cross-Domain Requirements Linking via Adversarial-based Domain Adaptation
2023cites this paper
User Behavior Simulation with Large Language Model-based Agents
2023cites this paper
VOLTA: Improving Generative Diversity by Variational Mutual Information Maximizing Autoencoder
2023cites this paper
CAPTCHA Types and Breaking Techniques: Design Issues, Challenges, and Future Research Directions
2023cites this paper
Controlled physics-informed data generation for deep learning-based remaining useful life prediction under unseen operation conditions
2023cites this paper
A Systematic survey on automated text generation tools and techniques: application, evaluation, and challenges
2023cites this paper
Adversarial learning of neural user simulators for dialogue policy optimisation
2023cites this paper
Neural Attention Model for Abstractive Text Summarization Using Linguistic Feature Space
2023cites this paper
Contrastive Adversarial Training for Multi-Modal Machine Translation
2023cites this paper
Application of conditional generative adversarial network to multi-step car-following modeling
2023cites this paper
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management
2023cites this paper
Toward Connecting Speech Acts and Search Actions in Conversational Search Tasks
2023cites this paper
Contrastive Learning with Dialogue Attributes for Neural Dialogue Generation
2023cites this paper
Improving Rumor Detection by Promoting Information Campaigns With Transformer-Based Generative Adversarial Learning
2023cites this paper
Open-Domain Text Evaluation via Contrastive Distribution Methods
2023cites this paper
Path-based multi-hop reasoning over knowledge graph for answering questions via adversarial reinforcement learning
2023cites this paper
Generative Methods for Social Media Analysis
2023influential citation
Improved Training Of Mixture-Of-Experts Language GANs
2023cites this paper
Commonsense-Aware Prompting for Controllable Empathetic Dialogue Generation
2023cites this paper
Learning Multi-turn Response Selection in Grounded Dialogues with Reinforced Knowledge and Context Distillation
2023cites this paper
GroupAligner: A Deep Reinforcement Learning with Domain Adaptation for Social Group Alignment
2023influential citation
EmoKbGAN: Emotion controlled response generation using Generative Adversarial Network for knowledge grounded conversation
2023cites this paper
Predicting Visit Cost of Obstructive Sleep Apnea Using Electronic Healthcare Records With Transformer
2023cites this paper
Metadial: A Meta-learning Approach for Arabic Dialogue Generation
2023cites this paper
Algorithm Comparison and Evaluation of GAN Models Based on Image Transferring from Desert to Green Field
2023cites this paper
A Unified Generative Adversarial Learning Framework for Improvement of Skip-Gram Network Representation Learning Methods
2023cites this paper
Open-Domain Text Evaluation via Meta Distribution Modeling
2023cites this paper
AIGC for Various Data Modalities: A Survey
2023cites this paper
Image Inpainting Using PatchGAN
2023cites this paper
Sentiment Aided Graph Attentive Contextualization for Task Oriented Negotiation Dialogue Generation
2023cites this paper
Generative Adversarial Networks (GANs) in Computer-Generated Imagery
2023cites this paper
Generative Adversarial Network (GAN) in Social Network: Introduction, Applications, Challenges and Future Directions
2023cites this paper
Multi-Source Probing for Open-Domain Conversational Understanding
2023influential citation
Information-Enhanced Hierarchical Self-Attention Network for Multiturn Dialog Generation
2023cites this paper
MRRL: Modifying the Reference via Reinforcement Learning for Non-Autoregressive Joint Multiple Intent Detection and Slot Filling
2023cites this paper
Adaptively Multi-Objective Adversarial Training for Medical Image Report Generation
2023cites this paper
A Missing Traffic Data Imputation Method Based on a Diffusion Convolutional Neural Network–Generative Adversarial Network
2023cites this paper
Partially Randomizing Transformer Weights for Dialogue Response Diversity
2023cites this paper
Two Birds with One Stone: Boosting Code Generation and Code Search via a Generative Adversarial Network
2023cites this paper
Lost in Dialogue: A Review and Categorisation of Current Dialogue System Approaches and Technical Solutions
2023cites this paper
Adversarial Conversational Shaping for Intelligent Agents
2023cites this paper
Feel You ’ : Enhancing conversational agents with empathy Name :
2023cites this paper
Learning to Rank Generation with Pairwise Partial Rewards
2023cites this paper
Adversarial Training with Comprehensive Objective for Medical Image Report Generation
2023cites this paper
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
2023cites this paper
Reinforcement Learning for Generative AI: A Survey
2023cites this paper
Bilevel Scheduled Sampling for Dialogue Generation
2023cites this paper
An Efficient 1 Iteration Learning Algorithm for Gaussian Mixture Model And Gaussian Mixture Embedding For Neural Network
2023cites this paper
Prediction of Time Series Using Generative Adversarial Networks
2023cites this paper
Deep Generative Modeling-based Data Augmentation with Demonstration using the BFBT Benchmark Void Fraction Datasets
2023cites this paper
Hi Model, generating 'nice' instead of 'good' is not as bad as generating 'rice'! Towards Context and Semantic Infused Dialogue Generation Loss Function and Evaluation Metric
2023cites this paper
Reinforcing personalized persuasion in task-oriented virtual sales assistant
2023cites this paper
Chatbots & Dialogue Systems
2023cites this paper
CL-CSP: Contrastive Learning with Continuous Semantic Perturbations for Neural Dialogue Generation
2023cites this paper
Graph Contrastive Learning with Generative Adversarial Network
2023cites this paper