We propose an online, end-to-end, neural generative conversational model for open-domain dialogue. It is trained using a unique combination of offline two-phase supervised learning and online human-in-the-loop active learning. While most existing research proposes offline supervision or hand-crafted reward functions for online reinforcement, we devise a novel interactive learning mechanism based on hamming-diverse beam search for response generation and one-character user-feedback at each step. Experiments show that our model inherently promotes the generation of semantically relevant and interesting responses, and can be used to train agents with customized personas, moods and conversational styles.
Deep Active Learning for Dialogue Generation
Nabiha Asghar,P. Poupart,Xin Jiang,Hang Li
Published 2016 in International Workshop on Semantic Evaluation
ABSTRACT
PUBLICATION RECORD
- Publication year
2016
- Venue
International Workshop on Semantic Evaluation
- Publication date
2016-12-12
- Fields of study
Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-33 of 33 references · Page 1 of 1
CITED BY
Showing 1-57 of 57 citing papers · Page 1 of 1