Building open-domain chatbots is a challenging area for machine learning research. While prior work has shown that scaling neural models in the number of parameters and the size of the data they are trained on gives improved results, we highlight other ingredients. Good conversation requires blended skills: providing engaging talking points, and displaying knowledge, empathy and personality appropriately, while maintaining a consistent persona. We show that large scale models can learn these skills when given appropriate training data and choice of generation strategy. We build variants of these recipes with 90M, 2.7B and 9.4B parameter models, and make our models and code publicly available. Human evaluations show our best models outperform existing approaches in multi-turn dialogue on engagingness and humanness measurements. We then discuss the limitations of this work by analyzing failure cases of our models.
Recipes for Building an Open-Domain Chatbot
Stephen Roller,Emily Dinan,Naman Goyal,Da Ju,Mary Williamson,Yinhan Liu,Jing Xu,Myle Ott,Kurt Shuster,Eric Michael Smith,Y-Lan Boureau,J. Weston
Published 2020 in Conference of the European Chapter of the Association for Computational Linguistics
ABSTRACT
PUBLICATION RECORD
- Publication year
2020
- Venue
Conference of the European Chapter of the Association for Computational Linguistics
- Publication date
2020-04-28
- Fields of study
Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
CONCEPTS
- empathy
A conversational skill the chatbot models are trained to display appropriately in dialogue.
AK (4715169a40) extractionAnonymous (12632b8b5f) review - engagingness
A human-evaluated measure of how compelling the chatbot feels in conversation.
AK (4715169a40) extractionAnonymous (12632b8b5f) review - generation strategy
The decoding choice that helps determine conversation quality in the chatbot recipes studied here.
AK (4715169a40) extractionAnonymous (12632b8b5f) review - humanness
A human evaluation metric used to judge how human-like chatbot responses appear.
AK (4715169a40) extractionAnonymous (12632b8b5f) review - multi-turn dialogue
An open-domain conversation task used to evaluate chatbot engagingness and humanness.
AK (4715169a40) extractionAnonymous (12632b8b5f) review - neural language model
A large neural model used to learn open-domain conversational behavior from training data.
Aliases: neural models
AK (4715169a40) extractionAnonymous (12632b8b5f) review - open-domain chatbot
A dialogue system designed to hold broad multi-turn conversations across unrestricted topics.
Aliases: open-domain chatbots
AK (4715169a40) extractionAnonymous (12632b8b5f) review - persona consistency
The chatbot's ability to maintain a stable persona across turns in conversation.
Aliases: consistent persona
AK (4715169a40) extractionAnonymous (12632b8b5f) review
REFERENCES
Showing 1-61 of 61 references · Page 1 of 1