Engagement Recognition in Spoken Dialogue via Neural Network by Aggregating Different Annotators' Models

K. Inoue,Divesh Lala,K. Takanashi,Tatsuya Kawahara

Published 2018 in Interspeech

ABSTRACT

This paper addresses engagement recognition based on four multimodal listener behaviors - backchannels, laughing, eye-gaze, and head nodding. Engagement is an indicator of how much a user is interested in the current dialogue. Multiple third-party annotators give ground truth labels of engagement in a human-robot interaction corpus. Since perception of engagement is subjective, the annotations are sometimes different between individual annotators. Conventional methods directly use integrated labels, such as those generated through simple majority voting, and do not consider each annotator’s recognition. We propose a two-step engagement recognition where each annotator’s recognition is modeled and the different anno-tators’ models are aggregated to recognize the integrated label. The proposed neural network consists of two parts. The ﬁrst part corresponds to each annotator’s model which is trained with the corresponding labels independently. The second part aggregates the different annotators’ models to obtain one integrated label. After each part is pre-trained, the whole network is ﬁne-tuned through back-propagation of prediction errors. Experimental results show that the proposed network outperforms baseline models which directly recognize the integrated label without considering differing annotations.

PUBLICATION RECORD

Publication year
2018
Venue
Interspeech
Publication date
2018-09-02
Fields of study
Computer Science
Identifiers
DOI 10.21437/Interspeech.2018-2067
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

An Open-Source Dialog System with Real-Time Engagement Tracking for Job Interview Training Applications
2018influential reference
Latent Character Model for Engagement Recognition Based on Multimodal Behaviors
2018cited by this paper
Sensing and Handling Engagement Dynamics in Human-Robot Interaction Involving Peripheral Computing Devices
2017cited by this paper
Analysis of efficient multimodal features for estimating user's willingness to talk: Comparison of human-machine and human-human dialog
2017cited by this paper
Rushing to Judgement: How do Laypeople Rate Caller Engagement in Thin-Slice Videos of Human-Machine Dialog?
2017cited by this paper
Highway-LSTM and Recurrent Highway Networks for Speech Recognition
2017cited by this paper
Analysis of Engagement and User Experience with a Laughter Responsive Social Robot
2017cited by this paper
Detection of social signals for recognizing engagement in human-robot interaction
2017cited by this paper
Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC
2017cited by this paper
UE-HRI: a new dataset for the study of user engagement in spontaneous human-robot interactions
2017cited by this paper
A Wizard-of-Oz Study on A Non-Task-Oriented Dialog Systems That Reacts to User Engagement
2016cited by this paper
Estimation of User's Willingness to Talk About the Topic: Analysis of Interviews Between Humans
2016cited by this paper
Semi-situated learning of verbal and nonverbal content for repeated human-robot interaction
2016cited by this paper
Engagement in Dialogue with Social Robots
2016cited by this paper
Engagement Detection in Meetings
2016cited by this paper
Talking with ERICA, an autonomous android
2016cited by this paper
ERICA: The ERATO Intelligent Conversational Android
2016cited by this paper
Conversational Engagement Recognition Using Auditory and Visual Cues
2016cited by this paper
Annotation and analysis of listener's engagement based on multi-modal behaviors
2016cited by this paper
Deciphering the Silent Participant: On the Use of Audio-Visual Cues for the Classification of Listener Categories in Group Discussions
2015influential reference
Modelling situated human-robot interaction using IrisTK
2015cited by this paper
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
2014cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
Directions robot: in-the-wild experiences and lessons learned
2014cited by this paper
Towards an open-domain conversational system fully based on natural language processing
2014cited by this paper
Designing engagement-aware agents for multiparty conversations
2013influential reference
Gaze and conversational engagement in multiparty video conversation: an annotation scheme and classification of high and low levels of engagement
2012cited by this paper
Ada and Grace: Direct Interaction with Museum Visitors
2012cited by this paper
I and J
2012cited by this paper
Annotation of japanese response tokens and preliminary analysis on their distribution in three-party conversations
2011cited by this paper
Modeling Wisdom of Crowds Using Latent Mixture of Discriminative Experts
2011cited by this paper
Estimating user's engagement from eye-gaze behaviors in human-agent conversations
2010cited by this paper
Latent Mixture of Discriminative Experts for Multimodal Prediction Modeling
2010cited by this paper
Recognizing engagement in human-robot interaction
2010cited by this paper
Detecting user engagement with a robot companion using task and social interaction-based features
2009cited by this paper
Learning to Predict Engagement with a Spoken Dialog System in Open-World Settings
2009cited by this paper
Mind, hands, face and body. A goal and belief view of multimodal communication
2007cited by this paper
A spatial model of engagement for a social robot
2006cited by this paper
Influence of cultivation temperature on the ligninolytic activity of selected fungal strains
2006cited by this paper
Direction of Attention Perception for Conversation Initiation in Virtual Environments
2005cited by this paper
Detecting user engagement in everyday conversations
2004cited by this paper
Engagement rules for human-robot collaborative interactions
2003cited by this paper
AND T
year unknowncited by this paper
and as an in
year unknowncited by this paper

CITED BY

Deep Learning Approaches for User Engagement Detection in Human-Robot Interaction: A Scoping Review
2025cites this paper
Automatic Context-Aware Inference of Engagement in HMI: A Survey
2024cites this paper
Automatic Context-Driven Inference of Engagement in HMI: A Survey
2022cites this paper
Dialogue Management by Estimating User’s Internal State Using the Movie Recommendation Dialogue
2021cites this paper
Distinguishing Engagement Facets: An Essential Component for AI-based Healthcare
2021cites this paper
Distinguishing Engagement Facets: An Essential Component for AI-based Interactive Healthcare 37-47
2021cites this paper
Mitigating Boredom Using An Empathetic Conversational Agent
2020cites this paper
HRI-RNN: A User-Robot Dynamics-Oriented RNN for Engagement Decrease Detection
2020cites this paper
Modeling and Utilizing User's Internal State in Movie Recommendation Dialogue
2020cites this paper
Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems
2019cites this paper
Engagement-Based Adaptive Behaviors for Laboratory Guide in Human-Robot Dialogue
2019cites this paper