Task-independent Recognition of Communication Skills in Group Interaction Using Time-series Modeling

Published 2021 in ACM Trans. Multim. Comput. Commun. Appl.

ABSTRACT

Case studies of group discussions are considered an effective way to assess communication skills (CS). This method can help researchers evaluate participants’ engagement with each other in a specific realistic context. In this article, multimodal analysis was performed to estimate CS indices using a three-task-type group discussion dataset, the MATRICS corpus. The current research investigated the effectiveness of engaging both static and time-series modeling, especially in task-independent settings. This investigation aimed to understand three main points: first, the effectiveness of time-series modeling compared to nonsequential modeling; second, multimodal analysis in a task-independent setting; and third, important differences to consider when dealing with task-dependent and task-independent settings, specifically in terms of modalities and prediction models. Several modalities were extracted (e.g., acoustics, speaking turns, linguistic-related movement, dialog tags, head motions, and face feature sets) for inferring the CS indices as a regression task. Three predictive models, including support vector regression (SVR), long short-term memory (LSTM), and an enhanced time-series model (an LSTM model with a combination of static and time-series features), were taken into account in this study. Our evaluation was conducted by using the R2 score in a cross-validation scheme. The experimental results suggested that time-series modeling can improve the performance of multimodal analysis significantly in the task-dependent setting (with the best R2 = 0.797 for the total CS index), with word2vec being the most prominent feature. Unfortunately, highly context-related features did not fit well with the task-independent setting. Thus, we propose an enhanced LSTM model for dealing with task-independent settings, and we successfully obtained better performance with the enhanced model than with the conventional SVR and LSTM models (the best R2 = 0.602 for the total CS index). In other words, our study shows that a particular time-series modeling can outperform traditional nonsequential modeling for automatically estimating the CS indices of a participant in a group discussion with regard to task dependency.

PUBLICATION RECORD

Publication year
2021
Venue
ACM Trans. Multim. Comput. Commun. Appl.
Publication date
2021-11-12
Fields of study
Computer Science
Identifiers
DOI 10.1145/3450283
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Multimodal BigFive Personality Trait Analysis Using Communication Skill Indices and Multiple Discussion Types Dataset
2019cited by this paper
Recent trends in deep learning based personality detection
2019cited by this paper
A Sequential Data Analysis Approach to Detect Emergent Leaders in Small Groups
2019cited by this paper
OpenFace 2.0: Facial Behavior Analysis Toolkit
2018cited by this paper
Using Interlocutor-Modulated Attention BLSTM to Predict Personality Traits in Small Group Interaction
2018cited by this paper
The Handbook of Communication Skills
2018influential reference
Automatic assessment of communication skill in interview-based interactions
2018cited by this paper
Annotating and modeling empathy in spoken conversations
2017cited by this paper
Estimating communication skills using dialogue acts and nonverbal features in multiple discussion datasets
2016influential reference
Convolutional Experts Constrained Local Model for Facial Landmark Detection
2016cited by this paper
Estimating Communication Skills based on Multimodal Information in Group Discussions
2016cited by this paper
A Survey on perceived speaker traits: Personality, likability, pathology, and the first challenge
2015cited by this paper
Automatic Recognition of Emergent Social Roles in Small Group Interactions
2015cited by this paper
Rendering of Eyes for Eye-Shape Registration and Gaze Estimation
2015cited by this paper
Automated Analysis and Prediction of Job Interview Performance
2015cited by this paper
Evaluating Speech, Face, Emotion and Body Movement Time-series Features for Automated Multimodal Presentation Scoring
2015cited by this paper
Computational Analysis of Persuasiveness in Social Multimedia: A Novel Dataset and Multimodal Prediction Approach
2014cited by this paper
Hire me: Computational Inference of Hirability in Employment Interviews Based on Nonverbal Behavior
2014cited by this paper
Distributed Representations of Sentences and Documents
2014cited by this paper
Gene Selection for Cancer Classification using Support Vector Machines
2014cited by this paper
Predicting Influential Statements in Group Discussions using Speech and Head Motion Information
2014cited by this paper
Efficient Estimation of Word Representations in Vector Space
2013cited by this paper
One of a kind: inferring personality impressions in meetings
2013cited by this paper
A Nonverbal Behavior Approach to Identify Emergent Leaders in Small Groups
2012cited by this paper
The INTERSPEECH 2012 Speaker Trait Challenge
2012cited by this paper
Multimodal prediction of expertise and leadership in learning groups
2012cited by this paper
Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions
2012cited by this paper
FaceTube: predicting personality from facial expressions of emotion in online conversational video
2012cited by this paper
Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus
2012cited by this paper
Listening and Message Interpretation
2011cited by this paper
Opensmile: the munich versatile and fast open-source audio feature extractor
2010cited by this paper
Robots in the wild: observing human-robot social interaction outside the lab
2006cited by this paper
Recognizing facial expression: machine learning and application to spontaneous behavior
2005cited by this paper
The ICSI Meeting Recorder Dialog Act (MRDA) Corpus
2004cited by this paper
Applying Conditional Random Fields to Japanese Morphological Analysis
2004cited by this paper
Handbook of communication and social interaction skills
2003influential reference
Gene Selection for Cancer Classification using Support Vector Machines
2002cited by this paper
Communication Under the Microscope: The Theory and Practice of Microanalysis
2002cited by this paper
Recognizing Action Units for Facial Expression Analysis
2001cited by this paper
The Harvard Business School Guide to Careers in Management Consulting
2000cited by this paper
Long Short-Term Memory
1997cited by this paper
Coding Dialogs with the DAMSL Annotation Scheme
1997cited by this paper
Support Vector Regression Machines
1996influential reference
: The Conduct of Inquiry: Methodology for Behavioral Science
1965cited by this paper
Communicative Competence
year unknowncited by this paper

CITED BY

Modeling of Small Groups in Computational Sciences: A Prospecting Review
2024cites this paper
Multimodal Analysis for Communication Skill and Self-Efficacy Level Estimation in Job Interview Scenario
2022cites this paper