Bidirectional parallel echo state network for speech emotion recognition

Published 2022 in Neural computing & applications (Print)

ABSTRACT

Speech is an effective way for communicating and exchanging complex information between humans. Speech signal has involved a great attention in human-computer interaction. Therefore, emotion recognition from speech has become a hot research topic in the field of interacting machines with humans. In this paper, we proposed a novel speech emotion recognition system by adopting multivariate time series handcrafted feature representation from speech signals. Bidirectional echo state network with two parallel reservoir layers has been applied to capture additional independent information. The parallel reservoirs produce multiple representations for each direction from the bidirectional data with two stages of concatenation. The sparse random projection approach has been adopted to reduce the high-dimensional sparse output for each direction separately from both reservoirs. Random over-sampling and random under-sampling methods are used to overcome the imbalanced nature of the used speech emotion datasets. The performance of the proposed parallel ESN model is evaluated from the speaker-independent experiments on EMO-DB, SAVEE, RAVDESS, and FAU Aibo datasets. The results show that the proposed SER model is superior to the single reservoir and the state-of-the-art studies.

PUBLICATION RECORD

Publication year
2022
Venue
Neural computing & applications (Print)
Publication date
2022-05-31
Fields of study
Medicine, Computer Science
Identifiers
DOI 10.1007/s00521-022-07410-2 PMID 35669535 PMCID 9152839
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Automatic children's personality assessment from emotional speech
2022cited by this paper
Speech Emotion Recognition by Late Fusion for Bidirectional Reservoir Computing With Random Projection
2021influential reference
Deep speaker conditioning for speech emotion recognition
2021influential reference
A modified feature selection method based on metaheuristic algorithms for speech emotion recognition
2021cited by this paper
Augmented Audio Data in Improving Speech Emotion Classification Tasks
2021cited by this paper
Grouped Multi-Layer Echo State Networks with Self-Normalizing Activations
2021cited by this paper
Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models
2021cited by this paper
Combining a parallel 2D CNN with a self-attention Dilated Residual Network for CTC-based discrete speech emotion recognition
2021influential reference
Speech emotion recognition using recurrent neural networks with directional self-attention
2021cited by this paper
Functional deep echo state network improved by a bi-level optimization approach for multivariate time series classification
2021cited by this paper
Deep Echo State Network With Multiple Adaptive Reservoirs for Time Series Prediction
2021cited by this paper
Speech emotion recognition based on formant characteristics feature extraction and phoneme type convergence
2021cited by this paper
Similarity of Speech Emotion in Different Languages Revealed by a Neural Network with Attention
2020cited by this paper
MLT-DNet: Speech emotion recognition using 1D dilated CNN based on multi-learning trick approach
2020cited by this paper
Call Redistribution for a Call Center Based on Speech Emotion Recognition
2020cited by this paper
Speech emotion recognition using hybrid spectral-prosodic features of speech signal/glottal waveform, metaheuristic-based dimensionality reduction, and Gaussian elliptical basis function network classifier
2020cited by this paper
Speech Emotion Recognition Based on Selective Interpolation Synthetic Minority Over-Sampling Technique in Small Sample Environment
2020cited by this paper
Continuous Speech Emotion Recognition with Convolutional Neural Networks
2020cited by this paper
Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition
2019cited by this paper
Speech emotion recognition based on DNN-decision tree SVM model
2019cited by this paper
Cepstral Derivatives in MFCCs for Emotion Recognition
2019cited by this paper
Multiscale Amplitude Feature and Significance of Enhanced Vocal Tract Information for Emotion Classification
2019cited by this paper
Speech Emotion Recognition From 3D Log-Mel Spectrograms With Deep Learning Network
2019cited by this paper
Exploring Deep Spectrum Representations via Attention-Based Recurrent and Convolutional Neural Networks for Speech Emotion Recognition
2019cited by this paper
Learning From Imbalanced Data
2019cited by this paper
Short Utterance Based Speech Language Identification in Intelligent Vehicles With Time-Scale Modifications and Deep Bottleneck Features
2019cited by this paper
Parallelized Convolutional Recurrent Neural Network With Spectral Features for Speech Emotion Recognition
2019cited by this paper
Reservoir Topology in Deep Echo State Networks
2019cited by this paper
Reservoir Computing Approaches for Representation and Classification of Multivariate Time Series
2018cited by this paper
On the Statistical Challenges of Echo State Networks and Some Potential Remedies
2018influential reference
Wind Power Forecasting Based on Echo State Networks and Long Short-Term Memory
2018cited by this paper
Genesis of Basic and Multi-Layer Echo State Network Recurrent Autoencoders for Efficient Data Representations
2018cited by this paper
The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English
2018cited by this paper
3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition
2018cited by this paper
Emotion Inferring from Large-scale Internet Voice Data: A Multimodal Deep Learning Approach
2018cited by this paper
Learning from Imbalanced Data Sets
2018cited by this paper
Random Deep Belief Networks for Recognizing Emotions from Speech Signals
2017influential reference
Bidirectional deep-readout echo state networks
2017cited by this paper
Evaluating deep learning architectures for Speech Emotion Recognition
2017cited by this paper
Snore Sound Classification Using Image-Based Deep Spectrum Features
2017cited by this paper
Echo State Property of Deep Reservoir Computing Networks
2017cited by this paper
Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentences
2017cited by this paper
Speech emotion recognition with skew-robust neural networks
2017cited by this paper
Deep reservoir computing: A critical experimental analysis
2017cited by this paper
Multilayered Echo State Machine: A Novel Architecture and Algorithm
2017cited by this paper
Functional echo state network for time series classification
2016cited by this paper
An Overview on Data Representation Learning: From Traditional Feature Learning to Recent Deep Learning
2016cited by this paper
Bidirectional reservoir networks trained using SVM+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$+$$\end{document} p
2016cited by this paper
High-level feature representation using recurrent neural network for speech emotion recognition
2015cited by this paper
Emotion Recognition in Car Industry
2015cited by this paper
Automatic Speech Emotion Recognition : feature space dimensionality and classification challenges
2015influential reference
Memristive computational architecture of an echo state network for real-time speech-emotion recognition
2015cited by this paper
A Preliminary Application of Echo State Networks to Emotion Recognition
2014cited by this paper
A new approach of audio emotion recognition
2014cited by this paper
Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks
2014cited by this paper
Anchor Models for Emotion Recognition from Speech
2013cited by this paper
Excitation source and low level descriptor features fusion for emotion recognition using SVM and ANN
2013cited by this paper
Multimodal Emotion Recognition
2013cited by this paper
Practical Bayesian Optimization of Machine Learning Algorithms
2012cited by this paper
Speech emotion recognition: Features and classification models
2012cited by this paper
Training and assessing classification rules with imbalanced data
2012cited by this paper
Automatic speech emotion recognition using modulation spectral features
2011cited by this paper
Machine Audition: Principles, Algorithms and Systems
2010cited by this paper
Opensmile: the munich versatile and fast open-source audio feature extractor
2010cited by this paper
The INTERSPEECH 2009 emotion challenge
2009cited by this paper
A systematic analysis of performance measures for classification tasks
2009cited by this paper
Automatic classification of emotion related user states in spontaneous children's speech
2009cited by this paper
Reservoir computing approaches to recurrent neural network training
2009cited by this paper
Real-Time Emotion Recognition from Speech Using Echo State Networks
2008cited by this paper
Combining frame and turn-level information for robust recognition of emotions within speech
2007cited by this paper
2007 Special Issue: Decoupled echo state networks with lateral inhibition
2007cited by this paper
Very sparse random projections
2006cited by this paper
A database of German emotional speech
2005cited by this paper
Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication
2004cited by this paper
Publisher's Note
2003cited by this paper

CITED BY

Acoustic Feature Excitation-and-Aggregation Network Based on Multi-Task Learning for Speech Emotion Recognition
2025cites this paper
Sign Language Recognition using Bidirectional Reservoir Computing
2025cites this paper
Quantum-enhanced cortical deep echo state network for fast and accurate speech emotion recognition
2025cites this paper
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
2024cites this paper
MBSSA-Bi-AESN: Classification prediction of bi-directional adaptive echo state network based on modified binary salp swarm algorithm and feature selection
2024cites this paper
Newman-Watts-Strogatz topology in deep echo state networks for speech emotion recognition
2024cites this paper
An enhanced speech emotion recognition using vision transformer
2024influential citation
Optimized multi-layer self-attention network for feature-level data fusion in emotion recognition
2024cites this paper
Echo State Network and Sparrow Search: Echo State Network for Modeling the Monthly River Discharge of the Biggest River in Buzău County, Romania
2024cites this paper
Topology-adaptive Bayesian optimization for deep ring echo state networks in speech emotion recognition
2024cites this paper
Automatic Persian Speech Emotion Recognition: A CNN-Based Approach Using PDREC and ShEMO Datasets
2024cites this paper
The Reservoir Topology of Echo State Network
2023cites this paper
Memory augmented echo state network for time series prediction
2023cites this paper
A Parallel-Model Speech Emotion Recognition Network Based on Feature Clustering
2023influential citation
Multimodal and Multitask Learning with Additive Angular Penalty Focus Loss for Speech Emotion Recognition
2023cites this paper
An octonion-based nonlinear echo state network for speech emotion recognition in Metaverse
2023cites this paper
Research on the Auditory Characteristics of Humanoid Robots to Assist the Older Population with Cognitive Impairments
2022cites this paper
A Pattern Recognition Framework for Signal Processing in Metaverse
2022cites this paper