Context-Aware Attention Mechanism for Speech Emotion Recognition

Gaetan Ramet,Philip N. Garner,Michael Baeriswyl,Alexandros Lazaridis

Context-Aware Attention Mechanism for Speech Emotion Recognition

2018

Gaetan Ramet
Philip N. Garner
Michael Baeriswyl
Alexandros Lazaridis

In this work, we study the use of attention mechanisms to enhance the performance of the state-of-the-art deep learning model in Speech Emotion Recognition (SER). We introduce a new Long Short-Term Memory (LSTM)-based neural network attention model which is able to take into account the temporal information in speech during the computation of the attention vector. The proposed LSTM-based model is evaluated on the IEMOCAP dataset using a 5-fold cross-validation scheme and achieved 68.8% weighted accuracy on 4 classes, which outperforms the state-of-the-art models.

Keywords:

Computer science
Speech recognition
Computation
Deep learning
Artificial neural network
Emotion recognition
Feature extraction
Artificial intelligence
Data modeling
temporal information
attention model

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations