학술논문

Can Spontaneous Emotions be Detected from Speech on TV Political Debates?
Document Type
Conference
Source
2019 10th IEEE International Conference on Cognitive Infocommunications (CogInfoCom) Cognitive Infocommunications (CogInfoCom), 2019 10th IEEE International Conference on. :289-294 Oct, 2019
Subject
Communication, Networking and Broadcast Technologies
General Topics for Engineers
Robotics and Control Systems
Human computer interaction
TV
Affective computing
Speech recognition
Emotion recognition
Support vector machines
Feedforward neural networks
speech processing
emotion detection from Speech
human-AI
affective computing
Language
Abstract
Decoding emotional states from multimodal signals is an increasingly active domain, within the framework of affective computing, which aims to a better understanding of Human-Human Communication as well as to improve Human- Computer Interaction. But the automatic recognition of spontaneous emotions from speech is a very complex task due to the lack of a certainty of the speaker states as well as to the difficulty to identify a variety of emotions in real scenarios.In this work we explore the extent to which emotional states can be decoded from speech signals extracted from TV political debates. The labelling procedure was supported by perception experiments where only a small set of emotions has been identified. In addition, some scaled judgements of valence, arousal and dominance were also provided. In this framework the paper shows meaningful comparisons between both, the dimensional and the categorical models of emotions, which is a new contribution when dealing with spontaneous emotions. To this end Support Vector Machines (SVM) as well as Feedforward Neural Networks (FNN) have been proposed to develop classifiers and predictors. The experimental evaluation over a Spanish corpus has shown the ability of both models to be identified in speech segments by the proposed artificial systems.