학술논문

Analysis of Emotion Recognition from Cross-lingual Speech: Arabic, English, and Urdu

Document Type

Conference

Author

Farhad, Moomal; Ismail, Heba; Harous, Saad; Masud, Mohammad Mehedy; Beg, Azam

Source

2021 2nd International Conference on Computation, Automation and Knowledge Management (ICCAKM) Computation, Automation and Knowledge Management (ICCAKM), 2021 2nd International Conference on. :42-47 Jan, 2021

Subject

Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Emotion recognition
Neural networks
Speech recognition
Feature extraction
Security
Random forests
Robots
Arabic
English
Urdu
Machine learning

Language

Abstract

In a system which involves interaction be- tween machines and humans, the recognition of emotion from audio has always been a focus of research. Emotion recognition can play an essential role in many fields, such as medicine, law, psychology, and customer services. In this paper, we present an empirical comparative analysis of several machine learning classifiers for emotion recognition in audio data. Evaluations are performed for a set of predefined emotions such as happy, sad, and angry from Arabic, English, and Urdu languages. Pitch and cepstral features are extracted from audio files and principal component analysis is applied for dimensionality reduction. Experiments show that random forest outperformed other classifiers on Urdu dataset with an accuracy of 78.75%. However, the performance of Meta iterative classifier on Arabic dataset was better than random forest and neural network with the accuracy of 70%. Classification of emotions on the English dataset, which do not differ much in terms of pitch and MFCC features, generated the lowest accuracies at or below 31%.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송