학술논문

Application of Personalized Emotion Speech Synthesis Technology in Human-Computer Interaction
Document Type
Conference
Source
2023 6th International Conference on Communication Engineering and Technology (ICCET) ICCET Communication Engineering and Technology (ICCET), 2023 6th International Conference on. :150-153 Feb, 2023
Subject
Computing and Processing
Human computer interaction
Emotion recognition
Speech synthesis
Task analysis
human-computer interaction
speech synthesis
Emotional recognition
Language
Abstract
Aiming at the problem that machine speech synthesis technology does not have emotion in human-computer interaction scenes at present, we propose a framework for personalized speech synthesis with emotion in human-computer interaction. Firstly, the emotion that the machine needs to convey is determined by the spoken text that has returned during the interaction process. Next, the Fastspeech 2 speech synthesis model is used to train the relevant individualized voice with emotion. The customized voice with emotion is then synthesized according to the emotion inferred from the text. In real-world scenarios involving emotional human-computer interactions, this technology has worked well.