학술논문

DNN Based Expressive Text-to-Speech with Limited Training Data

Document Type

Conference

Author

Suzie, Sinisa; Nosek, Tijana; Secujski, Milan; Pekar, Darko; Delie, Vlado

Source

2019 27th Telecommunications Forum (TELFOR) Telecommunications Forum (TELFOR), 2019 27th. :1-6 Nov, 2019

Subject

Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineering Profession
Fields, Waves and Electromagnetics
Signal Processing and Analysis
deep neural networks
expressive speech
style transplantation
text-to-speech synthesis

Language

Abstract

Modern text-to-speech synthesis systems should deliver speech which is not just intelligible, but whose style corresponds to the domain in which synthesized speech is used. In this paper three approaches based on deep neural networks aimed at synthesis of expressive speech are presented: style code, model re-training and an architecture using shared hidden layers. Their usability is tested on a speech corpus with a limited amount of expressive speech data. A new architecture for transplanting speech styles is also presented and compared with a referent approach from literature.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송