학술논문

Manipulation of the Fundamental Frequency Micro-Variations using a Fully Parametric and Computationally Efficient Speech Model
Document Type
Conference
Source
2020 IEEE Workshop on Signal Processing Systems (SiPS) Signal Processing Systems (SiPS), 2020 IEEE Workshop on. :1-6 Oct, 2020
Subject
Components, Circuits, Devices and Systems
Computing and Processing
Signal Processing and Analysis
Harmonic analysis
Vocoders
Speech processing
Analytical models
Estimation
Feature extraction
Real-time systems
Fundamental frequency contour
microvariations
speech processing
parametric model
perceptual tests
Language
ISSN
2374-7390
Abstract
In this paper, we present a computationally efficient and fully parametric harmonic speech model that is suitable for real-time flexible frame-based analysis and synthesis implementation in the frequency domain. We carry out a performance comparison between this vocoder and similar ones, such as WORLD and HPMD. Then, a deliberate manipulation of the speaker's fundamental frequency micro-variations is performed in order to understand in which way it conveys prosodic and idiosyncratic information. We conclude our discussion by evaluating the impact of these manipulations through the realization of perceptual tests.