학술논문

Analysis of Salient Feature Jitter in the Cochlea for Objective Prediction of Temporally Localized Distortion in Synthesized Speech
Document Type
article
Author
Source
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2009 (2009)
Subject
Acoustics. Sound
QC221-246
Electronic computers. Computer science
QA75.5-76.95
Language
English
ISSN
1687-4714
1687-4722
Abstract
Temporally localized distortions account for the highest variance in subjective evaluation of coded speech signals (Sen (2001) and Hall (2001)). The ability to discern and decompose perceptually relevant temporally localized coding noise from other types of distortions is both of theoretical importance as well as a valuable tool for deploying and designing speech synthesis systems. The work described within uses a physiologically motivated cochlear model to provide a tractable analysis of salient feature trajectories as processed by the cochlea. Subsequent statistical analysis shows simple relationships between the jitter of these trajectories and temporal attributes of the Diagnostic Acceptability Measure (DAM).