학술논문

New time-frequency derived cepstral coefficients for automatic speech recognition
Document Type
Conference
Source
1996 8th European Signal Processing Conference (EUSIPCO 1996) European Signal Processing Conference, 1996. EUSIPCO 1996. 8th. :1-4 Sep, 1996
Subject
Signal Processing and Analysis
Mel frequency cepstral coefficient
Databases
Time-frequency analysis
Wavelet transforms
Speech
Cepstrum
Language
Abstract
The goal is to improve recognition rate by optimisation of Mel Frequency Cepstral Coefficients (MFCCs): modifications concern the time-frequency representation used to estimate these coefficients. There are many ways to obtain a spectrum out of a signal which differ in the method itself (Fourier, Wavelets,…), and in the normalisation. We show here that we can obtain noise resistant cepstral coefficients, for speaker independent connected word recognition. The recognition system is based on a continuous whole word hidden Markov model. An error reduction rate of approximately 50% is achieved. Moreover evaluation tests demonstrate that these results can be obtained with smaller databases: halving the training database have small effects on recognition rates (which is not the case with traditional MFCCs).