학술논문

Tangerine: a large vocabulary Mandarin dictation system
Document Type
Conference
Source
1995 International Conference on Acoustics, Speech, and Signal Processing Acoustics, speech and signal processing Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on. 1:77-80 vol.1 1995
Subject
Signal Processing and Analysis
Components, Circuits, Devices and Systems
Vocabulary
Cepstral analysis
Mel frequency cepstral coefficient
Cables
Speech recognition
Hidden Markov models
Speech analysis
Background noise
Noise reduction
Natural languages
Language
ISSN
1520-6149
Abstract
The text input for non-alphabetic languages, such as Chinese, has been a decades-long problem. Chinese dictation using large vocabulary speech recognition provides a convenient mode of text entry. In contrast to a character based dictation system, a word-based Mandarin dictation system has been designed (based on Apple's PlainTalk speech recognition technology for efficient entry of Chinese characters into a computer. New features and improvements to the dictation system are presented. The new features and improvements have produced an overall reduction in recognition error of 50-80%. The vocabulary has also been increased from 5000 words to over 11000 words.