학술논문

Adapting monolingual resources for code-mixed hindi-english speech recognition
Document Type
Conference
Source
2017 International Conference on Asian Language Processing (IALP) Asian Language Processing (IALP), 2017 International Conference on. :218-221 Dec, 2017
Subject
Computing and Processing
Signal Processing and Analysis
Machine-to-machine communications
code-mixed speech recognition
low-resource acoustic modeling
Language
Abstract
The paper presents an automatic speech recognition (ASR) system for code-mixed read speech in Hindi-English, developed upon the extrapolation of monolingual training resources. A monolingual Hindi acoustic model, mixed with code-mixed speech data has been implemented to train a neural network based speech recognition framework. The testing corpus also follows a similar structure: containing data from both monolingual and code-mixed speech. The shared phonetic transcription, captured in WX notation has been exploited to harness the commonality between the pooled phonesets of Hindi and English. The experiments have been conducted in two separate formulations of a trigram based language model 1) In the first experiment, the language model contains no out-of-vocabulary words, as the test utterances are included in the training of the language model. The word error rate in this case has been obtained to be 10.63 %. 2) In the second experiment, the testing utterances have been excluded from the training language model. The word error rate in this case has been obtained to be 41.66 %.