학술논문

An Empirical Study on Speech Recognition Performance in Low-Resource Radio Environments

Document Type

Conference

Author

Lu, Yunzhao; Lam, Sam W. O.; Lee, Justin Y. Y.; Ji, Jie; Chan, Asa M. C.

Source

2023 IEEE Global Conference on Artificial Intelligence and Internet of Things (GCAIoT) Artificial Intelligence and Internet of Things (GCAIoT), 2023 IEEE Global Conference on. :29-33 Dec, 2023

Subject

Communication, Networking and Broadcast Technologies
Computing and Processing
General Topics for Engineers
Transportation
Railway communication
Protocols
System performance
Training data
Speech recognition
Speech enhancement
Reliability engineering
Speech Recognition
Language Model
Radio Communication

Language

Abstract

This work presents the development of a robust speech recognition engine for a railway communication system using radio signals with a sampling frequency of no more than 8000 Hz, which can perform robust speech transcription in noisy environments. A domain-specific language model is constructed from limited training data and domain-specific corpus based on finetuned weights. And this corpus is created from a template of spreadsheet containing a Jargon table of common terms and a Corpus of common sentences tuned as conversation aligned to radio communication protocols. For evaluation, it is reported the model’s performance is robust comparing to human accuracy on a portion of trained data; while for un-trained data, the model out-performs human by 83.8% vs. 82% in accuracy. The design architecture of the speech recognition engine provides a robust and reliable solution for low-resource radio signal communication and can even out-perform human on poor quality data.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송