학술논문

Low-Rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Document Type

Conference

Author

Yu, Yu; Yang, Chao-Han Huck; Kolehmainen, Jari; Shivakumar, Prashanth G.; Gu, Yile; Ren, Sungho Ryu Roger; Luo, Qi; Gourav, Aditya; Chen, I-Fan; Liu, Yi-Chieh; Dinh, Tuan; Filimonov, Ankur Gandhe Denis; Ghosh, Shalini; Stolcke, Andreas; Rastow, Ariya; Bulyko, Ivan

Source

2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) Automatic Speech Recognition and Understanding Workshop (ASRU), 2023 IEEE. :1-8 Dec, 2023

Subject

Signal Processing and Analysis
Training
Degradation
Adaptation models
Costs
Computational modeling
Conferences
Computer architecture
Low-rank adaptation
neural language model rescoring
parameter-efficient speech recognition

Language

Abstract

We propose a neural language modeling system based on low-rank adaptation (LoRA) for speech recognition output rescoring. Although pretrained language models (LMs) like BERT have shown superior performance in second-pass rescoring, the high computational cost of scaling up the pretraining stage and adapting the pretrained models to specific domains limit their practical use in rescoring. Here we present a method based on low-rank decomposition to train a rescoring BERT model and adapt it to new domains using only a fraction (0.08%) of the pretrained parameters. These inserted matrices are optimized through a discriminative training objective along with a correlation-based regularization loss. The proposed low-rank adaptation RescoreBERT (LoRB) architecture is evaluated on LibriSpeech and internal datasets with decreased training times by factors between 5.4 and 3.6.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송