학술논문

Spontaneous Speech Summarization: Transformers All The Way Through

Document Type

Conference

Author

Hayashi, Tomoki; Yoshimura, Takenori; Inuzuka, Masaya; Kuroyanagi, Ibuki; Segawa, Osamu

Source

2021 29th European Signal Processing Conference (EUSIPCO) Signal Processing Conference (EUSIPCO), 2021 29th European. :456-460 Aug, 2021

Subject

Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineering Profession
Signal Processing and Analysis
Training
Training data
Europe
Speech recognition
Signal processing
Transformers
Robustness
Speech summarization
Transformer
data augmentation
extractive summarization

Language

ISSN

2076-1465

Abstract

This paper proposes a speech summarization system for spontaneous speech. The proposed system consists of speech segmentation, speech recognition, and extractive text summarization modules. We utilize the Transformer architecture for all modules, enabling us to achieve outstanding performance by capturing global and local context information from the sequence thanks to the self-attention mechanism. Furthermore, we introduce a novel data augmentation method for speech summarization using the results of speech segmentation and recognition modules. The proposed data augmentation addresses each sentence boundary's ambiguity in spontaneous speech, making it possible to improve the robustness for speech segmentation and recognition errors. We conduct an experimental evaluation using the Corpus of Spontaneous Japanese, which consists of Japanese speech such as lecture and conference talks. Through the experimental evaluation, we investigate individual performance and each module's relationship in terms of text summarization performance and demonstrate the effectiveness of the proposed data augmentation method.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송