학술논문

Spontaneous Speech Summarization: Transformers All The Way Through
Document Type
Conference
Source
2021 29th European Signal Processing Conference (EUSIPCO) Signal Processing Conference (EUSIPCO), 2021 29th European. :456-460 Aug, 2021
Subject
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineering Profession
Signal Processing and Analysis
Training
Training data
Europe
Speech recognition
Signal processing
Transformers
Robustness
Speech summarization
Transformer
data augmentation
extractive summarization
Language
ISSN
2076-1465
Abstract
This paper proposes a speech summarization system for spontaneous speech. The proposed system consists of speech segmentation, speech recognition, and extractive text summarization modules. We utilize the Transformer architecture for all modules, enabling us to achieve outstanding performance by capturing global and local context information from the sequence thanks to the self-attention mechanism. Furthermore, we introduce a novel data augmentation method for speech summarization using the results of speech segmentation and recognition modules. The proposed data augmentation addresses each sentence boundary's ambiguity in spontaneous speech, making it possible to improve the robustness for speech segmentation and recognition errors. We conduct an experimental evaluation using the Corpus of Spontaneous Japanese, which consists of Japanese speech such as lecture and conference talks. Through the experimental evaluation, we investigate individual performance and each module's relationship in terms of text summarization performance and demonstrate the effectiveness of the proposed data augmentation method.