학술논문

Approaches to phoneme-based topic spotting: an experimental comparison

Document Type

Conference

Author

Source

1997 IEEE International Conference on Acoustics, Speech, and Signal Processing Acoustics, speech, and signal processing Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on. 3:1819-1822 vol.3 1997

Subject

Signal Processing and Analysis
Components, Circuits, Devices and Systems
Training data
Reconnaissance
Frequency
Dynamic programming
Heuristic algorithms
Laboratories
Educational institutions
Vocabulary
Data mining
Clustering algorithms

Language

ISSN

1520-6149

Abstract

Topic spotting is often performed on the output of a large vocabulary recognizer or a keyword spotter. However, this requires detailed knowledge about the vocabulary, and transcribed training data. If portability to new topics and languages is important, then a topic spotter based on phoneme recognition is preferable. A phoneme recognizer is run on training data consisting of audio files labeled by topic alone-no word transcripts are required. Phoneme sub-sequences which help to predict the topic are then extracted automatically. The work described was carried out by two teams exploring three very different approaches to phoneme-based topic spotting: the "DP-ngram", the "decision tree", and the "Euclidean" approach. Results obtained by each team on the ARM (Airborne Reconnaissance Mission) and Switchboard data sets were compared by means of receiver operating characteristic (ROC) curves. The best performance for each team was obtained via a similar type of discriminative training.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송