학술논문

Distributional semantics for understanding spoken meal descriptions

Document Type

Conference

Author

Korpusik, Mandy; Huang, Calvin; Price, Michael; Glass, James

Source

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on. :6070-6074 Mar, 2016

Subject

Signal Processing and Analysis
Semantics
Tagging
Prototypes
Speech recognition
Speech
Dairy products
Data models
CRF
Word vectors
Semantic tagging

Language

ISSN

2379-190X

Abstract

This paper presents ongoing language understanding experiments conducted as part of a larger effort to create a nutrition dialogue system that automatically extracts food concepts from a user's spoken meal description. We first discuss the technical approaches to understanding, including three methods for incorporating word vector features into conditional random field (CRF) models for semantic tagging, as well as classifiers for directly associating foods with properties. We report experiments on both text and spoken data from an in-domain speech recognizer. On text data, we show that the addition of word vector features significantly improves performance, achieving an F1 test score of 90.8 for semantic tagging and 86.3 for food-property association. On speech, the best model achieves an F1 test score of 87.5 for semantic tagging and 86.0 for association. Finally, we conduct an end-to-end system evaluation through a user study with human ratings of 83% semantic tagging accuracy.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송