학술논문

Distributed dialogue policies for multi-domain statistical dialogue management
Document Type
Conference
Source
2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. :5371-5375 Apr, 2015
Subject
Fields, Waves and Electromagnetics
Training
Training data
Limiting
Databases
open-domain
multi-domain
dialogue systems
POMDP
Gaussian process
Language
ISSN
1520-6149
2379-190X
Abstract
Statistical dialogue systems offer the potential to reduce costs by learning policies automatically on-line, but are not designed to scale to large open-domains. This paper proposes a hierarchical distributed dialogue architecture in which policies are organised in a class hierarchy aligned to an underlying knowledge graph. This allows a system to be deployed using a modest amount of data to train a small set of generic policies. As further data is collected, generic policies can be adapted to give in-domain performance. Using Gaussian process-based reinforcement learning, it is shown that within this framework generic policies can be constructed which provide acceptable user performance, and better performance than can be obtained using under-trained domain specific policies. It is also shown that as sufficient in-domain data becomes available, it is possible to seamlessly improve performance, without subjecting users to unacceptable behaviour during the adaptation period and without limiting the final performance compared to policies trained from scratch.