학술논문

Search space reduction for strategy learning in sequential decision processes

Document Type

Conference

Author

Schoknecht, R.; Spott, M.; Liekweg, F.; Riedmiller, M.

Source

ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378) Neural information processing Neural Information Processing, 1999. Proceedings. ICONIP '99. 6th International Conference on. 1:148-153 vol.1 1999

Subject

Computing and Processing
Components, Circuits, Devices and Systems
Signal Processing and Analysis
Control systems
Nonlinear control systems
Learning
Convergence
Adaptive control
Analytical models
Decision making
Operations research
Delay
State estimation

Language

Abstract

Sequential decision making in large domains requires high computational expense. With the classical dynamic programming approach, a rising problem size soon leads to intractability because of time and memory constraints. This situation can be significantly remedied by using more advanced reinforcement learning techniques in combination with generalizing function approximators. However, this may lead to unstable learning behaviour as the strict convergence results are no longer valid. The paper presents an approach to stabilize learning by gradually reducing the search space for the optimal decision policy. This is done by iteratively adapting the action set according to the progress of learning. Experiments are described within the FYNESSE control architecture that is a framework for autonomously learning adaptive control strategies.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송