학술논문

Learning programs for decision and control

Document Type

Conference

Author

Source

2001 International Conferences on Info-Tech and Info-Net. Proceedings (Cat. No.01EX479) Info-tech and Info-net proceedings Info-tech and Info-net, 2001. Proceedings. ICII 2001 - Beijing. 2001 International Conferences on. 3:462-467 vol.3 2001

Subject

Computing and Processing
Communication, Networking and Broadcast Technologies
Dynamic programming
State-space methods
Learning systems
Control theory
Sampling methods
Signal generators
Neural networks
Heuristic algorithms
Testing
Control systems

Language

Abstract

Introduces learning programs, an approximate dynamic programming (ADP) or otherwise named neural dynamic programming (NDP) algorithm developed and tested by the authors. We first introduce the basic framework of our learning programs, the associated learning algorithms, and then extensive case studies to demonstrate the effectiveness of our learning programs. This is probably the first time that neural dynamic programming type of learning algorithms has been applied to complex, real life continuous state problems. Until now, reinforcement learning (another learning approach for approximate dynamic programming) has been mostly successful in discrete state space problems. On the other hand, prior NDP based approaches to controlling continuous state space systems have all been limited to smaller, or linearized, or decoupled problems. Therefore the work presented here compliments and advances the existing literature in the general area of learning approaches in approximate dynamic programming.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송