학술논문

Learning programs for decision and control
Document Type
Conference
Source
2001 International Conferences on Info-Tech and Info-Net. Proceedings (Cat. No.01EX479) Info-tech and Info-net proceedings Info-tech and Info-net, 2001. Proceedings. ICII 2001 - Beijing. 2001 International Conferences on. 3:462-467 vol.3 2001
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
Dynamic programming
State-space methods
Learning systems
Control theory
Sampling methods
Signal generators
Neural networks
Heuristic algorithms
Testing
Control systems
Language
Abstract
Introduces learning programs, an approximate dynamic programming (ADP) or otherwise named neural dynamic programming (NDP) algorithm developed and tested by the authors. We first introduce the basic framework of our learning programs, the associated learning algorithms, and then extensive case studies to demonstrate the effectiveness of our learning programs. This is probably the first time that neural dynamic programming type of learning algorithms has been applied to complex, real life continuous state problems. Until now, reinforcement learning (another learning approach for approximate dynamic programming) has been mostly successful in discrete state space problems. On the other hand, prior NDP based approaches to controlling continuous state space systems have all been limited to smaller, or linearized, or decoupled problems. Therefore the work presented here compliments and advances the existing literature in the general area of learning approaches in approximate dynamic programming.