학술논문

BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning

Document Type

Conference

Author

Ivanovic, Boris; Harrison, James; Sharma, Apoorva; Chen, Mo; Pavone, Marco

Source

2019 International Conference on Robotics and Automation (ICRA) Robotics and Automation (ICRA), 2019 International Conference on. :15-21 May, 2019

Subject

Robotics and Control Systems
Robots
Task analysis
Training
Computational modeling
Heuristic algorithms
Complexity theory
Approximation algorithms

Language

ISSN

2577-087X

Abstract

Model-free Reinforcement Learning (RL) offers an attractive approach to learn control policies for high dimensional systems, but its relatively poor sample complexity often necessitates training in simulated environments. Even in simulation, goal-directed tasks whose natural reward function is sparse remain intractable for state-of-the-art model-free algorithms for continuous control. The bottleneck in these tasks is the prohibitive amount of exploration required to obtain a learning signal from the initial state of the system. In this work, we leverage physical priors in the form of an approximate system dynamics model to design a curriculum for a model-free policy optimization algorithm. Our Backward Reachability Curriculum (BaRC) begins policy training from states that require a small number of actions to accomplish the task, and expands the initial state distribution backwards in a dynamically-consistent manner once the policy optimization algorithm demonstrates sufficient performance. BaRC is general, in that it can accelerate training of any model-free RL algorithm on a broad class of goal-directed continuous control MDPs. Its curriculum strategy is physically intuitive, easy-to-tune, and allows incorporating physical priors to accelerate training without hindering the performance, flexibility, and applicability of the model-free RL algorithm. We evaluate our approach on two representative dynamic robotic learning problems and find substantial performance improvement relative to previous curriculum generation techniques and naive exploration strategies.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송