학술논문

LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

Document Type

Conference

Author

Hoque, Ryan; Balakrishna, Ashwin; Putterman, Carl; Luo, Michael; Brown, Daniel S.; Seita, Daniel; Thananjeyan, Brijen; Novoseller, Ellen; Goldberg, Ken

Source

2021 IEEE 17th International Conference on Automation Science and Engineering (CASE) Automation Science and Engineering (CASE), 2021 IEEE 17th International Conference on. :502-509 Aug, 2021

Subject

Robotics and Control Systems
Limiting
Computer aided software engineering
Conferences
Switches
Control systems
Robustness
Fabrics

Language

ISSN

2161-8089

Abstract

Corrective interventions while a robot is learning to automate a task provide an intuitive method for a human supervisor to assist the robot and convey information about desired behavior. However, these interventions can impose significant burden on a human supervisor, as each intervention interrupts other work the human is doing, incurs latency with each context switch between supervisor and autonomous control, and requires time to perform. We present LazyDAgger, which extends the interactive imitation learning (IL) algorithm SafeDAgger to reduce context switches between supervisor and autonomous control. We find that LazyDAgger improves the performance and robustness of the learned policy during both learning and execution while limiting burden on the supervisor. Simulation experiments suggest that LazyDAgger can reduce context switches by an average of 60% over SafeDAgger on 3 continuous control tasks while maintaining state-of-the-art policy performance. In physical fabric manipulation experiments with an ABB YuMi robot, LazyDAgger reduces context switches by 60% while achieving a 60% higher success rate than SafeDAgger at execution time.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송