학술논문

Sample Efficient Reinforcement Learning Using Graph-Based Memory Reconstruction

Document Type

Periodical

Author

Kang, Y.; Zhao, E.; Zang, Y.; Li, L.; Li, K.; Tao, P.; Xing, J.

Source

IEEE Transactions on Artificial Intelligence IEEE Trans. Artif. Intell. Artificial Intelligence, IEEE Transactions on. 5(2):751-762 Feb, 2024

Subject

Computing and Processing
Memory management
Task analysis
Neuroscience
Brain modeling
Games
Automation
Writing
Experience replay (ER)
graph model
memory reconstruction
reinforcement learning (RL)
sample efficiency

Language

ISSN

2691-4581

Abstract

Reinforcement learning (RL) algorithms typically require orders of magnitude more interactions than humans to learn effective policies. Research on memory in neuroscience suggests that humans' learning efficiency benefits from associating their experiences and reconstructing potential events. Inspired by this finding, we introduce a human brainlike memory structure for agents and build a general learning framework based on this structure to improve the RL sampling efficiency. Since this framework is similar to the memory reconstruction process in psychology, we name the newly proposed RL framework as graph-based memory reconstruction (GBMR). In particular, GBMR first maintains an attribute graph on the agent's memory and then retrieves its critical nodes to build and update potential paths among these nodes. This novel pipeline drives the RL agent to learn faster with its memory-enhanced value functions and reduces interactions with the environment by reconstructing its valuable paths. Extensive experimental analyses and evaluations in the grid maze and some challenging Atari environments demonstrate GBMRs superiority over traditional RL methods. We will release the source code and trained models to facilitate further studies in this research direction.

Online Access

Full Text (IEEE) Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송