학술논문

Causally Correct Partial Models for Reinforcement Learning

Document Type

Working Paper

Author

Rezende, Danilo J.; Danihelka, Ivo; Papamakarios, George; Ke, Nan Rosemary; Jiang, Ray; Weber, Theophane; Gregor, Karol; Merzic, Hamza; Viola, Fabio; Wang, Jane; Mitrovic, Jovana; Besse, Frederic; Antonoglou, Ioannis; Buesing, Lars

Source

Subject

Computer Science - Machine Learning
Computer Science - Artificial Intelligence
Statistics - Machine Learning

Language

Abstract

In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this paper, we show that partial models can be causally incorrect: they are confounded by the observations they don't model, and can therefore lead to incorrect planning. To address this, we introduce a general family of partial models that are provably causally correct, yet remain fast because they do not need to fully model future observations.

Online Access

Open Access (Arxiv) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송