학술논문

Malthusian Reinforcement Learning

Document Type

Conference

Author

Leibo, Joel Z.; Perolat, Julien; Hughes, Edward; Wheelwright, Steven; Marblestone, Adam H.; Duéñez-Guzmán, Edgar; Sunehag, Peter; Dunning, Iain; Graepel, Thore

Source

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems. :1099-1107

Subject

adaptive radiation
artificial general intelligence
demography
evolution
intrinsic motivation

Language

English

Abstract

Here we explore a new algorithmic framework for multi-agent reinforcement learning, called Malthusian reinforcement learning, which extends self-play to include fitness-linked population size dynamics that drive ongoing innovation. In Malthusian RL, increases in a subpopulation's average return drive subsequent increases in its size, just as Thomas Malthus argued in 1798 was the relationship between preindustrial income levels and population growth. Malthusian reinforcement learning harnesses the competitive pressures arising from growing and shrinking population size to drive agents to explore regions of state and policy spaces that they could not otherwise reach. Furthermore, in environments where there are potential gains from specialization and division of labor, we show that Malthusian reinforcement learning is better positioned to take advantage of such synergies than algorithms based on self-play.

Online Access

Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송