학술논문

Playing the Lottery of a Lifetime: The Effect of Socially Induced Aspiration on Q-Learning Agents

Document Type

article

Author

Hatekar, Yosi; Dubey, Rachit; Sumers, Ted; Sucholutsky, Ilia

Source

Proceedings of the Annual Meeting of the Cognitive Science Society. 44(44)

Subject

Artificial Intelligence
Psychology
Decision making
Machine learning
Agent-based Modeling

Language

Abstract

Our aspirations are influenced by the rewards obtained by people around us. How adaptive are these inherited aspirations in stochastic, lottery-like environments? We study the behavior of social Q-learning agents in two multi-armed bandit (MAB) settings: 1) a standard task where one arm gives a higher reward than others and, 2) a lottery task where all arms give a high reward with some small probability. We define aspiration as a function of rewards attained by a previous generation, and happiness as a linear combination of rewards and aspiration. We find that in the standard MAB task, higher aspiration encourages exploration, and agents who learn from the ‘top’ agents accumulate more rewards and happiness. However, in the lottery task, higher aspiration doesn’t improve performance; instead, agents who learn from the ‘top’ agents are more unhappy. Together, this research highlights the context-dependent nature of aspirations and their implications to modern society.

Online Access

Open Access (eScholarship) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송