학술논문

Learning, exploitation and bias in games.
Document Type
Article
Source
PLoS ONE. 2/5/2021, Vol. 16 Issue 2, p1-14. 14p.
Subject
*GAMIFICATION
*GAMES
*REINFORCEMENT learning
*INDIVIDUAL development
*VIRTUAL communities
Language
ISSN
1932-6203
Abstract
We focus on learning during development in a group of individuals that play a competitive game with each other. The game has two actions and there is negative frequency dependence. We define the distribution of actions by group members to be an equilibrium configuration if no individual can improve its payoff by unilaterally changing its action. We show that at this equilibrium, one action is preferred in the sense that those taking the preferred action have a higher payoff than those taking the other, more prosocial, action. We explore the consequences of a simple 'unbiased' reinforcement learning rule during development, showing that groups reach an approximate equilibrium distribution, so that some achieve a higher payoff than others. Because there is learning, an individual's behaviour can influence the future behaviour of others. We show that, as a consequence, there is the potential for an individual to exploit others by influencing them to be the ones to take the non-preferred action. Using an evolutionary simulation, we show that population members can avoid being exploited by over-valuing rewards obtained from the preferred option during learning, an example of a bias that is 'rational'. [ABSTRACT FROM AUTHOR]