TY - JOUR
T1 - Synergistic effects of adaptive reward and reinforcement learning rules on cooperation
AU - Wang, Lu
AU - Fan, Litong
AU - Zhang, Long
AU - Zou, Rongcheng
AU - Wang, Zhen
N1 - Publisher Copyright:
© 2023 The Author(s). Published by IOP Publishing Ltd on behalf of the Institute of Physics and Deutsche Physikalische Gesellschaft.
PY - 2023/7/1
Y1 - 2023/7/1
N2 - Cooperative behavior in multi-agent systems has been a focal point of research, particularly in the context of pairwise interaction games. While previous studies have successfully used reinforcement learning rules to explain and predict the behavior of agents in two-agent interactions, multi-agent interactions are more complex, and the impact of reward mechanisms on agent behavior is often overlooked. To address this gap, we propose a framework that combines the public goods game (PGG) with reinforcement learning and adaptive reward mechanisms to better capture decision-making behavior in multi-agent interactions. In that, PGG is adopted to reflect the decision-making behavior of multi-agent interactions, self-regarding Q-learning emphasizes an experience-based strategy update, and adaptive reward focuses on the adaptability. We are mainly concentrating on the synergistic effects of them. The simulations demonstrate that while self-regarding Q-learning fails to prevent the collapse of cooperation in the traditional PGG, the fraction of cooperation increases significantly when the adaptive reward strategy is included. Meanwhile, the theoretical analyses aligned with our simulation results, which revealed that there is a specific reward cost required to maximize the fraction of cooperation. Overall, this study provides a novel perspective on establishing cooperative reward mechanisms in social dilemmas and highlights the importance of considering adaptive reward mechanisms in multi-agent interactions.
AB - Cooperative behavior in multi-agent systems has been a focal point of research, particularly in the context of pairwise interaction games. While previous studies have successfully used reinforcement learning rules to explain and predict the behavior of agents in two-agent interactions, multi-agent interactions are more complex, and the impact of reward mechanisms on agent behavior is often overlooked. To address this gap, we propose a framework that combines the public goods game (PGG) with reinforcement learning and adaptive reward mechanisms to better capture decision-making behavior in multi-agent interactions. In that, PGG is adopted to reflect the decision-making behavior of multi-agent interactions, self-regarding Q-learning emphasizes an experience-based strategy update, and adaptive reward focuses on the adaptability. We are mainly concentrating on the synergistic effects of them. The simulations demonstrate that while self-regarding Q-learning fails to prevent the collapse of cooperation in the traditional PGG, the fraction of cooperation increases significantly when the adaptive reward strategy is included. Meanwhile, the theoretical analyses aligned with our simulation results, which revealed that there is a specific reward cost required to maximize the fraction of cooperation. Overall, this study provides a novel perspective on establishing cooperative reward mechanisms in social dilemmas and highlights the importance of considering adaptive reward mechanisms in multi-agent interactions.
KW - adaptive reward
KW - cooperation
KW - public goods game
KW - self-regarding Q-learning
UR - http://www.scopus.com/inward/record.url?scp=85165234171&partnerID=8YFLogxK
U2 - 10.1088/1367-2630/acd26e
DO - 10.1088/1367-2630/acd26e
M3 - 文章
AN - SCOPUS:85165234171
SN - 1367-2630
VL - 25
JO - New Journal of Physics
JF - New Journal of Physics
IS - 7
M1 - 073008
ER -