Lévy noise promotes cooperation in the prisoner’s dilemma game with reinforcement learning

Lu Wang; Danyang Jia; Long Zhang; Peican Zhu; Matjaž Perc; Lei Shi; Zhen Wang

doi:10.1007/s11071-022-07289-7

Lévy noise promotes cooperation in the prisoner’s dilemma game with reinforcement learning

Lu Wang, Danyang Jia, Long Zhang, Peican Zhu, Matjaž Perc, Lei Shi, Zhen Wang

科研成果: 期刊稿件 › 文章 › 同行评审

53 引用（Scopus）

摘要

Uncertainties are ubiquitous in everyday life, and it is thus important to explore their effects on the evolution of cooperation. In this paper, the prisoner’s dilemma game with reinforcement learning subject to Lévy noise is studied. Specifically, diverse fluctuations mimicked by Lévy distributed noise are reflected in the payoff matrix of each player. At the same time, the self-regarding Q-learning algorithm is considered as the strategy update rule to learn the behavior that achieves the highest payoff. The results show that not only does Lévy noise promote the evolution of cooperation with reinforcement learning, it does so comparatively better than Gaussian noise. We explain this with the iterative updating pattern of the self-regarding Q-learning algorithm, which has an accumulative effect on the noise entering the payoff matrix. It turns out that under Lévy noise, the Q-value of cooperative behavior becomes significantly larger than that of defective behavior when the current strategy is defection, which ultimately leads to the prevalence of cooperation, while this is absent with Gaussian noise or without noise. This research thus unveils a particular positive role of Lévy noise in the evolutionary dynamics of social dilemmas.

源语言	英语
页（从-至）	1837-1845
页数	9
期刊	Nonlinear Dynamics
卷	108
期	2
DOI	https://doi.org/10.1007/s11071-022-07289-7
出版状态	已出版 - 4月 2022

访问文件

10.1007/s11071-022-07289-7

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{436e935745974beea80fbc18421d0d7b,

title = "L{\'e}vy noise promotes cooperation in the prisoner{\textquoteright}s dilemma game with reinforcement learning",

abstract = "Uncertainties are ubiquitous in everyday life, and it is thus important to explore their effects on the evolution of cooperation. In this paper, the prisoner{\textquoteright}s dilemma game with reinforcement learning subject to L{\'e}vy noise is studied. Specifically, diverse fluctuations mimicked by L{\'e}vy distributed noise are reflected in the payoff matrix of each player. At the same time, the self-regarding Q-learning algorithm is considered as the strategy update rule to learn the behavior that achieves the highest payoff. The results show that not only does L{\'e}vy noise promote the evolution of cooperation with reinforcement learning, it does so comparatively better than Gaussian noise. We explain this with the iterative updating pattern of the self-regarding Q-learning algorithm, which has an accumulative effect on the noise entering the payoff matrix. It turns out that under L{\'e}vy noise, the Q-value of cooperative behavior becomes significantly larger than that of defective behavior when the current strategy is defection, which ultimately leads to the prevalence of cooperation, while this is absent with Gaussian noise or without noise. This research thus unveils a particular positive role of L{\'e}vy noise in the evolutionary dynamics of social dilemmas.",

keywords = "Cooperation, Evolutionary dynamics, L{\'e}vy noise, Prisoner{\textquoteright}s dilemma, Self-regarding Q-learning",

author = "Lu Wang and Danyang Jia and Long Zhang and Peican Zhu and Matja{\v z} Perc and Lei Shi and Zhen Wang",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive licence to Springer Nature B.V.",

year = "2022",

month = apr,

doi = "10.1007/s11071-022-07289-7",

language = "英语",

volume = "108",

pages = "1837--1845",

journal = "Nonlinear Dynamics",

issn = "0924-090X",

publisher = "Springer Netherlands",

number = "2",

}

TY - JOUR

T1 - Lévy noise promotes cooperation in the prisoner’s dilemma game with reinforcement learning

AU - Wang, Lu

AU - Jia, Danyang

AU - Zhang, Long

AU - Zhu, Peican

AU - Perc, Matjaž

AU - Shi, Lei

AU - Wang, Zhen

PY - 2022/4

Y1 - 2022/4

N2 - Uncertainties are ubiquitous in everyday life, and it is thus important to explore their effects on the evolution of cooperation. In this paper, the prisoner’s dilemma game with reinforcement learning subject to Lévy noise is studied. Specifically, diverse fluctuations mimicked by Lévy distributed noise are reflected in the payoff matrix of each player. At the same time, the self-regarding Q-learning algorithm is considered as the strategy update rule to learn the behavior that achieves the highest payoff. The results show that not only does Lévy noise promote the evolution of cooperation with reinforcement learning, it does so comparatively better than Gaussian noise. We explain this with the iterative updating pattern of the self-regarding Q-learning algorithm, which has an accumulative effect on the noise entering the payoff matrix. It turns out that under Lévy noise, the Q-value of cooperative behavior becomes significantly larger than that of defective behavior when the current strategy is defection, which ultimately leads to the prevalence of cooperation, while this is absent with Gaussian noise or without noise. This research thus unveils a particular positive role of Lévy noise in the evolutionary dynamics of social dilemmas.

AB - Uncertainties are ubiquitous in everyday life, and it is thus important to explore their effects on the evolution of cooperation. In this paper, the prisoner’s dilemma game with reinforcement learning subject to Lévy noise is studied. Specifically, diverse fluctuations mimicked by Lévy distributed noise are reflected in the payoff matrix of each player. At the same time, the self-regarding Q-learning algorithm is considered as the strategy update rule to learn the behavior that achieves the highest payoff. The results show that not only does Lévy noise promote the evolution of cooperation with reinforcement learning, it does so comparatively better than Gaussian noise. We explain this with the iterative updating pattern of the self-regarding Q-learning algorithm, which has an accumulative effect on the noise entering the payoff matrix. It turns out that under Lévy noise, the Q-value of cooperative behavior becomes significantly larger than that of defective behavior when the current strategy is defection, which ultimately leads to the prevalence of cooperation, while this is absent with Gaussian noise or without noise. This research thus unveils a particular positive role of Lévy noise in the evolutionary dynamics of social dilemmas.

KW - Cooperation

KW - Evolutionary dynamics

KW - Lévy noise

KW - Prisoner’s dilemma

KW - Self-regarding Q-learning

UR - http://www.scopus.com/inward/record.url?scp=85125944891&partnerID=8YFLogxK

U2 - 10.1007/s11071-022-07289-7

DO - 10.1007/s11071-022-07289-7

M3 - 文章

AN - SCOPUS:85125944891

SN - 0924-090X

VL - 108

SP - 1837

EP - 1845

JO - Nonlinear Dynamics

JF - Nonlinear Dynamics

IS - 2

ER -

Lévy noise promotes cooperation in the prisoner’s dilemma game with reinforcement learning

摘要

访问文件

其它文件与链接

指纹

引用此