Driving key nodes to learn cooperation in social dilemma

Litong Fan; Hao Guo; Dengxiu Yu; Bowen Xu; Zhen Wang

doi:10.1007/s11071-024-10376-6

Driving key nodes to learn cooperation in social dilemma

Litong Fan, Hao Guo, Dengxiu Yu, Bowen Xu, Zhen Wang

School of Artificial Intelligence, OPtics and Electronics

Research output: Contribution to journal › Article › peer-review

Abstract

This paper presents a method that drives key nodes with minimal control cost to influence the agents in social dilemmas to learn to cooperate. Cooperation exists widely in human society and nature. Discovering the mechanisms that promote the evolution of cooperation has always been a concern across disciplines. In this context, continuous action social dilemma games are a fitting model to explore individual interactions and the spread of cooperative behaviors in social networks. This paper proposes a novel framework that applies optimal control theory to steer the evolution of cooperation within these games. Existing research lacks a definitive method for judicious selecting critical nodes capable of guiding the entire system toward a desired state with minimal cost. To bridge this gap, our framework offers a control mechanism that influences cooperative behavior and determines the optimal number and identity of nodes to be controlled, thereby maximizing outcomes. Building upon continuous action social dilemma games in social networks, we formulate a set of coupled Hamilton-Jacobi-Bellman (HJB) equations, employ a value iteration reinforcement learning (RL) algorithm to solve for the HJB function, and demonstrate the convergence of the proposed algorithm. Furthermore, we conduct a comprehensive qualitative and quantitative investigation into the selection of controlled nodes across diverse networks, focusing on optimizing control performance while minimizing associated costs. Our research substantiates the efficacy of the proposed algorithm across a spectrum of social networks, affirming its utility and potential impact in promoting the evolution of cooperative behavior.

Original language	English
Journal	Nonlinear Dynamics
DOIs	https://doi.org/10.1007/s11071-024-10376-6
State	Accepted/In press - 2024

Keywords

Continuous action
Cooperative behavior
Hamilton-Jacobi-Bellman (HJB)
Optimal control theory
Reinforcement learning (RL)
Social networks

Access to Document

10.1007/s11071-024-10376-6

Cite this

@article{e8b6c8d64a574e3090c6200e7c090d6d,

title = "Driving key nodes to learn cooperation in social dilemma",

abstract = "This paper presents a method that drives key nodes with minimal control cost to influence the agents in social dilemmas to learn to cooperate. Cooperation exists widely in human society and nature. Discovering the mechanisms that promote the evolution of cooperation has always been a concern across disciplines. In this context, continuous action social dilemma games are a fitting model to explore individual interactions and the spread of cooperative behaviors in social networks. This paper proposes a novel framework that applies optimal control theory to steer the evolution of cooperation within these games. Existing research lacks a definitive method for judicious selecting critical nodes capable of guiding the entire system toward a desired state with minimal cost. To bridge this gap, our framework offers a control mechanism that influences cooperative behavior and determines the optimal number and identity of nodes to be controlled, thereby maximizing outcomes. Building upon continuous action social dilemma games in social networks, we formulate a set of coupled Hamilton-Jacobi-Bellman (HJB) equations, employ a value iteration reinforcement learning (RL) algorithm to solve for the HJB function, and demonstrate the convergence of the proposed algorithm. Furthermore, we conduct a comprehensive qualitative and quantitative investigation into the selection of controlled nodes across diverse networks, focusing on optimizing control performance while minimizing associated costs. Our research substantiates the efficacy of the proposed algorithm across a spectrum of social networks, affirming its utility and potential impact in promoting the evolution of cooperative behavior.",

keywords = "Continuous action, Cooperative behavior, Hamilton-Jacobi-Bellman (HJB), Optimal control theory, Reinforcement learning (RL), Social networks",

author = "Litong Fan and Hao Guo and Dengxiu Yu and Bowen Xu and Zhen Wang",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer Nature B.V. 2024.",

year = "2024",

doi = "10.1007/s11071-024-10376-6",

language = "英语",

journal = "Nonlinear Dynamics",

issn = "0924-090X",

publisher = "Springer Netherlands",

}

TY - JOUR

T1 - Driving key nodes to learn cooperation in social dilemma

AU - Fan, Litong

AU - Guo, Hao

AU - Yu, Dengxiu

AU - Xu, Bowen

AU - Wang, Zhen

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer Nature B.V. 2024.

PY - 2024

Y1 - 2024

N2 - This paper presents a method that drives key nodes with minimal control cost to influence the agents in social dilemmas to learn to cooperate. Cooperation exists widely in human society and nature. Discovering the mechanisms that promote the evolution of cooperation has always been a concern across disciplines. In this context, continuous action social dilemma games are a fitting model to explore individual interactions and the spread of cooperative behaviors in social networks. This paper proposes a novel framework that applies optimal control theory to steer the evolution of cooperation within these games. Existing research lacks a definitive method for judicious selecting critical nodes capable of guiding the entire system toward a desired state with minimal cost. To bridge this gap, our framework offers a control mechanism that influences cooperative behavior and determines the optimal number and identity of nodes to be controlled, thereby maximizing outcomes. Building upon continuous action social dilemma games in social networks, we formulate a set of coupled Hamilton-Jacobi-Bellman (HJB) equations, employ a value iteration reinforcement learning (RL) algorithm to solve for the HJB function, and demonstrate the convergence of the proposed algorithm. Furthermore, we conduct a comprehensive qualitative and quantitative investigation into the selection of controlled nodes across diverse networks, focusing on optimizing control performance while minimizing associated costs. Our research substantiates the efficacy of the proposed algorithm across a spectrum of social networks, affirming its utility and potential impact in promoting the evolution of cooperative behavior.

AB - This paper presents a method that drives key nodes with minimal control cost to influence the agents in social dilemmas to learn to cooperate. Cooperation exists widely in human society and nature. Discovering the mechanisms that promote the evolution of cooperation has always been a concern across disciplines. In this context, continuous action social dilemma games are a fitting model to explore individual interactions and the spread of cooperative behaviors in social networks. This paper proposes a novel framework that applies optimal control theory to steer the evolution of cooperation within these games. Existing research lacks a definitive method for judicious selecting critical nodes capable of guiding the entire system toward a desired state with minimal cost. To bridge this gap, our framework offers a control mechanism that influences cooperative behavior and determines the optimal number and identity of nodes to be controlled, thereby maximizing outcomes. Building upon continuous action social dilemma games in social networks, we formulate a set of coupled Hamilton-Jacobi-Bellman (HJB) equations, employ a value iteration reinforcement learning (RL) algorithm to solve for the HJB function, and demonstrate the convergence of the proposed algorithm. Furthermore, we conduct a comprehensive qualitative and quantitative investigation into the selection of controlled nodes across diverse networks, focusing on optimizing control performance while minimizing associated costs. Our research substantiates the efficacy of the proposed algorithm across a spectrum of social networks, affirming its utility and potential impact in promoting the evolution of cooperative behavior.

KW - Continuous action

KW - Cooperative behavior

KW - Hamilton-Jacobi-Bellman (HJB)

KW - Optimal control theory

KW - Reinforcement learning (RL)

KW - Social networks

UR - http://www.scopus.com/inward/record.url?scp=85208175217&partnerID=8YFLogxK

U2 - 10.1007/s11071-024-10376-6

DO - 10.1007/s11071-024-10376-6

M3 - 文章

AN - SCOPUS:85208175217

SN - 0924-090X

JO - Nonlinear Dynamics

JF - Nonlinear Dynamics

ER -

Driving key nodes to learn cooperation in social dilemma

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this