Complex Network Optimization for Fixed-Time Continuous Action Iteration Dilemma by Using Reinforcement Learning

Zhanxiao Jia; Dengxiu Yu; Zhen Wang; C. L.Philip Chen; Xuelong Li

doi:10.1109/TNSE.2024.3384509

Complex Network Optimization for Fixed-Time Continuous Action Iteration Dilemma by Using Reinforcement Learning

Zhanxiao Jia, Dengxiu Yu, Zhen Wang, C. L.Philip Chen, Xuelong Li

光电与智能研究院

科研成果: 期刊稿件 › 文章 › 同行评审

6 引用（Scopus）

摘要

In this paper, an optimization algorithm based on deep reinforcement learning is proposed to optimize complex networks in fixed-time convergence of continuous action iteration dilemmas. The field of continuous action iterative dilemmas has long been studied, with prior research primarily emphasizing the effectiveness of strategy selection and the stability of strategy evolution. However, the impact of topology on strategy evolution has remained under-explored. The present study fills this gap by examining how the structure of complex networks influences the time required for players to reach Nash Equilibrium and overall payoff. To identify the optimal complex network that ensures fixed-time convergence of continuous action iteration dilemma, achieves the shortest time, and attains the highest overall payoff in the Nash Equilibrium state, a deep reinforcement learning algorithm is designed to optimize the complex network. Firstly, the paper applies the Lyapunov stability theory to analyze the convergence of the fixed-time continuous action iteration dilemma and compute the upper bound of convergence time. Secondly, based on the fixed-time convergence of continuous action iteration dilemma, we establish evaluation criteria based on the time taken by players to reach the Nash Equilibrium and the overall payoff, subsequently designing evaluation functions for complex networks utilizing these criteria. Thirdly, this paper applies a deep reinforcement learning algorithm to resolve the optimization issue associated with the proposed evaluation function, while analyzing the convergence of complex network optimization methods. Lastly, the effectiveness of the proposed method is verified by simulating the dynamic model of snowdrift games and prisoner dilemmas.

源语言	英语
页（从-至）	3771-3781
页数	11
期刊	IEEE Transactions on Network Science and Engineering
卷	11
期	4
DOI	https://doi.org/10.1109/TNSE.2024.3384509
出版状态	已出版 - 1 7月 2024

访问文件

10.1109/TNSE.2024.3384509

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{3ab37459a4074b69b234f2e03cd8e95c,

title = "Complex Network Optimization for Fixed-Time Continuous Action Iteration Dilemma by Using Reinforcement Learning",

abstract = "In this paper, an optimization algorithm based on deep reinforcement learning is proposed to optimize complex networks in fixed-time convergence of continuous action iteration dilemmas. The field of continuous action iterative dilemmas has long been studied, with prior research primarily emphasizing the effectiveness of strategy selection and the stability of strategy evolution. However, the impact of topology on strategy evolution has remained under-explored. The present study fills this gap by examining how the structure of complex networks influences the time required for players to reach Nash Equilibrium and overall payoff. To identify the optimal complex network that ensures fixed-time convergence of continuous action iteration dilemma, achieves the shortest time, and attains the highest overall payoff in the Nash Equilibrium state, a deep reinforcement learning algorithm is designed to optimize the complex network. Firstly, the paper applies the Lyapunov stability theory to analyze the convergence of the fixed-time continuous action iteration dilemma and compute the upper bound of convergence time. Secondly, based on the fixed-time convergence of continuous action iteration dilemma, we establish evaluation criteria based on the time taken by players to reach the Nash Equilibrium and the overall payoff, subsequently designing evaluation functions for complex networks utilizing these criteria. Thirdly, this paper applies a deep reinforcement learning algorithm to resolve the optimization issue associated with the proposed evaluation function, while analyzing the convergence of complex network optimization methods. Lastly, the effectiveness of the proposed method is verified by simulating the dynamic model of snowdrift games and prisoner dilemmas.",

keywords = "continuous action iteration dilemma, deep reinforcement learning, fixed-time, Optimal complex network",

author = "Zhanxiao Jia and Dengxiu Yu and Zhen Wang and Chen, {C. L.Philip} and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2024",

month = jul,

day = "1",

doi = "10.1109/TNSE.2024.3384509",

language = "英语",

volume = "11",

pages = "3771--3781",

journal = "IEEE Transactions on Network Science and Engineering",

issn = "2327-4697",

publisher = "IEEE Computer Society",

number = "4",

}

TY - JOUR

T1 - Complex Network Optimization for Fixed-Time Continuous Action Iteration Dilemma by Using Reinforcement Learning

AU - Jia, Zhanxiao

AU - Yu, Dengxiu

AU - Wang, Zhen

AU - Chen, C. L.Philip

AU - Li, Xuelong

PY - 2024/7/1

Y1 - 2024/7/1

N2 - In this paper, an optimization algorithm based on deep reinforcement learning is proposed to optimize complex networks in fixed-time convergence of continuous action iteration dilemmas. The field of continuous action iterative dilemmas has long been studied, with prior research primarily emphasizing the effectiveness of strategy selection and the stability of strategy evolution. However, the impact of topology on strategy evolution has remained under-explored. The present study fills this gap by examining how the structure of complex networks influences the time required for players to reach Nash Equilibrium and overall payoff. To identify the optimal complex network that ensures fixed-time convergence of continuous action iteration dilemma, achieves the shortest time, and attains the highest overall payoff in the Nash Equilibrium state, a deep reinforcement learning algorithm is designed to optimize the complex network. Firstly, the paper applies the Lyapunov stability theory to analyze the convergence of the fixed-time continuous action iteration dilemma and compute the upper bound of convergence time. Secondly, based on the fixed-time convergence of continuous action iteration dilemma, we establish evaluation criteria based on the time taken by players to reach the Nash Equilibrium and the overall payoff, subsequently designing evaluation functions for complex networks utilizing these criteria. Thirdly, this paper applies a deep reinforcement learning algorithm to resolve the optimization issue associated with the proposed evaluation function, while analyzing the convergence of complex network optimization methods. Lastly, the effectiveness of the proposed method is verified by simulating the dynamic model of snowdrift games and prisoner dilemmas.

AB - In this paper, an optimization algorithm based on deep reinforcement learning is proposed to optimize complex networks in fixed-time convergence of continuous action iteration dilemmas. The field of continuous action iterative dilemmas has long been studied, with prior research primarily emphasizing the effectiveness of strategy selection and the stability of strategy evolution. However, the impact of topology on strategy evolution has remained under-explored. The present study fills this gap by examining how the structure of complex networks influences the time required for players to reach Nash Equilibrium and overall payoff. To identify the optimal complex network that ensures fixed-time convergence of continuous action iteration dilemma, achieves the shortest time, and attains the highest overall payoff in the Nash Equilibrium state, a deep reinforcement learning algorithm is designed to optimize the complex network. Firstly, the paper applies the Lyapunov stability theory to analyze the convergence of the fixed-time continuous action iteration dilemma and compute the upper bound of convergence time. Secondly, based on the fixed-time convergence of continuous action iteration dilemma, we establish evaluation criteria based on the time taken by players to reach the Nash Equilibrium and the overall payoff, subsequently designing evaluation functions for complex networks utilizing these criteria. Thirdly, this paper applies a deep reinforcement learning algorithm to resolve the optimization issue associated with the proposed evaluation function, while analyzing the convergence of complex network optimization methods. Lastly, the effectiveness of the proposed method is verified by simulating the dynamic model of snowdrift games and prisoner dilemmas.

KW - continuous action iteration dilemma

KW - deep reinforcement learning

KW - fixed-time

KW - Optimal complex network

UR - http://www.scopus.com/inward/record.url?scp=85189820714&partnerID=8YFLogxK

U2 - 10.1109/TNSE.2024.3384509

DO - 10.1109/TNSE.2024.3384509

M3 - 文章

AN - SCOPUS:85189820714

SN - 2327-4697

VL - 11

SP - 3771

EP - 3781

JO - IEEE Transactions on Network Science and Engineering

JF - IEEE Transactions on Network Science and Engineering

IS - 4

ER -

Complex Network Optimization for Fixed-Time Continuous Action Iteration Dilemma by Using Reinforcement Learning

摘要

访问文件

其它文件与链接

指纹

引用此