A formal model for multiagent Q-learning on graphs

Jinzhuo Liu; Guangchen Jiang; Chen Chu; Yong Li; Zhen Wang; Shuyue Hu

doi:10.1007/s11432-024-4289-6

A formal model for multiagent Q-learning on graphs

Jinzhuo Liu, Guangchen Jiang, Chen Chu, Yong Li, Zhen Wang, Shuyue Hu

网络空间安全学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Understanding the dynamics of multi-agent learning has long been an important research topic. Existing research has focused mostly on 2-agent games or well-mixed populations. However, in real-world multi-agent systems, agents often interact in spatially or socially structured networks (or graphs). In this paper, we examine the dynamics of multi-agent Q-learning on graphs. Combining mean-field theory and combinatorics analysis, we present a new analytical approach to formally describe the time evolution of Q-values in the system with a topological structure. Through extensive numerical simulations, we show that our theory consistently provides an accurate depiction of the Q-learning dynamics across different typical games, initial conditions, and various graph structures, encompassing regular graphs, scale-free graphs, and random graphs. Moreover, we show that when comparing regular graphs to other types of graphs with the same average degree, the differences in the system evolution are largely attributed to the behaviors and Q-values of agents with lower degrees.

源语言	英语
文章编号	192206
期刊	Science China Information Sciences
卷	68
期	9
DOI	https://doi.org/10.1007/s11432-024-4289-6
出版状态	已出版 - 9月 2025

访问文件

10.1007/s11432-024-4289-6

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{cb9a5e8885164385a8443142b88fe978,

title = "A formal model for multiagent Q-learning on graphs",

abstract = "Understanding the dynamics of multi-agent learning has long been an important research topic. Existing research has focused mostly on 2-agent games or well-mixed populations. However, in real-world multi-agent systems, agents often interact in spatially or socially structured networks (or graphs). In this paper, we examine the dynamics of multi-agent Q-learning on graphs. Combining mean-field theory and combinatorics analysis, we present a new analytical approach to formally describe the time evolution of Q-values in the system with a topological structure. Through extensive numerical simulations, we show that our theory consistently provides an accurate depiction of the Q-learning dynamics across different typical games, initial conditions, and various graph structures, encompassing regular graphs, scale-free graphs, and random graphs. Moreover, we show that when comparing regular graphs to other types of graphs with the same average degree, the differences in the system evolution are largely attributed to the behaviors and Q-values of agents with lower degrees.",

keywords = "game theory, graph theory, multiagent, Q-learning dynamics",

author = "Jinzhuo Liu and Guangchen Jiang and Chen Chu and Yong Li and Zhen Wang and Shuyue Hu",

note = "Publisher Copyright: {\textcopyright} Science China Press 2025.",

year = "2025",

month = sep,

doi = "10.1007/s11432-024-4289-6",

language = "英语",

volume = "68",

journal = "Science China Information Sciences",

issn = "1674-733X",

publisher = "Science China Press ",

number = "9",

}

TY - JOUR

T1 - A formal model for multiagent Q-learning on graphs

AU - Liu, Jinzhuo

AU - Jiang, Guangchen

AU - Chu, Chen

AU - Li, Yong

AU - Wang, Zhen

AU - Hu, Shuyue

N1 - Publisher Copyright: © Science China Press 2025.

PY - 2025/9

Y1 - 2025/9

N2 - Understanding the dynamics of multi-agent learning has long been an important research topic. Existing research has focused mostly on 2-agent games or well-mixed populations. However, in real-world multi-agent systems, agents often interact in spatially or socially structured networks (or graphs). In this paper, we examine the dynamics of multi-agent Q-learning on graphs. Combining mean-field theory and combinatorics analysis, we present a new analytical approach to formally describe the time evolution of Q-values in the system with a topological structure. Through extensive numerical simulations, we show that our theory consistently provides an accurate depiction of the Q-learning dynamics across different typical games, initial conditions, and various graph structures, encompassing regular graphs, scale-free graphs, and random graphs. Moreover, we show that when comparing regular graphs to other types of graphs with the same average degree, the differences in the system evolution are largely attributed to the behaviors and Q-values of agents with lower degrees.

AB - Understanding the dynamics of multi-agent learning has long been an important research topic. Existing research has focused mostly on 2-agent games or well-mixed populations. However, in real-world multi-agent systems, agents often interact in spatially or socially structured networks (or graphs). In this paper, we examine the dynamics of multi-agent Q-learning on graphs. Combining mean-field theory and combinatorics analysis, we present a new analytical approach to formally describe the time evolution of Q-values in the system with a topological structure. Through extensive numerical simulations, we show that our theory consistently provides an accurate depiction of the Q-learning dynamics across different typical games, initial conditions, and various graph structures, encompassing regular graphs, scale-free graphs, and random graphs. Moreover, we show that when comparing regular graphs to other types of graphs with the same average degree, the differences in the system evolution are largely attributed to the behaviors and Q-values of agents with lower degrees.

KW - game theory

KW - graph theory

KW - multiagent

KW - Q-learning dynamics

UR - http://www.scopus.com/inward/record.url?scp=105008726553&partnerID=8YFLogxK

U2 - 10.1007/s11432-024-4289-6

DO - 10.1007/s11432-024-4289-6

M3 - 文章

AN - SCOPUS:105008726553

SN - 1674-733X

VL - 68

JO - Science China Information Sciences

JF - Science China Information Sciences

IS - 9

M1 - 192206

ER -

A formal model for multiagent Q-learning on graphs

摘要

访问文件

其它文件与链接

指纹

引用此