TY - JOUR
T1 - A formal model for multiagent Q-learning on graphs
AU - Liu, Jinzhuo
AU - Jiang, Guangchen
AU - Chu, Chen
AU - Li, Yong
AU - Wang, Zhen
AU - Hu, Shuyue
N1 - Publisher Copyright:
© Science China Press 2025.
PY - 2025/9
Y1 - 2025/9
N2 - Understanding the dynamics of multi-agent learning has long been an important research topic. Existing research has focused mostly on 2-agent games or well-mixed populations. However, in real-world multi-agent systems, agents often interact in spatially or socially structured networks (or graphs). In this paper, we examine the dynamics of multi-agent Q-learning on graphs. Combining mean-field theory and combinatorics analysis, we present a new analytical approach to formally describe the time evolution of Q-values in the system with a topological structure. Through extensive numerical simulations, we show that our theory consistently provides an accurate depiction of the Q-learning dynamics across different typical games, initial conditions, and various graph structures, encompassing regular graphs, scale-free graphs, and random graphs. Moreover, we show that when comparing regular graphs to other types of graphs with the same average degree, the differences in the system evolution are largely attributed to the behaviors and Q-values of agents with lower degrees.
AB - Understanding the dynamics of multi-agent learning has long been an important research topic. Existing research has focused mostly on 2-agent games or well-mixed populations. However, in real-world multi-agent systems, agents often interact in spatially or socially structured networks (or graphs). In this paper, we examine the dynamics of multi-agent Q-learning on graphs. Combining mean-field theory and combinatorics analysis, we present a new analytical approach to formally describe the time evolution of Q-values in the system with a topological structure. Through extensive numerical simulations, we show that our theory consistently provides an accurate depiction of the Q-learning dynamics across different typical games, initial conditions, and various graph structures, encompassing regular graphs, scale-free graphs, and random graphs. Moreover, we show that when comparing regular graphs to other types of graphs with the same average degree, the differences in the system evolution are largely attributed to the behaviors and Q-values of agents with lower degrees.
KW - game theory
KW - graph theory
KW - multiagent
KW - Q-learning dynamics
UR - http://www.scopus.com/inward/record.url?scp=105008726553&partnerID=8YFLogxK
U2 - 10.1007/s11432-024-4289-6
DO - 10.1007/s11432-024-4289-6
M3 - 文章
AN - SCOPUS:105008726553
SN - 1674-733X
VL - 68
JO - Science China Information Sciences
JF - Science China Information Sciences
IS - 9
M1 - 192206
ER -