TY - JOUR
T1 - Approximate Optimal Strategy for Multiagent System Pursuit-Evasion Game
AU - Xu, Zhiqiang
AU - Yu, Dengxiu
AU - Liu, Yan Jun
AU - Wang, Zhen
N1 - Publisher Copyright:
© 2007-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - In this article, we propose an approximate optimal control strategy for a class of nonlinear multiagent system pursuit-evasion games. Herein, multiple pursuers aim to capture multiple evaders trying to evade capture. Under the competitive framework, agents not only pursue their individual goals but also consider coordination with their teammates to achieve collective objectives. However, maintaining cohesion with teammates in existing distributed control methods has always been a challenge. To enhance team coordination, we employ a graph-theoretic approach to represent the relationships between agents. Based on this, we design a dynamic target graph algorithm to enhance the coordination among pursuers. The approximate optimal strategies for each agent are solved by utilizing the Hamilton-Jacobi-Isaacs equations of the system. As solving these equations becomes computationally intensive in multiagent scenarios, we propose a value-based single network adaptive critic network architecture. In addition, we consider scenarios where the numbers of agents on both sides are inconsistent and address the phenomenon of input saturation. Moreover, we provide sufficient conditions to prove the system's stability. Finally, simulations conducted in two representative scenarios, multiple-pursuer-one-evader and multiple-pursuer-multiple-evader, demonstrate the effectiveness of our proposed algorithm.
AB - In this article, we propose an approximate optimal control strategy for a class of nonlinear multiagent system pursuit-evasion games. Herein, multiple pursuers aim to capture multiple evaders trying to evade capture. Under the competitive framework, agents not only pursue their individual goals but also consider coordination with their teammates to achieve collective objectives. However, maintaining cohesion with teammates in existing distributed control methods has always been a challenge. To enhance team coordination, we employ a graph-theoretic approach to represent the relationships between agents. Based on this, we design a dynamic target graph algorithm to enhance the coordination among pursuers. The approximate optimal strategies for each agent are solved by utilizing the Hamilton-Jacobi-Isaacs equations of the system. As solving these equations becomes computationally intensive in multiagent scenarios, we propose a value-based single network adaptive critic network architecture. In addition, we consider scenarios where the numbers of agents on both sides are inconsistent and address the phenomenon of input saturation. Moreover, we provide sufficient conditions to prove the system's stability. Finally, simulations conducted in two representative scenarios, multiple-pursuer-one-evader and multiple-pursuer-multiple-evader, demonstrate the effectiveness of our proposed algorithm.
KW - Approximate optimal control
KW - multiagent systems
KW - pursuite evasion games
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85200246428&partnerID=8YFLogxK
U2 - 10.1109/JSYST.2024.3432796
DO - 10.1109/JSYST.2024.3432796
M3 - 文章
AN - SCOPUS:85200246428
SN - 1932-8184
VL - 18
SP - 1669
EP - 1680
JO - IEEE Systems Journal
JF - IEEE Systems Journal
IS - 3
ER -