TY - JOUR
T1 - Integral reinforcement learning based dynamic stackelberg pursuit-evasion game for unmanned surface vehicles
AU - Hu, Xiaoxiang
AU - Liu, Shuaizheng
AU - Xu, Jingwen
AU - Xiao, Bing
AU - Guo, Chenguang
N1 - Publisher Copyright:
© 2024 Faculty of Engineering, Alexandria University
PY - 2024/12
Y1 - 2024/12
N2 - The dynamic stackelberg pursuit-evasion (PE) game of unmanned surface vehicles (USVs) is discussed in this paper. The optimal solution method of the USVs’ PE game is proposed. The USVs’ PE game is firstly described by the pursuit motion on two-dimensional bounded surface, and considering the optimal decision order of evader and pursuer, the pursuit-evasion game is modeled by a dynamic stackelberg game. Then the optimal game solution problem is transformed into the solution of Hamilton–Jacobi–Isaacs equations, and on-policy iteration of integral reinforcement learning algorithm is utilized. Neural network is also utilized for the value iteration solution of the dynamic stackelberg pursuit-evasion game. The existence of the stackelberg equilibrium and the global asymptotic stability of the system are all discussed related to optimal control and differential game. Finally, the presented method is tested by pursuit-evasion game between USVs.
AB - The dynamic stackelberg pursuit-evasion (PE) game of unmanned surface vehicles (USVs) is discussed in this paper. The optimal solution method of the USVs’ PE game is proposed. The USVs’ PE game is firstly described by the pursuit motion on two-dimensional bounded surface, and considering the optimal decision order of evader and pursuer, the pursuit-evasion game is modeled by a dynamic stackelberg game. Then the optimal game solution problem is transformed into the solution of Hamilton–Jacobi–Isaacs equations, and on-policy iteration of integral reinforcement learning algorithm is utilized. Neural network is also utilized for the value iteration solution of the dynamic stackelberg pursuit-evasion game. The existence of the stackelberg equilibrium and the global asymptotic stability of the system are all discussed related to optimal control and differential game. Finally, the presented method is tested by pursuit-evasion game between USVs.
KW - Dynamic stackelberg game
KW - Integral reinforcement learning
KW - Policy iteration
KW - Pursuit-evasion game
UR - http://www.scopus.com/inward/record.url?scp=85200235262&partnerID=8YFLogxK
U2 - 10.1016/j.aej.2024.07.085
DO - 10.1016/j.aej.2024.07.085
M3 - 文章
AN - SCOPUS:85200235262
SN - 1110-0168
VL - 108
SP - 428
EP - 435
JO - Alexandria Engineering Journal
JF - Alexandria Engineering Journal
ER -