跳到主要导航 跳到搜索 跳到主要内容

CBPE-Based Assignment Policy for Multiple Player Pursuit-Evasion Game by D3QN Algorithm Applied to Uncrewed Vehicles

科研成果: 期刊稿件文章同行评审

摘要

This article proposes novel assignment policies based on constant-bearing and pure-evading control policies (CBPE-based assignment policies) for a multiple player pursuit-evasion (MP-PE) game by a dueling double deep Q-leaning (D3QN) algorithm applied to uncrewed vehicles (UVs). In the MP-PE game, the D3QN algorithm, which incorporates deep, dueling and double Q-learning methods is used for training CBPE-based assignment policies for both pursuer and evader teams. In this case, Nash equilibrium (NE) approximate solutions of CBPE-based assignment policies are derived, which are equivalent to NE control policies for the MP-PE game. The goal of this article is to develop approximate NE CBPE-based assignment policies trained by the D3QN algorithm which are equivalent to NE control policies for both pursuer and evader teams in the MP-PE game. Subsequently, sufficient conditions are provided for analyzing the equivalence between NE CBPE-based assignment policies and NE control policies. Furthermore, with the compression mapping condition, the convergence and optimality of the D3QN algorithm are obtained for NE CBPE-based assignment policies of the MP-PE game. Finally, illustrative examples and experiments are provided to demonstrate the feasibility and usefulness of the proposed NE CBPE-based assignment policies trained by the D3QN algorithm.

源语言英语
期刊IEEE Transactions on Industrial Electronics
DOI
出版状态已接受/待刊 - 2026

指纹

探究 'CBPE-Based Assignment Policy for Multiple Player Pursuit-Evasion Game by D3QN Algorithm Applied to Uncrewed Vehicles' 的科研主题。它们共同构成独一无二的指纹。

引用此