Reinforcement-Learning-Based Counter Deception for Nonlinear Pursuit-Evasion Game With Incomplete and Asymmetric Information

Yongkang Wang, Rongxin Cui, Weisheng Yan, Xinxin Guo, Shouxu Zhang, Zhuo Zhang, Zhexuan Zhao

科研成果: 期刊稿件文章同行评审

摘要

In this article, we investigate the problem of capturing a noncooperative target with deception behavior using reinforcement learning (RL) under incomplete information. The pursuer copes not only with its maneuverability constraint but also with the target's deception behavior, in which the target deliberately conceals its private preference information. The target capture game involving deception behavior is formulated as a nonlinear differential game framework where the information structure is incomplete and asymmetric. The solution to this differential game is proposed based on an RL policy that incorporates critic, actor, and virtual actor neural networks (NNs), when taking into consideration the maneuverability constraint and information structure of the pursuer. Moreover, the states of the constrained adversarial system and the weight errors are proven to be ultimately uniformly bounded (UUB). To counter the deception of the target, we adopt unscented Kalman filter (UKF) to obtain the target intention on energy preference, and integrate it into the pursuer strategy. The feasibility of the proposed strategy and its superiority are verified through comparisons with recent works.

指纹

探究 'Reinforcement-Learning-Based Counter Deception for Nonlinear Pursuit-Evasion Game With Incomplete and Asymmetric Information' 的科研主题。它们共同构成独一无二的指纹。

引用此