Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization

Xiaoyang Li, Hairuo Zhang, Teng Wang, Haonan Li, Ying Zhou, Deyun Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Collaborative guidance technology is a crucial means to enhance the effectiveness of strikes. To achieve precise collaboration among multiple missiles targeting a common objective, this paper addresses the issue of inaccurate calculation of the virtual impact point control expected flight time resulting from the use of fast iterative algorithms. We propose a method based on proximal policy optimization to calculate the virtual impact point. A collaborative guidance model under multiple constraints is established, and a proximal policy optimization algorithm is applied to optimize the collaborative guidance law. The calculation parameters of the virtual impact point are treated as actions of an intelligent agent acting on the environment, with velocity, desired pitch angle, and position coordinates serving as the algorithm's observations. A reward function reflecting the collaborative time is constructed, establishing a multi-constraint collaborative guidance law based on intelligent learning. Extensive simulation experiments targeting stationary targets demonstrate the rationality and effectiveness of the proposed method. After training, the intelligent agent provides different desired attack angles, and, based on the observation space, it can generate corresponding parameters. In some scenarios, the precision of hitting time surpasses that of fast iterative algorithms.

Original languageEnglish
Title of host publicationProceedings of the 36th Chinese Control and Decision Conference, CCDC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages807-812
Number of pages6
ISBN (Electronic)9798350387780
DOIs
StatePublished - 2024
Event36th Chinese Control and Decision Conference, CCDC 2024 - Xi'an, China
Duration: 25 May 202427 May 2024

Publication series

NameProceedings of the 36th Chinese Control and Decision Conference, CCDC 2024

Conference

Conference36th Chinese Control and Decision Conference, CCDC 2024
Country/TerritoryChina
CityXi'an
Period25/05/2427/05/24

Keywords

  • FOV constraint
  • angle constraint
  • cooperative guidance
  • reinforcement learning
  • three-dimensional guidance
  • time constraint

Fingerprint

Dive into the research topics of 'Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization'. Together they form a unique fingerprint.

Cite this