A Deep Reinforcement Learning-Based Cooperative Guidance Strategy Under Uncontrollable Velocity Conditions

Hao Cui, Ke Zhang, Minghu Tan, Jingyu Wang

Research output: Contribution to journalArticlepeer-review

Abstract

We present a novel approach to generating a cooperative guidance strategy using deep reinforcement learning to address the challenge of cooperative multi-missile strikes under uncontrollable velocity conditions. This method employs the multi-agent proximal policy optimization (MAPPO) algorithm to construct a continuous action space framework for intelligent cooperative guidance. A heuristically reshaped reward function is designed to enhance cooperative guidance among agents, enabling effective target engagement while mitigating the low learning efficiency caused by sparse reward signals in the guidance environment. Additionally, a multi-stage curriculum learning approach is introduced to smooth agent actions, effectively reducing action oscillations arising from independent sampling in reinforcement learning. Simulation results demonstrate that the proposed deep reinforcement learning-based guidance law can successfully achieve cooperative attacks across a range of randomized initial conditions.

Original languageEnglish
Article number411
JournalAerospace
Volume12
Issue number5
DOIs
StatePublished - May 2025

Keywords

  • cooperative guidance
  • curriculum learning
  • MAPPO
  • uncontrollable velocity

Fingerprint

Dive into the research topics of 'A Deep Reinforcement Learning-Based Cooperative Guidance Strategy Under Uncontrollable Velocity Conditions'. Together they form a unique fingerprint.

Cite this