Research on Maneuver Decision Algorithm of Multi-UAV Based on Action Intention Reinforcement Learning

  • Weiyu Huo
  • , Zhenjiang Lian
  • , Yang Liu
  • , Yeehom Fang
  • , Deyun Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In multi-UAV air combat, traditional reinforcement learning (RL) struggles with high computational demands and policy learning inefficiencies arising from complex mission tasks and intensive inter-agent interactions. To address these limitations, this study proposes an intention-based RL framework that leverages prior knowledge to enhance learning efficiency and improve state-action mapping. Within this framework, a multi-UAV maneuver decision-making algorithm is developed, defining four action intentions - attack, surveillance, support, and evasion - to represent tactical objectives, each mapped to feasible maneuver actions based on situational data. The algorithm integrates a situational assessment model, cooperative target allocation, and a deep Q-network (DQN) to guide decision-making. Simulation results confirm the approach's effectiveness in enhancing multi-UAV maneuver coordination and decision performance.

Original languageEnglish
Title of host publicationProceedings of the 44th Chinese Control Conference, CCC 2025
EditorsJian Sun, Hongpeng Yin
PublisherIEEE Computer Society
Pages2784-2789
Number of pages6
ISBN (Electronic)9789887581611
DOIs
StatePublished - 2025
Event44th Chinese Control Conference, CCC 2025 - Chongqing, China
Duration: 28 Jul 202530 Jul 2025

Publication series

NameChinese Control Conference, CCC
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference44th Chinese Control Conference, CCC 2025
Country/TerritoryChina
CityChongqing
Period28/07/2530/07/25

Keywords

  • maneuvering decision
  • multi-UAV
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Research on Maneuver Decision Algorithm of Multi-UAV Based on Action Intention Reinforcement Learning'. Together they form a unique fingerprint.

Cite this