Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States

Jinlong Wei, Nan Jiang, Wu Sun, Chengli Fan, Dengxiu Yu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

To address the needs of future intelligent war-fare, this paper proposes a hierarchical intelligent decision-making algorithm (ATT-PPO) based on the PPO algorithm and multi-head attention mechanism, designed to solve air defense sequential decision-making problems under unsta-ble dimensional states. Traditional reinforcement learning struggles to handle variable-dimensional observation information, whereas ATT-PPO uses the attention mechanism to map variable-dimensional observations into fixed dimensions, thereby enhancing the representation of high-dimensional dynamic data. However, high-dimensional decision spaces increase the learning burden of the algorithm. To mitigate this, a hierarchical reinforcement learning architecture is designed, incorporating domain knowledge to narrow the exploration space and improve decision-making efficiency. Simulation results show that ATT-PPO significantly out-performs traditional expert control methods in cumulative rewards, validating its superior performance.

Original languageEnglish
Title of host publicationProceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024
EditorsJian Wang, Witold Pedrycz
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages366-371
Number of pages6
ISBN (Electronic)9798331517021
DOIs
StatePublished - 2024
Event2024 International Conference on New Trends in Computational Intelligence, NTCI 2024 - Qingdao, China
Duration: 18 Oct 202420 Oct 2024

Publication series

NameProceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024

Conference

Conference2024 International Conference on New Trends in Computational Intelligence, NTCI 2024
Country/TerritoryChina
CityQingdao
Period18/10/2420/10/24

Keywords

  • attention mechanism
  • dynamic observation
  • Intelligent combat
  • PPO algorithm

Fingerprint

Dive into the research topics of 'Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States'. Together they form a unique fingerprint.

Cite this