Autonomous Decision Making of UAV in Short-Range Air Combat Based on DQN Aided by Expert Knowledge

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Recently, reinforcement learning (RL) has emerged in the field of autonomous air combat. However, it is well known that RL has the problems of low exploration efficiency and long training time in practical application. In this paper, we propose autonomous maneuver decision model based on deep Q-learning network (DQN) incorporating expert knowledge. First, we design a series of exploration rules based on expert knowledge. With the help of exploration rules, UAV is no longer randomly exploring in the whole space, but is able to avoid ineffective space exploration to improve exploration efficiency. In addition, we also introduce Imitation Learning (IL) to obtain an initial strategy for RL from the decision trajectory data demonstrated by human experts, which can speed up the training process. Finally, the simulation results verify the effectiveness of the UAV autonomous maneuver decision model.

Original languageEnglish
Title of host publicationProceedings of 2022 International Conference on Autonomous Unmanned Systems, ICAUS 2022
EditorsWenxing Fu, Mancang Gu, Yifeng Niu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages1661-1670
Number of pages10
ISBN (Print)9789819904785
DOIs
StatePublished - 2023
EventInternational Conference on Autonomous Unmanned Systems, ICAUS 2022 - Xi'an, China
Duration: 23 Sep 202225 Sep 2022

Publication series

NameLecture Notes in Electrical Engineering
Volume1010 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

ConferenceInternational Conference on Autonomous Unmanned Systems, ICAUS 2022
Country/TerritoryChina
CityXi'an
Period23/09/2225/09/22

Keywords

  • Air combat
  • Expert knowledge
  • Maneuver decision
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Autonomous Decision Making of UAV in Short-Range Air Combat Based on DQN Aided by Expert Knowledge'. Together they form a unique fingerprint.

Cite this