Skip to main navigation Skip to search Skip to main content

E-TD3: A Deep Reinforcement Learning-based Autonomous Flight Decision-Making Method for Unmanned Aerial Vehicles

  • Yi Zhang
  • , Yujie Cui
  • , Geng Wang
  • , Bo Li
  • Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As the application of unmanned aerial vehicles(UAVs) in low-altitude airspace continues to broaden, higher requirements have been placed on their autonomous and intelligent manoeuvring and adaptive capabilities. To overcome this challenge, this paper proposes an end-to-end UAV flight decision-making method based on deep reinforcement learning, and provides a dynamic planning scheme for the mission of safely and stably avoiding the threat of environmental obstacles and tracking the target. The method is based on the Twin Delayed Deep Deterministic Policy Gradient (TD3) framework and introduces the Gated Recurrent Unit. To further improve the exploration capability and sample efficiency of the algorithm, we integrate expert experience into reinforcement learning and thus propose the E-TD3 algorithm. We reconstructed the experience replay buffer and designed a mixed sample collection mechanism to dynamically adjust the proportion of demonstration data. Finally, we perform experimental validation on the AirSim platform.

Original languageEnglish
Title of host publication2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350376739
DOIs
StatePublished - 2024
Event2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024 - Doha, Qatar
Duration: 8 Nov 202412 Nov 2024

Publication series

Name2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024

Conference

Conference2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024
Country/TerritoryQatar
CityDoha
Period8/11/2412/11/24

Keywords

  • deep reinforcement learning
  • expert experience
  • Gated Recurrent Unit
  • TD3 algorithm
  • UAV flight decision making

Fingerprint

Dive into the research topics of 'E-TD3: A Deep Reinforcement Learning-based Autonomous Flight Decision-Making Method for Unmanned Aerial Vehicles'. Together they form a unique fingerprint.

Cite this