E-TD3: A Deep Reinforcement Learning-based Autonomous Flight Decision-Making Method for Unmanned Aerial Vehicles

Yi Zhang, Yujie Cui, Geng Wang, Bo Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

As the application of unmanned aerial vehicles(UAVs) in low-altitude airspace continues to broaden, higher requirements have been placed on their autonomous and intelligent manoeuvring and adaptive capabilities. To overcome this challenge, this paper proposes an end-to-end UAV flight decision-making method based on deep reinforcement learning, and provides a dynamic planning scheme for the mission of safely and stably avoiding the threat of environmental obstacles and tracking the target. The method is based on the Twin Delayed Deep Deterministic Policy Gradient (TD3) framework and introduces the Gated Recurrent Unit. To further improve the exploration capability and sample efficiency of the algorithm, we integrate expert experience into reinforcement learning and thus propose the E-TD3 algorithm. We reconstructed the experience replay buffer and designed a mixed sample collection mechanism to dynamically adjust the proportion of demonstration data. Finally, we perform experimental validation on the AirSim platform.

源语言英语
主期刊名2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798350376739
DOI
出版状态已出版 - 2024
活动2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024 - Doha, 卡塔尔
期限: 8 11月 202412 11月 2024

出版系列

姓名2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024

会议

会议2024 International Conference on Cyber-Physical Social Intelligence, ICCSI 2024
国家/地区卡塔尔
Doha
时期8/11/2412/11/24

指纹

探究 'E-TD3: A Deep Reinforcement Learning-based Autonomous Flight Decision-Making Method for Unmanned Aerial Vehicles' 的科研主题。它们共同构成独一无二的指纹。

引用此