TY - GEN
T1 - Autonomous Penetration Trajectory Control for Unmanned Aerial Vehicle Based on Reinforcement Learning
AU - Liu, Yufei
AU - Yang, Zhen
AU - Chai, Shiyuan
AU - Wang, Xingyu
AU - Zhou, Yin
AU - Zhou, Deyun
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - With the rapid development of technology, the deployment of modern theater three-dimensional air defense systems is becoming increasingly rigorous. How to control the unmanned aerial vehicle (UAV) to quickly fly out of a high safety penetration trajectory under the threat of a tight ground air defense system is core of the penetration problem. Therefore, this paper proposes a TD3 algorithm based on LSTM target prediction for trajectory control of UAV in complex penetration environments. Firstly, the process of UAV penetration was analyzed and briefly described. Then, models of the UAV and threats were established separately. A target prediction method based on LSTM was designed to address the issue of errors in using the current target state iteration method. Using LSTM to extract and analyze the spatial features of the target during the penetration process to achieve accurate estimation of future situational rewards, and introducing it into the TD3 algorithm. The simulation results show that the TD3 penetration trajectory autonomous control algorithm based on LSTM target prediction is superior to other reinforcement learning algorithms. It can effectively control the trajectory of UAV and avoid ground threats, efficiently reach the vicinity of the target, and has certain application value.
AB - With the rapid development of technology, the deployment of modern theater three-dimensional air defense systems is becoming increasingly rigorous. How to control the unmanned aerial vehicle (UAV) to quickly fly out of a high safety penetration trajectory under the threat of a tight ground air defense system is core of the penetration problem. Therefore, this paper proposes a TD3 algorithm based on LSTM target prediction for trajectory control of UAV in complex penetration environments. Firstly, the process of UAV penetration was analyzed and briefly described. Then, models of the UAV and threats were established separately. A target prediction method based on LSTM was designed to address the issue of errors in using the current target state iteration method. Using LSTM to extract and analyze the spatial features of the target during the penetration process to achieve accurate estimation of future situational rewards, and introducing it into the TD3 algorithm. The simulation results show that the TD3 penetration trajectory autonomous control algorithm based on LSTM target prediction is superior to other reinforcement learning algorithms. It can effectively control the trajectory of UAV and avoid ground threats, efficiently reach the vicinity of the target, and has certain application value.
KW - Autonomous control
KW - TD3
KW - UAV
KW - complex threat environment
KW - ground penetration
UR - http://www.scopus.com/inward/record.url?scp=105000280865&partnerID=8YFLogxK
U2 - 10.1109/AIAC63745.2024.10899596
DO - 10.1109/AIAC63745.2024.10899596
M3 - 会议稿件
AN - SCOPUS:105000280865
T3 - 2024 2nd International Conference on Artificial Intelligence and Automation Control, AIAC 2024
SP - 213
EP - 219
BT - 2024 2nd International Conference on Artificial Intelligence and Automation Control, AIAC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Artificial Intelligence and Automation Control, AIAC 2024
Y2 - 20 December 2024 through 22 December 2024
ER -