TY - JOUR
T1 - A deep reinforcement learning approach incorporating genetic algorithm for missile path planning
AU - Xu, Shuangfei
AU - Bi, Wenhao
AU - Zhang, An
AU - Wang, Yunong
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.
PY - 2024/5
Y1 - 2024/5
N2 - The flight path planning of the missile is important in long-range air-to-ground strike missions. Constraints about missile guidance and guidance handover are considered, and path planning is required to conform to the missile motion model. Therefore, the missile’s allowable flight space and flight mode are further restricted, and the decision-making scale and difficulty of the path planning problem are significantly increased. A genetic algorithm incorporated twin delayed deep deterministic policy gradient (GA-TD3) algorithm is proposed for missile path planning, which uses high-quality data generated by GA to improve the TD3 training effect. Firstly, a missile path planning model is established based on the missile’s motion equations, and the missile guidance and guidance handover constraints are stated in detail. Then a fast path generation method is proposed, which uses several leading points to generate a leading path based on the optimal control theory, and the genetic algorithm is used to improve the leading path quality. Finally, the deep reinforcement learning model for the missile path planning problem is established based on the TD3 framework, and the leading paths participate in the leading training to improve the training effect. Simulation cases of 4 threat areas and 3 guidance platforms demonstrate the efficiency of the GA-TD3. Furthermore, the influence of three factors on the algorithm’s performance is tested, including the leading path quality, leading path number, and leading training cycle.
AB - The flight path planning of the missile is important in long-range air-to-ground strike missions. Constraints about missile guidance and guidance handover are considered, and path planning is required to conform to the missile motion model. Therefore, the missile’s allowable flight space and flight mode are further restricted, and the decision-making scale and difficulty of the path planning problem are significantly increased. A genetic algorithm incorporated twin delayed deep deterministic policy gradient (GA-TD3) algorithm is proposed for missile path planning, which uses high-quality data generated by GA to improve the TD3 training effect. Firstly, a missile path planning model is established based on the missile’s motion equations, and the missile guidance and guidance handover constraints are stated in detail. Then a fast path generation method is proposed, which uses several leading points to generate a leading path based on the optimal control theory, and the genetic algorithm is used to improve the leading path quality. Finally, the deep reinforcement learning model for the missile path planning problem is established based on the TD3 framework, and the leading paths participate in the leading training to improve the training effect. Simulation cases of 4 threat areas and 3 guidance platforms demonstrate the efficiency of the GA-TD3. Furthermore, the influence of three factors on the algorithm’s performance is tested, including the leading path quality, leading path number, and leading training cycle.
KW - Deep reinforcement learning
KW - Genetic algorithm
KW - Path planning
KW - Relay guidance
UR - http://www.scopus.com/inward/record.url?scp=85176469400&partnerID=8YFLogxK
U2 - 10.1007/s13042-023-01998-0
DO - 10.1007/s13042-023-01998-0
M3 - 文章
AN - SCOPUS:85176469400
SN - 1868-8071
VL - 15
SP - 1795
EP - 1814
JO - International Journal of Machine Learning and Cybernetics
JF - International Journal of Machine Learning and Cybernetics
IS - 5
ER -