A deep reinforcement learning approach incorporating genetic algorithm for missile path planning

Shuangfei Xu; Wenhao Bi; An Zhang; Yunong Wang

doi:10.1007/s13042-023-01998-0

A deep reinforcement learning approach incorporating genetic algorithm for missile path planning

Shuangfei Xu, Wenhao Bi, An Zhang, Yunong Wang

School of Aeronautics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

11 Scopus citations

Abstract

The flight path planning of the missile is important in long-range air-to-ground strike missions. Constraints about missile guidance and guidance handover are considered, and path planning is required to conform to the missile motion model. Therefore, the missile’s allowable flight space and flight mode are further restricted, and the decision-making scale and difficulty of the path planning problem are significantly increased. A genetic algorithm incorporated twin delayed deep deterministic policy gradient (GA-TD3) algorithm is proposed for missile path planning, which uses high-quality data generated by GA to improve the TD3 training effect. Firstly, a missile path planning model is established based on the missile’s motion equations, and the missile guidance and guidance handover constraints are stated in detail. Then a fast path generation method is proposed, which uses several leading points to generate a leading path based on the optimal control theory, and the genetic algorithm is used to improve the leading path quality. Finally, the deep reinforcement learning model for the missile path planning problem is established based on the TD3 framework, and the leading paths participate in the leading training to improve the training effect. Simulation cases of 4 threat areas and 3 guidance platforms demonstrate the efficiency of the GA-TD3. Furthermore, the influence of three factors on the algorithm’s performance is tested, including the leading path quality, leading path number, and leading training cycle.

Original language	English
Pages (from-to)	1795-1814
Number of pages	20
Journal	International Journal of Machine Learning and Cybernetics
Volume	15
Issue number	5
DOIs	https://doi.org/10.1007/s13042-023-01998-0
State	Published - May 2024

Keywords

Deep reinforcement learning
Genetic algorithm
Path planning
Relay guidance

Access to Document

10.1007/s13042-023-01998-0

Cite this

@article{e4b17e16c3534cb499ede5f388d1cc43,

title = "A deep reinforcement learning approach incorporating genetic algorithm for missile path planning",

abstract = "The flight path planning of the missile is important in long-range air-to-ground strike missions. Constraints about missile guidance and guidance handover are considered, and path planning is required to conform to the missile motion model. Therefore, the missile{\textquoteright}s allowable flight space and flight mode are further restricted, and the decision-making scale and difficulty of the path planning problem are significantly increased. A genetic algorithm incorporated twin delayed deep deterministic policy gradient (GA-TD3) algorithm is proposed for missile path planning, which uses high-quality data generated by GA to improve the TD3 training effect. Firstly, a missile path planning model is established based on the missile{\textquoteright}s motion equations, and the missile guidance and guidance handover constraints are stated in detail. Then a fast path generation method is proposed, which uses several leading points to generate a leading path based on the optimal control theory, and the genetic algorithm is used to improve the leading path quality. Finally, the deep reinforcement learning model for the missile path planning problem is established based on the TD3 framework, and the leading paths participate in the leading training to improve the training effect. Simulation cases of 4 threat areas and 3 guidance platforms demonstrate the efficiency of the GA-TD3. Furthermore, the influence of three factors on the algorithm{\textquoteright}s performance is tested, including the leading path quality, leading path number, and leading training cycle.",

keywords = "Deep reinforcement learning, Genetic algorithm, Path planning, Relay guidance",

author = "Shuangfei Xu and Wenhao Bi and An Zhang and Yunong Wang",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.",

year = "2024",

month = may,

doi = "10.1007/s13042-023-01998-0",

language = "英语",

volume = "15",

pages = "1795--1814",

journal = "International Journal of Machine Learning and Cybernetics",

issn = "1868-8071",

publisher = "Springer Science + Business Media",

number = "5",

}

TY - JOUR

T1 - A deep reinforcement learning approach incorporating genetic algorithm for missile path planning

AU - Xu, Shuangfei

AU - Bi, Wenhao

AU - Zhang, An

AU - Wang, Yunong

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.

PY - 2024/5

Y1 - 2024/5

N2 - The flight path planning of the missile is important in long-range air-to-ground strike missions. Constraints about missile guidance and guidance handover are considered, and path planning is required to conform to the missile motion model. Therefore, the missile’s allowable flight space and flight mode are further restricted, and the decision-making scale and difficulty of the path planning problem are significantly increased. A genetic algorithm incorporated twin delayed deep deterministic policy gradient (GA-TD3) algorithm is proposed for missile path planning, which uses high-quality data generated by GA to improve the TD3 training effect. Firstly, a missile path planning model is established based on the missile’s motion equations, and the missile guidance and guidance handover constraints are stated in detail. Then a fast path generation method is proposed, which uses several leading points to generate a leading path based on the optimal control theory, and the genetic algorithm is used to improve the leading path quality. Finally, the deep reinforcement learning model for the missile path planning problem is established based on the TD3 framework, and the leading paths participate in the leading training to improve the training effect. Simulation cases of 4 threat areas and 3 guidance platforms demonstrate the efficiency of the GA-TD3. Furthermore, the influence of three factors on the algorithm’s performance is tested, including the leading path quality, leading path number, and leading training cycle.

AB - The flight path planning of the missile is important in long-range air-to-ground strike missions. Constraints about missile guidance and guidance handover are considered, and path planning is required to conform to the missile motion model. Therefore, the missile’s allowable flight space and flight mode are further restricted, and the decision-making scale and difficulty of the path planning problem are significantly increased. A genetic algorithm incorporated twin delayed deep deterministic policy gradient (GA-TD3) algorithm is proposed for missile path planning, which uses high-quality data generated by GA to improve the TD3 training effect. Firstly, a missile path planning model is established based on the missile’s motion equations, and the missile guidance and guidance handover constraints are stated in detail. Then a fast path generation method is proposed, which uses several leading points to generate a leading path based on the optimal control theory, and the genetic algorithm is used to improve the leading path quality. Finally, the deep reinforcement learning model for the missile path planning problem is established based on the TD3 framework, and the leading paths participate in the leading training to improve the training effect. Simulation cases of 4 threat areas and 3 guidance platforms demonstrate the efficiency of the GA-TD3. Furthermore, the influence of three factors on the algorithm’s performance is tested, including the leading path quality, leading path number, and leading training cycle.

KW - Deep reinforcement learning

KW - Genetic algorithm

KW - Path planning

KW - Relay guidance

UR - http://www.scopus.com/inward/record.url?scp=85176469400&partnerID=8YFLogxK

U2 - 10.1007/s13042-023-01998-0

DO - 10.1007/s13042-023-01998-0

M3 - 文章

AN - SCOPUS:85176469400

SN - 1868-8071

VL - 15

SP - 1795

EP - 1814

JO - International Journal of Machine Learning and Cybernetics

JF - International Journal of Machine Learning and Cybernetics

IS - 5

ER -

A deep reinforcement learning approach incorporating genetic algorithm for missile path planning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this