Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization

Xiaoyang Li; Hairuo Zhang; Teng Wang; Haonan Li; Ying Zhou; Deyun Zhou

doi:10.1109/CCDC62350.2024.10587609

Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization

Xiaoyang Li, Hairuo Zhang, Teng Wang, Haonan Li, Ying Zhou, Deyun Zhou

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Collaborative guidance technology is a crucial means to enhance the effectiveness of strikes. To achieve precise collaboration among multiple missiles targeting a common objective, this paper addresses the issue of inaccurate calculation of the virtual impact point control expected flight time resulting from the use of fast iterative algorithms. We propose a method based on proximal policy optimization to calculate the virtual impact point. A collaborative guidance model under multiple constraints is established, and a proximal policy optimization algorithm is applied to optimize the collaborative guidance law. The calculation parameters of the virtual impact point are treated as actions of an intelligent agent acting on the environment, with velocity, desired pitch angle, and position coordinates serving as the algorithm's observations. A reward function reflecting the collaborative time is constructed, establishing a multi-constraint collaborative guidance law based on intelligent learning. Extensive simulation experiments targeting stationary targets demonstrate the rationality and effectiveness of the proposed method. After training, the intelligent agent provides different desired attack angles, and, based on the observation space, it can generate corresponding parameters. In some scenarios, the precision of hitting time surpasses that of fast iterative algorithms.

Original language	English
Title of host publication	Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	807-812
Number of pages	6
ISBN (Electronic)	9798350387780
DOIs	https://doi.org/10.1109/CCDC62350.2024.10587609
State	Published - 2024
Event	36th Chinese Control and Decision Conference, CCDC 2024 - Xi'an, China Duration: 25 May 2024 → 27 May 2024

Publication series

Name	Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024

Conference

Conference	36th Chinese Control and Decision Conference, CCDC 2024
Country/Territory	China
City	Xi'an
Period	25/05/24 → 27/05/24

Keywords

FOV constraint
angle constraint
cooperative guidance
reinforcement learning
three-dimensional guidance
time constraint

Access to Document

10.1109/CCDC62350.2024.10587609

Cite this

Li, X., Zhang, H., Wang, T., Li, H., Zhou, Y., & Zhou, D. (2024). Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization. In Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024 (pp. 807-812). (Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CCDC62350.2024.10587609

Li, Xiaoyang ; Zhang, Hairuo ; Wang, Teng et al. / Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization. Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 807-812 (Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024).

@inproceedings{61cf3e8d5e4542b8947e5241b181b727,

title = "Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization",

abstract = "Collaborative guidance technology is a crucial means to enhance the effectiveness of strikes. To achieve precise collaboration among multiple missiles targeting a common objective, this paper addresses the issue of inaccurate calculation of the virtual impact point control expected flight time resulting from the use of fast iterative algorithms. We propose a method based on proximal policy optimization to calculate the virtual impact point. A collaborative guidance model under multiple constraints is established, and a proximal policy optimization algorithm is applied to optimize the collaborative guidance law. The calculation parameters of the virtual impact point are treated as actions of an intelligent agent acting on the environment, with velocity, desired pitch angle, and position coordinates serving as the algorithm's observations. A reward function reflecting the collaborative time is constructed, establishing a multi-constraint collaborative guidance law based on intelligent learning. Extensive simulation experiments targeting stationary targets demonstrate the rationality and effectiveness of the proposed method. After training, the intelligent agent provides different desired attack angles, and, based on the observation space, it can generate corresponding parameters. In some scenarios, the precision of hitting time surpasses that of fast iterative algorithms.",

keywords = "FOV constraint, angle constraint, cooperative guidance, reinforcement learning, three-dimensional guidance, time constraint",

author = "Xiaoyang Li and Hairuo Zhang and Teng Wang and Haonan Li and Ying Zhou and Deyun Zhou",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 36th Chinese Control and Decision Conference, CCDC 2024 ; Conference date: 25-05-2024 Through 27-05-2024",

year = "2024",

doi = "10.1109/CCDC62350.2024.10587609",

language = "英语",

series = "Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "807--812",

booktitle = "Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024",

}

Li, X, Zhang, H, Wang, T, Li, H, Zhou, Y & Zhou, D 2024, Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization. in Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024. Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024, Institute of Electrical and Electronics Engineers Inc., pp. 807-812, 36th Chinese Control and Decision Conference, CCDC 2024, Xi'an, China, 25/05/24. https://doi.org/10.1109/CCDC62350.2024.10587609

Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization. / Li, Xiaoyang; Zhang, Hairuo; Wang, Teng et al.
Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024. Institute of Electrical and Electronics Engineers Inc., 2024. p. 807-812 (Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization

AU - Li, Xiaoyang

AU - Zhang, Hairuo

AU - Wang, Teng

AU - Li, Haonan

AU - Zhou, Ying

AU - Zhou, Deyun

PY - 2024

Y1 - 2024

N2 - Collaborative guidance technology is a crucial means to enhance the effectiveness of strikes. To achieve precise collaboration among multiple missiles targeting a common objective, this paper addresses the issue of inaccurate calculation of the virtual impact point control expected flight time resulting from the use of fast iterative algorithms. We propose a method based on proximal policy optimization to calculate the virtual impact point. A collaborative guidance model under multiple constraints is established, and a proximal policy optimization algorithm is applied to optimize the collaborative guidance law. The calculation parameters of the virtual impact point are treated as actions of an intelligent agent acting on the environment, with velocity, desired pitch angle, and position coordinates serving as the algorithm's observations. A reward function reflecting the collaborative time is constructed, establishing a multi-constraint collaborative guidance law based on intelligent learning. Extensive simulation experiments targeting stationary targets demonstrate the rationality and effectiveness of the proposed method. After training, the intelligent agent provides different desired attack angles, and, based on the observation space, it can generate corresponding parameters. In some scenarios, the precision of hitting time surpasses that of fast iterative algorithms.

AB - Collaborative guidance technology is a crucial means to enhance the effectiveness of strikes. To achieve precise collaboration among multiple missiles targeting a common objective, this paper addresses the issue of inaccurate calculation of the virtual impact point control expected flight time resulting from the use of fast iterative algorithms. We propose a method based on proximal policy optimization to calculate the virtual impact point. A collaborative guidance model under multiple constraints is established, and a proximal policy optimization algorithm is applied to optimize the collaborative guidance law. The calculation parameters of the virtual impact point are treated as actions of an intelligent agent acting on the environment, with velocity, desired pitch angle, and position coordinates serving as the algorithm's observations. A reward function reflecting the collaborative time is constructed, establishing a multi-constraint collaborative guidance law based on intelligent learning. Extensive simulation experiments targeting stationary targets demonstrate the rationality and effectiveness of the proposed method. After training, the intelligent agent provides different desired attack angles, and, based on the observation space, it can generate corresponding parameters. In some scenarios, the precision of hitting time surpasses that of fast iterative algorithms.

KW - FOV constraint

KW - angle constraint

KW - cooperative guidance

KW - reinforcement learning

KW - three-dimensional guidance

KW - time constraint

UR - http://www.scopus.com/inward/record.url?scp=85200328097&partnerID=8YFLogxK

U2 - 10.1109/CCDC62350.2024.10587609

DO - 10.1109/CCDC62350.2024.10587609

M3 - 会议稿件

AN - SCOPUS:85200328097

T3 - Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024

SP - 807

EP - 812

BT - Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 36th Chinese Control and Decision Conference, CCDC 2024

Y2 - 25 May 2024 through 27 May 2024

ER -

Li X, Zhang H, Wang T, Li H, Zhou Y, Zhou D. Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization. In Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024. Institute of Electrical and Electronics Engineers Inc. 2024. p. 807-812. (Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024). doi: 10.1109/CCDC62350.2024.10587609

Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this