TY - GEN
T1 - Three-Dimensional Cooperative Guidance with Multiple Constraints Based on Proximal Policy Optimization
AU - Li, Xiaoyang
AU - Zhang, Hairuo
AU - Wang, Teng
AU - Li, Haonan
AU - Zhou, Ying
AU - Zhou, Deyun
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Collaborative guidance technology is a crucial means to enhance the effectiveness of strikes. To achieve precise collaboration among multiple missiles targeting a common objective, this paper addresses the issue of inaccurate calculation of the virtual impact point control expected flight time resulting from the use of fast iterative algorithms. We propose a method based on proximal policy optimization to calculate the virtual impact point. A collaborative guidance model under multiple constraints is established, and a proximal policy optimization algorithm is applied to optimize the collaborative guidance law. The calculation parameters of the virtual impact point are treated as actions of an intelligent agent acting on the environment, with velocity, desired pitch angle, and position coordinates serving as the algorithm's observations. A reward function reflecting the collaborative time is constructed, establishing a multi-constraint collaborative guidance law based on intelligent learning. Extensive simulation experiments targeting stationary targets demonstrate the rationality and effectiveness of the proposed method. After training, the intelligent agent provides different desired attack angles, and, based on the observation space, it can generate corresponding parameters. In some scenarios, the precision of hitting time surpasses that of fast iterative algorithms.
AB - Collaborative guidance technology is a crucial means to enhance the effectiveness of strikes. To achieve precise collaboration among multiple missiles targeting a common objective, this paper addresses the issue of inaccurate calculation of the virtual impact point control expected flight time resulting from the use of fast iterative algorithms. We propose a method based on proximal policy optimization to calculate the virtual impact point. A collaborative guidance model under multiple constraints is established, and a proximal policy optimization algorithm is applied to optimize the collaborative guidance law. The calculation parameters of the virtual impact point are treated as actions of an intelligent agent acting on the environment, with velocity, desired pitch angle, and position coordinates serving as the algorithm's observations. A reward function reflecting the collaborative time is constructed, establishing a multi-constraint collaborative guidance law based on intelligent learning. Extensive simulation experiments targeting stationary targets demonstrate the rationality and effectiveness of the proposed method. After training, the intelligent agent provides different desired attack angles, and, based on the observation space, it can generate corresponding parameters. In some scenarios, the precision of hitting time surpasses that of fast iterative algorithms.
KW - FOV constraint
KW - angle constraint
KW - cooperative guidance
KW - reinforcement learning
KW - three-dimensional guidance
KW - time constraint
UR - http://www.scopus.com/inward/record.url?scp=85200328097&partnerID=8YFLogxK
U2 - 10.1109/CCDC62350.2024.10587609
DO - 10.1109/CCDC62350.2024.10587609
M3 - 会议稿件
AN - SCOPUS:85200328097
T3 - Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024
SP - 807
EP - 812
BT - Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 36th Chinese Control and Decision Conference, CCDC 2024
Y2 - 25 May 2024 through 27 May 2024
ER -