Collaborative Guidance Algorithm Based on Offline Pre-training and Online Reinforcement Learning

Zhenrui Lv, Yifan Hu, Zijing Tian, Bin Fu, Hongguang Ren, Wenxing Fu

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In response to the common assumption of small angle relationships in existing collaborative guidance laws and the neglect of high-order terms in the remaining time expansion, this paper proposes a guidance law structure based on a combination of traditional guidance laws and collaborative correction terms, and uses reinforcement learning methods to train the correction terms. This article also constructs a guided pre training algorithm based on offline reinforcement learning algorithms, combined with the dual delay deep deterministic policy gradient algorithm. Through methods such as delayed updates and critical comparison, fast and efficient learning and training iterations are carried out, effectively solving the problem of overestimation of actions and policies in the reinforcement learning process. The simulation results show that the reinforcement learning collaborative guidance law trained by the designed framework has obvious advantages of wider applicability and higher time collaboration accuracy.

源语言英语
主期刊名Proceedings of 4th 2024 International Conference on Autonomous Unmanned Systems, 4th ICAUS 2024
编辑Lianqing Liu, Yifeng Niu, Wenxing Fu, Yi Qu
出版商Springer Science and Business Media Deutschland GmbH
443-453
页数11
ISBN(印刷版)9789819635672
DOI
出版状态已出版 - 2025
活动4th International Conference on Autonomous Unmanned Systems, ICAUS 2024 - Shenyang, 中国
期限: 19 9月 202421 9月 2024

出版系列

姓名Lecture Notes in Electrical Engineering
1377 LNEE
ISSN(印刷版)1876-1100
ISSN(电子版)1876-1119

会议

会议4th International Conference on Autonomous Unmanned Systems, ICAUS 2024
国家/地区中国
Shenyang
时期19/09/2421/09/24

指纹

探究 'Collaborative Guidance Algorithm Based on Offline Pre-training and Online Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此