Collaborative Guidance Algorithm Based on Offline Pre-training and Online Reinforcement Learning

Zhenrui Lv, Yifan Hu, Zijing Tian, Bin Fu, Hongguang Ren, Wenxing Fu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In response to the common assumption of small angle relationships in existing collaborative guidance laws and the neglect of high-order terms in the remaining time expansion, this paper proposes a guidance law structure based on a combination of traditional guidance laws and collaborative correction terms, and uses reinforcement learning methods to train the correction terms. This article also constructs a guided pre training algorithm based on offline reinforcement learning algorithms, combined with the dual delay deep deterministic policy gradient algorithm. Through methods such as delayed updates and critical comparison, fast and efficient learning and training iterations are carried out, effectively solving the problem of overestimation of actions and policies in the reinforcement learning process. The simulation results show that the reinforcement learning collaborative guidance law trained by the designed framework has obvious advantages of wider applicability and higher time collaboration accuracy.

Original languageEnglish
Title of host publicationProceedings of 4th 2024 International Conference on Autonomous Unmanned Systems, 4th ICAUS 2024
EditorsLianqing Liu, Yifeng Niu, Wenxing Fu, Yi Qu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages443-453
Number of pages11
ISBN (Print)9789819635672
DOIs
StatePublished - 2025
Event4th International Conference on Autonomous Unmanned Systems, ICAUS 2024 - Shenyang, China
Duration: 19 Sep 202421 Sep 2024

Publication series

NameLecture Notes in Electrical Engineering
Volume1377 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference4th International Conference on Autonomous Unmanned Systems, ICAUS 2024
Country/TerritoryChina
CityShenyang
Period19/09/2421/09/24

Keywords

  • Collaborative guidance
  • Reinforcement learning
  • Time collaboration

Fingerprint

Dive into the research topics of 'Collaborative Guidance Algorithm Based on Offline Pre-training and Online Reinforcement Learning'. Together they form a unique fingerprint.

Cite this