Reinforcement Learning-Based 3-D Sliding Mode Interception Guidance via Proximal Policy Optimization

Jianguo Guo; Mengxuan Li; Zongyi Guo; Zhiyong She

doi:10.1109/JMASS.2023.3325054

Reinforcement Learning-Based 3-D Sliding Mode Interception Guidance via Proximal Policy Optimization

Jianguo Guo, Mengxuan Li, Zongyi Guo, Zhiyong She

School of Astronautics

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

This article proposes a novel 3-D sliding mode interception guidance law for maneuvering targets, which explores the potential of reinforcement learning (RL) techniques to enhance guidance accuracy and reduce chattering. The guidance problem of intercepting maneuvering targets is abstracted into a Markov decision process whose reward function is established to estimate the off-target amount and line-of-sight angular rate chattering. Importantly, a design framework of reward function suitable for general guidance problems based on RL can be proposed. Then, the proximal policy optimization algorithm with a satisfactory training performance is introduced to learn an action policy which represents the observed engagements states to sliding mode interception guidance. Finally, numerical simulations and comparisons are conducted to demonstrate the effectiveness of the proposed guidance law.

Original language	English
Pages (from-to)	423-430
Number of pages	8
Journal	IEEE Journal on Miniaturization for Air and Space Systems
Volume	4
Issue number	4
DOIs	https://doi.org/10.1109/JMASS.2023.3325054
State	Published - 1 Dec 2023

Keywords

3-D
guidance law
proximal policy optimization (PPO)
reinforcement learning (RL)
sliding mode control

Access to Document

10.1109/JMASS.2023.3325054

Cite this

@article{43bab25388b54722ab8812e189a2d587,

title = "Reinforcement Learning-Based 3-D Sliding Mode Interception Guidance via Proximal Policy Optimization",

abstract = "This article proposes a novel 3-D sliding mode interception guidance law for maneuvering targets, which explores the potential of reinforcement learning (RL) techniques to enhance guidance accuracy and reduce chattering. The guidance problem of intercepting maneuvering targets is abstracted into a Markov decision process whose reward function is established to estimate the off-target amount and line-of-sight angular rate chattering. Importantly, a design framework of reward function suitable for general guidance problems based on RL can be proposed. Then, the proximal policy optimization algorithm with a satisfactory training performance is introduced to learn an action policy which represents the observed engagements states to sliding mode interception guidance. Finally, numerical simulations and comparisons are conducted to demonstrate the effectiveness of the proposed guidance law.",

keywords = "3-D, guidance law, proximal policy optimization (PPO), reinforcement learning (RL), sliding mode control",

author = "Jianguo Guo and Mengxuan Li and Zongyi Guo and Zhiyong She",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.",

year = "2023",

month = dec,

day = "1",

doi = "10.1109/JMASS.2023.3325054",

language = "英语",

volume = "4",

pages = "423--430",

journal = "IEEE Journal on Miniaturization for Air and Space Systems",

issn = "2576-3164",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "4",

}

TY - JOUR

T1 - Reinforcement Learning-Based 3-D Sliding Mode Interception Guidance via Proximal Policy Optimization

AU - Guo, Jianguo

AU - Li, Mengxuan

AU - Guo, Zongyi

AU - She, Zhiyong

PY - 2023/12/1

Y1 - 2023/12/1

N2 - This article proposes a novel 3-D sliding mode interception guidance law for maneuvering targets, which explores the potential of reinforcement learning (RL) techniques to enhance guidance accuracy and reduce chattering. The guidance problem of intercepting maneuvering targets is abstracted into a Markov decision process whose reward function is established to estimate the off-target amount and line-of-sight angular rate chattering. Importantly, a design framework of reward function suitable for general guidance problems based on RL can be proposed. Then, the proximal policy optimization algorithm with a satisfactory training performance is introduced to learn an action policy which represents the observed engagements states to sliding mode interception guidance. Finally, numerical simulations and comparisons are conducted to demonstrate the effectiveness of the proposed guidance law.

AB - This article proposes a novel 3-D sliding mode interception guidance law for maneuvering targets, which explores the potential of reinforcement learning (RL) techniques to enhance guidance accuracy and reduce chattering. The guidance problem of intercepting maneuvering targets is abstracted into a Markov decision process whose reward function is established to estimate the off-target amount and line-of-sight angular rate chattering. Importantly, a design framework of reward function suitable for general guidance problems based on RL can be proposed. Then, the proximal policy optimization algorithm with a satisfactory training performance is introduced to learn an action policy which represents the observed engagements states to sliding mode interception guidance. Finally, numerical simulations and comparisons are conducted to demonstrate the effectiveness of the proposed guidance law.

KW - 3-D

KW - guidance law

KW - proximal policy optimization (PPO)

KW - reinforcement learning (RL)

KW - sliding mode control

UR - http://www.scopus.com/inward/record.url?scp=85178973558&partnerID=8YFLogxK

U2 - 10.1109/JMASS.2023.3325054

DO - 10.1109/JMASS.2023.3325054

M3 - 文章

AN - SCOPUS:85178973558

SN - 2576-3164

VL - 4

SP - 423

EP - 430

JO - IEEE Journal on Miniaturization for Air and Space Systems

JF - IEEE Journal on Miniaturization for Air and Space Systems

IS - 4

ER -

Reinforcement Learning-Based 3-D Sliding Mode Interception Guidance via Proximal Policy Optimization

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this