Optimal trajectory tracking control based on reinforcement learning for the deployment process of space tether system

Yiting Feng; Changqing Wang; Aijun Li

doi:10.1016/j.ifacol.2020.06.113

Optimal trajectory tracking control based on reinforcement learning for the deployment process of space tether system

Yiting Feng, Changqing Wang, Aijun Li

自动化学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 会议文章 › 同行评审

6 引用（Scopus）

摘要

Space tether system has a wide application prospect in space mission. Due to the characteristics of strong non-linearity and under-actuation, as well as the interference of complex space environment, it is difficult to model the tethered system accurately. Hence, the controller based on the parameters of the system model will cause large errors in the process of control. In this paper, an adaptive dynamic programming algorithm based on reinforcement learning theory is adopted. By training two Back Propagation (BP) neural networks, namely critic neural network (NN) and actor NN, the performance index function and control law of the system approach approximate optimal values respectively. The controller design is independent of the system model, so model-free control of the system is realized by implementing this control method. First, assuming that the out-of-plane motion of the system is stable, the optimal deployment trajectory of the tethered system is obtained by parameter optimization based on Nelder-Mead method. The optimal trajectory is taken as the nominal trajectory and the trajectory tracking is carried out by reinforcement learning controller. The simulation results show that the reinforcement learning algorithm has a good control effect on the in-plane trajectory tracking of the tethered system, which proves the feasibility and robustness of the control method.

源语言	英语
页（从-至）	679-684
页数	6
期刊	IFAC-PapersOnLine
卷	53
期	1
DOI	https://doi.org/10.1016/j.ifacol.2020.06.113
出版状态	已出版 - 2020
活动	6th Conference on Advances in Control and Optimization of Dynamical Systems, ACODS 2020 - Chennai, 印度期限: 16 2月 2020 → 19 2月 2020

访问文件

10.1016/j.ifacol.2020.06.113

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{12df7a55135a4d869db8395dee07cda2,

title = "Optimal trajectory tracking control based on reinforcement learning for the deployment process of space tether system",

abstract = "Space tether system has a wide application prospect in space mission. Due to the characteristics of strong non-linearity and under-actuation, as well as the interference of complex space environment, it is difficult to model the tethered system accurately. Hence, the controller based on the parameters of the system model will cause large errors in the process of control. In this paper, an adaptive dynamic programming algorithm based on reinforcement learning theory is adopted. By training two Back Propagation (BP) neural networks, namely critic neural network (NN) and actor NN, the performance index function and control law of the system approach approximate optimal values respectively. The controller design is independent of the system model, so model-free control of the system is realized by implementing this control method. First, assuming that the out-of-plane motion of the system is stable, the optimal deployment trajectory of the tethered system is obtained by parameter optimization based on Nelder-Mead method. The optimal trajectory is taken as the nominal trajectory and the trajectory tracking is carried out by reinforcement learning controller. The simulation results show that the reinforcement learning algorithm has a good control effect on the in-plane trajectory tracking of the tethered system, which proves the feasibility and robustness of the control method.",

keywords = "Adaptive dynamic programming, Deployment, Neural network, Reinforcement learning, Space tether system, Trajectory tracking",

author = "Yiting Feng and Changqing Wang and Aijun Li",

note = "Publisher Copyright: {\textcopyright} 2020, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.; 6th Conference on Advances in Control and Optimization of Dynamical Systems, ACODS 2020 ; Conference date: 16-02-2020 Through 19-02-2020",

year = "2020",

doi = "10.1016/j.ifacol.2020.06.113",

language = "英语",

volume = "53",

pages = "679--684",

journal = "IFAC-PapersOnLine",

issn = "2405-8963",

number = "1",

}

TY - JOUR

T1 - Optimal trajectory tracking control based on reinforcement learning for the deployment process of space tether system

AU - Feng, Yiting

AU - Wang, Changqing

AU - Li, Aijun

PY - 2020

Y1 - 2020

N2 - Space tether system has a wide application prospect in space mission. Due to the characteristics of strong non-linearity and under-actuation, as well as the interference of complex space environment, it is difficult to model the tethered system accurately. Hence, the controller based on the parameters of the system model will cause large errors in the process of control. In this paper, an adaptive dynamic programming algorithm based on reinforcement learning theory is adopted. By training two Back Propagation (BP) neural networks, namely critic neural network (NN) and actor NN, the performance index function and control law of the system approach approximate optimal values respectively. The controller design is independent of the system model, so model-free control of the system is realized by implementing this control method. First, assuming that the out-of-plane motion of the system is stable, the optimal deployment trajectory of the tethered system is obtained by parameter optimization based on Nelder-Mead method. The optimal trajectory is taken as the nominal trajectory and the trajectory tracking is carried out by reinforcement learning controller. The simulation results show that the reinforcement learning algorithm has a good control effect on the in-plane trajectory tracking of the tethered system, which proves the feasibility and robustness of the control method.

AB - Space tether system has a wide application prospect in space mission. Due to the characteristics of strong non-linearity and under-actuation, as well as the interference of complex space environment, it is difficult to model the tethered system accurately. Hence, the controller based on the parameters of the system model will cause large errors in the process of control. In this paper, an adaptive dynamic programming algorithm based on reinforcement learning theory is adopted. By training two Back Propagation (BP) neural networks, namely critic neural network (NN) and actor NN, the performance index function and control law of the system approach approximate optimal values respectively. The controller design is independent of the system model, so model-free control of the system is realized by implementing this control method. First, assuming that the out-of-plane motion of the system is stable, the optimal deployment trajectory of the tethered system is obtained by parameter optimization based on Nelder-Mead method. The optimal trajectory is taken as the nominal trajectory and the trajectory tracking is carried out by reinforcement learning controller. The simulation results show that the reinforcement learning algorithm has a good control effect on the in-plane trajectory tracking of the tethered system, which proves the feasibility and robustness of the control method.

KW - Adaptive dynamic programming

KW - Deployment

KW - Neural network

KW - Reinforcement learning

KW - Space tether system

KW - Trajectory tracking

UR - http://www.scopus.com/inward/record.url?scp=85092451478&partnerID=8YFLogxK

U2 - 10.1016/j.ifacol.2020.06.113

DO - 10.1016/j.ifacol.2020.06.113

M3 - 会议文章

AN - SCOPUS:85092451478

SN - 2405-8963

VL - 53

SP - 679

EP - 684

JO - IFAC-PapersOnLine

JF - IFAC-PapersOnLine

IS - 1

T2 - 6th Conference on Advances in Control and Optimization of Dynamical Systems, ACODS 2020

Y2 - 16 February 2020 through 19 February 2020

ER -

Optimal trajectory tracking control based on reinforcement learning for the deployment process of space tether system

摘要

访问文件

其它文件与链接

指纹

引用此