Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning

Jie Lan; Yan Jun Liu; Dengxiu Yu; Guoxing Wen; Shaocheng Tong; Lei Liu

doi:10.1109/TNNLS.2022.3158085

Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning

Jie Lan, Yan Jun Liu, Dengxiu Yu, Guoxing Wen, Shaocheng Tong, Lei Liu

光电与智能研究院

科研成果: 期刊稿件 › 文章 › 同行评审

48 引用（Scopus）

摘要

This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control. Furthermore, focusing on Hamilton-Jacobi-Bellman optimization, it is almost incapable of directly dealing with unknown equations. Above uncertainty and immeasurability processed by adaptive state observer and NN simplified RL are further designed to achieve desired second-order formation configuration at the least cost. The optimization protocol can not only solve the undetectable states and realize the prescribed time-varying formation performance on the premise that all the errors are SGUUB, but also prove the stability and update the critics and actors easily. Through the above-mentioned approaches offer an optimal control scheme to address time-varying formation control. Finally, the validity of the theoretical method is proven by the Lyapunov stability theory and digital simulation.

源语言	英语
页（从-至）	3144-3155
页数	12
期刊	IEEE Transactions on Neural Networks and Learning Systems
卷	35
期	3
DOI	https://doi.org/10.1109/TNNLS.2022.3158085
出版状态	已出版 - 1 3月 2024

访问文件

10.1109/TNNLS.2022.3158085

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{4517919c07a0479e8a87bb6f5538c320,

title = "Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning",

abstract = "This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control. Furthermore, focusing on Hamilton-Jacobi-Bellman optimization, it is almost incapable of directly dealing with unknown equations. Above uncertainty and immeasurability processed by adaptive state observer and NN simplified RL are further designed to achieve desired second-order formation configuration at the least cost. The optimization protocol can not only solve the undetectable states and realize the prescribed time-varying formation performance on the premise that all the errors are SGUUB, but also prove the stability and update the critics and actors easily. Through the above-mentioned approaches offer an optimal control scheme to address time-varying formation control. Finally, the validity of the theoretical method is proven by the Lyapunov stability theory and digital simulation.",

keywords = "Adaptive neural network (NN) observer, optimal formation control, reinforcement learning (RL), second-order time-varying formation",

author = "Jie Lan and Liu, {Yan Jun} and Dengxiu Yu and Guoxing Wen and Shaocheng Tong and Lei Liu",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2024",

month = mar,

day = "1",

doi = "10.1109/TNNLS.2022.3158085",

language = "英语",

volume = "35",

pages = "3144--3155",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "3",

}

TY - JOUR

T1 - Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning

AU - Lan, Jie

AU - Liu, Yan Jun

AU - Yu, Dengxiu

AU - Wen, Guoxing

AU - Tong, Shaocheng

AU - Liu, Lei

PY - 2024/3/1

Y1 - 2024/3/1

N2 - This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control. Furthermore, focusing on Hamilton-Jacobi-Bellman optimization, it is almost incapable of directly dealing with unknown equations. Above uncertainty and immeasurability processed by adaptive state observer and NN simplified RL are further designed to achieve desired second-order formation configuration at the least cost. The optimization protocol can not only solve the undetectable states and realize the prescribed time-varying formation performance on the premise that all the errors are SGUUB, but also prove the stability and update the critics and actors easily. Through the above-mentioned approaches offer an optimal control scheme to address time-varying formation control. Finally, the validity of the theoretical method is proven by the Lyapunov stability theory and digital simulation.

AB - This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control. Furthermore, focusing on Hamilton-Jacobi-Bellman optimization, it is almost incapable of directly dealing with unknown equations. Above uncertainty and immeasurability processed by adaptive state observer and NN simplified RL are further designed to achieve desired second-order formation configuration at the least cost. The optimization protocol can not only solve the undetectable states and realize the prescribed time-varying formation performance on the premise that all the errors are SGUUB, but also prove the stability and update the critics and actors easily. Through the above-mentioned approaches offer an optimal control scheme to address time-varying formation control. Finally, the validity of the theoretical method is proven by the Lyapunov stability theory and digital simulation.

KW - Adaptive neural network (NN) observer

KW - optimal formation control

KW - reinforcement learning (RL)

KW - second-order time-varying formation

UR - http://www.scopus.com/inward/record.url?scp=85128605811&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2022.3158085

DO - 10.1109/TNNLS.2022.3158085

M3 - 文章

C2 - 35417354

AN - SCOPUS:85128605811

SN - 2162-237X

VL - 35

SP - 3144

EP - 3155

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 3

ER -

Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning

摘要

访问文件

其它文件与链接

指纹

引用此