Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning

Jie Lan; Yan Jun Liu; Dengxiu Yu; Guoxing Wen; Shaocheng Tong; Lei Liu

doi:10.1109/TNNLS.2022.3158085

Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning

Jie Lan, Yan Jun Liu, Dengxiu Yu, Guoxing Wen, Shaocheng Tong, Lei Liu

School of Artificial Intelligence, OPtics and Electronics

Research output: Contribution to journal › Article › peer-review

48 Scopus citations

Abstract

This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control. Furthermore, focusing on Hamilton-Jacobi-Bellman optimization, it is almost incapable of directly dealing with unknown equations. Above uncertainty and immeasurability processed by adaptive state observer and NN simplified RL are further designed to achieve desired second-order formation configuration at the least cost. The optimization protocol can not only solve the undetectable states and realize the prescribed time-varying formation performance on the premise that all the errors are SGUUB, but also prove the stability and update the critics and actors easily. Through the above-mentioned approaches offer an optimal control scheme to address time-varying formation control. Finally, the validity of the theoretical method is proven by the Lyapunov stability theory and digital simulation.

Original language	English
Pages (from-to)	3144-3155
Number of pages	12
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	35
Issue number	3
DOIs	https://doi.org/10.1109/TNNLS.2022.3158085
State	Published - 1 Mar 2024

Keywords

Adaptive neural network (NN) observer
optimal formation control
reinforcement learning (RL)
second-order time-varying formation

Access to Document

10.1109/TNNLS.2022.3158085

Cite this

@article{4517919c07a0479e8a87bb6f5538c320,

title = "Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning",

abstract = "This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control. Furthermore, focusing on Hamilton-Jacobi-Bellman optimization, it is almost incapable of directly dealing with unknown equations. Above uncertainty and immeasurability processed by adaptive state observer and NN simplified RL are further designed to achieve desired second-order formation configuration at the least cost. The optimization protocol can not only solve the undetectable states and realize the prescribed time-varying formation performance on the premise that all the errors are SGUUB, but also prove the stability and update the critics and actors easily. Through the above-mentioned approaches offer an optimal control scheme to address time-varying formation control. Finally, the validity of the theoretical method is proven by the Lyapunov stability theory and digital simulation.",

keywords = "Adaptive neural network (NN) observer, optimal formation control, reinforcement learning (RL), second-order time-varying formation",

author = "Jie Lan and Liu, {Yan Jun} and Dengxiu Yu and Guoxing Wen and Shaocheng Tong and Lei Liu",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2024",

month = mar,

day = "1",

doi = "10.1109/TNNLS.2022.3158085",

language = "英语",

volume = "35",

pages = "3144--3155",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "3",

}

TY - JOUR

T1 - Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning

AU - Lan, Jie

AU - Liu, Yan Jun

AU - Yu, Dengxiu

AU - Wen, Guoxing

AU - Tong, Shaocheng

AU - Liu, Lei

PY - 2024/3/1

Y1 - 2024/3/1

N2 - This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control. Furthermore, focusing on Hamilton-Jacobi-Bellman optimization, it is almost incapable of directly dealing with unknown equations. Above uncertainty and immeasurability processed by adaptive state observer and NN simplified RL are further designed to achieve desired second-order formation configuration at the least cost. The optimization protocol can not only solve the undetectable states and realize the prescribed time-varying formation performance on the premise that all the errors are SGUUB, but also prove the stability and update the critics and actors easily. Through the above-mentioned approaches offer an optimal control scheme to address time-varying formation control. Finally, the validity of the theoretical method is proven by the Lyapunov stability theory and digital simulation.

AB - This article addresses a distributed time-varying optimal formation protocol for a class of second-order uncertain nonlinear dynamic multiagent systems (MASs) based on an adaptive neural network (NN) state observer through the backstepping method and simplified reinforcement learning (RL). Each follower agent is subjected to only local information and measurable partial states due to actual sensor limitations. In view of the distributed optimized formation strategic needs, the uncertain nonlinear dynamics and undetectable states may jointly affect the stability of the time-varying cooperative formation control. Furthermore, focusing on Hamilton-Jacobi-Bellman optimization, it is almost incapable of directly dealing with unknown equations. Above uncertainty and immeasurability processed by adaptive state observer and NN simplified RL are further designed to achieve desired second-order formation configuration at the least cost. The optimization protocol can not only solve the undetectable states and realize the prescribed time-varying formation performance on the premise that all the errors are SGUUB, but also prove the stability and update the critics and actors easily. Through the above-mentioned approaches offer an optimal control scheme to address time-varying formation control. Finally, the validity of the theoretical method is proven by the Lyapunov stability theory and digital simulation.

KW - Adaptive neural network (NN) observer

KW - optimal formation control

KW - reinforcement learning (RL)

KW - second-order time-varying formation

UR - http://www.scopus.com/inward/record.url?scp=85128605811&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2022.3158085

DO - 10.1109/TNNLS.2022.3158085

M3 - 文章

C2 - 35417354

AN - SCOPUS:85128605811

SN - 2162-237X

VL - 35

SP - 3144

EP - 3155

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 3

ER -

Time-Varying Optimal Formation Control for Second-Order Multiagent Systems Based on Neural Network Observer and Reinforcement Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this