Actor-critic-disturbance reinforcement learning algorithm-based fast finite-time stability of multiagent systems

Junsheng Zhao; Yaqi Gu; Xiangpeng Xie; Dengxiu Yu

doi:10.1016/j.ins.2024.121802

Actor-critic-disturbance reinforcement learning algorithm-based fast finite-time stability of multiagent systems

Junsheng Zhao, Yaqi Gu, Xiangpeng Xie, Dengxiu Yu

光电与智能研究院

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

This paper proposes an actor-critic-disturbance (ACD) reinforcement learning algorithm-based fast finite-time stability of multiagent systems (MASs) with time-varying asymmetrical constraints. Initially, a barrier function is designed to facilitate the transformation of the constrained system to an unconstrained one. Notably, the adaptive control strategy discussed in this paper is capable of solving more general dynamic constraints compared with most existing literature. Subsequently, in scenarios where the disturbance affects the system in the worst way, an H_∞ optimal control strategy based on the ACD reinforcement learning algorithms is proposed to enhance the robustness of the system and minimize the influence of disturbances. Thirdly, a fast finite-time theory is integrated into the optimal control protocol for MASs, which allows the system to complete the control objective in finite time while converging faster. Lastly, numerical and practical simulation examples confirm the validity of the theoretical results.

源语言	英语
文章编号	121802
期刊	Information Sciences
卷	699
DOI	https://doi.org/10.1016/j.ins.2024.121802
出版状态	已出版 - 5月 2025

访问文件

10.1016/j.ins.2024.121802

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{ee5a1a362ca840e497ea1a0b2e78ad0b,

title = "Actor-critic-disturbance reinforcement learning algorithm-based fast finite-time stability of multiagent systems",

abstract = "This paper proposes an actor-critic-disturbance (ACD) reinforcement learning algorithm-based fast finite-time stability of multiagent systems (MASs) with time-varying asymmetrical constraints. Initially, a barrier function is designed to facilitate the transformation of the constrained system to an unconstrained one. Notably, the adaptive control strategy discussed in this paper is capable of solving more general dynamic constraints compared with most existing literature. Subsequently, in scenarios where the disturbance affects the system in the worst way, an H∞ optimal control strategy based on the ACD reinforcement learning algorithms is proposed to enhance the robustness of the system and minimize the influence of disturbances. Thirdly, a fast finite-time theory is integrated into the optimal control protocol for MASs, which allows the system to complete the control objective in finite time while converging faster. Lastly, numerical and practical simulation examples confirm the validity of the theoretical results.",

keywords = "Actor-critic-disturbance reinforcement learning, Fast finite-time stabilization, Time-varying asymmetrical constraint",

author = "Junsheng Zhao and Yaqi Gu and Xiangpeng Xie and Dengxiu Yu",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Inc.",

year = "2025",

month = may,

doi = "10.1016/j.ins.2024.121802",

language = "英语",

volume = "699",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Actor-critic-disturbance reinforcement learning algorithm-based fast finite-time stability of multiagent systems

AU - Zhao, Junsheng

AU - Gu, Yaqi

AU - Xie, Xiangpeng

AU - Yu, Dengxiu

PY - 2025/5

Y1 - 2025/5

N2 - This paper proposes an actor-critic-disturbance (ACD) reinforcement learning algorithm-based fast finite-time stability of multiagent systems (MASs) with time-varying asymmetrical constraints. Initially, a barrier function is designed to facilitate the transformation of the constrained system to an unconstrained one. Notably, the adaptive control strategy discussed in this paper is capable of solving more general dynamic constraints compared with most existing literature. Subsequently, in scenarios where the disturbance affects the system in the worst way, an H∞ optimal control strategy based on the ACD reinforcement learning algorithms is proposed to enhance the robustness of the system and minimize the influence of disturbances. Thirdly, a fast finite-time theory is integrated into the optimal control protocol for MASs, which allows the system to complete the control objective in finite time while converging faster. Lastly, numerical and practical simulation examples confirm the validity of the theoretical results.

AB - This paper proposes an actor-critic-disturbance (ACD) reinforcement learning algorithm-based fast finite-time stability of multiagent systems (MASs) with time-varying asymmetrical constraints. Initially, a barrier function is designed to facilitate the transformation of the constrained system to an unconstrained one. Notably, the adaptive control strategy discussed in this paper is capable of solving more general dynamic constraints compared with most existing literature. Subsequently, in scenarios where the disturbance affects the system in the worst way, an H∞ optimal control strategy based on the ACD reinforcement learning algorithms is proposed to enhance the robustness of the system and minimize the influence of disturbances. Thirdly, a fast finite-time theory is integrated into the optimal control protocol for MASs, which allows the system to complete the control objective in finite time while converging faster. Lastly, numerical and practical simulation examples confirm the validity of the theoretical results.

KW - Actor-critic-disturbance reinforcement learning

KW - Fast finite-time stabilization

KW - Time-varying asymmetrical constraint

UR - http://www.scopus.com/inward/record.url?scp=85213250617&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2024.121802

DO - 10.1016/j.ins.2024.121802

M3 - 文章

AN - SCOPUS:85213250617

SN - 0020-0255

VL - 699

JO - Information Sciences

JF - Information Sciences

M1 - 121802

ER -

Actor-critic-disturbance reinforcement learning algorithm-based fast finite-time stability of multiagent systems

摘要

访问文件

其它文件与链接

指纹

引用此