TY - JOUR
T1 - Hierarchical dynamic graphical games for optimal leader-follower consensus control
AU - Long, Jia
AU - Xu, Tao
AU - Yu, Dengxiu
AU - Chen, C. L.Philip
N1 - Publisher Copyright:
© 2025 The Franklin Institute
PY - 2025/10/1
Y1 - 2025/10/1
N2 - In this paper, a hierarchical dynamic graphical game (HDGG) model is proposed to extend the existing optimal leader-follower consensus control framework for a class of linear stochastic systems. Owing to the interaction relationships and communication constraints among agents determined by the communication topology, the multi-agent system naturally exhibits a multi-stage hierarchical decision-making mechanism that relies solely on the local perceptions available to each agent. The proposed model introduces a multi-layer distributed structure through sequential decision-making, which accounts for both in-degree and out-degree tracking errors in a generalized leader-follower sense. Following the bottom-up principle, hierarchical and simultaneous optimal control policies for each agent are derived from the bottom to the top in a dynamic stochastic environment. A corresponding policy gradient algorithm is developed for the implementation of HDGG under both known and unknown parameter settings, and its convergence and optimality are guaranteed under certain conditions. Furthermore, the control policies are shown to constitute a Stackelberg-Nash equilibrium owing to the unique solution of the corresponding Riccati equation. Finally, an illustrative example is provided to demonstrate the efficacy of the proposed algorithm in both model-based and model-free settings.
AB - In this paper, a hierarchical dynamic graphical game (HDGG) model is proposed to extend the existing optimal leader-follower consensus control framework for a class of linear stochastic systems. Owing to the interaction relationships and communication constraints among agents determined by the communication topology, the multi-agent system naturally exhibits a multi-stage hierarchical decision-making mechanism that relies solely on the local perceptions available to each agent. The proposed model introduces a multi-layer distributed structure through sequential decision-making, which accounts for both in-degree and out-degree tracking errors in a generalized leader-follower sense. Following the bottom-up principle, hierarchical and simultaneous optimal control policies for each agent are derived from the bottom to the top in a dynamic stochastic environment. A corresponding policy gradient algorithm is developed for the implementation of HDGG under both known and unknown parameter settings, and its convergence and optimality are guaranteed under certain conditions. Furthermore, the control policies are shown to constitute a Stackelberg-Nash equilibrium owing to the unique solution of the corresponding Riccati equation. Finally, an illustrative example is provided to demonstrate the efficacy of the proposed algorithm in both model-based and model-free settings.
KW - Generalized leader-follower framework
KW - Hierarchical dynamic graphical game
KW - Optimal consensus control
KW - Stackelberg-Nash equilibrium
UR - https://www.scopus.com/pages/publications/105015036059
U2 - 10.1016/j.jfranklin.2025.108015
DO - 10.1016/j.jfranklin.2025.108015
M3 - 文章
AN - SCOPUS:105015036059
SN - 0016-0032
VL - 362
JO - Journal of the Franklin Institute
JF - Journal of the Franklin Institute
IS - 15
M1 - 108015
ER -