Cooperative Differential Graphical Game Theoretic for Tracking Control of Nonlinear Multi-Agent Systems With Unknown Dynamics

Yaning Guo, Quan Pan, Penglin Hu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper investigates the cooperative tracking control problem of networked nonlinear multi-agent systems (MASs) with completely unknown dynamics. By formulating the optimal tracking control problem into a cooperative differential graphical game, we can employ the off-policy integral reinforcement learning (IRL) scheme to find optimal tracking controllers online along with the system trajectories without requiring the knowledge of the system dynamics. In contrast to the existing literature where the Nash equilibrium is utilized to characterize the performance of the designed controllers for the cooperative control of MASs, we introduce a new solution concept regarded as Pareto optimality strategies which devote to minimize performance cost and risk of all agents simultaneously. A simulation example is presented to verify the effectiveness of the proposed approach.

Original languageEnglish
Title of host publicationProceeding - 2021 China Automation Congress, CAC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3491-3496
Number of pages6
ISBN (Electronic)9781665426473
DOIs
StatePublished - 2021
Event2021 China Automation Congress, CAC 2021 - Beijing, China
Duration: 22 Oct 202124 Oct 2021

Publication series

NameProceeding - 2021 China Automation Congress, CAC 2021

Conference

Conference2021 China Automation Congress, CAC 2021
Country/TerritoryChina
CityBeijing
Period22/10/2124/10/21

Keywords

  • cooperative differential graphical games
  • Multi-agent systems (MASs)
  • Pareto optimality
  • reinforcement learning
  • tracking control

Fingerprint

Dive into the research topics of 'Cooperative Differential Graphical Game Theoretic for Tracking Control of Nonlinear Multi-Agent Systems With Unknown Dynamics'. Together they form a unique fingerprint.

Cite this