A Deep Reinforcement Learning Based Leader-Follower Control Policy for Swarm Systems

Di Cui; Huiping Li; Rizhong Wang

doi:10.1007/978-981-19-8915-5_23

A Deep Reinforcement Learning Based Leader-Follower Control Policy for Swarm Systems

Di Cui, Huiping Li, Rizhong Wang

School of Marine Science and Technology

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

This paper is concerned with the learning-based control problem for large-scale robotic swarm systems, which makes the single leader able to herd the follower swarm systems to form a target distribution. We use the mean-field model to describe the spatio-temporal evolution of the probability density of the follower swarm, under which the physical space is divided into several bins and the leader control policy only depends on the density distribution over these bins. Therefore, the designed control policy is free from the computation issue raised by the large number of follower agents N. A deep reinforcement learning (DRL) algorithm is designed here to learn the leader control policy and accommodate the variation of the follower density. It is verified that the proposed control policy is much more efficient than existing results in terms of control performance and training time.

Original language	English
Title of host publication	Intelligent Networked Things - 5th China Conference, CINT 2022, Revised Selected Papers
Editors	Lin Zhang, Wensheng Yu, Haijun Jiang, Yuanjun Laili
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	269-280
Number of pages	12
ISBN (Print)	9789811989148
DOIs	https://doi.org/10.1007/978-981-19-8915-5_23
State	Published - 2022
Event	5th China Conference on Intelligent Networked Things, CINT 2022 - Virtual, Online Duration: 7 Aug 2022 → 8 Aug 2022

Publication series

Name	Communications in Computer and Information Science
Volume	1714 CCIS
ISSN (Print)	1865-0929
ISSN (Electronic)	1865-0937

Conference

Conference	5th China Conference on Intelligent Networked Things, CINT 2022
City	Virtual, Online
Period	7/08/22 → 8/08/22

Keywords

Deep reinforcement learning (DRL)
Leader-follower control
Mean-field model
Swarm systems

Access to Document

10.1007/978-981-19-8915-5_23

Cite this

Cui, D., Li, H., & Wang, R. (2022). A Deep Reinforcement Learning Based Leader-Follower Control Policy for Swarm Systems. In L. Zhang, W. Yu, H. Jiang, & Y. Laili (Eds.), Intelligent Networked Things - 5th China Conference, CINT 2022, Revised Selected Papers (pp. 269-280). (Communications in Computer and Information Science; Vol. 1714 CCIS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-19-8915-5_23

Cui, Di ; Li, Huiping ; Wang, Rizhong. / A Deep Reinforcement Learning Based Leader-Follower Control Policy for Swarm Systems. Intelligent Networked Things - 5th China Conference, CINT 2022, Revised Selected Papers. editor / Lin Zhang ; Wensheng Yu ; Haijun Jiang ; Yuanjun Laili. Springer Science and Business Media Deutschland GmbH, 2022. pp. 269-280 (Communications in Computer and Information Science).

@inproceedings{7ded30f738a445b7b26bfd5c4aaf92c8,

title = "A Deep Reinforcement Learning Based Leader-Follower Control Policy for Swarm Systems",

abstract = "This paper is concerned with the learning-based control problem for large-scale robotic swarm systems, which makes the single leader able to herd the follower swarm systems to form a target distribution. We use the mean-field model to describe the spatio-temporal evolution of the probability density of the follower swarm, under which the physical space is divided into several bins and the leader control policy only depends on the density distribution over these bins. Therefore, the designed control policy is free from the computation issue raised by the large number of follower agents N. A deep reinforcement learning (DRL) algorithm is designed here to learn the leader control policy and accommodate the variation of the follower density. It is verified that the proposed control policy is much more efficient than existing results in terms of control performance and training time.",

keywords = "Deep reinforcement learning (DRL), Leader-follower control, Mean-field model, Swarm systems",

author = "Di Cui and Huiping Li and Rizhong Wang",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.; 5th China Conference on Intelligent Networked Things, CINT 2022 ; Conference date: 07-08-2022 Through 08-08-2022",

year = "2022",

doi = "10.1007/978-981-19-8915-5_23",

language = "英语",

isbn = "9789811989148",

series = "Communications in Computer and Information Science",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "269--280",

editor = "Lin Zhang and Wensheng Yu and Haijun Jiang and Yuanjun Laili",

booktitle = "Intelligent Networked Things - 5th China Conference, CINT 2022, Revised Selected Papers",

}

Cui, D, Li, H & Wang, R 2022, A Deep Reinforcement Learning Based Leader-Follower Control Policy for Swarm Systems. in L Zhang, W Yu, H Jiang & Y Laili (eds), Intelligent Networked Things - 5th China Conference, CINT 2022, Revised Selected Papers. Communications in Computer and Information Science, vol. 1714 CCIS, Springer Science and Business Media Deutschland GmbH, pp. 269-280, 5th China Conference on Intelligent Networked Things, CINT 2022, Virtual, Online, 7/08/22. https://doi.org/10.1007/978-981-19-8915-5_23

A Deep Reinforcement Learning Based Leader-Follower Control Policy for Swarm Systems. / Cui, Di; Li, Huiping; Wang, Rizhong.
Intelligent Networked Things - 5th China Conference, CINT 2022, Revised Selected Papers. ed. / Lin Zhang; Wensheng Yu; Haijun Jiang; Yuanjun Laili. Springer Science and Business Media Deutschland GmbH, 2022. p. 269-280 (Communications in Computer and Information Science; Vol. 1714 CCIS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A Deep Reinforcement Learning Based Leader-Follower Control Policy for Swarm Systems

AU - Cui, Di

AU - Li, Huiping

AU - Wang, Rizhong

PY - 2022

Y1 - 2022

N2 - This paper is concerned with the learning-based control problem for large-scale robotic swarm systems, which makes the single leader able to herd the follower swarm systems to form a target distribution. We use the mean-field model to describe the spatio-temporal evolution of the probability density of the follower swarm, under which the physical space is divided into several bins and the leader control policy only depends on the density distribution over these bins. Therefore, the designed control policy is free from the computation issue raised by the large number of follower agents N. A deep reinforcement learning (DRL) algorithm is designed here to learn the leader control policy and accommodate the variation of the follower density. It is verified that the proposed control policy is much more efficient than existing results in terms of control performance and training time.

AB - This paper is concerned with the learning-based control problem for large-scale robotic swarm systems, which makes the single leader able to herd the follower swarm systems to form a target distribution. We use the mean-field model to describe the spatio-temporal evolution of the probability density of the follower swarm, under which the physical space is divided into several bins and the leader control policy only depends on the density distribution over these bins. Therefore, the designed control policy is free from the computation issue raised by the large number of follower agents N. A deep reinforcement learning (DRL) algorithm is designed here to learn the leader control policy and accommodate the variation of the follower density. It is verified that the proposed control policy is much more efficient than existing results in terms of control performance and training time.

KW - Deep reinforcement learning (DRL)

KW - Leader-follower control

KW - Mean-field model

KW - Swarm systems

UR - http://www.scopus.com/inward/record.url?scp=85148039207&partnerID=8YFLogxK

U2 - 10.1007/978-981-19-8915-5_23

DO - 10.1007/978-981-19-8915-5_23

M3 - 会议稿件

AN - SCOPUS:85148039207

SN - 9789811989148

T3 - Communications in Computer and Information Science

SP - 269

EP - 280

BT - Intelligent Networked Things - 5th China Conference, CINT 2022, Revised Selected Papers

A2 - Zhang, Lin

A2 - Yu, Wensheng

A2 - Jiang, Haijun

A2 - Laili, Yuanjun

PB - Springer Science and Business Media Deutschland GmbH

T2 - 5th China Conference on Intelligent Networked Things, CINT 2022

Y2 - 7 August 2022 through 8 August 2022

ER -

Cui D, Li H, Wang R. A Deep Reinforcement Learning Based Leader-Follower Control Policy for Swarm Systems. In Zhang L, Yu W, Jiang H, Laili Y, editors, Intelligent Networked Things - 5th China Conference, CINT 2022, Revised Selected Papers. Springer Science and Business Media Deutschland GmbH. 2022. p. 269-280. (Communications in Computer and Information Science). doi: 10.1007/978-981-19-8915-5_23