Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning

Yao Zhang; Zhiwen Yu; Jun Zhang; Liang Wang; Tom H. Luan; Bin Guo; Chau Yuen

doi:10.1109/TMC.2023.3332081

Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning

Yao Zhang, Zhiwen Yu, Jun Zhang, Liang Wang, Tom H. Luan, Bin Guo, Chau Yuen

School of Computer Science

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms.

Original language	English
Pages (from-to)	7180-7195
Number of pages	16
Journal	IEEE Transactions on Mobile Computing
Volume	23
Issue number	6
DOIs	https://doi.org/10.1109/TMC.2023.3332081
State	Published - 1 Jun 2024

Keywords

Graph learning
intelligent transportation systems
MARL
traffic signal control

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/TMC.2023.3332081

Cite this

@article{4a0e5d9088d745fb80d2df3235165da0,

title = "Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning",

abstract = "This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms.",

keywords = "Graph learning, intelligent transportation systems, MARL, traffic signal control",

author = "Yao Zhang and Zhiwen Yu and Jun Zhang and Liang Wang and Luan, {Tom H.} and Bin Guo and Chau Yuen",

note = "Publisher Copyright: {\textcopyright} 2002-2012 IEEE.",

year = "2024",

month = jun,

day = "1",

doi = "10.1109/TMC.2023.3332081",

language = "英语",

volume = "23",

pages = "7180--7195",

journal = "IEEE Transactions on Mobile Computing",

issn = "1536-1233",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "6",

}

TY - JOUR

T1 - Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning

AU - Zhang, Yao

AU - Yu, Zhiwen

AU - Zhang, Jun

AU - Wang, Liang

AU - Luan, Tom H.

AU - Guo, Bin

AU - Yuen, Chau

PY - 2024/6/1

Y1 - 2024/6/1

N2 - This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms.

AB - This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms.

KW - Graph learning

KW - intelligent transportation systems

KW - MARL

KW - traffic signal control

UR - http://www.scopus.com/inward/record.url?scp=85177082081&partnerID=8YFLogxK

U2 - 10.1109/TMC.2023.3332081

DO - 10.1109/TMC.2023.3332081

M3 - 文章

AN - SCOPUS:85177082081

SN - 1536-1233

VL - 23

SP - 7180

EP - 7195

JO - IEEE Transactions on Mobile Computing

JF - IEEE Transactions on Mobile Computing

IS - 6

ER -

Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this