TY - JOUR
T1 - Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning
AU - Zhang, Yao
AU - Yu, Zhiwen
AU - Zhang, Jun
AU - Wang, Liang
AU - Luan, Tom H.
AU - Guo, Bin
AU - Yuen, Chau
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2024/6/1
Y1 - 2024/6/1
N2 - This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms.
AB - This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms.
KW - Graph learning
KW - intelligent transportation systems
KW - MARL
KW - traffic signal control
UR - http://www.scopus.com/inward/record.url?scp=85177082081&partnerID=8YFLogxK
U2 - 10.1109/TMC.2023.3332081
DO - 10.1109/TMC.2023.3332081
M3 - 文章
AN - SCOPUS:85177082081
SN - 1536-1233
VL - 23
SP - 7180
EP - 7195
JO - IEEE Transactions on Mobile Computing
JF - IEEE Transactions on Mobile Computing
IS - 6
ER -