跳到主要导航 跳到搜索 跳到主要内容

Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning

  • Northwestern Polytechnical University Xian
  • Harbin Engineering University
  • Hong Kong University of Science and Technology
  • Xidian University
  • Nanyang Technological University

科研成果: 期刊稿件文章同行评审

26 引用 (Scopus)

摘要

This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcement Learning (MARL) is a promising solution. Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents. In this paper, we design a new decentralized control architecture with improved environmental observability to capture the spatial-temporal correlation. Specifically, we first develop a topology-aware information aggregation strategy to extract correlation-related information from unstructured data gathered in the road network. Particularly, we transfer the road network topology into a graph shift operator by forming a diffusion process on the topology, which subsequently facilitates the construction of graph signals. A diffusion convolution module is developed, forming a new MARL algorithm, which endows agents with the capabilities of graph learning. Extensive experiments based on both synthetic and real-world datasets verify that our proposal outperforms existing decentralized algorithms.

源语言英语
页(从-至)7180-7195
页数16
期刊IEEE Transactions on Mobile Computing
23
6
DOI
出版状态已出版 - 1 6月 2024

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 11 - 可持续城市和社区
    可持续发展目标 11 可持续城市和社区

指纹

探究 'Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此