跳到主要导航 跳到搜索 跳到主要内容

A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control

  • Northwestern Polytechnical University Xian

科研成果: 期刊稿件文章同行评审

11 引用 (Scopus)

摘要

Using reinforcement learning to control traffic signal systems has been discussed in recent years, but most works focused on simple scenarios such as a single crossroads, and the methods aiming at large-scale traffic scenarios face long-time training and suboptimal results. In this work, we develop a new multi-agent reinforcement model for large-scale traffic signal control tasks, and a curriculum transfer learning method is developed to optimize the joint policy step by step. The policies for different intersections are trained in a partially observable Markov decision process with centralized training and decentralized execution mechanism, and we design transformer modules for both the policy and evaluation networks by attention mechanism. We first train policies in a simple traffic scenario, and then these policies are transferred to the next curriculum by policy reloading, while the experiences of the source task are reused selectively. With the number of agents increasing, our method can achieve satisfactory performances quickly by reusing the knowledge from previous curriculums. We conduct several experiments on the Cityflow testbed. In the case of more than 10 crossroads, our model improve the mean reward from 3.0 to 5.0.

源语言英语
页(从-至)21433-21447
页数15
期刊Applied Intelligence
53
18
DOI
出版状态已出版 - 9月 2023

指纹

探究 'A multi-agent reinforcement learning method with curriculum transfer for large-scale dynamic traffic signal control' 的科研主题。它们共同构成独一无二的指纹。

引用此