Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning

Yao Jing; Bin Guo; Nuo Li; Yasan Ding; Yan Liu; Zhiwen Yu

doi:10.1016/j.eswa.2024.125792

Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning

Yao Jing, Bin Guo, Nuo Li, Yasan Ding, Yan Liu, Zhiwen Yu

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework's effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users’ requests, improving user experience.

源语言	英语
文章编号	125792
期刊	Expert Systems with Applications
卷	264
DOI	https://doi.org/10.1016/j.eswa.2024.125792
出版状态	已出版 - 10 3月 2025

访问文件

10.1016/j.eswa.2024.125792

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{6956a0c350174dd5baa1b952561956d2,

title = "Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning",

abstract = "Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework's effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users{\textquoteright} requests, improving user experience.",

keywords = "Federated learning, Multi-Agent Reinforcement Learning, Scalable order dispatching",

author = "Yao Jing and Bin Guo and Nuo Li and Yasan Ding and Yan Liu and Zhiwen Yu",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Ltd",

year = "2025",

month = mar,

day = "10",

doi = "10.1016/j.eswa.2024.125792",

language = "英语",

volume = "264",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "Elsevier Ltd",

}

TY - JOUR

T1 - Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning

AU - Jing, Yao

AU - Guo, Bin

AU - Li, Nuo

AU - Ding, Yasan

AU - Liu, Yan

AU - Yu, Zhiwen

PY - 2025/3/10

Y1 - 2025/3/10

N2 - Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework's effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users’ requests, improving user experience.

AB - Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework's effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users’ requests, improving user experience.

KW - Federated learning

KW - Multi-Agent Reinforcement Learning

KW - Scalable order dispatching

UR - http://www.scopus.com/inward/record.url?scp=85209925285&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2024.125792

DO - 10.1016/j.eswa.2024.125792

M3 - 文章

AN - SCOPUS:85209925285

SN - 0957-4174

VL - 264

JO - Expert Systems with Applications

JF - Expert Systems with Applications

M1 - 125792

ER -

Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning

摘要

访问文件

其它文件与链接

指纹

引用此