Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards

Mengyuan Li; Bin Guo; Jiangshan Zhang; Jiaqi Liu; Sicong Liu; Zhiwen Yu; Zhetao Li; Liyao Xiang

doi:10.1109/MASS52906.2021.00066

Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards

Mengyuan Li, Bin Guo, Jiangshan Zhang, Jiaqi Liu, Sicong Liu, Zhiwen Yu, Zhetao Li, Liyao Xiang

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

15 引用（Scopus）

摘要

Automated Guided Vehicles (AGVs) have been widely used for material handling in flexible shop floors. Each product requires various raw materials to complete the assembly in production process. AGVs are used to realize the automatic handling of raw materials in different locations. Efficient AGVs task allocation strategy can reduce transportation costs and improve distribution efficiency. However, the traditional centralized approaches make high demands on the control center's computing power and real-time capability. In this paper, we present decentralized solutions to achieve flexible and self-organized AGVs task allocation. In particular, we propose two improved multi-agent reinforcement learning algorithms, MAD-DPG-IPF (Information Potential Field) and BiCNet-IPF, to realize the coordination among AGVs adapting to different scenarios. To address the reward-sparsity issue, we propose a reward shaping strategy based on information potential field, which provides stepwise rewards and implicitly guides the AGVs to different material targets. We conduct experiments under different settings (3 AGVs and 6 AGVs), and the experiment results indicate that, compared with baseline methods, our work obtains up to 47% task response improvement and 22% training iterations reduction.

源语言	英语
主期刊名	Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021
出版商	Institute of Electrical and Electronics Engineers Inc.
页	482-489
页数	8
ISBN（电子版）	9781665449359
DOI	https://doi.org/10.1109/MASS52906.2021.00066
出版状态	已出版 - 2021
活动	18th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021 - Virtual, Online, 美国期限: 4 10月 2021 → 7 10月 2021

出版系列

姓名	Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021

会议

会议	18th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021
国家/地区	美国
市	Virtual, Online
时期	4/10/21 → 7/10/21

访问文件

10.1109/MASS52906.2021.00066

其它文件与链接

链接到 Scopus 的出版物

引用此

Li, M., Guo, B., Zhang, J., Liu, J., Liu, S., Yu, Z., Li, Z., & Xiang, L. (2021). Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards. 在 Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021 (页码 482-489). (Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/MASS52906.2021.00066

Li, Mengyuan ; Guo, Bin ; Zhang, Jiangshan 等. / Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards. Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021. Institute of Electrical and Electronics Engineers Inc., 2021. 页码 482-489 (Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021).

@inproceedings{c1c11e7127184935a7b38e96386c484e,

title = "Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards",

abstract = "Automated Guided Vehicles (AGVs) have been widely used for material handling in flexible shop floors. Each product requires various raw materials to complete the assembly in production process. AGVs are used to realize the automatic handling of raw materials in different locations. Efficient AGVs task allocation strategy can reduce transportation costs and improve distribution efficiency. However, the traditional centralized approaches make high demands on the control center's computing power and real-time capability. In this paper, we present decentralized solutions to achieve flexible and self-organized AGVs task allocation. In particular, we propose two improved multi-agent reinforcement learning algorithms, MAD-DPG-IPF (Information Potential Field) and BiCNet-IPF, to realize the coordination among AGVs adapting to different scenarios. To address the reward-sparsity issue, we propose a reward shaping strategy based on information potential field, which provides stepwise rewards and implicitly guides the AGVs to different material targets. We conduct experiments under different settings (3 AGVs and 6 AGVs), and the experiment results indicate that, compared with baseline methods, our work obtains up to 47% task response improvement and 22% training iterations reduction.",

keywords = "AGVs, Decentralized task allocation, Information potential field, Multi-agent reinforcement learning",

author = "Mengyuan Li and Bin Guo and Jiangshan Zhang and Jiaqi Liu and Sicong Liu and Zhiwen Yu and Zhetao Li and Liyao Xiang",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 18th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021 ; Conference date: 04-10-2021 Through 07-10-2021",

year = "2021",

doi = "10.1109/MASS52906.2021.00066",

language = "英语",

series = "Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "482--489",

booktitle = "Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021",

}

Li, M, Guo, B, Zhang, J, Liu, J, Liu, S, Yu, Z, Li, Z & Xiang, L 2021, Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards. 在 Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021. Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021, Institute of Electrical and Electronics Engineers Inc., 页码 482-489, 18th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021, Virtual, Online, 美国, 4/10/21. https://doi.org/10.1109/MASS52906.2021.00066

Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards. / Li, Mengyuan; Guo, Bin; Zhang, Jiangshan 等.
Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021. Institute of Electrical and Electronics Engineers Inc., 2021. 页码 482-489 (Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards

AU - Li, Mengyuan

AU - Guo, Bin

AU - Zhang, Jiangshan

AU - Liu, Jiaqi

AU - Liu, Sicong

AU - Yu, Zhiwen

AU - Li, Zhetao

AU - Xiang, Liyao

PY - 2021

Y1 - 2021

N2 - Automated Guided Vehicles (AGVs) have been widely used for material handling in flexible shop floors. Each product requires various raw materials to complete the assembly in production process. AGVs are used to realize the automatic handling of raw materials in different locations. Efficient AGVs task allocation strategy can reduce transportation costs and improve distribution efficiency. However, the traditional centralized approaches make high demands on the control center's computing power and real-time capability. In this paper, we present decentralized solutions to achieve flexible and self-organized AGVs task allocation. In particular, we propose two improved multi-agent reinforcement learning algorithms, MAD-DPG-IPF (Information Potential Field) and BiCNet-IPF, to realize the coordination among AGVs adapting to different scenarios. To address the reward-sparsity issue, we propose a reward shaping strategy based on information potential field, which provides stepwise rewards and implicitly guides the AGVs to different material targets. We conduct experiments under different settings (3 AGVs and 6 AGVs), and the experiment results indicate that, compared with baseline methods, our work obtains up to 47% task response improvement and 22% training iterations reduction.

AB - Automated Guided Vehicles (AGVs) have been widely used for material handling in flexible shop floors. Each product requires various raw materials to complete the assembly in production process. AGVs are used to realize the automatic handling of raw materials in different locations. Efficient AGVs task allocation strategy can reduce transportation costs and improve distribution efficiency. However, the traditional centralized approaches make high demands on the control center's computing power and real-time capability. In this paper, we present decentralized solutions to achieve flexible and self-organized AGVs task allocation. In particular, we propose two improved multi-agent reinforcement learning algorithms, MAD-DPG-IPF (Information Potential Field) and BiCNet-IPF, to realize the coordination among AGVs adapting to different scenarios. To address the reward-sparsity issue, we propose a reward shaping strategy based on information potential field, which provides stepwise rewards and implicitly guides the AGVs to different material targets. We conduct experiments under different settings (3 AGVs and 6 AGVs), and the experiment results indicate that, compared with baseline methods, our work obtains up to 47% task response improvement and 22% training iterations reduction.

KW - AGVs

KW - Decentralized task allocation

KW - Information potential field

KW - Multi-agent reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85123916801&partnerID=8YFLogxK

U2 - 10.1109/MASS52906.2021.00066

DO - 10.1109/MASS52906.2021.00066

M3 - 会议稿件

AN - SCOPUS:85123916801

T3 - Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021

SP - 482

EP - 489

BT - Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 18th IEEE International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021

Y2 - 4 October 2021 through 7 October 2021

ER -

Li M, Guo B, Zhang J, Liu J, Liu S, Yu Z 等. Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards. 在 Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021. Institute of Electrical and Electronics Engineers Inc. 2021. 页码 482-489. (Proceedings - 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems, MASS 2021). doi: 10.1109/MASS52906.2021.00066

Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此