A multi-objective reinforcement learning approach for AGV task clustering

Jiawei Liu, Wentao Zhang, Tao Zhang, Ruyi Zheng

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The complete scheduling problem of unmanned warehouse AGV is a complex NP-hard problem with a process that includes three parts: task assignment dispatching, path planning, and traffic coordination. This study focuses on determining the optimal task assignment scheme for the AGV considering all known information. With the goal of balancing the workload at each picking station, an unsupervised reinforcement learning-based classification assignment method is proposed. The complex multi-task assignment problem is decomposed into a two-stage assignment problem. In order to reduce the task load of AGV and picking stations, the picking and storing region is first classified, and then tasks are assigned. The algorithm uses a hierarchical reinforcement learning method based on policy gradient to assign the storage nodes by class with the set optimization target. Experiments show that using this approach reduces the difficulty of the AGV scheduling problem and increases the efficiency of the solution.

Original languageEnglish
Title of host publicationInternational Conference on Computer Vision, Robotics, and Automation Engineering, CRAE 2024
EditorsQiang Cheng, Lei Chen
PublisherSPIE
ISBN (Electronic)9781510682283
DOIs
StatePublished - 2024
EventInternational Conference on Computer Vision, Robotics, and Automation Engineering, CRAE 2024 - Kunming, China
Duration: 21 Jun 202423 Jun 2024

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume13249
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

ConferenceInternational Conference on Computer Vision, Robotics, and Automation Engineering, CRAE 2024
Country/TerritoryChina
CityKunming
Period21/06/2423/06/24

Keywords

  • AGV
  • path planning
  • Reinforcement Learning
  • task assignment dispatching

Fingerprint

Dive into the research topics of 'A multi-objective reinforcement learning approach for AGV task clustering'. Together they form a unique fingerprint.

Cite this