跳到主要导航 跳到搜索 跳到主要内容

A policy-based Monte Carlo tree search method for container pre-marshalling

  • Ziliang Wang
  • , Chenhao Zhou
  • , Ada Che
  • , Jingkun Gao
  • Northwestern Polytechnical University Xian
  • Northwest Electric Power Design Institute Co., Ltd.

科研成果: 期刊稿件文章同行评审

9 引用 (Scopus)

摘要

The container pre-marshalling problem (CPMP) aims to minimise the number of reshuffling moves, ultimately achieving an optimised stacking arrangement in each bay based on the priority of containers during the non-loading phase. Given the sequential decision nature, we formulated the CPMP as a Markov decision process (MDP) model to account for the specific state and action of the reshuffling process. To address the challenge that the relocated container may trigger a chain effect on the subsequent reshuffling moves, this paper develops an improved policy-based Monte Carlo tree search (P-MCTS) to solve the CPMP, where eight composite reshuffling rules and modified upper confidence bounds are employed in the selection phases, and a well-designed heuristic algorithm is utilised in the simulation phases. Meanwhile, considering the effectiveness of reinforcement learning methods for solving the MDP model, an improved Q-learning is proposed as the compared method. Numerical results show that the P-MCTS outperforms all compared methods in scenarios where all containers have different priorities and scenarios where containers can share the same priority.

源语言英语
页(从-至)4776-4792
页数17
期刊International Journal of Production Research
62
13
DOI
出版状态已出版 - 2024

指纹

探究 'A policy-based Monte Carlo tree search method for container pre-marshalling' 的科研主题。它们共同构成独一无二的指纹。

引用此