基于改进深度双Ｑ网络的移动机器人路径规划算法

Lei Zhang; Yashuang Mu; Quan Pan

doi:10.13976/j.cnki.xk.2024.3090

基于改进深度双Ｑ网络的移动机器人路径规划算法

Lei Zhang, Yashuang Mu, Quan Pan

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

To solve the problems of the conventional mobile robot path planning method based on the deep double Q-network (DDQN), such as incomplete search and slow convergence, we propose an improved DDQN (I-DDQN) learning algorithm. First, the proposed I-DDQN algorithm uses the competitive network structure to estimate the value function of the DDQN algorithm. Second, we propose a robot path exploration strategy based on a two-layer controller structure, where the value function of the upper controller is used to explore the local optimal action of the mobile robot and the value function of the lower controller is used to learn the global task strategy. In addition, during algorithm learning, we use the priority experience playback mechanism for data collection and sampling and the small-batch data for network training. Finally, we perform a comparative analysis with the conventional DDQN algorithm and its improved algorithm in two different simulation environments, OpenAI Gym and Gazebo. The experimental results show that the proposed I-DDQN algorithm is superior to the conventional DDQN algorithm and its improved algorithm in terms of various evaluation indicators in the two simulation environments and effectively overcomes the problems of incomplete path search and slow convergence speed in the same complex environment.

投稿的翻译标题	Mobile Robot Path Planning Algorithm with Improved Deep Double Q Networks
源语言	繁体中文
页（从-至）	365-376
页数	12
期刊	Information and Control
卷	53
期	3
DOI	https://doi.org/10.13976/j.cnki.xk.2024.3090
出版状态	已出版 - 2024

关键词

competitive network structure
deep learning
hierarchical deep reinforcement learning
priority experience playback
reinforcement learning
robot path planning

访问文件

10.13976/j.cnki.xk.2024.3090

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{ec646a35a86d4240af6e2398073ee14b,

title = "基于改进深度双Ｑ网络的移动机器人路径规划算法",

abstract = "To solve the problems of the conventional mobile robot path planning method based on the deep double Q-network (DDQN), such as incomplete search and slow convergence, we propose an improved DDQN (I-DDQN) learning algorithm. First, the proposed I-DDQN algorithm uses the competitive network structure to estimate the value function of the DDQN algorithm. Second, we propose a robot path exploration strategy based on a two-layer controller structure, where the value function of the upper controller is used to explore the local optimal action of the mobile robot and the value function of the lower controller is used to learn the global task strategy. In addition, during algorithm learning, we use the priority experience playback mechanism for data collection and sampling and the small-batch data for network training. Finally, we perform a comparative analysis with the conventional DDQN algorithm and its improved algorithm in two different simulation environments, OpenAI Gym and Gazebo. The experimental results show that the proposed I-DDQN algorithm is superior to the conventional DDQN algorithm and its improved algorithm in terms of various evaluation indicators in the two simulation environments and effectively overcomes the problems of incomplete path search and slow convergence speed in the same complex environment.",

keywords = "competitive network structure, deep learning, hierarchical deep reinforcement learning, priority experience playback, reinforcement learning, robot path planning",

author = "Lei Zhang and Yashuang Mu and Quan Pan",

year = "2024",

doi = "10.13976/j.cnki.xk.2024.3090",

language = "繁体中文",

volume = "53",

pages = "365--376",

journal = "Information and Control",

issn = "1002-0411",

publisher = "Science Press ",

number = "3",

}

TY - JOUR

T1 - 基于改进深度双Ｑ网络的移动机器人路径规划算法

AU - Zhang, Lei

AU - Mu, Yashuang

AU - Pan, Quan

PY - 2024

Y1 - 2024

N2 - To solve the problems of the conventional mobile robot path planning method based on the deep double Q-network (DDQN), such as incomplete search and slow convergence, we propose an improved DDQN (I-DDQN) learning algorithm. First, the proposed I-DDQN algorithm uses the competitive network structure to estimate the value function of the DDQN algorithm. Second, we propose a robot path exploration strategy based on a two-layer controller structure, where the value function of the upper controller is used to explore the local optimal action of the mobile robot and the value function of the lower controller is used to learn the global task strategy. In addition, during algorithm learning, we use the priority experience playback mechanism for data collection and sampling and the small-batch data for network training. Finally, we perform a comparative analysis with the conventional DDQN algorithm and its improved algorithm in two different simulation environments, OpenAI Gym and Gazebo. The experimental results show that the proposed I-DDQN algorithm is superior to the conventional DDQN algorithm and its improved algorithm in terms of various evaluation indicators in the two simulation environments and effectively overcomes the problems of incomplete path search and slow convergence speed in the same complex environment.

AB - To solve the problems of the conventional mobile robot path planning method based on the deep double Q-network (DDQN), such as incomplete search and slow convergence, we propose an improved DDQN (I-DDQN) learning algorithm. First, the proposed I-DDQN algorithm uses the competitive network structure to estimate the value function of the DDQN algorithm. Second, we propose a robot path exploration strategy based on a two-layer controller structure, where the value function of the upper controller is used to explore the local optimal action of the mobile robot and the value function of the lower controller is used to learn the global task strategy. In addition, during algorithm learning, we use the priority experience playback mechanism for data collection and sampling and the small-batch data for network training. Finally, we perform a comparative analysis with the conventional DDQN algorithm and its improved algorithm in two different simulation environments, OpenAI Gym and Gazebo. The experimental results show that the proposed I-DDQN algorithm is superior to the conventional DDQN algorithm and its improved algorithm in terms of various evaluation indicators in the two simulation environments and effectively overcomes the problems of incomplete path search and slow convergence speed in the same complex environment.

KW - competitive network structure

KW - deep learning

KW - hierarchical deep reinforcement learning

KW - priority experience playback

KW - reinforcement learning

KW - robot path planning

UR - http://www.scopus.com/inward/record.url?scp=85199412822&partnerID=8YFLogxK

U2 - 10.13976/j.cnki.xk.2024.3090

DO - 10.13976/j.cnki.xk.2024.3090

M3 - 文章

AN - SCOPUS:85199412822

SN - 1002-0411

VL - 53

SP - 365

EP - 376

JO - Information and Control

JF - Information and Control

IS - 3

ER -

基于改进深度双Ｑ网络的移动机器人路径规划算法

摘要

关键词

访问文件

其它文件与链接

指纹

引用此