Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning

Kai Zhang; Luhe Wang; Jinwen Hu; Zhao Xu; Chubing Guo

doi:10.23919/CCC52363.2021.9549620

Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning

Kai Zhang, Luhe Wang, Jinwen Hu, Zhao Xu, Chubing Guo

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

1 引用（Scopus）

摘要

As the basic problem of unmanned vehicle navigation control, path planning has been widely studied. Reinforcement learning (RL) has been found an effective way of path optimization for the highly nonlinear and unmodeled dynamics. However, the RL based methods suffer from the "dimension disaster"under the high-dimension state spaces. In this paper, the path planning of an unmanned vehicle with collision avoidance is considered, and an improved Deep Q-Network (DQN) algorithm is proposed to reduce the computation load in the high-dimension state space. First, the states, actions and rewards are determined based on the task requirement, and a smoothing function is defined as an additional penalty term to modify the basic reward function. Then, the two-dimension grid of the state space is mapped to a gray image, which is applied as the input of a neural network, i.e., the Q-Network. Finally, simulation results show that the modified DQN algorithm is more stable and the fluctuation frequency is significantly reduced.

源语言	英语
主期刊名	Proceedings of the 40th Chinese Control Conference, CCC 2021
编辑	Chen Peng, Jian Sun
出版商	IEEE Computer Society
页	8392-8397
页数	6
ISBN（电子版）	9789881563804
DOI	https://doi.org/10.23919/CCC52363.2021.9549620
出版状态	已出版 - 26 7月 2021
活动	40th Chinese Control Conference, CCC 2021 - Shanghai, 中国期限: 26 7月 2021 → 28 7月 2021

出版系列

姓名	Chinese Control Conference, CCC
卷	2021-July
ISSN（印刷版）	1934-1768
ISSN（电子版）	2161-2927

会议

会议	40th Chinese Control Conference, CCC 2021
国家/地区	中国
市	Shanghai
时期	26/07/21 → 28/07/21

访问文件

10.23919/CCC52363.2021.9549620

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, K., Wang, L., Hu, J., Xu, Z., & Guo, C. (2021). Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning. 在 C. Peng, & J. Sun (编辑), Proceedings of the 40th Chinese Control Conference, CCC 2021 (页码 8392-8397). (Chinese Control Conference, CCC; 卷 2021-July). IEEE Computer Society. https://doi.org/10.23919/CCC52363.2021.9549620

@inproceedings{52672552b1424156bc6c7ef4d916477d,

title = "Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning",

abstract = "As the basic problem of unmanned vehicle navigation control, path planning has been widely studied. Reinforcement learning (RL) has been found an effective way of path optimization for the highly nonlinear and unmodeled dynamics. However, the RL based methods suffer from the {"}dimension disaster{"}under the high-dimension state spaces. In this paper, the path planning of an unmanned vehicle with collision avoidance is considered, and an improved Deep Q-Network (DQN) algorithm is proposed to reduce the computation load in the high-dimension state space. First, the states, actions and rewards are determined based on the task requirement, and a smoothing function is defined as an additional penalty term to modify the basic reward function. Then, the two-dimension grid of the state space is mapped to a gray image, which is applied as the input of a neural network, i.e., the Q-Network. Finally, simulation results show that the modified DQN algorithm is more stable and the fluctuation frequency is significantly reduced.",

keywords = "DQN, path planning, Reinforcement learning",

author = "Kai Zhang and Luhe Wang and Jinwen Hu and Zhao Xu and Chubing Guo",

note = "Publisher Copyright: {\textcopyright} 2021 Technical Committee on Control Theory, Chinese Association of Automation.; 40th Chinese Control Conference, CCC 2021 ; Conference date: 26-07-2021 Through 28-07-2021",

year = "2021",

month = jul,

day = "26",

doi = "10.23919/CCC52363.2021.9549620",

language = "英语",

series = "Chinese Control Conference, CCC",

publisher = "IEEE Computer Society",

pages = "8392--8397",

editor = "Chen Peng and Jian Sun",

booktitle = "Proceedings of the 40th Chinese Control Conference, CCC 2021",

}

Zhang, K, Wang, L, Hu, J , Xu, Z & Guo, C 2021, Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning. 在 C Peng & J Sun (编辑), Proceedings of the 40th Chinese Control Conference, CCC 2021. Chinese Control Conference, CCC, 卷 2021-July, IEEE Computer Society, 页码 8392-8397, 40th Chinese Control Conference, CCC 2021, Shanghai, 中国, 26/07/21. https://doi.org/10.23919/CCC52363.2021.9549620

Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning. / Zhang, Kai; Wang, Luhe; Hu, Jinwen 等.
Proceedings of the 40th Chinese Control Conference, CCC 2021. 编辑 / Chen Peng; Jian Sun. IEEE Computer Society, 2021. 页码 8392-8397 (Chinese Control Conference, CCC; 卷 2021-July).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning

AU - Zhang, Kai

AU - Wang, Luhe

AU - Hu, Jinwen

AU - Xu, Zhao

AU - Guo, Chubing

PY - 2021/7/26

Y1 - 2021/7/26

N2 - As the basic problem of unmanned vehicle navigation control, path planning has been widely studied. Reinforcement learning (RL) has been found an effective way of path optimization for the highly nonlinear and unmodeled dynamics. However, the RL based methods suffer from the "dimension disaster"under the high-dimension state spaces. In this paper, the path planning of an unmanned vehicle with collision avoidance is considered, and an improved Deep Q-Network (DQN) algorithm is proposed to reduce the computation load in the high-dimension state space. First, the states, actions and rewards are determined based on the task requirement, and a smoothing function is defined as an additional penalty term to modify the basic reward function. Then, the two-dimension grid of the state space is mapped to a gray image, which is applied as the input of a neural network, i.e., the Q-Network. Finally, simulation results show that the modified DQN algorithm is more stable and the fluctuation frequency is significantly reduced.

AB - As the basic problem of unmanned vehicle navigation control, path planning has been widely studied. Reinforcement learning (RL) has been found an effective way of path optimization for the highly nonlinear and unmodeled dynamics. However, the RL based methods suffer from the "dimension disaster"under the high-dimension state spaces. In this paper, the path planning of an unmanned vehicle with collision avoidance is considered, and an improved Deep Q-Network (DQN) algorithm is proposed to reduce the computation load in the high-dimension state space. First, the states, actions and rewards are determined based on the task requirement, and a smoothing function is defined as an additional penalty term to modify the basic reward function. Then, the two-dimension grid of the state space is mapped to a gray image, which is applied as the input of a neural network, i.e., the Q-Network. Finally, simulation results show that the modified DQN algorithm is more stable and the fluctuation frequency is significantly reduced.

KW - DQN

KW - path planning

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85117292825&partnerID=8YFLogxK

U2 - 10.23919/CCC52363.2021.9549620

DO - 10.23919/CCC52363.2021.9549620

M3 - 会议稿件

AN - SCOPUS:85117292825

T3 - Chinese Control Conference, CCC

SP - 8392

EP - 8397

BT - Proceedings of the 40th Chinese Control Conference, CCC 2021

A2 - Peng, Chen

A2 - Sun, Jian

PB - IEEE Computer Society

T2 - 40th Chinese Control Conference, CCC 2021

Y2 - 26 July 2021 through 28 July 2021

ER -

Path Planning Technology of Unmanned Vehicle Based on Improved Deep Reinforcement Learning

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此