A path planning method based on deep reinforcement learning for AUV in complex marine environment

An Zhang; Weixiang Wang; Wenhao Bi; Zhanjun Huang

doi:10.1016/j.oceaneng.2024.119354

A path planning method based on deep reinforcement learning for AUV in complex marine environment

An Zhang, Weixiang Wang, Wenhao Bi, Zhanjun Huang

航空学院

科研成果: 期刊稿件 › 文章 › 同行评审

7 引用（Scopus）

摘要

The potential of autonomous underwater vehicle (AUV) on future applications is significant due to advancements in autonomy and intelligence. Path planning is a critical technology for AUV to perform operational missions in complex marine environments. To this end, this paper proposes a path planning method for AUV based on deep reinforcement learning. Initially, considering actual requirements, a complex marine environment model containing underwater terrain, sonobuoy detection, and ocean currents is established. Subsequently, the corresponding state space, action space, and reward function are formulated. Furthermore, to address the inherent limitations of existing deep reinforcement learning algorithms in terms of training efficiency, a mixed experience replay (MER) strategy is proposed. This strategy aims to enhance the efficiency of sample learning by integrating prior knowledge and exploration experience. Lastly, a novel HMER-SAC algorithm for AUV path planning is proposed by integrating the Soft Actor–Critic (SAC) algorithm with the hierarchical reinforcement learning strategy and the MER strategy. The results of the simulation and experiment demonstrate that the method is capable of efficiently planning executable paths in complex marine environments and exhibits superior training efficiency, stability, and performance.

源语言	英语
文章编号	119354
期刊	Ocean Engineering
卷	313
DOI	https://doi.org/10.1016/j.oceaneng.2024.119354
出版状态	已出版 - 1 12月 2024

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.1016/j.oceaneng.2024.119354

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{2e3ef25444504731975a957a3ad3913f,

title = "A path planning method based on deep reinforcement learning for AUV in complex marine environment",

abstract = "The potential of autonomous underwater vehicle (AUV) on future applications is significant due to advancements in autonomy and intelligence. Path planning is a critical technology for AUV to perform operational missions in complex marine environments. To this end, this paper proposes a path planning method for AUV based on deep reinforcement learning. Initially, considering actual requirements, a complex marine environment model containing underwater terrain, sonobuoy detection, and ocean currents is established. Subsequently, the corresponding state space, action space, and reward function are formulated. Furthermore, to address the inherent limitations of existing deep reinforcement learning algorithms in terms of training efficiency, a mixed experience replay (MER) strategy is proposed. This strategy aims to enhance the efficiency of sample learning by integrating prior knowledge and exploration experience. Lastly, a novel HMER-SAC algorithm for AUV path planning is proposed by integrating the Soft Actor–Critic (SAC) algorithm with the hierarchical reinforcement learning strategy and the MER strategy. The results of the simulation and experiment demonstrate that the method is capable of efficiently planning executable paths in complex marine environments and exhibits superior training efficiency, stability, and performance.",

keywords = "Autonomous underwater vehicle, Deep reinforcement learning, Hierarchical reinforcement learning, Path planning, Soft Actor–Critic",

author = "An Zhang and Weixiang Wang and Wenhao Bi and Zhanjun Huang",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Ltd",

year = "2024",

month = dec,

day = "1",

doi = "10.1016/j.oceaneng.2024.119354",

language = "英语",

volume = "313",

journal = "Ocean Engineering",

issn = "0029-8018",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - A path planning method based on deep reinforcement learning for AUV in complex marine environment

AU - Zhang, An

AU - Wang, Weixiang

AU - Bi, Wenhao

AU - Huang, Zhanjun

PY - 2024/12/1

Y1 - 2024/12/1

N2 - The potential of autonomous underwater vehicle (AUV) on future applications is significant due to advancements in autonomy and intelligence. Path planning is a critical technology for AUV to perform operational missions in complex marine environments. To this end, this paper proposes a path planning method for AUV based on deep reinforcement learning. Initially, considering actual requirements, a complex marine environment model containing underwater terrain, sonobuoy detection, and ocean currents is established. Subsequently, the corresponding state space, action space, and reward function are formulated. Furthermore, to address the inherent limitations of existing deep reinforcement learning algorithms in terms of training efficiency, a mixed experience replay (MER) strategy is proposed. This strategy aims to enhance the efficiency of sample learning by integrating prior knowledge and exploration experience. Lastly, a novel HMER-SAC algorithm for AUV path planning is proposed by integrating the Soft Actor–Critic (SAC) algorithm with the hierarchical reinforcement learning strategy and the MER strategy. The results of the simulation and experiment demonstrate that the method is capable of efficiently planning executable paths in complex marine environments and exhibits superior training efficiency, stability, and performance.

AB - The potential of autonomous underwater vehicle (AUV) on future applications is significant due to advancements in autonomy and intelligence. Path planning is a critical technology for AUV to perform operational missions in complex marine environments. To this end, this paper proposes a path planning method for AUV based on deep reinforcement learning. Initially, considering actual requirements, a complex marine environment model containing underwater terrain, sonobuoy detection, and ocean currents is established. Subsequently, the corresponding state space, action space, and reward function are formulated. Furthermore, to address the inherent limitations of existing deep reinforcement learning algorithms in terms of training efficiency, a mixed experience replay (MER) strategy is proposed. This strategy aims to enhance the efficiency of sample learning by integrating prior knowledge and exploration experience. Lastly, a novel HMER-SAC algorithm for AUV path planning is proposed by integrating the Soft Actor–Critic (SAC) algorithm with the hierarchical reinforcement learning strategy and the MER strategy. The results of the simulation and experiment demonstrate that the method is capable of efficiently planning executable paths in complex marine environments and exhibits superior training efficiency, stability, and performance.

KW - Autonomous underwater vehicle

KW - Deep reinforcement learning

KW - Hierarchical reinforcement learning

KW - Path planning

KW - Soft Actor–Critic

UR - http://www.scopus.com/inward/record.url?scp=85205326102&partnerID=8YFLogxK

U2 - 10.1016/j.oceaneng.2024.119354

DO - 10.1016/j.oceaneng.2024.119354

M3 - 文章

AN - SCOPUS:85205326102

SN - 0029-8018

VL - 313

JO - Ocean Engineering

JF - Ocean Engineering

M1 - 119354

ER -

A path planning method based on deep reinforcement learning for AUV in complex marine environment

摘要

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此