An improved DDPG reinforcement learning control of underwater gliders for energy optimization

Anyan Jing; Zuocheng Tang; Jian Gao; Guang Pan

doi:10.1109/ICUS50048.2020.9274883

An improved DDPG reinforcement learning control of underwater gliders for energy optimization

Anyan Jing, Zuocheng Tang, Jian Gao, Guang Pan

航海学院

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

5 引用（Scopus）

摘要

As a novel underw ater vehicle, underw ater gliders are widely used in marine environment exploration. Underwater gliders are designed for long-term and longdistance operation, adaptivity and energy optimization is a critical requirement for controller design. In this paper, the reinforcement learning control is studied for underwater gliders, and the problem of slow learning convergence and unstable learning process of the DDPG reinforcement learning algorithm. The proposed solution is based on the priority experience replay method, which effectively increase the convergence speed and stability of the algorithm is addressed. The gliding control parameters are optimized to reduce the energy consumption is proposed, by using the improved DDPG algorithm and the energy consumption model. In the simulation experiments with an underwater glider, a set of glide parameters is obtained at a given gliding depth.

源语言	英语
主期刊名	Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020
出版商	Institute of Electrical and Electronics Engineers Inc.
页	621-626
页数	6
ISBN（电子版）	9781728180250
DOI	https://doi.org/10.1109/ICUS50048.2020.9274883
出版状态	已出版 - 27 11月 2020
活动	3rd International Conference on Unmanned Systems, ICUS 2020 - Harbin, 中国期限: 27 11月 2020 → 28 11月 2020

出版系列

姓名	Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020

会议

会议	3rd International Conference on Unmanned Systems, ICUS 2020
国家/地区	中国
市	Harbin
时期	27/11/20 → 28/11/20

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.1109/ICUS50048.2020.9274883

其它文件与链接

链接到 Scopus 的出版物

引用此

Jing, A., Tang, Z., Gao, J., & Pan, G. (2020). An improved DDPG reinforcement learning control of underwater gliders for energy optimization. 在 Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020 (页码 621-626). 文章 9274883 (Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICUS50048.2020.9274883

Jing, Anyan ; Tang, Zuocheng ; Gao, Jian 等. / An improved DDPG reinforcement learning control of underwater gliders for energy optimization. Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020. Institute of Electrical and Electronics Engineers Inc., 2020. 页码 621-626 (Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020).

@inproceedings{4ce6e0248dcf4e26874579c9faa29f55,

title = "An improved DDPG reinforcement learning control of underwater gliders for energy optimization",

abstract = "As a novel underw ater vehicle, underw ater gliders are widely used in marine environment exploration. Underwater gliders are designed for long-term and longdistance operation, adaptivity and energy optimization is a critical requirement for controller design. In this paper, the reinforcement learning control is studied for underwater gliders, and the problem of slow learning convergence and unstable learning process of the DDPG reinforcement learning algorithm. The proposed solution is based on the priority experience replay method, which effectively increase the convergence speed and stability of the algorithm is addressed. The gliding control parameters are optimized to reduce the energy consumption is proposed, by using the improved DDPG algorithm and the energy consumption model. In the simulation experiments with an underwater glider, a set of glide parameters is obtained at a given gliding depth.",

keywords = "Deep deterministic policy gradient, Glide parameters optimization, Prioritized experience replay, Reinforcement learning, Underwater glider",

author = "Anyan Jing and Zuocheng Tang and Jian Gao and Guang Pan",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 3rd International Conference on Unmanned Systems, ICUS 2020 ; Conference date: 27-11-2020 Through 28-11-2020",

year = "2020",

month = nov,

day = "27",

doi = "10.1109/ICUS50048.2020.9274883",

language = "英语",

series = "Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "621--626",

booktitle = "Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020",

}

Jing, A, Tang, Z, Gao, J & Pan, G 2020, An improved DDPG reinforcement learning control of underwater gliders for energy optimization. 在 Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020., 9274883, Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020, Institute of Electrical and Electronics Engineers Inc., 页码 621-626, 3rd International Conference on Unmanned Systems, ICUS 2020, Harbin, 中国, 27/11/20. https://doi.org/10.1109/ICUS50048.2020.9274883

An improved DDPG reinforcement learning control of underwater gliders for energy optimization. / Jing, Anyan; Tang, Zuocheng; Gao, Jian 等.
Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020. Institute of Electrical and Electronics Engineers Inc., 2020. 页码 621-626 9274883 (Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - An improved DDPG reinforcement learning control of underwater gliders for energy optimization

AU - Jing, Anyan

AU - Tang, Zuocheng

AU - Gao, Jian

AU - Pan, Guang

PY - 2020/11/27

Y1 - 2020/11/27

N2 - As a novel underw ater vehicle, underw ater gliders are widely used in marine environment exploration. Underwater gliders are designed for long-term and longdistance operation, adaptivity and energy optimization is a critical requirement for controller design. In this paper, the reinforcement learning control is studied for underwater gliders, and the problem of slow learning convergence and unstable learning process of the DDPG reinforcement learning algorithm. The proposed solution is based on the priority experience replay method, which effectively increase the convergence speed and stability of the algorithm is addressed. The gliding control parameters are optimized to reduce the energy consumption is proposed, by using the improved DDPG algorithm and the energy consumption model. In the simulation experiments with an underwater glider, a set of glide parameters is obtained at a given gliding depth.

AB - As a novel underw ater vehicle, underw ater gliders are widely used in marine environment exploration. Underwater gliders are designed for long-term and longdistance operation, adaptivity and energy optimization is a critical requirement for controller design. In this paper, the reinforcement learning control is studied for underwater gliders, and the problem of slow learning convergence and unstable learning process of the DDPG reinforcement learning algorithm. The proposed solution is based on the priority experience replay method, which effectively increase the convergence speed and stability of the algorithm is addressed. The gliding control parameters are optimized to reduce the energy consumption is proposed, by using the improved DDPG algorithm and the energy consumption model. In the simulation experiments with an underwater glider, a set of glide parameters is obtained at a given gliding depth.

KW - Deep deterministic policy gradient

KW - Glide parameters optimization

KW - Prioritized experience replay

KW - Reinforcement learning

KW - Underwater glider

UR - http://www.scopus.com/inward/record.url?scp=85098997917&partnerID=8YFLogxK

U2 - 10.1109/ICUS50048.2020.9274883

DO - 10.1109/ICUS50048.2020.9274883

M3 - 会议稿件

AN - SCOPUS:85098997917

T3 - Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020

SP - 621

EP - 626

BT - Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 3rd International Conference on Unmanned Systems, ICUS 2020

Y2 - 27 November 2020 through 28 November 2020

ER -

Jing A, Tang Z, Gao J, Pan G. An improved DDPG reinforcement learning control of underwater gliders for energy optimization. 在 Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020. Institute of Electrical and Electronics Engineers Inc. 2020. 页码 621-626. 9274883. (Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020). doi: 10.1109/ICUS50048.2020.9274883

An improved DDPG reinforcement learning control of underwater gliders for energy optimization

摘要

出版系列

会议

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此