Depth Control of a Biomimetic Manta Robot via Reinforcement Learning

Daili Zhang; Guang Pan; Yonghui Cao; Qiaogao Huang; Yong Cao

doi:10.1007/978-981-99-0617-8_5

Depth Control of a Biomimetic Manta Robot via Reinforcement Learning

Daili Zhang, Guang Pan, Yonghui Cao, Qiaogao Huang, Yong Cao

School of Marine Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

This paper proposes a model-free biomimetic manta robot depth control method based on reinforcement learning. Different from the traditional control method, the reinforcement learning method does not need to establish a mathematical model of the control object, and autonomously learns the control law through data training. Based on the classical Q algorithm, the state space, the action space, and reward function of the depth control of the bionic manta robot are designed. The state-action function is trained offline using the experience replay mechanism and random sampling strategy. Finally, the trained function is transplanted to the biomimetic manta robot prototype to establish a controller. The effectiveness of the proposed control method is verified by experiments.

Original language	English
Title of host publication	Cognitive Systems and Information Processing - 7th International Conference, ICCSIP 2022, Revised Selected Papers
Editors	Fuchun Sun, Angelo Cangelosi, Jianwei Zhang, Yuanlong Yu, Huaping Liu, Bin Fang
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	59-69
Number of pages	11
ISBN (Print)	9789819906161
DOIs	https://doi.org/10.1007/978-981-99-0617-8_5
State	Published - 2023
Event	7th International Conference on Cognitive Systems and Information Processing, ICCSIP 2022 - Fuzhou, China Duration: 17 Dec 2022 → 18 Dec 2022

Publication series

Name	Communications in Computer and Information Science
Volume	1787 CCIS
ISSN (Print)	1865-0929
ISSN (Electronic)	1865-0937

Conference

Conference	7th International Conference on Cognitive Systems and Information Processing, ICCSIP 2022
Country/Territory	China
City	Fuzhou
Period	17/12/22 → 18/12/22

Keywords

Autonomous underwater vehicle
Biomimetic manta robot
Depth control
Reinforcement learning

Access to Document

10.1007/978-981-99-0617-8_5

Cite this

Zhang, D., Pan, G., Cao, Y., Huang, Q., & Cao, Y. (2023). Depth Control of a Biomimetic Manta Robot via Reinforcement Learning. In F. Sun, A. Cangelosi, J. Zhang, Y. Yu, H. Liu, & B. Fang (Eds.), Cognitive Systems and Information Processing - 7th International Conference, ICCSIP 2022, Revised Selected Papers (pp. 59-69). (Communications in Computer and Information Science; Vol. 1787 CCIS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-99-0617-8_5

Zhang, Daili ; Pan, Guang ; Cao, Yonghui et al. / Depth Control of a Biomimetic Manta Robot via Reinforcement Learning. Cognitive Systems and Information Processing - 7th International Conference, ICCSIP 2022, Revised Selected Papers. editor / Fuchun Sun ; Angelo Cangelosi ; Jianwei Zhang ; Yuanlong Yu ; Huaping Liu ; Bin Fang. Springer Science and Business Media Deutschland GmbH, 2023. pp. 59-69 (Communications in Computer and Information Science).

@inproceedings{cdfaf13f94c24e8ab2e18fd634c01415,

title = "Depth Control of a Biomimetic Manta Robot via Reinforcement Learning",

abstract = "This paper proposes a model-free biomimetic manta robot depth control method based on reinforcement learning. Different from the traditional control method, the reinforcement learning method does not need to establish a mathematical model of the control object, and autonomously learns the control law through data training. Based on the classical Q algorithm, the state space, the action space, and reward function of the depth control of the bionic manta robot are designed. The state-action function is trained offline using the experience replay mechanism and random sampling strategy. Finally, the trained function is transplanted to the biomimetic manta robot prototype to establish a controller. The effectiveness of the proposed control method is verified by experiments.",

keywords = "Autonomous underwater vehicle, Biomimetic manta robot, Depth control, Reinforcement learning",

author = "Daili Zhang and Guang Pan and Yonghui Cao and Qiaogao Huang and Yong Cao",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.; 7th International Conference on Cognitive Systems and Information Processing, ICCSIP 2022 ; Conference date: 17-12-2022 Through 18-12-2022",

year = "2023",

doi = "10.1007/978-981-99-0617-8_5",

language = "英语",

isbn = "9789819906161",

series = "Communications in Computer and Information Science",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "59--69",

editor = "Fuchun Sun and Angelo Cangelosi and Jianwei Zhang and Yuanlong Yu and Huaping Liu and Bin Fang",

booktitle = "Cognitive Systems and Information Processing - 7th International Conference, ICCSIP 2022, Revised Selected Papers",

}

Zhang, D, Pan, G, Cao, Y, Huang, Q & Cao, Y 2023, Depth Control of a Biomimetic Manta Robot via Reinforcement Learning. in F Sun, A Cangelosi, J Zhang, Y Yu, H Liu & B Fang (eds), Cognitive Systems and Information Processing - 7th International Conference, ICCSIP 2022, Revised Selected Papers. Communications in Computer and Information Science, vol. 1787 CCIS, Springer Science and Business Media Deutschland GmbH, pp. 59-69, 7th International Conference on Cognitive Systems and Information Processing, ICCSIP 2022, Fuzhou, China, 17/12/22. https://doi.org/10.1007/978-981-99-0617-8_5

Depth Control of a Biomimetic Manta Robot via Reinforcement Learning. / Zhang, Daili; Pan, Guang; Cao, Yonghui et al.
Cognitive Systems and Information Processing - 7th International Conference, ICCSIP 2022, Revised Selected Papers. ed. / Fuchun Sun; Angelo Cangelosi; Jianwei Zhang; Yuanlong Yu; Huaping Liu; Bin Fang. Springer Science and Business Media Deutschland GmbH, 2023. p. 59-69 (Communications in Computer and Information Science; Vol. 1787 CCIS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Depth Control of a Biomimetic Manta Robot via Reinforcement Learning

AU - Zhang, Daili

AU - Pan, Guang

AU - Cao, Yonghui

AU - Huang, Qiaogao

AU - Cao, Yong

PY - 2023

Y1 - 2023

N2 - This paper proposes a model-free biomimetic manta robot depth control method based on reinforcement learning. Different from the traditional control method, the reinforcement learning method does not need to establish a mathematical model of the control object, and autonomously learns the control law through data training. Based on the classical Q algorithm, the state space, the action space, and reward function of the depth control of the bionic manta robot are designed. The state-action function is trained offline using the experience replay mechanism and random sampling strategy. Finally, the trained function is transplanted to the biomimetic manta robot prototype to establish a controller. The effectiveness of the proposed control method is verified by experiments.

AB - This paper proposes a model-free biomimetic manta robot depth control method based on reinforcement learning. Different from the traditional control method, the reinforcement learning method does not need to establish a mathematical model of the control object, and autonomously learns the control law through data training. Based on the classical Q algorithm, the state space, the action space, and reward function of the depth control of the bionic manta robot are designed. The state-action function is trained offline using the experience replay mechanism and random sampling strategy. Finally, the trained function is transplanted to the biomimetic manta robot prototype to establish a controller. The effectiveness of the proposed control method is verified by experiments.

KW - Autonomous underwater vehicle

KW - Biomimetic manta robot

KW - Depth control

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85149893062&partnerID=8YFLogxK

U2 - 10.1007/978-981-99-0617-8_5

DO - 10.1007/978-981-99-0617-8_5

M3 - 会议稿件

AN - SCOPUS:85149893062

SN - 9789819906161

T3 - Communications in Computer and Information Science

SP - 59

EP - 69

BT - Cognitive Systems and Information Processing - 7th International Conference, ICCSIP 2022, Revised Selected Papers

A2 - Sun, Fuchun

A2 - Cangelosi, Angelo

A2 - Zhang, Jianwei

A2 - Yu, Yuanlong

A2 - Liu, Huaping

A2 - Fang, Bin

PB - Springer Science and Business Media Deutschland GmbH

T2 - 7th International Conference on Cognitive Systems and Information Processing, ICCSIP 2022

Y2 - 17 December 2022 through 18 December 2022

ER -

Zhang D, Pan G, Cao Y, Huang Q , Cao Y. Depth Control of a Biomimetic Manta Robot via Reinforcement Learning. In Sun F, Cangelosi A, Zhang J, Yu Y, Liu H, Fang B, editors, Cognitive Systems and Information Processing - 7th International Conference, ICCSIP 2022, Revised Selected Papers. Springer Science and Business Media Deutschland GmbH. 2023. p. 59-69. (Communications in Computer and Information Science). doi: 10.1007/978-981-99-0617-8_5

Depth Control of a Biomimetic Manta Robot via Reinforcement Learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this