A Multi-robot Lunar Area Coverage Method Based on Deep Reinforcement Learning

Yufei Guo; Zixuan Zheng; Qiming Liang; Jianping Yuan

A Multi-robot Lunar Area Coverage Method Based on Deep Reinforcement Learning

Yufei Guo, Zixuan Zheng, Qiming Liang, Jianping Yuan

School of Astronautics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Conference article › peer-review

Abstract

When exploring celestial bodies like the Moon, lunar surface coverage becomes a critical task, necessitating multiple robots to cover as much area as possible on the lunar surface. However, the intricate lunar terrain and environmental uncertainties render traditional area coverage methods ineffective. Therefore, this paper proposes a multi-robot lunar surface area coverage approach based on deep reinforcement learning (DRL). The approach consists of two phases: the training phase and the execution phase. During the training phase, robots learn a strategy to maximize lunar surface area coverage, employing the multi-agent deep deterministic policy gradient (MADDPG) algorithm for deep reinforcement learning. In the execution phase, robots execute movements based on their current state and learned strategy to achieve lunar surface coverage. Furthermore, a series of simulation experiments were conducted, with coverage rate during the execution phase serving as the evaluation metric. Experimental results demonstrate that our approach significantly enhances lunar surface area coverage compared to traditional methods. In the future, we will further optimize the algorithm to accomplish more efficient and intelligent lunar exploration missions.

Original language	English
Journal	Proceedings of the International Astronautical Congress, IAC
Volume	2023-October
State	Published - 2023
Event	74th International Astronautical Congress, IAC 2023 - Baku, Azerbaijan Duration: 2 Oct 2023 → 6 Oct 2023

Keywords

area coverage
deep reinforcement learning
Multi-robot

Cite this

@article{d3505f9ac71a4e9f841bf7882c312634,

title = "A Multi-robot Lunar Area Coverage Method Based on Deep Reinforcement Learning",

abstract = "When exploring celestial bodies like the Moon, lunar surface coverage becomes a critical task, necessitating multiple robots to cover as much area as possible on the lunar surface. However, the intricate lunar terrain and environmental uncertainties render traditional area coverage methods ineffective. Therefore, this paper proposes a multi-robot lunar surface area coverage approach based on deep reinforcement learning (DRL). The approach consists of two phases: the training phase and the execution phase. During the training phase, robots learn a strategy to maximize lunar surface area coverage, employing the multi-agent deep deterministic policy gradient (MADDPG) algorithm for deep reinforcement learning. In the execution phase, robots execute movements based on their current state and learned strategy to achieve lunar surface coverage. Furthermore, a series of simulation experiments were conducted, with coverage rate during the execution phase serving as the evaluation metric. Experimental results demonstrate that our approach significantly enhances lunar surface area coverage compared to traditional methods. In the future, we will further optimize the algorithm to accomplish more efficient and intelligent lunar exploration missions.",

keywords = "area coverage, deep reinforcement learning, Multi-robot",

author = "Yufei Guo and Zixuan Zheng and Qiming Liang and Jianping Yuan",

note = "Publisher Copyright: Copyright {\textcopyright} 2023 by the International Astronautical Federation (IAF). All rights reserved.; 74th International Astronautical Congress, IAC 2023 ; Conference date: 02-10-2023 Through 06-10-2023",

year = "2023",

language = "英语",

volume = "2023-October",

journal = "Proceedings of the International Astronautical Congress, IAC",

issn = "0074-1795",

publisher = "International Astronautical Federation, IAF",

}

TY - JOUR

T1 - A Multi-robot Lunar Area Coverage Method Based on Deep Reinforcement Learning

AU - Guo, Yufei

AU - Zheng, Zixuan

AU - Liang, Qiming

AU - Yuan, Jianping

PY - 2023

Y1 - 2023

N2 - When exploring celestial bodies like the Moon, lunar surface coverage becomes a critical task, necessitating multiple robots to cover as much area as possible on the lunar surface. However, the intricate lunar terrain and environmental uncertainties render traditional area coverage methods ineffective. Therefore, this paper proposes a multi-robot lunar surface area coverage approach based on deep reinforcement learning (DRL). The approach consists of two phases: the training phase and the execution phase. During the training phase, robots learn a strategy to maximize lunar surface area coverage, employing the multi-agent deep deterministic policy gradient (MADDPG) algorithm for deep reinforcement learning. In the execution phase, robots execute movements based on their current state and learned strategy to achieve lunar surface coverage. Furthermore, a series of simulation experiments were conducted, with coverage rate during the execution phase serving as the evaluation metric. Experimental results demonstrate that our approach significantly enhances lunar surface area coverage compared to traditional methods. In the future, we will further optimize the algorithm to accomplish more efficient and intelligent lunar exploration missions.

AB - When exploring celestial bodies like the Moon, lunar surface coverage becomes a critical task, necessitating multiple robots to cover as much area as possible on the lunar surface. However, the intricate lunar terrain and environmental uncertainties render traditional area coverage methods ineffective. Therefore, this paper proposes a multi-robot lunar surface area coverage approach based on deep reinforcement learning (DRL). The approach consists of two phases: the training phase and the execution phase. During the training phase, robots learn a strategy to maximize lunar surface area coverage, employing the multi-agent deep deterministic policy gradient (MADDPG) algorithm for deep reinforcement learning. In the execution phase, robots execute movements based on their current state and learned strategy to achieve lunar surface coverage. Furthermore, a series of simulation experiments were conducted, with coverage rate during the execution phase serving as the evaluation metric. Experimental results demonstrate that our approach significantly enhances lunar surface area coverage compared to traditional methods. In the future, we will further optimize the algorithm to accomplish more efficient and intelligent lunar exploration missions.

KW - area coverage

KW - deep reinforcement learning

KW - Multi-robot

UR - http://www.scopus.com/inward/record.url?scp=85187988695&partnerID=8YFLogxK

M3 - 会议文章

AN - SCOPUS:85187988695

SN - 0074-1795

VL - 2023-October

JO - Proceedings of the International Astronautical Congress, IAC

JF - Proceedings of the International Astronautical Congress, IAC

T2 - 74th International Astronautical Congress, IAC 2023

Y2 - 2 October 2023 through 6 October 2023

ER -

A Multi-robot Lunar Area Coverage Method Based on Deep Reinforcement Learning

Abstract

Keywords

Other files and links

Fingerprint

Cite this