Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments

Fei WANG; Xiaoping ZHU; Zhou ZHOU; Yang TANG

doi:10.1016/j.cja.2023.09.033

Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments

Fei WANG, Xiaoping ZHU, Zhou ZHOU, Yang TANG

School of Aeronautics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

23 Scopus citations

Abstract

In some military application scenarios, Unmanned Aerial Vehicles (UAVs) need to perform missions with the assistance of on-board cameras when radar is not available and communication is interrupted, which brings challenges for UAV autonomous navigation and collision avoidance. In this paper, an improved deep-reinforcement-learning algorithm, Deep Q-Network with a Faster R-CNN model and a Data Deposit Mechanism (FRDDM-DQN), is proposed. A Faster R-CNN model (FR) is introduced and optimized to obtain the ability to extract obstacle information from images, and a new replay memory Data Deposit Mechanism (DDM) is designed to train an agent with a better performance. During training, a two-part training approach is used to reduce the time spent on training as well as retraining when the scenario changes. In order to verify the performance of the proposed method, a series of experiments, including training experiments, test experiments, and typical episodes experiments, is conducted in a 3D simulation environment. Experimental results show that the agent trained by the proposed FRDDM-DQN has the ability to navigate autonomously and avoid collisions, and performs better compared to the FR-DQN, FR-DDQN, FR-Dueling DQN, YOLO-based YDDM-DQN, and original FR output-based FR-ODQN.

Original language	English
Pages (from-to)	237-257
Number of pages	21
Journal	Chinese Journal of Aeronautics
Volume	37
Issue number	3
DOIs	https://doi.org/10.1016/j.cja.2023.09.033
State	Published - Mar 2024

Keywords

Faster R-CNN model
Image-based Autonomous Navigation and Collision Avoidance (ANCA)
Replay memory Data Deposit Mechanism (DDM)
Two-part training approach
Unmanned Aerial Vehicle (UAV)

Access to Document

10.1016/j.cja.2023.09.033

Cite this

@article{2b12c57c7dae4c358471c5c27153c180,

title = "Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments",

abstract = "In some military application scenarios, Unmanned Aerial Vehicles (UAVs) need to perform missions with the assistance of on-board cameras when radar is not available and communication is interrupted, which brings challenges for UAV autonomous navigation and collision avoidance. In this paper, an improved deep-reinforcement-learning algorithm, Deep Q-Network with a Faster R-CNN model and a Data Deposit Mechanism (FRDDM-DQN), is proposed. A Faster R-CNN model (FR) is introduced and optimized to obtain the ability to extract obstacle information from images, and a new replay memory Data Deposit Mechanism (DDM) is designed to train an agent with a better performance. During training, a two-part training approach is used to reduce the time spent on training as well as retraining when the scenario changes. In order to verify the performance of the proposed method, a series of experiments, including training experiments, test experiments, and typical episodes experiments, is conducted in a 3D simulation environment. Experimental results show that the agent trained by the proposed FRDDM-DQN has the ability to navigate autonomously and avoid collisions, and performs better compared to the FR-DQN, FR-DDQN, FR-Dueling DQN, YOLO-based YDDM-DQN, and original FR output-based FR-ODQN.",

keywords = "Faster R-CNN model, Image-based Autonomous Navigation and Collision Avoidance (ANCA), Replay memory Data Deposit Mechanism (DDM), Two-part training approach, Unmanned Aerial Vehicle (UAV)",

author = "Fei WANG and Xiaoping ZHU and Zhou ZHOU and Yang TANG",

note = "Publisher Copyright: {\textcopyright} 2024 Chinese Society of Aeronautics and Astronautics",

year = "2024",

month = mar,

doi = "10.1016/j.cja.2023.09.033",

language = "英语",

volume = "37",

pages = "237--257",

journal = "Chinese Journal of Aeronautics",

issn = "1000-9361",

publisher = "Elsevier B.V.",

number = "3",

}

TY - JOUR

T1 - Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments

AU - WANG, Fei

AU - ZHU, Xiaoping

AU - ZHOU, Zhou

AU - TANG, Yang

PY - 2024/3

Y1 - 2024/3

N2 - In some military application scenarios, Unmanned Aerial Vehicles (UAVs) need to perform missions with the assistance of on-board cameras when radar is not available and communication is interrupted, which brings challenges for UAV autonomous navigation and collision avoidance. In this paper, an improved deep-reinforcement-learning algorithm, Deep Q-Network with a Faster R-CNN model and a Data Deposit Mechanism (FRDDM-DQN), is proposed. A Faster R-CNN model (FR) is introduced and optimized to obtain the ability to extract obstacle information from images, and a new replay memory Data Deposit Mechanism (DDM) is designed to train an agent with a better performance. During training, a two-part training approach is used to reduce the time spent on training as well as retraining when the scenario changes. In order to verify the performance of the proposed method, a series of experiments, including training experiments, test experiments, and typical episodes experiments, is conducted in a 3D simulation environment. Experimental results show that the agent trained by the proposed FRDDM-DQN has the ability to navigate autonomously and avoid collisions, and performs better compared to the FR-DQN, FR-DDQN, FR-Dueling DQN, YOLO-based YDDM-DQN, and original FR output-based FR-ODQN.

AB - In some military application scenarios, Unmanned Aerial Vehicles (UAVs) need to perform missions with the assistance of on-board cameras when radar is not available and communication is interrupted, which brings challenges for UAV autonomous navigation and collision avoidance. In this paper, an improved deep-reinforcement-learning algorithm, Deep Q-Network with a Faster R-CNN model and a Data Deposit Mechanism (FRDDM-DQN), is proposed. A Faster R-CNN model (FR) is introduced and optimized to obtain the ability to extract obstacle information from images, and a new replay memory Data Deposit Mechanism (DDM) is designed to train an agent with a better performance. During training, a two-part training approach is used to reduce the time spent on training as well as retraining when the scenario changes. In order to verify the performance of the proposed method, a series of experiments, including training experiments, test experiments, and typical episodes experiments, is conducted in a 3D simulation environment. Experimental results show that the agent trained by the proposed FRDDM-DQN has the ability to navigate autonomously and avoid collisions, and performs better compared to the FR-DQN, FR-DDQN, FR-Dueling DQN, YOLO-based YDDM-DQN, and original FR output-based FR-ODQN.

KW - Faster R-CNN model

KW - Image-based Autonomous Navigation and Collision Avoidance (ANCA)

KW - Replay memory Data Deposit Mechanism (DDM)

KW - Two-part training approach

KW - Unmanned Aerial Vehicle (UAV)

UR - http://www.scopus.com/inward/record.url?scp=85184055357&partnerID=8YFLogxK

U2 - 10.1016/j.cja.2023.09.033

DO - 10.1016/j.cja.2023.09.033

M3 - 文章

AN - SCOPUS:85184055357

SN - 1000-9361

VL - 37

SP - 237

EP - 257

JO - Chinese Journal of Aeronautics

JF - Chinese Journal of Aeronautics

IS - 3

ER -

Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this