基于强化学习的改进三维A*算法在线航迹规划

Zhi Ren; Dong Zhang; Shuo Tang

doi:10.12305/j.issn.1001-506X.2023.01.23

基于强化学习的改进三维A*算法在线航迹规划

Translated title of the contribution: Improved three-dimensional A* algorithm of real-time path planning based on reinforcement learning

Zhi Ren, Dong Zhang, Shuo Tang

School of Astronautics

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

In order to address the problem of high requirements for real-time performance and optimality of real-time path planning, a three-dimensional A∗ algorithm is improved based on the reinforcement learning method. Firstly, the shrinkage factor is introduced to ameliorate the heuristic information weighting method of the improved cost function, so as to improve the time performance. Secondly, a measurement model is established to measure the real-time performance and optimality of the algorithm. Combined with the deterministic policy gradient method, the action-state and reward functions are designed to optimize the shrinkage factor. Finally, the improved three-dimensional A∗ algorithm is simulated in multiple scenarios, and the simulation results show that the improved algorithm can ensure the optimality of the track results and effectively improve the time performance of the algorithm.

Translated title of the contribution	Improved three-dimensional A* algorithm of real-time path planning based on reinforcement learning
Original language	Chinese (Traditional)
Pages (from-to)	193-201
Number of pages	9
Journal	Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics
Volume	45
Issue number	1
DOIs	https://doi.org/10.12305/j.issn.1001-506X.2023.01.23
State	Published - Jan 2023

Access to Document

10.12305/j.issn.1001-506X.2023.01.23

Cite this

@article{b0b7d07bc90844529809a95c30dd8326,

title = "基于强化学习的改进三维A*算法在线航迹规划",

abstract = "In order to address the problem of high requirements for real-time performance and optimality of real-time path planning, a three-dimensional A∗ algorithm is improved based on the reinforcement learning method. Firstly, the shrinkage factor is introduced to ameliorate the heuristic information weighting method of the improved cost function, so as to improve the time performance. Secondly, a measurement model is established to measure the real-time performance and optimality of the algorithm. Combined with the deterministic policy gradient method, the action-state and reward functions are designed to optimize the shrinkage factor. Finally, the improved three-dimensional A∗ algorithm is simulated in multiple scenarios, and the simulation results show that the improved algorithm can ensure the optimality of the track results and effectively improve the time performance of the algorithm.",

keywords = "algorithm, deep deterministic policy gradient, improved A, real-time path planning, reinforcement learning, shrinkage factor",

author = "Zhi Ren and Dong Zhang and Shuo Tang",

year = "2023",

month = jan,

doi = "10.12305/j.issn.1001-506X.2023.01.23",

language = "繁体中文",

volume = "45",

pages = "193--201",

journal = "Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics",

issn = "1001-506X",

publisher = "Chinese Institute of Electronics",

number = "1",

}

TY - JOUR

T1 - 基于强化学习的改进三维A*算法在线航迹规划

AU - Ren, Zhi

AU - Zhang, Dong

AU - Tang, Shuo

PY - 2023/1

Y1 - 2023/1

N2 - In order to address the problem of high requirements for real-time performance and optimality of real-time path planning, a three-dimensional A∗ algorithm is improved based on the reinforcement learning method. Firstly, the shrinkage factor is introduced to ameliorate the heuristic information weighting method of the improved cost function, so as to improve the time performance. Secondly, a measurement model is established to measure the real-time performance and optimality of the algorithm. Combined with the deterministic policy gradient method, the action-state and reward functions are designed to optimize the shrinkage factor. Finally, the improved three-dimensional A∗ algorithm is simulated in multiple scenarios, and the simulation results show that the improved algorithm can ensure the optimality of the track results and effectively improve the time performance of the algorithm.

AB - In order to address the problem of high requirements for real-time performance and optimality of real-time path planning, a three-dimensional A∗ algorithm is improved based on the reinforcement learning method. Firstly, the shrinkage factor is introduced to ameliorate the heuristic information weighting method of the improved cost function, so as to improve the time performance. Secondly, a measurement model is established to measure the real-time performance and optimality of the algorithm. Combined with the deterministic policy gradient method, the action-state and reward functions are designed to optimize the shrinkage factor. Finally, the improved three-dimensional A∗ algorithm is simulated in multiple scenarios, and the simulation results show that the improved algorithm can ensure the optimality of the track results and effectively improve the time performance of the algorithm.

KW - algorithm

KW - deep deterministic policy gradient

KW - improved A

KW - real-time path planning

KW - reinforcement learning

KW - shrinkage factor

UR - http://www.scopus.com/inward/record.url?scp=85148375676&partnerID=8YFLogxK

U2 - 10.12305/j.issn.1001-506X.2023.01.23

DO - 10.12305/j.issn.1001-506X.2023.01.23

M3 - 文章

AN - SCOPUS:85148375676

SN - 1001-506X

VL - 45

SP - 193

EP - 201

JO - Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics

JF - Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics

IS - 1

ER -

基于强化学习的改进三维A*算法在线航迹规划

Abstract

Access to Document

Other files and links

Fingerprint

Cite this