TY - JOUR
T1 - 基于强化学习的改进三维A*算法在线航迹规划
AU - Ren, Zhi
AU - Zhang, Dong
AU - Tang, Shuo
N1 - Publisher Copyright:
© 2023 Chinese Institute of Electronics. All rights reserved.
PY - 2023/1
Y1 - 2023/1
N2 - In order to address the problem of high requirements for real-time performance and optimality of real-time path planning, a three-dimensional A∗ algorithm is improved based on the reinforcement learning method. Firstly, the shrinkage factor is introduced to ameliorate the heuristic information weighting method of the improved cost function, so as to improve the time performance. Secondly, a measurement model is established to measure the real-time performance and optimality of the algorithm. Combined with the deterministic policy gradient method, the action-state and reward functions are designed to optimize the shrinkage factor. Finally, the improved three-dimensional A∗ algorithm is simulated in multiple scenarios, and the simulation results show that the improved algorithm can ensure the optimality of the track results and effectively improve the time performance of the algorithm.
AB - In order to address the problem of high requirements for real-time performance and optimality of real-time path planning, a three-dimensional A∗ algorithm is improved based on the reinforcement learning method. Firstly, the shrinkage factor is introduced to ameliorate the heuristic information weighting method of the improved cost function, so as to improve the time performance. Secondly, a measurement model is established to measure the real-time performance and optimality of the algorithm. Combined with the deterministic policy gradient method, the action-state and reward functions are designed to optimize the shrinkage factor. Finally, the improved three-dimensional A∗ algorithm is simulated in multiple scenarios, and the simulation results show that the improved algorithm can ensure the optimality of the track results and effectively improve the time performance of the algorithm.
KW - algorithm
KW - deep deterministic policy gradient
KW - improved A
KW - real-time path planning
KW - reinforcement learning
KW - shrinkage factor
UR - http://www.scopus.com/inward/record.url?scp=85148375676&partnerID=8YFLogxK
U2 - 10.12305/j.issn.1001-506X.2023.01.23
DO - 10.12305/j.issn.1001-506X.2023.01.23
M3 - 文章
AN - SCOPUS:85148375676
SN - 1001-506X
VL - 45
SP - 193
EP - 201
JO - Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics
JF - Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics
IS - 1
ER -