TY - JOUR
T1 - Space Target Pose Estimation Framework with Deep Reinforcement Learning Technique
AU - Yuan, Jing
AU - Che, Dejia
AU - Guo, Yufei
AU - Yuan, Jianping
N1 - Publisher Copyright:
Copyright © 2022 by the International Astronautical Federation (IAF). All rights reserved.
PY - 2022
Y1 - 2022
N2 - Vision-based relative navigation technology is the key technology to complete on orbit service, space debris removal, and formation flight. One of the current challenges to this technology is to estimate the attitude of uncooperative targets that do not provide any navigation assistance. Previously, the development of vision-based relative navigation technology has mostly relied on image processing and template matching techniques, such as feature-based matching and tracking methods, based on PnP or PnL methods. Line features were often adopted for pose estimation in important on-orbit experiments since the advantages of robust detection and more structural information and less likely to be affected by occlusions. When the corresponding relationship between the model lines and the image projection lines is known, an accurate relative pose could be solved by Newton and Levenberg-Marquardt methods. For unknown line correspondences case, this problem is difficult to solve. This paper explores a pose estimation framework that uses deep reinforcement learning to learn the matching relationship and relative pose. The Deep Deterministic Policy Gradient (DDPG) algorithm is adopted since it supports the continuous action space and continuous status space. While the MDP is delicated designed, with the rotations angle being included in the action space, through continuous attempts and exploration, the model would finally reaches the "correct" posture, and create a match between the model lines and image lines.
AB - Vision-based relative navigation technology is the key technology to complete on orbit service, space debris removal, and formation flight. One of the current challenges to this technology is to estimate the attitude of uncooperative targets that do not provide any navigation assistance. Previously, the development of vision-based relative navigation technology has mostly relied on image processing and template matching techniques, such as feature-based matching and tracking methods, based on PnP or PnL methods. Line features were often adopted for pose estimation in important on-orbit experiments since the advantages of robust detection and more structural information and less likely to be affected by occlusions. When the corresponding relationship between the model lines and the image projection lines is known, an accurate relative pose could be solved by Newton and Levenberg-Marquardt methods. For unknown line correspondences case, this problem is difficult to solve. This paper explores a pose estimation framework that uses deep reinforcement learning to learn the matching relationship and relative pose. The Deep Deterministic Policy Gradient (DDPG) algorithm is adopted since it supports the continuous action space and continuous status space. While the MDP is delicated designed, with the rotations angle being included in the action space, through continuous attempts and exploration, the model would finally reaches the "correct" posture, and create a match between the model lines and image lines.
KW - DDPG
KW - Deep Reinforcement Learning
KW - pose estimation
UR - http://www.scopus.com/inward/record.url?scp=85167610324&partnerID=8YFLogxK
M3 - 会议文章
AN - SCOPUS:85167610324
SN - 0074-1795
VL - 2022-September
JO - Proceedings of the International Astronautical Congress, IAC
JF - Proceedings of the International Astronautical Congress, IAC
T2 - 73rd International Astronautical Congress, IAC 2022
Y2 - 18 September 2022 through 22 September 2022
ER -