Space Target Pose Estimation Framework with Deep Reinforcement Learning Technique

Jing Yuan; Dejia Che; Yufei Guo; Jianping Yuan

Space Target Pose Estimation Framework with Deep Reinforcement Learning Technique

Jing Yuan, Dejia Che, Yufei Guo, Jianping Yuan

航天学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 会议文章 › 同行评审

摘要

Vision-based relative navigation technology is the key technology to complete on orbit service, space debris removal, and formation flight. One of the current challenges to this technology is to estimate the attitude of uncooperative targets that do not provide any navigation assistance. Previously, the development of vision-based relative navigation technology has mostly relied on image processing and template matching techniques, such as feature-based matching and tracking methods, based on PnP or PnL methods. Line features were often adopted for pose estimation in important on-orbit experiments since the advantages of robust detection and more structural information and less likely to be affected by occlusions. When the corresponding relationship between the model lines and the image projection lines is known, an accurate relative pose could be solved by Newton and Levenberg-Marquardt methods. For unknown line correspondences case, this problem is difficult to solve. This paper explores a pose estimation framework that uses deep reinforcement learning to learn the matching relationship and relative pose. The Deep Deterministic Policy Gradient (DDPG) algorithm is adopted since it supports the continuous action space and continuous status space. While the MDP is delicated designed, with the rotations angle being included in the action space, through continuous attempts and exploration, the model would finally reaches the "correct" posture, and create a match between the model lines and image lines.

源语言	英语
期刊	Proceedings of the International Astronautical Congress, IAC
卷	2022-September
出版状态	已出版 - 2022
活动	73rd International Astronautical Congress, IAC 2022 - Paris, 法国期限: 18 9月 2022 → 22 9月 2022

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{25caf238452f421fbd334fdfb560dec3,

title = "Space Target Pose Estimation Framework with Deep Reinforcement Learning Technique",

abstract = "Vision-based relative navigation technology is the key technology to complete on orbit service, space debris removal, and formation flight. One of the current challenges to this technology is to estimate the attitude of uncooperative targets that do not provide any navigation assistance. Previously, the development of vision-based relative navigation technology has mostly relied on image processing and template matching techniques, such as feature-based matching and tracking methods, based on PnP or PnL methods. Line features were often adopted for pose estimation in important on-orbit experiments since the advantages of robust detection and more structural information and less likely to be affected by occlusions. When the corresponding relationship between the model lines and the image projection lines is known, an accurate relative pose could be solved by Newton and Levenberg-Marquardt methods. For unknown line correspondences case, this problem is difficult to solve. This paper explores a pose estimation framework that uses deep reinforcement learning to learn the matching relationship and relative pose. The Deep Deterministic Policy Gradient (DDPG) algorithm is adopted since it supports the continuous action space and continuous status space. While the MDP is delicated designed, with the rotations angle being included in the action space, through continuous attempts and exploration, the model would finally reaches the {"}correct{"} posture, and create a match between the model lines and image lines.",

keywords = "DDPG, Deep Reinforcement Learning, pose estimation",

author = "Jing Yuan and Dejia Che and Yufei Guo and Jianping Yuan",

note = "Publisher Copyright: Copyright {\textcopyright} 2022 by the International Astronautical Federation (IAF). All rights reserved.; 73rd International Astronautical Congress, IAC 2022 ; Conference date: 18-09-2022 Through 22-09-2022",

year = "2022",

language = "英语",

volume = "2022-September",

journal = "Proceedings of the International Astronautical Congress, IAC",

issn = "0074-1795",

publisher = "International Astronautical Federation, IAF",

}

TY - JOUR

T1 - Space Target Pose Estimation Framework with Deep Reinforcement Learning Technique

AU - Yuan, Jing

AU - Che, Dejia

AU - Guo, Yufei

AU - Yuan, Jianping

PY - 2022

Y1 - 2022

N2 - Vision-based relative navigation technology is the key technology to complete on orbit service, space debris removal, and formation flight. One of the current challenges to this technology is to estimate the attitude of uncooperative targets that do not provide any navigation assistance. Previously, the development of vision-based relative navigation technology has mostly relied on image processing and template matching techniques, such as feature-based matching and tracking methods, based on PnP or PnL methods. Line features were often adopted for pose estimation in important on-orbit experiments since the advantages of robust detection and more structural information and less likely to be affected by occlusions. When the corresponding relationship between the model lines and the image projection lines is known, an accurate relative pose could be solved by Newton and Levenberg-Marquardt methods. For unknown line correspondences case, this problem is difficult to solve. This paper explores a pose estimation framework that uses deep reinforcement learning to learn the matching relationship and relative pose. The Deep Deterministic Policy Gradient (DDPG) algorithm is adopted since it supports the continuous action space and continuous status space. While the MDP is delicated designed, with the rotations angle being included in the action space, through continuous attempts and exploration, the model would finally reaches the "correct" posture, and create a match between the model lines and image lines.

AB - Vision-based relative navigation technology is the key technology to complete on orbit service, space debris removal, and formation flight. One of the current challenges to this technology is to estimate the attitude of uncooperative targets that do not provide any navigation assistance. Previously, the development of vision-based relative navigation technology has mostly relied on image processing and template matching techniques, such as feature-based matching and tracking methods, based on PnP or PnL methods. Line features were often adopted for pose estimation in important on-orbit experiments since the advantages of robust detection and more structural information and less likely to be affected by occlusions. When the corresponding relationship between the model lines and the image projection lines is known, an accurate relative pose could be solved by Newton and Levenberg-Marquardt methods. For unknown line correspondences case, this problem is difficult to solve. This paper explores a pose estimation framework that uses deep reinforcement learning to learn the matching relationship and relative pose. The Deep Deterministic Policy Gradient (DDPG) algorithm is adopted since it supports the continuous action space and continuous status space. While the MDP is delicated designed, with the rotations angle being included in the action space, through continuous attempts and exploration, the model would finally reaches the "correct" posture, and create a match between the model lines and image lines.

KW - DDPG

KW - Deep Reinforcement Learning

KW - pose estimation

UR - http://www.scopus.com/inward/record.url?scp=85167610324&partnerID=8YFLogxK

M3 - 会议文章

AN - SCOPUS:85167610324

SN - 0074-1795

VL - 2022-September

JO - Proceedings of the International Astronautical Congress, IAC

JF - Proceedings of the International Astronautical Congress, IAC

T2 - 73rd International Astronautical Congress, IAC 2022

Y2 - 18 September 2022 through 22 September 2022

ER -

Space Target Pose Estimation Framework with Deep Reinforcement Learning Technique

摘要

其它文件与链接

指纹

引用此