TY - GEN
T1 - SGU-SLAM
T2 - 7th International Symposium on Robotics and Intelligent Manufacturing Technology, ISRIMT 2025
AU - Zheng, Shaobo
AU - Hua, Zexi
AU - Tang, Yongchuan
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Reliable localization and mapping in highly dynamic environments are key challenges for Visual SLAM due to the violation of the static world assumption. Deep learning-based semantic VSLAM methods face a trade-off: instance segmentation is computationally costly, while object detection's coarse bounding boxes discard valid background features, degrading pose accuracy. To address this, we propose a robust RGB-D SLAM framework based on ORB-SLAM3, integrating a lightweight object detector with a probabilistic semantic-geometric fusion mechanism. An Adaptive Gaussian Mixture Model recovers static background features within dynamic bounding boxes, avoiding heavy segmentation networks. Additionally, a geometry-based probabilistic filter identifies unknown dynamic objects via epipolar constraints. Instead of hard dynamic feature elimination, we incorporate temporal belief states into an uncertainty-embedded factor graph optimization to dynamically down-weight unreliable landmarks. Extensive experiments on the TUM RGB-D dataset show our system outperforms state-of-the-art dynamic SLAM methods in high-dynamic sequences while maintaining real-time performance.
AB - Reliable localization and mapping in highly dynamic environments are key challenges for Visual SLAM due to the violation of the static world assumption. Deep learning-based semantic VSLAM methods face a trade-off: instance segmentation is computationally costly, while object detection's coarse bounding boxes discard valid background features, degrading pose accuracy. To address this, we propose a robust RGB-D SLAM framework based on ORB-SLAM3, integrating a lightweight object detector with a probabilistic semantic-geometric fusion mechanism. An Adaptive Gaussian Mixture Model recovers static background features within dynamic bounding boxes, avoiding heavy segmentation networks. Additionally, a geometry-based probabilistic filter identifies unknown dynamic objects via epipolar constraints. Instead of hard dynamic feature elimination, we incorporate temporal belief states into an uncertainty-embedded factor graph optimization to dynamically down-weight unreliable landmarks. Extensive experiments on the TUM RGB-D dataset show our system outperforms state-of-the-art dynamic SLAM methods in high-dynamic sequences while maintaining real-time performance.
KW - dynamic environments
KW - gaussian mixture model
KW - semantic-geometric fusion
KW - Visual SLAM
UR - https://www.scopus.com/pages/publications/105035091600
U2 - 10.1109/ISRIMT67769.2025.11413226
DO - 10.1109/ISRIMT67769.2025.11413226
M3 - 会议稿件
AN - SCOPUS:105035091600
T3 - 2025 7th International Symposium on Robotics and Intelligent Manufacturing Technology, ISRIMT 2025
SP - 102
EP - 107
BT - 2025 7th International Symposium on Robotics and Intelligent Manufacturing Technology, ISRIMT 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 12 December 2025 through 14 December 2025
ER -