TY - JOUR
T1 - ETV-MVS
T2 - Robust Visibility-Aware Multi-View Stereo with Epipolar Line-Based Transformer
AU - Wang, Shaoqian
AU - Ding, Xiaokun
AU - Mao, Yuxin
AU - Dai, Yuchao
N1 - Publisher Copyright:
© 2018 Tsinghua University Press.
PY - 2025
Y1 - 2025
N2 - Multi-View Stereo (MVS) is a pivotal technique in computer vision for reconstructing 3D models frommultiple images by estimating depth maps. However, the reconstruction performance is hindered by visibilitychallenges, such as occlusions and non-overlapping regions. In this paper, we propose an innovative visibility-aware framework to address these issues. Central to our method is an Epipolar Line-based Transformer (ELT)module, which capitalizes on the epipolar line correspondence and candidate matching features betweenimages to enhance the feature representation and correlation robustness. Furthermore, we propose a novelSupervised Visibility Estimation (SVE) module that estimates high-precision visibility maps, transcending theconstraints of previous methods that rely on indirect supervision. By integrating these modules, our methodachieves state-of-the-art results on the benchmarks and demonstrates its capability to perform high-qualityreconstructions even in challenging regions.
AB - Multi-View Stereo (MVS) is a pivotal technique in computer vision for reconstructing 3D models frommultiple images by estimating depth maps. However, the reconstruction performance is hindered by visibilitychallenges, such as occlusions and non-overlapping regions. In this paper, we propose an innovative visibility-aware framework to address these issues. Central to our method is an Epipolar Line-based Transformer (ELT)module, which capitalizes on the epipolar line correspondence and candidate matching features betweenimages to enhance the feature representation and correlation robustness. Furthermore, we propose a novelSupervised Visibility Estimation (SVE) module that estimates high-precision visibility maps, transcending theconstraints of previous methods that rely on indirect supervision. By integrating these modules, our methodachieves state-of-the-art results on the benchmarks and demonstrates its capability to perform high-qualityreconstructions even in challenging regions.
KW - Deep Neural Networks (DNN)
KW - epipolar geometry
KW - Multi-View Stereo (MVS)
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=105003599880&partnerID=8YFLogxK
U2 - 10.26599/BDMA.2024.9020075
DO - 10.26599/BDMA.2024.9020075
M3 - 文章
AN - SCOPUS:105003599880
SN - 2096-0654
VL - 8
SP - 520
EP - 533
JO - Big Data Mining and Analytics
JF - Big Data Mining and Analytics
IS - 3
ER -