Learning Reliable Gradients from Undersampled Circular Light Field for 3D Reconstruction

Zhengxi Song; Xue Wang; Hao Zhu; Guoqing Zhou; Qing Wang

doi:10.1109/TVCG.2022.3206207

Learning Reliable Gradients from Undersampled Circular Light Field for 3D Reconstruction

Zhengxi Song, Xue Wang, Hao Zhu, Guoqing Zhou, Qing Wang

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

8 引用（Scopus）

摘要

The paper presents a 3D reconstruction algorithm from an undersampled circular light field (LF). With an ultra-dense angular sampling rate, every scene point captured by a circular LF corresponds to a smooth trajectory in the circular epipolar plane volume (CEPV). Thus per-pixel disparities can be calculated by retrieving the local gradients of the CEPV-trajectories. However, the continuous curve will be broken up into discrete segments in an undersampled circular LF, which leads to a noticeable deterioration of the 3D reconstruction accuracy. We observe that the coherent structure is still embedded in the discrete segments. With less noise and ambiguity, the scene points can be reconstructed using gradients from reliable epipolar plane image (EPI) regions. By analyzing the geometric characteristics of the coherent structure in the CEPV, both the trajectory itself and its gradients could be modeled as 3D predictable series. Thus a mask-guided CNN+LSTM network is proposed to learn the mapping from the CEPV with a lower angular sampling rate to the gradients under a higher angular sampling rate. To segment the reliable regions, the reliable-mask-based loss that assesses the difference between learned gradients and ground truth gradients is added to the loss function. We construct a synthetic circular LF dataset with ground truth for depth and foreground/background segmentation to train the network. Moreover, a real-scene circular LF dataset is collected for performance evaluation. Experimental results on both public and self-constructed datasets demonstrate the superiority of the proposed method over existing state-of-the-art methods.

源语言	英语
页（从-至）	5194-5207
页数	14
期刊	IEEE Transactions on Visualization and Computer Graphics
卷	29
期	12
DOI	https://doi.org/10.1109/TVCG.2022.3206207
出版状态	已出版 - 1 12月 2023

访问文件

10.1109/TVCG.2022.3206207

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{e6ef08dd2f0041c9bf4c1ad54ec052e6,

title = "Learning Reliable Gradients from Undersampled Circular Light Field for 3D Reconstruction",

abstract = "The paper presents a 3D reconstruction algorithm from an undersampled circular light field (LF). With an ultra-dense angular sampling rate, every scene point captured by a circular LF corresponds to a smooth trajectory in the circular epipolar plane volume (CEPV). Thus per-pixel disparities can be calculated by retrieving the local gradients of the CEPV-trajectories. However, the continuous curve will be broken up into discrete segments in an undersampled circular LF, which leads to a noticeable deterioration of the 3D reconstruction accuracy. We observe that the coherent structure is still embedded in the discrete segments. With less noise and ambiguity, the scene points can be reconstructed using gradients from reliable epipolar plane image (EPI) regions. By analyzing the geometric characteristics of the coherent structure in the CEPV, both the trajectory itself and its gradients could be modeled as 3D predictable series. Thus a mask-guided CNN+LSTM network is proposed to learn the mapping from the CEPV with a lower angular sampling rate to the gradients under a higher angular sampling rate. To segment the reliable regions, the reliable-mask-based loss that assesses the difference between learned gradients and ground truth gradients is added to the loss function. We construct a synthetic circular LF dataset with ground truth for depth and foreground/background segmentation to train the network. Moreover, a real-scene circular LF dataset is collected for performance evaluation. Experimental results on both public and self-constructed datasets demonstrate the superiority of the proposed method over existing state-of-the-art methods.",

keywords = "3d reconstruction, circular epipolar plane volume (CEPV), circular light field, CNN+LSTM",

author = "Zhengxi Song and Xue Wang and Hao Zhu and Guoqing Zhou and Qing Wang",

note = "Publisher Copyright: {\textcopyright} 1995-2012 IEEE.",

year = "2023",

month = dec,

day = "1",

doi = "10.1109/TVCG.2022.3206207",

language = "英语",

volume = "29",

pages = "5194--5207",

journal = "IEEE Transactions on Visualization and Computer Graphics",

issn = "1077-2626",

publisher = "IEEE Computer Society",

number = "12",

}

TY - JOUR

T1 - Learning Reliable Gradients from Undersampled Circular Light Field for 3D Reconstruction

AU - Song, Zhengxi

AU - Wang, Xue

AU - Zhu, Hao

AU - Zhou, Guoqing

AU - Wang, Qing

PY - 2023/12/1

Y1 - 2023/12/1

N2 - The paper presents a 3D reconstruction algorithm from an undersampled circular light field (LF). With an ultra-dense angular sampling rate, every scene point captured by a circular LF corresponds to a smooth trajectory in the circular epipolar plane volume (CEPV). Thus per-pixel disparities can be calculated by retrieving the local gradients of the CEPV-trajectories. However, the continuous curve will be broken up into discrete segments in an undersampled circular LF, which leads to a noticeable deterioration of the 3D reconstruction accuracy. We observe that the coherent structure is still embedded in the discrete segments. With less noise and ambiguity, the scene points can be reconstructed using gradients from reliable epipolar plane image (EPI) regions. By analyzing the geometric characteristics of the coherent structure in the CEPV, both the trajectory itself and its gradients could be modeled as 3D predictable series. Thus a mask-guided CNN+LSTM network is proposed to learn the mapping from the CEPV with a lower angular sampling rate to the gradients under a higher angular sampling rate. To segment the reliable regions, the reliable-mask-based loss that assesses the difference between learned gradients and ground truth gradients is added to the loss function. We construct a synthetic circular LF dataset with ground truth for depth and foreground/background segmentation to train the network. Moreover, a real-scene circular LF dataset is collected for performance evaluation. Experimental results on both public and self-constructed datasets demonstrate the superiority of the proposed method over existing state-of-the-art methods.

AB - The paper presents a 3D reconstruction algorithm from an undersampled circular light field (LF). With an ultra-dense angular sampling rate, every scene point captured by a circular LF corresponds to a smooth trajectory in the circular epipolar plane volume (CEPV). Thus per-pixel disparities can be calculated by retrieving the local gradients of the CEPV-trajectories. However, the continuous curve will be broken up into discrete segments in an undersampled circular LF, which leads to a noticeable deterioration of the 3D reconstruction accuracy. We observe that the coherent structure is still embedded in the discrete segments. With less noise and ambiguity, the scene points can be reconstructed using gradients from reliable epipolar plane image (EPI) regions. By analyzing the geometric characteristics of the coherent structure in the CEPV, both the trajectory itself and its gradients could be modeled as 3D predictable series. Thus a mask-guided CNN+LSTM network is proposed to learn the mapping from the CEPV with a lower angular sampling rate to the gradients under a higher angular sampling rate. To segment the reliable regions, the reliable-mask-based loss that assesses the difference between learned gradients and ground truth gradients is added to the loss function. We construct a synthetic circular LF dataset with ground truth for depth and foreground/background segmentation to train the network. Moreover, a real-scene circular LF dataset is collected for performance evaluation. Experimental results on both public and self-constructed datasets demonstrate the superiority of the proposed method over existing state-of-the-art methods.

KW - 3d reconstruction

KW - circular epipolar plane volume (CEPV)

KW - circular light field

KW - CNN+LSTM

UR - http://www.scopus.com/inward/record.url?scp=85139399084&partnerID=8YFLogxK

U2 - 10.1109/TVCG.2022.3206207

DO - 10.1109/TVCG.2022.3206207

M3 - 文章

C2 - 36099223

AN - SCOPUS:85139399084

SN - 1077-2626

VL - 29

SP - 5194

EP - 5207

JO - IEEE Transactions on Visualization and Computer Graphics

JF - IEEE Transactions on Visualization and Computer Graphics

IS - 12

ER -

Learning Reliable Gradients from Undersampled Circular Light Field for 3D Reconstruction

摘要

访问文件

其它文件与链接

指纹

引用此