TY - JOUR
T1 - Generalizable Occlusion-aware Human Novel View Synthesis from Image Pairs
AU - Zhao, Kaijin
AU - Huang, Xin
AU - Zhou, Guoqing
AU - Wang, Qing
N1 - Publisher Copyright:
© 1991-2012 IEEE.
PY - 2026
Y1 - 2026
N2 - We propose a novel approach for human novel view synthesis from image pairs, achieving high-quality cross-scene rendering. Existing methods struggle with occlusions, often suffering from excessive invalid sampling and sensitivity to depth estimation errors. To address these challenges, we propose GoRF, an innovative framework built upon Neural Radiance Fields (NeRF). Specifically, we develop a geometry-guided occlusion-aware mechanism that implicitly models cross-view geometric projection discrepancies and dynamically adjusts multi-view feature blending weights, mitigating occlusion ambiguity between input views. Furthermore, to enhance detail reconstruction, we propose a hybrid sampling strategy that integrates surface-guided and global sampling, effectively compensating for inaccuracies in depth estimation. By combining these strategies, our method enables occlusion-aware, high-fidelity human novel view synthesis. Extensive experiments on diverse human datasets, including THuman2.0, THuman-Sit, and DNA-Rendering, demonstrate that our approach outperforms state-of-the-art methods in both cross-pose and cross-identity scenarios.
AB - We propose a novel approach for human novel view synthesis from image pairs, achieving high-quality cross-scene rendering. Existing methods struggle with occlusions, often suffering from excessive invalid sampling and sensitivity to depth estimation errors. To address these challenges, we propose GoRF, an innovative framework built upon Neural Radiance Fields (NeRF). Specifically, we develop a geometry-guided occlusion-aware mechanism that implicitly models cross-view geometric projection discrepancies and dynamically adjusts multi-view feature blending weights, mitigating occlusion ambiguity between input views. Furthermore, to enhance detail reconstruction, we propose a hybrid sampling strategy that integrates surface-guided and global sampling, effectively compensating for inaccuracies in depth estimation. By combining these strategies, our method enables occlusion-aware, high-fidelity human novel view synthesis. Extensive experiments on diverse human datasets, including THuman2.0, THuman-Sit, and DNA-Rendering, demonstrate that our approach outperforms state-of-the-art methods in both cross-pose and cross-identity scenarios.
KW - Human Novel View Synthesis
KW - Neural Rendering
KW - Occlusion Handling
KW - Sparse-view Synthesis
UR - https://www.scopus.com/pages/publications/105028475692
U2 - 10.1109/TCSVT.2026.3656967
DO - 10.1109/TCSVT.2026.3656967
M3 - 文章
AN - SCOPUS:105028475692
SN - 1051-8215
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
ER -