H2O-NeRF: Radiance Fields Reconstruction for Two-Hand-Held Objects

Xinxin Liu; Qi Zhang; Xin Huang; Ying Feng; Guoqing Zhou; Qing Wang

doi:10.1109/TVCG.2025.3553975

H₂O-NeRF: Radiance Fields Reconstruction for Two-Hand-Held Objects

Xinxin Liu, Qi Zhang, Xin Huang, Ying Feng, Guoqing Zhou, Qing Wang

计算机学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Our work aims to reconstruct the appearance and geometry of the two-hand-held object from a sequence of color images. In contrast to traditional single-hand-held manipulation, two-hand-holding allows more flexible interaction, thereby providing back views of the object, which is particularly convenient for reconstruction but generates complex view-dependent occlusions. The recent development of neural rendering provides new potential for hand-held object reconstruction. In this paper, we propose a novel neural representation-based framework to recover radiance fields of the two-hand-held object, named H₂O-NeRF. We first design an object-centric semantic module based on the geometric signed distance function cues to predict 3D object-centric regions and develop the view-dependent visible module based on the image-related cues to label 2D occluded regions. We then combine them to obtain a 2D visible mask that adaptively guides ray sampling on the object for optimization. We also provide a newly collected H₂O dataset to validate the proposed method. Experiments show that our method achieves superior performance on reconstruction completeness and view-consistency synthesis compared to the state-of-the-art methods.

源语言	英语
期刊	IEEE Transactions on Visualization and Computer Graphics
DOI	https://doi.org/10.1109/TVCG.2025.3553975
出版状态	已接受/待刊 - 2025

访问文件

10.1109/TVCG.2025.3553975

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{5d7d9b7e885947439dfdc85a2d86c0e5,

title = "H2O-NeRF: Radiance Fields Reconstruction for Two-Hand-Held Objects",

abstract = "Our work aims to reconstruct the appearance and geometry of the two-hand-held object from a sequence of color images. In contrast to traditional single-hand-held manipulation, two-hand-holding allows more flexible interaction, thereby providing back views of the object, which is particularly convenient for reconstruction but generates complex view-dependent occlusions. The recent development of neural rendering provides new potential for hand-held object reconstruction. In this paper, we propose a novel neural representation-based framework to recover radiance fields of the two-hand-held object, named H2O-NeRF. We first design an object-centric semantic module based on the geometric signed distance function cues to predict 3D object-centric regions and develop the view-dependent visible module based on the image-related cues to label 2D occluded regions. We then combine them to obtain a 2D visible mask that adaptively guides ray sampling on the object for optimization. We also provide a newly collected H2O dataset to validate the proposed method. Experiments show that our method achieves superior performance on reconstruction completeness and view-consistency synthesis compared to the state-of-the-art methods.",

keywords = "Adaptive ray sampling, Anti-occlusion, Neural radiance fields, Two-hand-held object reconstruction, View synthesis",

author = "Xinxin Liu and Qi Zhang and Xin Huang and Ying Feng and Guoqing Zhou and Qing Wang",

note = "Publisher Copyright: {\textcopyright} 1995-2012 IEEE.",

year = "2025",

doi = "10.1109/TVCG.2025.3553975",

language = "英语",

journal = "IEEE Transactions on Visualization and Computer Graphics",

issn = "1077-2626",

publisher = "IEEE Computer Society",

}

TY - JOUR

T1 - H2O-NeRF

T2 - Radiance Fields Reconstruction for Two-Hand-Held Objects

AU - Liu, Xinxin

AU - Zhang, Qi

AU - Huang, Xin

AU - Feng, Ying

AU - Zhou, Guoqing

AU - Wang, Qing

PY - 2025

Y1 - 2025

N2 - Our work aims to reconstruct the appearance and geometry of the two-hand-held object from a sequence of color images. In contrast to traditional single-hand-held manipulation, two-hand-holding allows more flexible interaction, thereby providing back views of the object, which is particularly convenient for reconstruction but generates complex view-dependent occlusions. The recent development of neural rendering provides new potential for hand-held object reconstruction. In this paper, we propose a novel neural representation-based framework to recover radiance fields of the two-hand-held object, named H2O-NeRF. We first design an object-centric semantic module based on the geometric signed distance function cues to predict 3D object-centric regions and develop the view-dependent visible module based on the image-related cues to label 2D occluded regions. We then combine them to obtain a 2D visible mask that adaptively guides ray sampling on the object for optimization. We also provide a newly collected H2O dataset to validate the proposed method. Experiments show that our method achieves superior performance on reconstruction completeness and view-consistency synthesis compared to the state-of-the-art methods.

AB - Our work aims to reconstruct the appearance and geometry of the two-hand-held object from a sequence of color images. In contrast to traditional single-hand-held manipulation, two-hand-holding allows more flexible interaction, thereby providing back views of the object, which is particularly convenient for reconstruction but generates complex view-dependent occlusions. The recent development of neural rendering provides new potential for hand-held object reconstruction. In this paper, we propose a novel neural representation-based framework to recover radiance fields of the two-hand-held object, named H2O-NeRF. We first design an object-centric semantic module based on the geometric signed distance function cues to predict 3D object-centric regions and develop the view-dependent visible module based on the image-related cues to label 2D occluded regions. We then combine them to obtain a 2D visible mask that adaptively guides ray sampling on the object for optimization. We also provide a newly collected H2O dataset to validate the proposed method. Experiments show that our method achieves superior performance on reconstruction completeness and view-consistency synthesis compared to the state-of-the-art methods.

KW - Adaptive ray sampling

KW - Anti-occlusion

KW - Neural radiance fields

KW - Two-hand-held object reconstruction

KW - View synthesis

UR - http://www.scopus.com/inward/record.url?scp=105001105203&partnerID=8YFLogxK

U2 - 10.1109/TVCG.2025.3553975

DO - 10.1109/TVCG.2025.3553975

M3 - 文章

AN - SCOPUS:105001105203

SN - 1077-2626

JO - IEEE Transactions on Visualization and Computer Graphics

JF - IEEE Transactions on Visualization and Computer Graphics

ER -

H2O-NeRF: Radiance Fields Reconstruction for Two-Hand-Held Objects

摘要

访问文件

其它文件与链接

指纹

引用此

H₂O-NeRF: Radiance Fields Reconstruction for Two-Hand-Held Objects