TY - GEN
T1 - A multi-camera network system for markerless 3D human body voxel reconstruction
AU - Yang, Tao
AU - Zhang, Yanning
AU - Li, Meng
AU - Shao, Dapei
AU - Zhang, Xingong
PY - 2009
Y1 - 2009
N2 - This paper presents a fully automated system for realtime 3D human visual hull reconstruction and skeleton voxels extraction. The main contributions include: (1) A novel network based system is presented, which uses AXIS network cameras as video capture device, and performs a parallel processing among data capture, 3D voxel reconstruction and display. (2) A new human visual hull reconstruction algorithm is given. This approach firstly segments the foreground accurately by an efficient Gaussian Mixture Model (GMM) and a shadow model in HSV color space, then extends the standard Shape-From-Silhouette (SFS) algorithm with online Region-of-Interest (ROI) estimation and binary searching, and finally construct skeleton probability visual hull with distance transform. Experiments with real video sequences show that the system can process eleven 640×480 video sequences at a frame rate of 15fps, and construct human body voxels reliably in complex scenarios with cast shadows, various body configurations and multiple persons.
AB - This paper presents a fully automated system for realtime 3D human visual hull reconstruction and skeleton voxels extraction. The main contributions include: (1) A novel network based system is presented, which uses AXIS network cameras as video capture device, and performs a parallel processing among data capture, 3D voxel reconstruction and display. (2) A new human visual hull reconstruction algorithm is given. This approach firstly segments the foreground accurately by an efficient Gaussian Mixture Model (GMM) and a shadow model in HSV color space, then extends the standard Shape-From-Silhouette (SFS) algorithm with online Region-of-Interest (ROI) estimation and binary searching, and finally construct skeleton probability visual hull with distance transform. Experiments with real video sequences show that the system can process eleven 640×480 video sequences at a frame rate of 15fps, and construct human body voxels reliably in complex scenarios with cast shadows, various body configurations and multiple persons.
UR - http://www.scopus.com/inward/record.url?scp=77952254285&partnerID=8YFLogxK
U2 - 10.1109/ICIG.2009.89
DO - 10.1109/ICIG.2009.89
M3 - 会议稿件
AN - SCOPUS:77952254285
SN - 9780769538839
T3 - Proceedings of the 5th International Conference on Image and Graphics, ICIG 2009
SP - 706
EP - 711
BT - Proceedings of the 5th International Conference on Image and Graphics, ICIG 2009
PB - IEEE Computer Society
T2 - 5th International Conference on Image and Graphics, ICIG 2009
Y2 - 20 September 2009 through 23 September 2009
ER -