Two-and three-dimensional deep human detection by generating orthographic top view image from dense point cloud

Fang Tan, Xiaoyi Feng, Yupeng Ma, Zhaoqiang Xia

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

Human detection still suffers from occlusion, complex backgrounds, and scale-variant problems. Projecting three-dimensional (3D) points onto the ground to generate an orthographic top view (OTV) image for detection can effectively alleviate these problems. However, depth sensors may be placed arbitrarily, making it difficult to create OTV images by the dense point cloud converted from a depth image. We focus on the generation of OTV images and human detection via the constructed OTV image. First, we propose a ground plane extraction method that is well suitable for various camera positions and orientations in complex scenes. Next, points are converted to a uniform coordinate system by ground parameters and encoded to generate a three-channel OTV image. Then, the mainstream two-dimensional (2D) network is employed to detect the human directly on OTV images and further obtain the 3D bounding box by computing the mapping from the OTV image. Besides, we propose a semiautomated annotation method to solve the problem of few OTV image annotations. The proposed method is evaluated on the EPFL dataset, including two subsets, and achieves state-of-the-art performance compared with the existing approaches. Moreover, our 2D and 3D human detection method can run more than 26FPS on the CPU.

源语言英语
文章编号033009
期刊Journal of Electronic Imaging
31
3
DOI
出版状态已出版 - 1 5月 2022

指纹

探究 'Two-and three-dimensional deep human detection by generating orthographic top view image from dense point cloud' 的科研主题。它们共同构成独一无二的指纹。

引用此