TY - GEN
T1 - Modeling urban scenes in the spatial-temporal space
AU - Xu, Jiong
AU - Wang, Qing
AU - Yang, Jie
PY - 2011
Y1 - 2011
N2 - This paper presents a technique to simultaneously model 3D urban scenes in the spatial-temporal space using a collection of photos that span many years. We propose to use a middle level representation, building, to characterize significant structure changes in the scene. We first use structure-from-motion techniques to build 3D point clouds, which is a mixture of scenes from different periods of time. We then segment the point clouds into independent buildings using a hierarchical method, including coarse clustering on sparse points and fine classification on dense points based on the spatial distance of point clouds and the difference of visibility vectors. In the fine classification, we segment building candidates using a probabilistic model in the spatial-temporal space simultaneously. We employ a z-buffering based method to infer existence of each building in each image. After recovering temporal order of input images, we finally obtain 3D models of these buildings along the time axis. We present experiments using both toy building images captured from our lab and real urban scene images to demonstrate the feasibility of the proposed approach.
AB - This paper presents a technique to simultaneously model 3D urban scenes in the spatial-temporal space using a collection of photos that span many years. We propose to use a middle level representation, building, to characterize significant structure changes in the scene. We first use structure-from-motion techniques to build 3D point clouds, which is a mixture of scenes from different periods of time. We then segment the point clouds into independent buildings using a hierarchical method, including coarse clustering on sparse points and fine classification on dense points based on the spatial distance of point clouds and the difference of visibility vectors. In the fine classification, we segment building candidates using a probabilistic model in the spatial-temporal space simultaneously. We employ a z-buffering based method to infer existence of each building in each image. After recovering temporal order of input images, we finally obtain 3D models of these buildings along the time axis. We present experiments using both toy building images captured from our lab and real urban scene images to demonstrate the feasibility of the proposed approach.
UR - http://www.scopus.com/inward/record.url?scp=79952496059&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-19309-5_29
DO - 10.1007/978-3-642-19309-5_29
M3 - 会议稿件
AN - SCOPUS:79952496059
SN - 9783642193088
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 374
EP - 387
BT - Computer Vision, ACCV 2010 - 10th Asian Conference on Computer Vision, Revised Selected Papers
T2 - 10th Asian Conference on Computer Vision, ACCV 2010
Y2 - 8 November 2010 through 12 November 2010
ER -