VDG: Vision-Only Dynamic Gaussian for Driving Simulation

Hao Li; Jingfeng Li; Dingwen Zhang; Chenming Wu; Jieqi Shi; Chen Zhao; Haocheng Feng; Errui Ding; Jingdong Wang; Junwei Han

doi:10.1109/LRA.2025.3555938

VDG: Vision-Only Dynamic Gaussian for Driving Simulation

Hao Li, Jingfeng Li, Dingwen Zhang, Chenming Wu, Jieqi Shi, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han

School of Automation

Research output: Contribution to journal › Article › peer-review

Abstract

Recent advances in dynamic Gaussian splatting have significantly improved scene reconstruction and novel-view synthesis. However, existing methods often rely on pre-computed camera poses and Gaussian initialization using Structure from Motion (SfM) or other costly sensors, limiting their scalability. In this paper, we propose Vision-only Dynamic Gaussian (VDG), a novel method that, for the first time, integrates self-supervised visual odometry (VO) into a pose-free dynamic Gaussian splatting framework. Given the reason that estimated poses are not accurate enough to perform self-decomposition for dynamic scenes, we specifically design motion supervision, enabling precise static-dynamic decomposition and modeling of dynamic objects via dynamic Gaussians. Extensive experiments on urban driving datasets, including KITTI and Waymo, show that VDG consistently outperforms state-of-the-art dynamic view synthesis methods in both reconstruction accuracy and pose prediction with only image input. Project page:https://3daigc.github.io/VDG/.

Original language	English
Journal	IEEE Robotics and Automation Letters
DOIs	https://doi.org/10.1109/LRA.2025.3555938
State	Accepted/In press - 2025

Keywords

Computer Vision for Transportation
Intelligent Transportation Systems
Simulation and Animation

Access to Document

10.1109/LRA.2025.3555938

Cite this

@article{f27bb6a64c304217ab4067f20fa191ac,

title = "VDG: Vision-Only Dynamic Gaussian for Driving Simulation",

abstract = "Recent advances in dynamic Gaussian splatting have significantly improved scene reconstruction and novel-view synthesis. However, existing methods often rely on pre-computed camera poses and Gaussian initialization using Structure from Motion (SfM) or other costly sensors, limiting their scalability. In this paper, we propose Vision-only Dynamic Gaussian (VDG), a novel method that, for the first time, integrates self-supervised visual odometry (VO) into a pose-free dynamic Gaussian splatting framework. Given the reason that estimated poses are not accurate enough to perform self-decomposition for dynamic scenes, we specifically design motion supervision, enabling precise static-dynamic decomposition and modeling of dynamic objects via dynamic Gaussians. Extensive experiments on urban driving datasets, including KITTI and Waymo, show that VDG consistently outperforms state-of-the-art dynamic view synthesis methods in both reconstruction accuracy and pose prediction with only image input. Project page:https://3daigc.github.io/VDG/.",

keywords = "Computer Vision for Transportation, Intelligent Transportation Systems, Simulation and Animation",

author = "Hao Li and Jingfeng Li and Dingwen Zhang and Chenming Wu and Jieqi Shi and Chen Zhao and Haocheng Feng and Errui Ding and Jingdong Wang and Junwei Han",

note = "Publisher Copyright: {\textcopyright} 2016 IEEE.",

year = "2025",

doi = "10.1109/LRA.2025.3555938",

language = "英语",

journal = "IEEE Robotics and Automation Letters",

issn = "2377-3766",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - VDG

T2 - Vision-Only Dynamic Gaussian for Driving Simulation

AU - Li, Hao

AU - Li, Jingfeng

AU - Zhang, Dingwen

AU - Wu, Chenming

AU - Shi, Jieqi

AU - Zhao, Chen

AU - Feng, Haocheng

AU - Ding, Errui

AU - Wang, Jingdong

AU - Han, Junwei

PY - 2025

Y1 - 2025

N2 - Recent advances in dynamic Gaussian splatting have significantly improved scene reconstruction and novel-view synthesis. However, existing methods often rely on pre-computed camera poses and Gaussian initialization using Structure from Motion (SfM) or other costly sensors, limiting their scalability. In this paper, we propose Vision-only Dynamic Gaussian (VDG), a novel method that, for the first time, integrates self-supervised visual odometry (VO) into a pose-free dynamic Gaussian splatting framework. Given the reason that estimated poses are not accurate enough to perform self-decomposition for dynamic scenes, we specifically design motion supervision, enabling precise static-dynamic decomposition and modeling of dynamic objects via dynamic Gaussians. Extensive experiments on urban driving datasets, including KITTI and Waymo, show that VDG consistently outperforms state-of-the-art dynamic view synthesis methods in both reconstruction accuracy and pose prediction with only image input. Project page:https://3daigc.github.io/VDG/.

AB - Recent advances in dynamic Gaussian splatting have significantly improved scene reconstruction and novel-view synthesis. However, existing methods often rely on pre-computed camera poses and Gaussian initialization using Structure from Motion (SfM) or other costly sensors, limiting their scalability. In this paper, we propose Vision-only Dynamic Gaussian (VDG), a novel method that, for the first time, integrates self-supervised visual odometry (VO) into a pose-free dynamic Gaussian splatting framework. Given the reason that estimated poses are not accurate enough to perform self-decomposition for dynamic scenes, we specifically design motion supervision, enabling precise static-dynamic decomposition and modeling of dynamic objects via dynamic Gaussians. Extensive experiments on urban driving datasets, including KITTI and Waymo, show that VDG consistently outperforms state-of-the-art dynamic view synthesis methods in both reconstruction accuracy and pose prediction with only image input. Project page:https://3daigc.github.io/VDG/.

KW - Computer Vision for Transportation

KW - Intelligent Transportation Systems

KW - Simulation and Animation

UR - http://www.scopus.com/inward/record.url?scp=105001980838&partnerID=8YFLogxK

U2 - 10.1109/LRA.2025.3555938

DO - 10.1109/LRA.2025.3555938

M3 - 文章

AN - SCOPUS:105001980838

SN - 2377-3766

JO - IEEE Robotics and Automation Letters

JF - IEEE Robotics and Automation Letters

ER -

VDG: Vision-Only Dynamic Gaussian for Driving Simulation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this