V2VFusion: Multimodal Fusion for Enhanced Vehicle-to-Vehicle Cooperative Perception

Lei Zhang, Binglu Wang, Zhaozhong Wang, Yongqiang Zhao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Current vehicle-to-vehicle (V2V) research mainly centers on either LiDAR or camera-based perception. Yet, combining data from multiple sensors offers a more complete and precise understanding of the environment. This paper presents V2VFusion, a multimodal perception framework that fuses Li-DAR and camera sensor inputs to improve the performance of V2V systems. Firstly, we implement a baseline system for multi-modal fusion in V2V scenarios, effectively integrating data from LiDAR and camera sensors. This baseline provides a comparable benchmark for subsequent research. Secondly, we explore different fusion strategies, including concatenation, element-wise summation, and transformer methods, to investigate their impact on fusion performance. Lastly, we conduct experiments and evaluation on the OPV2V dataset. The experimental results demonstrate that the multimodal perception method achieves better performance and robustness in V2V tasks, providing more accurate object detection results, thereby improving the safety and reliability of autonomous driving systems.

Original languageEnglish
Title of host publicationProceedings - 2023 China Automation Congress, CAC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3691-3696
Number of pages6
ISBN (Electronic)9798350303759
DOIs
StatePublished - 2023
Event2023 China Automation Congress, CAC 2023 - Chongqing, China
Duration: 17 Nov 202319 Nov 2023

Publication series

NameProceedings - 2023 China Automation Congress, CAC 2023

Conference

Conference2023 China Automation Congress, CAC 2023
Country/TerritoryChina
CityChongqing
Period17/11/2319/11/23

Keywords

  • autonomous driving
  • cooperative perception
  • multimodal fusion
  • vehicle-to-vehicle

Fingerprint

Dive into the research topics of 'V2VFusion: Multimodal Fusion for Enhanced Vehicle-to-Vehicle Cooperative Perception'. Together they form a unique fingerprint.

Cite this