Video Frame Prediction from a Single Image and Events

Juanjuan Zhu, Zhexiong Wan, Yuchao Dai

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Recently, the task of Video Frame Prediction (VFP), which predicts future video frames from previous ones through extrapolation, has made remarkable progress.However, the performance of existing VFP methods is still far from satisfactory due to the fixed framerate video used: 1) they have difficulties in handling complex dynamic scenes; 2) they cannot predict future frames with flexible prediction time intervals.The event cameras can record the intensity changes asynchronously with a very high temporal resolution, which provides rich dynamic information about the observed scenes.In this paper, we propose to predict video frames from a single image and the following events, which can not only handle complex dynamic scenes but also predict future frames with flexible prediction time intervals.First, we introduce a symmetrical cross-modal attention augmentation module to enhance the complementary information between images and events.Second, we propose to jointly achieve optical flow estimation and frame generation by combining the motion information of events and the semantic information of the image, then inpainting the holes produced by forward warping to obtain an ideal prediction frame.Based on these, we propose a lightweight pyramidal coarse-to-fine model that can predict a 720P frame within 25 ms.Extensive experiments show that our proposed model significantly outperforms the state-of-the-art frame-based and event-based VFP methods and has the fastest runtime.Code is available at https://npucvr.github.io/VFPSIE/.

源语言英语
主期刊名Technical Tracks 14
编辑Michael Wooldridge, Jennifer Dy, Sriraam Natarajan
出版商Association for the Advancement of Artificial Intelligence
7748-7756
页数9
版本7
ISBN(电子版)1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 1577358872, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879, 9781577358879
DOI
出版状态已出版 - 25 3月 2024
活动38th AAAI Conference on Artificial Intelligence, AAAI 2024 - Vancouver, 加拿大
期限: 20 2月 202427 2月 2024

出版系列

姓名Proceedings of the AAAI Conference on Artificial Intelligence
编号7
38
ISSN(印刷版)2159-5399
ISSN(电子版)2374-3468

会议

会议38th AAAI Conference on Artificial Intelligence, AAAI 2024
国家/地区加拿大
Vancouver
时期20/02/2427/02/24

指纹

探究 'Video Frame Prediction from a Single Image and Events' 的科研主题。它们共同构成独一无二的指纹。

引用此