HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization

Bin Zhao, Xuelong Li, Xiaoqiang Lu

科研成果: 书/报告/会议事项章节会议稿件同行评审

205 引用 (Scopus)

摘要

Although video summarization has achieved great success in recent years, few approaches have realized the influence of video structure on the summarization results. As we know, the video data follow a hierarchical structure, i.e., a video is composed of shots, and a shot is composed of several frames. Generally, shots provide the activity-level information for people to understand the video content. While few existing summarization approaches pay attention to the shot segmentation procedure. They generate shots by some trivial strategies, such as fixed length segmentation, which may destroy the underlying hierarchical structure of video data and further reduce the quality of generated summaries. To address this problem, we propose a structure-adaptive video summarization approach that integrates shot segmentation and video summarization into a Hierarchical Structure-Adaptive RNN, denoted as HSA-RNN. We evaluate the proposed approach on four popular datasets, i.e., SumMe, TVsum, CoSum and VTW. The experimental results have demonstrated the effectiveness of HSA-RNN in the video summarization task.

源语言英语
主期刊名Proceedings - 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018
出版商IEEE Computer Society
7405-7414
页数10
ISBN(电子版)9781538664209
DOI
出版状态已出版 - 14 12月 2018
活动31st Meeting of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018 - Salt Lake City, 美国
期限: 18 6月 201822 6月 2018

出版系列

姓名Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISSN(印刷版)1063-6919

会议

会议31st Meeting of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2018
国家/地区美国
Salt Lake City
时期18/06/1822/06/18

指纹

探究 'HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization' 的科研主题。它们共同构成独一无二的指纹。

引用此