A top-down approach for video summarization

Genliang Guan, Zhiyong Wang, Shaohui Mei, Max Ott, Mingyi He, David Dagan Feng

Research output: Contribution to journalArticlepeer-review

40 Scopus citations

Abstract

While most existing video summarization approaches aim to identify important frames of a video from either a global or local perspective, we propose a top-down approach consisting of scene identification and scene summarization. For scene identification, we represent each frame with global features and utilize a scalable clustering method.We then formulate scene summarization as choosing those frames that best cover a set of local descriptors with minimal redundancy. In addition, we develop a visual word-based approach to make our approach more computationally scalable. Experimental results on two benchmark datasets demonstrate that our proposed approach clearly outperforms the state-of-the-art.

Original languageEnglish
Article number4
JournalACM Transactions on Multimedia Computing, Communications and Applications
Volume11
Issue number1
DOIs
StatePublished - Aug 2014

Keywords

  • Clustering
  • Keyframe extraction
  • Keypoint
  • Local visual word
  • Scene identification

Fingerprint

Dive into the research topics of 'A top-down approach for video summarization'. Together they form a unique fingerprint.

Cite this