Quasi real-time summarization for consumer videos

Bin Zhao; Eric P. Xing

doi:10.1109/CVPR.2014.322

Quasi real-time summarization for consumer videos

Bin Zhao, Eric P. Xing

Carnegie Mellon University

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

223 Scopus citations

Abstract

With the widespread availability of video cameras, we are facing an ever-growing enormous collection of unedited and unstructured video data. Due to lack of an automatic way to generate summaries from this large collection of consumer videos, they can be tedious and time consuming to index or search. In this work, we propose online video highlighting, a principled way of generating short video summarizing the most important and interesting contents of an unedited and unstructured video, costly both time-wise and financially for manual processing. Specifically, our method learns a dictionary from given video using group sparse coding, and updates atoms in the dictionary on-the-fly. A summary video is then generated by combining segments that cannot be sparsely reconstructed using the learned dictionary. The online fashion of our proposed method enables it to process arbitrarily long videos and start generating summaries before seeing the end of the video. Moreover, the processing time required by our proposed method is close to the original video length, achieving quasi real-time summarization speed. Theoretical analysis, together with experimental results on more than 12 hours of surveillance and YouTube videos are provided, demonstrating the effectiveness of online video highlighting.

Original language	English
Title of host publication	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Publisher	IEEE Computer Society
Pages	2513-2520
Number of pages	8
ISBN (Electronic)	9781479951178, 9781479951178
DOIs	https://doi.org/10.1109/CVPR.2014.322
State	Published - 24 Sep 2014
Externally published	Yes
Event	27th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014 - Columbus, United States Duration: 23 Jun 2014 → 28 Jun 2014

Publication series

Name	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISSN (Print)	1063-6919

Conference

Conference	27th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014
Country/Territory	United States
City	Columbus
Period	23/06/14 → 28/06/14

Access to Document

10.1109/CVPR.2014.322

Cite this

Zhao, B., & Xing, E. P. (2014). Quasi real-time summarization for consumer videos. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 2513-2520). Article 6909718 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). IEEE Computer Society. https://doi.org/10.1109/CVPR.2014.322

@inproceedings{ea9b8ba5126648b283c36384d68b6a37,

title = "Quasi real-time summarization for consumer videos",

abstract = "With the widespread availability of video cameras, we are facing an ever-growing enormous collection of unedited and unstructured video data. Due to lack of an automatic way to generate summaries from this large collection of consumer videos, they can be tedious and time consuming to index or search. In this work, we propose online video highlighting, a principled way of generating short video summarizing the most important and interesting contents of an unedited and unstructured video, costly both time-wise and financially for manual processing. Specifically, our method learns a dictionary from given video using group sparse coding, and updates atoms in the dictionary on-the-fly. A summary video is then generated by combining segments that cannot be sparsely reconstructed using the learned dictionary. The online fashion of our proposed method enables it to process arbitrarily long videos and start generating summaries before seeing the end of the video. Moreover, the processing time required by our proposed method is close to the original video length, achieving quasi real-time summarization speed. Theoretical analysis, together with experimental results on more than 12 hours of surveillance and YouTube videos are provided, demonstrating the effectiveness of online video highlighting.",

author = "Bin Zhao and Xing, {Eric P.}",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.; 27th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014 ; Conference date: 23-06-2014 Through 28-06-2014",

year = "2014",

month = sep,

day = "24",

doi = "10.1109/CVPR.2014.322",

language = "英语",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "2513--2520",

booktitle = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

}

Zhao, B & Xing, EP 2014, Quasi real-time summarization for consumer videos. in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition., 6909718, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, pp. 2513-2520, 27th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014, Columbus, United States, 23/06/14. https://doi.org/10.1109/CVPR.2014.322

Quasi real-time summarization for consumer videos. / Zhao, Bin; Xing, Eric P.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE Computer Society, 2014. p. 2513-2520 6909718 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Quasi real-time summarization for consumer videos

AU - Zhao, Bin

AU - Xing, Eric P.

PY - 2014/9/24

Y1 - 2014/9/24

N2 - With the widespread availability of video cameras, we are facing an ever-growing enormous collection of unedited and unstructured video data. Due to lack of an automatic way to generate summaries from this large collection of consumer videos, they can be tedious and time consuming to index or search. In this work, we propose online video highlighting, a principled way of generating short video summarizing the most important and interesting contents of an unedited and unstructured video, costly both time-wise and financially for manual processing. Specifically, our method learns a dictionary from given video using group sparse coding, and updates atoms in the dictionary on-the-fly. A summary video is then generated by combining segments that cannot be sparsely reconstructed using the learned dictionary. The online fashion of our proposed method enables it to process arbitrarily long videos and start generating summaries before seeing the end of the video. Moreover, the processing time required by our proposed method is close to the original video length, achieving quasi real-time summarization speed. Theoretical analysis, together with experimental results on more than 12 hours of surveillance and YouTube videos are provided, demonstrating the effectiveness of online video highlighting.

AB - With the widespread availability of video cameras, we are facing an ever-growing enormous collection of unedited and unstructured video data. Due to lack of an automatic way to generate summaries from this large collection of consumer videos, they can be tedious and time consuming to index or search. In this work, we propose online video highlighting, a principled way of generating short video summarizing the most important and interesting contents of an unedited and unstructured video, costly both time-wise and financially for manual processing. Specifically, our method learns a dictionary from given video using group sparse coding, and updates atoms in the dictionary on-the-fly. A summary video is then generated by combining segments that cannot be sparsely reconstructed using the learned dictionary. The online fashion of our proposed method enables it to process arbitrarily long videos and start generating summaries before seeing the end of the video. Moreover, the processing time required by our proposed method is close to the original video length, achieving quasi real-time summarization speed. Theoretical analysis, together with experimental results on more than 12 hours of surveillance and YouTube videos are provided, demonstrating the effectiveness of online video highlighting.

UR - http://www.scopus.com/inward/record.url?scp=84911458226&partnerID=8YFLogxK

U2 - 10.1109/CVPR.2014.322

DO - 10.1109/CVPR.2014.322

M3 - 会议稿件

AN - SCOPUS:84911458226

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 2513

EP - 2520

BT - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

PB - IEEE Computer Society

T2 - 27th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014

Y2 - 23 June 2014 through 28 June 2014

ER -

Quasi real-time summarization for consumer videos

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this