Exploring the influence of feature representation for dictionary selection based video summarization

Mingyang Ma; Shaohui Mei; Jingyu Ji; Shuai Wan; Zhiyong Wang; Dagan Feng

doi:10.1109/ICIP.2017.8296815

Exploring the influence of feature representation for dictionary selection based video summarization

Mingyang Ma, Shaohui Mei, Jingyu Ji, Shuai Wan, Zhiyong Wang, Dagan Feng

School of Electronics and Information

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

7 Scopus citations

Abstract

Dictionary selection based video summarization (VS) algorithms, in which keyframes are considered as a dictionary to reconstruct all the video frames, have been demonstrated to be effective and efficient for video summarization. It has been noticed that the feature representation of video plays a great impact of the performance of VS. In this paper, the influence of feature representation of video frames on the performance of dictionary selection-based VS is for the first time investigated. In addition to the traditional hand-crafted features used in VS, such as color histogram, the deep features learned through deep neural networks are firstly used to represent video frames for dictionary selection-based VS. The impact of dimensionality reduction to the high-dimensional deep learning features on VS is further discussed. Experimental results on a benchmark video dataset demonstrate that deep learning features are able to achieve better performance than traditional hand-crafted features for dictionary selection-based VS. Moreover, the dimensionality of deep learning features can be reduced to decrease the computational cost without the degradation of VS performance.

Original language	English
Title of host publication	2017 IEEE International Conference on Image Processing, ICIP 2017 - Proceedings
Publisher	IEEE Computer Society
Pages	2911-2915
Number of pages	5
ISBN (Electronic)	9781509021758
DOIs	https://doi.org/10.1109/ICIP.2017.8296815
State	Published - 2 Jul 2017
Event	24th IEEE International Conference on Image Processing, ICIP 2017 - Beijing, China Duration: 17 Sep 2017 → 20 Sep 2017

Publication series

Name	Proceedings - International Conference on Image Processing, ICIP
Volume	2017-September
ISSN (Print)	1522-4880

Conference

Conference	24th IEEE International Conference on Image Processing, ICIP 2017
Country/Territory	China
City	Beijing
Period	17/09/17 → 20/09/17

Keywords

Deep learning
Feature representation
Sparse reconstruction
Video summarization

Access to Document

10.1109/ICIP.2017.8296815

Cite this

Ma, M., Mei, S., Ji, J., Wan, S., Wang, Z., & Feng, D. (2017). Exploring the influence of feature representation for dictionary selection based video summarization. In 2017 IEEE International Conference on Image Processing, ICIP 2017 - Proceedings (pp. 2911-2915). (Proceedings - International Conference on Image Processing, ICIP; Vol. 2017-September). IEEE Computer Society. https://doi.org/10.1109/ICIP.2017.8296815

@inproceedings{ecc76d500d0f4e28860648e8eed3b70b,

title = "Exploring the influence of feature representation for dictionary selection based video summarization",

abstract = "Dictionary selection based video summarization (VS) algorithms, in which keyframes are considered as a dictionary to reconstruct all the video frames, have been demonstrated to be effective and efficient for video summarization. It has been noticed that the feature representation of video plays a great impact of the performance of VS. In this paper, the influence of feature representation of video frames on the performance of dictionary selection-based VS is for the first time investigated. In addition to the traditional hand-crafted features used in VS, such as color histogram, the deep features learned through deep neural networks are firstly used to represent video frames for dictionary selection-based VS. The impact of dimensionality reduction to the high-dimensional deep learning features on VS is further discussed. Experimental results on a benchmark video dataset demonstrate that deep learning features are able to achieve better performance than traditional hand-crafted features for dictionary selection-based VS. Moreover, the dimensionality of deep learning features can be reduced to decrease the computational cost without the degradation of VS performance.",

keywords = "Deep learning, Feature representation, Sparse reconstruction, Video summarization",

author = "Mingyang Ma and Shaohui Mei and Jingyu Ji and Shuai Wan and Zhiyong Wang and Dagan Feng",

note = "Publisher Copyright: {\textcopyright} 2017 IEEE.; 24th IEEE International Conference on Image Processing, ICIP 2017 ; Conference date: 17-09-2017 Through 20-09-2017",

year = "2017",

month = jul,

day = "2",

doi = "10.1109/ICIP.2017.8296815",

language = "英语",

series = "Proceedings - International Conference on Image Processing, ICIP",

publisher = "IEEE Computer Society",

pages = "2911--2915",

booktitle = "2017 IEEE International Conference on Image Processing, ICIP 2017 - Proceedings",

}

Ma, M, Mei, S, Ji, J, Wan, S, Wang, Z & Feng, D 2017, Exploring the influence of feature representation for dictionary selection based video summarization. in 2017 IEEE International Conference on Image Processing, ICIP 2017 - Proceedings. Proceedings - International Conference on Image Processing, ICIP, vol. 2017-September, IEEE Computer Society, pp. 2911-2915, 24th IEEE International Conference on Image Processing, ICIP 2017, Beijing, China, 17/09/17. https://doi.org/10.1109/ICIP.2017.8296815

Exploring the influence of feature representation for dictionary selection based video summarization. / Ma, Mingyang; Mei, Shaohui; Ji, Jingyu et al.
2017 IEEE International Conference on Image Processing, ICIP 2017 - Proceedings. IEEE Computer Society, 2017. p. 2911-2915 (Proceedings - International Conference on Image Processing, ICIP; Vol. 2017-September).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Exploring the influence of feature representation for dictionary selection based video summarization

AU - Ma, Mingyang

AU - Mei, Shaohui

AU - Ji, Jingyu

AU - Wan, Shuai

AU - Wang, Zhiyong

AU - Feng, Dagan

PY - 2017/7/2

Y1 - 2017/7/2

N2 - Dictionary selection based video summarization (VS) algorithms, in which keyframes are considered as a dictionary to reconstruct all the video frames, have been demonstrated to be effective and efficient for video summarization. It has been noticed that the feature representation of video plays a great impact of the performance of VS. In this paper, the influence of feature representation of video frames on the performance of dictionary selection-based VS is for the first time investigated. In addition to the traditional hand-crafted features used in VS, such as color histogram, the deep features learned through deep neural networks are firstly used to represent video frames for dictionary selection-based VS. The impact of dimensionality reduction to the high-dimensional deep learning features on VS is further discussed. Experimental results on a benchmark video dataset demonstrate that deep learning features are able to achieve better performance than traditional hand-crafted features for dictionary selection-based VS. Moreover, the dimensionality of deep learning features can be reduced to decrease the computational cost without the degradation of VS performance.

AB - Dictionary selection based video summarization (VS) algorithms, in which keyframes are considered as a dictionary to reconstruct all the video frames, have been demonstrated to be effective and efficient for video summarization. It has been noticed that the feature representation of video plays a great impact of the performance of VS. In this paper, the influence of feature representation of video frames on the performance of dictionary selection-based VS is for the first time investigated. In addition to the traditional hand-crafted features used in VS, such as color histogram, the deep features learned through deep neural networks are firstly used to represent video frames for dictionary selection-based VS. The impact of dimensionality reduction to the high-dimensional deep learning features on VS is further discussed. Experimental results on a benchmark video dataset demonstrate that deep learning features are able to achieve better performance than traditional hand-crafted features for dictionary selection-based VS. Moreover, the dimensionality of deep learning features can be reduced to decrease the computational cost without the degradation of VS performance.

KW - Deep learning

KW - Feature representation

KW - Sparse reconstruction

KW - Video summarization

UR - http://www.scopus.com/inward/record.url?scp=85045343937&partnerID=8YFLogxK

U2 - 10.1109/ICIP.2017.8296815

DO - 10.1109/ICIP.2017.8296815

M3 - 会议稿件

AN - SCOPUS:85045343937

T3 - Proceedings - International Conference on Image Processing, ICIP

SP - 2911

EP - 2915

BT - 2017 IEEE International Conference on Image Processing, ICIP 2017 - Proceedings

PB - IEEE Computer Society

T2 - 24th IEEE International Conference on Image Processing, ICIP 2017

Y2 - 17 September 2017 through 20 September 2017

ER -

Exploring the influence of feature representation for dictionary selection based video summarization

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this