Video summarization via nonlinear sparse dictionary selection

Mingyang Ma; Shaohui Mei; Shuai Wan; Zhiyong Wang; Dagan Feng

doi:10.1109/ACCESS.2019.2891834

Video summarization via nonlinear sparse dictionary selection

Mingyang Ma, Shaohui Mei, Shuai Wan, Zhiyong Wang, Dagan Feng

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

20 Scopus citations

Abstract

Video summarization (VS) is to identify important content from a given video, which can help users quickly comprehend video content. Recently, sparse dictionary selection (SDS) has demonstrated to be an effective solution for VS problems, which generally assumes a linear relationship between keyframes and non-keyframes. However, this assumption is not always true for video frames which possess intrinsic nonlinear structures and properties. In this paper, by exploiting the nonlinearity between video frames, a nonlinear SDS model is formulated for VS, in which the nonlinearity is transformed to linearity by projecting a video to a high-dimensional feature space induced by a kernel function. We also propose two greedy optimization algorithms to solve the resulting model, namely the standard kernel SDS (KSDS) greedy algorithm and the robust KSDS greedy algorithm with a backtracking strategy. In order to achieve an intuitive and flexible configuration of the VS process, an adaptive criterion, namely energy ratio, is devised to produce video summaries with different lengths for different video contents. Experimental results on two different benchmark video datasets demonstrate that the proposed algorithm outperforms several state-of-the-art VS algorithms.

Original language	English
Article number	8606919
Pages (from-to)	11763-11774
Number of pages	12
Journal	IEEE Access
Volume	7
DOIs	https://doi.org/10.1109/ACCESS.2019.2891834
State	Published - 2019

Keywords

dictionary selection
Nonlinear representation
sparse representation
video summarization

Access to Document

10.1109/ACCESS.2019.2891834

Cite this

@article{4c98efaac11245bfad6285069706cc62,

title = "Video summarization via nonlinear sparse dictionary selection",

abstract = "Video summarization (VS) is to identify important content from a given video, which can help users quickly comprehend video content. Recently, sparse dictionary selection (SDS) has demonstrated to be an effective solution for VS problems, which generally assumes a linear relationship between keyframes and non-keyframes. However, this assumption is not always true for video frames which possess intrinsic nonlinear structures and properties. In this paper, by exploiting the nonlinearity between video frames, a nonlinear SDS model is formulated for VS, in which the nonlinearity is transformed to linearity by projecting a video to a high-dimensional feature space induced by a kernel function. We also propose two greedy optimization algorithms to solve the resulting model, namely the standard kernel SDS (KSDS) greedy algorithm and the robust KSDS greedy algorithm with a backtracking strategy. In order to achieve an intuitive and flexible configuration of the VS process, an adaptive criterion, namely energy ratio, is devised to produce video summaries with different lengths for different video contents. Experimental results on two different benchmark video datasets demonstrate that the proposed algorithm outperforms several state-of-the-art VS algorithms.",

keywords = "dictionary selection, Nonlinear representation, sparse representation, video summarization",

author = "Mingyang Ma and Shaohui Mei and Shuai Wan and Zhiyong Wang and Dagan Feng",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.",

year = "2019",

doi = "10.1109/ACCESS.2019.2891834",

language = "英语",

volume = "7",

pages = "11763--11774",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Video summarization via nonlinear sparse dictionary selection

AU - Ma, Mingyang

AU - Mei, Shaohui

AU - Wan, Shuai

AU - Wang, Zhiyong

AU - Feng, Dagan

PY - 2019

Y1 - 2019

N2 - Video summarization (VS) is to identify important content from a given video, which can help users quickly comprehend video content. Recently, sparse dictionary selection (SDS) has demonstrated to be an effective solution for VS problems, which generally assumes a linear relationship between keyframes and non-keyframes. However, this assumption is not always true for video frames which possess intrinsic nonlinear structures and properties. In this paper, by exploiting the nonlinearity between video frames, a nonlinear SDS model is formulated for VS, in which the nonlinearity is transformed to linearity by projecting a video to a high-dimensional feature space induced by a kernel function. We also propose two greedy optimization algorithms to solve the resulting model, namely the standard kernel SDS (KSDS) greedy algorithm and the robust KSDS greedy algorithm with a backtracking strategy. In order to achieve an intuitive and flexible configuration of the VS process, an adaptive criterion, namely energy ratio, is devised to produce video summaries with different lengths for different video contents. Experimental results on two different benchmark video datasets demonstrate that the proposed algorithm outperforms several state-of-the-art VS algorithms.

AB - Video summarization (VS) is to identify important content from a given video, which can help users quickly comprehend video content. Recently, sparse dictionary selection (SDS) has demonstrated to be an effective solution for VS problems, which generally assumes a linear relationship between keyframes and non-keyframes. However, this assumption is not always true for video frames which possess intrinsic nonlinear structures and properties. In this paper, by exploiting the nonlinearity between video frames, a nonlinear SDS model is formulated for VS, in which the nonlinearity is transformed to linearity by projecting a video to a high-dimensional feature space induced by a kernel function. We also propose two greedy optimization algorithms to solve the resulting model, namely the standard kernel SDS (KSDS) greedy algorithm and the robust KSDS greedy algorithm with a backtracking strategy. In order to achieve an intuitive and flexible configuration of the VS process, an adaptive criterion, namely energy ratio, is devised to produce video summaries with different lengths for different video contents. Experimental results on two different benchmark video datasets demonstrate that the proposed algorithm outperforms several state-of-the-art VS algorithms.

KW - dictionary selection

KW - Nonlinear representation

KW - sparse representation

KW - video summarization

UR - http://www.scopus.com/inward/record.url?scp=85061137777&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2019.2891834

DO - 10.1109/ACCESS.2019.2891834

M3 - 文章

AN - SCOPUS:85061137777

SN - 2169-3536

VL - 7

SP - 11763

EP - 11774

JO - IEEE Access

JF - IEEE Access

M1 - 8606919

ER -

Video summarization via nonlinear sparse dictionary selection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this