An iteratively reweighting algorithm for dynamic video summarization

Pei Dong; Yong Xia; Shanshan Wang; Li Zhuo; David Dagan Feng

doi:10.1007/s11042-014-2126-8

An iteratively reweighting algorithm for dynamic video summarization

Pei Dong, Yong Xia, Shanshan Wang, Li Zhuo, David Dagan Feng

School of Computer Science

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Information explosion has imposed unprecedented challenges on the conventional ways of video data consumption. Hence providing condensed and meaningful video summary to viewers has been recognized as a beneficial and attractive research in the multimedia community in recent years. Analyzing both the visual and textual modalities proves essential for an automatic video summarizer to pick up important contents from a video. However, most established studies in this direction either use heuristic rules or rely on simple ways of text analysis. This paper proposes an iteratively reweighting dynamic video summarization (IRDVS) algorithm based on the joint and adaptive use of the visual modality and accompanying subtitles. The proposed algorithm takes advantage of our developed SEmantic inDicator of videO seGment (SEDOG) feature for exploring the most representative concepts for describing the video. Meanwhile, the iteratively reweighting scheme effectively updates the dynamic surrogate of the original video by combining the high-level features in an adaptive manner. The proposed algorithm has been compared to four state-of-the-art video summarization approaches, namely the speech transcript-based (STVS) algorithm, attention model-based (AMVS) algorithm, sparse dictionary selection-based (DSVS) algorithm and heterogeneity image patch index-based (HIPVS) algorithm, on different video genres, including documentary, movie and TV news. Our results show that the proposed IRDVS algorithm can produce summarized videos with better quality.

Original language	English
Pages (from-to)	9449-9473
Number of pages	25
Journal	Multimedia Tools and Applications
Volume	74
Issue number	21
DOIs	https://doi.org/10.1007/s11042-014-2126-8
State	Published - 29 Nov 2015

Keywords

Iterative weight estimation
Multimodal features
Saliency ranking
Semantic indicator of video segment (SEDOG)
Video summarization

Access to Document

10.1007/s11042-014-2126-8

Cite this

@article{05ab8c76f7644e71b21666a3915d3cd9,

title = "An iteratively reweighting algorithm for dynamic video summarization",

abstract = "Information explosion has imposed unprecedented challenges on the conventional ways of video data consumption. Hence providing condensed and meaningful video summary to viewers has been recognized as a beneficial and attractive research in the multimedia community in recent years. Analyzing both the visual and textual modalities proves essential for an automatic video summarizer to pick up important contents from a video. However, most established studies in this direction either use heuristic rules or rely on simple ways of text analysis. This paper proposes an iteratively reweighting dynamic video summarization (IRDVS) algorithm based on the joint and adaptive use of the visual modality and accompanying subtitles. The proposed algorithm takes advantage of our developed SEmantic inDicator of videO seGment (SEDOG) feature for exploring the most representative concepts for describing the video. Meanwhile, the iteratively reweighting scheme effectively updates the dynamic surrogate of the original video by combining the high-level features in an adaptive manner. The proposed algorithm has been compared to four state-of-the-art video summarization approaches, namely the speech transcript-based (STVS) algorithm, attention model-based (AMVS) algorithm, sparse dictionary selection-based (DSVS) algorithm and heterogeneity image patch index-based (HIPVS) algorithm, on different video genres, including documentary, movie and TV news. Our results show that the proposed IRDVS algorithm can produce summarized videos with better quality.",

keywords = "Iterative weight estimation, Multimodal features, Saliency ranking, Semantic indicator of video segment (SEDOG), Video summarization",

author = "Pei Dong and Yong Xia and Shanshan Wang and Li Zhuo and Feng, {David Dagan}",

note = "Publisher Copyright: {\textcopyright} 2014, Springer Science+Business Media New York.",

year = "2015",

month = nov,

day = "29",

doi = "10.1007/s11042-014-2126-8",

language = "英语",

volume = "74",

pages = "9449--9473",

journal = "Multimedia Tools and Applications",

issn = "1380-7501",

publisher = "Springer",

number = "21",

}

TY - JOUR

T1 - An iteratively reweighting algorithm for dynamic video summarization

AU - Dong, Pei

AU - Xia, Yong

AU - Wang, Shanshan

AU - Zhuo, Li

AU - Feng, David Dagan

PY - 2015/11/29

Y1 - 2015/11/29

N2 - Information explosion has imposed unprecedented challenges on the conventional ways of video data consumption. Hence providing condensed and meaningful video summary to viewers has been recognized as a beneficial and attractive research in the multimedia community in recent years. Analyzing both the visual and textual modalities proves essential for an automatic video summarizer to pick up important contents from a video. However, most established studies in this direction either use heuristic rules or rely on simple ways of text analysis. This paper proposes an iteratively reweighting dynamic video summarization (IRDVS) algorithm based on the joint and adaptive use of the visual modality and accompanying subtitles. The proposed algorithm takes advantage of our developed SEmantic inDicator of videO seGment (SEDOG) feature for exploring the most representative concepts for describing the video. Meanwhile, the iteratively reweighting scheme effectively updates the dynamic surrogate of the original video by combining the high-level features in an adaptive manner. The proposed algorithm has been compared to four state-of-the-art video summarization approaches, namely the speech transcript-based (STVS) algorithm, attention model-based (AMVS) algorithm, sparse dictionary selection-based (DSVS) algorithm and heterogeneity image patch index-based (HIPVS) algorithm, on different video genres, including documentary, movie and TV news. Our results show that the proposed IRDVS algorithm can produce summarized videos with better quality.

AB - Information explosion has imposed unprecedented challenges on the conventional ways of video data consumption. Hence providing condensed and meaningful video summary to viewers has been recognized as a beneficial and attractive research in the multimedia community in recent years. Analyzing both the visual and textual modalities proves essential for an automatic video summarizer to pick up important contents from a video. However, most established studies in this direction either use heuristic rules or rely on simple ways of text analysis. This paper proposes an iteratively reweighting dynamic video summarization (IRDVS) algorithm based on the joint and adaptive use of the visual modality and accompanying subtitles. The proposed algorithm takes advantage of our developed SEmantic inDicator of videO seGment (SEDOG) feature for exploring the most representative concepts for describing the video. Meanwhile, the iteratively reweighting scheme effectively updates the dynamic surrogate of the original video by combining the high-level features in an adaptive manner. The proposed algorithm has been compared to four state-of-the-art video summarization approaches, namely the speech transcript-based (STVS) algorithm, attention model-based (AMVS) algorithm, sparse dictionary selection-based (DSVS) algorithm and heterogeneity image patch index-based (HIPVS) algorithm, on different video genres, including documentary, movie and TV news. Our results show that the proposed IRDVS algorithm can produce summarized videos with better quality.

KW - Iterative weight estimation

KW - Multimodal features

KW - Saliency ranking

KW - Semantic indicator of video segment (SEDOG)

KW - Video summarization

UR - http://www.scopus.com/inward/record.url?scp=84942499697&partnerID=8YFLogxK

U2 - 10.1007/s11042-014-2126-8

DO - 10.1007/s11042-014-2126-8

M3 - 文章

AN - SCOPUS:84942499697

SN - 1380-7501

VL - 74

SP - 9449

EP - 9473

JO - Multimedia Tools and Applications

JF - Multimedia Tools and Applications

IS - 21

ER -

An iteratively reweighting algorithm for dynamic video summarization

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this