Learning frame relevance for video classification

Hua Wang; Feiping Nie; Heng Huang; Yi Yang

doi:10.1145/2072298.2072011

Learning frame relevance for video classification

Hua Wang, Feiping Nie, Heng Huang, Yi Yang

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

7 Scopus citations

Abstract

Traditional video classification methods typically require a large number of labeled training video frames to achieve satisfactory performance. However, in the real world, we usually only have sufficient labeled video clips (such as tagged online videos) but lack labeled video frames. In this paper, we formalize the video classification problem as a Multi-Instance Learning (MIL) problem, an emerging topic in machine learning in recent years, which only needs bag (video clip) labels. To solve the problem, we propose a novel Parameterized Class-to-Bag (P-C2B) Distance method to learn the relative importance of a training instance with respect to its labeled classes, such that the instance level labeling ambiguity in MIL is tackled and the frame relevances of training video data with respect to the semantic concepts of interest are given. Promising experimental results have demonstrated the effectiveness of the proposed method.

Original language	English
Title of host publication	MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops
Pages	1345-1348
Number of pages	4
DOIs	https://doi.org/10.1145/2072298.2072011
State	Published - 2011
Externally published	Yes
Event	19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11 - Scottsdale, AZ, United States Duration: 28 Nov 2011 → 1 Dec 2011

Publication series

Name	MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops

Conference

Conference	19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11
Country/Territory	United States
City	Scottsdale, AZ
Period	28/11/11 → 1/12/11

Keywords

Multi-instance learning
Video classification

Access to Document

10.1145/2072298.2072011

Cite this

@inproceedings{89b287b3d3d34e888ddba56c83387161,

title = "Learning frame relevance for video classification",

abstract = "Traditional video classification methods typically require a large number of labeled training video frames to achieve satisfactory performance. However, in the real world, we usually only have sufficient labeled video clips (such as tagged online videos) but lack labeled video frames. In this paper, we formalize the video classification problem as a Multi-Instance Learning (MIL) problem, an emerging topic in machine learning in recent years, which only needs bag (video clip) labels. To solve the problem, we propose a novel Parameterized Class-to-Bag (P-C2B) Distance method to learn the relative importance of a training instance with respect to its labeled classes, such that the instance level labeling ambiguity in MIL is tackled and the frame relevances of training video data with respect to the semantic concepts of interest are given. Promising experimental results have demonstrated the effectiveness of the proposed method.",

keywords = "Multi-instance learning, Video classification",

author = "Hua Wang and Feiping Nie and Heng Huang and Yi Yang",

year = "2011",

doi = "10.1145/2072298.2072011",

language = "英语",

isbn = "9781450306164",

series = "MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops",

pages = "1345--1348",

booktitle = "MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops",

note = "19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11 ; Conference date: 28-11-2011 Through 01-12-2011",

}

Wang, H, Nie, F, Huang, H & Yang, Y 2011, Learning frame relevance for video classification. in MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops. MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops, pp. 1345-1348, 19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11, Scottsdale, AZ, United States, 28/11/11. https://doi.org/10.1145/2072298.2072011

Learning frame relevance for video classification. / Wang, Hua; Nie, Feiping; Huang, Heng et al.
MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops. 2011. p. 1345-1348 (MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Learning frame relevance for video classification

AU - Wang, Hua

AU - Nie, Feiping

AU - Huang, Heng

AU - Yang, Yi

PY - 2011

Y1 - 2011

N2 - Traditional video classification methods typically require a large number of labeled training video frames to achieve satisfactory performance. However, in the real world, we usually only have sufficient labeled video clips (such as tagged online videos) but lack labeled video frames. In this paper, we formalize the video classification problem as a Multi-Instance Learning (MIL) problem, an emerging topic in machine learning in recent years, which only needs bag (video clip) labels. To solve the problem, we propose a novel Parameterized Class-to-Bag (P-C2B) Distance method to learn the relative importance of a training instance with respect to its labeled classes, such that the instance level labeling ambiguity in MIL is tackled and the frame relevances of training video data with respect to the semantic concepts of interest are given. Promising experimental results have demonstrated the effectiveness of the proposed method.

AB - Traditional video classification methods typically require a large number of labeled training video frames to achieve satisfactory performance. However, in the real world, we usually only have sufficient labeled video clips (such as tagged online videos) but lack labeled video frames. In this paper, we formalize the video classification problem as a Multi-Instance Learning (MIL) problem, an emerging topic in machine learning in recent years, which only needs bag (video clip) labels. To solve the problem, we propose a novel Parameterized Class-to-Bag (P-C2B) Distance method to learn the relative importance of a training instance with respect to its labeled classes, such that the instance level labeling ambiguity in MIL is tackled and the frame relevances of training video data with respect to the semantic concepts of interest are given. Promising experimental results have demonstrated the effectiveness of the proposed method.

KW - Multi-instance learning

KW - Video classification

UR - http://www.scopus.com/inward/record.url?scp=84455201982&partnerID=8YFLogxK

U2 - 10.1145/2072298.2072011

DO - 10.1145/2072298.2072011

M3 - 会议稿件

AN - SCOPUS:84455201982

SN - 9781450306164

T3 - MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops

SP - 1345

EP - 1348

BT - MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops

T2 - 19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11

Y2 - 28 November 2011 through 1 December 2011

ER -

Learning frame relevance for video classification

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this