Learning frame relevance for video classification

Hua Wang, Feiping Nie, Heng Huang, Yi Yang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Traditional video classification methods typically require a large number of labeled training video frames to achieve satisfactory performance. However, in the real world, we usually only have sufficient labeled video clips (such as tagged online videos) but lack labeled video frames. In this paper, we formalize the video classification problem as a Multi-Instance Learning (MIL) problem, an emerging topic in machine learning in recent years, which only needs bag (video clip) labels. To solve the problem, we propose a novel Parameterized Class-to-Bag (P-C2B) Distance method to learn the relative importance of a training instance with respect to its labeled classes, such that the instance level labeling ambiguity in MIL is tackled and the frame relevances of training video data with respect to the semantic concepts of interest are given. Promising experimental results have demonstrated the effectiveness of the proposed method.

Original languageEnglish
Title of host publicationMM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops
Pages1345-1348
Number of pages4
DOIs
StatePublished - 2011
Externally publishedYes
Event19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11 - Scottsdale, AZ, United States
Duration: 28 Nov 20111 Dec 2011

Publication series

NameMM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops

Conference

Conference19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11
Country/TerritoryUnited States
CityScottsdale, AZ
Period28/11/111/12/11

Keywords

  • Multi-instance learning
  • Video classification

Fingerprint

Dive into the research topics of 'Learning frame relevance for video classification'. Together they form a unique fingerprint.

Cite this