TY - GEN
T1 - Learning frame relevance for video classification
AU - Wang, Hua
AU - Nie, Feiping
AU - Huang, Heng
AU - Yang, Yi
PY - 2011
Y1 - 2011
N2 - Traditional video classification methods typically require a large number of labeled training video frames to achieve satisfactory performance. However, in the real world, we usually only have sufficient labeled video clips (such as tagged online videos) but lack labeled video frames. In this paper, we formalize the video classification problem as a Multi-Instance Learning (MIL) problem, an emerging topic in machine learning in recent years, which only needs bag (video clip) labels. To solve the problem, we propose a novel Parameterized Class-to-Bag (P-C2B) Distance method to learn the relative importance of a training instance with respect to its labeled classes, such that the instance level labeling ambiguity in MIL is tackled and the frame relevances of training video data with respect to the semantic concepts of interest are given. Promising experimental results have demonstrated the effectiveness of the proposed method.
AB - Traditional video classification methods typically require a large number of labeled training video frames to achieve satisfactory performance. However, in the real world, we usually only have sufficient labeled video clips (such as tagged online videos) but lack labeled video frames. In this paper, we formalize the video classification problem as a Multi-Instance Learning (MIL) problem, an emerging topic in machine learning in recent years, which only needs bag (video clip) labels. To solve the problem, we propose a novel Parameterized Class-to-Bag (P-C2B) Distance method to learn the relative importance of a training instance with respect to its labeled classes, such that the instance level labeling ambiguity in MIL is tackled and the frame relevances of training video data with respect to the semantic concepts of interest are given. Promising experimental results have demonstrated the effectiveness of the proposed method.
KW - Multi-instance learning
KW - Video classification
UR - http://www.scopus.com/inward/record.url?scp=84455201982&partnerID=8YFLogxK
U2 - 10.1145/2072298.2072011
DO - 10.1145/2072298.2072011
M3 - 会议稿件
AN - SCOPUS:84455201982
SN - 9781450306164
T3 - MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops
SP - 1345
EP - 1348
BT - MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops
T2 - 19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11
Y2 - 28 November 2011 through 1 December 2011
ER -