TY - GEN
T1 - Robust and discriminative distance for Multi-Instance Learning
AU - Wang, Hua
AU - Nie, Feiping
AU - Huang, Heng
PY - 2012
Y1 - 2012
N2 - Multi-Instance Learning (MIL) is an emerging topic in machine learning, which has broad applications in computer vision. For example, by considering video classification as a MIL problem where we only need labeled video clips (such as tagged online videos) but not labeled video frames, one can lower down the labeling cost, which is typically very expensive. We propose a novel class specific distance Metrics enhanced Class-to-Bag distance (M-C2B) method to learn a robust and discriminative distance for multi-instance data, which employs the not-squared 2-norm distance to address the most difficult challenge in MIL, i.e., the outlier instances that abound in multi-instance data by nature. As a result, the formulated objective ends up to be a simultaneous 2, 1-norm minimization and maximization (minmax) problem, which is very hard to solve in general due to the non-smoothness of the 2, 1-norm. We thus present an efficient iterative algorithm to solve the general 2, 1-norm minmax problem with rigorously proved convergence. To the best of our knowledge, we are the first to solve a general 2, 1-norm minmax problem in literature. We have conducted extensive experiments to evaluate various aspects of the proposed method, in which promising results validate our new method in cost-effective video classification.
AB - Multi-Instance Learning (MIL) is an emerging topic in machine learning, which has broad applications in computer vision. For example, by considering video classification as a MIL problem where we only need labeled video clips (such as tagged online videos) but not labeled video frames, one can lower down the labeling cost, which is typically very expensive. We propose a novel class specific distance Metrics enhanced Class-to-Bag distance (M-C2B) method to learn a robust and discriminative distance for multi-instance data, which employs the not-squared 2-norm distance to address the most difficult challenge in MIL, i.e., the outlier instances that abound in multi-instance data by nature. As a result, the formulated objective ends up to be a simultaneous 2, 1-norm minimization and maximization (minmax) problem, which is very hard to solve in general due to the non-smoothness of the 2, 1-norm. We thus present an efficient iterative algorithm to solve the general 2, 1-norm minmax problem with rigorously proved convergence. To the best of our knowledge, we are the first to solve a general 2, 1-norm minmax problem in literature. We have conducted extensive experiments to evaluate various aspects of the proposed method, in which promising results validate our new method in cost-effective video classification.
UR - http://www.scopus.com/inward/record.url?scp=84866680990&partnerID=8YFLogxK
U2 - 10.1109/CVPR.2012.6248019
DO - 10.1109/CVPR.2012.6248019
M3 - 会议稿件
AN - SCOPUS:84866680990
SN - 9781467312264
T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
SP - 2919
EP - 2924
BT - 2012 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2012
T2 - 2012 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2012
Y2 - 16 June 2012 through 21 June 2012
ER -