TY - JOUR
T1 - Multiple instance learning based on positive instance selection and bag structure construction
AU - Li, Zhan
AU - Geng, Guo Hua
AU - Feng, Jun
AU - Peng, Jin Ye
AU - Wen, Chao
AU - Liang, Jun Li
PY - 2014/4/15
Y1 - 2014/4/15
N2 - Previous studies on multiple instance learning (MIL) have shown that the MIL problem holds three characteristics: positive instance clustering, bag structure and instance probabilistic influence to bag label. In this paper, combined with the advantages of these three characteristics, we propose two simple yet effective MIL algorithms, CK-MIL and ck-MIL. We take three steps to convert MIL to a standard supervised learning problem. In the first step, we perform K-means clustering algorithm on the positive and negative sets separately to obtain the cluster centers, further use them to select the most positive instances in bags. Next, we combine three distances, including the maximum, minimum and the average distances from bag to cluster centers, as bag structure. For CK-MIL, we simply compose the positive instance and bag structure to form a new vector as bag representation, then apply RBF kernel to measure bag similarity, while for ck-MIL algorithm we construct a new kernel by introducing a probabilistic coefficient to balance the influences between the positive instance similarity and bag structure similarity. As a result, the MIL problem is converted to a standard supervised learning problem that can be solved directly by SVM method. Experiments on MUSK and COREL image set have shown that our two algorithms perform better than other key existing MIL algorithms on the drug prediction and image classification tasks.
AB - Previous studies on multiple instance learning (MIL) have shown that the MIL problem holds three characteristics: positive instance clustering, bag structure and instance probabilistic influence to bag label. In this paper, combined with the advantages of these three characteristics, we propose two simple yet effective MIL algorithms, CK-MIL and ck-MIL. We take three steps to convert MIL to a standard supervised learning problem. In the first step, we perform K-means clustering algorithm on the positive and negative sets separately to obtain the cluster centers, further use them to select the most positive instances in bags. Next, we combine three distances, including the maximum, minimum and the average distances from bag to cluster centers, as bag structure. For CK-MIL, we simply compose the positive instance and bag structure to form a new vector as bag representation, then apply RBF kernel to measure bag similarity, while for ck-MIL algorithm we construct a new kernel by introducing a probabilistic coefficient to balance the influences between the positive instance similarity and bag structure similarity. As a result, the MIL problem is converted to a standard supervised learning problem that can be solved directly by SVM method. Experiments on MUSK and COREL image set have shown that our two algorithms perform better than other key existing MIL algorithms on the drug prediction and image classification tasks.
KW - K-means clustering
KW - Multiple instance learning (MIL)
KW - Multiple kernel
KW - Support vector machine (SVM)
UR - http://www.scopus.com/inward/record.url?scp=84892141917&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2013.11.013
DO - 10.1016/j.patrec.2013.11.013
M3 - 文章
AN - SCOPUS:84892141917
SN - 0167-8655
VL - 40
SP - 19
EP - 26
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
IS - 1
ER -