TY - JOUR
T1 - Unsupervised Ensemble Hashing
T2 - Boosting Minimum Hamming Distance
AU - Zha, Yufei
AU - Qiu, Zhuling
AU - Zhang, Peng
AU - Huang, Wei
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2020
Y1 - 2020
N2 - Hashing aims at learning discriminative binary codes of high-dimensional data for the approximate nearest neighbor searching. However, the distance ranking obtained by traditional methods is not optimum in the Hamming space, and it degrades the performance for retrieval tasks. To tackle the above problem, an unsupervised ensemble hashing is proposed to improve the ranking accuracy by assembling the diverse hash tables independently in this paper. We observe that the higher the accuracy is the larger diversity the base learner has, and the more effective the ensemble method is. Based on this principle, two special ensembles hashing approaches are proposed to increase diversity by bootstrap sampling with data-dependent methods. Especially, the results are better when the minimum Hamming distance is large and the variance of the Hamming distance is small. This proposed method is conducted in the experiments and the results show that it can achieve about 10%-25% performance compared with the baseline algorithm, which achieves competitive results with the state-of-the-art methods on the CIFAR-10 and LabelMe benchmarks.
AB - Hashing aims at learning discriminative binary codes of high-dimensional data for the approximate nearest neighbor searching. However, the distance ranking obtained by traditional methods is not optimum in the Hamming space, and it degrades the performance for retrieval tasks. To tackle the above problem, an unsupervised ensemble hashing is proposed to improve the ranking accuracy by assembling the diverse hash tables independently in this paper. We observe that the higher the accuracy is the larger diversity the base learner has, and the more effective the ensemble method is. Based on this principle, two special ensembles hashing approaches are proposed to increase diversity by bootstrap sampling with data-dependent methods. Especially, the results are better when the minimum Hamming distance is large and the variance of the Hamming distance is small. This proposed method is conducted in the experiments and the results show that it can achieve about 10%-25% performance compared with the baseline algorithm, which achieves competitive results with the state-of-the-art methods on the CIFAR-10 and LabelMe benchmarks.
KW - Accuracy and diversity
KW - Distance variance
KW - Ensemble method
KW - Hamming distance
KW - Unsupervised hashing
UR - http://www.scopus.com/inward/record.url?scp=85082056069&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2020.2975883
DO - 10.1109/ACCESS.2020.2975883
M3 - 文章
AN - SCOPUS:85082056069
SN - 2169-3536
VL - 8
SP - 42937
EP - 42947
JO - IEEE Access
JF - IEEE Access
M1 - 9007381
ER -