TY - JOUR
T1 - A novel visual codebook model based on fuzzy geometry for large-scale image classification
AU - Li, Yanshan
AU - Huang, Qinghua
AU - Xie, Weixin
AU - Li, Xuelong
N1 - Publisher Copyright:
© 2015 Elsevier Ltd. All rights reserved.
PY - 2015/10/1
Y1 - 2015/10/1
N2 - The codebook model has been developed as an effective means for image classification. However, the inherent operation of assigning visual words to image feature vectors in traditional codebook approaches causes serious ambiguities in image classification. In particular, the nearest word may not be the best fit to a feature, and multiple words may be equally appropriate for one specific feature. To resolve these ambiguities, we propose a novel visual codebook model based on the n-dimensional fuzzy geometry (n-D FG) theory, where all visual words and features are modeled as fuzzy points in the n-D FG space, and appropriate uncertainty is introduced to each fuzzy point to enhance the representation capacity. This n-D FG-codebook model not only inherits advantages from the fuzzy set theory, but also facilitates the analysis and determination of the relationship between visual words and features in geometric form. By explicitly taking into account the ambiguities, we propose a novel measure of similarity between the visual words and fuzzy features. Following the proposed codebook model and the novel similarity measure, we develop two useful image classification algorithms by modifying popular image coding algorithms (i.e. SPM and LLC). Finally, experimental results demonstrate that the classification accuracy of the proposed algorithms is dramatically improved for a standard large-scale image database. For example, with a codebook size of 256, the proposed algorithms achieve similar performance as traditional algorithms with a codebook size of 1024, indicating that the proposed algorithms reduce the computational cost by 75% while achieving almost identical classification accuracy to traditional algorithms. Thus, the proposed algorithms represent a more efficient and appropriate scheme for big image data.
AB - The codebook model has been developed as an effective means for image classification. However, the inherent operation of assigning visual words to image feature vectors in traditional codebook approaches causes serious ambiguities in image classification. In particular, the nearest word may not be the best fit to a feature, and multiple words may be equally appropriate for one specific feature. To resolve these ambiguities, we propose a novel visual codebook model based on the n-dimensional fuzzy geometry (n-D FG) theory, where all visual words and features are modeled as fuzzy points in the n-D FG space, and appropriate uncertainty is introduced to each fuzzy point to enhance the representation capacity. This n-D FG-codebook model not only inherits advantages from the fuzzy set theory, but also facilitates the analysis and determination of the relationship between visual words and features in geometric form. By explicitly taking into account the ambiguities, we propose a novel measure of similarity between the visual words and fuzzy features. Following the proposed codebook model and the novel similarity measure, we develop two useful image classification algorithms by modifying popular image coding algorithms (i.e. SPM and LLC). Finally, experimental results demonstrate that the classification accuracy of the proposed algorithms is dramatically improved for a standard large-scale image database. For example, with a codebook size of 256, the proposed algorithms achieve similar performance as traditional algorithms with a codebook size of 1024, indicating that the proposed algorithms reduce the computational cost by 75% while achieving almost identical classification accuracy to traditional algorithms. Thus, the proposed algorithms represent a more efficient and appropriate scheme for big image data.
KW - Codebook
KW - Fuzzy geometry
KW - Fuzzy set theory
KW - Image classification
UR - http://www.scopus.com/inward/record.url?scp=84931563771&partnerID=8YFLogxK
U2 - 10.1016/j.patcog.2015.02.010
DO - 10.1016/j.patcog.2015.02.010
M3 - 文章
AN - SCOPUS:84931563771
SN - 0031-3203
VL - 48
SP - 3125
EP - 3134
JO - Pattern Recognition
JF - Pattern Recognition
IS - 10
ER -