A novel visual codebook model based on fuzzy geometry for large-scale image classification

Yanshan Li; Qinghua Huang; Weixin Xie; Xuelong Li

doi:10.1016/j.patcog.2015.02.010

A novel visual codebook model based on fuzzy geometry for large-scale image classification

Yanshan Li, Qinghua Huang, Weixin Xie, Xuelong Li

Research output: Contribution to journal › Article › peer-review

20 Scopus citations

Abstract

The codebook model has been developed as an effective means for image classification. However, the inherent operation of assigning visual words to image feature vectors in traditional codebook approaches causes serious ambiguities in image classification. In particular, the nearest word may not be the best fit to a feature, and multiple words may be equally appropriate for one specific feature. To resolve these ambiguities, we propose a novel visual codebook model based on the n-dimensional fuzzy geometry (n-D FG) theory, where all visual words and features are modeled as fuzzy points in the n-D FG space, and appropriate uncertainty is introduced to each fuzzy point to enhance the representation capacity. This n-D FG-codebook model not only inherits advantages from the fuzzy set theory, but also facilitates the analysis and determination of the relationship between visual words and features in geometric form. By explicitly taking into account the ambiguities, we propose a novel measure of similarity between the visual words and fuzzy features. Following the proposed codebook model and the novel similarity measure, we develop two useful image classification algorithms by modifying popular image coding algorithms (i.e. SPM and LLC). Finally, experimental results demonstrate that the classification accuracy of the proposed algorithms is dramatically improved for a standard large-scale image database. For example, with a codebook size of 256, the proposed algorithms achieve similar performance as traditional algorithms with a codebook size of 1024, indicating that the proposed algorithms reduce the computational cost by 75% while achieving almost identical classification accuracy to traditional algorithms. Thus, the proposed algorithms represent a more efficient and appropriate scheme for big image data.

Original language	English
Pages (from-to)	3125-3134
Number of pages	10
Journal	Pattern Recognition
Volume	48
Issue number	10
DOIs	https://doi.org/10.1016/j.patcog.2015.02.010
State	Published - 1 Oct 2015
Externally published	Yes

Keywords

Codebook
Fuzzy geometry
Fuzzy set theory
Image classification

Access to Document

10.1016/j.patcog.2015.02.010

Cite this

@article{6f17cdebf4664d16b5b056522e490f6c,

title = "A novel visual codebook model based on fuzzy geometry for large-scale image classification",

abstract = "The codebook model has been developed as an effective means for image classification. However, the inherent operation of assigning visual words to image feature vectors in traditional codebook approaches causes serious ambiguities in image classification. In particular, the nearest word may not be the best fit to a feature, and multiple words may be equally appropriate for one specific feature. To resolve these ambiguities, we propose a novel visual codebook model based on the n-dimensional fuzzy geometry (n-D FG) theory, where all visual words and features are modeled as fuzzy points in the n-D FG space, and appropriate uncertainty is introduced to each fuzzy point to enhance the representation capacity. This n-D FG-codebook model not only inherits advantages from the fuzzy set theory, but also facilitates the analysis and determination of the relationship between visual words and features in geometric form. By explicitly taking into account the ambiguities, we propose a novel measure of similarity between the visual words and fuzzy features. Following the proposed codebook model and the novel similarity measure, we develop two useful image classification algorithms by modifying popular image coding algorithms (i.e. SPM and LLC). Finally, experimental results demonstrate that the classification accuracy of the proposed algorithms is dramatically improved for a standard large-scale image database. For example, with a codebook size of 256, the proposed algorithms achieve similar performance as traditional algorithms with a codebook size of 1024, indicating that the proposed algorithms reduce the computational cost by 75% while achieving almost identical classification accuracy to traditional algorithms. Thus, the proposed algorithms represent a more efficient and appropriate scheme for big image data.",

keywords = "Codebook, Fuzzy geometry, Fuzzy set theory, Image classification",

author = "Yanshan Li and Qinghua Huang and Weixin Xie and Xuelong Li",

year = "2015",

month = oct,

day = "1",

doi = "10.1016/j.patcog.2015.02.010",

language = "英语",

volume = "48",

pages = "3125--3134",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd",

number = "10",

}

TY - JOUR

T1 - A novel visual codebook model based on fuzzy geometry for large-scale image classification

AU - Li, Yanshan

AU - Huang, Qinghua

AU - Xie, Weixin

AU - Li, Xuelong

PY - 2015/10/1

Y1 - 2015/10/1

N2 - The codebook model has been developed as an effective means for image classification. However, the inherent operation of assigning visual words to image feature vectors in traditional codebook approaches causes serious ambiguities in image classification. In particular, the nearest word may not be the best fit to a feature, and multiple words may be equally appropriate for one specific feature. To resolve these ambiguities, we propose a novel visual codebook model based on the n-dimensional fuzzy geometry (n-D FG) theory, where all visual words and features are modeled as fuzzy points in the n-D FG space, and appropriate uncertainty is introduced to each fuzzy point to enhance the representation capacity. This n-D FG-codebook model not only inherits advantages from the fuzzy set theory, but also facilitates the analysis and determination of the relationship between visual words and features in geometric form. By explicitly taking into account the ambiguities, we propose a novel measure of similarity between the visual words and fuzzy features. Following the proposed codebook model and the novel similarity measure, we develop two useful image classification algorithms by modifying popular image coding algorithms (i.e. SPM and LLC). Finally, experimental results demonstrate that the classification accuracy of the proposed algorithms is dramatically improved for a standard large-scale image database. For example, with a codebook size of 256, the proposed algorithms achieve similar performance as traditional algorithms with a codebook size of 1024, indicating that the proposed algorithms reduce the computational cost by 75% while achieving almost identical classification accuracy to traditional algorithms. Thus, the proposed algorithms represent a more efficient and appropriate scheme for big image data.

AB - The codebook model has been developed as an effective means for image classification. However, the inherent operation of assigning visual words to image feature vectors in traditional codebook approaches causes serious ambiguities in image classification. In particular, the nearest word may not be the best fit to a feature, and multiple words may be equally appropriate for one specific feature. To resolve these ambiguities, we propose a novel visual codebook model based on the n-dimensional fuzzy geometry (n-D FG) theory, where all visual words and features are modeled as fuzzy points in the n-D FG space, and appropriate uncertainty is introduced to each fuzzy point to enhance the representation capacity. This n-D FG-codebook model not only inherits advantages from the fuzzy set theory, but also facilitates the analysis and determination of the relationship between visual words and features in geometric form. By explicitly taking into account the ambiguities, we propose a novel measure of similarity between the visual words and fuzzy features. Following the proposed codebook model and the novel similarity measure, we develop two useful image classification algorithms by modifying popular image coding algorithms (i.e. SPM and LLC). Finally, experimental results demonstrate that the classification accuracy of the proposed algorithms is dramatically improved for a standard large-scale image database. For example, with a codebook size of 256, the proposed algorithms achieve similar performance as traditional algorithms with a codebook size of 1024, indicating that the proposed algorithms reduce the computational cost by 75% while achieving almost identical classification accuracy to traditional algorithms. Thus, the proposed algorithms represent a more efficient and appropriate scheme for big image data.

KW - Codebook

KW - Fuzzy geometry

KW - Fuzzy set theory

KW - Image classification

UR - http://www.scopus.com/inward/record.url?scp=84931563771&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2015.02.010

DO - 10.1016/j.patcog.2015.02.010

M3 - 文章

AN - SCOPUS:84931563771

SN - 0031-3203

VL - 48

SP - 3125

EP - 3134

JO - Pattern Recognition

JF - Pattern Recognition

IS - 10

ER -

A novel visual codebook model based on fuzzy geometry for large-scale image classification

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this