TY - JOUR
T1 - Fuzzy bag of words for social image description
AU - Li, Yanshan
AU - Liu, Weiming
AU - Huang, Qinghua
AU - Li, Xuelong
N1 - Publisher Copyright:
© 2014, Springer Science+Business Media New York.
PY - 2016/2/1
Y1 - 2016/2/1
N2 - Rapid growth of social media resources brings huge challenges and opportunities for image description technologies. The performance of image description method directly affects the accuracy of image retrieval, image annotation and image recognition. Bag of Words (BoW) as an efficient approach to describing the images has been attracting more and more attention. However, in traditional BoW, the maps between the words in the codebook and the features extracted from the images are actually ambiguous. As the Fuzzy Sets Theory (FST) is a powerful means for dealing with uncertainty efficiently, we utilize the FST to solve the problem caused by the ambiguity between the features and words. Accordingly, we propose a new type of BoW named as FBoW to describe images based on FST. Firstly, the features are extracted from the images. Secondly, k-means is utilized to learn the codebook. Thirdly, a fuzzy membership function is designed to measure the similarity between the features and words. The optimal parameters of the fuzzy membership function are obtained by using a Genetic Algorithm (GA). The histogram is generated by adding up the fuzzy membership values of each word to describe the images. The experimental results show that the proposed FBoW outperforms traditional BoW for social image description.
AB - Rapid growth of social media resources brings huge challenges and opportunities for image description technologies. The performance of image description method directly affects the accuracy of image retrieval, image annotation and image recognition. Bag of Words (BoW) as an efficient approach to describing the images has been attracting more and more attention. However, in traditional BoW, the maps between the words in the codebook and the features extracted from the images are actually ambiguous. As the Fuzzy Sets Theory (FST) is a powerful means for dealing with uncertainty efficiently, we utilize the FST to solve the problem caused by the ambiguity between the features and words. Accordingly, we propose a new type of BoW named as FBoW to describe images based on FST. Firstly, the features are extracted from the images. Secondly, k-means is utilized to learn the codebook. Thirdly, a fuzzy membership function is designed to measure the similarity between the features and words. The optimal parameters of the fuzzy membership function are obtained by using a Genetic Algorithm (GA). The histogram is generated by adding up the fuzzy membership values of each word to describe the images. The experimental results show that the proposed FBoW outperforms traditional BoW for social image description.
KW - Bag of words
KW - Fuzzy sets theory
KW - Image description
KW - Social images
UR - http://www.scopus.com/inward/record.url?scp=84960396639&partnerID=8YFLogxK
U2 - 10.1007/s11042-014-2138-4
DO - 10.1007/s11042-014-2138-4
M3 - 文章
AN - SCOPUS:84960396639
SN - 1380-7501
VL - 75
SP - 1371
EP - 1390
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 3
ER -