TY - JOUR
T1 - Locality constrained encoding of frequency and spatial information for image classification
AU - Pan, Yongsheng
AU - Xia, Yong
AU - Song, Yang
AU - Cai, Weidong
N1 - Publisher Copyright:
© 2018, Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2018/10/1
Y1 - 2018/10/1
N2 - The bag-of-feature (BoF) model provides a way to construct high-level representation for image classification. Although spatial pyramid matching (SPM) has been incorporated into many of its extensions, these models intrinsically lack the mechanism to utilize frequency domain information. In this paper, we propose the locality-constrained encoding of frequency and spatial information (LEFSI) algorithm, in which an image is decomposed into multiple frequency components and each component is further decomposed into subregions using SPM. The scale-invariant feature transform (SIFT) descriptors are first calculated in each subregion, and then converted into a global descriptor by using the codebook generated on a category-by-category basis and locality-constrained linear coding (LLC). The image feature is defined as the concatenation of global descriptors constructed in all subregions. We evaluated this algorithm against several state-of-the-art models on six benchmark datasets. Our results suggest that the proposed LEFSI algorithm can describe images more effectively and provide more accurate image classification.
AB - The bag-of-feature (BoF) model provides a way to construct high-level representation for image classification. Although spatial pyramid matching (SPM) has been incorporated into many of its extensions, these models intrinsically lack the mechanism to utilize frequency domain information. In this paper, we propose the locality-constrained encoding of frequency and spatial information (LEFSI) algorithm, in which an image is decomposed into multiple frequency components and each component is further decomposed into subregions using SPM. The scale-invariant feature transform (SIFT) descriptors are first calculated in each subregion, and then converted into a global descriptor by using the codebook generated on a category-by-category basis and locality-constrained linear coding (LLC). The image feature is defined as the concatenation of global descriptors constructed in all subregions. We evaluated this algorithm against several state-of-the-art models on six benchmark datasets. Our results suggest that the proposed LEFSI algorithm can describe images more effectively and provide more accurate image classification.
KW - Bag-of-features (BoF)
KW - Image classification
KW - Image decomposition
KW - Spatial pyramid matching (SPM)
KW - Wavelet transform
UR - http://www.scopus.com/inward/record.url?scp=85052680175&partnerID=8YFLogxK
U2 - 10.1007/s11042-018-5712-3
DO - 10.1007/s11042-018-5712-3
M3 - 文章
AN - SCOPUS:85052680175
SN - 1380-7501
VL - 77
SP - 24891
EP - 24907
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 19
ER -