Learning regularized LDA by clustering

Yanwei Pang; Shuang Wang; Yuan Yuan

doi:10.1109/TNNLS.2014.2306844

Learning regularized LDA by clustering

Yanwei Pang, Shuang Wang, Yuan Yuan

Research output: Contribution to journal › Article › peer-review

105 Scopus citations

Abstract

As a supervised dimensionality reduction technique, linear discriminant analysis has a serious overfitting problem when the number of training samples per class is small. The main reason is that the between- and within-class scatter matrices computed from the limited number of training samples deviate greatly from the underlying ones. To overcome the problem without increasing the number of training samples, we propose making use of the structure of the given training data to regularize the between- and within-class scatter matrices by between- and within-cluster scatter matrices, respectively, and simultaneously. The within- and between-cluster matrices are computed from unsupervised clustered data. The within-cluster scatter matrix contributes to encoding the possible variations in intraclasses and the between-cluster scatter matrix is useful for separating extra classes. The contributions are inversely proportional to the number of training samples per class. The advantages of the proposed method become more remarkable as the number of training samples per class decreases. Experimental results on the AR and Feret face databases demonstrate the effectiveness of the proposed method.

Original language	English
Article number	6799229
Pages (from-to)	2191-2201
Number of pages	11
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	25
Issue number	12
DOIs	https://doi.org/10.1109/TNNLS.2014.2306844
State	Published - 1 Dec 2014
Externally published	Yes

Keywords

Dimensionality reduction
face recognition
feature extraction
linear discriminant analysis (LDA).

Access to Document

10.1109/TNNLS.2014.2306844

Cite this

@article{8eee06a0f354435ea312ebd7cf667e15,

title = "Learning regularized LDA by clustering",

abstract = "As a supervised dimensionality reduction technique, linear discriminant analysis has a serious overfitting problem when the number of training samples per class is small. The main reason is that the between- and within-class scatter matrices computed from the limited number of training samples deviate greatly from the underlying ones. To overcome the problem without increasing the number of training samples, we propose making use of the structure of the given training data to regularize the between- and within-class scatter matrices by between- and within-cluster scatter matrices, respectively, and simultaneously. The within- and between-cluster matrices are computed from unsupervised clustered data. The within-cluster scatter matrix contributes to encoding the possible variations in intraclasses and the between-cluster scatter matrix is useful for separating extra classes. The contributions are inversely proportional to the number of training samples per class. The advantages of the proposed method become more remarkable as the number of training samples per class decreases. Experimental results on the AR and Feret face databases demonstrate the effectiveness of the proposed method.",

keywords = "Dimensionality reduction, face recognition, feature extraction, linear discriminant analysis (LDA).",

author = "Yanwei Pang and Shuang Wang and Yuan Yuan",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2014",

month = dec,

day = "1",

doi = "10.1109/TNNLS.2014.2306844",

language = "英语",

volume = "25",

pages = "2191--2201",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "12",

}

TY - JOUR

T1 - Learning regularized LDA by clustering

AU - Pang, Yanwei

AU - Wang, Shuang

AU - Yuan, Yuan

PY - 2014/12/1

Y1 - 2014/12/1

N2 - As a supervised dimensionality reduction technique, linear discriminant analysis has a serious overfitting problem when the number of training samples per class is small. The main reason is that the between- and within-class scatter matrices computed from the limited number of training samples deviate greatly from the underlying ones. To overcome the problem without increasing the number of training samples, we propose making use of the structure of the given training data to regularize the between- and within-class scatter matrices by between- and within-cluster scatter matrices, respectively, and simultaneously. The within- and between-cluster matrices are computed from unsupervised clustered data. The within-cluster scatter matrix contributes to encoding the possible variations in intraclasses and the between-cluster scatter matrix is useful for separating extra classes. The contributions are inversely proportional to the number of training samples per class. The advantages of the proposed method become more remarkable as the number of training samples per class decreases. Experimental results on the AR and Feret face databases demonstrate the effectiveness of the proposed method.

AB - As a supervised dimensionality reduction technique, linear discriminant analysis has a serious overfitting problem when the number of training samples per class is small. The main reason is that the between- and within-class scatter matrices computed from the limited number of training samples deviate greatly from the underlying ones. To overcome the problem without increasing the number of training samples, we propose making use of the structure of the given training data to regularize the between- and within-class scatter matrices by between- and within-cluster scatter matrices, respectively, and simultaneously. The within- and between-cluster matrices are computed from unsupervised clustered data. The within-cluster scatter matrix contributes to encoding the possible variations in intraclasses and the between-cluster scatter matrix is useful for separating extra classes. The contributions are inversely proportional to the number of training samples per class. The advantages of the proposed method become more remarkable as the number of training samples per class decreases. Experimental results on the AR and Feret face databases demonstrate the effectiveness of the proposed method.

KW - Dimensionality reduction

KW - face recognition

KW - feature extraction

KW - linear discriminant analysis (LDA).

UR - http://www.scopus.com/inward/record.url?scp=84913585443&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2014.2306844

DO - 10.1109/TNNLS.2014.2306844

M3 - 文章

AN - SCOPUS:84913585443

SN - 2162-237X

VL - 25

SP - 2191

EP - 2201

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 12

M1 - 6799229

ER -

Learning regularized LDA by clustering

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this