TY - JOUR
T1 - Harmonic Mean Linear Discriminant Analysis
AU - Zheng, Shuai
AU - Ding, Chris
AU - Nie, Feiping
AU - Huang, Heng
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2019/8/1
Y1 - 2019/8/1
N2 - In machine learning and data mining, dimensionality reduction is one of the main tasks. Linear Discriminant Analysis (LDA) is a widely used supervised dimensionality reduction algorithm and it has attracted a lot of research interests. Classical Linear Discriminant Analysis finds a subspace to minimize within-class distance and maximize between-class distance, where between-class distance is computed using arithmetic mean of all between-class distances. However, arithmetic mean between-class distance has some limitations. First, arithmetic mean gives equal weight to all between-class distances, and large between-class distance could dominate the result. Second, it does not consider pairwise between-class distance and thus some classes may overlap with each other in the subspace. In this paper, we propose two formulations of harmonic mean based Linear Discriminant Analysis: HLDA and HLDAp, to demonstrate the benefit of harmonic mean between-class distance and overcome the limitations of classical LDA. We compare our algorithm with 11 existing single-label algorithms on seven datasets and five existing multi-label algorithms on two datasets. On some single-label experiment data, the classification accuracy absolute percentage increase can reach 39 percent compared to state-of-Art existing algorithms; on multi-label data, significant improvement on five evaluation metric has been achieved compared to existing algorithms.
AB - In machine learning and data mining, dimensionality reduction is one of the main tasks. Linear Discriminant Analysis (LDA) is a widely used supervised dimensionality reduction algorithm and it has attracted a lot of research interests. Classical Linear Discriminant Analysis finds a subspace to minimize within-class distance and maximize between-class distance, where between-class distance is computed using arithmetic mean of all between-class distances. However, arithmetic mean between-class distance has some limitations. First, arithmetic mean gives equal weight to all between-class distances, and large between-class distance could dominate the result. Second, it does not consider pairwise between-class distance and thus some classes may overlap with each other in the subspace. In this paper, we propose two formulations of harmonic mean based Linear Discriminant Analysis: HLDA and HLDAp, to demonstrate the benefit of harmonic mean between-class distance and overcome the limitations of classical LDA. We compare our algorithm with 11 existing single-label algorithms on seven datasets and five existing multi-label algorithms on two datasets. On some single-label experiment data, the classification accuracy absolute percentage increase can reach 39 percent compared to state-of-Art existing algorithms; on multi-label data, significant improvement on five evaluation metric has been achieved compared to existing algorithms.
KW - Dimensionality reduction
KW - linear discriminant analysis
KW - subspace learning
UR - http://www.scopus.com/inward/record.url?scp=85050979922&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2018.2861858
DO - 10.1109/TKDE.2018.2861858
M3 - 文章
AN - SCOPUS:85050979922
SN - 1041-4347
VL - 31
SP - 1520
EP - 1531
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 8
M1 - 8424045
ER -