Clustering and projected clustering with adaptive neighbors

Feiping Nie; Xiaoqian Wang; Heng Huang

doi:10.1145/2623330.2623726

Clustering and projected clustering with adaptive neighbors

Feiping Nie, Xiaoqian Wang, Heng Huang

University of Texas at Arlington

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

932 Scopus citations

Abstract

Many clustering methods partition the data groups based on the input data similarity matrix. Thus, the clustering results highly depend on the data similarity learning. Because the similarity measurement and data clustering are often conducted in two separated steps, the learned data similarity may not be the optimal one for data clustering and lead to the suboptimal results. In this paper, we propose a novel clustering model to learn the data similarity matrix and clustering structure simultaneously. Our new model learns the data similarity matrix by assigning the adaptive and optimal neighbors for each data point based on the local distances. Meanwhile, the new rank constraint is imposed to the Laplacian matrix of the data similarity matrix, such that the connected components in the resulted similarity matrix are exactly equal to the cluster number. We derive an efficient algorithm to optimize the proposed challenging problem, and show the theoretical analysis on the connections between our method and the K-means clustering, and spectral clustering. We also further extend the new clustering model for the projected clustering to handle the high-dimensional data. Extensive empirical results on both synthetic data and real-world benchmark data sets show that our new clustering methods consistently outperforms the related clustering approaches.

Original language	English
Title of host publication	KDD 2014 - Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Publisher	Association for Computing Machinery
Pages	977-986
Number of pages	10
ISBN (Print)	9781450329569
DOIs	https://doi.org/10.1145/2623330.2623726
State	Published - 2014
Externally published	Yes
Event	20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2014 - New York, NY, United States Duration: 24 Aug 2014 → 27 Aug 2014

Publication series

Name	Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Conference

Conference	20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2014
Country/Territory	United States
City	New York, NY
Period	24/08/14 → 27/08/14

Keywords

adaptive neighbors
block diagonal similarity matrix
clustering
clustering with dimensionality reduction

Access to Document

10.1145/2623330.2623726

Cite this

Nie, F., Wang, X., & Huang, H. (2014). Clustering and projected clustering with adaptive neighbors. In KDD 2014 - Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 977-986). (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining). Association for Computing Machinery. https://doi.org/10.1145/2623330.2623726

Nie, Feiping ; Wang, Xiaoqian ; Huang, Heng. / Clustering and projected clustering with adaptive neighbors. KDD 2014 - Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, 2014. pp. 977-986 (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).

@inproceedings{2a7aa210331849a6ba62922150984767,

title = "Clustering and projected clustering with adaptive neighbors",

abstract = "Many clustering methods partition the data groups based on the input data similarity matrix. Thus, the clustering results highly depend on the data similarity learning. Because the similarity measurement and data clustering are often conducted in two separated steps, the learned data similarity may not be the optimal one for data clustering and lead to the suboptimal results. In this paper, we propose a novel clustering model to learn the data similarity matrix and clustering structure simultaneously. Our new model learns the data similarity matrix by assigning the adaptive and optimal neighbors for each data point based on the local distances. Meanwhile, the new rank constraint is imposed to the Laplacian matrix of the data similarity matrix, such that the connected components in the resulted similarity matrix are exactly equal to the cluster number. We derive an efficient algorithm to optimize the proposed challenging problem, and show the theoretical analysis on the connections between our method and the K-means clustering, and spectral clustering. We also further extend the new clustering model for the projected clustering to handle the high-dimensional data. Extensive empirical results on both synthetic data and real-world benchmark data sets show that our new clustering methods consistently outperforms the related clustering approaches.",

keywords = "adaptive neighbors, block diagonal similarity matrix, clustering, clustering with dimensionality reduction",

author = "Feiping Nie and Xiaoqian Wang and Heng Huang",

year = "2014",

doi = "10.1145/2623330.2623726",

language = "英语",

isbn = "9781450329569",

series = "Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining",

publisher = "Association for Computing Machinery",

pages = "977--986",

booktitle = "KDD 2014 - Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining",

note = "20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2014 ; Conference date: 24-08-2014 Through 27-08-2014",

}

Nie, F, Wang, X & Huang, H 2014, Clustering and projected clustering with adaptive neighbors. in KDD 2014 - Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, pp. 977-986, 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2014, New York, NY, United States, 24/08/14. https://doi.org/10.1145/2623330.2623726

Clustering and projected clustering with adaptive neighbors. / Nie, Feiping; Wang, Xiaoqian; Huang, Heng.
KDD 2014 - Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, 2014. p. 977-986 (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Clustering and projected clustering with adaptive neighbors

AU - Nie, Feiping

AU - Wang, Xiaoqian

AU - Huang, Heng

PY - 2014

Y1 - 2014

N2 - Many clustering methods partition the data groups based on the input data similarity matrix. Thus, the clustering results highly depend on the data similarity learning. Because the similarity measurement and data clustering are often conducted in two separated steps, the learned data similarity may not be the optimal one for data clustering and lead to the suboptimal results. In this paper, we propose a novel clustering model to learn the data similarity matrix and clustering structure simultaneously. Our new model learns the data similarity matrix by assigning the adaptive and optimal neighbors for each data point based on the local distances. Meanwhile, the new rank constraint is imposed to the Laplacian matrix of the data similarity matrix, such that the connected components in the resulted similarity matrix are exactly equal to the cluster number. We derive an efficient algorithm to optimize the proposed challenging problem, and show the theoretical analysis on the connections between our method and the K-means clustering, and spectral clustering. We also further extend the new clustering model for the projected clustering to handle the high-dimensional data. Extensive empirical results on both synthetic data and real-world benchmark data sets show that our new clustering methods consistently outperforms the related clustering approaches.

AB - Many clustering methods partition the data groups based on the input data similarity matrix. Thus, the clustering results highly depend on the data similarity learning. Because the similarity measurement and data clustering are often conducted in two separated steps, the learned data similarity may not be the optimal one for data clustering and lead to the suboptimal results. In this paper, we propose a novel clustering model to learn the data similarity matrix and clustering structure simultaneously. Our new model learns the data similarity matrix by assigning the adaptive and optimal neighbors for each data point based on the local distances. Meanwhile, the new rank constraint is imposed to the Laplacian matrix of the data similarity matrix, such that the connected components in the resulted similarity matrix are exactly equal to the cluster number. We derive an efficient algorithm to optimize the proposed challenging problem, and show the theoretical analysis on the connections between our method and the K-means clustering, and spectral clustering. We also further extend the new clustering model for the projected clustering to handle the high-dimensional data. Extensive empirical results on both synthetic data and real-world benchmark data sets show that our new clustering methods consistently outperforms the related clustering approaches.

KW - adaptive neighbors

KW - block diagonal similarity matrix

KW - clustering

KW - clustering with dimensionality reduction

UR - http://www.scopus.com/inward/record.url?scp=84907031470&partnerID=8YFLogxK

U2 - 10.1145/2623330.2623726

DO - 10.1145/2623330.2623726

M3 - 会议稿件

AN - SCOPUS:84907031470

SN - 9781450329569

T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

SP - 977

EP - 986

BT - KDD 2014 - Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

PB - Association for Computing Machinery

T2 - 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2014

Y2 - 24 August 2014 through 27 August 2014

ER -

Nie F, Wang X, Huang H. Clustering and projected clustering with adaptive neighbors. In KDD 2014 - Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery. 2014. p. 977-986. (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining). doi: 10.1145/2623330.2623726

Clustering and projected clustering with adaptive neighbors

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this