T-distributed Stochastic Neighbor Network for unsupervised representation learning

Zheng Wang; Jiaxi Xie; Feiping Nie; Rong Wang; Yanyan Jia; Shichang Liu

doi:10.1016/j.neunet.2024.106520

T-distributed Stochastic Neighbor Network for unsupervised representation learning

Zheng Wang, Jiaxi Xie, Feiping Nie, Rong Wang, Yanyan Jia, Shichang Liu

光电与智能研究院

科研成果: 期刊稿件 › 文章 › 同行评审

2 引用（Scopus）

摘要

Unsupervised representation learning (URL) is still lack of a reasonable operator (e.g. convolution kernel) for exploring meaningful structural information from generic data including vector, image and tabular data. In this paper, we propose a simple end-to-end T-distributed Stochastic Neighbor Network (TsNet) for URL with clustering downstream task. Concretely, our TsNet model has three major components: (1) an adaptive connectivity distribution learning module is presented to construct a pairwise graph for preserving the local structure of generic data; (2) a T-distributed stochastic neighbor embedding based loss function is designed to learn a transformation between embeddings and original data, which improves the discrimination of representations; (3) a nonlinear parametric mapping is learned via our TsNet on an unsupervised generalized manner, which can address the “out-of-sample” issue. By combining these components, our method is able to considerably outperform previous related unsupervised learning approaches on visualization and clustering of generic data. A simple deep neural network equipped on our model respectively achieves 74.90%, 76.56% ACC and NMI, which is 8% relative improvement over previous state-of-the-art on real single-cell RNA-sequencing (scRNA-seq) datasets clustering.

源语言	英语
文章编号	106520
期刊	Neural Networks
卷	179
DOI	https://doi.org/10.1016/j.neunet.2024.106520
出版状态	已出版 - 11月 2024

访问文件

10.1016/j.neunet.2024.106520

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{a7d8fd70fa554901bdc88f5b7a8313ee,

title = "T-distributed Stochastic Neighbor Network for unsupervised representation learning",

abstract = "Unsupervised representation learning (URL) is still lack of a reasonable operator (e.g. convolution kernel) for exploring meaningful structural information from generic data including vector, image and tabular data. In this paper, we propose a simple end-to-end T-distributed Stochastic Neighbor Network (TsNet) for URL with clustering downstream task. Concretely, our TsNet model has three major components: (1) an adaptive connectivity distribution learning module is presented to construct a pairwise graph for preserving the local structure of generic data; (2) a T-distributed stochastic neighbor embedding based loss function is designed to learn a transformation between embeddings and original data, which improves the discrimination of representations; (3) a nonlinear parametric mapping is learned via our TsNet on an unsupervised generalized manner, which can address the “out-of-sample” issue. By combining these components, our method is able to considerably outperform previous related unsupervised learning approaches on visualization and clustering of generic data. A simple deep neural network equipped on our model respectively achieves 74.90%, 76.56% ACC and NMI, which is 8% relative improvement over previous state-of-the-art on real single-cell RNA-sequencing (scRNA-seq) datasets clustering.",

keywords = "Generic data dimensionality reduction, scRNA-seq clustering, Unsupervised representation learning",

author = "Zheng Wang and Jiaxi Xie and Feiping Nie and Rong Wang and Yanyan Jia and Shichang Liu",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Ltd",

year = "2024",

month = nov,

doi = "10.1016/j.neunet.2024.106520",

language = "英语",

volume = "179",

journal = "Neural Networks",

issn = "0893-6080",

publisher = "Elsevier Ltd",

}

TY - JOUR

T1 - T-distributed Stochastic Neighbor Network for unsupervised representation learning

AU - Wang, Zheng

AU - Xie, Jiaxi

AU - Nie, Feiping

AU - Wang, Rong

AU - Jia, Yanyan

AU - Liu, Shichang

PY - 2024/11

Y1 - 2024/11

N2 - Unsupervised representation learning (URL) is still lack of a reasonable operator (e.g. convolution kernel) for exploring meaningful structural information from generic data including vector, image and tabular data. In this paper, we propose a simple end-to-end T-distributed Stochastic Neighbor Network (TsNet) for URL with clustering downstream task. Concretely, our TsNet model has three major components: (1) an adaptive connectivity distribution learning module is presented to construct a pairwise graph for preserving the local structure of generic data; (2) a T-distributed stochastic neighbor embedding based loss function is designed to learn a transformation between embeddings and original data, which improves the discrimination of representations; (3) a nonlinear parametric mapping is learned via our TsNet on an unsupervised generalized manner, which can address the “out-of-sample” issue. By combining these components, our method is able to considerably outperform previous related unsupervised learning approaches on visualization and clustering of generic data. A simple deep neural network equipped on our model respectively achieves 74.90%, 76.56% ACC and NMI, which is 8% relative improvement over previous state-of-the-art on real single-cell RNA-sequencing (scRNA-seq) datasets clustering.

AB - Unsupervised representation learning (URL) is still lack of a reasonable operator (e.g. convolution kernel) for exploring meaningful structural information from generic data including vector, image and tabular data. In this paper, we propose a simple end-to-end T-distributed Stochastic Neighbor Network (TsNet) for URL with clustering downstream task. Concretely, our TsNet model has three major components: (1) an adaptive connectivity distribution learning module is presented to construct a pairwise graph for preserving the local structure of generic data; (2) a T-distributed stochastic neighbor embedding based loss function is designed to learn a transformation between embeddings and original data, which improves the discrimination of representations; (3) a nonlinear parametric mapping is learned via our TsNet on an unsupervised generalized manner, which can address the “out-of-sample” issue. By combining these components, our method is able to considerably outperform previous related unsupervised learning approaches on visualization and clustering of generic data. A simple deep neural network equipped on our model respectively achieves 74.90%, 76.56% ACC and NMI, which is 8% relative improvement over previous state-of-the-art on real single-cell RNA-sequencing (scRNA-seq) datasets clustering.

KW - Generic data dimensionality reduction

KW - scRNA-seq clustering

KW - Unsupervised representation learning

UR - http://www.scopus.com/inward/record.url?scp=85198729332&partnerID=8YFLogxK

U2 - 10.1016/j.neunet.2024.106520

DO - 10.1016/j.neunet.2024.106520

M3 - 文章

C2 - 39024709

AN - SCOPUS:85198729332

SN - 0893-6080

VL - 179

JO - Neural Networks

JF - Neural Networks

M1 - 106520

ER -

T-distributed Stochastic Neighbor Network for unsupervised representation learning

摘要

访问文件

其它文件与链接

指纹

引用此