Graph-Regularized Non-Negative Matrix Factorization for Single-Cell Clustering in scRNA-Seq Data

Hanjing Jiang, Mei Neng Wang, Yu An Huang, Yabing Huang

科研成果: 期刊稿件文章同行评审

摘要

The advent of single-cell RNA sequencing (scRNA-seq) has brought forth fresh perspectives on intricate biological processes, revealing the nuances and divergences present among distinct cells. Accurate single-cell analysis is a crucial prerequisite for in-depth investigation into the underlying mechanisms of heterogeneity. Due to various technical noises, like the impact of dropout values, scRNA-seq data remains challenging to interpret. In this work, we propose an unsupervised learning framework for scRNA-seq data analysis (aka Sc-GNNMF). Based on the non-negativity and sparsity of scRNA-seq data, we propose employing graph-regularized non-negative matrix factorization (GNNMF) algorithm for the analysis of scRNA-seq data, which involves estimating cell-cell sparse similarity and gene-gene sparse similarity through Laplacian kernels and p-nearest neighbor graphs (p-NNG). By assuming intrinsic geometric local invariance, we use a weighted p-nearest known neighbors (p-NKN) to optimize the scRNA-seq data. The optimized scRNA-seq data then participates in the matrix decomposition process, promoting the closeness of cells with similar types in cell-gene data space and determining a more suitable embedding space for clustering. Sc-GNNMF demonstrates superior performance compared to other methods and maintains satisfactory compatibility and robustness, as evidenced by experiments on 11 real scRNA-seq datasets. Furthermore, Sc-GNNMF yields excellent results in clustering tasks, extracting useful gene markers, and pseudo-temporal analysis.

源语言英语
页(从-至)4986-4994
页数9
期刊IEEE Journal of Biomedical and Health Informatics
28
8
DOI
出版状态已出版 - 2024

指纹

探究 'Graph-Regularized Non-Negative Matrix Factorization for Single-Cell Clustering in scRNA-Seq Data' 的科研主题。它们共同构成独一无二的指纹。

引用此