TY - JOUR
T1 - A multi-view latent variable model reveals cellular heterogeneity in complex tissues for paired multimodal single-cell data
AU - Wang, Yuwei
AU - Lian, Bin
AU - Zhang, Haohui
AU - Zhong, Yuanke
AU - He, Jie
AU - Wu, Fashuai
AU - Reinert, Knut
AU - Shang, Xuequn
AU - Yang, Hui
AU - Hu, Jialu
N1 - Publisher Copyright:
© The Author(s) 2023. Published by Oxford University Press.
PY - 2023/1/1
Y1 - 2023/1/1
N2 - Motivation: Single-cell multimodal assays allow us to simultaneously measure two different molecular features of the same cell, enabling new insights into cellular heterogeneity, cell development and diseases. However, most existing methods suffer from inaccurate dimensionality reduction for the joint-modality data, hindering their discovery of novel or rare cell subpopulations. Results: Here, we present VIMCCA, a computational framework based on variational-assisted multi-view canonical correlation analysis to integrate paired multimodal single-cell data. Our statistical model uses a common latent variable to interpret the common source of variances in two different data modalities. Our approach jointly learns an inference model and two modality-specific non-linear models by leveraging variational inference and deep learning. We perform VIMCCA and compare it with 10 existing state-of-the-art algorithms on four paired multi-modal datasets sequenced by different protocols. Results demonstrate that VIMCCA facilitates integrating various types of joint-modality data, thus leading to more reliable and accurate downstream analysis. VIMCCA improves our ability to identify novel or rare cell subtypes compared to existing widely used methods. Besides, it can also facilitate inferring cell lineage based on joint-modality profiles. Availability and implementation: The VIMCCA algorithm has been implemented in our toolkit package scbean (≥0.5.0), and its code has been archived at https://github.com/jhu99/scbean under MIT license.
AB - Motivation: Single-cell multimodal assays allow us to simultaneously measure two different molecular features of the same cell, enabling new insights into cellular heterogeneity, cell development and diseases. However, most existing methods suffer from inaccurate dimensionality reduction for the joint-modality data, hindering their discovery of novel or rare cell subpopulations. Results: Here, we present VIMCCA, a computational framework based on variational-assisted multi-view canonical correlation analysis to integrate paired multimodal single-cell data. Our statistical model uses a common latent variable to interpret the common source of variances in two different data modalities. Our approach jointly learns an inference model and two modality-specific non-linear models by leveraging variational inference and deep learning. We perform VIMCCA and compare it with 10 existing state-of-the-art algorithms on four paired multi-modal datasets sequenced by different protocols. Results demonstrate that VIMCCA facilitates integrating various types of joint-modality data, thus leading to more reliable and accurate downstream analysis. VIMCCA improves our ability to identify novel or rare cell subtypes compared to existing widely used methods. Besides, it can also facilitate inferring cell lineage based on joint-modality profiles. Availability and implementation: The VIMCCA algorithm has been implemented in our toolkit package scbean (≥0.5.0), and its code has been archived at https://github.com/jhu99/scbean under MIT license.
UR - http://www.scopus.com/inward/record.url?scp=85147046165&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btad005
DO - 10.1093/bioinformatics/btad005
M3 - 文章
C2 - 36622018
AN - SCOPUS:85147046165
SN - 1367-4803
VL - 39
JO - Bioinformatics
JF - Bioinformatics
IS - 1
M1 - btad005
ER -