TY - GEN
T1 - A deconvolution method for predicting cell abundance based on single cell RNA-seq data
AU - Peng, Jiajie
AU - Han, Lu
AU - Shang, Xuequn
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - It is important to understand the cell-type composition and its proportion in tissues. Some previous experiments have shown that the variation of gene expression in certain cell types may lead to disease. Although cell type composition and proportion can be obtained by single-cell RNA-sequencing (scRNA-sq), using scRNA-sq is expensive and cannot be applied in clinical studies involving a large number of subjects currently. Therefore, it is urgent to develop a method to deconvolute the Bulk RNA-Seq data to obtain the cell type composition in the tissue. Most of the existing methods require the signature matrix, which provides the cell type-specific gene expression profile, as input. However, the signature matrix is not always available for some types of tissue, and it is not always possible to find a suitable cell type-specific gene expression profile. To solve this problem, we propose a novel method, named DCap, to predict cell abundance. Different from non-negative least squares, DCap performs weighted iterative calculation based on least squares. By weighting bulk tissue gene expression matrix and single-cell gene expression matrix, DCap minimizes the measurement error of Bulk RNA-Seq and error resulting from the difference in the amount of genes in the same cell type among different samples. DCap solves the deconvolution problem by using weighted non-negative least squares to predict cell type abundance. DCap does not need to prepare a suitable signature matrix in advance, and the evaluation test shows that DCap performs better in cell type abundance prediction than existing methods.
AB - It is important to understand the cell-type composition and its proportion in tissues. Some previous experiments have shown that the variation of gene expression in certain cell types may lead to disease. Although cell type composition and proportion can be obtained by single-cell RNA-sequencing (scRNA-sq), using scRNA-sq is expensive and cannot be applied in clinical studies involving a large number of subjects currently. Therefore, it is urgent to develop a method to deconvolute the Bulk RNA-Seq data to obtain the cell type composition in the tissue. Most of the existing methods require the signature matrix, which provides the cell type-specific gene expression profile, as input. However, the signature matrix is not always available for some types of tissue, and it is not always possible to find a suitable cell type-specific gene expression profile. To solve this problem, we propose a novel method, named DCap, to predict cell abundance. Different from non-negative least squares, DCap performs weighted iterative calculation based on least squares. By weighting bulk tissue gene expression matrix and single-cell gene expression matrix, DCap minimizes the measurement error of Bulk RNA-Seq and error resulting from the difference in the amount of genes in the same cell type among different samples. DCap solves the deconvolution problem by using weighted non-negative least squares to predict cell type abundance. DCap does not need to prepare a suitable signature matrix in advance, and the evaluation test shows that DCap performs better in cell type abundance prediction than existing methods.
KW - bioinformatics
KW - cell abundance prediction
KW - deconvolution
KW - weighted least squares
UR - http://www.scopus.com/inward/record.url?scp=85084337383&partnerID=8YFLogxK
U2 - 10.1109/BIBM47256.2019.8983123
DO - 10.1109/BIBM47256.2019.8983123
M3 - 会议稿件
AN - SCOPUS:85084337383
T3 - Proceedings - 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019
SP - 1769
EP - 1773
BT - Proceedings - 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019
A2 - Yoo, Illhoi
A2 - Bi, Jinbo
A2 - Hu, Xiaohua Tony
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019
Y2 - 18 November 2019 through 21 November 2019
ER -