TY - JOUR
T1 - Combining K Nearest Neighbor With Nonnegative Matrix Factorization for Predicting Circrna-Disease Associations
AU - Wang, Mei Neng
AU - Xie, Xue Jun
AU - You, Zhu Hong
AU - Wong, Leon
AU - Li, Li Ping
AU - Chen, Zhan Heng
N1 - Publisher Copyright:
© 2004-2012 IEEE.
PY - 2023/9/1
Y1 - 2023/9/1
N2 - Accumulating evidences show that circular RNAs (circRNAs) play an important role in regulating gene expression, and involve in many complex human diseases. Identifying associations of circRNA with disease helps to understand the pathogenesis, treatment and diagnosis of complex diseases. Since inferring circRNA-disease associations by biological experiments is costly and time-consuming, there is an urgently need to develop a computational model to identify the association between them. In this paper, we proposed a novel method named KNN-NMF, which combines K Knearest neighbors with nonnegative matrix factorization to infer associations between circRNA and disease (KNN-NMF). Frist, we compute the Gaussian Interaction Profile (GIP) kernel similarity of circRNA and disease, the semantic similarity of disease, respectively. Then, the circRNA-disease new interaction profiles are established using weight KK nearest neighbors to reduce the false negative association impact on prediction performance. Finally, Nonnegative Matrix Factorization is implemented to predict associations of circRNA with disease. The experiment results indicate that the prediction performance of KNN-NMF outperforms the competing methods under five-fold cross-validation. Moreover, case studies of two common diseases further show that KNN-NMF can identify potential circRNA-disease associations effectively.
AB - Accumulating evidences show that circular RNAs (circRNAs) play an important role in regulating gene expression, and involve in many complex human diseases. Identifying associations of circRNA with disease helps to understand the pathogenesis, treatment and diagnosis of complex diseases. Since inferring circRNA-disease associations by biological experiments is costly and time-consuming, there is an urgently need to develop a computational model to identify the association between them. In this paper, we proposed a novel method named KNN-NMF, which combines K Knearest neighbors with nonnegative matrix factorization to infer associations between circRNA and disease (KNN-NMF). Frist, we compute the Gaussian Interaction Profile (GIP) kernel similarity of circRNA and disease, the semantic similarity of disease, respectively. Then, the circRNA-disease new interaction profiles are established using weight KK nearest neighbors to reduce the false negative association impact on prediction performance. Finally, Nonnegative Matrix Factorization is implemented to predict associations of circRNA with disease. The experiment results indicate that the prediction performance of KNN-NMF outperforms the competing methods under five-fold cross-validation. Moreover, case studies of two common diseases further show that KNN-NMF can identify potential circRNA-disease associations effectively.
KW - circRNA-disease association
KW - Gaussian kernel similarity
KW - nearest neighbor
KW - nonnegative Matrix Factorization
KW - semantic similarity
UR - http://www.scopus.com/inward/record.url?scp=85131833079&partnerID=8YFLogxK
U2 - 10.1109/TCBB.2022.3180903
DO - 10.1109/TCBB.2022.3180903
M3 - 文章
C2 - 35675235
AN - SCOPUS:85131833079
SN - 1545-5963
VL - 20
SP - 2610
EP - 2618
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 5
ER -