TY - JOUR
T1 - DANE-MDA
T2 - Predicting microRNA-disease associations via deep attributed network embedding
AU - Ji, Bo Ya
AU - You, Zhu Hong
AU - Wang, Yi
AU - Li, Zheng Wei
AU - Wong, Leon
N1 - Publisher Copyright:
© 2021 The Authors
PY - 2021/6/25
Y1 - 2021/6/25
N2 - Predicting the microRNA-disease associations by using computational methods is conductive to the efficiency of costly and laborious traditional bio-experiments. In this study, we propose a computational machine learning-based method (DANE-MDA) that preserves integrated structure and attribute features via deep attributed network embedding to predict potential miRNA-disease associations. Specifically, the integrated features are extracted by using deep stacked auto-encoder on the diverse orders of matrixes containing structure and attribute information and are then trained by using random forest classifier. Under 5-fold cross-validation experiments, DANE-MDA yielded average accuracy, sensitivity, and AUC at 85.59%, 84.23%, and 0.9264 in term of HMDD v3.0 dataset, and 83.21%, 80.39%, and 0.9113 in term of HMDD v2.0 dataset, respectively. Additionally, case studies on breast, colon, and lung neoplasms related disease show that 47, 47, and 46 of the top 50 miRNAs can be predicted and retrieved in the other database.
AB - Predicting the microRNA-disease associations by using computational methods is conductive to the efficiency of costly and laborious traditional bio-experiments. In this study, we propose a computational machine learning-based method (DANE-MDA) that preserves integrated structure and attribute features via deep attributed network embedding to predict potential miRNA-disease associations. Specifically, the integrated features are extracted by using deep stacked auto-encoder on the diverse orders of matrixes containing structure and attribute information and are then trained by using random forest classifier. Under 5-fold cross-validation experiments, DANE-MDA yielded average accuracy, sensitivity, and AUC at 85.59%, 84.23%, and 0.9264 in term of HMDD v3.0 dataset, and 83.21%, 80.39%, and 0.9113 in term of HMDD v2.0 dataset, respectively. Additionally, case studies on breast, colon, and lung neoplasms related disease show that 47, 47, and 46 of the top 50 miRNAs can be predicted and retrieved in the other database.
KW - Cancer
KW - Computational bioinformatics
KW - Systems biology
UR - http://www.scopus.com/inward/record.url?scp=85105948138&partnerID=8YFLogxK
U2 - 10.1016/j.isci.2021.102455
DO - 10.1016/j.isci.2021.102455
M3 - 文章
AN - SCOPUS:85105948138
SN - 2589-0042
VL - 24
JO - iScience
JF - iScience
IS - 6
M1 - 102455
ER -