TY - GEN
T1 - Prediction of LncRNA-Disease Associations Based on Network Representation Learning
AU - Su, Xiaorui
AU - You, Zhuhong
AU - Yi, Haicheng
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/12/16
Y1 - 2020/12/16
N2 - Massive observations have indicated that long noncoding RNAs (lncRNAs) are crucial in a number of biological processes and associated with various human diseases. Developing an efficient calculation model to predict the associations between lncRNA and diseases is not only beneficial to disease diagnosis, treatment, prognosis and potential drug targets in drug discovery, but also avoid the waste of human and material resources brought by biological experiments. In this paper, we proposed a novel prediction of lncRNA-disease associations based on complex and comprehensive molecular associations network (MAN), which integrated nine kinds of interactions among five molecules, including lncRNA, miRNA, disease, drug and protein. Network embedding Node2vec method was applied to extract behavior feature from MAN to generate a low-dimension vector containing nodes and edges information. After implementing 5-fold cross validation, the proposed method yielded good prediction performance with an average Accuracy of 91.91%, Sensitivity of 94.05%, Specificity of 89.76%, Precision of 90.21%, MCC value of 83.91%, AUC value of 0.9746 and AUPR of 0.9693. Comparative experiment indicates the behavior feature extracted by Node2vec is more representative than attribute features of lncRNA adopted 3-mer and diseases extracted by semantic similarity. Moreover, breast cancer, colon cancer and lung cancer are explored in case study. As a results, more than half of top 5 interactions are successfully confirmed for each disease by other datasets. Based on these reliable results, it is anticipated that proposed model is feasible and effective to predict lncRNA-disease associations at a global molecules level, which is a new respective for future biomedical researches.
AB - Massive observations have indicated that long noncoding RNAs (lncRNAs) are crucial in a number of biological processes and associated with various human diseases. Developing an efficient calculation model to predict the associations between lncRNA and diseases is not only beneficial to disease diagnosis, treatment, prognosis and potential drug targets in drug discovery, but also avoid the waste of human and material resources brought by biological experiments. In this paper, we proposed a novel prediction of lncRNA-disease associations based on complex and comprehensive molecular associations network (MAN), which integrated nine kinds of interactions among five molecules, including lncRNA, miRNA, disease, drug and protein. Network embedding Node2vec method was applied to extract behavior feature from MAN to generate a low-dimension vector containing nodes and edges information. After implementing 5-fold cross validation, the proposed method yielded good prediction performance with an average Accuracy of 91.91%, Sensitivity of 94.05%, Specificity of 89.76%, Precision of 90.21%, MCC value of 83.91%, AUC value of 0.9746 and AUPR of 0.9693. Comparative experiment indicates the behavior feature extracted by Node2vec is more representative than attribute features of lncRNA adopted 3-mer and diseases extracted by semantic similarity. Moreover, breast cancer, colon cancer and lung cancer are explored in case study. As a results, more than half of top 5 interactions are successfully confirmed for each disease by other datasets. Based on these reliable results, it is anticipated that proposed model is feasible and effective to predict lncRNA-disease associations at a global molecules level, which is a new respective for future biomedical researches.
KW - Disease
KW - LncRNAs
KW - Molecular Associations Network
KW - Network Representation Learning
KW - Node2vec
UR - http://www.scopus.com/inward/record.url?scp=85100344572&partnerID=8YFLogxK
U2 - 10.1109/BIBM49941.2020.9313139
DO - 10.1109/BIBM49941.2020.9313139
M3 - 会议稿件
AN - SCOPUS:85100344572
T3 - Proceedings - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020
SP - 1805
EP - 1812
BT - Proceedings - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020
A2 - Park, Taesung
A2 - Cho, Young-Rae
A2 - Hu, Xiaohua Tony
A2 - Yoo, Illhoi
A2 - Woo, Hyun Goo
A2 - Wang, Jianxin
A2 - Facelli, Julio
A2 - Nam, Seungyoon
A2 - Kang, Mingon
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020
Y2 - 16 December 2020 through 19 December 2020
ER -