TY - GEN
T1 - Hypergraph-based Gene Ontology Embedding for Disease Gene Prediction
AU - Wang, Tao
AU - Xu, Hengbo
AU - Zhang, Ranye
AU - Xiao, Yifu
AU - Peng, Jiajie
AU - Shang, Xuequn
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Disease gene identification has provided valuable insights into illuminating the molecular mechanisms underlying complex diseases. And it has been shown that novel drugs with genetically supported targets were more likely to be successful in clinical trials. In recent years, multiple graph machine learning-based methods for this purpose have been proposed. However, those methods were mainly based on various well-established biological molecular networks, while seldomly considering the curated biological annotations of genes. To fill this gap, we aim to integrate the gene ontology annotations (GOA), including the biological process (BP), the cellular component (CC), and the molecular function (MF), into the process of disease gene prediction. Our method treated the GOA as a hypergraph and used the hypergraph-based embedding technique to extract the deep features underlying gene annotations. Besides, we also extracted gene features from the protein-protein interaction (PPI) network using graph representation learning methods. The convolutional neural network (CNN) framework was followed to fuse the features extracted from the two networks and make the final prediction. Experiments on a range of diseases have demonstrated the accuracy and robustness of our method. The average area under the receiver operating characteristic curve (AUROC) and area under the precision-recall curve (AUPRC) reached 0.85 and 0.79, respectively. Besides, the hypergraph-based gene ontology embedding can be generalized to other bioinformatics applications.
AB - Disease gene identification has provided valuable insights into illuminating the molecular mechanisms underlying complex diseases. And it has been shown that novel drugs with genetically supported targets were more likely to be successful in clinical trials. In recent years, multiple graph machine learning-based methods for this purpose have been proposed. However, those methods were mainly based on various well-established biological molecular networks, while seldomly considering the curated biological annotations of genes. To fill this gap, we aim to integrate the gene ontology annotations (GOA), including the biological process (BP), the cellular component (CC), and the molecular function (MF), into the process of disease gene prediction. Our method treated the GOA as a hypergraph and used the hypergraph-based embedding technique to extract the deep features underlying gene annotations. Besides, we also extracted gene features from the protein-protein interaction (PPI) network using graph representation learning methods. The convolutional neural network (CNN) framework was followed to fuse the features extracted from the two networks and make the final prediction. Experiments on a range of diseases have demonstrated the accuracy and robustness of our method. The average area under the receiver operating characteristic curve (AUROC) and area under the precision-recall curve (AUPRC) reached 0.85 and 0.79, respectively. Besides, the hypergraph-based gene ontology embedding can be generalized to other bioinformatics applications.
KW - Disease gene prediction
KW - Gene ontology
KW - Hypergraph
KW - Network modeling
KW - Network representation learning
UR - http://www.scopus.com/inward/record.url?scp=85146650254&partnerID=8YFLogxK
U2 - 10.1109/BIBM55620.2022.9995140
DO - 10.1109/BIBM55620.2022.9995140
M3 - 会议稿件
AN - SCOPUS:85146650254
T3 - Proceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
SP - 2424
EP - 2430
BT - Proceedings - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
A2 - Adjeroh, Donald
A2 - Long, Qi
A2 - Shi, Xinghua
A2 - Guo, Fei
A2 - Hu, Xiaohua
A2 - Aluru, Srinivas
A2 - Narasimhan, Giri
A2 - Wang, Jianxin
A2 - Kang, Mingon
A2 - Mondal, Ananda M.
A2 - Liu, Jin
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2022
Y2 - 6 December 2022 through 8 December 2022
ER -