Protein-Protein Interaction Prediction by Integrating Sequence Information and Heterogeneous Network Representation

Xiao Rui Su, Zhu Hong You, Zhan Heng Chen, Hai Cheng Yi, Zhen Hao Guo

科研成果: 书/报告/会议事项章节会议稿件同行评审

3 引用 (Scopus)

摘要

Protein-protein interaction (PPI) plays an important role in regulating cells and signals. PPI deregulation will lead to many diseases, including pernicious anemia or cancer. Despite the ongoing efforts of the bioassay group, continued data incompleteness limits our ability to understand the molecular roots of human disease. Therefore, it is urgent to develop a computational method that accurately and quickly detects PPIs. In this paper, a highly efficient model is proposed for predicting PPIs through heterogeneous network by combining local feature with global feature. Heterogeneous network is collected from several valuable datasets, containing five types of nodes and nine interactions among them. Local feature is extracted from protein sequence by k-mer method. Global feature is extracted from heterogeneous network by LINE (Large-scale Information Network Embedding). Protein representation is obtained from local feature and global feature by concatenation. Finally, random forest is trained to classify and predict potential protein pairs. The proposed method is demonstrated on STRING dataset and achieved an average 86.55% prediction accuracy with 0.9308 AUC. Extensive contrast experiments are performed with different protein representations and different classifiers. Obtained experiment results illustrate that proposed method is economically viable, which provides a new perspective for future research.

源语言英语
主期刊名Intelligent Computing Theories and Application - 17th International Conference, ICIC 2021, Proceedings
编辑De-Shuang Huang, Kang-Hyun Jo, Jianqiang Li, Valeriya Gribova, Vitoantonio Bevilacqua
出版商Springer Science and Business Media Deutschland GmbH
617-626
页数10
ISBN(印刷版)9783030845315
DOI
出版状态已出版 - 2021
已对外发布
活动17th International Conference on Intelligent Computing, ICIC 2021 - Shenzhen, 中国
期限: 12 8月 202115 8月 2021

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
12838 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议17th International Conference on Intelligent Computing, ICIC 2021
国家/地区中国
Shenzhen
时期12/08/2115/08/21

指纹

探究 'Protein-Protein Interaction Prediction by Integrating Sequence Information and Heterogeneous Network Representation' 的科研主题。它们共同构成独一无二的指纹。

引用此