TY - GEN
T1 - GCNSP
T2 - 16th International Conference on Intelligent Computing, ICIC 2020
AU - Wang, Lei
AU - You, Zhu Hong
AU - Yan, Xin
AU - Zheng, Kai
AU - Li, Zheng Wei
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - As an essential protein interaction, self-interacting proteins (SIPs) plays a vital role in biological processes. Identifying and confirming SIPs is of great significance for the exploration of new gene functions, protein function research and proteomics research. Although a large number of SIPs have been confirmed with the rapid development of high-throughput technology, the biological experimental method is still limited by blindness and high cost, and has a high false-positive rate. Therefore, the use of computational techniques to accurately and efficiently predict SIPs has become an urgent need. In this study, a novel SIPs prediction method GCNSP based on Graph Convolutional Networks (GCN) is proposed. Firstly, the evolution information of protein is described by Position-Specific Scoring Matrix (PSSM). Then the feature information is extracted by GCN, and finally fed into Random Forest (RF) classifier for accurate classification. In the five-fold cross-validation on Human and Yeast data sets, GCNSP achieved 93.65% and 90.69% prediction accuracy with 99.64% and 99.08% specificity, respectively. In comparison with different classifier models and other existing methods, GCNSP shows strong competitiveness. The excellent results show that the proposed method is very suitable for SIPs prediction and can provide highly reliable candidates for biological experiments.
AB - As an essential protein interaction, self-interacting proteins (SIPs) plays a vital role in biological processes. Identifying and confirming SIPs is of great significance for the exploration of new gene functions, protein function research and proteomics research. Although a large number of SIPs have been confirmed with the rapid development of high-throughput technology, the biological experimental method is still limited by blindness and high cost, and has a high false-positive rate. Therefore, the use of computational techniques to accurately and efficiently predict SIPs has become an urgent need. In this study, a novel SIPs prediction method GCNSP based on Graph Convolutional Networks (GCN) is proposed. Firstly, the evolution information of protein is described by Position-Specific Scoring Matrix (PSSM). Then the feature information is extracted by GCN, and finally fed into Random Forest (RF) classifier for accurate classification. In the five-fold cross-validation on Human and Yeast data sets, GCNSP achieved 93.65% and 90.69% prediction accuracy with 99.64% and 99.08% specificity, respectively. In comparison with different classifier models and other existing methods, GCNSP shows strong competitiveness. The excellent results show that the proposed method is very suitable for SIPs prediction and can provide highly reliable candidates for biological experiments.
KW - Graph Convolutional Networks
KW - Position-specific scoring matrix
KW - Protein-protein interactions
KW - Random forest
KW - Self-interacting protein
UR - http://www.scopus.com/inward/record.url?scp=85094133941&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-60802-6_11
DO - 10.1007/978-3-030-60802-6_11
M3 - 会议稿件
AN - SCOPUS:85094133941
SN - 9783030608019
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 109
EP - 120
BT - Intelligent Computing Theories and Application - 16th International Conference, ICIC 2020, Proceedings
A2 - Huang, De-Shuang
A2 - Jo, Kang-Hyun
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 2 October 2020 through 5 October 2020
ER -