An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information

Zhu Hong You, Wen Zhun Huang, Shanwen Zhang, Yu An Huang, Chang Qing Yu, Li Ping Li

Research output: Contribution to journalArticlepeer-review

26 Scopus citations

Abstract

Protein-protein interactions (PPIs) perform a very important function in a number of cellular processes, including signal transduction, post-translational modifications, apoptosis, and cell growth. Deregulation of PPIs will lead to many diseases, including pernicious anemia or cancers. Although a large number of high-throughput techniques are designed to generate PPIs data, they are generally expensive, inefficient, and labor-intensive. Hence, there is an urgent need for developing a computational method to accurately and rapidly detect PPIs. In this article, we proposed a highly efficient method to detect PPIs by integrating a new protein sequence sub-stitution matrix feature representation and ensemble weighted sparse representation model classifier. The proposed method is demonstrated on Saccharomyces cerevisiae dataset and achieved 99.26 percent prediction accuracy with 98.53 percent sensitivity at precision of 100 percent, which is shown to have much higher predictive accuracy than the state-of-the-art methods. Extensive contrast experiments are performed with the benchmark data set from Human and Helicobacter pylori that our proposed method can achieve outstanding better success rates than other existing approaches in this problem. Experiment results illustrate that our proposed method presents an economical approach for computational building of PPI networks, which can be a helpful supplementary method for future proteomics researches.

Original languageEnglish
Article number8540898
Pages (from-to)809-817
Number of pages9
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume16
Issue number3
DOIs
StatePublished - 1 May 2019
Externally publishedYes

Keywords

  • Ensemble learning
  • evolutionary information
  • protein sequence
  • protein-protein interactions

Fingerprint

Dive into the research topics of 'An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information'. Together they form a unique fingerprint.

Cite this