Predicting protein-protein interactions from multimodal biological data sources via nonnegative matrix tri-factorization

Research output: Contribution to journalArticlepeer-review

56 Scopus citations

Abstract

Protein interactions are central to all the biological processes and structural scaffolds in living organisms, because they orchestrate a number of cellular processes such as metabolic pathways and immunological recognition. Several high-throughput methods, for example, yeast two-hybrid system and mass spectrometry method, can help determine protein interactions, which, however, suffer from high false-positive rates. Moreover, many protein interactions predicted by one method are not supported by another. Therefore, computational methods are necessary and crucial to complete the interactome expeditiously. In this work, we formulate the problem of predicting protein interactions from a new mathematical perspective-sparse matrix completion, and propose a novel nonnegative matrix factorization (NMF)-based matrix completion approach to predict new protein interactions from existing protein interaction networks. Through using manifold regularization, we further develop our method to integrate different biological data sources, such as protein sequences, gene expressions, protein structure information, etc. Extensive experimental results on four species, Saccharomyces cerevisiae, Drosophila melanogaster, Homo sapiens, and Caenorhabditis elegans, have shown that our new methods outperform related state-of-the-art protein interaction prediction methods.

Original languageEnglish
Pages (from-to)344-358
Number of pages15
JournalJournal of Computational Biology
Volume20
Issue number4
DOIs
StatePublished - 1 Apr 2013
Externally publishedYes

Keywords

  • Algorithms
  • Biochemical networks
  • Gene clusters
  • Gene expression
  • Learning
  • Machine learning
  • Mass spectroscopy

Fingerprint

Dive into the research topics of 'Predicting protein-protein interactions from multimodal biological data sources via nonnegative matrix tri-factorization'. Together they form a unique fingerprint.

Cite this