Predicting protein-protein interactions from multimodal biological data sources via nonnegative matrix tri-factorization

Hua Wang, Heng Huang, Chris Ding, Feiping Nie

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Due to the high false positive rate in the high-throughput experimental methods to discover protein interactions, computational methods are necessary and crucial to complete the interactome expeditiously. However, when building classification models to identify putative protein interactions, compared to the obvious choice of positive samples from truly interacting protein pairs, it is usually very hard to select negative samples, because non-interacting protein pairs refer to those currently without experimental or computational evidence to support a physical interaction or a functional association, which, though, could interact in reality. To tackle this difficulty, instead of using heuristics as in many existing works, in this paper we solve it in a principled way by formulating the protein interaction prediction problem from a new mathematical perspective of view - sparse matrix completion, and propose a novel Nonnegative Matrix Tri-Factorization (NMTF) based matrix completion approach to predict new protein interactions from existing protein interaction networks. Because matrix completion only requires positive samples but not use negative samples, the challenge in existing classification based methods for protein interaction prediction is circumvented. Through using manifold regularization, we further develop our method to integrate different biological data sources, such as protein sequences, gene expressions, protein structure information, etc. Extensive experimental results on Saccharomyces cerevisiae genome show that our new methods outperform related state-of-the-art protein interaction prediction methods.

Original languageEnglish
Title of host publicationResearch in Computational Molecular Biology - 16th Annual International Conference, RECOMB 2012, Proceedings
Pages314-325
Number of pages12
DOIs
StatePublished - 2012
Externally publishedYes
Event16th Annual International Conference on Research in Computational Molecular Biology, RECOMB 2012 - Barcelona, Spain
Duration: 21 Apr 201224 Apr 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7262 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th Annual International Conference on Research in Computational Molecular Biology, RECOMB 2012
Country/TerritorySpain
CityBarcelona
Period21/04/1224/04/12

Keywords

  • Multimodal Biological Data
  • Nonnegative Matrix Factorization
  • Protein-Protein Interaction

Fingerprint

Dive into the research topics of 'Predicting protein-protein interactions from multimodal biological data sources via nonnegative matrix tri-factorization'. Together they form a unique fingerprint.

Cite this