A MapReduce based parallel SVM for large-scale predicting protein-protein interactions

Zhu Hong You, Jian Zhong Yu, Lin Zhu, Shuai Li, Zhen Kun Wen

科研成果: 期刊稿件文章同行评审

102 引用 (Scopus)

摘要

Protein-protein interactions (PPIs) are crucial to most biochemical processes, including metabolic cycles, DNA transcription and replication, and signaling cascades. Although large amount of protein-protein interaction data for different species has been generated by high-throughput experimental techniques, the number is still limited compared to the total number of possible PPIs. Furthermore, the experimental methods for identifying PPIs are both time-consuming and expensive. Therefore, it is urgent and challenging to develop automated computational methods to efficiently and accurately predict PPIs. In this article, we propose a novel MapReduce-based parallel SVM model for large-scale predicting protein-protein interactions only using the information of protein sequences. First, the local sequential features represented by autocorrelation descriptor are extracted from protein sequences. Then the MapReduce framework is employed to train support vector machine (SVM) classifiers in a distributed way, obtaining significant improvement in training time while maintaining a high level of accuracy. The experimental results demonstrate that the proposed parallel algorithms not only can tackle large-scale PPIs dataset, but also perform well in terms of the evaluation metrics of speedup and accuracy. Consequently, the proposed approach can be considered as a new promising and powerful tools for large-scale predicting PPI with excellent performance and less time.

源语言英语
页(从-至)37-43
页数7
期刊Neurocomputing
145
DOI
出版状态已出版 - 5 12月 2014
已对外发布

指纹

探究 'A MapReduce based parallel SVM for large-scale predicting protein-protein interactions' 的科研主题。它们共同构成独一无二的指纹。

引用此