TY - JOUR
T1 - Construction of reliable protein–protein interaction networks using weighted sparse representation based classifier with pseudo substitution matrix representation features
AU - Huang, Yu An
AU - You, Zhu Hong
AU - Li, Xiao
AU - Chen, Xing
AU - Hu, Pengwei
AU - Li, Shuai
AU - Luo, Xin
N1 - Publisher Copyright:
© 2016 Elsevier B.V.
PY - 2016/12/19
Y1 - 2016/12/19
N2 - Protein-protein interactions (PPIs) networks play an important role in most of biological processes. Although much effort has been devoted to using high-throughput biological technologies to identify PPIs of various kinds of organisms, the experimental methods are expensive, time-consuming, and tedious. Therefore, developing computational methods for predicting PPIs is of great significance in this post-genomic era. In recent years, the exponential increase of available protein sequence data leads to the urgent need for sequence-based prediction model. In this paper, we report a highly efficient method for constructing PPIs networks. The main improvements come from a novel protein sequence representation called pseudo-SMR, and from adopting weighted sparse representation based classifier (WSRC). When predicting the PPIs of Yeast, Human and H. pylori datasets, the 5-fold cross-validation accuracies performed by the proposed method achieve as high as 97.09%, 96.71% and 91.15% respectively, significantly better than previous methods. To further evaluate the performance of the proposed method, extensive experiments are performed to compare the proposed method with state-of-the-art Support Vector Machine (SVM) classifier. Promising results obtained show that the proposed method is feasible, robust and powerful.
AB - Protein-protein interactions (PPIs) networks play an important role in most of biological processes. Although much effort has been devoted to using high-throughput biological technologies to identify PPIs of various kinds of organisms, the experimental methods are expensive, time-consuming, and tedious. Therefore, developing computational methods for predicting PPIs is of great significance in this post-genomic era. In recent years, the exponential increase of available protein sequence data leads to the urgent need for sequence-based prediction model. In this paper, we report a highly efficient method for constructing PPIs networks. The main improvements come from a novel protein sequence representation called pseudo-SMR, and from adopting weighted sparse representation based classifier (WSRC). When predicting the PPIs of Yeast, Human and H. pylori datasets, the 5-fold cross-validation accuracies performed by the proposed method achieve as high as 97.09%, 96.71% and 91.15% respectively, significantly better than previous methods. To further evaluate the performance of the proposed method, extensive experiments are performed to compare the proposed method with state-of-the-art Support Vector Machine (SVM) classifier. Promising results obtained show that the proposed method is feasible, robust and powerful.
KW - Protein sequence
KW - Protein-protein interaction networks
KW - Substitution matrix representation
KW - Weighted sparse representation
UR - https://www.scopus.com/pages/publications/84994193771
U2 - 10.1016/j.neucom.2016.08.063
DO - 10.1016/j.neucom.2016.08.063
M3 - 文章
AN - SCOPUS:84994193771
SN - 0925-2312
VL - 218
SP - 131
EP - 138
JO - Neurocomputing
JF - Neurocomputing
ER -