TY - JOUR
T1 - Prediction of protein-protein interactions by label propagation with protein evolutionary and chemical information derived from heterogeneous network
AU - Wen, Yu Ting
AU - Lei, Hai Jun
AU - You, Zhu Hong
AU - Lei, Bai Ying
AU - Chen, Xing
AU - Li, Li Ping
N1 - Publisher Copyright:
© 2017 Elsevier Ltd
PY - 2017/10/7
Y1 - 2017/10/7
N2 - Prediction of protein-protein interactions (PPIs) is of great significance. To achieve this, we propose a novel computational method for PPIs prediction based on a similarity network fusion (SNF) model for integrating the physical and chemical properties of proteins. Specifically, the physical and chemical properties of protein are the protein amino acid mutation rate and its hydrophobicity, respectively. The amino acid mutation rate is extracted using a BLOSUM62 matrix, which puts the protein sequence into block substitution matrix. The SNF model is exploited to fuse protein physical and chemical features of multiple data by iteratively updating each original network. Finally, the complementary features from the fused network are fed into a label propagation algorithm (LPA) for PPIs prediction. The experimental results show that the proposed method achieves promising performance and outperforms the traditional methods for the public dataset of H. pylori, Human, and Yeast. In addition, our proposed method achieves average accuracy of 76.65%, 81.98%, 84.56%, 84.01% and 84.38% on E. coli, C. elegans, H. sapien, H. pylori and M. musculus datasets, respectively. Comparison results demonstrate that the proposed method is very promising and provides a cost-effective alternative for predicting PPIs. The source code and all datasets are available at http://pan.baidu.com/s/1dF7rp7N.
AB - Prediction of protein-protein interactions (PPIs) is of great significance. To achieve this, we propose a novel computational method for PPIs prediction based on a similarity network fusion (SNF) model for integrating the physical and chemical properties of proteins. Specifically, the physical and chemical properties of protein are the protein amino acid mutation rate and its hydrophobicity, respectively. The amino acid mutation rate is extracted using a BLOSUM62 matrix, which puts the protein sequence into block substitution matrix. The SNF model is exploited to fuse protein physical and chemical features of multiple data by iteratively updating each original network. Finally, the complementary features from the fused network are fed into a label propagation algorithm (LPA) for PPIs prediction. The experimental results show that the proposed method achieves promising performance and outperforms the traditional methods for the public dataset of H. pylori, Human, and Yeast. In addition, our proposed method achieves average accuracy of 76.65%, 81.98%, 84.56%, 84.01% and 84.38% on E. coli, C. elegans, H. sapien, H. pylori and M. musculus datasets, respectively. Comparison results demonstrate that the proposed method is very promising and provides a cost-effective alternative for predicting PPIs. The source code and all datasets are available at http://pan.baidu.com/s/1dF7rp7N.
KW - Label propagation algorithm
KW - Prediction
KW - Protein-protein interactions
KW - Similarity network fusion
UR - http://www.scopus.com/inward/record.url?scp=85021886854&partnerID=8YFLogxK
U2 - 10.1016/j.jtbi.2017.06.003
DO - 10.1016/j.jtbi.2017.06.003
M3 - 文章
C2 - 28625475
AN - SCOPUS:85021886854
SN - 0022-5193
VL - 430
SP - 9
EP - 20
JO - Journal of Theoretical Biology
JF - Journal of Theoretical Biology
ER -