A new hybrid approach to predict subcellular localization by incorporating protein evolutionary conservation information

Shao Wu Zhang, Yun Long Zhang, Jun Hui Li, Hui Feng Yang, Yong Mei Cheng, Guo Ping Zhou

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

The rapidly increasing number of sequence entering into the genome databank has created the need for fully automated methods to analyze them. Knowing the cellular location of a protein is a key step towards understanding its function. The development in statistical prediction of protein attributes generally consists of two cores: one is to construct a training dataset and the other is to formulate a predictive algorithm. The latter can be further separated into two subcores: one is how to give a mathematical expression to effectively represent a protein and the other is how to find a powerful algorithm to accurately perform the prediction. Here, an improved evolutionary conservation algorithm was proposed to calculate per residue conservation score. Then, each protein can be represented as a feature vector created with multi-scale energy (MSE). In addition, the protein can be represented as other feature vectors based on amino acid composition (AAC), weighted auto-correlation function and Moment descriptor methods. Finally, a novel hybrid approach was developed by fusing the four kinds of feature classifiers through a product rule system to predict 12 subcellular locations. Compared with existing methods, this new approach provides better predictive performance. High success accuracies were obtained in both jackknife cross-validation test and independent dataset test, suggesting that introducing protein evolutionary information and the concept of fusing multifeatures classifiers are quite promising, and might also hold a great potential as a useful vehicle for the other areas of molecular biology.

源语言英语
主期刊名Life System Modeling and Simulation - International Conference, LSMS 2007, Proceedings
出版商Springer Verlag
172-179
页数8
ISBN(印刷版)9783540747703
DOI
出版状态已出版 - 2007
活动2007 International Conference on Life System Modeling and Simulation, LSMS 2007 - Shanghai, 中国
期限: 14 9月 200717 9月 2007

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
4689 LNBI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议2007 International Conference on Life System Modeling and Simulation, LSMS 2007
国家/地区中国
Shanghai
时期14/09/0717/09/07

指纹

探究 'A new hybrid approach to predict subcellular localization by incorporating protein evolutionary conservation information' 的科研主题。它们共同构成独一无二的指纹。

引用此