TY - JOUR
T1 - Using the concept of Chou's pseudo amino acid composition to predict protein subcellular localization
T2 - An approach by incorporating evolutionary information and von Neumann entropies
AU - Zhang, Shao Wu
AU - Zhang, Yun Long
AU - Yang, Hui Fang
AU - Zhao, Chun Hui
AU - Pan, Quan
PY - 2008/5
Y1 - 2008/5
N2 - The rapidly increasing number of sequence entering into the genome databank has called for the need for developing automated methods to analyze them. Information on the subcellular localization of new found protein sequences is important for helping to reveal their functions in time and conducting the study of system biology at the cellular level. Based on the concept of Chou's pseudo-amino acid composition, a series of useful information and techniques, such as residue conservation scores, von Neumann entropies, multi-scale energy, and weighted auto-correlation function were utilized to generate the pseudo-amino acid components for representing the protein samples. Based on such an infrastructure, a hybridization predictor was developed for identifying uncharacterized proteins among the following 12 subcellular localizations: chloroplast, cytoplasm, cytoskeleton, endoplasmic reticulum, extracell, Golgi apparatus, lysosome, mitochondria, nucleus, peroxisome, plasma membrane, and vacuole. Compared with the results reported by the previous investigators, higher success rates were obtained, suggesting that the current approach is quite promising, and may become a useful high-throughput tool in the relevant areas.
AB - The rapidly increasing number of sequence entering into the genome databank has called for the need for developing automated methods to analyze them. Information on the subcellular localization of new found protein sequences is important for helping to reveal their functions in time and conducting the study of system biology at the cellular level. Based on the concept of Chou's pseudo-amino acid composition, a series of useful information and techniques, such as residue conservation scores, von Neumann entropies, multi-scale energy, and weighted auto-correlation function were utilized to generate the pseudo-amino acid components for representing the protein samples. Based on such an infrastructure, a hybridization predictor was developed for identifying uncharacterized proteins among the following 12 subcellular localizations: chloroplast, cytoplasm, cytoskeleton, endoplasmic reticulum, extracell, Golgi apparatus, lysosome, mitochondria, nucleus, peroxisome, plasma membrane, and vacuole. Compared with the results reported by the previous investigators, higher success rates were obtained, suggesting that the current approach is quite promising, and may become a useful high-throughput tool in the relevant areas.
KW - Chou's pseudo-amino acid composition
KW - Multi-scale energy
KW - Residue evolutionary conservation
KW - Von Neumann entropies
KW - Weighted auto-correlation function
UR - http://www.scopus.com/inward/record.url?scp=43149104120&partnerID=8YFLogxK
U2 - 10.1007/s00726-007-0010-9
DO - 10.1007/s00726-007-0010-9
M3 - 文章
C2 - 18074191
AN - SCOPUS:43149104120
SN - 0939-4451
VL - 34
SP - 565
EP - 572
JO - Amino Acids
JF - Amino Acids
IS - 4
ER -