Prediction of protein subcellular localization with a Novel method: Sequence-segmented PseAAC

Shao Wu Zhang; Hui Fang Yang; Qi Peng Li; Yong Mei Cheng; Quan Pan

doi:10.1109/ICMLC.2008.4621106

Prediction of protein subcellular localization with a Novel method: Sequence-segmented PseAAC

Shao Wu Zhang, Hui Fang Yang, Qi Peng Li, Yong Mei Cheng, Quan Pan

自动化学院

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

1 引用（Scopus）

摘要

Information of the subcellular localizations of proteins is important because it can provide useful insights about their functions, as well as how and in what kind of cellular environments they interact with each other and with other molecules. Facing the explosion of newly generated protein sequences in the post genomic era, we are challenged to develop an automated method tor fast and reliably annotating their subcellular localizations. To tackle the challenge, a novel method of the sequence-segmented pseudo amino acid composition (PseAAC) is introduced to represent protein samples. Based on the concept of Chou's PseAAC, a series of useful information and techniques, such as multi- scale energy and moment descriptors were utilized to generate the sequence-segmented pseudo amino acid components for representing the protein samples. Meanwhile, the multi-class SVM classifier modules were adopted for predicting 16 kinds of eukaryotic protein subcellular localizations. Compared with existing methods, this new approach provides better predictive performance. The success total accuracies were obtained in the jackknife test and independent dataset test, suggesting that the sequence-segmented PseAAC method is quite promising, and might also hold a great potential as a useful vehicle for the other areas of molecular biology.

源语言	英语
主期刊名	Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC
页	4024-4028
页数	5
DOI	https://doi.org/10.1109/ICMLC.2008.4621106
出版状态	已出版 - 2008
活动	7th International Conference on Machine Learning and Cybernetics, ICMLC - Kunming, 中国期限: 12 7月 2008 → 15 7月 2008

出版系列

姓名	Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC
卷	7

会议

会议	7th International Conference on Machine Learning and Cybernetics, ICMLC
国家/地区	中国
市	Kunming
时期	12/07/08 → 15/07/08

访问文件

10.1109/ICMLC.2008.4621106

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, S. W., Yang, H. F., Li, Q. P., Cheng, Y. M., & Pan, Q. (2008). Prediction of protein subcellular localization with a Novel method: Sequence-segmented PseAAC. 在 Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC (页码 4024-4028). 文章 4621106 (Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC; 卷 7). https://doi.org/10.1109/ICMLC.2008.4621106

@inproceedings{583d4fc1cb3544478ae358ed1649c756,

title = "Prediction of protein subcellular localization with a Novel method: Sequence-segmented PseAAC",

abstract = "Information of the subcellular localizations of proteins is important because it can provide useful insights about their functions, as well as how and in what kind of cellular environments they interact with each other and with other molecules. Facing the explosion of newly generated protein sequences in the post genomic era, we are challenged to develop an automated method tor fast and reliably annotating their subcellular localizations. To tackle the challenge, a novel method of the sequence-segmented pseudo amino acid composition (PseAAC) is introduced to represent protein samples. Based on the concept of Chou's PseAAC, a series of useful information and techniques, such as multi- scale energy and moment descriptors were utilized to generate the sequence-segmented pseudo amino acid components for representing the protein samples. Meanwhile, the multi-class SVM classifier modules were adopted for predicting 16 kinds of eukaryotic protein subcellular localizations. Compared with existing methods, this new approach provides better predictive performance. The success total accuracies were obtained in the jackknife test and independent dataset test, suggesting that the sequence-segmented PseAAC method is quite promising, and might also hold a great potential as a useful vehicle for the other areas of molecular biology.",

keywords = "Moment descriptor, Multi-scale energy, Sequence-segmented PseAAC, Support vector machine",

author = "Zhang, {Shao Wu} and Yang, {Hui Fang} and Li, {Qi Peng} and Cheng, {Yong Mei} and Quan Pan",

year = "2008",

doi = "10.1109/ICMLC.2008.4621106",

language = "英语",

isbn = "9781424420964",

series = "Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC",

pages = "4024--4028",

booktitle = "Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC",

note = "7th International Conference on Machine Learning and Cybernetics, ICMLC ; Conference date: 12-07-2008 Through 15-07-2008",

}

Zhang, SW, Yang, HF, Li, QP, Cheng, YM & Pan, Q 2008, Prediction of protein subcellular localization with a Novel method: Sequence-segmented PseAAC. 在 Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC., 4621106, Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC, 卷 7, 页码 4024-4028, 7th International Conference on Machine Learning and Cybernetics, ICMLC, Kunming, 中国, 12/07/08. https://doi.org/10.1109/ICMLC.2008.4621106

Prediction of protein subcellular localization with a Novel method: Sequence-segmented PseAAC. / Zhang, Shao Wu; Yang, Hui Fang; Li, Qi Peng 等.
Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC. 2008. 页码 4024-4028 4621106 (Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC; 卷 7).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Prediction of protein subcellular localization with a Novel method

T2 - 7th International Conference on Machine Learning and Cybernetics, ICMLC

AU - Zhang, Shao Wu

AU - Yang, Hui Fang

AU - Li, Qi Peng

AU - Cheng, Yong Mei

AU - Pan, Quan

PY - 2008

Y1 - 2008

N2 - Information of the subcellular localizations of proteins is important because it can provide useful insights about their functions, as well as how and in what kind of cellular environments they interact with each other and with other molecules. Facing the explosion of newly generated protein sequences in the post genomic era, we are challenged to develop an automated method tor fast and reliably annotating their subcellular localizations. To tackle the challenge, a novel method of the sequence-segmented pseudo amino acid composition (PseAAC) is introduced to represent protein samples. Based on the concept of Chou's PseAAC, a series of useful information and techniques, such as multi- scale energy and moment descriptors were utilized to generate the sequence-segmented pseudo amino acid components for representing the protein samples. Meanwhile, the multi-class SVM classifier modules were adopted for predicting 16 kinds of eukaryotic protein subcellular localizations. Compared with existing methods, this new approach provides better predictive performance. The success total accuracies were obtained in the jackknife test and independent dataset test, suggesting that the sequence-segmented PseAAC method is quite promising, and might also hold a great potential as a useful vehicle for the other areas of molecular biology.

AB - Information of the subcellular localizations of proteins is important because it can provide useful insights about their functions, as well as how and in what kind of cellular environments they interact with each other and with other molecules. Facing the explosion of newly generated protein sequences in the post genomic era, we are challenged to develop an automated method tor fast and reliably annotating their subcellular localizations. To tackle the challenge, a novel method of the sequence-segmented pseudo amino acid composition (PseAAC) is introduced to represent protein samples. Based on the concept of Chou's PseAAC, a series of useful information and techniques, such as multi- scale energy and moment descriptors were utilized to generate the sequence-segmented pseudo amino acid components for representing the protein samples. Meanwhile, the multi-class SVM classifier modules were adopted for predicting 16 kinds of eukaryotic protein subcellular localizations. Compared with existing methods, this new approach provides better predictive performance. The success total accuracies were obtained in the jackknife test and independent dataset test, suggesting that the sequence-segmented PseAAC method is quite promising, and might also hold a great potential as a useful vehicle for the other areas of molecular biology.

KW - Moment descriptor

KW - Multi-scale energy

KW - Sequence-segmented PseAAC

KW - Support vector machine

UR - http://www.scopus.com/inward/record.url?scp=57749084708&partnerID=8YFLogxK

U2 - 10.1109/ICMLC.2008.4621106

DO - 10.1109/ICMLC.2008.4621106

M3 - 会议稿件

AN - SCOPUS:57749084708

SN - 9781424420964

T3 - Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC

SP - 4024

EP - 4028

BT - Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC

Y2 - 12 July 2008 through 15 July 2008

ER -

Zhang SW, Yang HF, Li QP, Cheng YM , Pan Q. Prediction of protein subcellular localization with a Novel method: Sequence-segmented PseAAC. 在 Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC. 2008. 页码 4024-4028. 4621106. (Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC). doi: 10.1109/ICMLC.2008.4621106

Prediction of protein subcellular localization with a Novel method: Sequence-segmented PseAAC

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此