Abstract
In the Post Genome Age, there is an urgent need to develop the reliable and effective computational methods to predict the subcellular localization for the explosion of newly found proteins. Here, a novel method of pseudo amino acid (PseAA) composition, the so-called "amino acid composition distribution" (AACD), is introduced. First, a protein sequence is divided equally into multiple segments. Then, amino acid composition of each segment is calculated in series. After that, each protein sequence can be represented by a feature vector. Finally, the feature vectors of all sequences thus obtained are further input into the multi-class support vector machines to predict the subcellular localization. The results show that AACD is quite effective in representing protein sequences for the purpose of predicting protein subcellular localization.
| Original language | English |
|---|---|
| Pages (from-to) | 321-327 |
| Number of pages | 7 |
| Journal | Amino Acids |
| Volume | 35 |
| Issue number | 2 |
| DOIs | |
| State | Published - Aug 2008 |
Keywords
- Amino acid composition distribution
- Protein subcellular localization
- Pseudo amino acid composition
- Support vector machines
Fingerprint
Dive into the research topics of 'Using pseudo amino acid composition to predict protein subcellular location: Approached with amino acid composition distribution'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver