Using pseudo amino acid composition to predict protein subcellular location: Approached with amino acid composition distribution

J. Y. Shi, S. W. Zhang, Q. Pan, G. P. Zhou

Research output: Contribution to journalArticlepeer-review

59 Scopus citations

Abstract

In the Post Genome Age, there is an urgent need to develop the reliable and effective computational methods to predict the subcellular localization for the explosion of newly found proteins. Here, a novel method of pseudo amino acid (PseAA) composition, the so-called "amino acid composition distribution" (AACD), is introduced. First, a protein sequence is divided equally into multiple segments. Then, amino acid composition of each segment is calculated in series. After that, each protein sequence can be represented by a feature vector. Finally, the feature vectors of all sequences thus obtained are further input into the multi-class support vector machines to predict the subcellular localization. The results show that AACD is quite effective in representing protein sequences for the purpose of predicting protein subcellular localization.

Original languageEnglish
Pages (from-to)321-327
Number of pages7
JournalAmino Acids
Volume35
Issue number2
DOIs
StatePublished - Aug 2008

Keywords

  • Amino acid composition distribution
  • Protein subcellular localization
  • Pseudo amino acid composition
  • Support vector machines

Fingerprint

Dive into the research topics of 'Using pseudo amino acid composition to predict protein subcellular location: Approached with amino acid composition distribution'. Together they form a unique fingerprint.

Cite this