Amino acid composition distribution: A novel sequence representation for prediction of protein subcellular localization

  • Jianyu Shi
  • , Shaowu Zhang
  • , Quan Pan
  • , Guo Ping Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

A novel representation of protein sequence, amino acid composition distribution (AACD), is introduced to perform prediction of subcellular localization in this paper. First, a protein sequence is divided equally into multiple segments. Then, amino acid composition of each segment is calculated in series. After that, each protein sequence can be represented a feature vector. Finally, feature vectors of all sequences are further input into multi-class support vector machines to predict the subcellular localization. The results show that AACD is more effective to represent protein sequence and is non-sensitive to sequence similarity because of the better ability to reflect the information of protein subcellular localization.

Original languageEnglish
Title of host publication2007 1st International Conference on Bioinformatics and Biomedical Engineering, ICBBE
Pages115-118
Number of pages4
DOIs
StatePublished - 2007
Event2007 1st International Conference on Bioinformatics and Biomedical Engineering, ICBBE - Wuhan, China
Duration: 6 Jul 20078 Jul 2007

Publication series

Name2007 1st International Conference on Bioinformatics and Biomedical Engineering, ICBBE

Conference

Conference2007 1st International Conference on Bioinformatics and Biomedical Engineering, ICBBE
Country/TerritoryChina
CityWuhan
Period6/07/078/07/07

Keywords

  • Amino acid composition distribution
  • Protein subcellular localization
  • Support vector machines

Fingerprint

Dive into the research topics of 'Amino acid composition distribution: A novel sequence representation for prediction of protein subcellular localization'. Together they form a unique fingerprint.

Cite this