Effective moment feature vectors for protein domain structures

Jian Yu Shi, Siu Ming Yiu, Yan Ning Zhang, Francis Yuk Lun Chin

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

Imaging processing techniques have been shown to be useful in studying protein domain structures. The idea is to represent the pairwise distances of any two residues of the structure in a 2D distance matrix (DM). Features and/or submatrices are extracted from this DM to represent a domain. Existing approaches, however, may involve a large number of features (100-400) or complicated mathematical operations. Finding fewer but more effective features is always desirable. In this paper, based on some key observations on DMs, we are able to decompose a DM image into four basic binary images, each representing the structural characteristics of a fundamental secondary structure element (SSE) or a motif in the domain. Using the concept of moments in image processing, we further derive 45 structural features based on the four binary images. Together with 4 features extracted from the basic images, we represent the structure of a domain using 49 features. We show that our feature vectors can represent domain structures effectively in terms of the following. (1) We show a higher accuracy for domain classification. (2) We show a clear and consistent distribution of domains using our proposed structural vector space. (3) We are able to cluster the domains according to our moment features and demonstrate a relationship between structural variation and functional diversity.

源语言英语
文章编号e83788
期刊PLoS ONE
8
12
DOI
出版状态已出版 - 31 12月 2013

指纹

探究 'Effective moment feature vectors for protein domain structures' 的科研主题。它们共同构成独一无二的指纹。

引用此