TY - GEN
T1 - Using singular value decomposition and discrete Fourier transform to characterize protein structure and build fast fold recognition
AU - Shi, Jian Yu
AU - Zhang, Yan Ning
PY - 2009
Y1 - 2009
N2 - In order to extract compact and effective feature to characterize protein structure, this paper presents a feature extraction of protein fold by mapping into 2-D distance matrix which is regarded as gray level image and further analyzed by image processing techniques. Firstly, gray level co-occurrence matrix (CoM) of distance matrix image (DMI) is calculated and its singular values are taken as the first group of features. Next, DMI is transformed into frequency view by discrete Fourier transform (DFT). In succession, the magnitude of DFT coefficients is analyzed by histogram of which seven descriptors are taken as the second group of features. Last, the final feature vector is combined by the two groups of features and further standardized by calculating Z-scores before classification runs. The results compared with other methods show that the presented method can characterize effectively protein structure, and perform efficiently automatic classification of multiple types of folds with the benefit of low dimension, meaningful and compact feature, but also no need of complicated classifier system.
AB - In order to extract compact and effective feature to characterize protein structure, this paper presents a feature extraction of protein fold by mapping into 2-D distance matrix which is regarded as gray level image and further analyzed by image processing techniques. Firstly, gray level co-occurrence matrix (CoM) of distance matrix image (DMI) is calculated and its singular values are taken as the first group of features. Next, DMI is transformed into frequency view by discrete Fourier transform (DFT). In succession, the magnitude of DFT coefficients is analyzed by histogram of which seven descriptors are taken as the second group of features. Last, the final feature vector is combined by the two groups of features and further standardized by calculating Z-scores before classification runs. The results compared with other methods show that the presented method can characterize effectively protein structure, and perform efficiently automatic classification of multiple types of folds with the benefit of low dimension, meaningful and compact feature, but also no need of complicated classifier system.
KW - Fold recognition
KW - Gray level co-occurrence matrix
KW - Histogram
KW - Image processing
KW - Support vector machines
UR - http://www.scopus.com/inward/record.url?scp=72749113337&partnerID=8YFLogxK
U2 - 10.1109/ICBBE.2009.5163740
DO - 10.1109/ICBBE.2009.5163740
M3 - 会议稿件
AN - SCOPUS:72749113337
SN - 9781424429028
T3 - 3rd International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2009
BT - 3rd International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2009
T2 - 3rd International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2009
Y2 - 11 June 2009 through 13 June 2009
ER -