TY - GEN
T1 - A novel hybrid method of gene selection and its application on tumor classification
AU - You, Zhuhong
AU - Wang, Shulin
AU - Gui, Jie
AU - Zhang, Shanwen
PY - 2008
Y1 - 2008
N2 - Microarray gene expression profile data is used to accurately predict different tumor types, which has great value in providing better treatment and toxicity minimization on the patients. However, it is difficult to classify different tumor types using microarray data because the number of samples is much smaller than the number of genes. It has been proved that a small feature gene subset can improve classification accuracy, so feature gene selection and extraction algorithm is very important in tumor classification. In this paper, a novel hybrid gene selection method is proposed to find a feature gene subset so that the feature genes related to certain cancer can be kept and the redundant genes can be leave out. In the proposed method, we combine the advantages of the PCA and the LDA and proposed a novel feature gene extraction scheme. We also compared several kinds of parametric and non-parametric feature gene selection methods. We use the SVM as the classifier in the experiment and compare the performance of three common SVM kernels. Their differences are analyzed. Using the n-fold cross validation, the proposed algorithm is carried out on three published benchmark tumor datasets and experimental results show that this algorithm leads to better classification performance than other methods.
AB - Microarray gene expression profile data is used to accurately predict different tumor types, which has great value in providing better treatment and toxicity minimization on the patients. However, it is difficult to classify different tumor types using microarray data because the number of samples is much smaller than the number of genes. It has been proved that a small feature gene subset can improve classification accuracy, so feature gene selection and extraction algorithm is very important in tumor classification. In this paper, a novel hybrid gene selection method is proposed to find a feature gene subset so that the feature genes related to certain cancer can be kept and the redundant genes can be leave out. In the proposed method, we combine the advantages of the PCA and the LDA and proposed a novel feature gene extraction scheme. We also compared several kinds of parametric and non-parametric feature gene selection methods. We use the SVM as the classifier in the experiment and compare the performance of three common SVM kernels. Their differences are analyzed. Using the n-fold cross validation, the proposed algorithm is carried out on three published benchmark tumor datasets and experimental results show that this algorithm leads to better classification performance than other methods.
KW - Feature Gene Selection
KW - K-NN
KW - LDA
KW - PCA
KW - SVM
KW - Tumor Classification
UR - http://www.scopus.com/inward/record.url?scp=53149115284&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-85984-0_127
DO - 10.1007/978-3-540-85984-0_127
M3 - 会议稿件
AN - SCOPUS:53149115284
SN - 3540859837
SN - 9783540859833
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 1055
EP - 1068
BT - Advanced Intelligent Computing Theories and Applications
T2 - 4th International Conference on Intelligent Computing, ICIC 2008
Y2 - 15 September 2008 through 18 September 2008
ER -