Towards the classification of cancer subtypes by using cascade deep forest model in gene expression data

Yang Guo, Shuhui Liu, Zhanhuai Li, Xuequn Shang

科研成果: 书/报告/会议事项章节会议稿件同行评审

33 引用 (Scopus)

摘要

The classification of cancer subtypes is of great importance in cancer disease diagnosis and therapy. Many supervised learning methods have been applied to classification of cancer subtypes in the past few years, especially of deep learning based methods. Recently, a deep forest model has been proposed as an alternative of deep neural networks to learn hyper-representations by using cascade ensemble decision trees, and it has been proved that deep forest model has competitive or even better performance than deep neural networks. However, the original deep forest may face under-fitting and ensemble diversity problems when dealing with small sample size, and high-dimension biology data. It is important to improve the deep forest model to work better on small-scale biology data. In this paper, we propose a deep learning model to follow the mission of cancer subtype classification on small-scale biology data sets, which can be viewed as modification of original deep forest model. Our model distinguishes from the original deep forest model with two main contributions: First, a named multi-class-scanning method is proposed to train multiple simple binary classifiers to encourage diversity of ensemble. Meanwhile, the fitting quality of each classifier is considered in representations learning. Second, we propose a boosting strategy to emphasize more important features in cascade forests of representations learning, thus to propagate the benefits of discriminative features among layers to improve the overall classification performance. Systematical experiments on both microarray and RNA-seq data sets demonstrate that our method consistently outperforms the most state-of-the-art classification methods in application of cancer subtype classifications.

源语言英语
主期刊名Proceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017
编辑Illhoi Yoo, Jane Huiru Zheng, Yang Gong, Xiaohua Tony Hu, Chi-Ren Shyu, Yana Bromberg, Jean Gao, Dmitry Korkin
出版商Institute of Electrical and Electronics Engineers Inc.
1664-1669
页数6
ISBN(电子版)9781509030491
DOI
出版状态已出版 - 15 12月 2017
活动2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017 - Kansas City, 美国
期限: 13 11月 201716 11月 2017

出版系列

姓名Proceedings - 2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017
2017-January

会议

会议2017 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2017
国家/地区美国
Kansas City
时期13/11/1716/11/17

指纹

探究 'Towards the classification of cancer subtypes by using cascade deep forest model in gene expression data' 的科研主题。它们共同构成独一无二的指纹。

引用此