摘要
The identification of cancer-related genes is important towards the understanding of complex genetic diseases. Although many machine learning algorithms are proposed to identify disease-related genes, they often either have poor performance to identify locus heterogeneity cancer-related genes or are not applicable to predict individual-disease-related genes due to the lack of positive instances (imbalanced classification). To overcome these two issues, a two-step logistic regression (LR) based algorithm is proposed in this study for identifying individual-cancer-related genes. A set of high potential cancer-class-related genes is first generated in step 1, followed by a second round of LR-based algorithm conducted on this smaller dataset for identifying individual-cancer-related genes. Numerical experiments show that the proposed two-step LR-based algorithm not only works well for locus heterogeneity data, but also has good performance to handle the imbalanced classification problem. The individual-cancer-related gene identification experiments achieve AUC values of around 0.85 when the threshold of posterior probability is chosen between 0.3 and 0.6. All evaluations are conducted by using the leave-one-out cross validation method.
| 源语言 | 英语 |
|---|---|
| 主期刊名 | Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015 |
| 编辑 | lng. Matthieu Schapranow, Jiayu Zhou, Xiaohua Tony Hu, Bin Ma, Sanguthevar Rajasekaran, Satoru Miyano, Illhoi Yoo, Brian Pierce, Amarda Shehu, Vijay K. Gombar, Brian Chen, Vinay Pai, Jun Huan |
| 出版商 | Institute of Electrical and Electronics Engineers Inc. |
| 页 | 195-200 |
| 页数 | 6 |
| ISBN(电子版) | 9781467367981 |
| DOI | |
| 出版状态 | 已出版 - 16 12月 2015 |
| 活动 | IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015 - Washington, 美国 期限: 9 11月 2015 → 12 11月 2015 |
出版系列
| 姓名 | Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015 |
|---|
会议
| 会议 | IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015 |
|---|---|
| 国家/地区 | 美国 |
| 市 | Washington |
| 时期 | 9/11/15 → 12/11/15 |
联合国可持续发展目标
此成果有助于实现下列可持续发展目标:
-
可持续发展目标 3 良好健康与福祉
指纹
探究 'A two-step logistic regression algorithm for identifying individual-cancer-related genes' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver