跳到主要导航 跳到搜索 跳到主要内容

A two-step logistic regression algorithm for identifying individual-cancer-related genes

  • Bolin Chen
  • , Xuequn Shang
  • , Min Li
  • , Jianxin Wang
  • , Fang Xiang Wu

科研成果: 书/报告/会议事项章节会议稿件同行评审

10 引用 (Scopus)

摘要

The identification of cancer-related genes is important towards the understanding of complex genetic diseases. Although many machine learning algorithms are proposed to identify disease-related genes, they often either have poor performance to identify locus heterogeneity cancer-related genes or are not applicable to predict individual-disease-related genes due to the lack of positive instances (imbalanced classification). To overcome these two issues, a two-step logistic regression (LR) based algorithm is proposed in this study for identifying individual-cancer-related genes. A set of high potential cancer-class-related genes is first generated in step 1, followed by a second round of LR-based algorithm conducted on this smaller dataset for identifying individual-cancer-related genes. Numerical experiments show that the proposed two-step LR-based algorithm not only works well for locus heterogeneity data, but also has good performance to handle the imbalanced classification problem. The individual-cancer-related gene identification experiments achieve AUC values of around 0.85 when the threshold of posterior probability is chosen between 0.3 and 0.6. All evaluations are conducted by using the leave-one-out cross validation method.

源语言英语
主期刊名Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015
编辑lng. Matthieu Schapranow, Jiayu Zhou, Xiaohua Tony Hu, Bin Ma, Sanguthevar Rajasekaran, Satoru Miyano, Illhoi Yoo, Brian Pierce, Amarda Shehu, Vijay K. Gombar, Brian Chen, Vinay Pai, Jun Huan
出版商Institute of Electrical and Electronics Engineers Inc.
195-200
页数6
ISBN(电子版)9781467367981
DOI
出版状态已出版 - 16 12月 2015
活动IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015 - Washington, 美国
期限: 9 11月 201512 11月 2015

出版系列

姓名Proceedings - 2015 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015

会议

会议IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2015
国家/地区美国
Washington
时期9/11/1512/11/15

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 3 - 良好健康与福祉
    可持续发展目标 3 良好健康与福祉

指纹

探究 'A two-step logistic regression algorithm for identifying individual-cancer-related genes' 的科研主题。它们共同构成独一无二的指纹。

引用此