TY - JOUR
T1 - Uncovering prostate cancer candidate disease modules with dual constraints based on node-module confidence and local modularity
AU - Wang, Yi Bin
AU - Cheng, Yong Mei
AU - Zhang, Shao Wu
PY - 2015
Y1 - 2015
N2 - Researches on the etiology and pathogenesis of prostate cancer are helpful for disease diagnosis and treatment. However, current biochemical experimental methods for prostate cancer are both costly and time-consuming, as well as networks based methods for this disease analysis limited by the nature of gene expression profiles for its incomplete, high noise and small sample size. Therefore, we proposed a dual constraint algorithm based on the confidence of one vertices belonging to the community and local modularity, named as NMCOM, to mine the candidate disease modules of prostate cancer in the present work. The NMCOMalgorithmis gene expression independent method. It first integrated the concordance scores between the candidate genes and the causative phenotypes, as well as the semantic similarity scores between the candidate genes and the causative genes for prioritizing the candidate genes together, and then the starting node is selected with a sorting strategy. Finally, the candidate modules of prostate cancer are mined with dual constraint produces constructing on the confidence between node and module, as well as local modularity. 18 significant candidate disease gene modules were detected for the enrichment analysis of the obtained modules. Compared with the single scoring sorting methods and random walk with restart, the NMCOM fusion prioritizing strategy achieved a smaller MRR (Mean Rank Ratio) but bigger AUC value. The results are significantly better than other modules-based mining algorithms, and the biological explanations for these mined modules are more significant. More importantly, the NMCOMalgorithmcan be easily extended to mine any other diseases candidate modules.
AB - Researches on the etiology and pathogenesis of prostate cancer are helpful for disease diagnosis and treatment. However, current biochemical experimental methods for prostate cancer are both costly and time-consuming, as well as networks based methods for this disease analysis limited by the nature of gene expression profiles for its incomplete, high noise and small sample size. Therefore, we proposed a dual constraint algorithm based on the confidence of one vertices belonging to the community and local modularity, named as NMCOM, to mine the candidate disease modules of prostate cancer in the present work. The NMCOMalgorithmis gene expression independent method. It first integrated the concordance scores between the candidate genes and the causative phenotypes, as well as the semantic similarity scores between the candidate genes and the causative genes for prioritizing the candidate genes together, and then the starting node is selected with a sorting strategy. Finally, the candidate modules of prostate cancer are mined with dual constraint produces constructing on the confidence between node and module, as well as local modularity. 18 significant candidate disease gene modules were detected for the enrichment analysis of the obtained modules. Compared with the single scoring sorting methods and random walk with restart, the NMCOM fusion prioritizing strategy achieved a smaller MRR (Mean Rank Ratio) but bigger AUC value. The results are significantly better than other modules-based mining algorithms, and the biological explanations for these mined modules are more significant. More importantly, the NMCOMalgorithmcan be easily extended to mine any other diseases candidate modules.
KW - Candidate gene prioritization
KW - Disease module mining
KW - Local modularity
KW - Node-module confidence
KW - Prostate cancer
UR - http://www.scopus.com/inward/record.url?scp=84928748310&partnerID=8YFLogxK
U2 - 10.16476/j.pibb.2014.0091
DO - 10.16476/j.pibb.2014.0091
M3 - 文章
AN - SCOPUS:84928748310
SN - 1000-3282
VL - 42
SP - 375
EP - 389
JO - Progress in Biochemistry and Biophysics
JF - Progress in Biochemistry and Biophysics
IS - 4
ER -