Identifying protein complexes in protein-protein interaction networks by using clique seeds and graph entropy

Bolin Chen; Jinhong Shi; Shenggui Zhang; Fang Xiang Wu

doi:10.1002/pmic.201200336

Identifying protein complexes in protein-protein interaction networks by using clique seeds and graph entropy

Bolin Chen, Jinhong Shi, Shenggui Zhang, Fang Xiang Wu

数学与统计学院

University of Saskatchewan

科研成果: 期刊稿件 › 文章 › 同行评审

34 引用（Scopus）

摘要

The identification of protein complexes plays a key role in understanding major cellular processes and biological functions. Various computational algorithms have been proposed to identify protein complexes from protein-protein interaction (PPI) networks. In this paper, we first introduce a new seed-selection strategy for seed-growth style algorithms. Cliques rather than individual vertices are employed as initial seeds. After that, a result-modification approach is proposed based on this seed-selection strategy. Predictions generated by higher order clique seeds are employed to modify results that are generated by lower order ones. The performance of this seed-selection strategy and the result-modification approach are tested by using the entropy-based algorithm, which is currently the best seed-growth style algorithm to detect protein complexes from PPI networks. In addition, we investigate four pairs of strategies for this algorithm in order to improve its accuracy. The numerical experiments are conducted on a Saccharomyces cerevisiae PPI network. The group of best predictions consists of 1711 clusters, with the average f-score at 0.68 after removing all similar and redundant clusters. We conclude that higher order clique seeds can generate predictions with higher accuracy and that our improved entropy-based algorithm outputs more reasonable predictions than the original one.

源语言	英语
页（从-至）	269-277
页数	9
期刊	Proteomics
卷	13
期	2
DOI	https://doi.org/10.1002/pmic.201200336
出版状态	已出版 - 1月 2013

访问文件

10.1002/pmic.201200336

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{d3eee9e606ca40709a2b7531cb6cd2e0,

title = "Identifying protein complexes in protein-protein interaction networks by using clique seeds and graph entropy",

abstract = "The identification of protein complexes plays a key role in understanding major cellular processes and biological functions. Various computational algorithms have been proposed to identify protein complexes from protein-protein interaction (PPI) networks. In this paper, we first introduce a new seed-selection strategy for seed-growth style algorithms. Cliques rather than individual vertices are employed as initial seeds. After that, a result-modification approach is proposed based on this seed-selection strategy. Predictions generated by higher order clique seeds are employed to modify results that are generated by lower order ones. The performance of this seed-selection strategy and the result-modification approach are tested by using the entropy-based algorithm, which is currently the best seed-growth style algorithm to detect protein complexes from PPI networks. In addition, we investigate four pairs of strategies for this algorithm in order to improve its accuracy. The numerical experiments are conducted on a Saccharomyces cerevisiae PPI network. The group of best predictions consists of 1711 clusters, with the average f-score at 0.68 after removing all similar and redundant clusters. We conclude that higher order clique seeds can generate predictions with higher accuracy and that our improved entropy-based algorithm outputs more reasonable predictions than the original one.",

keywords = "Bioinformatics, Clique seed, Graph entropy, Protein complex, Protein-protein interaction",

author = "Bolin Chen and Jinhong Shi and Shenggui Zhang and Wu, {Fang Xiang}",

year = "2013",

month = jan,

doi = "10.1002/pmic.201200336",

language = "英语",

volume = "13",

pages = "269--277",

journal = "Proteomics",

issn = "1615-9853",

publisher = "Wiley-VCH Verlag",

number = "2",

}

TY - JOUR

T1 - Identifying protein complexes in protein-protein interaction networks by using clique seeds and graph entropy

AU - Chen, Bolin

AU - Shi, Jinhong

AU - Zhang, Shenggui

AU - Wu, Fang Xiang

PY - 2013/1

Y1 - 2013/1

N2 - The identification of protein complexes plays a key role in understanding major cellular processes and biological functions. Various computational algorithms have been proposed to identify protein complexes from protein-protein interaction (PPI) networks. In this paper, we first introduce a new seed-selection strategy for seed-growth style algorithms. Cliques rather than individual vertices are employed as initial seeds. After that, a result-modification approach is proposed based on this seed-selection strategy. Predictions generated by higher order clique seeds are employed to modify results that are generated by lower order ones. The performance of this seed-selection strategy and the result-modification approach are tested by using the entropy-based algorithm, which is currently the best seed-growth style algorithm to detect protein complexes from PPI networks. In addition, we investigate four pairs of strategies for this algorithm in order to improve its accuracy. The numerical experiments are conducted on a Saccharomyces cerevisiae PPI network. The group of best predictions consists of 1711 clusters, with the average f-score at 0.68 after removing all similar and redundant clusters. We conclude that higher order clique seeds can generate predictions with higher accuracy and that our improved entropy-based algorithm outputs more reasonable predictions than the original one.

AB - The identification of protein complexes plays a key role in understanding major cellular processes and biological functions. Various computational algorithms have been proposed to identify protein complexes from protein-protein interaction (PPI) networks. In this paper, we first introduce a new seed-selection strategy for seed-growth style algorithms. Cliques rather than individual vertices are employed as initial seeds. After that, a result-modification approach is proposed based on this seed-selection strategy. Predictions generated by higher order clique seeds are employed to modify results that are generated by lower order ones. The performance of this seed-selection strategy and the result-modification approach are tested by using the entropy-based algorithm, which is currently the best seed-growth style algorithm to detect protein complexes from PPI networks. In addition, we investigate four pairs of strategies for this algorithm in order to improve its accuracy. The numerical experiments are conducted on a Saccharomyces cerevisiae PPI network. The group of best predictions consists of 1711 clusters, with the average f-score at 0.68 after removing all similar and redundant clusters. We conclude that higher order clique seeds can generate predictions with higher accuracy and that our improved entropy-based algorithm outputs more reasonable predictions than the original one.

KW - Bioinformatics

KW - Clique seed

KW - Graph entropy

KW - Protein complex

KW - Protein-protein interaction

UR - http://www.scopus.com/inward/record.url?scp=84872678695&partnerID=8YFLogxK

U2 - 10.1002/pmic.201200336

DO - 10.1002/pmic.201200336

M3 - 文章

C2 - 23112006

AN - SCOPUS:84872678695

SN - 1615-9853

VL - 13

SP - 269

EP - 277

JO - Proteomics

JF - Proteomics

IS - 2

ER -

Identifying protein complexes in protein-protein interaction networks by using clique seeds and graph entropy

摘要

访问文件

其它文件与链接

指纹

引用此