TY - GEN
T1 - Increasing reliability of protein interactome by combining heterogeneous data sources with weighted network topological metrics
AU - You, Zhu Hong
AU - Li, Liping
AU - Yu, Hongjie
AU - Chen, Sanfeng
AU - Wang, Shu Lin
PY - 2010
Y1 - 2010
N2 - Over the last decade, the development of high-throughput techniques has resulted in a rapid accumulation of protein-protein interaction (PPI) data. However, the high-throughput experimental interaction data is prone to exhibit high level of false-positive and false-negative rates. It is therefore highly desirable to develop an approach to deal with these issues from the computational perspective. Meanwhile, as a variety of genomic and proteomic datasets become available, they provide an opportunity to study the interactions between proteins indirectly. In this paper, we introduce a novel approach that employs the Logistic Regression to integrate heterogeneous types of high-throughput biological data into a weighted biological network. Then, a weighted topological metrics of the network is devised to indicate the interacting possibility of two proteins. We evaluate our method on the Gavin's yeast interaction dataset. The experimental results show that by incorporating heterogeneous data types with weighted network topological metrics, our method improved functional homogeneity and localization coherence compared with existing approaches.
AB - Over the last decade, the development of high-throughput techniques has resulted in a rapid accumulation of protein-protein interaction (PPI) data. However, the high-throughput experimental interaction data is prone to exhibit high level of false-positive and false-negative rates. It is therefore highly desirable to develop an approach to deal with these issues from the computational perspective. Meanwhile, as a variety of genomic and proteomic datasets become available, they provide an opportunity to study the interactions between proteins indirectly. In this paper, we introduce a novel approach that employs the Logistic Regression to integrate heterogeneous types of high-throughput biological data into a weighted biological network. Then, a weighted topological metrics of the network is devised to indicate the interacting possibility of two proteins. We evaluate our method on the Gavin's yeast interaction dataset. The experimental results show that by incorporating heterogeneous data types with weighted network topological metrics, our method improved functional homogeneity and localization coherence compared with existing approaches.
KW - CD-DIST
KW - False Positive
KW - FSWeight
KW - Multiple Data Sources
KW - Protein-protein Interaction
KW - Weighted Network Topology
UR - http://www.scopus.com/inward/record.url?scp=77958490588&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-14922-1_82
DO - 10.1007/978-3-642-14922-1_82
M3 - 会议稿件
AN - SCOPUS:77958490588
SN - 3642149219
SN - 9783642149214
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 657
EP - 663
BT - Advanced Intelligent Computing Theories and Applications - 6th International Conference on Intelligent Computing, ICIC 2010, Proceedings
T2 - 6th International Conference on Intelligent Computing, ICIC 2010
Y2 - 18 August 2010 through 21 August 2010
ER -