A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences

Zhuhong You; Zhong Ming; Ben Niu; Suping Deng; Zexuan Zhu

doi:10.1007/978-3-642-39479-9_73

A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences

Zhuhong You, Zhong Ming, Ben Niu, Suping Deng, Zexuan Zhu

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

12 Scopus citations

Abstract

Protein-protein interactions (PPIs) are crucial for almost all cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades. However, the experimental methods for identifying PPIs are both time-consuming and expensive. Therefore, it is important to develop computational approaches for predicting PPIs. In this article, a sequence-based method is developed by combining a novel feature representation using binary coding and Support Vector Machine (SVM). The binary-coding-based descriptors account for the interactions between residues a certain distance apart in the protein sequence, thus this method adequately takes the neighboring effect into account and mine interaction information from the continuous and discontinuous amino acids segments at the same time. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 86.93% prediction accuracy with 86.99% sensitivity at the precision of 86.90%. Extensive experiments are performed to compare our method with the existing sequence-based method. Achieved results show that the proposed approach is very promising for predicting PPI, so it can be a useful supplementary tool for future proteomics studies.

Original language	English
Title of host publication	Intelligent Computing Theories - 9th International Conference, ICIC 2013, Proceedings
Pages	629-637
Number of pages	9
DOIs	https://doi.org/10.1007/978-3-642-39479-9_73
State	Published - 2013
Externally published	Yes
Event	9th International Conference on Intelligent Computing, ICIC 2013 - Nanning, China Duration: 28 Jul 2013 → 31 Jul 2013

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	7995 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	9th International Conference on Intelligent Computing, ICIC 2013
Country/Territory	China
City	Nanning
Period	28/07/13 → 31/07/13

Keywords

binary coding
local descriptor
protein sequence
protein-protein interaction
support vector machine

Access to Document

10.1007/978-3-642-39479-9_73

Cite this

You, Z., Ming, Z., Niu, B., Deng, S., & Zhu, Z. (2013). A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences. In Intelligent Computing Theories - 9th International Conference, ICIC 2013, Proceedings (pp. 629-637). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7995 LNCS). https://doi.org/10.1007/978-3-642-39479-9_73

You, Zhuhong ; Ming, Zhong ; Niu, Ben et al. / A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences. Intelligent Computing Theories - 9th International Conference, ICIC 2013, Proceedings. 2013. pp. 629-637 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{520929fb5fc742be8cbc65e75acfad97,

title = "A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences",

abstract = "Protein-protein interactions (PPIs) are crucial for almost all cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades. However, the experimental methods for identifying PPIs are both time-consuming and expensive. Therefore, it is important to develop computational approaches for predicting PPIs. In this article, a sequence-based method is developed by combining a novel feature representation using binary coding and Support Vector Machine (SVM). The binary-coding-based descriptors account for the interactions between residues a certain distance apart in the protein sequence, thus this method adequately takes the neighboring effect into account and mine interaction information from the continuous and discontinuous amino acids segments at the same time. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 86.93% prediction accuracy with 86.99% sensitivity at the precision of 86.90%. Extensive experiments are performed to compare our method with the existing sequence-based method. Achieved results show that the proposed approach is very promising for predicting PPI, so it can be a useful supplementary tool for future proteomics studies.",

keywords = "binary coding, local descriptor, protein sequence, protein-protein interaction, support vector machine",

author = "Zhuhong You and Zhong Ming and Ben Niu and Suping Deng and Zexuan Zhu",

year = "2013",

doi = "10.1007/978-3-642-39479-9_73",

language = "英语",

isbn = "9783642394782",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "629--637",

booktitle = "Intelligent Computing Theories - 9th International Conference, ICIC 2013, Proceedings",

note = "9th International Conference on Intelligent Computing, ICIC 2013 ; Conference date: 28-07-2013 Through 31-07-2013",

}

You, Z, Ming, Z, Niu, B, Deng, S & Zhu, Z 2013, A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences. in Intelligent Computing Theories - 9th International Conference, ICIC 2013, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 7995 LNCS, pp. 629-637, 9th International Conference on Intelligent Computing, ICIC 2013, Nanning, China, 28/07/13. https://doi.org/10.1007/978-3-642-39479-9_73

A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences. / You, Zhuhong; Ming, Zhong; Niu, Ben et al.
Intelligent Computing Theories - 9th International Conference, ICIC 2013, Proceedings. 2013. p. 629-637 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7995 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences

AU - You, Zhuhong

AU - Ming, Zhong

AU - Niu, Ben

AU - Deng, Suping

AU - Zhu, Zexuan

PY - 2013

Y1 - 2013

N2 - Protein-protein interactions (PPIs) are crucial for almost all cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades. However, the experimental methods for identifying PPIs are both time-consuming and expensive. Therefore, it is important to develop computational approaches for predicting PPIs. In this article, a sequence-based method is developed by combining a novel feature representation using binary coding and Support Vector Machine (SVM). The binary-coding-based descriptors account for the interactions between residues a certain distance apart in the protein sequence, thus this method adequately takes the neighboring effect into account and mine interaction information from the continuous and discontinuous amino acids segments at the same time. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 86.93% prediction accuracy with 86.99% sensitivity at the precision of 86.90%. Extensive experiments are performed to compare our method with the existing sequence-based method. Achieved results show that the proposed approach is very promising for predicting PPI, so it can be a useful supplementary tool for future proteomics studies.

AB - Protein-protein interactions (PPIs) are crucial for almost all cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades. However, the experimental methods for identifying PPIs are both time-consuming and expensive. Therefore, it is important to develop computational approaches for predicting PPIs. In this article, a sequence-based method is developed by combining a novel feature representation using binary coding and Support Vector Machine (SVM). The binary-coding-based descriptors account for the interactions between residues a certain distance apart in the protein sequence, thus this method adequately takes the neighboring effect into account and mine interaction information from the continuous and discontinuous amino acids segments at the same time. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 86.93% prediction accuracy with 86.99% sensitivity at the precision of 86.90%. Extensive experiments are performed to compare our method with the existing sequence-based method. Achieved results show that the proposed approach is very promising for predicting PPI, so it can be a useful supplementary tool for future proteomics studies.

KW - binary coding

KW - local descriptor

KW - protein sequence

KW - protein-protein interaction

KW - support vector machine

UR - http://www.scopus.com/inward/record.url?scp=84882799517&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-39479-9_73

DO - 10.1007/978-3-642-39479-9_73

M3 - 会议稿件

AN - SCOPUS:84882799517

SN - 9783642394782

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 629

EP - 637

BT - Intelligent Computing Theories - 9th International Conference, ICIC 2013, Proceedings

T2 - 9th International Conference on Intelligent Computing, ICIC 2013

Y2 - 28 July 2013 through 31 July 2013

ER -

You Z, Ming Z, Niu B, Deng S, Zhu Z. A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences. In Intelligent Computing Theories - 9th International Conference, ICIC 2013, Proceedings. 2013. p. 629-637. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-642-39479-9_73

A SVM-based system for predicting protein-protein interactions using a novel representation of protein sequences

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this