Accurate prediction of ncRNA-protein interactions from the integration of sequence and evolutionary information

Zhao Hui Zhan, Zhu Hong You, Li Ping Li, Yong Zhou, Hai Cheng Yi

Research output: Contribution to journalArticlepeer-review

30 Scopus citations

Abstract

Non-coding RNA (ncRNA) plays a crucial role in numerous biological processes including gene expression and post-transcriptional gene regulation. The biological function of ncRNA is mostly realized by binding with related proteins. Therefore, an accurate understanding of interactions between ncRNA and protein has a significant impact on current biological research. The major challenge at this stage is the waste of a great deal of redundant time and resource consumed on classification in traditional interaction pattern prediction methods. Fortunately, an efficient classifier named LightGBM can solve this difficulty of long time consumption. In this study, we employed LightGBM as the integrated classifier and proposed a novel computational model for predicting ncRNA and protein interactions. More specifically, the pseudo-Zernike Moments and singular value decomposition algorithm are employed to extract the discriminative features from protein and ncRNA sequences. On four widely used datasets RPI369, RPI488, RPI1807, and RPI2241, we evaluated the performance of LGBM and obtained an superior performance with AUC of 0.799, 0.914, 0.989, and 0.762, respectively. The experimental results of 10-fold cross-validation shown that the proposed method performs much better than existing methods in predicting ncRNA-protein interaction patterns, which could be used as a useful tool in proteomics research.

Original languageEnglish
Article number458
JournalFrontiers in Genetics
Volume9
Issue numberOCT
DOIs
StatePublished - 8 Oct 2018
Externally publishedYes

Keywords

  • K-mers
  • LightGBM
  • NcRNA-protein Interactions
  • Pseudo-Zernike moments
  • PSSM

Fingerprint

Dive into the research topics of 'Accurate prediction of ncRNA-protein interactions from the integration of sequence and evolutionary information'. Together they form a unique fingerprint.

Cite this