RPI-SE: A stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information

Hai Cheng Yi, Zhu Hong You, Mei Neng Wang, Zhen Hao Guo, Yan Bin Wang, Ji Ren Zhou

Research output: Contribution to journalArticlepeer-review

38 Scopus citations

Abstract

Background: The interactions between non-coding RNAs (ncRNA) and proteins play an essential role in many biological processes. Several high-throughput experimental methods have been applied to detect ncRNA-protein interactions. However, these methods are time-consuming and expensive. Accurate and efficient computational methods can assist and accelerate the study of ncRNA-protein interactions. Results: In this work, we develop a stacking ensemble computational framework, RPI-SE, for effectively predicting ncRNA-protein interactions. More specifically, to fully exploit protein and RNA sequence feature, Position Weight Matrix combined with Legendre Moments is applied to obtain protein evolutionary information. Meanwhile, k-mer sparse matrix is employed to extract efficient feature of ncRNA sequences. Finally, an ensemble learning framework integrated different types of base classifier is developed to predict ncRNA-protein interactions using these discriminative features. The accuracy and robustness of RPI-SE was evaluated on three benchmark data sets under five-fold cross-validation and compared with other state-of-the-art methods. Conclusions: The results demonstrate that RPI-SE is competent for ncRNA-protein interactions prediction task with high accuracy and robustness. It's anticipated that this work can provide a computational prediction tool to advance ncRNA-protein interactions related biomedical research.

Original languageEnglish
Article number60
JournalBMC Bioinformatics
Volume21
Issue number1
DOIs
StatePublished - 18 Feb 2020
Externally publishedYes

Keywords

  • Ensemble learning
  • Legendre moments
  • ncRNA
  • Position weight matrix
  • RNA-protein interaction
  • Sequence analysis

Fingerprint

Dive into the research topics of 'RPI-SE: A stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information'. Together they form a unique fingerprint.

Cite this