Biolinguistic graph fusion model for circRNA–miRNA association prediction

Lu Xiang Guo, Lei Wang, Zhu Hong You, Chang Qing Yu, Meng Lei Hu, Bo Wei Zhao, Yang Li

Research output: Contribution to journalArticlepeer-review

11 Scopus citations

Abstract

Emerging clinical evidence suggests that sophisticated associations with circular ribonucleic acids (RNAs) (circRNAs) and microRNAs (miRNAs) are a critical regulatory factor of various pathological processes and play a critical role in most intricate human diseases. Nonetheless, the above correlations via wet experiments are error-prone and labor-intensive, and the underlying novel circRNA–miRNA association (CMA) has been validated by numerous existing computational methods that rely only on single correlation data. Considering the inadequacy of existing machine learning models, we propose a new model named BGF-CMAP, which combines the gradient boosting decision tree with natural language processing and graph embedding methods to infer associations between circRNAs and miRNAs. Specifically, BGF-CMAP extracts sequence attribute features and interaction behavior features by Word2vec and two homogeneous graph embedding algorithms, large-scale information network embedding and graph factorization, respectively. Multitudinous comprehensive experimental analysis revealed that BGF-CMAP successfully predicted the complex relationship between circRNAs and miRNAs with an accuracy of 82.90% and an area under receiver operating characteristic of 0.9075. Furthermore, 23 of the top 30 miRNA-associated circRNAs of the studies on data were confirmed in relevant experiences, showing that the BGF-CMAP model is superior to others. BGF-CMAP can serve as a helpful model to provide a scientific theoretical basis for the study of CMA prediction.

Original languageEnglish
Article numberbbae058
JournalBriefings in Bioinformatics
Volume25
Issue number2
DOIs
StatePublished - 1 Mar 2024

Keywords

  • CircRNA
  • circRNA–miRNA association
  • gradient boosting decision tree
  • graph factorization
  • large-scale information network embedding
  • MiRNA

Fingerprint

Dive into the research topics of 'Biolinguistic graph fusion model for circRNA–miRNA association prediction'. Together they form a unique fingerprint.

Cite this