跳到主要导航 跳到搜索 跳到主要内容

A Knowledge Graph Enhanced Pre-Trained Large Language Model for Predicting MicroRNA-circRNA Interactions

  • Northwestern Polytechnical University Xian
  • Hunan University

科研成果: 期刊稿件文章同行评审

摘要

The interactions between circular RNAs (circRNAs) and microRNAs are one of the key mechanisms determining the functions of non-coding RNAs (ncRNAs) in biological processes such as DNA methylation and RNA-induced silencing. Studying these relationships can deepen our understanding of the function of these RNAs' roles in developing cancer vaccines and designing treatments. Therefore, we propose a knowledge graph enhanced pre-trained Large Language Model (LLM) for predicting circRNA-microRNA interactions. Our approach employs graph contrastive learning to represent a knowledge graph consisting of circRNA and microRNA entities from multi-views. The features of these entities are derived by fine-tuning a sequential LLM by two types of ncRNAs separately. At the final, the embedding is fed into classifier for prediction. We employ an independent testing set to evaluate the model's performance and against our model with recently reported models on two datasets. Our model achieves approximately a 3% improvement in Area Under the Receiver Operating Characteristic Curve (AUROC), reaching 93.77% and 93.07%, respectively. The stability of our model is tested by performing 10-fold cross-validation on the remaining training set where our model performs the best stability. In ablation study, we comprehensively compare strategies for sequence processing and effectiveness of independent module. Finally, on a case study dataset derived from real-world scenarios, the model assign scores to all candidates and rank them accordingly. Among the top 10 highest-scoring results, 7 have been validated by wet-lab experiments, highlighting the model's strong generalization capability.

源语言英语
页(从-至)1405-1417
页数13
期刊Big Data Mining and Analytics
8
6
DOI
出版状态已出版 - 12月 2025

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 3 - 良好健康与福祉
    可持续发展目标 3 良好健康与福祉

指纹

探究 'A Knowledge Graph Enhanced Pre-Trained Large Language Model for Predicting MicroRNA-circRNA Interactions' 的科研主题。它们共同构成独一无二的指纹。

引用此