跳到主要导航 跳到搜索 跳到主要内容

NSECDA: Natural Semantic Enhancement for CircRNA-Disease Association Prediction

  • Lei Wang
  • , Leon Wong
  • , Zhu Hong You
  • , De Shuang Huang
  • , Xiao Rui Su
  • , Bo Wei Zhao
  • Guangxi Academy of Agricultural Sciences
  • Zaozhuang University
  • Northwestern Polytechnical University Xian
  • Xinjiang Technical Institute of Physics and Chemistry

科研成果: 期刊稿件文章同行评审

29 引用 (Scopus)

摘要

Increasing evidence suggest that circRNA, as one of the most promising emerging biomarkers, has a very close relationship with diseases. Exploring the relationship between circRNA and diseases can provide novel perspective for diseases diagnosis and pathogenesis. The existing circRNA-disease association (CDA) prediction models, however, generally treat the data attributes equally, do not pay special attention to the attributes with more significant influence, and do not make full use of the correlation and symbiosis between attributes to dig into the latent semantic information of the data. Therefore, in response to the above problems, this paper proposes a natural semantic enhancement method NSECDA to predict CDA. In practical terms, we first recognize the circRNA sequence as a biological language, and analyze its natural semantic properties through the natural language understanding theory; then integrate it with disease attributes, circRNA and disease Gaussian Interaction Profile (GIP) kernel attributes, and use Graph Attention Network (GAT) to focus on the influential attributes, so as to mine the deeply hidden features; finally, the Rotation Forest (RoF) classifier was used to accurately determine CDA. In the gold standard data set CircR2Disease, NSECDA achieved 92.49% accuracy with 0.9225 AUC score. In comparison with the non-natural semantic enhancement model and other classifier models, NSECDA also shows competitive performance. Additionally, 25 of the CDA pairs with unknown associations in the top 30 prediction scores of NSECDA have been proven by newly reported studies. These achievements suggest that NSECDA is an effective model to predict CDA, which can provide credible candidate for subsequent wet experiments, thus significantly reducing the scope of investigations.

源语言英语
页(从-至)5075-5084
页数10
期刊IEEE Journal of Biomedical and Health Informatics
26
10
DOI
出版状态已出版 - 1 10月 2022
已对外发布

指纹

探究 'NSECDA: Natural Semantic Enhancement for CircRNA-Disease Association Prediction' 的科研主题。它们共同构成独一无二的指纹。

引用此