Inferring Drug-miRNA Associations by Integrating Drug SMILES and MiRNA Sequence Information

Zhen Hao Guo, Zhu Hong You, Li Ping Li, Zhan Heng Chen, Hai Cheng Yi, Yan Bin Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

The accumulated evidences indicate that drugs not only interact with proteins, but also regulate a wide variety of biomarkers such as miRNAs. Hence, uncovering potential drug-miRNA associations plays significant roles in disease prevention, diagnosis and treatment as well as drug development. In this paper, we discuss how this problem is formulated as a link prediction task in a bipartite graph and construct a computational model to infer unknown drug-miRNA associations. Specifically, the drug SMILES (Simplified molecular input line entry specification) or miRNA sequences can be regarded as a kind of biology language described by distributed representation. The experiment verified associations are treated as positive samples and the same number unlabeled associations are randomly selected as negative samples. Finally, Random Forest classifier is applied to perform the prediction task. In the experiment, the proposed method achieves AUROC of 91.16 and AUPR of 89.21 under 5-fold cross-validation. It demonstrates the great potential of seamless integration of deep learning and biological big data. We hope that this research with great expectations can be used as a practical guidance tool to bring useful inspiration to relevant researchers.

Original languageEnglish
Title of host publicationIntelligent Computing Theories and Application - 16th International Conference, ICIC 2020, Proceedings
EditorsDe-Shuang Huang, Kang-Hyun Jo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages279-289
Number of pages11
ISBN (Print)9783030608019
DOIs
StatePublished - 2020
Externally publishedYes
Event16th International Conference on Intelligent Computing, ICIC 2020 - Bari , Italy
Duration: 2 Oct 20205 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12464 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Intelligent Computing, ICIC 2020
Country/TerritoryItaly
CityBari
Period2/10/205/10/20

Keywords

  • Association prediction
  • Biology language processing
  • Drug
  • miRNA

Fingerprint

Dive into the research topics of 'Inferring Drug-miRNA Associations by Integrating Drug SMILES and MiRNA Sequence Information'. Together they form a unique fingerprint.

Cite this