Predicting Human Disease-Associated piRNAs Based on Multi-source Information and Random Forest

Kai Zheng, Zhu Hong You, Lei Wang, Hao Yuan Li, Bo Ya Ji

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

Whole genome analysis studies have shown that Piwi-interacting RNA (piRNA) play a crucial role in disease progression, diagnosis, and therapeutic target. However, traditional biological experiments are expensive and time-consuming. Thus, computational models could serve as a complementary means to provide potential disease-related piRNA candidates. In this study, we propose a novel computational model called APDA to identify piRNA-disease associations. The proposed method integrates disease semantic similarity and piRNA sequence information to construct feature vectors, and maps them to the optimal feature subspace through the stacked autoencoder to obtain the final feature vector. Finally, random forest classifier is used to infer disease-related piRNA. In five-fold cross-validation, the APDA achieved an average AUC of 0.9088 and standard deviation of 0.0126, which is significantly better than the compared method. Therefore, the proposed APDA method is a powerful and necessary tool for predicting human disease-associated piRNAs and provide new impetus to reveal the underlying causes of human disease.

Original languageEnglish
Title of host publicationIntelligent Computing Theories and Application - 16th International Conference, ICIC 2020, Proceedings
EditorsDe-Shuang Huang, Kang-Hyun Jo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages227-238
Number of pages12
ISBN (Print)9783030608019
DOIs
StatePublished - 2020
Externally publishedYes
Event16th International Conference on Intelligent Computing, ICIC 2020 - Bari , Italy
Duration: 2 Oct 20205 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12464 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Intelligent Computing, ICIC 2020
Country/TerritoryItaly
CityBari
Period2/10/205/10/20

Keywords

  • Disease
  • Heterogenous information
  • Multi-source information
  • piRNA-disease associations
  • PIWI-interacting RNA

Fingerprint

Dive into the research topics of 'Predicting Human Disease-Associated piRNAs Based on Multi-source Information and Random Forest'. Together they form a unique fingerprint.

Cite this