Precise Prediction of Pathogenic Microorganisms Using 16S rRNA Gene Sequences

Yu An Huang, Zhi An Huang, Zhu Hong You, Pengwei Hu, Li Ping Li, Zheng Wei Li, Lei Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Clinical observations show that human microorganisms get involved in various human biological processes. The disruption of a symbiotic balance for host-microbiota relationship is found to cause different types of human complex diseases. Discoverying the associations between microbes and the host health statuses that they affect could provide great insights into understanding the mechanisms of diseases caused by microbes. However, experimental approaches are time-consuming and expensive. Little effort has been done to develop computational models for predicting pathogenic microbes on a large scale. The prediction results yielded by such models are anticipated to boost the identification and characterization of potential human pathogenic microbes. Based on the assumption that microbes of similar characters tend to get involved in diseases of similar symptoms forming functional clusters, in this paper, we develop a group based computational model of Bayesian disease-oriented ranking for inferring the most potential microbes associated with human diseases. It is the first attempt to predict this kind of associations by using 16S rRNA gene sequences. Based on the sequence information of genes, we use two computational approaches (BLAST+ and MEGA 7) to measure how similar each pairs of microbes are from different aspects. On the other hand, the similarity of diseases is computed based on MeSH descriptors. Using the data collected from HMDAD database, the proposed model achieved AUCs of 0.9456, 0.8266, 0.8866 and 0.8926 in leave-one-out, 2-fold, 5-fold and 10-fold cross validations, respectively. Besides, we conducted a case study on colorectal carcinoma and found that 16 out of top-20 predicted microbes can be confirmed by the published literatures. The prediction result is publicly released and anticipated to help researchers to preferentially validate these promising pathogenic microbe candidates via biological experiments.

Original languageEnglish
Title of host publicationIntelligent Computing Theories and Application - 15th International Conference, ICIC 2019, Proceedings
EditorsDe-Shuang Huang, Kang-Hyun Jo, Zhi-Kai Huang
PublisherSpringer Verlag
Pages138-150
Number of pages13
ISBN (Print)9783030269685
DOIs
StatePublished - 2019
Externally publishedYes
Event15th International Conference on Intelligent Computing, ICIC 2019 - Nanchang, China
Duration: 3 Aug 20196 Aug 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11644 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th International Conference on Intelligent Computing, ICIC 2019
Country/TerritoryChina
CityNanchang
Period3/08/196/08/19

Keywords

  • 16S rRNA sequence analysis
  • Computational prediction model
  • Microbe–disease associations
  • Microflora
  • Pathogenic microorganisms

Fingerprint

Dive into the research topics of 'Precise Prediction of Pathogenic Microorganisms Using 16S rRNA Gene Sequences'. Together they form a unique fingerprint.

Cite this