A Machine Learning Based Method to Identify Differentially Expressed Genes

Bolin Chen, Li Gao, Xuequn Shang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Detecting differentially expressed genes (DEGs) under two biological conditions is an essential step and is one of the most common reasons for statistical analysis of Microarray and RNA-seq data. There are various methods developed to detect DEGs either originate from a sophisticated statistical model based on fold-change (FC) strategy or from an analysis of biological reasoning. In this paper, we present a machine learning based method called Fusion for identifying DEGs based on an ensemble strategy, it provides a straightforward stringent way to determine the significance level for each gene. We use the Fusion technique on two biological datasets, the results show that in each case it performs more reliably and consistently than the Limma as well as other methods. A validation on the Platinum Spike dataset indicates that the proposed approach is more reliable with high confidence in identifying DEGs. An analysis of the biological function of the identified genes illustrates that the designed ensemble technique is powerful for identifying biologically relevant expression changes.

Original languageEnglish
Title of host publicationIntelligent Computing Theories and Application - 16th International Conference, ICIC 2020, Proceedings
EditorsDe-Shuang Huang, Kang-Hyun Jo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages21-31
Number of pages11
ISBN (Print)9783030608019
DOIs
StatePublished - 2020
Event16th International Conference on Intelligent Computing, ICIC 2020 - Bari , Italy
Duration: 2 Oct 20205 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12464 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Intelligent Computing, ICIC 2020
Country/TerritoryItaly
CityBari
Period2/10/205/10/20

Keywords

  • Differentially expressed genes
  • Ensemble strategy
  • Gene expression

Fingerprint

Dive into the research topics of 'A Machine Learning Based Method to Identify Differentially Expressed Genes'. Together they form a unique fingerprint.

Cite this