A Unified Deep Biological Sequence Representation Learning with Pretrained Encoder-Decoder Model

Hai Cheng Yi, Zhu Hong You, Xiao Rui Su, De Shuang Huang, Zhen Hao Guo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Machine learning methods are increasingly being applied to model and predict biomolecular interactions, while efficient feature representation plays a vital role. To this end, a unified biological sequence deep representation learning framework BioSeq2vec is proposed to extract discriminative features of any type of biological sequence. For arbitrary-length sequence input, the BioSeq2vec produces fixed-length efficient feature representation, which can be applied to various learning models. The performance of BioSeq2vec is evaluated on lncRNA-protein interaction prediction tasks. Experimental results reveal the superior performance of BioSeq2vec in biological sequence feature representation and broad prospects in various genome informatics and computational biology studies.

Original languageEnglish
Title of host publicationIntelligent Computing Theories and Application - 16th International Conference, ICIC 2020, Proceedings
EditorsDe-Shuang Huang, Kang-Hyun Jo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages339-347
Number of pages9
ISBN (Print)9783030608019
DOIs
StatePublished - 2020
Externally publishedYes
Event16th International Conference on Intelligent Computing, ICIC 2020 - Bari , Italy
Duration: 2 Oct 20205 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12464 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Intelligent Computing, ICIC 2020
Country/TerritoryItaly
CityBari
Period2/10/205/10/20

Keywords

  • Deep learning
  • Pre-trained model
  • Representation learning
  • Seq2Seq
  • Sequence analysis

Fingerprint

Dive into the research topics of 'A Unified Deep Biological Sequence Representation Learning with Pretrained Encoder-Decoder Model'. Together they form a unique fingerprint.

Cite this