Advances in the compression of high-throughput DNA sequencing data

Zexuan Zhu, Yongpeng Zhang, Zhuhong You, Liang Jiang, Zhen Ji

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

With the development of high-throughput DNA sequencing technology, DNA sequencing data grows rapidly. The use of compression techniques provides an important candidate solution for the storage and transmission challenges of high-throughput DNA sequencing data. In this paper, the traditional DNA sequences compression methods, including substitutionary and statistical methods, and the reference-genome-based compression method for high-throughput DNA sequencing data are surveyed. The state-of-the-art algorithms of re-sequencing data compression, de novo sequencing data compression, quality score compression, and compressed data indexing are introduced and compared. The challenges and future prospects of high-throughput DNA sequencing data compression are also discussed.

Original languageEnglish
Pages (from-to)409-415
Number of pages7
JournalShenzhen Daxue Xuebao (Ligong Ban)/Journal of Shenzhen University Science and Engineering
Volume30
Issue number4
DOIs
StatePublished - Jul 2013
Externally publishedYes

Keywords

  • Computer application
  • Data compression
  • De novo sequencing
  • DNA sequencing
  • High-throughput sequencing
  • Next generation sequencing
  • Resequencing

Fingerprint

Dive into the research topics of 'Advances in the compression of high-throughput DNA sequencing data'. Together they form a unique fingerprint.

Cite this