학술논문

CoLoRd: compressing long reads
Document Type
Article
Source
Nature Methods; 20220101, Issue: Preprints p1-4, 4p
Subject
Language
ISSN
15487091; 15487105
Abstract
The cost of maintaining exabytes of data produced by sequencing experiments every year has become a major issue in today’s genomic research. In spite of the increasing popularity of third-generation sequencing, the existing algorithms for compressing long reads exhibit a minor advantage over the general-purpose gzip. We present CoLoRd, an algorithm able to reduce the size of third-generation sequencing data by an order of magnitude without affecting the accuracy of downstream analyses.