학술논문

LocoGSE, a sequence-based genome size estimator for plants
Document Type
article
Source
Frontiers in Plant Science, Vol 15 (2024)
Subject
genome size estimation
genome size
ploidy
genome-skimming
environmental DNA
plant genomics
Plant culture
SB1-1110
Language
English
ISSN
1664-462X
Abstract
Extensive research has focused on exploring the range of genome sizes in eukaryotes, with a particular emphasis on land plants, where significant variability has been observed. Accurate estimation of genome size is essential for various research purposes, but existing sequence-based methods have limitations, particularly for low-coverage datasets. In this study, we introduce LocoGSE, a novel genome size estimator designed specifically for low-coverage datasets generated by genome skimming approaches. LocoGSE relies on mapping the reads on single copy consensus proteins without the need for a reference genome assembly. We calibrated LocoGSE using 430 low-coverage Angiosperm genome skimming datasets and compared its performance against other estimators. Our results demonstrate that LocoGSE accurately predicts monoploid genome size even at very low depth of coverage (