학술논문

ExpansionHunter Denovo: a computational method for locating known and novel repeat expansions in short-read sequencing data
Document Type
article
Source
Genome Biology, Vol 21, Iss 1, Pp 1-14 (2020)
Subject
Repeat expansions
Short tandem repeats
Whole-genome sequencing data
Genome-wide analysis
Friedreich ataxia
Myotonic dystrophy type 1
Biology (General)
QH301-705.5
Genetics
QH426-470
Language
English
ISSN
1474-760X
Abstract
Abstract Repeat expansions are responsible for over 40 monogenic disorders, and undoubtedly more pathogenic repeat expansions remain to be discovered. Existing methods for detecting repeat expansions in short-read sequencing data require predefined repeat catalogs. Recent discoveries emphasize the need for methods that do not require pre-specified candidate repeats. To address this need, we introduce ExpansionHunter Denovo, an efficient catalog-free method for genome-wide repeat expansion detection. Analysis of real and simulated data shows that our method can identify large expansions of 41 out of 44 pathogenic repeats, including nine recently reported non-reference repeat expansions not discoverable via existing methods.