학술논문

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program
Document Type
article
Author
Taliun, DanielHarris, Daniel NKessler, Michael DCarlson, JedidiahSzpiech, Zachary ATorres, RaulTaliun, Sarah A GaglianoCorvelo, AndréGogarten, Stephanie MKang, Hyun MinPitsillides, Achilleas NLeFaive, JonathonLee, Seung-beenTian, XiaowenBrowning, Brian LDas, SayantanEmde, Anne-KatrinClarke, Wayne ELoesch, Douglas PShetty, Amol CBlackwell, Thomas WSmith, Albert VWong, QuennaLiu, XiaomingConomos, Matthew PBobo, Dean MAguet, FrançoisAlbert, ChristineAlonso, AlvaroArdlie, Kristin GArking, Dan EAslibekyan, StellaAuer, Paul LBarnard, JohnBarr, R GrahamBarwick, LucasBecker, Lewis CBeer, Rebecca LBenjamin, Emelia JBielak, Lawrence FBlangero, JohnBoehnke, MichaelBowden, Donald WBrody, Jennifer ABurchard, Esteban GCade, Brian ECasella, James FChalazan, BrandonChasman, Daniel IChen, Yii-Der IdaCho, Michael HChoi, Seung HoanChung, Mina KClish, Clary BCorrea, AdolfoCurran, Joanne ECuster, BrianDarbar, DawoodDaya, Michellede Andrade, MarizaDeMeo, Dawn LDutcher, Susan KEllinor, Patrick TEmery, Leslie SEng, CelesteFatkin, DianeFingerlin, TashaForer, LukasFornage, MyriamFranceschini, NoraFuchsberger, ChristianFullerton, Stephanie MGermer, SorenGladwin, Mark TGottlieb, Daniel JGuo, XiuqingHall, Michael EHe, JiangHeard-Costa, Nancy LHeckbert, Susan RIrvin, Marguerite RJohnsen, Jill MJohnson, Andrew DKaplan, RobertKardia, Sharon LRKelly, TanikaKelly, ShannonKenny, Eimear EKiel, Douglas PKlemmer, RobertKonkle, Barbara AKooperberg, CharlesKöttgen, AnnaLange, Leslie ALasky-Su, JessicaLevy, DanielLin, XihongLin, Keng-HanLiu, ChunyuLoos, Ruth JF
Source
Nature. 590(7845)
Subject
Genetics
Biotechnology
Human Genome
Aetiology
2.1 Biological and endogenous factors
Generic health relevance
Good Health and Well Being
Cytochrome P-450 CYP2D6
Genetic Variation
Genome
Human
Genomics
Haplotypes
Heterozygote
Humans
INDEL Mutation
Loss of Function Mutation
Mutagenesis
National Heart
Lung
and Blood Institute (U.S.)
Phenotype
Polymorphism
Single Nucleotide
Population Density
Precision Medicine
Quality Control
Sample Size
United States
Whole Genome Sequencing
NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium
General Science & Technology
Language
Abstract
The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.