학술논문

The impact of rare variation on gene expression across tissues
Document Type
article
Author
Aguet, FrançoisArdlie, Kristin GCummings, Beryl BGelfand, Ellen TGetz, GadHadley, KaneHandsaker, Robert EHuang, Katherine HKashin, SevaKarczewski, Konrad JLek, MonkolLi, XiaoMacArthur, Daniel GNedzel, Jared LNguyen, Duyen TNoble, Michael SSegrè, Ayellet VTrowbridge, Casandra ATukiainen, TaruAbell, Nathan SBalliu, BrunildaBarshir, RuthBasha, OmerBattle, AlexisBogu, Gireesh KBrown, AndrewBrown, Christopher DCastel, Stephane EChen, Lin SChiang, ColbyConrad, Donald FCox, Nancy JDamani, Farhan NDavis, Joe RDelaneau, OlivierDermitzakis, Emmanouil TEngelhardt, Barbara EEskin, EleazarFerreira, Pedro GFrésard, LaureGamazon, Eric RGarrido-Martín, DiegoGewirtz, Ariel DHGliner, GennaGloudemans, Michael JGuigo, RodericHall, Ira MHan, BuhmHe, YuanHormozdiari, FarhadHowald, CedricKyung Im, HaeJo, BrianYong Kang, EunKim, YungilKim-Hellmuth, SarahLappalainen, TuuliLi, GenLi, XinLiu, BoxiangMangul, SergheiMcCarthy, Mark IMcDowell, Ian CMohammadi, PejmanMonlong, JeanMontgomery, Stephen BMuñoz-Aguirre, ManuelNdungu, Anne WNicolae, Dan LNobel, Andrew BOliva, MeritxellOngen, HalitPalowitch, John JPanousis, NikolaosPapasaikas, PanagiotisPark, YoSonParsana, PrincyPayne, Anthony JPeterson, Christine BQuan, JieReverter, FerranSabatti, ChiaraSaha, AshisSammeth, MichaelScott, Alexandra JShabalin, Andrey ASodaei, RezaStephens, MatthewStranger, Barbara EStrober, Benjamin JSul, Jae HoonTsang, Emily KUrbut, Sarahvan de Bunt, MartijnWang, GaoWen, XiaoquanWright, Fred AXi, Hualin SYeger-Lotem, EstiZappala, Zachary
Source
Nature. 550(7675)
Subject
Bayes Theorem
Female
Gene Expression Profiling
Genetic Variation
Genome
Human
Genomics
Genotype
Humans
Male
Models
Genetic
Organ Specificity
Sequence Analysis
RNA
GTEx Consortium
Laboratory
Data Analysis &Coordinating Center (LDACC)—Analysis Working Group
Statistical Methods groups—Analysis Working Group
Enhancing GTEx (eGTEx) groups
NIH Common Fund
NIH/NCI
NIH/NHGRI
NIH/NIMH
NIH/NIDA
Biospecimen Collection Source Site—NDRI
Biospecimen Collection Source Site—RPCI
Biospecimen Core Resource—VARI
Brain Bank Repository—University of Miami Brain Endowment Bank
Leidos Biomedical—Project Management
ELSI Study
Genome Browser Data Integration &Visualization—EBI
Genome Browser Data Integration &Visualization—UCSC Genomics Institute
University of California Santa Cruz
General Science & Technology
Language
Abstract
Rare genetic variants are abundant in humans and are expected to contribute to individual disease risk. While genetic association studies have successfully identified common genetic variants associated with susceptibility, these studies are not practical for identifying rare variants. Efforts to distinguish pathogenic variants from benign rare variants have leveraged the genetic code to identify deleterious protein-coding alleles, but no analogous code exists for non-coding variants. Therefore, ascertaining which rare variants have phenotypic effects remains a major challenge. Rare non-coding variants have been associated with extreme gene expression in studies using single tissues, but their effects across tissues are unknown. Here we identify gene expression outliers, or individuals showing extreme expression levels for a particular gene, across 44 human tissues by using combined analyses of whole genomes and multi-tissue RNA-sequencing data from the Genotype-Tissue Expression (GTEx) project v6p release. We find that 58% of underexpression and 28% of overexpression outliers have nearby conserved rare variants compared to 8% of non-outliers. Additionally, we developed RIVER (RNA-informed variant effect on regulation), a Bayesian statistical model that incorporates expression data to predict a regulatory effect for rare variants with higher accuracy than models using genomic annotations alone. Overall, we demonstrate that rare variants contribute to large gene expression changes across tissues and provide an integrative method for interpretation of rare variants in individual genomes.