학술논문

Inferring compound heterozygosity from large-scale exome sequencing data
Document Type
Original Paper
Source
Nature Genetics. 56(1):152-161
Subject
Language
English
ISSN
1061-4036
1546-1718
Abstract
Recessive diseases arise when both copies of a gene are impacted by a damaging genetic variant. When a patient carries two potentially causal variants in a gene, accurate diagnosis requires determining that these variants occur on different copies of the chromosome (that is, are in trans) rather than on the same copy (that is, in cis). However, current approaches for determining phase, beyond parental testing, are limited in clinical settings. Here we developed a strategy for inferring phase for rare variant pairs within genes, leveraging genotypes observed in the Genome Aggregation Database (v2, n = 125,748 exomes). Our approach estimates phase with 96% accuracy, both in trio data and in patients with Mendelian conditions and presumed causal compound heterozygous variants. We provide a public resource of phasing estimates for coding variants and counts per gene of rare variants in trans that can aid interpretation of rare co-occurring variants in the context of recessive disease.
A strategy for inferring phase for rare variant pairs is applied to exome sequencing data for 125,748 individuals from the Genome Aggregation Database (gnomAD). This resource will aid interpretation of rare co-occurring variants in the context of recessive disease.