학술논문

A powerful approach to identify replicable variants in genome-wide association studies.
Document Type
Article
Source
American Journal of Human Genetics. May2024, Vol. 111 Issue 5, p966-978. 13p.
Subject
*GENOME-wide association studies
*HIDDEN Markov models
*LINKAGE disequilibrium
*FALSE discovery rate
*SINGLE nucleotide polymorphisms
Language
ISSN
0002-9297
Abstract
Replicability is the cornerstone of modern scientific research. Reliable identifications of genotype-phenotype associations that are significant in multiple genome-wide association studies (GWASs) provide stronger evidence for the findings. Current replicability analysis relies on the independence assumption among single-nucleotide polymorphisms (SNPs) and ignores the linkage disequilibrium (LD) structure. We show that such a strategy may produce either overly liberal or overly conservative results in practice. We develop an efficient method, ReAD, to detect replicable SNPs associated with the phenotype from two GWASs accounting for the LD structure. The local dependence structure of SNPs across two heterogeneous studies is captured by a four-state hidden Markov model (HMM) built on two sequences of p values. By incorporating information from adjacent locations via the HMM, our approach provides more accurate SNP significance rankings. ReAD is scalable, platform independent, and more powerful than existing replicability analysis methods with effective false discovery rate control. Through analysis of datasets from two asthma GWASs and two ulcerative colitis GWASs, we show that ReAD can identify replicable genetic loci that existing methods might otherwise miss. Replicability is one of the central issues in scientific research. We develop a robust and powerful method to assess replicability in GWAS via the repeated significance of SNPs. By accounting for the linkage disequilibrium structure, our method can identify replicable genetic loci that existing methods might otherwise miss. [ABSTRACT FROM AUTHOR]