학술논문

Comparative evaluation of the heterozygous variant standard deviation as a quality measure for next-generation sequencing.
Document Type
Academic Journal
Author
Høy Hansen M; Haematology-Pathology Research Laboratory, Haematology Research Unit and Pathology Research Unit, Department of Haematology, University of Southern Denmark and Odense University Hospital, Odense, Denmark. Electronic address: marcus.hoy.hansen@rsyd.dk.; Steensboe Lang C; Haematology-Pathology Research Laboratory, Haematology Research Unit and Pathology Research Unit, Department of Haematology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Clinical Pathology, Odense University Hospital, Odense, Denmark.; Abildgaard N; Haematology-Pathology Research Laboratory, Haematology Research Unit and Pathology Research Unit, Department of Haematology, University of Southern Denmark and Odense University Hospital, Odense, Denmark.; Nyvold CG; Haematology-Pathology Research Laboratory, Haematology Research Unit and Pathology Research Unit, Department of Haematology, University of Southern Denmark and Odense University Hospital, Odense, Denmark.
Source
Publisher: Elsevier Country of Publication: United States NLM ID: 100970413 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1532-0480 (Electronic) Linking ISSN: 15320464 NLM ISO Abbreviation: J Biomed Inform Subsets: MEDLINE
Subject
Language
English
Abstract
Next-generation sequencing holds unprecedented throughput in terms of informational content to cost. The technology has entered the scene in laboratory diagnostics and offers flexible workflows in biomedical research. However, the rapid acquisition of genomic data also gives rise to a substantial fraction of sequencing artifacts, causing the detection of false-positive germline variants or erroneous somatic mutations. Consequently, there is a pressing need for efficient and practical quality assessment in sequencing projects. In this study, we investigate using heterozygous variant allele frequency (VAF) standard deviation (σ) for supplementary quality control. Whereas several proposed quality metrics are based on empirical assessments, the dispersion of the allele frequencies reflects a direct approximation of the inherent and discrete features of a diploid genome. Consequently, homologous chromosomes display heterozygous VAF of approximately 1/2. Based on the meta-analysis of 152 whole-exome sequencing data sets, we found that σ reflects both sequencing coverage and noise and can be effectively modeled. It is concluded that the relative comparison of heterozygous VAF σ provides a practical handle for quality assessment, even for samples afflicted with copy-number alterations. The approach can be implemented when performing whole-exome, whole-genome, or targeted panel sequencing and helps identify problematic samples, such as those retrieved from archived formalin-fixed paraffin-embedded tissue.
Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
(Copyright © 2022 The Author(s). Published by Elsevier Inc. All rights reserved.)