학술논문

Evaluation of computational programs to predict HLA genotypes from genomic sequencing data.
Document Type
Article
Source
Briefings in Bioinformatics. Mar2018, Vol. 19 Issue 2, p179-187. 9p.
Subject
*HLA histocompatibility antigens
*POLYMERASE chain reaction
*EXOMES
*GENOMES
*GENE expression
Language
ISSN
1467-5463
Abstract
Motivation: Despite being essential for numerous clinical and research applications, high-resolution human leukocyte antigen (HLA) typing remains challenging and laboratory tests are also time-consuming and labour intensive.With next-generation sequencing data becoming widely accessible, on-demand in silico HLA typing offers an economical and efficient alternative. Results: In this study we evaluate the HLA typing accuracy and efficiency of five computational HLA typingmethods by comparing their predictions against a curated set of>1000 published polymerase chain reaction-derived HLA genotypes on three different data sets (whole genome sequencing, whole exome sequencing and transcriptomic sequencing data). The highest accuracy at clinically relevant resolution (four digits) we observe is 81% on RNAseq data by PHLAT and 99% accuracy by OPTITYPE when limited to Class I genes only.We also observed variability between the tools for resource consumption, with runtime ranging from an average of 5h (HLAMINER) to 7min (SEQ2HLA) and memory from 12.8GB (HLA-VBSEQ) to 0.46GB (HLAMINER) per sample. While a minimal coverage is required, other factors also determine prediction accuracy and the results between tools do not correlate well. Therefore, by combining tools, there is the potential to develop a highly accurate ensemble method that is able to deliver fast, economical HLA typing from existing sequencing data. [ABSTRACT FROM AUTHOR]