학술논문

Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications.
Document Type
Article
Source
Bioinformatics. 3/15/2020, Vol. 36 Issue 6, p1681-1688. 8p.
Subject
*GASES
*PIPELINES
*PANDEMICS
*INFLUENZA
*INFLUENZA viruses
*RNA viruses
*INFLUENZA A virus, H1N1 subtype
Language
ISSN
1367-4803
Abstract
Motivation Influenza viruses represent a global public health burden due to annual epidemics and pandemic potential. Due to a rapidly evolving RNA genome, inter-species transmission, intra-host variation, and noise in short-read data, reads can be lost during mapping, and de novo assembly can be time consuming and result in misassembly. We assessed read loss during mapping and designed a graph-based classifier, VAPOR, for selecting mapping references, assembly validation and detection of strains of non-human origin. Results Standard human reference viruses were insufficient for mapping diverse influenza samples in simulation. VAPOR retrieved references for 257 real whole-genome sequencing samples with a mean of > 99.8 % identity to assemblies, and increased the proportion of mapped reads by up to 13.3% compared to standard references. VAPOR has the potential to improve the robustness of bioinformatics pipelines for surveillance and could be adapted to other RNA viruses. Availability and implementation VAPOR is available at https://github.com/connor-lab/vapor. Supplementary information Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]