학술논문

BisQC: an operational pipeline for multiplexed bisulfite sequencing.
Document Type
Article
Source
BMC Genomics. 2014, Vol. 15 Issue 1, p1-39. 39p. 3 Diagrams, 7 Charts, 7 Graphs.
Subject
*SINGLE nucleotide polymorphisms
*GENOMES
*METHYLATION
*SULFITES
*BIOINFORMATICS
Language
ISSN
1471-2164
Abstract
Background Bisulfite sequencing is the most efficient single nucleotide resolution method for analysis of methylation status at whole genome scale, but improved quality control metrics are needed to better standardize experiments. Results We describe BisQC, a step-by-step method for multiplexed bisulfite-converted DNA library construction, pooling, spike-in content, and bioinformatics. We demonstrate technical improvements for library preparation and bioinformatic analyses that can be done in standard laboratories. We find that decoupling amplification of bisulfite converted (bis) DNA from the indexing reaction is an advantage, specifically in reducing total PCR cycle number and preselecting high quality bis-libraries. We also introduce a progressive PCR method for optimal library amplification and size-selection. At the sequencing stage, we thoroughly test the benefits of pooling non-bis DNA library with bis-libraries and find that BisSeq libraries can be pooled with a high proportion of non-bis DNA libraries with minimal impact on BisSeq output. For informatics analysis, we propose a series of optimization steps including the utilization of the mitochondrial genome as a QC standard, and we assess the validity of using duplicate reads for coverage statistics. Conclusion We demonstrate several quality control checkpoints at the library preparation, presequencing, post-sequencing, and post-alignment stages, which should prove useful in determining sample and processing quality. We also determine that including a significant portion of non-bisulfite converted DNA with bisulfite converted DNA has a minimal impact on usable bisulfite read output. [ABSTRACT FROM AUTHOR]