학술논문

Highly-multiplexed and efficient long-amplicon PacBio and Nanopore sequencing of hundreds of full mitochondrial genomes
Document Type
Report
Source
BMC Genomics. May 2, 2023, Vol. 24 Issue 1
Subject
United States
Language
English
ISSN
1471-2164
Abstract
Author(s): Benjamin R. Karin[sup.1,2] , Selene Arellano[sup.1] , Laura Wang[sup.1] , Kayla Walzer[sup.1] , Aaron Pomerantz[sup.1] , Juan Manuel Vasquez[sup.1] , Kamalakar Chatla[sup.1] , Peter H. Sudmant[sup.1,3] , Bryan H. [...]
Background Mitochondrial genome sequences have become critical to the study of biodiversity. Genome skimming and other short-read based methods are the most common approaches, but they are not well-suited to scale up to multiplexing hundreds of samples. Here, we report on a new approach to sequence hundreds to thousands of complete mitochondrial genomes in parallel using long-amplicon sequencing. We amplified the mitochondrial genome of 677 specimens in two partially overlapping amplicons and implemented an asymmetric PCR-based indexing approach to multiplex 1,159 long amplicons together on a single PacBio SMRT Sequel II cell. We also tested this method on Oxford Nanopore Technologies (ONT) MinION R9.4 to assess if this method could be applied to other long-read technologies. We implemented several optimizations that make this method significantly more efficient than alternative mitochondrial genome sequencing methods. Results With the PacBio sequencing data we recovered at least one of the two fragments for 96% of samples (~ 80-90%) with mean coverage ~ 1,500x. The ONT data recovered less than 50% of input fragments likely due to low throughput and the design of the Barcoded Universal Primers which were optimized for PacBio sequencing. We compared a single mitochondrial gene alignment to half and full mitochondrial genomes and found, as expected, increased tree support with longer alignments, though whole mitochondrial genomes were not significantly better than half mitochondrial genomes. Conclusions This method can effectively capture thousands of long amplicons in a single run and be used to build more robust phylogenies quickly and effectively. We provide several recommendations for future users depending on the evolutionary scale of their system. A natural extension of this method is to collect multi-locus datasets consisting of mitochondrial genomes and several long nuclear loci at once. Keywords: mtDNA, DNA barcoding, MinION, LongAmp, Third generation sequencing, Long read sequencing, Plasmid