학술논문

FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.
Document Type
Article
Source
PLoS Computational Biology. 1/22/2024, Vol. 20 Issue 1, p1-11. 11p.
Subject
*NON-coding RNA
*RNA sequencing
*PYTHON programming language
*ADAPTERS (Telecommunication)
Language
ISSN
1553-734X
Abstract
Adapter trimming is an essential step for analyzing small RNA sequencing data, where reads are generally longer than target RNAs ranging from 18 to 30 bp. Most adapter trimming tools require adapter information as input. However, adapter information is hard to access, specified incorrectly, or not provided with publicly available datasets, hampering their reproducibility and reusability. Manual identification of adapter patterns from raw reads is labor-intensive and error-prone. Moreover, the use of randomized adapters to reduce ligation biases during library preparation makes adapter detection even more challenging. Here, we present FindAdapt, a Python package for fast and accurate detection of adapter patterns without relying on prior information. We demonstrated that FindAdapt was far superior to existing approaches. It identified adapters successfully in 180 simulation datasets with diverse read structures and 3,184 real datasets covering a variety of commercial and customized small RNA library preparation kits. FindAdapt is stand-alone software that can be easily integrated into small RNA sequencing analysis pipelines. [ABSTRACT FROM AUTHOR]