학술논문

ShapeShifter: a novel approach for identifying and quantifying stable lariat intronic species in RNAseq data
Document Type
Article
Source
Quantitative Biology; September 2018, Vol. 6 Issue: 3 p267-274, 8p
Subject
Language
ISSN
20954689; 20954697
Abstract
Most intronic lariats are rapidly turned over after splicing. However, new research suggests that some introns may have additional post-splicing functions. Current bioinformatics methods used to identify lariats require a sequencing read that traverses the lariat branchpoint. This method provides precise branchpoint sequence and position information, but is limited in its ability to quantify abundance of stabilized lariat species in a given RNAseq sample. Bioinformatic tools are needed to better address these emerging biological questions. We used an unsupervised machine learning approach on sequencing reads from publicly available ENCODE data to learn to identify and quantify lariats based on RNAseq read coverage shape. We developed ShapeShifter, a novel approach for identifying and quantifying stable lariat species in RNAseq datasets. We learned a characteristic “lariat” curve from ENCODE RNAseq data and were able to estimate abundances for introns based on read coverage. Using this method we discovered new stable introns in these samples that were not represented using the older, branchpoint-traversing read method. ShapeShifter provides a robust approach towards detecting and quantifying stable lariat species.