학술논문

SimSpliceEvol: alternative splicing-aware simulation of biological sequence evolution.
Document Type
Article
Source
BMC Bioinformatics. 12/17/2019 Supplement 20, Vol. 20, p1-13. 13p. 2 Diagrams, 4 Charts, 4 Graphs.
Subject
*RNA splicing
*BIOLOGICAL evolution
*RNA sequencing
*ANTISENSE DNA
*SEQUENCE analysis
*SOURCE code
Language
ISSN
1471-2105
Abstract
Background: It is now well established that eukaryotic coding genes have the ability to produce more than one type of transcript thanks to the mechanisms of alternative splicing and alternative transcription. Because of the lack of gold standard real data on alternative splicing, simulated data constitute a good option for evaluating the accuracy and the efficiency of methods developed for splice-aware sequence analysis. However, existing sequence evolution simulation methods do not model alternative splicing, and so they can not be used to test spliced sequence analysis methods. Results: We propose a new method called SimSpliceEvol for simulating the evolution of sets of alternative transcripts along the branches of an input gene tree. In addition to traditional sequence evolution events, the simulation also includes gene exon-intron structure evolution events and alternative splicing events that modify the sets of transcripts produced from genes. SimSpliceEvol was implemented in Python. The source code is freely available at https://github.com/UdeS-CoBIUS/SimSpliceEvol. Conclusions: Data generated using SimSpliceEvol are useful for testing spliced RNA sequence analysis methods such as methods for spliced alignment of cDNA and genomic sequences, multiple cDNA alignment, orthologous exons identification, splicing orthology inference, transcript phylogeny inference, which requires to know the real evolutionary relationships between the sequences. [ABSTRACT FROM AUTHOR]