학술논문

Comparative genomic signature representations of the emerging COVID-19 coronavirus and other coronaviruses: High identity and possible recombination between Bat and Pangolin coronaviruses.
Document Type
Article
Source
Genomics. Nov2020, Vol. 112 Issue 6, p4189-4202. 14p.
Subject
*CORONAVIRUSES
*COVID-19
*DIGITAL signal processing
*SARS-CoV-2
*ANIMAL diseases
Language
ISSN
0888-7543
Abstract
Coronaviruses are responsible on respiratory diseases in animal and human. The combination of numerical encoding techniques and digital signal processing methods are becoming increasingly important in handling large genomic data. In this paper, we propose to analyze the SARS-CoV-2 genomic signature using the combination of different nucleotide representations and signal processing tools in the aim to identify its genetic origin. The sequence of SARS-CoV-2 was compared with 21 relevant sequences including Bat, Yak and Pangolin coronavirus sequences. In addition, we developed a new algorithm to locate the nucleotide modifications. The results show that the Bat and Pangolin coronaviruses were the most related to SARS-CoV-2 with 96% and 86% of identity all along the genome. Within the S gene sequence, the Pangolin sequence presents local highest nucleotide identity. Those findings suggest genesis of SARS-Cov-2 through evolution from Bat and Pangolin strains. This study offers new ways to automatically characterize viruses. • We propose to analyze the SARS-CoV-2 genomic signature using the combination of different nucleotide representations in the aim to identify its genetic origin. • The SARS-CoV-2 sequence was compared with 21 relevant sequences including Bat, Yak and Pangolin coronavirus sequences. • the Bat and Pangolin coronaviruses were the most related to SARS-CoV-2 • Within the S gene sequence, the Pangolin sequence presents local highest nucleotide identity. • This study suggests genesis of SARS-Cov-2 through evolution from bat and pangolin strains. [ABSTRACT FROM AUTHOR]