학술논문

Multiple Lines of Evidence Support 199 SARS-CoV-2 Positively Selected Amino Acid Sites
Document Type
article
Source
International Journal of Molecular Sciences, Vol 25, Iss 4, p 2428 (2024)
Subject
SARS-CoV-2
coronaviruses
positively selected amino acid sites
Biology (General)
QH301-705.5
Chemistry
QD1-999
Language
English
ISSN
1422-0067
1661-6596
Abstract
SARS-CoV-2 amino acid variants that contribute to an increased transmissibility or to host immune system escape are likely to increase in frequency due to positive selection and may be identified using different methods, such as codeML, FEL, FUBAR, and MEME. Nevertheless, when using different methods, the results do not always agree. The sampling scheme used in different studies may partially explain the differences that are found, but there is also the possibility that some of the identified positively selected amino acid sites are false positives. This is especially important in the context of very large-scale projects where hundreds of analyses have been performed for the same protein-coding gene. To account for these issues, in this work, we have identified positively selected amino acid sites in SARS-CoV-2 and 15 other coronavirus species, using both codeML and FUBAR, and compared the location of such sites in the different species. Moreover, we also compared our results to those that are available in the COV2Var database and the frequency of the 10 most frequent variants and predicted protein location to identify those sites that are supported by multiple lines of evidence. Amino acid changes observed at these sites should always be of concern. The information reported for SARS-CoV-2 can also be used to identify variants of concern in other coronaviruses.