학술논문

Computational scoring and experimental evaluation of enzymes generated by neural networks

Document Type

Article

Author

Johnson, Sean R.; Fu, Xiaozhi; Viknander, Sandra; Goldin, Clara; Monaco, Sarah; Zelezniak, Aleksej; Yang, Kevin K.

Source

Nature Biotechnology; 20240101, Issue: Preprints p1-10, 10p

Subject

Language

ISSN

10870156; 15461696

Abstract

In recent years, generative protein sequence models have been developed to sample novel sequences. However, predicting whether generated proteins will fold and function remains challenging. We evaluate a set of 20 diverse computational metrics to assess the quality of enzyme sequences produced by three contrasting generative models: ancestral sequence reconstruction, a generative adversarial network and a protein language model. Focusing on two enzyme families, we expressed and purified over 500 natural and generated sequences with 70–90% identity to the most similar natural sequences to benchmark computational metrics for predicting in vitro enzyme activity. Over three rounds of experiments, we developed a computational filter that improved the rate of experimental success by 50–150%. The proposed metrics and models will drive protein engineering research by serving as a benchmark for generative protein sequence models and helping to select active variants for experimental testing.

Online Access

Full Text (Nature Journals) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송