학술논문

Optimizing Granulocyte Colony-Stimulating Factor Transcript for Enhanced Expression in Escherichia coli
Document Type
article
Author
Source
Frontiers in Bioengineering and Biotechnology, Vol 9 (2021)
Subject
messenger RNA engineering
G-CSF
optimizing transcript for recombinant protein expression
stable secondary structures in mRNA
translation efficiency
Biotechnology
TP248.13-248.65
Language
English
ISSN
2296-4185
11166606
Abstract
The human granulocyte colony-stimulating factor (G-CSF) is a hematopoietic growth factor used to prevent and treat neutropenia. G-CSF stimulates the bone marrow to produce infection-fighting granulocytes. Food and Drug Administration of the United States approved G-CSF in 1991 and its PEGylated version in 2002 as a prophylactic and therapeutic measure against neutropenia. Recombinant human G-CSF is produced in surrogate host Escherichia coli and is PEGylated at N-terminal. Besides neutropenia, G-CSF is also used in bone marrow transplantation for the mobilization and maturation of peripheral blood stem cells. Considering the requirement of producing G-CSF therapeutic in large quantities, construct designing for high expression is critical for the biopharmaceutical and industrial application. Earlier studies have employed approaches such as codon optimization, use of strong promoters, employment of protein tags, secretion signals, optimization of protein folding, etc., for increasing expression and yield of therapeutic proteins. In this study, it was observed that mRNA transcribed from the native human cDNA of G-CSF and the codon-optimized variant leads to low protein expression in E. coli. To understand the underlying reasons, the mRNA secondary structure of the 5′ end of the G-CSF transcript was analyzed. This analysis revealed the presence of stable secondary structures at the 5′ end of the G-CSF transcript, arising from the native human gene and even from the codon-optimized sequence. These secondary structures were disrupted through translationally silent mutations within the first 24 nucleotides of the transcript without affecting the protein sequence. Interestingly, through this approach, the G-CSF protein expression was increased 60 folds as compared to native G-CSF construct. We believe that these findings create a roadmap for optimization of G-CSF transcript for enhanced expression in E. coli and could be employed to increase the expression of other therapeutic proteins.