A new look at the fluorescent protein‐based approach for identifying optimal coding sequence for recombinant protein expression in E. coli
Artur I. Zabolotskii, Natalia S. Riabkova- Molecular Medicine
- Applied Microbiology and Biotechnology
- General Medicine
Abstract
Due to the degeneracy of the genetic code, most amino acids are encoded by several codons. The choice among synonymous codons at the N‐terminus of genes has a profound effect on protein expression in Escherichia coli. This is often explained by the different contributions of synonymous codons to mRNA secondary structure formation. Strong secondary structures at the 5'‐end of mRNA interfere with ribosome binding and affect the process of translation initiation. In silico optimization of the gene 5’‐end can significantly increase the level of protein expression; however, this method is not always effective due to the uncertainty of the exact mechanism by which synonymous substitutions affect expression; thus, it may produce nonoptimal variants as well as miss some of the best producers. In this paper, an alternative approach is proposed based on screening a partially randomized library of expression constructs comprising hundreds of selected synonymous variants. The effect of such substitutions was evaluated using the gene of interest fused to the reporter gene of the fluorescent protein with subsequent screening for the most promising candidates according to the reporter's signal intensity. The power of the approach is demonstrated by a significant increase in the prokaryotic expression of three proteins: canine cystatin C, human BCL2‐associated athanogene 3 and human cardiac troponin I. This simple approach was suggested which may provide an efficient, easy, and inexpensive optimization method for poorly expressed proteins in bacteria.