DOI: 10.1002/ange.7768514 ISSN: 0044-8249

Enhancing Enzyme Activity With Mutation Combinations Guided by Few‐Shot Learning and Causal Inference

Lin Guo, Xiaoguang Yan, Yali Lu, Shengxin Nie, Mingyue Ge, Yukun Li, Weiguo Li, Xiaochun Zhang, Dongmei Liang, Yihan Zhao, Hongxiao Tan, Xiling Chen, Shilong Fan, Yefeng Tang, Jianjun Qiao, Boxue Tian

ABSTRACT

Designing enzyme sequences to enhance product yield represents a fundamental challenge in metabolic engineering. Here, we established a workflow that integrates computational predictions with efficient experimental iteration to obtain outsized gains in product yield. Based on causal inference and examination of published datasets, we realized and ultimately experimentally confirmed that in vivo unit yield (yield/expression) can serve as an attractive surrogate for aqueous k cat / K m when optimizing for activity. In our workflow, we initially predict activity‐enhancing single mutants by calculating the binding affinities of reactive intermediates, followed by experimental investigations of unit yield. Subsequently, we predict activity‐enhancing mutation combinations using a few‐shot learning model we developed called Physics‐Inspired Feature Selection of Protein Language Models (PIFS‐PLM), which requires only 60–100 experimentally examined mutation combinations as input. In a case study of a bicyclogermacrene (BCG) synthase, we achieve a 73‐fold increase in BCG yield or a 15% increase in BCG selectivity based on combinations of 12 individual mutations, and provide extensive crystallographic and biochemical evidence for impacts from specific mutations. Thus, optimizing for unit yield is highly efficient as an alternative to optimizing for thermostability, and our study provides a powerful workflow for the efficient engineering of high‐yield enzyme variants.

More from our Archive