蛋白质工程
序列空间
计算机科学
定向进化
计算生物学
序列(生物学)
蛋白质设计
蛋白质测序
生成模型
人工智能
能源景观
生成语法
蛋白质结构
生物
酶
生物化学
肽序列
数学
突变体
基因
离散数学
巴拿赫空间
作者
Evgenii Lobzaev,Michael A. Herrera,Martyna Kasprzyk,Giovanni Stracquadanio
标识
DOI:10.1038/s41467-024-54814-w
摘要
Abstract Engineering proteins is a challenging task requiring the exploration of a vast design space. Traditionally, this is achieved using Directed Evolution (DE), which is a laborious process. Generative deep learning, instead, can learn biological features of functional proteins from sequence and structural datasets and return novel variants. However, most models do not generate thermodynamically stable proteins, thus leading to many non-functional variants. Here we propose a model called PRotein Engineering by Variational frEe eNergy approximaTion (PREVENT), which generates stable and functional variants by learning the sequence and thermodynamic landscape of a protein. We evaluate PREVENT by designing 40 variants of the conditionally essential E. coli phosphotransferase N-acetyl-L-glutamate kinase ( Ec NAGK). We find 85% of the variants to be functional, with 55% of them showing similar growth rate compared to the wildtype enzyme, despite harbouring up to 9 mutations. Our results support a new approach that can significantly accelerate protein engineering.
科研通智能强力驱动
Strongly Powered by AbleSci AI