Zero-shot prediction of mutation effects with multimodal deep representation learning guides protein engineering

蛋白质工程 突变体 计算生物学 突变 人工智能 定向进化 合成生物学 计算机科学 生物信息学 生物 遗传学 生物化学 基因
作者
Peng Cheng,Cong Mao,Jin Tang,Sen Yang,Yu Cheng,Wuke Wang,Qiuxi Gu,Wei Han,Hao Chen,Sihan Li,Chen Yaofeng,Jianglin Zhou,Wuju Li,Aimin Pan,Suwen Zhao,Xingxu Huang,Shiqiang Zhu,Jun Zhang,Wenjie Shu,Shengqi Wang
出处
期刊:Cell Research [Springer Nature]
被引量:1
标识
DOI:10.1038/s41422-024-00989-2
摘要

Abstract Mutations in amino acid sequences can provoke changes in protein function. Accurate and unsupervised prediction of mutation effects is critical in biotechnology and biomedicine, but remains a fundamental challenge. To resolve this challenge, here we present Pro tein M utational E ffect P redictor (ProMEP), a general and multiple sequence alignment-free method that enables zero-shot prediction of mutation effects. A multimodal deep representation learning model embedded in ProMEP was developed to comprehensively learn both sequence and structure contexts from ~160 million proteins. ProMEP achieves state-of-the-art performance in mutational effect prediction and accomplishes a tremendous improvement in speed, enabling efficient and intelligent protein engineering. Specifically, ProMEP accurately forecasts mutational consequences on the gene-editing enzymes TnpB and TadA, and successfully guides the development of high-performance gene-editing tools with their engineered variants. The gene-editing efficiency of a 5-site mutant of TnpB reaches up to 74.04% (vs 24.66% for the wild type); and the base editing tool developed on the basis of a TadA 15-site mutant (in addition to the A106V/D108N double mutation that renders deoxyadenosine deaminase activity to TadA) exhibits an A-to-G conversion frequency of up to 77.27% (vs 69.80% for ABE8e, a previous TadA-based adenine base editor) with significantly reduced bystander and off-target effects compared to ABE8e. ProMEP not only showcases superior performance in predicting mutational effects on proteins but also demonstrates a great capability to guide protein engineering. Therefore, ProMEP enables efficient exploration of the gigantic protein space and facilitates practical design of proteins, thereby advancing studies in biomedicine and synthetic biology.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
氧气邮递员完成签到,获得积分10
刚刚
刚刚
高强完成签到,获得积分10
1秒前
科研通AI2S应助水博士采纳,获得10
1秒前
高强发布了新的文献求助10
3秒前
3秒前
3秒前
阿俞应助Ben采纳,获得20
3秒前
4秒前
酷波er应助倩倩采纳,获得10
5秒前
welcome发布了新的文献求助10
6秒前
FYYY发布了新的文献求助10
6秒前
科研通AI6.2应助搞一篇SCI采纳,获得10
6秒前
高强发布了新的文献求助10
7秒前
37关闭了37文献求助
8秒前
cong发布了新的文献求助10
8秒前
Orange应助聪明小丸子采纳,获得10
9秒前
阿北完成签到,获得积分10
10秒前
简啦啦完成签到,获得积分10
10秒前
无极微光应助大佬帮帮我采纳,获得20
13秒前
mofan完成签到,获得积分10
13秒前
小鱼完成签到,获得积分10
14秒前
15秒前
15秒前
高高的乌冬面完成签到,获得积分10
16秒前
17秒前
17秒前
17秒前
17秒前
20秒前
历坷小梦发布了新的文献求助10
21秒前
orixero应助duxh123采纳,获得10
21秒前
水博士发布了新的文献求助10
21秒前
vc发布了新的文献求助50
22秒前
珍妮发布了新的文献求助10
22秒前
22秒前
无极微光发布了新的文献求助10
23秒前
23秒前
加油少年发布了新的文献求助10
23秒前
smile发布了新的文献求助10
23秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Introduction to Helicopter and Tiltrotor Flight Simulation, Second Edition 2500
卤化钙钛矿人工突触的研究 2000
History of U.S. Space Surveillance and Satellite Cataloging 1000
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Materials selection in mechanical design 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6504544
求助须知:如何正确求助?哪些是违规求助? 8298901
关于积分的说明 17714893
捐赠科研通 5603957
什么是DOI,文献DOI怎么找? 2919895
邀请新用户注册赠送积分活动 1897274
关于科研通互助平台的介绍 1759121