背景(考古学)
序列(生物学)
蛋白质设计
计算生物学
生物
遗传学
蛋白质结构
生物化学
古生物学
作者
Justas Dauparas,Gyu Rie Lee,Robert Pecoraro,Linna An,Ivan Anishchenko,Cameron J. Glasscock,David Baker
出处
期刊:Nature Methods
[Springer Nature]
日期:2025-03-28
卷期号:22 (4): 717-723
被引量:60
标识
DOI:10.1038/s41592-025-02626-1
摘要
Protein sequence design in the context of small molecules, nucleotides and metals is critical to enzyme and small-molecule binder and sensor design, but current state-of-the-art deep-learning-based sequence design methods are unable to model nonprotein atoms and molecules. Here we describe a deep-learning-based protein sequence design method called LigandMPNN that explicitly models all nonprotein components of biomolecular systems. LigandMPNN significantly outperforms Rosetta and ProteinMPNN on native backbone sequence recovery for residues interacting with small molecules (63.3% versus 50.4% and 50.5%), nucleotides (50.5% versus 35.2% and 34.0%) and metals (77.5% versus 36.0% and 40.6%). LigandMPNN generates not only sequences but also sidechain conformations to allow detailed evaluation of binding interactions. LigandMPNN has been used to design over 100 experimentally validated small-molecule and DNA-binding proteins with high affinity and high structural accuracy (as indicated by four X-ray crystal structures), and redesign of Rosetta small-molecule binder designs has increased binding affinity by as much as 100-fold. We anticipate that LigandMPNN will be widely useful for designing new binding proteins, sensors and enzymes.
科研通智能强力驱动
Strongly Powered by AbleSci AI