蛋白质设计
蛋白质工程
序列(生物学)
计算生物学
蛋白质结构
计算机科学
蛋白质测序
蛋白质结构预测
生物系统
化学
生物信息学
肽序列
人工智能
生物化学
生物
酶
基因
作者
Jiale Liu,Zheng Guo,Hantian You,Changsheng Zhang,Luhua Lai
标识
DOI:10.1002/anie.202411461
摘要
Designing sequences for specific protein backbones is a key step in creating new functional proteins. Here, we introduce GeoSeqBuilder, a deep learning framework that integrates protein sequence generation with side chain conformation prediction to produce the complete all‐atom structures for designed sequences. GeoSeqBuilder uses spatial geometric features from protein backbones and explicitly includes three‐body interactions of neighboring residues. GeoSeqBuilder achieves native residue type recovery rate of 51.6%, comparable to ProteinMPNN and other leading methods, while accurately predicting side chain conformations. We first used GeoSeqBuilder to design sequences for thioredoxin and a hallucinated three‐helical bundle protein. All the 15 tested sequences expressed as soluble monomeric proteins with high thermal stability, and the 2 high‐resolution crystal structures solved closely match the designed models. The generated protein sequences exhibit low similarity (minimum 23%) to the original sequences, with significantly altered hydrophobic cores. We further redesigned the hydrophobic core of glutathione peroxidase 4, and 3 of the 5 designs showed improved enzyme activity. Although further testing is needed, the high experimental success rate in our testing demonstrates that GeoSeqBuilder is a powerful tool for designing novel sequences for predefined protein structures with atomic details. GeoSeqBuilder is available at https://github.com/PKUliujl/GeoSeqBuilder
科研通智能强力驱动
Strongly Powered by AbleSci AI