生成语法
蛋白质二级结构
扩散
计算生物学
生成模型
计算机科学
生成设计
人工智能
生物
工程类
物理
生物化学
化学工程
热力学
相容性(地球化学)
作者
Bo Ni,David L. Kaplan,Markus J. Buehler
出处
期刊:Chem
[Elsevier]
日期:2023-04-20
卷期号:9 (7): 1828-1849
被引量:82
标识
DOI:10.1016/j.chempr.2023.03.020
摘要
We report two generative deep learning models that predict amino acid sequences and 3D protein structures based on secondary structure design objectives via either overall content or per-residue structure. Both models are robust regarding imperfect inputs and offer de novo design capacity as they can discover new protein sequences not yet discovered from natural mechanisms or systems. The residue-level secondary structure design model generally yields higher accuracy and more diverse sequences. These findings suggest unexplored opportunities for protein designs and functional outcomes within the vast amino acid sequences beyond known proteins. Our models, based on an attention-based diffusion model and trained on a dataset extracted from experimentally known 3D protein structures, offer numerous downstream applications in conditional generative design of various biological or engineering systems. Future work may include additional conditioning, and an exploration of other functional properties of the generated proteins for various properties beyond structural objectives.
科研通智能强力驱动
Strongly Powered by AbleSci AI