计算机科学
推论
马尔可夫链
人工智能
蛋白质结构预测
水准点(测量)
蛋白质结构
机器学习
物理
大地测量学
核磁共振
地理
作者
Diego E. Kleiman,Jiangyan Feng,Zhengyuan Xue,Diwakar Shukla
标识
DOI:10.1101/2025.08.20.671365
摘要
Understanding conformational dynamics is essential for elucidating protein function, yet most deep learning models in structural biology predict only static structures. Here, we introduce ESMDynamic, a deep learning model that predicts dynamic residue-residue contact probability maps directly from protein sequence. Built on the ESMFold architecture, ESMDynamic is trained on contact fluctuations from experimental structure ensembles and molecular dynamics (MD) simulations, enabling it to capture diverse modes of structural variability without requiring multiple sequence alignments. We benchmark ESMDynamic on two large-scale MD datasets (mdCATH and ATLAS), showing that it matches or outperforms state-of-the-art ensemble prediction models (AlphaFlow, ESMFlow, BioEmu) for transient contact prediction while offering orders-of-magnitude faster inference. We demonstrate the model on the ASCT2 and SWEET2b transporters, a de novo troponin C design, and the HIV-1 protease homodimer, illustrating generalization to unseen systems and recovery of experimentally validated dynamic contacts. Furthermore, we present an automated pipeline using ESMDynamic predictions to select collective variables for Markov State Model construction, producing high-quality kinetic models from unbiased MD simulations of SWEET2b. Overall, ESMDynamic provides a compact and interpretable sequence-based description of conformational dynamics, with broad applications in protein engineering, functional analysis, and simulation-guided discovery.
科研通智能强力驱动
Strongly Powered by AbleSci AI