理论(学习稳定性)
反向
蛋白质折叠
折叠(DSP实现)
蛋白质稳定性
计算机科学
计算生物学
化学
数学
生物
工程类
生物化学
机器学习
几何学
电气工程
作者
Hong‐Zhou Tan,Xiaowei Wei,Shenggeng Lin,Xueying Mao,Junwei Chen,Heqi Sun,Yufang Zhang,Zhenghong Zhou,Dong‐Qing Wei,Shuangjun Lin,Yi Xiong
标识
DOI:10.1101/2025.08.11.669595
摘要
Abstract Predicting protein stability change upon mutation is critical for protein engineering, yet remains limited by the modeling assumptions of physics-based methods and the generalization bottlenecks of data-driven approaches. We present ProStab, a deep learning framework that integrates sequence- and structure-based information, including the mutation-aware sequence embeddings from protein language models and the geometric features extracted via an inverse folding model. Trained on the large-scale Megascale dataset, ProStab demonstrates strong performance across diverse test sets and robust generalization across distribution shifts between the training and test sets. In head-to-head comparisons, ProStab outperforms all state-of-the-art methods with consistently higher Spearman correlation and precision. To evaluate its practical utility, we experimentally validated ProStab-predicted mutations on the model enzyme transaminase. Among the 16 successfully expressed variants, 4 exhibited improved thermal stability. Remarkably, the 1st top-ranked predicted mutation yielded the highest observed enzymatic activity, retaining three-fold that of the wild type after 10 minutes at 40 °C. To facilitate broader application, a publicly accessible web server has been developed. We envisage that ProStab provides a scalable and accurate platform for intelligent protein stability design.
科研通智能强力驱动
Strongly Powered by AbleSci AI