理论(学习稳定性)
突变
计算机科学
人工智能
深度学习
蛋白质稳定性
零(语言学)
语言模型
机器学习
化学
生物
遗传学
生物化学
语言学
哲学
基因
作者
Pan Tan,Mingchen Li,Liang Zhang,Zhiqiang Hu,Hong Liang
出处
期刊:Cornell University - arXiv
日期:2023-01-01
标识
DOI:10.48550/arxiv.2304.03780
摘要
We introduce TemPL, a novel deep learning approach for zero-shot prediction of protein stability and activity, harnessing temperature-guided language modeling. By assembling an extensive dataset of 96 million sequence-host bacterial strain optimal growth temperatures (OGTs) and {\Delta}Tm data for point mutations under consistent experimental conditions, we effectively compared TemPL with state-of-the-art models. Notably, TemPL demonstrated superior performance in predicting protein stability. An ablation study was conducted to elucidate the influence of OGT prediction and language modeling modules on TemPL's performance, revealing the importance of integrating both components. Consequently, TemPL offers considerable promise for protein engineering applications, facilitating the design of mutation sequences with enhanced stability and activity.
科研通智能强力驱动
Strongly Powered by AbleSci AI