生成模型
匹配(统计)
计算机科学
生成语法
流量(数学)
人工智能
机器学习
数学
统计
几何学
作者
Yaowei Jin,Qi Huang,Ziyang Song,Mingyue Zheng,Dan Teng,Qian Shi
标识
DOI:10.1021/acs.jctc.4c01620
摘要
Biological processes, functions, and properties are intricately linked to the ensemble of protein conformations rather than being solely determined by a single stable conformation. In this study, we developed P2DFlow, a generative model based on SE(3) flow matching, to predict the structural ensembles of proteins. We specifically designed a valuable prior for the flow process and enhanced the model's ability to distinguish each intermediate state by incorporating an additional dimension to describe the ensemble data, which can reflect the physical laws governing the distribution of ensembles so that the prior knowledge can effectively guide the generation process. When trained and evaluated on the MD data sets of ATLAS, P2DFlow outperforms other baseline models on extensive experiments, successfully capturing the observable dynamic fluctuations as evidenced in crystal structure and MD simulations. As a potential proxy agent for protein molecular simulation, the high-quality ensembles generated by P2DFlow could significantly aid in understanding protein functions across various scenarios. Code is available at https://github.com/BLEACH366/P2DFlow.
科研通智能强力驱动
Strongly Powered by AbleSci AI