内在无序蛋白质
相(物质)
分子间力
化学物理
序列(生物学)
生物系统
化学
计算机科学
分子
生物
生物化学
有机化学
作者
Sören von Bülow,Giulio Tesei,Fatima Zaidi,Tanja Mittag,Kresten Lindorff‐Larsen
标识
DOI:10.1073/pnas.2417920122
摘要
Phase separation is one possible mechanism governing the selective cellular enrichment of biomolecular constituents for processes such as transcriptional activation, mRNA regulation, and immune signaling. Phase separation is mediated by multivalent interactions of macromolecules including intrinsically disordered proteins and regions (IDRs). Despite considerable advances in experiments, theory, and simulations, the prediction of the thermodynamics of IDR phase behavior remains challenging. We combined coarse-grained molecular dynamics simulations and active learning to develop a fast and accurate machine learning model to predict the free energy and saturation concentration for phase separation directly from sequence. We validate the model using computational and previously measured experimental data, as well as new experimental data for six proteins. We apply our model to all 27,663 IDRs of chain length up to 800 residues in the human proteome and find that 1,420 of these (5%) are predicted to undergo homotypic phase separation with transfer free energies < −2 k B T . We use our model to understand the relationship between single-chain compaction and phase separation and find that changes from charge- to hydrophobicity-mediated interactions can break the symmetry between intra- and intermolecular interactions. We also provide proof of principle for how the model can be used in force field refinement. Our work refines and quantifies the established rules governing the connection between sequence features and phase-separation propensities, and our prediction models will be useful for interpreting and designing cellular experiments on the role of phase separation, and for the design of IDRs with specific phase-separation propensities.
科研通智能强力驱动
Strongly Powered by AbleSci AI