Improved protein structure prediction using potentials from deep learning

计算机科学 蛋白质结构预测 梯度下降 蛋白质结构 构造(python库) 人工神经网络 人工智能 简单(哲学) 算法 机器学习 蛋白质超家族 功能(生物学) 计算生物学 生物系统 卡斯普 数据挖掘 生物 遗传学 认识论 基因 哲学 程序设计语言 生物化学
作者
Andrew Senior,Richard Evans,John Jumper,James Kirkpatrick,Laurent Sifre,Tim Green,Chongli Qin,Augustin Žídek,Alexander Nelson,Alex Bridgland,Hugo Penedones,Stig Petersen,Karen Simonyan,Steve Crossan,Pushmeet Kohli,David T. Jones,David Silver,Koray Kavukcuoglu,Demis Hassabis
出处
期刊:Nature [Nature Portfolio]
卷期号:577 (7792): 706-710 被引量:3500
标识
DOI:10.1038/s41586-019-1923-7
摘要

Protein structure prediction can be used to determine the three-dimensional shape of a protein from its amino acid sequence1. This problem is of fundamental importance as the structure of a protein largely determines its function2; however, protein structures can be difficult to determine experimentally. Considerable progress has recently been made by leveraging genetic information. It is possible to infer which amino acid residues are in contact by analysing covariation in homologous sequences, which aids in the prediction of protein structures3. Here we show that we can train a neural network to make accurate predictions of the distances between pairs of residues, which convey more information about the structure than contact predictions. Using this information, we construct a potential of mean force4 that can accurately describe the shape of a protein. We find that the resulting potential can be optimized by a simple gradient descent algorithm to generate structures without complex sampling procedures. The resulting system, named AlphaFold, achieves high accuracy, even for sequences with fewer homologous sequences. In the recent Critical Assessment of Protein Structure Prediction5 (CASP13)—a blind assessment of the state of the field—AlphaFold created high-accuracy structures (with template modelling (TM) scores6 of 0.7 or higher) for 24 out of 43 free modelling domains, whereas the next best method, which used sampling and contact information, achieved such accuracy for only 14 out of 43 domains. AlphaFold represents a considerable advance in protein-structure prediction. We expect this increased accuracy to enable insights into the function and malfunction of proteins, especially in cases for which no structures for homologous proteins have been experimentally determined7. AlphaFold predicts the distances between pairs of residues, is used to construct potentials of mean force that accurately describe the shape of a protein and can be optimized with gradient descent to predict protein structures.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
词词完成签到,获得积分10
2秒前
Lucas应助zhonyi采纳,获得10
2秒前
我是老大应助TWOFAT采纳,获得10
3秒前
无极微光应助俭朴不平采纳,获得20
6秒前
音玥完成签到,获得积分10
7秒前
8秒前
水波不兴完成签到 ,获得积分10
8秒前
11秒前
12秒前
Ava应助wangweicai采纳,获得10
12秒前
13秒前
欢喜的南露完成签到,获得积分10
13秒前
谨慎乌完成签到,获得积分10
13秒前
无奈的忆梅完成签到,获得积分20
14秒前
14秒前
14秒前
秋秋发布了新的文献求助10
15秒前
15秒前
17秒前
aaa发布了新的文献求助10
17秒前
18秒前
Alvienan完成签到 ,获得积分10
18秒前
18秒前
zhonyi发布了新的文献求助10
18秒前
缓慢的可乐完成签到,获得积分10
18秒前
上官若男应助ash_alice采纳,获得10
19秒前
xglake发布了新的文献求助10
19秒前
冻干粉发布了新的文献求助10
21秒前
TWOFAT发布了新的文献求助10
23秒前
23秒前
大尾巴白发布了新的文献求助10
24秒前
岸部完成签到,获得积分10
24秒前
24秒前
六沉完成签到 ,获得积分10
25秒前
明月落乌江完成签到,获得积分10
26秒前
我爱夏日长完成签到,获得积分10
26秒前
吱吱吱吱发布了新的文献求助10
27秒前
hchnb1234完成签到 ,获得积分10
27秒前
如意葶完成签到 ,获得积分10
28秒前
星星赶路完成签到,获得积分10
29秒前
高分求助中
GL 2 A method for assessing the in-place cleanability of food processing equipment, Fourth Edition, December 2023 3000
Annie Ernaux: De la perte au corps glorieux 600
Developing Solid Oral Dosage Forms Pharmaceutical Theory and Practice (3rd Edition) 500
Writing Systems 500
类器官构建与应用:从基础到前沿 500
Thermodynamics of Natural Systems 400
Electric Vehicle Powertrains Design Fundamentals, Components, and Applications 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6812174
求助须知:如何正确求助?哪些是违规求助? 8527764
关于积分的说明 18153331
捐赠科研通 6139150
什么是DOI,文献DOI怎么找? 3030213
邀请新用户注册赠送积分活动 2006884
关于科研通互助平台的介绍 2005934