Learning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained on Docked Poses

化学 机器学习 对接(动物) 功能(生物学) 人工智能 蛋白质配体 集合(抽象数据类型) 计算机科学 训练集 化学 生物信息学 药物发现 生物 生物化学 程序设计语言 护理部 进化生物学 医学
作者
Fergus Boyles,Charlotte M. Deane,Garrett M. Morris
出处
期刊:Journal of Chemical Information and Modeling [American Chemical Society]
卷期号:62 (22): 5329-5341 被引量:11
标识
DOI:10.1021/acs.jcim.1c00096
摘要

Machine learning scoring functions for protein–ligand binding affinity have been found to consistently outperform classical scoring functions when trained and tested on crystal structures of bound protein–ligand complexes. However, it is less clear how these methods perform when applied to docked poses of complexes. We explore how the use of docked rather than crystallographic poses for both training and testing affects the performance of machine learning scoring functions. Using the PDBbind Core Sets as benchmarks, we show that the performance of a structure-based machine learning scoring function trained and tested on docked poses is lower than that of the same scoring function trained and tested on crystallographic poses. We construct a hybrid scoring function by combining both structure-based and ligand-based features, and show that its ability to predict binding affinity using docked poses is comparable to that of purely structure-based scoring functions trained and tested on crystal poses. We also present a new, freely available validation set─the Updated DUD-E Diverse Subset─for binding affinity prediction using data from DUD-E and ChEMBL. Despite strong performance on docked poses of the PDBbind Core Sets, we find that our hybrid scoring function sometimes generalizes poorly to a protein target not represented in the training set, demonstrating the need for improved scoring functions and additional validation benchmarks.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Echo1发布了新的文献求助10
1秒前
ZJL发布了新的文献求助10
1秒前
希望天下0贩的0应助bai采纳,获得10
1秒前
Hello应助小小杨采纳,获得10
1秒前
七里香发布了新的文献求助10
1秒前
1秒前
小白发布了新的文献求助10
2秒前
洁净硬币关注了科研通微信公众号
2秒前
magiczhu完成签到,获得积分10
3秒前
乐融融完成签到,获得积分10
4秒前
4秒前
5秒前
无花果应助fxxin采纳,获得10
5秒前
执着的过客完成签到,获得积分20
5秒前
小蘑菇应助嗯嗯采纳,获得10
5秒前
南枝瑾发布了新的文献求助10
6秒前
6秒前
苏苏发布了新的文献求助10
7秒前
雪晴发布了新的文献求助10
7秒前
小蘑菇应助蒯秀燕采纳,获得20
7秒前
7秒前
乐融融发布了新的文献求助10
8秒前
科研通AI2S应助简单幻珊的采纳,获得20
9秒前
Leo应助我爱学习采纳,获得20
9秒前
原野完成签到,获得积分10
10秒前
10秒前
蕾蕾完成签到 ,获得积分10
10秒前
大方万仇发布了新的文献求助10
11秒前
11秒前
11秒前
Freya发布了新的文献求助10
12秒前
12秒前
13秒前
科研通AI6.3应助panhaoyu采纳,获得10
13秒前
Echo1完成签到,获得积分10
14秒前
14秒前
15秒前
科研通AI2S应助LHP采纳,获得30
16秒前
科研牛马发布了新的文献求助10
16秒前
填空发布了新的文献求助10
16秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
The Resilient Mindset 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 300
Upland Kenya wild flowers and ferns: a flora of the flowers, ferns, grasses, and sedges of highland Kenya 300
Disturbing the Quiet Life? Competition and CEO Incentives 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6652721
求助须知:如何正确求助?哪些是违规求助? 8406550
关于积分的说明 17975079
捐赠科研通 5848202
什么是DOI,文献DOI怎么找? 2971802
邀请新用户注册赠送积分活动 1947301
关于科研通互助平台的介绍 1867864