发布文献求助

Most Ligand-Based Classification Benchmarks Reward Memorization Rather than Generalization

过度拟合计算机科学人工智能虚拟筛选冗余（工程）一般化相似性（几何）机器学习度量（数据仓库）训练集分类器（UML）数据挖掘指纹（计算）相似性度量模式识别（心理学）数学药物发现生物信息学生物人工神经网络数学分析图像（数学）操作系统

作者

Izhar Wallach,Abraham Heifets

出处

期刊：Journal of Chemical Information and Modeling [American Chemical Society]
日期：2018-04-26 卷期号：58 (5): 916-932 被引量：204

链接

arxiv.org arxiv.org arxiv.org arxiv.org nih.gov datacite.orgdoi.org

标识

DOI：10.1021/acs.jcim.7b00403

摘要

Undetected overfitting can occur when there are significant redundancies between training and validation data. We describe AVE, a new measure of training-validation redundancy for ligand-based classification problems that accounts for the similarity amongst inactive molecules as well as active. We investigated seven widely-used benchmarks for virtual screening and classification, and show that the amount of AVE bias strongly correlates with the performance of ligand-based predictive methods irrespective of the predicted property, chemical fingerprint, similarity measure, or previously-applied unbiasing techniques. Therefore, it may be that the previously-reported performance of most ligand-based methods can be explained by overfitting to benchmarks rather than good prospective accuracy.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

📰 新增『新锐期刊分区』 (2026-3-24)

更新

💬 新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒 每天60秒读懂世界·精选全球要闻 (2026-1-2)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: wulanshu上传了应助文件

1秒前; 周不是舟上传了应助文件

1秒前; 桐桐上传了应助文件

2秒前; 脱节的骨头发布了新的文献求助10

2秒前; 852的应助被Lily采纳，获得10

2秒前; 李健的粉丝团团长上传了应助文件

3秒前; 樊星发布了新的文献求助10

4秒前; 沐溪完成签到，获得积分10

5秒前; 潇洒一曲发布了新的文献求助10

6秒前; 害怕的思天关闭了害怕的思天的文献求助

6秒前; alexisgood发布了新的文献求助10

7秒前; 烟花上传了应助文件

7秒前; 拼搏的依风完成签到，获得积分10

7秒前; 淡然冬灵关闭了淡然冬灵的文献求助

8秒前; 贪玩的秋柔上传了应助文件

8秒前; 李健的粉丝团团长上传了应助文件

9秒前; Akim的应助被存在采纳，获得10

9秒前; 赵大宝完成签到，获得积分10

9秒前; CX发布了新的文献求助10

10秒前; Frederic完成签到，获得积分20

10秒前; 小猫喵喵发布了新的文献求助10

10秒前; 深情安青的应助被lzy采纳，获得10

10秒前; 科研通AI6.3上传了应助文件

10秒前; imchenyin完成签到，获得积分10

10秒前; 肚子完成签到，获得积分10

11秒前; 粗犷的磬发布了新的文献求助10

12秒前; 三维码发布了新的文献求助10

12秒前; 收手吧大哥关闭了收手吧大哥的文献求助

13秒前; 樊星完成签到，获得积分10

14秒前; xiaoyu发布了新的文献求助10

14秒前; CipherSage的应助被ljymedical采纳，获得10

15秒前; 赤足先森发布了新的文献求助10

15秒前; xrf完成签到，获得积分10

16秒前; 周不是舟上传了应助文件

17秒前; 科研通AI6.4的应助被小猫喵喵采纳，获得10

18秒前; CipherSage上传了应助文件

19秒前; CipherSage上传了应助文件

20秒前; 慕卿完成签到，获得积分10

21秒前; 淡然冬灵关闭了淡然冬灵的文献求助

24秒前; 乐乐上传了应助文件

24秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; Development Across Adulthood 800; Chemistry and Physics of Carbon Volume 18 800; The Organometallic Chemistry of the Transition Metals 800; The formation of Australian attitudes towards China, 1918-1941 640; Signals, Systems, and Signal Processing 610; 天津市智库成果选编 600

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 6445904; 求助须知：如何正确求助？哪些是违规求助？ 8259390; 关于积分的说明 17594994; 捐赠科研通 5506309; 什么是DOI，文献DOI怎么找？ 2901788; 邀请新用户注册赠送积分活动 1878808; 关于科研通互助平台的介绍 1718850

今日热心研友

仰望喀纳斯的星空

殷勤的紫槐

粗心的羽毛

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通