Differential Multimolecule Fingerprint for Similarity Search─Making Use of Active and Inactive Compound Sets in Virtual Screening

虚拟筛选 指纹(计算) 公共化学 化学 相似性(几何) 计算机科学 人工智能 生物系统 模式识别(心理学) 立体化学 生物化学 药效团 生物 图像(数学)
作者
Michael Hutter
出处
期刊:Journal of Chemical Information and Modeling [American Chemical Society]
卷期号:62 (11): 2726-2736 被引量:2
标识
DOI:10.1021/acs.jcim.2c00242
摘要

In conventional fingerprint methods, the similarity between two molecules is calculated using the Tanimoto index as a numerical criterion. Thus, the query molecules in virtual screening should be most representative of the wanted compound class at hand. In the concept introduced here, all available active molecules form a multimolecule fingerprint in which the appearing features are weighted according to their respective frequency. The features of inactive molecules are treated likewise and the resulting values are subtracted from those of the active ones. The obtained differential multimolecule fingerprint (DMMFP) is thus specific for the respective class of compounds. To account for the noninteger representation within this fingerprint, a modified Sørensen-Dice coefficient is used to compute the similarity. Potentially active molecules yield positive scores, whereas presumably inactive ones are denoted by negative values. The concept was applied to Angiotensin-converting enzyme (ACE) inhibitors, β2-adrenoceptor ligands, leukotriene A4 hydrolase inhibitors, dopamine D3 antagonists, and cytochrome CYP2C9 substrates, for which experimental binding affinities are known and was tested against decoys from DUD-E and a further background database consisting of molecules from the dark chemical matter, which comprises compounds that appear as frequent hitters across multiple assays. Using the 166 publicly available keys of the MACCS fingerprint and the larger PubChem fingerprint, actives were recovered with very high sensitivity. Furthermore, three marketed ACE inhibitors as well as the carbonic anhydrase II inhibitor dorzolamide were detected in the dark chemical matter data set. For comparison, the DMMFP was also used with a Bayesian classifier, for which the specificity (correctly classified inactives) and likewise the accuracy was superior. Conversely, the similarity score produced by the Sørensen-Dice coefficient showed its potential for the early recognition of (potentially) active molecules.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
ttt完成签到 ,获得积分10
1秒前
打打应助晓晓晓采纳,获得10
3秒前
隐形曼青应助jsw采纳,获得10
4秒前
EE完成签到 ,获得积分10
12秒前
拓小八完成签到,获得积分10
13秒前
鹏826完成签到 ,获得积分10
16秒前
yk完成签到 ,获得积分10
17秒前
晓晓晓完成签到,获得积分10
17秒前
平常从蓉完成签到,获得积分10
21秒前
33秒前
chenyan完成签到,获得积分10
40秒前
Hululu完成签到 ,获得积分10
47秒前
风中茈完成签到 ,获得积分10
47秒前
星寒完成签到 ,获得积分10
54秒前
58秒前
jsw发布了新的文献求助10
1分钟前
Ive完成签到,获得积分10
1分钟前
Ding完成签到,获得积分10
1分钟前
l老王完成签到 ,获得积分10
1分钟前
爱爱完成签到 ,获得积分10
1分钟前
云鹤完成签到 ,获得积分10
1分钟前
luffy完成签到 ,获得积分10
1分钟前
CY.X完成签到 ,获得积分10
1分钟前
zhu完成签到 ,获得积分10
1分钟前
jsw发布了新的文献求助10
1分钟前
Akim应助科研通管家采纳,获得10
2分钟前
科目三应助科研通管家采纳,获得10
2分钟前
samuel完成签到,获得积分10
2分钟前
小灰灰完成签到 ,获得积分10
2分钟前
吴邪完成签到,获得积分10
2分钟前
老宇126完成签到,获得积分10
2分钟前
shame完成签到 ,获得积分10
2分钟前
多克特里完成签到 ,获得积分10
2分钟前
xiao完成签到 ,获得积分10
2分钟前
接accept完成签到 ,获得积分10
2分钟前
曾欢完成签到 ,获得积分10
2分钟前
轻松思枫完成签到 ,获得积分10
2分钟前
想吃芝士焗饭完成签到 ,获得积分10
2分钟前
清净163完成签到,获得积分10
2分钟前
xiaoruixue完成签到,获得积分10
2分钟前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Sport in der Antike 800
Aspect and Predication: The Semantics of Argument Structure 666
De arte gymnastica. The art of gymnastics 600
少脉山油柑叶的化学成分研究 530
Electronic Structure Calculations and Structure-Property Relationships on Aromatic Nitro Compounds 500
Berns Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2413018
求助须知:如何正确求助?哪些是违规求助? 2106947
关于积分的说明 5324473
捐赠科研通 1834469
什么是DOI,文献DOI怎么找? 913982
版权声明 560972
科研通“疑难数据库(出版商)”最低求助积分说明 488751