化学
计算机科学
深度学习
人工智能
集合(抽象数据类型)
训练集
药物靶点
数据集
药物发现
人工神经网络
机器学习
化学信息学
二进制数
数据挖掘
卷积神经网络
药物数据库
二元分类
计算生物学
深层神经网络
NIST公司
小分子
互操作性
作者
Maedeh Darsaraee,Sacha Javor,Jean-Louis Reymond
标识
DOI:10.1021/acs.jcim.6c00299
摘要
Drug-like molecules often interact with multiple biological targets. Assessing this polypharmacology is essential for drug development. Here, we trained deep neural networks to associate bioactive molecules up to 80 non-hydrogen atoms reported in ChEMBL 34, represented as binary substructure fingerprints, with lists of targets on which the molecules are ≥50% active at ≤10 μM. We included 2,496,555 interactions between 1,187,089 molecules and 7546 targets having at least five reported active molecules, including single proteins, protein complexes, protein families, cell lines, organisms, and further target types. This represents a much larger data set than in previously reported models, which were mostly limited to protein targets. Our models achieve good performances in terms of recall and precision per molecule and per target, as illustrated by overall statistics and by a case study in comparison with other online prediction tools. PPB3 predictions can be performed online at https://ppb3.gdb.tools/.
科研通智能强力驱动
Strongly Powered by AbleSci AI