虚拟筛选
Crystal(编程语言)
计算机科学
现存分类群
可药性
人工智能
机器学习
共晶
集合(抽象数据类型)
数据集
灵敏度(控制系统)
数据挖掘
算法
化学
分子
分子动力学
计算化学
工程类
氢键
有机化学
基因
生物
进化生物学
生物化学
程序设计语言
电子工程
作者
Dezhi Yang,Li Wang,Penghui Yuan,Qi An,Bin Su,Mingchao Yu,Ting Chen,Kun Hu,Li Zhang,Yang Lu,Guanhua Du
标识
DOI:10.1016/j.cclet.2022.107964
摘要
Co-crystal formation can improve the physicochemical properties of a compound, thus enhancing its druggability. Therefore, artificial intelligence-based co-crystal virtual screening in the early stage of drug development has attracted extensive attention from researchers. However, the complexity of developing and applying algorithms hinders it wide application. This study presents a data-driven co-crystal prediction method based on the XGBoost machine learning model of the scikit-learn package. The simplified molecular input line entry specification (SMILES) information of two compounds is simply inputted to determine whether a co-crystal can be formed. The data set includs the co-crystal records presented in the Cambridge Structural Database (CSD) and the records of no co-crystal formation from extant literature and experiments. RDKit molecular descriptors are adopted as the features of a compound in the data set. The developed model shows excellent performance in the proposed co-crystal training and validation sets with high accuracy, sensitivity, and F1 score. The prediction success rate of the model exceeds 90%. The model therefore provides a simple and feasible scheme for designing and screening co-crystal drugs efficiently and accurately.
科研通智能强力驱动
Strongly Powered by AbleSci AI