数量结构-活动关系
适用范围
分配系数
偏最小二乘回归
支持向量机
数学
辛醇
相关系数
训练集
化学
统计
色谱法
人工智能
计算机科学
立体化学
作者
Jianbing Wang,Dongsheng Cao,Minfeng Zhu,Yong‐Huan Yun,Nan Xiao,Yi‐Zeng Liang
摘要
Lipophilicity, evaluated by either n ‐octanol/water partition coefficient or n ‐octanol/buffer solution distribution coefficient, is of high importance in pharmacology, toxicology, and medicinal chemistry. A quantitative structure–property relationship study was carried out to predict distribution coefficients at pH 7.4 (logD 7.4 ) of a large data set consisting of 1130 organic compounds. Partial least squares and support vector machine (SVM) regressions were employed to build prediction models with 30 molecular descriptors selected by genetic algorithm. The obtained results demonstrated that the SVM model is more reliable and has a better prediction performance than the partial least squares model. The square correlation coefficients of fitting, cross validation, and prediction are 0.92, 0.90, and 0.89, respectively. The corresponding root mean square errors are 0.52, 0.59, and 0.56, respectively. The robustness, reliability, and generalization ability of the model were assessed by Y ‐randomization test and applicability domain. When compared with logD 7.4 values calculated by five existing methods from Discovery Studio and ChemAxon, our SVM model shows superiority over them. The results indicated that our model could give a reliable and robust prediction of logD 7.4 . Copyright © 2015 John Wiley & Sons, Ltd.
科研通智能强力驱动
Strongly Powered by AbleSci AI