亲脂性
数量结构-活动关系
偏最小二乘回归
保留时间
分子描述符
化学
相似性(几何)
时间点
适用范围
生物系统
数学
人工智能
色谱法
统计
计算机科学
立体化学
哲学
图像(数学)
美学
生物
作者
Shilpayan Ghosh,Mainak Chatterjee,Kunal Roy
标识
DOI:10.1021/acs.jafc.3c01438
摘要
The retention time (log tR) of pesticidal compounds in a reverse-phase high-performance liquid chromatography (HPLC) analysis has a direct relationship with lipophilicity, which could be related to the ecotoxicity potential of the compounds. The novel quantitative read-across structure-property relationship (q-RASPR) modeling approach uses similarity-based descriptors for predictive model generation. These models have been shown to enhance external predictivity in previous studies for several end points. The current study describes the development of a q-RASPR model using experimental retention time data (log tR) in the HPLC experiments of 823 environmentally significant pesticide residues collected from a large compound database. To model the retention time (log tR) end point, 0D-2D descriptors have been used along with the read-across-derived similarity descriptors. The developed partial least squares (PLS) model was rigorously validated by various internal and external validation metrics as recommended by the Organization for Economic Co-operation and Development (OECD). The final q-RASPR model is proven to be a good fit, robust, and externally predictive (ntrain = 618, R2 = 0.82, Q2LOO = 0.81, ntest = 205, and Q2F1 = 0.84) that literally outperforms the external predictivity of the previously reported quantitative structure-property relationship (QSPR) model. From the insights of modeled descriptors, lipophilicity is found to be the most important chemical property, which positively correlates with the retention time (log tR). Various other characteristics, such as the number of multiple bonds (nBM), graph density (GD), etc., have a substantial and inversely proportionate relationship with the retention time end point. The software tools utilized in this study are user-friendly, and most of them are free, which makes our methodology quite cost-effective when compared to experimentation. In a nutshell, to obtain better external predictivity, interpretability, and transferability, q-RASPR is an efficient technique that has the potential to be employed as a good alternative approach for retention time prediction and ecotoxicity potential identification.
科研通智能强力驱动
Strongly Powered by AbleSci AI