致癌物
计算机科学
接收机工作特性
分子描述符
交叉验证
曲线下面积
可靠性(半导体)
预测能力
人工智能
机器学习
数量结构-活动关系
生物信息学
化学
功率(物理)
生物
药代动力学
认识论
量子力学
物理
哲学
有机化学
作者
Wenguo Zhu,Li Zhang,Jianqiang Sun,Rui Ma,Shu Yin,Qi Zhao
摘要
The carcinogenicity of drugs can have a serious impact on human health, so carcinogenicity testing of new compounds is very necessary before being put on the market. Currently, many methods have been used to predict the carcinogenicity of compounds. However, most methods have limited predictive power and there is still much room for improvement. In this study, we construct a deep learning model based on capsule network and attention mechanism named DCAMCP to discriminate between carcinogenic and non-carcinogenic compounds. We train the DCAMCP on a dataset containing 1564 different compounds through their molecular fingerprints and molecular graph features. The trained model is validated by fivefold cross-validation and external validation. DCAMCP achieves an average accuracy (ACC) of 0.718 ± 0.009, sensitivity (SE) of 0.721 ± 0.006, specificity (SP) of 0.715 ± 0.014 and area under the receiver-operating characteristic curve (AUC) of 0.793 ± 0.012. Meanwhile, comparable results can be achieved on an external validation dataset containing 100 compounds, with an ACC of 0.750, SE of 0.778, SP of 0.727 and AUC of 0.811, which demonstrate the reliability of DCAMCP. The results indicate that our model has made progress in cancer risk assessment and could be used as an efficient tool in drug design.
科研通智能强力驱动
Strongly Powered by AbleSci AI