亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Prediction of Drug-Induced Nephrotoxicity Using Chemical Information and Transcriptomics Data

肾毒性 药品 转录组 计算机科学 化学 计算生物学 药理学 医学 生物 内科学 生物化学 基因表达 基因
作者
Hemanth Chenga,Ayush Garg,Shyam Sundar Das,Narayanan Ramamurthi
出处
期刊:Journal of Chemical Information and Modeling [American Chemical Society]
标识
DOI:10.1021/acs.jcim.5c00141
摘要

Prediction of drug-induced nephrotoxicity is an important task in the drug discovery and development pipeline. Chemical information-based machine learning models are used in general for nephrotoxicity prediction as a part of computational modeling. Currently, gene expression data are being considered increasingly for prediction of different toxicities, as they can provide mechanistic understanding by which the drug causes specific organ toxicity. Here, we demonstrate the use of gene expression data for nephrotoxicity prediction using multiple machine learning methods such as LightGBM, random forest, support vector machine, and XGBoost. Apart from the models built with all the gene expression profiles for selected compounds, the sample selection technique is used to select three different subsets of gene expression profiles of sizes 6000, 9000, and 12,000 and models are generated using them also. Considering the imbalanced class distribution in gene expression data, different techniques such as optimal probability thresholds determination, data balancing, and cost-sensitive learning are considered during model generation. We have also generated chemical information-based models to compare the performance of gene expression-based models. Multiple data division techniques are applied to enhance the performance of chemical information-based models. The best chemical information-based model (CIM19) and best gene expression-based model (GEM9) (generated without any data balancing techniques) have similar AUC values of 0.89 and 0.9, respectively. To further enhance the performance of gene expression-based models, we have developed a model GEM20 with all the 6162 toxic gene expression profiles and the same number of nontoxic profiles selected using the SPXY method from 18,825 nontoxic profiles. This model provides the highest AUC score of 0.94 among all of the chemical information- and gene expression-based models. Additionally, SHAP analysis has been performed on a gene expression-based model and identified several genes such as cell division cycle 20, RPS6, DNA damage-inducible transcript 4, GAPDH, CCNF, and MRPL12, which could be associated with nephrotoxicity.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
哈比人linling完成签到,获得积分10
1秒前
2秒前
可乐wutang发布了新的文献求助10
6秒前
8秒前
14秒前
15秒前
可乐wutang发布了新的文献求助10
27秒前
慕青应助科研努力版采纳,获得10
27秒前
华仔应助科研通管家采纳,获得10
28秒前
28秒前
我是老大应助可乐wutang采纳,获得10
41秒前
55秒前
1分钟前
梦梦梦发布了新的文献求助10
1分钟前
万邦德完成签到,获得积分10
1分钟前
乐正亦寒完成签到 ,获得积分10
1分钟前
搞怪建辉完成签到,获得积分10
1分钟前
翁遥发布了新的文献求助20
2分钟前
赘婿应助科研通管家采纳,获得10
2分钟前
丘比特应助科研通管家采纳,获得10
2分钟前
attention完成签到,获得积分10
3分钟前
3分钟前
yang发布了新的文献求助10
3分钟前
mark163完成签到,获得积分10
3分钟前
arbitmomo完成签到,获得积分10
4分钟前
4分钟前
任性天晴发布了新的文献求助10
4分钟前
4分钟前
4分钟前
S1mple发布了新的文献求助10
4分钟前
5分钟前
Dr发布了新的文献求助10
5分钟前
CodeCraft应助任性天晴采纳,获得10
5分钟前
5分钟前
Dainlyn发布了新的文献求助10
5分钟前
李健的粉丝团团长应助Dr采纳,获得10
5分钟前
5分钟前
Dr完成签到,获得积分10
5分钟前
5分钟前
5分钟前
高分求助中
液晶指向矢仿真分析数据集 8888
GL 2 A method for assessing the in-place cleanability of food processing equipment, Fourth Edition, December 2023 3000
Ideology and Meaning-Making under the Putin Regime 750
Annie Ernaux: De la perte au corps glorieux 600
Petrology and Plate Tectonics 500
Writing Systems 500
A Handbook of User Experience Research & Design in Libraries 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6850295
求助须知:如何正确求助?哪些是违规求助? 8556634
关于积分的说明 18198792
捐赠科研通 6207713
什么是DOI,文献DOI怎么找? 3043628
关于科研通互助平台的介绍 2038144
邀请新用户注册赠送积分活动 2021050