自编码
癌细胞系
深度学习
均方误差
灵敏度(控制系统)
人工智能
人工神经网络
计算生物学
计算机科学
基因组学
模式识别(心理学)
机器学习
癌症
数学
癌细胞
生物
基因
基因组
遗传学
统计
工程类
电子工程
作者
Min Li,Yake Wang,Ruiqing Zheng,Xinghua Shi,Yaohang Li,Fang‐Xiang Wu,Jianxin Wang
标识
DOI:10.1109/tcbb.2019.2919581
摘要
High-throughput screening technologies have provided a large amount of drug sensitivity data for a panel of cancer cell lines and hundreds of compounds. Computational approaches to analyzing these data can benefit anticancer therapeutics by identifying molecular genomic determinants of drug sensitivity and developing new anticancer drugs. In this study, we have developed a deep learning architecture to improve the performance of drug sensitivity prediction based on these data. We integrated both genomic features of cell lines and chemical information of compounds to predict the half maximal inhibitory concentrations (IC 50 )(IC50) on the Cancer Cell Line Encyclopedia (CCLE) and the Genomics of Drug Sensitivity in Cancer (GDSC) datasets using a deep neural network, which we called DeepDSC. Specifically, we first applied a stacked deep autoencoder to extract genomic features of cell lines from gene expression data, and then combined the compounds' chemical features to these genomic features to produce final response data. We conducted 10-fold cross-validation to demonstrate the performance of our deep model in terms of root-mean-square error (RMSE) and coefficient of determination R 2 R2. We show that our model outperforms the previous approaches with RMSE of 0.23 and R 2 R2 of 0.78 on CCLE dataset, and RMSE of 0.52 and R 2 R2 of 0.78 on GDSC dataset, respectively. Moreover, to demonstrate the prediction ability of our models on novel cell lines or novel compounds, we left cell lines originating from the same tissue and each compound out as the test sets, respectively, and the rest as training sets. The performance was comparable to other methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI