不可用
计算机科学
插补(统计学)
稳健性(进化)
可扩展性
贝叶斯概率
数据挖掘
聚类分析
缺少数据
辍学(神经网络)
人工智能
机器学习
生物
基因
数学
统计
数据库
生物化学
作者
Lihua Zhang,Shihua Zhang
标识
DOI:10.1109/tcbb.2018.2848633
摘要
Single-cell RNA-sequencing (scRNA-seq) is a recent breakthrough technology, which paves the way for measuring RNA levels at single cell resolution to study precise biological functions. One of the main challenges when analyzing scRNA-seq data is the presence of zeros or dropout events, which may mislead downstream analyses. To compensate the dropout effect, several methods have been developed to impute gene expression since the first Bayesian-based method being proposed in 2016. However, these methods have shown very diverse characteristics in terms of model hypothesis and imputation performance. Thus, large-scale comparison and evaluation of these methods is urgently needed now. To this end, we compared eight imputation methods, evaluated their power in recovering original real data, and performed broad analyses to explore their effects on clustering cell types, detecting differentially expressed genes, and reconstructing lineage trajectories in the context of both simulated and real data. Simulated datasets and case studies highlight that there are no one method performs the best in all the situations. Some defects of these methods such as scalability, robustness, and unavailability in some situations need to be addressed in future studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI