计算机科学
计算生物学
系统生物学
生物网络
表达数量性状基因座
基因表达
基因
贝叶斯网络
多元统计
数据集成
作者
Thalia E. Chan,Michael P. H. Stumpf,Ann C. Babtie
出处
期刊:Cell systems
[Elsevier BV]
日期:2017-09-27
卷期号:5 (3): 251-
被引量:246
标识
DOI:10.1016/j.cels.2017.08.014
摘要
While single-cell gene expression experiments present new challenges for data processing, the cell-to-cell variability observed also reveals statistical relationships that can be used by information theory. Here, we use multivariate information theory to explore the statistical dependencies between triplets of genes in single-cell gene expression datasets. We develop PIDC, a fast, efficient algorithm that uses partial information decomposition (PID) to identify regulatory relationships between genes. We thoroughly evaluate the performance of our algorithm and demonstrate that the higher-order information captured by PIDC allows it to outperform pairwise mutual information-based algorithms when recovering true relationships present in simulated data. We also infer gene regulatory networks from three experimental single-cell datasets and illustrate how network context, choices made during analysis, and sources of variability affect network inference. PIDC tutorials and open-source software for estimating PID are available. PIDC should facilitate the identification of putative functional relationships and mechanistic hypotheses from single-cell transcriptomic data.
科研通智能强力驱动
Strongly Powered by AbleSci AI