生物
混乱
表达式(计算机科学)
计算生物学
核糖核酸
RNA序列
基因表达
转录组
遗传学
基因
计算机科学
心理学
精神分析
程序设计语言
作者
Abhishek Sarkar,Matthew Stephens
出处
期刊:Nature Genetics
[Nature Portfolio]
日期:2021-05-24
卷期号:53 (6): 770-777
被引量:176
标识
DOI:10.1038/s41588-021-00873-4
摘要
The high proportion of zeros in typical single-cell RNA sequencing datasets has led to widespread but inconsistent use of terminology such as dropout and missing data. Here, we argue that much of this terminology is unhelpful and confusing, and outline simple ideas to help to reduce confusion. These include: (1) observed single-cell RNA sequencing counts reflect both true gene expression levels and measurement error, and carefully distinguishing between these contributions helps to clarify thinking; and (2) method development should start with a Poisson measurement model, rather than more complex models, because it is simple and generally consistent with existing data. We outline how several existing methods can be viewed within this framework and highlight how these methods differ in their assumptions about expression variation. We also illustrate how our perspective helps to address questions of biological interest, such as whether messenger RNA expression levels are multimodal among cells.
科研通智能强力驱动
Strongly Powered by AbleSci AI