选择性拼接
计算生物学
RNA剪接
转录组
生物
基因表达谱
基因
cDNA文库
深度测序
标识符
计算机科学
基因表达
遗传学
信使核糖核酸
基因组
核糖核酸
程序设计语言
作者
Yuguang Xiong,Magali Soumillon,Jie Wu,Jens Hansen,Bin Hu,J. G. Coen van Hasselt,Gomathi Jayaraman,Ryan G. Lim,Mehdi Bouhaddou,Loren Ornelas,Jim Bochicchio,Lindsay Lenaeus,Jennifer Stocksdale,Jaehee V. Shim,Emilda Gomez,Dhruv Sareen,Clive N. Svendsen,Leslie M. Thompson,Milind Mahajan,Ravi Iyengar
标识
DOI:10.1038/s41598-017-14892-x
摘要
Creating a cDNA library for deep mRNA sequencing (mRNAseq) is generally done by random priming, creating multiple sequencing fragments along each transcript. A 3'-end-focused library approach cannot detect differential splicing, but has potentially higher throughput at a lower cost, along with the ability to improve quantification by using transcript molecule counting with unique molecular identifiers (UMI) that correct PCR bias. Here, we compare an implementation of such a 3'-digital gene expression (3'-DGE) approach with "conventional" random primed mRNAseq. Given our particular datasets on cultured human cardiomyocyte cell lines, we find that, while conventional mRNAseq detects ~15% more genes and needs ~500,000 fewer reads per sample for equivalent statistical power, the resulting differentially expressed genes, biological conclusions, and gene signatures are highly concordant between two techniques. We also find good quantitative agreement at the level of individual genes between two techniques for both read counts and fold changes between given conditions. We conclude that, for high-throughput applications, the potential cost savings associated with 3'-DGE approach are likely a reasonable tradeoff for modest reduction in sensitivity and inability to observe alternative splicing, and should enable many larger scale studies focusing on not only differential expression analysis, but also quantitative transcriptome profiling.
科研通智能强力驱动
Strongly Powered by AbleSci AI