转录组
选择性拼接
计算生物学
蛋白质组
基因亚型
RNA序列
纳米孔测序
RNA剪接
核糖核酸
生物
基因组
遗传学
基因
基因表达
作者
Meng Wang,Yumei Li,Jun Wang,Soo Hwan Oh,Rui Chen
标识
DOI:10.1101/2024.02.20.581234
摘要
Abstract The vast majority of protein-coding genes in the human genome produce multiple mRNA isoforms through alternative splicing, significantly enhancing the complexity of the transcriptome and proteome. To establish an efficient method for characterizing transcript isoforms within tissue samples, we conducted a systematic comparison between single-cell long-read and conventional short-read RNA sequencing techniques. The transcriptome of approximately 30,000 mouse retina cells was profiled using 1.54 billion Illumina short reads and 1.40 billion Oxford Nanopore long reads. Consequently, we identified 44,325 transcript isoforms, with a notable 38% previously uncharacterized and 17% expressed exclusively in distinct cellular subclasses. We observed that long-read sequencing not only matched the gene expression and cell-type annotation performance of short-read sequencing but also excelled in the precise identification of transcript isoforms. While transcript isoforms are often shared across various cell types, their relative abundance shows considerable cell-type-specific variation. The data generated from our study significantly enhance the existing repertoire of transcript isoforms, thereby establishing a foundational resource for future research into the mechanisms and implications of alternative splicing within retinal biology and its links to related diseases.
科研通智能强力驱动
Strongly Powered by AbleSci AI