生物
转录组
仿形(计算机编程)
计算生物学
视网膜
RNA序列
遗传学
基因表达谱
核糖核酸
基因
基因表达
神经科学
计算机科学
操作系统
作者
Meng Wang,Yumei Li,Jun Wang,Soo Hwan Oh,Ye‐Xuan Cao,Rui Chen
标识
DOI:10.1101/gr.279167.124
摘要
The vast majority of protein-coding genes in the human genome produce multiple mRNA isoforms through alternative splicing, significantly enhancing the complexity of the transcriptome and proteome. To establish an efficient method for characterizing transcript isoforms within tissue samples, we conducted a systematic comparison between single-cell long-read and conventional short-read RNA sequencing techniques. The transcriptome of approximately 30,000 mouse retina cells was profiled using 1.54 billion Illumina short reads and 1.40 billion Oxford Nanopore Technologies long reads. Consequently, we identify 44,325 transcript isoforms, with a notable 38% previously uncharacterized and 17% expressed exclusively in distinct cellular subclasses. We observe that long-read sequencing not only matches the gene expression and cell-type annotation performance of short-read sequencing but also excel in the precise identification of transcript isoforms. While transcript isoforms are often shared across various cell types, their relative abundance shows considerable cell type-specific variation. The data generated from our study significantly enhance the existing repertoire of transcript isoforms, thereby establishing a resource for future research into the mechanisms and implications of alternative splicing within retinal biology and its links to related diseases.
科研通智能强力驱动
Strongly Powered by AbleSci AI