聚腺苷酸
计算生物学
基因亚型
生物
数据库
基因
选择(遗传算法)
管道(软件)
基因表达
基因表达谱
遗传学
仿形(计算机编程)
转录组
单细胞分析
R包
高分辨率
序列分析
基因组学
基因表达调控
作者
Peihong Zhang,Hua Feng,Xu-Kai Ma,Nan Fang,Yang Li
标识
DOI:10.1093/gpbjnl/qzaf089
摘要
Polyadenylation site (PAS) selection plays important roles in gene expression regulation and function. RNA-seq data derived from 3' tag sequencing contain intrinsic information about PAS usage and have been analyzed for alternative polyadenylation (APA) isoform expression in both bulk and single cell samples. Here, we upgraded our previously developed deep learning-based PAS analysis pipeline SCAPTURE v2 to profile PASs from 1330 published 3' tag-based scRNA-seq datasets across seven species, resulting in a comprehensive PAS landscape across species. Validation with long-read sequencing data from matched human tissues showed high accuracy of single-cell PAS profiling by SCAPTURE, including previously unannotated ones. Further comparisons revealed distinct PAS usage preferences in different species, such as human versus mouse, independent of conservation of gene expression. Finally, we present PASSpedia, a comprehensive database for PAS analysis and comparison across seven species at single cell resolution, which is freely accessible online at https://bits.fudan.edu.cn/PASSpedia/.
科研通智能强力驱动
Strongly Powered by AbleSci AI