染色质
转座酶
计算生物学
计算机科学
标识符
脚印
管道(软件)
RNA序列
转录因子
生物
数据挖掘
遗传学
基因组
转录组
基因
转座因子
基因表达
程序设计语言
作者
Tao Zhu,Keyan Liao,Rongfang Zhou,Chunjiao Xia,Weibo Xie
标识
DOI:10.1038/s42003-020-01403-4
摘要
Abstract ATAC-seq (Assay for Transposase-Accessible Chromatin with high-throughput sequencing) provides an efficient way to analyze nucleosome-free regions and has been applied widely to identify transcription factor footprints. Both applications rely on the accurate quantification of insertion events of the hyperactive transposase Tn5. However, due to the presence of the PCR amplification, it is impossible to accurately distinguish independently generated identical Tn5 insertion events from PCR duplicates using the standard ATAC-seq technique. Removing PCR duplicates based on mapping coordinates introduces increasing bias towards highly accessible chromatin regions. To overcome this limitation, we establish a UMI-ATAC-seq technique by incorporating unique molecular identifiers (UMIs) into standard ATAC-seq procedures. UMI-ATAC-seq can rescue about 20% of reads that are mistaken as PCR duplicates in standard ATAC-seq in our study. We demonstrate that UMI-ATAC-seq could more accurately quantify chromatin accessibility and significantly improve the sensitivity of identifying transcription factor footprints. An analytic pipeline is developed to facilitate the application of UMI-ATAC-seq, and it is available at https://github.com/tzhu-bio/UMI-ATAC-seq .
科研通智能强力驱动
Strongly Powered by AbleSci AI