计算机科学
亚硫酸氢盐测序
生物
DNA甲基化
甲基化
CpG站点
遗传学
计算生物学
基因
DNA
数据库
基因表达
作者
Zhiqiang Zhang,Yuhao Dan,Yaochen Xu,Jiarui Zhang,Xiaoqi Zheng,Jiantao Shi
出处
期刊:Bioinformatics
[Oxford University Press]
日期:2021-06-19
卷期号:37 (24): 4892-4894
被引量:5
标识
DOI:10.1093/bioinformatics/btab458
摘要
Bisulfite sequencing (BS-seq) is currently the gold standard for measuring genome-wide DNA methylation profiles at single-nucleotide resolution. Most analyses focus on mean CpG methylation and ignore methylation states on the same DNA fragments [DNA methylation haplotypes (mHaps)]. Here, we propose mHap, a simple DNA mHap format for storing DNA BS-seq data. This format reduces the size of a BAM file by 40- to 140-fold while retaining complete read-level CpG methylation information. It is also compatible with the Tabix tool for fast and random access. We implemented a command-line tool, mHapTools, for converting BAM/SAM files from existing platforms to mHap files as well as post-processing DNA methylation data in mHap format. With this tool, we processed all publicly available human reduced representation bisulfite sequencing data and provided these data as a comprehensive mHap database.https://jiantaoshi.github.io/mHap/index.html.Supplementary data are available at Bioinformatics online.
科研通智能强力驱动
Strongly Powered by AbleSci AI