Base-Resolution Analysis of 5-Hydroxymethylcytosine in the Mammalian Genome

生物 5-羟甲基胞嘧啶 计算生物学 基因组 遗传学 基础(拓扑) 分辨率(逻辑) 进化生物学 基因 DNA甲基化 基因表达 数学分析 数学 人工智能 计算机科学
作者
Miao Yu,Gary C. Hon,Keith E. Szulwach,Chun-Xiao Song,Lei Zhang,Audrey S Kim,Xuekun Li,Qing Dai,Yin Shen,Beom-Seok Park,Jung Joon Min,Peng Jin,Bing Ren,Chuan He
出处
期刊:Cell [Elsevier]
卷期号:149 (6): 1368-1380 被引量:869
标识
DOI:10.1016/j.cell.2012.04.027
摘要

The study of 5-hydroxylmethylcytosines (5hmC) has been hampered by the lack of a method to map it at single-base resolution on a genome-wide scale. Affinity purification-based methods cannot precisely locate 5hmC nor accurately determine its relative abundance at each modified site. We here present a genome-wide approach, Tet-assisted bisulfite sequencing (TAB-Seq), that when combined with traditional bisulfite sequencing can be used for mapping 5hmC at base resolution and quantifying the relative abundance of 5hmC as well as 5mC. Application of this method to embryonic stem cells not only confirms widespread distribution of 5hmC in the mammalian genome but also reveals sequence bias and strand asymmetry at 5hmC sites. We observe high levels of 5hmC and reciprocally low levels of 5mC near but not on transcription factor-binding sites. Additionally, the relative abundance of 5hmC varies significantly among distinct functional sequence elements, suggesting different mechanisms for 5hmC deposition and maintenance. The study of 5-hydroxylmethylcytosines (5hmC) has been hampered by the lack of a method to map it at single-base resolution on a genome-wide scale. Affinity purification-based methods cannot precisely locate 5hmC nor accurately determine its relative abundance at each modified site. We here present a genome-wide approach, Tet-assisted bisulfite sequencing (TAB-Seq), that when combined with traditional bisulfite sequencing can be used for mapping 5hmC at base resolution and quantifying the relative abundance of 5hmC as well as 5mC. Application of this method to embryonic stem cells not only confirms widespread distribution of 5hmC in the mammalian genome but also reveals sequence bias and strand asymmetry at 5hmC sites. We observe high levels of 5hmC and reciprocally low levels of 5mC near but not on transcription factor-binding sites. Additionally, the relative abundance of 5hmC varies significantly among distinct functional sequence elements, suggesting different mechanisms for 5hmC deposition and maintenance. Genome-wide Tet-assisted bisulfite sequencing for single-base detection of 5hmC Single-base resolution maps of 5hmC versus 5mC in human and mouse ESCs 5hmC is enriched at distal-regulatory elements and promoters with low CpG content 5hmC deposition is asymmetric and strand biased toward G-rich sequences 5-methylcytosine (5mC) in mammalian genomic DNA is essential for normal development and impacts a variety of biological functions. In 2009, 5-hydroxymethylcytosine (5hmC) was discovered as another relatively abundant form of cytosine modification in embryonic stem cells (ESCs) and Purkinje neurons (Kriaucionis and Heintz, 2009Kriaucionis S. Heintz N. The nuclear DNA base 5-hydroxymethylcytosine is present in Purkinje neurons and the brain.Science. 2009; 324: 929-930Crossref PubMed Scopus (2105) Google Scholar, Tahiliani et al., 2009Tahiliani M. Koh K.P. Shen Y. Pastor W.A. Bandukwala H. Brudno Y. Agarwal S. Iyer L.M. Liu D.R. Aravind L. Rao A. Conversion of 5-methylcytosine to 5-hydroxymethylcytosine in mammalian DNA by MLL partner TET1.Science. 2009; 324: 930-935Crossref PubMed Scopus (4229) Google Scholar). The TET proteins, which are responsible for conversion of 5mC to 5hmC, have been shown to function in ESC regulation, myelopoiesis, and zygote development (Dawlaty et al., 2011Dawlaty M.M. Ganz K. Powell B.E. Hu Y.C. Markoulaki S. Cheng A.W. Gao Q. Kim J. Choi S.W. Page D.C. Jaenisch R. Tet1 is dispensable for maintaining pluripotency and its loss is compatible with embryonic and postnatal development.Cell Stem Cell. 2011; 9: 166-175Abstract Full Text Full Text PDF PubMed Scopus (396) Google Scholar, Gu et al., 2011Gu T.P. Guo F. Yang H. Wu H.P. Xu G.F. Liu W. Xie Z.G. Shi L. He X. Jin S.G. et al.The role of Tet3 DNA dioxygenase in epigenetic reprogramming by oocytes.Nature. 2011; 477: 606-610Crossref PubMed Scopus (859) Google Scholar, Iqbal et al., 2011Iqbal K. Jin S.G. Pfeifer G.P. Szabó P.E. Reprogramming of the paternal genome upon fertilization involves genome-wide oxidation of 5-methylcytosine.Proc. Natl. Acad. Sci. USA. 2011; 108: 3642-3647Crossref PubMed Scopus (550) Google Scholar, Ito et al., 2010Ito S. D'Alessio A.C. Taranova O.V. Hong K. Sowers L.C. Zhang Y. Role of Tet proteins in 5mC to 5hmC conversion, ES-cell self-renewal and inner cell mass specification.Nature. 2010; 466: 1129-1133Crossref PubMed Scopus (1925) Google Scholar, Ko et al., 2010Ko M. Huang Y. Jankowska A.M. Pape U.J. Tahiliani M. Bandukwala H.S. An J. Lamperti E.D. Koh K.P. Ganetzky R. et al.Impaired hydroxylation of 5-methylcytosine in myeloid cancers with mutant TET2.Nature. 2010; 468: 839-843Crossref PubMed Scopus (1023) Google Scholar, Koh et al., 2011Koh K.P. Yabuuchi A. Rao S. Huang Y. Cunniff K. Nardone J. Laiho A. Tahiliani M. Sommer C.A. Mostoslavsky G. et al.Tet1 and Tet2 regulate 5-hydroxymethylcytosine production and cell lineage specification in mouse embryonic stem cells.Cell Stem Cell. 2011; 8: 200-213Abstract Full Text Full Text PDF PubMed Scopus (608) Google Scholar, Wossidlo et al., 2011Wossidlo M. Nakamura T. Lepikhov K. Marques C.J. Zakhartchenko V. Boiani M. Arand J. Nakano T. Reik W. Walter J. 5-Hydroxymethylcytosine in the mammalian zygote is linked with epigenetic reprogramming.Nat Commun. 2011; 2: 241Crossref PubMed Scopus (609) Google Scholar). 5hmC was found to be widespread in many tissues and cell types, although with diverse levels of abundance (Globisch et al., 2010Globisch D. Münzel M. Müller M. Michalakis S. Wagner M. Koch S. Brückl T. Biel M. Carell T. Tissue distribution of 5-hydroxymethylcytosine and search for active demethylation intermediates.PLoS ONE. 2010; 5: e15367Crossref PubMed Scopus (655) Google Scholar, Münzel et al., 2010Münzel M. Globisch D. Brückl T. Wagner M. Welzmiller V. Michalakis S. Müller M. Biel M. Carell T. Quantification of the sixth DNA base hydroxymethylcytosine in the brain.Angew. Chem. Int. Ed. Engl. 2010; 49: 5375-5377Crossref PubMed Scopus (318) Google Scholar, Song et al., 2011Song C.X. Szulwach K.E. Fu Y. Dai Q. Yi C. Li X. Li Y. Chen C.H. Zhang W. Jian X. et al.Selective chemical labeling reveals the genome-wide distribution of 5-hydroxymethylcytosine.Nat. Biotechnol. 2011; 29: 68-72Crossref PubMed Scopus (806) Google Scholar, Szwagierczak et al., 2010Szwagierczak A. Bultmann S. Schmidt C.S. Spada F. Leonhardt H. Sensitive enzymatic quantification of 5-hydroxymethylcytosine in genomic DNA.Nucleic Acids Res. 2010; 38: e181Crossref PubMed Scopus (353) Google Scholar). Proteins that can recognize 5hmC-containing DNA have also been investigated (Frauer et al., 2011Frauer C. Hoffmann T. Bultmann S. Casa V. Cardoso M.C. Antes I. Leonhardt H. Recognition of 5-hydroxymethylcytosine by the Uhrf1 SRA domain.PLoS ONE. 2011; 6: e21306Crossref PubMed Scopus (151) Google Scholar, Yildirim et al., 2011Yildirim O. Li R. Hung J.H. Chen P.B. Dong X. Ee L.S. Weng Z. Rando O.J. Fazzio T.G. Mbd3/NURD complex regulates expression of 5-hydroxymethylcytosine marked genes in embryonic stem cells.Cell. 2011; 147: 1498-1510Abstract Full Text Full Text PDF PubMed Scopus (353) Google Scholar). In addition, 5hmC can be further oxidized to 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC) by TET proteins (He et al., 2011He Y.F. Li B.Z. Li Z. Liu P. Wang Y. Tang Q. Ding J. Jia Y. Chen Z. Li L. et al.Tet-mediated formation of 5-carboxylcytosine and its excision by TDG in mammalian DNA.Science. 2011; 333: 1303-1307Crossref PubMed Scopus (2014) Google Scholar, Ito et al., 2011Ito S. Shen L. Dai Q. Wu S.C. Collins L.B. Swenberg J.A. He C. Zhang Y. Tet proteins can convert 5-methylcytosine to 5-formylcytosine and 5-carboxylcytosine.Science. 2011; 333: 1300-1303Crossref PubMed Scopus (2463) Google Scholar, Pfaffeneder et al., 2011Pfaffeneder T. Hackner B. Truss M. Münzel M. Müller M. Deiml C.A. Hagemeier C. Carell T. The discovery of 5-formylcytosine in embryonic stem cell DNA.Angew. Chem. Int. Ed. Engl. 2011; 50: 7008-7012Crossref PubMed Scopus (391) Google Scholar), and demethylation pathways through these modified cytosines have been shown (Cortellino et al., 2011Cortellino S. Xu J. Sannai M. Moore R. Caretti E. Cigliano A. Le Coz M. Devarajan K. Wessels A. Soprano D. et al.Thymine DNA glycosylase is essential for active DNA demethylation by linked deamination-base excision repair.Cell. 2011; 146: 67-79Abstract Full Text Full Text PDF PubMed Scopus (610) Google Scholar, Guo et al., 2011Guo J.U. Su Y. Zhong C. Ming G.L. Song H. Hydroxylation of 5-methylcytosine by TET1 promotes active DNA demethylation in the adult brain.Cell. 2011; 145: 423-434Abstract Full Text Full Text PDF PubMed Scopus (1069) Google Scholar, He et al., 2011He Y.F. Li B.Z. Li Z. Liu P. Wang Y. Tang Q. Ding J. Jia Y. Chen Z. Li L. et al.Tet-mediated formation of 5-carboxylcytosine and its excision by TDG in mammalian DNA.Science. 2011; 333: 1303-1307Crossref PubMed Scopus (2014) Google Scholar, Maiti and Drohat, 2011Maiti A. Drohat A.C. Thymine DNA glycosylase can rapidly excise 5-formylcytosine and 5-carboxylcytosine: potential implications for active demethylation of CpG sites.J. Biol. Chem. 2011; 286: 35334-35338Crossref PubMed Scopus (638) Google Scholar, Zhang et al., 2012Zhang L. Lu X. Lu J. Liang H. Dai Q. Xu G.L. Luo C. Jiang H. He C. Thymine DNA glycosylase specifically recognizes 5-carboxylcytosine-modified DNA.Nat. Chem. Biol. 2012; 8: 328-330Crossref PubMed Scopus (247) Google Scholar). Together, these studies provide an emerging paradigm in which 5mC oxidation plays important roles in sculpting a cell's epigenetic landscape and developmental potential through the regulation of dynamic DNA methylation states. Strategies to label and/or enrich 5hmC in genomic DNA have been developed to investigate the distribution and function of 5hmC in the genome (Ficz et al., 2011Ficz G. Branco M.R. Seisenberger S. Santos F. Krueger F. Hore T.A. Marques C.J. Andrews S. Reik W. Dynamic regulation of 5-hydroxymethylcytosine in mouse ES cells and during differentiation.Nature. 2011; 473: 398-402Crossref PubMed Scopus (901) Google Scholar, Pastor et al., 2011Pastor W.A. Pape U.J. Huang Y. Henderson H.R. Lister R. Ko M. McLoughlin E.M. Brudno Y. Mahapatra S. Kapranov P. et al.Genome-wide mapping of 5-hydroxymethylcytosine in embryonic stem cells.Nature. 2011; 473: 394-397Crossref PubMed Scopus (645) Google Scholar, Robertson et al., 2011Robertson A.B. Dahl J.A. Vågbø C.B. Tripathi P. Krokan H.E. Klungland A. A novel method for the efficient and selective identification of 5-hydroxymethylcytosine in genomic DNA.Nucleic Acids Res. 2011; 39: e55Crossref PubMed Scopus (90) Google Scholar, Robertson et al., 2012Robertson A.B. Dahl J.A. Ougland R. Klungland A. Pull-down of 5-hydroxymethylcytosine DNA using JBP1-coated magnetic beads.Nat. Protoc. 2012; 7: 340-350Crossref PubMed Scopus (50) Google Scholar, Song et al., 2011Song C.X. Szulwach K.E. Fu Y. Dai Q. Yi C. Li X. Li Y. Chen C.H. Zhang W. Jian X. et al.Selective chemical labeling reveals the genome-wide distribution of 5-hydroxymethylcytosine.Nat. Biotechnol. 2011; 29: 68-72Crossref PubMed Scopus (806) Google Scholar, Stroud et al., 2011Stroud H. Feng S. Morey Kinney S. Pradhan S. Jacobsen S.E. 5-Hydroxymethylcytosine is associated with enhancers and gene bodies in human embryonic stem cells.Genome Biol. 2011; 12: R54Crossref PubMed Scopus (346) Google Scholar, Williams et al., 2011Williams K. Christensen J. Pedersen M.T. Johansen J.V. Cloos P.A. Rappsilber J. Helin K. TET1 and hydroxymethylcytosine in transcription and DNA methylation fidelity.Nature. 2011; 473: 343-348Crossref PubMed Scopus (800) Google Scholar, Wu et al., 2011Wu H. D'Alessio A.C. Ito S. Wang Z. Cui K. Zhao K. Sun Y.E. Zhang Y. Genome-wide analysis of 5-hydroxymethylcytosine distribution reveals its dual function in transcriptional regulation in mouse embryonic stem cells.Genes Dev. 2011; 25: 679-684Crossref PubMed Scopus (452) Google Scholar, Xu et al., 2011Xu Y. Wu F. Tan L. Kong L. Xiong L. Deng J. Barbera A.J. Zheng L. Zhang H. Huang S. et al.Genome-wide regulation of 5hmC, 5mC, and gene expression by Tet1 hydroxylase in mouse embryonic stem cells.Mol. Cell. 2011; 42: 451-464Abstract Full Text Full Text PDF PubMed Scopus (494) Google Scholar). Although 5hmC is more enriched in gene bodies than transcription starting sites in mouse cerebellum (Song et al., 2011Song C.X. Szulwach K.E. Fu Y. Dai Q. Yi C. Li X. Li Y. Chen C.H. Zhang W. Jian X. et al.Selective chemical labeling reveals the genome-wide distribution of 5-hydroxymethylcytosine.Nat. Biotechnol. 2011; 29: 68-72Crossref PubMed Scopus (806) Google Scholar, Szulwach et al., 2011bSzulwach K.E. Li X. Li Y. Song C.X. Wu H. Dai Q. Irier H. Upadhyay A.K. Gearing M. Levey A.I. et al.5-hmC-mediated epigenetic dynamics during postnatal neurodevelopment and aging.Nat. Neurosci. 2011; 14: 1607-1616Crossref PubMed Scopus (615) Google Scholar), all genome-wide maps of 5hmC in human ESCs (hESCs) and mouse ESCs (mESCs) indicate that 5hmC tends to exist in gene bodies, promoters, and enhancers (Ficz et al., 2011Ficz G. Branco M.R. Seisenberger S. Santos F. Krueger F. Hore T.A. Marques C.J. Andrews S. Reik W. Dynamic regulation of 5-hydroxymethylcytosine in mouse ES cells and during differentiation.Nature. 2011; 473: 398-402Crossref PubMed Scopus (901) Google Scholar, Pastor et al., 2011Pastor W.A. Pape U.J. Huang Y. Henderson H.R. Lister R. Ko M. McLoughlin E.M. Brudno Y. Mahapatra S. Kapranov P. et al.Genome-wide mapping of 5-hydroxymethylcytosine in embryonic stem cells.Nature. 2011; 473: 394-397Crossref PubMed Scopus (645) Google Scholar, Stroud et al., 2011Stroud H. Feng S. Morey Kinney S. Pradhan S. Jacobsen S.E. 5-Hydroxymethylcytosine is associated with enhancers and gene bodies in human embryonic stem cells.Genome Biol. 2011; 12: R54Crossref PubMed Scopus (346) Google Scholar, Szulwach et al., 2011aSzulwach K.E. Li X. Li Y. Song C.X. Han J.W. Kim S. Namburi S. Hermetz K. Kim J.J. Rudd M.K. et al.Integrating 5-hydroxymethylcytosine into the epigenomic landscape of human embryonic stem cells.PLoS Genet. 2011; 7: e1002154Crossref PubMed Scopus (219) Google Scholar, Williams et al., 2011Williams K. Christensen J. Pedersen M.T. Johansen J.V. Cloos P.A. Rappsilber J. Helin K. TET1 and hydroxymethylcytosine in transcription and DNA methylation fidelity.Nature. 2011; 473: 343-348Crossref PubMed Scopus (800) Google Scholar, Wu et al., 2011Wu H. D'Alessio A.C. Ito S. Wang Z. Cui K. Zhao K. Sun Y.E. Zhang Y. Genome-wide analysis of 5-hydroxymethylcytosine distribution reveals its dual function in transcriptional regulation in mouse embryonic stem cells.Genes Dev. 2011; 25: 679-684Crossref PubMed Scopus (452) Google Scholar, Xu et al., 2011Xu Y. Wu F. Tan L. Kong L. Xiong L. Deng J. Barbera A.J. Zheng L. Zhang H. Huang S. et al.Genome-wide regulation of 5hmC, 5mC, and gene expression by Tet1 hydroxylase in mouse embryonic stem cells.Mol. Cell. 2011; 42: 451-464Abstract Full Text Full Text PDF PubMed Scopus (494) Google Scholar). However, in all cases, the resolution of these maps was restricted by the size of the immunoprecipitated or chemically captured DNA, which varied from several hundred to over a thousand bases. The study of 5mC has been facilitated by the development of whole-genome bisulfite sequencing methods that can resolve the genomic location of methylcytosine at single-base resolution (Cokus et al., 2008Cokus S.J. Feng S. Zhang X. Chen Z. Merriman B. Haudenschild C.D. Pradhan S. Nelson S.F. Pellegrini M. Jacobsen S.E. Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning.Nature. 2008; 452: 215-219Crossref PubMed Scopus (1717) Google Scholar, Lister et al., 2008Lister R. O'Malley R.C. Tonti-Filippini J. Gregory B.D. Berry C.C. Millar A.H. Ecker J.R. Highly integrated single-base resolution maps of the epigenome in Arabidopsis.Cell. 2008; 133: 523-536Abstract Full Text Full Text PDF PubMed Scopus (1856) Google Scholar, Lister et al., 2009Lister, R., Pelizzola, M., Dowen, R.H., Hawkins, R.D., Hon, G., Tonti-Filippini, J., Nery, J.R., Lee, L., Ye, Z., Ngo, Q.M., et al. (2009). Human DNA methylomes at base resolution show widespread epigenomic differences. Nature 462, 315–322.Google Scholar). However, current bisulfite sequencing methods cannot distinguish between 5mC and 5hmC (Huang et al., 2010Huang Y. Pastor W.A. Shen Y. Tahiliani M. Liu D.R. Rao A. The behaviour of 5-hydroxymethylcytosine in bisulfite sequencing.PLoS ONE. 2010; 5: e8888Crossref PubMed Scopus (567) Google Scholar, Jin et al., 2010Jin S.G. Kadam S. Pfeifer G.P. Examination of the specificity of DNA methylation profiling techniques towards 5-methylcytosine and 5-hydroxymethylcytosine.Nucleic Acids Res. 2010; 38: e125Crossref PubMed Scopus (352) Google Scholar). Therefore, the genome-wide bisulfite sequencing maps generated in recent years may not accurately capture the true abundance of 5mC at each base in the genome. A more detailed understanding of the function of 5hmC as well as 5mC has, therefore, been hampered by the lack of a single-base resolution sequencing technology capable of detecting the relative abundance of 5hmC per cytosine. Here we present a Tet-assisted bisulfite sequencing (TAB-Seq) strategy, which provides a method for single-base resolution detection of 5hmC amenable to both genome-wide and loci-specific sequencing. Applying this method, we have generated genome-wide, single-base resolution maps of 5hmC in ESCs. Distinct classes of functional elements exhibit variable abundance of 5hmC, with promoter-distal regulatory elements harboring the highest levels of 5hmC. High levels of 5hmC and reciprocally low levels of 5mC can be found near binding sites of transcription factors. In contrast to 5mC, 5hmC sites display strand asymmetry and sequence bias. Finally, the base-resolution maps of 5hmC provide more accurate estimates of both 5hmC and 5mC levels at each modified cytosine than previous whole-genome bisulfite sequencing approaches. Our results support a dynamic DNA methylation process at distal-regulatory elements and suggest that different mechanisms of DNA modification may be involved at distinct classes of functional sequences in the genome. Traditional bisulfite sequencing cannot discriminate 5mC from 5hmC because both resist deamination by bisulfite treatment (Huang et al., 2010Huang Y. Pastor W.A. Shen Y. Tahiliani M. Liu D.R. Rao A. The behaviour of 5-hydroxymethylcytosine in bisulfite sequencing.PLoS ONE. 2010; 5: e8888Crossref PubMed Scopus (567) Google Scholar, Jin et al., 2010Jin S.G. Kadam S. Pfeifer G.P. Examination of the specificity of DNA methylation profiling techniques towards 5-methylcytosine and 5-hydroxymethylcytosine.Nucleic Acids Res. 2010; 38: e125Crossref PubMed Scopus (352) Google Scholar). We have recently found that TET proteins not only oxidize 5mC to 5hmC but also further oxidize 5hmC to 5caC, and that 5caC exhibits behavior similar to that of unmodified cytosine after bisulfite treatment (He et al., 2011He Y.F. Li B.Z. Li Z. Liu P. Wang Y. Tang Q. Ding J. Jia Y. Chen Z. Li L. et al.Tet-mediated formation of 5-carboxylcytosine and its excision by TDG in mammalian DNA.Science. 2011; 333: 1303-1307Crossref PubMed Scopus (2014) Google Scholar, Ito et al., 2011Ito S. Shen L. Dai Q. Wu S.C. Collins L.B. Swenberg J.A. He C. Zhang Y. Tet proteins can convert 5-methylcytosine to 5-formylcytosine and 5-carboxylcytosine.Science. 2011; 333: 1300-1303Crossref PubMed Scopus (2463) Google Scholar). This deamination difference between 5caC and 5mC/5hmC under standard bisulfite conditions inspired us to explore TAB-Seq. In this approach, we use β-glucosyltransferase (βGT) to introduce a glucose onto 5hmC, generating β-glucosyl-5-hydroxymethylcytosine (5gmC) to protect 5hmC from further TET oxidation. After blocking of 5hmC, all 5mC is converted to 5caC by oxidation with an excess of recombinant Tet1 protein. Bisulfite treatment of the resulting DNA then converts all C and 5caC (derived from 5mC) to uracil or 5caU, respectively, whereas the original 5hmC bases remain protected as 5gmC. Thus, subsequent sequencing will reveal 5hmC as C, which, when combined with traditional bisulfite sequencing results, will provide an accurate assessment of abundance of this modification at each cytosine (Figure 1A ). We first confirmed that 5gmC is read as C in traditional bisulfite sequencing (data not shown). We cloned and expressed the catalytic domain of mouse Tet1 (mTet1) (Figure S1A available online), as previously reported (Ito et al., 2010Ito S. D'Alessio A.C. Taranova O.V. Hong K. Sowers L.C. Zhang Y. Role of Tet proteins in 5mC to 5hmC conversion, ES-cell self-renewal and inner cell mass specification.Nature. 2010; 466: 1129-1133Crossref PubMed Scopus (1925) Google Scholar). We tested a double-stranded DNA (dsDNA) with site-specifically incorporated 5mC or 5hmC modification (Figure 1B). Application of our method with Sanger sequencing of the PCR-amplified products showed that the original 5mC was completely converted into T after treatment, indicating efficient oxidation of 5mC to 5caC by mTet1 (Figure 1B). However, the original 5hmC was sequenced as C, confirming that the protected 5gmC is resistant to deamination under bisulfite treatment (Figure 1B). The products of each step were confirmed by MALDI-TOF/TOF using a shorter model duplex DNA (Figure 1C). Full conversion of 5mC in the context of genomic DNA was also confirmed by conventional bisulfite, PCR, and both Sanger and semiconductor sequencing (Figures S1B and S1C). Additionally, application to genomic DNA confirmed conversion of 5mC to 5caC and protection of 5hmC, and that 5fC is undetectable by immunoblot on the final reaction products (Figure 1D). Thus, coupling βGT-mediated transfer of glucose to 5hmC with mTet1-catalyzed oxidation of 5mC to 5caC enables the distinction of 5hmC from both C and 5mC after sodium bisulfite treatment.Figure S1TAB-Seq of Specific Loci and 5mC Conversion Rate Test in the Context of Genomic DNA, Related to Figure 1Show full caption(A) Purified mTet1 catalytic domain used for oxidation of genomic DNA.(B) Sanger sequencing of M.SssI treated lambda DNA spiked into a genomic DNA background at 0.5% before (− mTet1) and after (+ mTet1) subjecting the DNA to TAB-Seq.(C) Semiconductor sequencing of M.SssI-treated lambda DNA spiked into a genomic DNA background at 0.5% before (− mTet1) and after (+ mTet1) subjecting the DNA to TAB-Seq. The left y axis shows the percentage of bases read as C, and the right y axis shows the depth of sequencing at each C position in the targeted amplicon, which is plotted on the x axis. For reference, a dotted line is plotted at 98% on the left y axis.(D) Several loci in mouse cerebellum were tested by both traditional bisulfite sequencing and TAB-Seq. Genuine 5hmC is read as C in both methods (left) whereas genuine 5mC is read as C in traditional bisulfite sequencing but display as T in TAB-Seq (right).View Large Image Figure ViewerDownload Hi-res image Download (PPT) (A) Purified mTet1 catalytic domain used for oxidation of genomic DNA. (B) Sanger sequencing of M.SssI treated lambda DNA spiked into a genomic DNA background at 0.5% before (− mTet1) and after (+ mTet1) subjecting the DNA to TAB-Seq. (C) Semiconductor sequencing of M.SssI-treated lambda DNA spiked into a genomic DNA background at 0.5% before (− mTet1) and after (+ mTet1) subjecting the DNA to TAB-Seq. The left y axis shows the percentage of bases read as C, and the right y axis shows the depth of sequencing at each C position in the targeted amplicon, which is plotted on the x axis. For reference, a dotted line is plotted at 98% on the left y axis. (D) Several loci in mouse cerebellum were tested by both traditional bisulfite sequencing and TAB-Seq. Genuine 5hmC is read as C in both methods (left) whereas genuine 5mC is read as C in traditional bisulfite sequencing but display as T in TAB-Seq (right). The ability to distinguish 5hmC at base resolution offers a significant opportunity to further parse DNA methylation/hydroxymethylation states at specific genomic loci. We applied traditional bisulfite sequencing and TAB-Seq to known 5hmC-enriched loci in mouse cerebellum that were identified previously (Song et al., 2011Song C.X. Szulwach K.E. Fu Y. Dai Q. Yi C. Li X. Li Y. Chen C.H. Zhang W. Jian X. et al.Selective chemical labeling reveals the genome-wide distribution of 5-hydroxymethylcytosine.Nat. Biotechnol. 2011; 29: 68-72Crossref PubMed Scopus (806) Google Scholar, Szulwach et al., 2011bSzulwach K.E. Li X. Li Y. Song C.X. Wu H. Dai Q. Irier H. Upadhyay A.K. Gearing M. Levey A.I. et al.5-hmC-mediated epigenetic dynamics during postnatal neurodevelopment and aging.Nat. Neurosci. 2011; 14: 1607-1616Crossref PubMed Scopus (615) Google Scholar). Comparing the sequencing results, we were able to identify genuine 5hmC and 5mC sites (Figure S1D). We next applied TAB-Seq to genomic DNA from H1 hESCs and E14Tg2a mESCs and sequenced to an average depth of 26.5× and 17× per cytosine, respectively. Successful detection of 5hmC is governed by three key parameters: (1) efficient conversion of unmodified cytosine to uracil; (2) efficient conversion of 5mC to 5caU/U; and (3) efficient protection of 5hmC. To directly assess these conversion rates in the context of genomic DNA, sequenced samples were spiked in with fragments of lambda DNA amplified by PCR to contain three distinct domains having either unmodified cytosine, 5mC, or 5hmC. We observe low nonconversion rates for unmodified cytosine (0.38%) and 5mC (2.21%), contrasted to a high nonconversion rate of 5hmC (84.4%) (Figure S2B ). Further analysis indicates that this latter value is an underestimate of the true 5hmC protection rate in H1, which is close to 92.0% (Figures S2D and S2E). These data further confirm the capability of TAB-Seq for robust distinction of 5hmC from 5mC and unmodified cytosine in the context of genomic DNA. We next focused our analysis on the map of H1 hESCs. To confidently identify 5hmC-modified bases, we took advantage of the highly annotated H1 methylome generated with methylC-Seq, which identifies both 5mC as well as 5hmC. Accordingly, we restricted our search for 5hmC to the subset of methylated bases previously identified by methylC-Seq (Lister et al., 2009Lister R. Pelizzola M. Dowen R.H. Hawkins R.D. Hon G. Tonti-Filippini J. Nery J.R. Lee L. Ye Z. Ngo Q.M. et al.Human DNA methylomes at base resolution show widespread epigenomic differences.Nature. 2009; 462: 315-322Crossref PubMed Scopus (3352) Google Scholar). The probability that a cytosine can be confidently identified as 5hmC is governed by the sequencing depth at the cytosine and abundance of the modification (Figure S2C). Modeling this probabilistic event with a binomial distribution (Lister et al., 2009Lister R. Pelizzola M. Dowen R.H. Hawkins R.D. Hon G. Tonti-Filippini J. Nery J.R. Lee L. Ye Z. Ngo Q.M. et al.Human DNA methylomes at base resolution show widespread epigenomic differences.Nature. 2009; 462: 315-322Crossref PubMed Scopus (3352) Google Scholar) with N as the depth of sequencing at the cytosine and p as the 5mC nonconversion rate, we identified a total of 691,414 5hmCs with a false discovery rate of 5% (Figure S2F; see Extended Experimental Procedures). Given an average sequencing depth of 26.5, our assay can on average resolve 5hmC having an abundance of 20% or higher (Figure S2C). Genomic profiles of absolute 5hmC levels are comparable to a map previously generated with an affinity-based approach (Szulwach et al., 2011aSzulwach K.E. Li X. Li Y. Song C.X. Han J.W. Kim S. Namburi S. Hermetz K. Kim J.J. Rudd M.K. et al.Integrating 5-hydroxymethylcytosine into the epigenomic landscape of human embryonic stem cells.PLoS Genet. 2011; 7: e1002154Crossref PubMed Scopus (219) Google Scholar) (Figure 2A ). As sequenced fragments are equally distributed among the population of cells, TAB-Seq provides a steady-state glimpse of 5hmC in the entire population. This is in contrast to affinity-based approaches, which bias sequencing toward 5hmC-enriched DNA fragments. By TAB-Seq, identified 5hmCs are highly clustered, unlike 5mCs (Figure S3A ), and track well with peaks of 5hmC enrichment previously identified by affinity sequencing (Figure 2A). There are 7.6 times as many 5hmCs overlapping affinity-identified regions as expected by chance (Figure 2B, Z-score = 1,579). Furthermore, 81.5% of these 82,221 affinity-identified regions were recovered by at least one 5hmC. In contrast, only 35.6% of 5hmCs are recovered by affinity-based approaches, suggesting an increased sensitivity of TAB-Seq. Using semiconductor sequencing, we verified the presence/absence of 5hmC at 57 out of 59 individual cytosines (9 out of 11 hydroxymethylated CpGs, with depth ≥ 30) within regions that previously escaped detection by 5hmC affinity capture (Figure S2A), underscoring the sensitivity and specificity of our approach.Figure S3Related to Figure 3Show full caption(A) The distribution of pairwise distances between all 5hmCs identified in H1 (red), compared to the same number of randomly selected 5mCs (black).(B) The distribution of base-level phastCons conservation scores (Siepel et al., 2005Siepel, A., Bejerano, G., Pedersen, J.S., Hinrichs, A.S., Hou, M., Rosenbloom, K., Clawson, H., Spieth, J., Hillier, L.W., Richards, S., et al. (2005). Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 15, 1034–1050.Google Scholar) for several tiers of 5hmC abundance.(C) Total methylation level measured by methylC-Seq (left) and the 5hmC abundance measured by TAB-Seq (right) for DNase I hypersensitive elements ranked by sig
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
完美的幻悲完成签到 ,获得积分10
1秒前
机灵谷蓝发布了新的文献求助10
2秒前
2秒前
3秒前
xiaofei666完成签到,获得积分10
4秒前
rocky15应助直率的芷采纳,获得10
5秒前
悲回风发布了新的文献求助10
7秒前
Ya完成签到 ,获得积分10
8秒前
疯子完成签到,获得积分10
8秒前
活在当下完成签到,获得积分10
9秒前
夜如雨完成签到,获得积分10
10秒前
xiaofei666发布了新的文献求助100
10秒前
Dexterzzzzz发布了新的文献求助10
14秒前
Ava应助悲回风采纳,获得10
14秒前
15秒前
16秒前
魔幻无极完成签到,获得积分10
17秒前
qqz发布了新的文献求助10
20秒前
4652376完成签到,获得积分10
27秒前
小白关注了科研通微信公众号
27秒前
29秒前
35秒前
niaoniao完成签到,获得积分10
39秒前
SciGPT应助俭朴的猫咪采纳,获得10
40秒前
41秒前
克林完成签到,获得积分10
42秒前
小白发布了新的文献求助10
45秒前
47秒前
48秒前
50秒前
Qi齐发布了新的文献求助10
51秒前
52秒前
judy123完成签到,获得积分20
54秒前
lewe完成签到,获得积分10
55秒前
55秒前
rocky15应助大鱼采纳,获得10
55秒前
1分钟前
pante完成签到,获得积分10
1分钟前
Leisure_Lee完成签到,获得积分10
1分钟前
July发布了新的文献求助10
1分钟前
高分求助中
Sustainable Land Management: Strategies to Cope with the Marginalisation of Agriculture 1000
Corrosion and Oxygen Control 600
Python Programming for Linguistics and Digital Humanities: Applications for Text-Focused Fields 500
Heterocyclic Stilbene and Bibenzyl Derivatives in Liverworts: Distribution, Structures, Total Synthesis and Biological Activity 500
重庆市新能源汽车产业大数据招商指南(两链两图两池两库两平台两清单两报告) 400
Division and square root. Digit-recurrence algorithms and implementations 400
行動データの計算論モデリング 強化学習モデルを例として 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2547280
求助须知:如何正确求助?哪些是违规求助? 2176211
关于积分的说明 5602907
捐赠科研通 1896983
什么是DOI,文献DOI怎么找? 946495
版权声明 565383
科研通“疑难数据库(出版商)”最低求助积分说明 503744