+ All Categories
Home > Documents > New roles for DNA cytosine modification, eRNA, anchors, and … · New roles for DNA cytosine...

New roles for DNA cytosine modification, eRNA, anchors, and … · New roles for DNA cytosine...

Date post: 15-Jul-2020
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
6
New roles for DNA cytosine modification, eRNA, anchors, and superanchors in developing B cell progenitors Christopher Benner a,1,2 , Takeshi Isoda b,1 , and Cornelis Murre b,2 a The Integrative Genomics and Bioinformatics Core, The Salk Institute for Biological Studies, La Jolla, CA 92037; and b Department of Molecular Biology, University of California at San Diego, La Jolla, CA 92093 Edited by Frederick W. Alt, Program in Cellular and Molecular Medicine, Boston Childrens Hospital, Harvard Medical School, Howard Hughes Medical Institute, Boston, MA, and approved September 1, 2015 (received for review July 1, 2015) B-cell fate is orchestrated by a series of well-characterized de- velopmental regulators. Here, we found that the onset of B-cell development was accompanied by large-scale changes in DNA cy- tosine modifications associated with promoters, enhancers, and anchors. These changes were tightly linked to alterations in trans- cription factor occupancy and nascent RNA (eRNA) transcription. We found that the prepro-B to the proB-cell transition was associated with a global exchange of DNA cytosine modifications for poly- comb-mediated repression at CpG islands. Hypomethylated regions were found exclusively in the active/permissive compartment of the nucleus and were predominantly associated with regulatory elements or anchors that orchestrate the folding patterns of the genome. We identified superanchors, characterized by clusters of hypomethylated CCCTC-binding factor (CTCF)-bound elements, which were predominantly located at boundaries that define topological associated domains. A particularly prominent hypomethylated superanchor was positioned down-stream of the Ig heavy chain (Igh) locus. Analysis of global formaldehydecross-linking studies indicated that the Igh locus superanchor interacts with the V H region repertoire across vast genomic distances. We propose that the Igh locus superanchor sequesters the V H and D H J H regions into a spatial confined geometric environment to promote rapid first-passage times. Collectively, these studies demonstrate how, in developing B cells, DNA cytosine modifications associated with regulatory and architectural elements affect patterns of gene expression, folding patterns of the genome, and antigen recep- tor assembly. DNA modification | superanchor | nuclear architecture | immunoglobulin heavy chain locus | superinsulator T he onset of B-cell development is initiated in the fetal liver or adult bone marrow at the common lymphoid progenitor cell (CLP) cell stage (1, 2). Specification to the B-cell lineage is es- tablished by a spectrum of transcriptional regulators that act collaboratively to prime and ultimately activate a B-lineagespecific program of gene expression (3, 4). Conspicuous among the activators that establish B-cell identity are the E2A, EBF1, and FOXO1 proteins (58). In CLPs, the E2A proteins induce the expression of FOXO1, which, in turn, activates EBF1 tran- scription (6). EBF1 and FOXO1 subsequently act in a regulatory feedback loop to induce an active enhancer repertoire and ac- tivate a B-lineage specific program of gene expression (9, 10). It is now well established that the chromatin fiber is marked by DNA methylation (1113). Methylation of cytosines primarily occurs at CpG residues, although methylation in CHG and CHH nucleotides has also been observed (14). The vast majority of CpG nucleotides are methylated. Unmethylated CpG residues and CpG islands are closely associated with transcriptionally active promoters. Other demethylated regions, found distal to gene promoters, tend to be cell-type specific, with changes in DNA methylation state often correlating with alterations in patterns of gene expression (15). Unlike many other epigenetic modifications, mCpG has a well- understood maintenance mechanism that copies methylation states during the cell cycle mediated by DNMT1 (16). Recent studies have shown that patterns of DNA methylation are dynamic, with specific enzymes involved in de novo DNA methylation (DNMT3) and demethylation (TET1-3) (17). TET proteins ox- idize 5mC to 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC), and 5-carboxylcytosine (5caC) (1820). Here, we describe how DNA cytosine modifications associ- ated with regulatory and architectural elements affects patterns of gene expression, folding patterns of the genome, and antigen receptor assembly. Results DNA Methylation Landscape in B Cells. To determine whether al- terations in DNA cytosine modifications are associated with the onset of B-cell development, pro-B cells were isolated from RAG1-deficient mice and cultured in the presence of IL-7 and SCF. Purified DNA was treated with sodium bisulfite by using standard procedures. Bisulfite sequencing does not distinguish 5mC from 5hmC and C from 5fC and 5caC (20, 21). Hence, we will refer to unconverted cytosines as mC (includes 5hmC/5mC), whereas converted cytosines will be referred to as C (includes 5fC/ 5caC/C). Consistent with previous observations, most CpGs across the proB-cell genome were heavily methylated (81.8% on aver- age). Profiles of mCpG% at genes revealed hypomethylated Significance B cells are destined to produce a wide spectrum of antibodies or immunoglobulins in response to the invading pathogens. Here, we found that the onset of B-cell development was accompanied by large-scale changes in DNA cytosine modifi- cations associated with DNA elements that control gene ex- pression and the folding patterns of genomes. We identify novel DNA elements that function as superanchors and/or superinsulators. A prominent super-anchor is located in the immunoglobulin heavy chain locus where it acts to facilitate the interactions among variable, diversity, and joining DNA segments. We propose that changes in DNA cytosine modifi- cations at regulatory DNA elements orchestrate a B cell-spe- cific transcription signature and genome structure to enable the production of a diverse antibody repertoire. Author contributions: C.B., T.I., and C.M. designed research; C.B. and T.I. performed re- search; C.B. analyzed data; and C.B. and C.M. wrote the paper. The authors declare no conflict of interest. This article is a PNAS Direct Submission. Data deposition: The sequence reported in this paper has been deposited in the GenBank database (accession no. GSE72670). 1 C.B. and T.I. contributed equally to this work. 2 To whom correspondence may be addressed. Email: [email protected] or [email protected]. This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10. 1073/pnas.1512995112/-/DCSupplemental. 1277612781 | PNAS | October 13, 2015 | vol. 112 | no. 41 www.pnas.org/cgi/doi/10.1073/pnas.1512995112 Downloaded by guest on August 30, 2020
Transcript
Page 1: New roles for DNA cytosine modification, eRNA, anchors, and … · New roles for DNA cytosine modification, eRNA, anchors, and superanchors in developing B cell progenitors Christopher

New roles for DNA cytosine modification, eRNA,anchors, and superanchors in developing Bcell progenitorsChristopher Bennera,1,2, Takeshi Isodab,1, and Cornelis Murreb,2

aThe Integrative Genomics and Bioinformatics Core, The Salk Institute for Biological Studies, La Jolla, CA 92037; and bDepartment of Molecular Biology,University of California at San Diego, La Jolla, CA 92093

Edited by Frederick W. Alt, Program in Cellular and Molecular Medicine, Boston Children’s Hospital, Harvard Medical School, Howard Hughes MedicalInstitute, Boston, MA, and approved September 1, 2015 (received for review July 1, 2015)

B-cell fate is orchestrated by a series of well-characterized de-velopmental regulators. Here, we found that the onset of B-celldevelopment was accompanied by large-scale changes in DNA cy-tosine modifications associated with promoters, enhancers, andanchors. These changes were tightly linked to alterations in trans-cription factor occupancy and nascent RNA (eRNA) transcription. Wefound that the prepro-B to the pro–B-cell transition was associatedwith a global exchange of DNA cytosine modifications for poly-comb-mediated repression at CpG islands. Hypomethylated regionswere found exclusively in the active/permissive compartment ofthe nucleus and were predominantly associated with regulatoryelements or anchors that orchestrate the folding patterns of thegenome. We identified superanchors, characterized by clusters ofhypomethylated CCCTC-binding factor (CTCF)-bound elements, whichwere predominantly located at boundaries that define topologicalassociated domains. A particularly prominent hypomethylatedsuperanchor was positioned down-stream of the Ig heavy chain(Igh) locus. Analysis of global formaldehyde–cross-linking studiesindicated that the Igh locus superanchor interacts with the VH

region repertoire across vast genomic distances. We proposethat the Igh locus superanchor sequesters the VH and DHJH regionsinto a spatial confined geometric environment to promote rapidfirst-passage times. Collectively, these studies demonstrate how,in developing B cells, DNA cytosine modifications associated withregulatory and architectural elements affect patterns of geneexpression, folding patterns of the genome, and antigen recep-tor assembly.

DNA modification | superanchor | nuclear architecture | immunoglobulinheavy chain locus | superinsulator

The onset of B-cell development is initiated in the fetal liver oradult bone marrow at the common lymphoid progenitor cell

(CLP) cell stage (1, 2). Specification to the B-cell lineage is es-tablished by a spectrum of transcriptional regulators that actcollaboratively to prime and ultimately activate a B-lineage–specific program of gene expression (3, 4). Conspicuous amongthe activators that establish B-cell identity are the E2A, EBF1,and FOXO1 proteins (5–8). In CLPs, the E2A proteins inducethe expression of FOXO1, which, in turn, activates EBF1 tran-scription (6). EBF1 and FOXO1 subsequently act in a regulatoryfeedback loop to induce an active enhancer repertoire and ac-tivate a B-lineage specific program of gene expression (9, 10).It is now well established that the chromatin fiber is marked

by DNA methylation (11–13). Methylation of cytosines primarilyoccurs at CpG residues, although methylation in CHG and CHHnucleotides has also been observed (14). The vast majority of CpGnucleotides are methylated. Unmethylated CpG residues and CpGislands are closely associated with transcriptionally active promoters.Other demethylated regions, found distal to gene promoters, tend tobe cell-type specific, with changes in DNA methylation state oftencorrelating with alterations in patterns of gene expression (15).Unlike many other epigenetic modifications, mCpG has a well-

understood maintenance mechanism that copies methylation statesduring the cell cycle mediated by DNMT1 (16). Recent studieshave shown that patterns of DNA methylation are dynamic,with specific enzymes involved in de novo DNA methylation(DNMT3) and demethylation (TET1-3) (17). TET proteins ox-idize 5mC to 5-hydroxymethylcytosine (5hmC), 5-formylcytosine(5fC), and 5-carboxylcytosine (5caC) (18–20).Here, we describe how DNA cytosine modifications associ-

ated with regulatory and architectural elements affects patternsof gene expression, folding patterns of the genome, and antigenreceptor assembly.

ResultsDNA Methylation Landscape in B Cells. To determine whether al-terations in DNA cytosine modifications are associated with theonset of B-cell development, pro-B cells were isolated fromRAG1-deficient mice and cultured in the presence of IL-7 andSCF. Purified DNA was treated with sodium bisulfite by usingstandard procedures. Bisulfite sequencing does not distinguish5mC from 5hmC and C from 5fC and 5caC (20, 21). Hence, wewill refer to unconverted cytosines as mC (includes 5hmC/5mC),whereas converted cytosines will be referred to as C (includes 5fC/5caC/C). Consistent with previous observations, most CpGs acrossthe pro–B-cell genome were heavily methylated (81.8% on aver-age). Profiles of mCpG% at genes revealed hypomethylated

Significance

B cells are destined to produce a wide spectrum of antibodiesor immunoglobulins in response to the invading pathogens.Here, we found that the onset of B-cell development wasaccompanied by large-scale changes in DNA cytosine modifi-cations associated with DNA elements that control gene ex-pression and the folding patterns of genomes. We identifynovel DNA elements that function as superanchors and/orsuperinsulators. A prominent super-anchor is located in theimmunoglobulin heavy chain locus where it acts to facilitatethe interactions among variable, diversity, and joining DNAsegments. We propose that changes in DNA cytosine modifi-cations at regulatory DNA elements orchestrate a B cell-spe-cific transcription signature and genome structure to enablethe production of a diverse antibody repertoire.

Author contributions: C.B., T.I., and C.M. designed research; C.B. and T.I. performed re-search; C.B. analyzed data; and C.B. and C.M. wrote the paper.

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

Data deposition: The sequence reported in this paper has been deposited in the GenBankdatabase (accession no. GSE72670).1C.B. and T.I. contributed equally to this work.2To whom correspondence may be addressed. Email: [email protected] [email protected].

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1512995112/-/DCSupplemental.

12776–12781 | PNAS | October 13, 2015 | vol. 112 | no. 41 www.pnas.org/cgi/doi/10.1073/pnas.1512995112

Dow

nloa

ded

by g

uest

on

Aug

ust 3

0, 2

020

Page 2: New roles for DNA cytosine modification, eRNA, anchors, and … · New roles for DNA cytosine modification, eRNA, anchors, and superanchors in developing B cell progenitors Christopher

promoter regions and elevated levels of methylation at genebodies (Fig. S1A).We next performed an unbiased search for hypomethylated re-

gions across the pro–B-cell genome, finding 43,363 regions with anaverage mCpG% less than 30%. The majority of hypomethylatedregions mapped to promoter-distal regions (Fig. S1B). Clustering ofthe hypomethylated domains revealed distinct groups that wereidentified based on the deposition of H3K4me3 (promoters),H3K4me1/2 (enhancers), H3K27me3 (polycomb bodies), andCCCTC-binding factor (CTCF) occupancy (Fig. 1 A and B and Fig.S1D). We found that the cistromes for all relevant transcriptionfactors were hypomethylated (Fig. S1C). Factors closely associatedwith promoter binding (i.e., Yy1) showed wider windows of hypo-methylation, whereas factors typically binding at enhancer elements(i.e., E2A) displayed a narrower pattern of decreased methyl-ation (Fig. S1C). CTCF occupancy was closely associated withnucleosome-sized fluctuations in the mCpG/CpG ratios (Fig. 1C).Using Global Run-On sequencing data from pro-B cells

(GRO-Seq), we assessed the differences in methylation profiles atintergenic E2A-bound sites associated with transcriptionally activeregions producing nascent RNA (eRNA) versus E2A-bound sitesthat were transcriptionally inactive (Fig. 1D). We found that E2Aoccupancy was associated with hypomethylation at bound sites,regardless of transcriptional activity (Fig. 1D and Fig. S1E).However, E2A bound sites at transcriptionally active enhancersshowed broader regions of hypomethylated DNA (Fig. 1D andFig. S1E). A similar pattern of mCpG depletion was observed

when considering gene transcription start sites with unidirec-tional versus bidirectional transcription. Hypomethylation waspredominately associated with promoters and genomic regionspositioned immediately downstream of active transcription, whichstrongly correlates with levels of H3K4me3 (Fig. 1E and Fig. S1F).Taken together, these data suggest a mechanistic link betweennoncoding RNA transcription and modulating DNA cytosinemodifications associated with enhancer regions.

Global Changes in Enhancer Methylation During B-Cell Development.To determine whether the pre–pro-B to pro–B-cell transition isaccompanied by changes in DNA methylation, DNA was extractedfrom E2A-deficient prepro-B cells. The overall pattern of methyl-ation was similar as described for pro-B cells (80.7% mCpG over-all), including depletion of mCpG at transcriptional start sites andincreased levels of DNA cytosine modifications associated withgene bodies (Fig. S2A). More than 14,000 genomic regions wereidentified that displayed large-scale changes in DNA cytosine mod-ifications, termed DMRs (differential methylated regions) (Fig.S3A). A large fraction of these changes could be segregated into twogroups that exchanged mCpG for either deposition of H3K4me2or H3K27me3 (Fig. S3A). In contrast, a mere 1,299 genomic re-gions displayed the reverse pattern (Fig. S3A). Prepro-B–specificDMRs were predominantly associated with prepro-B–specificH3K4me2 levels (Fig. S3A). Many of these changes in patterns ofmethylation occurred at the promoters of prominent B-lineagegenes (Fig. S2 B and C).

mC

pG

%

A B

C

43

,36

3 H

ypo

me

thyl

ate

d R

eg

ion

s in

pro

-B c

ells

Pro

mo

ters

Poly

com

bE

nh

an

cers

CTC

F

Relative mCpG% or ChIP-seq Reads

H3

K4

me

3

H3

K4

me

2

H3

K4

me

1

H3

K9

ac

H3

K2

7m

e3

DN

ase

CTC

F

Inp

ut

-2k +2k

mCpG%

hypomethylatedregions

umCpG%

H3K4me2

H3K4me1

H3K9ac

H3K4me3

CTCF

DNase

H3K27me3

GRO-Seq_

+

chr4:50 kb

44,700,000 44,750,000

Pax5 Zcchc7

0% 10% 20% 30% 40% 50% 60% 70% 80%

-1500 -1000 -500 0 500 1000 1500

mC

pG%

Distance from CTCF Peak

-2000 -1000 0 1000 2000 Distance from TSS (bp)

0% 10% 20% 30% 40% 50% 60% 70% 80% 90%

100%

-3000 -2000 -1000 0 1000 2000 3000

mC

pG%

Distance from intergenic E2A Peak (bp)

Enhancer Transcription at E2A ChIP-Seq Peaks

0 0.002 0.004 0.006 0.008 0.01

0.012 0.014

Nor

mal

ized

GR

O-S

eq re

ads

+-+-

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%

mC

pG%

Unidirectional

Promoters

Bidirectional

Promoters

E

D

E2A (high eRNA)E2A (low eRNA)

E2A-high eRNAE2A-low eRNA

Fig. 1. Global patterns of hypomethylation acrossregulatory elements in the pro–B-cell genome. (A) Un-biased hierarchical clustering of hypomethylatedgenomic regions with respect to mCpG%, histonemodification ChIP-Seq, DNase hypersensitivity, andCTCF ChIP-Seq in pro-B cells. Active promoters werecharacterized by enrichment for H3K4me3. En-hancer regions were assigned by deposition ofH3K4me1/2. Polycomb bodies were identified bythe deposition of H3K27me3. (B) Hypomethylatedgenomic regions associated with the Pax5 locus.Hypomethylated regions were observed across thePax5 promoter, enhancer, and CTCF binding sites.(C ) CpG methylation metaprofile associated withall CTCF-bound sites in pro-B cells revealing nucle-osomal fluctuations. (D) Mechanistic link involvingeRNA transcription and DNA hypomethylation.Differences in methylation profiles at intergenic E2Apeaks associated with transcriptionally active regions(>5 GRO-Seq reads per locus) versus intergenic E2A-bound sites that were not linked with eRNA tran-scription (<2 GRO-Seq reads per locus). Both sets ofbinding sites were hypomethylated at the bindingsites, but transcriptionally active E2A enhancers dis-played broader domains of hypomethylated DNA.The + and – symbols discriminate between nascentRNA transcription originating from the + and –

strand. (E) CpG methylation profile at active unidi-rectional and bidirectional promoters. Directionalityof promoters was determined by strand-specificGRO-Seq reads found upstream and downstream ofthe transcription start site (Fig. S1E).

Benner et al. PNAS | October 13, 2015 | vol. 112 | no. 41 | 12777

IMMUNOLO

GYAND

INFLAMMATION

Dow

nloa

ded

by g

uest

on

Aug

ust 3

0, 2

020

Page 3: New roles for DNA cytosine modification, eRNA, anchors, and … · New roles for DNA cytosine modification, eRNA, anchors, and superanchors in developing B cell progenitors Christopher

To identify regulatory signals in DMRs associated with thedevelopmental progression of B cells in an unbiased manner, weapplied HOMER, a de novo motif finding algorithm (Fig. S3B)(22). This analysis revealed transcription factor motifs closelylinked with a B-lineage–specific transcription signature, includingPU.1, EBF1, and E2A (Fig. S3B). MEF2, AP-1, RUNX, and PU.1binding sites were associated with genomic regions that acquiredmCpG (Fig. S3B) (23). Consistent with this analysis, we found thatE2A, EBF1, and PU.1 bound sites were highly enriched in ge-nomic regions that altered their DNA cytosine modification status(Fig. S3C). Analysis of intergenic DMRs specific to either cell typerevealed robust, bidirectional nascent RNA production indicativeof active eRNA transcription specific for each cell type (Fig. S3D).To further explore the relationship between transcription

factor binding and DNA cytosine modifications, we examinedthe distribution mCpG across genomic regions associated withE2A bound sites. We found that nearly half of the genomic re-gions bound by E2A in pro-B cells were hypermethylated(5hmC/5mC) in prepro-B cells (Fig. S2D). Notably, in pro-Bcells, hypomethylated genomic regions that displayed E47binding were associated with elevated levels of H3K4me2 (Fig.S2E). To determine how E2A binding modulates the epigeneticstate, we analyzed E2A cistromes derived from prepro-B cellstransduced with virus-expressing E47-ER (23). One hour aftertreatment with tamoxifen, E2A binding was associated withmore than 40,000 sites (Fig. S2F) (23). We found that more thanhalf of genomic regions associated with E2A occupancy werehypermethylated in prepro-B cells (Fig. S2G). Six hours aftertreatment with tamoxifen, E2A occupancy across these highlymethylated regions was closely associated with robust depositionof H3K4me1 (Fig. S2F) (23). Taken together, these data indicatethat commitment to the B-cell fate is closely associated withthe demethylation of a B-lineage–specific enhancer repertoireand that before activating a B-lineage–specific enhancer reper-toire, E47 is capable of binding to genomic regions that overallare hypermethylated.

Demethylated CpG Islands Are Predominantly Recruited to PolycombBodies. In contrast to the relatively balanced gain and loss ofDNA methylation at enhancers during the prepro-B to pro–B-cell transition, cytosine demethylation at regions that becomeassociated with H3K27me3 is specific to pro-B cells (Fig. S3A).The H3K27me3 histone modification is closely associated withpolycomb-mediated repression, representing an alternative mecha-nism to DNA methylation to silence gene expression. We foundthat more than 70% of pro-B DMRs that displayed H3K27me3deposition during the prepro-B to pro–B-cell transition were asso-ciated with CpG islands (Fig. S3E). Analysis of the entire spectrumof CpG Islands revealed a genome-wide pattern of cytosine de-methylation in pro-B cells. CpG islands that were inactive in prepro-B cells were methylated, but the methylation was lost in pro-B cellsand replaced with the acquisition of H3K4me2 (active), H3K27me3(repressed), or both (bivalent) (Fig. S3F, group II). Interestingly,two groups of CpG islands remained methylated in developingB-cell progenitors (groups III and IV) (Fig. S3F). Group III wasfound in the gene bodies of highly expressed genes because it wasclosely associated with H3K36me3, indicative of active transcriptionand possibly reflecting the presence of 5hmC (Fig. S3F). Group IVCpG islands also remained methylated (5hmC/5mC) during theprepro-B to pro–B-cell transition (Fig. S3F). Notably, group IVCpG islands were located at promoters that were highly enrichedfor genes associated with meiosis and reproduction, which mayrequire mCpG-mediated silencing rather than being localized inpolycomb bodies (Fig. S2H). Collectively, these data indicate thatduring the prepro-B to pro–B-cell transition, demethylated CpGislands are primarily recruited to polycomb bodies.

DNA Cytosine Modifications Across Heterochromatic CompartmentsAre Not Well Maintained. To determine how DNA methylationrelates to nuclear architecture, we examined the distribution ofmCpG across the pro–B-cell interactome (23). We found that the

vast majority of hypomethylated regions (>80%) were associatedwith the transcriptionally permissive compartment (Fig. S4A).To assess whether DNA cytosine modifications were maintainedduring developmental progression, we plotted the average per-centages of mCpG. We found that on average, the proportion ofmCpGs for genomic regions associated with the transcriptionallypermissive compartment was similar for prepro-B and pro-Bcells (Fig. S4B). However, we found lower mCpG rates in genomicregions associated with the heterochromatic compartment (Fig. S4B)(23). The difference in mCpG rates was derived from higher vari-ability in mCpG rates at individual CpG dinucleotides throughoutthe repressive compartment in prepro-B cells, rather than arisingfrom coordinately demethylated regions (Fig. S5A). A striking ex-ample of this was found at the Igκ locus (Fig. S5B). The proportionsof mCpGs across genomic regions spanning the Igκ locus appearedto fluctuate to a much higher degree in prepro-B cells where thelocus was associated with the heterochromatic compartment (Fig.S5A). This analysis indicates that the heterochromatic compartmentin prepro-B cells is associated with substantial allelic and/or indi-vidual cell variation across the population, whereas genomic regionsacross the euchromatic compartment are well maintained as it re-lates to the degree of cytosine modifications.

Identifying Superanchors. On a genic scale (100 kb to 1 Mb), wefound that hypomethylated genomic regions were significantlyenriched at putative anchors (Fig. S4C) (24). Consistent with theseobservations, anchors participating in multiple long-range geno-mic interactions, for example transcriptionally active hubs, weremore likely to be associated with hypomethylated than methylatedgenomic regions (Fig. S4D). Likewise, we found that putative cell-type specific anchors were highly enriched for genomic regionsthat displayed significant levels of demethylation (Fig. S4 E andF). Analysis of boundaries that flank densely interconnected as-sociated domains revealed enrichment for hypomethylated regionsand CTCF occupancy. In contrast, transcriptional active regionsdefined by the deposition of H3K4me2 or E2A occupancy werepredominantly found inside topological-associated domains (Fig.2A). We found that CTCF bound sites at boundaries containedmotifs in specific orientations, consistent with previous findingsdemonstrating that CTCF motif orientation dictates the directionof looping (Fig. 2B) (25). Furthermore, we found that boundariesassociated with topological-associated domains often revealed clus-ters of CTCF-binding sites (Fig. 2 C and D). Borrowing from theconcept that clusters of enhancers form superenhancers (26), weused the same computational framework to identify clusters ofCTCF sites, termed hereafter as “superanchors” or “superinsulators”(Fig. 2C) (Datasets S1–S3). We identified 645 superanchors in pro-Bcells. For example, we found a prominent superanchor positioned4 kb upstream of the Foxo1 locus (Fig. 2D). The Foxo1 superanchor,spanning 27 kb, is composed of eight CTCF-binding sites bound inboth pro-B and prepro-B cells and positioned on a boundary seg-regating two topological-associated domains (Fig. 2D). In prepro-Bcells, the upstream topological-associated domain harboringMaml3 was transcriptionally active, but the downstream topo-logical-associated domain encompassing Foxo1 is insulated fromthat activity and transcriptionally inactive (Fig. 2D). We found thatupon establishing B-cell identity, the Foxo1 promoter underwentdemethylation and enhancers associated with the Foxo1 locus wereactivated as characterized by the deposition of H3K4me2 (Fig. 2D).Global analysis of superanchors compared with “typical anchors”

showed that they are more enriched at topological-associateddomain boundaries, whereas superenhancers were found pre-dominantly inside topological-associated domains (Fig. 2E). Con-sistent with their placement at topological-associated domainboundaries, the distribution of CTCF binding within superanchorsrevealed that the CTCF motifs found at the 5′ and 3′ ends of thesuperanchors were oriented to form interactions either upstreamor downstream, respectively (Fig. 2F). Unlike superenhancers,which are usually cell type-specific, superanchors appeared to belargely invariant between the two cell types (Datasets S1–S3).More than 70% of superanchors identified in pro-B cells were also

12778 | www.pnas.org/cgi/doi/10.1073/pnas.1512995112 Benner et al.

Dow

nloa

ded

by g

uest

on

Aug

ust 3

0, 2

020

Page 4: New roles for DNA cytosine modification, eRNA, anchors, and … · New roles for DNA cytosine modification, eRNA, anchors, and superanchors in developing B cell progenitors Christopher

identified in prepro-B cells (Fig. S6A). Levels of CTCF bindingwithin superanchors were similar between the two cell types,consistent with the relatively invariant structure of topological-associated domains across different cell types (Fig. S6B) (27). Onenotable exception was the superanchor associated with thex-linked Firre locus (Fig. S6C), which displayed substantially moreCTCF binding and was associated with DNA hypomethylation infemale prepro-B cells compared with male pro-B cells. In othercases where superanchors were associated with developmentalprogression, developmental stage-specific CTCF occupancy bind-ing was accompanied by DNA demethylation and specific patterns

of gene expression (Fig. 2G and Fig. S6D). Collectively, theseobservations indicate a novel architectural element: superanchoror superinsulator consisting of clusters of CTCF sites.

The Ig Heavy Chain Locus Superanchor. Conspicuous among theidentified ensemble of superanchors was an unusual configurationof 10 densely bound, hypomethylated CTCF motifs that displayedthe same convergent orientation within an 8.5-kb genomic regionlocated 3′ of the Ig heavy chain (Igh) locus (Fig. 3 A and B) (28).CTCF-bound sites located across the Igh locus were hypomethy-lated and often found at a fixed distance from VH regions, whereas

Hypomethylated

proB

Hi-C

+/-

GRO-Seq

Rel

ativ

e P

eak

Den

sity

H3K4me2

mCG/CG

CTCF

GRO-Seq

mCG/CG

CTCF

0 1 2 3 4 5

0

1

2

3

0

1

2

0 1 2 3 4 5

0

1

A

E

B

D

F G

CC

TC

FE

2A

H3

K4

me

2H

3K

27

me

3h

ypo

me

thyl

ate

d

Pe

aks

or

Re

gio

ns

pe

r 1

00

kb

pe

r T

AD

-300kb TAD

Super Anchor

+300kb

Topological Domain

-200kb 0 200kb

CTCF+ CTCF-

0 1 2 3 4 5 6

-200kb 0

Mot

ifs p

er 1

00kb

pe

r TA

D B

ound

ary

Distance fromTAD 5’ Boundary

TGCA

A

GCT

CTGA

A

TCG

CT

T

AG

ACCC

ATCCC

TC

TAG

CG

C

ATG

C

A

T

GGATC

GTAC

CTGA

GACT

CATG

CTAG

G

ACC

T

G

AGC

G

ATC

G

AGAG

GTGT

GATCG

AAGC

GACT

CGA

ACGT

CTCF Motifs within Super Anchors

TADTAD

0 10 20 30 40 50 60 70 80 90

100

-5kb 0% 50% 100%+5kb

Mot

ifs p

er 1

00 k

b pe

r Sup

er A

ncho

r

CTCF+

CTCF Motif Sites

CTCF-

0

0.2

0.4

0.6

0.8

1

0 0.2 0.4 0.6 0.8 1 Nor

mal

ized

CTC

F C

hIP-

Seq

Sign

al

Super Anchor Rank

Igh

FirreFoxo1

100 kb

Maml3 AK144834 Foxo1

Super AnchorTypical Anchor

Super Enhancer Typical Enhancer

-300kb TAD +300kb

10 kb

Sstr4 Thbd

Super Anchor(CTCF motifs +/-)

pre-pro-Bpro-B

pre-pro-B

pro-B

pre-pro-B

pro-B

pre-pro-B

pro-B

pre-pro-Bpro-B

pre-pro-Bpro-B

pre-pro-Bpro-B

pre-pro-Bpro-B

pre-pro-Bpro-B

Distance fromTAD 3’ Boundary

Super Anchor (CTCF motifs )

7

H3K27me3

H3K4me2

Fig. 2. Superanchors are closely associated with hypomethylated CTCF-binding sites at boundaries flanking topological associated domains. (A) Hypo-methylated regions are frequently associated with boundary elements that define topological-associated domains. Metaplot of hypomethylated regions andChIP-Seq peaks density across topological-associated domains. (B) Distribution of CTCF motifs based on matching strand at the 5′ and 3′ edges of topological-associated domains. Primary orientation of CTCF motif at each boundary is displayed above each plot. (C) Super enhancer-style plot used to identifysuperanchors depicting the ranked signal for CTCF at putative CTCF clusters. Superanchors and typical anchors were identified in the same manner thatsuperenhancers and typical enhancers are identified, the only difference being that superanchors were found by using CTCF occupancy. As a result, typicalanchors are associated with single CTCF peaks found in the genome. The cutoff for superanchors was defined where the slope of the graph equals one. Levelsof representative superanchors indicated. (D) Superanchor that separates the Malm3 and Foxo1 loci with levels of CTCF, DNA methylation, H3K4me2, andnascent RNA levels indicated. Note the cluster of CTCF bound sites across the putative superinsulator. (E) Distribution of superanchors versus typical anchorsand superenhancers versus typical enhancers across topological domains. Levels are normalized to the total number of elements in each category to comparethe relative levels for each feature. (F) Distribution of CTCF motifs and their orientation along superanchor elements. (G) Differentially bound superanchor atthe Thbd locus with differential CTCF recruitment, DNA methylation, and histone modifications.

Benner et al. PNAS | October 13, 2015 | vol. 112 | no. 41 | 12779

IMMUNOLO

GYAND

INFLAMMATION

Dow

nloa

ded

by g

uest

on

Aug

ust 3

0, 2

020

Page 5: New roles for DNA cytosine modification, eRNA, anchors, and … · New roles for DNA cytosine modification, eRNA, anchors, and superanchors in developing B cell progenitors Christopher

predicted CTCF motifs found with the opposite orientation withinthe Igh locus were hypermethylated and often found near pseudo-VHregions (Fig. 3 A and B and Fig. S7A). The only exception to thistrend was the IGCR1 element, found immediately adjacent to DHregions. The IGCR1 contains two CTCF sites with opposing orien-tations, one of which is oriented to interact with the Igh superanchorand the other pairing with one of the VH regions, and disrupts V(D)Jrecombination when knocked out (28). We note that a similar trendwas observed for the TCRα locus (Fig. S7A).The orientation of VH region-associated CTCF motifs suggests

that they all form interactions with the Igh superanchor or theIGCR1 control region (28). To explore this possibility, we examinedthe pro–B-cell interactome by using meta Hi–C interactions identi-fied in B-cell progenitors (24, 29). This analysis revealed substantialenrichment for a spectrum of long-range genomic interactions,spanning vast genomic distances involving the VH region repertoireand the Igh superanchor as well as weak enrichment for genomicinteractions involving distal V region gene segments and the IGCR1control region (Fig. 3 A and C and Fig. S7 B–D).

DiscussionB-lineage development is initiated by the combined activities oflineage-specific transcriptional regulators that act in concert to alterthe epigenetic landscape of B-lineage regulatory elements. Here wefound that specification of the B-cell fate was closely associated withlarge-scale changes in DNA cytosine modifications across a widespectrum of regulatory elements. How are such sweeping changesacross the epigenetic landscape established? The induction of a Blineage-specific program of gene expression requires the activationof an enhancer repertoire. The activation of such enhancer reper-toires is established by the combined activities of B lineage-specificregulators. Although in prepro-B cells hypermethylated CpGswere associated with B-lineage specific enhancers, the majority oftranscription factor binding sites lack CpG residues. Consistentwith these observations, we found that in prepro-B cells, forcedE2A expression readily binds to a hypermethylated enhancerrepertoire. We suggest that in progenitor cells, E2A by itself or inconjunction with other transcription factors promotes demethyla-tion across an enhancer repertoire. Although detailed mechanistic

A

B

C

Igh Locus: chr12:113,500,000-118,200,000

PredictedCTCF motifs +/-

BoundCTCF motifs +/-

1 MbSuper Anchors

16x

Hi-C

Inte

ract

ions

Obs

erve

d/E

xpec

ted

1x

1/16x

PredictedCTCF motifs +/-

BoundCTCF motifs +/-

Super Anchors CTCF+ (unbound )

pro-BCTCF

pro-BH3K4me2

pro-BCTCF

pro-BH3K4me2

pro-BmCpG/CpG

GenesC J D V / Vpseudo

Ighv11-1Ighd1-1Ighd6-1 Ighv1-13

SuperAnchor CTCF +

CTCF -

IGCR1

C

JD

V

(bound CTCF+)Intergenic Control Region 1 Typical V pseudo regionTypical CTCF Bound V region

(IGCR1)The Igh Super Insulator

Fig. 3. Characterization of the Ig heavy chain (Igh) locus superanchor. (A) Schematic of the Igh locus depicting the locations of predicted CTCF motifs, boundCTCF motifs, DNA methylation, and ChIP-Seq signal across the locus. The normalized Hi-C interaction frequencies are shown above the locus. (B) Zoomed inrepresentation of the Igh superanchor, IGCR1 region, and representative VH and pseudo VH regions. (C) Model depicting looping patterns involving thesuperanchor and an ensemble of variable regions associated with the Igh locus.

12780 | www.pnas.org/cgi/doi/10.1073/pnas.1512995112 Benner et al.

Dow

nloa

ded

by g

uest

on

Aug

ust 3

0, 2

020

Page 6: New roles for DNA cytosine modification, eRNA, anchors, and … · New roles for DNA cytosine modification, eRNA, anchors, and superanchors in developing B cell progenitors Christopher

insight is lacking, it has been proposed that demethylation of highlytranscribed genes is accomplished by recruitment of TET proteinsby the RNA polymerase II complex (30). Here we find that eRNAtranscription is closely associated with the demethylation of aB lineage-specific enhancer repertoire. Thus, we propose that inB-cell progenitors, B lineage-specific transcription factors bindto a spectrum of hypermethylated enhancers to activate eRNAtranscription, which, in turn, leads to demethylation, convertinga poised into an active enhancer repertoire capable of formingloops with other regulatory DNA elements.We found considerable noise in the degree of methylation across

the heterochromatic compartment in multipotent progenitors. Whydoes variation in DNA cytosine modification to such a degree exist?We speculate that noise in DNA methylation permits the branchingout of different hematopoietic cell lineages from common pro-genitor cells. For example, it is well established that in multipotentprogenitor cells, the EBF1 locus is localized in the heterochromaticcompartment (24, 31). However, nuclear localization of the EBF1locus to the lamina is not absolute but variable, and we would like tosuggest that the decision of multipotent progenitors to differentiateinto a distinct cell lineage is controlled by changes in DNAmethylation at sites that control the nuclear positioning of keydevelopmental regulators.It is now well established the Igκ VJ rearrangements and allelic

exclusion are controlled at least in part by DNA cytosine modifica-tions (11, 32–34). Here, we find that the relocation of the Igκ ge-nomic region upon establishing B-cell fate is closely allied withincreased levels of CpG methylation. Why are elevated levels ofDNA cytosine modifications associated with the Igκ locus in com-mitted pro-B cells? We suggest that the increase in DNA cytosinemodifications ensures that Igκ VJ locus rearrangement is not pre-maturely initiated. As cells differentiate into small pre-B cells, theIgκ locus would then undergo allele-specific demethylation as de-monstrated in previous studies (11, 31, 32).Our data indicates that the Igh superanchor serves as a

platform to assemble variable regions separated by vast ge-nomic distances within close spatial proximity of the DHJH

elements. How does such an elaborate and intricate foldingpattern relate to the process of VDJ recombination? Recentstudies have indicated that spatial confinement is the domi-nant parameter that controls the frequencies of genomic en-counters, and we would like to propose that the Igh locussuperanchor spatially constrains subsets of VH elements andDHJH gene segments to promote rapid pairing of antigenreceptor coding elements (33, 35).

Experimental ProceduresPreviously published sequencing data used in this study: GSE40173 (Hi-C,GRO-seq, H3K4me2, H3K36me3, CTCF), GSE40173 (H3K4me1/3, H3K9ac, E2A,EBF, Foxo1), GSE21512 (PU.1), GSE35519 (Hi-C), GSE44288 (Med1), GSE38046(Pax5, DNase-Seq), GSE43008 (YY1), GSE53595 (Ikaros, Irf4), GSE21614(Smad3), and GSE45377 (Runx1).

All ChIP-Seq datawere reanalyzed from FASTQ files bymapping reads to themm9 genome using Bowtie2, and all downstream processing using HOMER.DNase-Seq data from GSE38046 was treated as ChIP-Seq data. Superanchorswere identified by using HOMER as described (22), which mimics the definitionof superenhancers described (26). Briefly, peaks were initially found in CTCFChIP-Seq data by using input as background. Peaks within 12.5 kb were thenmerged into putative superanchor regions. The combined CTCF signal fromthese regions was then normalized to input and ranked. The region where therank and signal normalized curve achieved a slope of 1 was used to determinethe cutoff for superanchors. Pro-B Hi-C data were pooled from two studies toboost the total coverage (GSE35519, GSE40173) (24, 36). Hi-C reads weretrimmed at HindIII sites and mapped to the mm9 genome by using Bowtie2.Interaction normalization, matrix creation, PCA analysis of compartments, andsignificant interaction identification at 10-kb resolution were performed byusing HOMER as described (24). Topological associated domain identificationat 10-kb resolution was also performed by using HOMER in the manner de-scribed (27) (SI Experimental Procedures).

ACKNOWLEDGMENTS. We thank Vivek Chandra and Yin Lin for help ingenerating the libraries. T.I. is the recipient of a research fellowship from theUehara Memorial Foundation. This work was supported by the NationalInstitutes of Health (C.M.).

1. Adolfsson J, et al. (2005) Identification of Flt3+ lympho-myeloid stem cells lackingerythro-megakaryocytic potential a revised road map for adult blood lineage com-mitment. Cell 121(2):295–306.

2. Inlay MA, et al. (2009) Ly6d marks the earliest stage of B-cell specification and identifiesthe branchpoint between B-cell and T-cell development. Genes Dev 23(20):2376–2381.

3. Nutt SL, Kee BL (2007) The transcriptional regulation of B cell lineage commitment.Immunity 26(6):715–725.

4. Mercer EM, Lin YC, Murre C (2011) Factors and networks that underpin early hema-topoiesis. Semin Immunol 23(5):317–325.

5. Mercer EM, et al. (2011) Multilineage priming of enhancer repertoires precedescommitment to the B and myeloid cell lineages in hematopoietic progenitors.Immunity 35(3):413–425.

6. Welinder E, et al. (2011) The transcription factors E2A and HEB act in concert to in-duce the expression of FOXO1 in the common lymphoid progenitor. Proc Natl AcadSci USA 108(42):17402–17407.

7. Beck K, Peak MM, Ota T, Nemazee D, Murre C (2009) Distinct roles for E12 and E47 inB cell specification and the sequential rearrangement of immunoglobulin light chainloci. J Exp Med 206(10):2271–2284.

8. Decker T, et al. (2009) Stepwise activation of enhancer and promoter regions of the Bcell commitment gene Pax5 in early lymphopoiesis. Immunity 30(4):508–520.

9. Bossen C, et al. (2015) The chromatin remodeler Brg1 activates enhancer repertoiresto establish B cell identity and modulate cell growth. Nat Immunol 16(7):775–784.

10. Mansson R, et al. (2012) Positive intergenic feedback circuitry, involving EBF1 andFOXO1, orchestrates B-cell fate. Proc Natl Acad Sci USA 109(51):21028–21033.

11. Cedar H, Bergman Y (2012) Programming of DNA methylation patterns. Annu RevBiochem 81:97–117.

12. Bird A (2002) DNA methylation patterns and epigenetic memory. Genes Dev 16(1):6–21.13. Gifford CA, et al. (2013) Transcriptional and epigenetic dynamics during specification

of human embryonic stem cells. Cell 153(5):1149–1163.14. Schmitz RJ, et al. (2013) Epigenome-wide inheritance of cytosine methylation variants

in a recombinant inbred population. Genome Res 23(10):1663–1674.15. Deaton AM, Bird A (2011) CpG islands and the regulation of transcription. Genes Dev

25(10):1010–1022.16. Ooi SK, O’Donnell AH, Bestor TH (2009) Mammalian cytosine methylation at a glance.

J Cell Sci 122(Pt 16):2787–2791.17. Tahiliani M, et al. (2009) Conversion of 5-methylcytosine to 5-hydroxymethylcytosine

in mammalian DNA by MLL partner TET1. Science 324(5929):930–935.18. Ito S, et al. (2011) Tet proteins can convert 5-methylcytosine to 5-formylcytosine and

5-carboxylcytosine. Science 333(6047):1300–1303.

19. Wu H, Zhang Y (2014) Reversing DNA methylation: Mechanisms, genomics, and bi-ological functions. Cell 156(1-2):45–68.

20. Hayatsu H, Shiragami M (1979) Reaction of bisulfite with the 5-hydroxymethyl groupin pyrimidines and in phage DNAs. Biochemistry 18(4):632–637.

21. Huang Y, et al. (2010) The behaviour of 5-hydroxymethylcytosine in bisulfite se-quencing. PLoS One 5(1):e8888.

22. Heinz S, et al. (2010) Simple combinations of lineage-determining transcription fac-tors prime cis-regulatory elements required for macrophage and B cell identities. MolCell 38(4):576–589.

23. Lin YC, et al. (2010) A global network of transcription factors, involving E2A, EBF1 andFoxo1, that orchestrates B cell fate. Nat Immunol 11(7):635–643.

24. Lin YC, et al. (2012) Global changes in the nuclear positioning of genes and intra- andinterdomain genomic interactions that orchestrate B cell fate. Nat Immunol 13(12):1196–1204.

25. Rao SS, et al. (2014) A 3D map of the human genome at kilobase resolution revealsprinciples of chromatin looping. Cell 159(7):1665–1680.

26. Whyte WA, et al. (2013) Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell 153(2):307–319.

27. Dixon JR, et al. (2012) Topological domains in mammalian genomes identified byanalysis of chromatin interactions. Nature 485(7398):376–380.

28. Lin SG, Guo C, Su A, Zhang Y, Alt FW (2015) CTCF-binding elements 1 and 2 in the Ighintergenic control region cooperatively regulate V(D)J recombination. Proc Natl AcadSci USA 112(6):1815–1820.

29. Zhang Y, et al. (2012) Spatial organization of the mouse genome and its role in re-current chromosomal translocations. Cell 148(5):908–921.

30. Huang Y, et al. (2014) Distinct roles of the methylcytosine oxidases Tet1 and Tet2 inmouse embryonic stem cells. Proc Natl Acad Sci USA 111(4):1361–1366.

31. Miyazaki K, Miyazaki M, Murre C (2014) The establishment of B versus T cell identity.Trends Immunol 35(5):205–210.

32. Cedar H, Bergman Y (2008) Choreography of Ig allelic exclusion. Curr Opin Immunol20(3):308–317.

33. Jhunjhunwala S, van Zelm MC, Peak MM, Murre C (2009) Chromatin architecture andthe generation of antigen receptor diversity. Cell 138(3):435–448.

34. Bossen C, Mansson R, Murre C (2012) Chromatin topology and the regulation ofantigen receptor assembly. Annu Rev Immunol 30:337–356.

35. Lucas JS, Zhang Y, Dudko OK, Murre C (2014) 3D-Trajectories adopted by coding andregulatory DNA elements: First-passage times for genomic interactions. Cell 158:339–352.

36. Hah N, et al. (2015) Inflammation-sensitive super enhancers form domains of co-ordinately regulated enhancer RNAs. Proc Natl Acad Sci USA 112(3):E297–E302.

Benner et al. PNAS | October 13, 2015 | vol. 112 | no. 41 | 12781

IMMUNOLO

GYAND

INFLAMMATION

Dow

nloa

ded

by g

uest

on

Aug

ust 3

0, 2

020


Recommended