Dynamic nucleosome landscape elicits a noncanonical gata2 pioneer model

feature-image

Play all audios:

Loading...

ABSTRACT Knowledge gaps remain on how nucleosome organization and dynamic reorganization are governed by specific pioneer factors in a genome-wide manner. In this study, we generate over


three billons of multi-omics sequencing data to exploit dynamic nucleosome landscape governed by pioneer factors (PFs), FOXA1 and GATA2. We quantitatively define nine functional nucleosome


states each with specific characteristic nucleosome footprints in LNCaP prostate cancer cells. Interestingly, we observe dynamic switches among nucleosome states upon androgen stimulation,


accompanied by distinct differential (gained or lost) binding of FOXA1, GATA2, H1 as well as many other coregulators. Intriguingly, we reveal a noncanonical pioneer model of GATA2 that it


initially functions as a PF binding at the edge of a nucleosome in an inaccessible crowding array. Upon androgen stimulation, GATA2 re-configures an inaccessible to accessible nucleosome


state and subsequently acts as a master transcription factor either directly or recruits signaling specific transcription factors to enhance WNT signaling in an androgen receptor


(AR)-independent manner. Our data elicit a pioneer and master dual role of GATA2 in mediating nucleosome dynamics and enhancing downstream signaling pathways. Our work offers structural and


mechanistic insight into the dynamics of pioneer factors governing nucleosome reorganization. SIMILAR CONTENT BEING VIEWED BY OTHERS IDENTIFYING SYNERGISTIC HIGH-ORDER 3D CHROMATIN


CONFORMATIONS FROM GENOME-SCALE NANOPORE CONCATEMER SEQUENCING Article 30 May 2022 DECODING THE EPIGENETICS AND CHROMATIN LOOP DYNAMICS OF ANDROGEN RECEPTOR-MEDIATED TRANSCRIPTION Article


Open access 03 November 2024 A PIONEER FACTOR LOCALLY OPENS COMPACTED CHROMATIN TO ENABLE TARGETED ATP-DEPENDENT NUCLEOSOME REMODELING Article 19 December 2022 INTRODUCTION Nucleosome


organization (positioning, spacing, and regularity) plays a central role in gene regulation1,2. The dynamic nucleosome reorganization is the interplay among nucleosome, wrapped DNA, and


nucleosome-binding factors such that nucleosomes sterically occlude their wrapped DNA from DNA-binding factors and ATP-dependent chromatin remodelers unwrap nucleosomal DNA or slide


nucleosomes to reposition along DNA3,4. Recent genome-wide nucleosome mapping highlighted functionally important regular nucleosomal arrays as their array regularity is often aligned at


biological features5,6. In contrast, impaired genic arrays are correlated with increased cryptic transcription with suppressive activities7. High-resolution mapping studies suggested that


DNA contains information specifying the position of nucleosomes called positioning code which facilitates the shunting of nucleosomes from one array to another by chromatin remodeling


machines8,9,10. Some studies also showed nucleosomes are subject to extensive reversible post-translational modifications that can alter the local chromatin structure poised for activation


and transcription11,12. Other studies further elucidated coordinated and antagonistic functional relationships between nucleosome remodeling and modifying machineries13,14,15. A group of


special transcription factors (TFs) called pioneer factors (PFs) such as FOXA families16,17, GATA families18,19,20, PAX715, HOXB1321, and P5322 can access target DNA sequences on


nucleosomes. A paradigm of competence for transcription is thus established that nucleosome-binding properties of PFs can engage in the assembly of regulatory factors on the DNA by either


opening the chromatin locally, positioning nucleosomes, or enabling intrinsic cooperative binding effects among other TFs23,24, or directly recruiting other chromatin modifiers and


coregulators25. Despite many advances in the understanding of nucleosome dynamics, there still remain knowledge gaps on how nucleosome organization and dynamic reorganization are governed by


specific pioneer factors in a genome-wide manner. Many studies26,27,28, including ours19,29, have found both FOXA1 and GATA2 act as PFs to trigger the androgen-induced androgen receptor


(AR) signaling pathway. It is believed that both PFs can recruit chromatin modifiers, chromatin remodelers, and chaperone molecules to establish an “open” chromatin environment or a


nucleosome-free region (NFR) to facilitate the accessibility for other factors, then initiate subsequent regulatory events. However, the pioneer capacity of GATA2 is inconclusive. Several


studies illustrated that GATA2 is simply a non-pioneer TF30 but others demonstrated it is indeed a pioneer TF19. To fully understand the detailed pioneer capacities of GATA2 in regulating


nucleosome organization in androgen stimulated prostate cancer cells, it is critical to use integrative approaches combining high-resolution genomic techniques (ChIP-exo and MNase-ChIP-seq)


and computational analyses to examine the relationship among GATA2 and nucleosome organization. In this study, we conducted high-resolution ChIP-exo, MNase-seq, and MNase-ChIP-seq in LNCaP


cell model under Vehicle (Veh) and 5α-dihydroxytestosterone (DHT)-treated conditions and generated over three billions multi-omics sequencing data. Here, we exploit the landscape of dynamic


nucleosome footprints and quantitatively define functional nucleosome states based on histone marks, genomic regions, nucleosome positioning, spacing, and regularity. We further identify


GATA2-associated dynamic nucleosome state switching upon DHT treatment. Using various in vitro assays, we demonstrate that GATA2 initially functions as a PF binding at the edge of a


nucleosome in an inaccessible crowding array. Under the DHT-treated condition, GATA2 reconfigures inaccessible to accessible nucleosome state and subsequently, it acts as a master


transcription factor either directly or to recruit signaling-specific TFs to enhance oncogenic Wnt/β-catenin signaling in an AR-independent manner. Our work thus elicits a noncanonical GATA2


pioneer model, providing a structural and mechanistic insight into the dynamics of pioneer factors governed by nucleosome reorganization. RESULTS GENOME-WIDE IDENTIFICATION OF NUCLEOSOME


POSITIONING AND SPACING We conducted multi-omics sequencing profiling in LNCaP cells in Veh and DHT-treated conditions, including MNase-seq for identifying genome-wide nucleosome


positioning, spacing, and regularity, MNase-ChIP-seq of H3K4me1, H3K4me2, H3K4me3, H3K27ac, H3K27me3, H3K36me3, and H3K79me2 for detecting enriched histone marks at a nucleosome level, as


well as ChIP-ePENS of FOXA1 and GATA2 for detecting 1 bp binding resolution, each with biological replicates (Fig. 1a). In total, we generated over three billons of multi-omics sequencing


data to investigate the landscape of nucleosome organization and dynamic reorganization (Supplementary Table 1). A Pearson correlation coefficient between two MNase-seq replicates in LNCaP


cells was very high with an _r_ value of 0.95 (Fig. 1b), demonstrating the high reproducibility and good quality of the data. We applied a nucleosome positioning tool iNPS31 on MNase-seq


data, and identified ~12.6 million nucleosomes, in equivalent to 48.6% of the whole genome DNA wrapping on the nucleosomes, which were proportionally distributed in each of 23 chromosomes


(Supplementary Table 2). We then plotted a down-sampling saturation curve and showed the data sequencing depth was sufficient to capture whole genome-wide nucleosomes (Supplementary Fig. 1).


We further used a nucleosome density map to illustrate the robust correlations of detected nucleosomes in each of 23 chromosomes between two replicates (Fig. 1c), with all _r_ values larger


than 0.93. The accumulation of nucleosome dyads with the enrichment of Mono-nucleosome, Di-nucleosomes, Tri-nucleosomes, and Penta-nucleosomes showed relatively high robustness of


nucleosome detection with adjacent three nucleosomes in a genome-wide scale. (Fig. 1d). We also observed a range of 150-350 bp with a peak of 187 bp nucleosome spacing, i.e., the distance


between two neighboring nucleosome dyads (Fig. 1e). QUANTITATIVELY DEFINING FUNCTIONAL NUCLEOSOME STATES To understand the relationship between the nucleosome positioning, spacing, and


regularity with eight histone marks which characterize genomic regulatory elements as a promoter, enhancer, repressor, and others, we examined the enrichment of histone marks on positioned


nucleosomes in three genomic regions, the Promoter (−1 Kb to 1 Kb away from TSS), the Proximal (−5 Kb to −1 Kb away from TSS), and the Distal (−50 Kb to −5 Kb away from TSS). We observed a


distinct distribution of nucleosomes enriched with different histone marks in three genomic regions (Fig. 2a). For example, H3K4me1-nucleosomes were mostly located in far the Proximal and


Distal regions and H3K27me3-nucleosomes were in the far Distal region. When looking into MNase-seq read signal distribution of histone marks in different genomic regions, we found the


characteristic spacing between nucleosomes. For example, we found a clear spacing between nucleosomes of H3K4me1 and H3K27ac in all three regions, of H3K4me3 in the Promoter/Proximal region,


of H3K27me3 in the Proximal/Distal, of H3K4me2 in the Promoter, and of H3K36me3 in the Distal region respectively (Fig. 2b). Further, we observed histone marks and genomic region-specific


nucleosome spacing patterns (Fig. 2c). For instance, H3K4me3, H3K27ac, and H3K36me3 have the shortest nucleosome spacing ranging from 170–180 bp in the Promoter region, H3K27me3 has two


peaks at 188 bp and 215 bp in the Proximal region, and H3K9me3 and H3K27me3 have a wide peak in the Distal region. Furthermore, we compared the density of histone mark-enriched nucleosomes


and gene expression, i.e., No. of nucleosomes per 1000 bp, in three regions, and found active marks, H3K4me1/2/3 and H3K27ac had a higher density than repressive marks, H3K9me3 and H3K27me3


in all three regions (Fig. 2d and Supplementary Fig. 2). Finally, we developed an empirical formula to quantitatively define the nucleosome states (Methods), which takes into account


nucleosome positioning, spacing, and histone marks, including the similarity score γ between nucleosome αn +1 and α1 to αn, where γ is calculated by nucleosome position factor λ, the peak


area of nucleosome S, the width of nucleosome position W and spacing between nucleosomes d and histone mark factor β. After iteratively running the formula to optimize the parameters, we


were able to determine the trajectory of optimized parameters used to define the number of nucleosome states. Indeed, when further incorporating various genomic features with the trajectory,


including histone marks, the number of the grouped nucleosomes, genomic location, degree of positioning, regularity score, and average spacing, we were able to define nine functional


states, S1–S9 (Fig. 2e, f and Supplementary Fig. 3). S1 was defined as transcriptional initial due to its location around the up-promoter or 5′TSS region with active H3Kme3/K27ac marks. S2


was defined as an accessible edge due to its mainly located in proximal/distal regions and having 1–4 nucleosomes, a shorter spacing (180.43 bp shortest except S1), and a relatively low


regularity score of 9.85. S3 was defined as Alternative primed due to its locating in promoter/proximal regions and having primed or poised marks H3K27me3/H3K4me1/H3K4me2. S4 was defined as


Crowding array because of its higher number of 5–20 positioned nucleosomes in an array with the highest average spacing of 210.64 bp and 75.4% of states enriched with H3K9me3 and/or H3K27me3


marks. S5 was identified as well-organized due to its location in the down-promoter region with the highest regularity score of 21.00. S6 was defined as restricted accessible since a


majority (54.1%) of S6 were in the lower average spacing and in proximal/distal regions with H3K4me2/H3K27me3 marks. S7 was defined as a Steady structure due to 69.8% of states were in a


distal region with various marks. S8 was defined as Fuzzy due to its lower degree of positioning and the corresponding low regularity score and S9 was defined as Unknown due to its unclear


features. S1 and S5 have been extensively studied in previous work32,33,34, therefore, we focused on thoroughly examining the functionality of S2, S3, S4, S6, and S7 and their relationship


with FOXA1, GATA2, and other TFs and coregulators in the downstream analyses. DYNAMIC NUCLEOSOME STATES SWITCHING We extended our quantitative modeling of nucleosome states on MNase-seq and


MNase-ChIP-seq data in LNCaP cells under DHT-treatment conditions. Intriguingly, we obtained the same functional nucleosome states, demonstrating the validity and broadness of our


quantitative definition. When comparing the changes in nucleosome states before and after DHT treatment, we found the numbers of S2, S3, and S6 increase while the numbers of S4 and S7


decrease (Fig. 3a). Interestingly, we found dynamic switches among different nucleosome states upon DHT treatment. Sankey's diagram showed over 72.1% of S4 have been switched to other


states including 45.4% of them turning into S3, while 42.3% of S3 have been changed to S2, and the number of S2 increases to 202.4% in the DHT-treatment condition (Fig. 3b). Next, we wanted


to investigate what transcription factors (TFs) and coregulators could potentially instruct these dynamic nucleosome state switches particularly from two condensed states, S4 (Inaccessible)


or S3 (Alternative primed) to others. We downloaded many publicly available ChIP-seq data of PFs, TFs, and coregulators and examined their differential binding patterns35,36,37,38 (Fig. 3c


and Supplementary Fig. 4). As expected, we found FOXA1 showed a significantly lost binding from S4 to S3 or S2 accompanied by a lost linker H1. This finding is consistent with many other


studies39,40, where FOXA1 competes with its canonical binding motif with H1 to enhance nucleosome accessibility (Supplementary Fig. 5). We also found the bindings of GATA2, HOXB13, RUNX1,


and TLE3 were lost, implicating their potential pioneer capacities of opening condensed nucleosomes. We then particularly examined the expression level of genes associated with nucleosome


states switching from S4 to relatively accessible states (RAS) including S2, S3, S7, and nucleosome-free region (NFR) named as RAS1; and from S3 to relatively accessible states including S2,


S6, and NFR named as RAS2. Interestingly, we found a majority (>80%) of the genes associated with differential binding of FOXA1, GATA2, and H1 were upregulated, indicating both FOXA1 and


GATA2 play pioneer roles capable of dynamically reprogramming nucleosome accessibility, resulting in gene activation (Fig. 3d and Supplementary Figs. 6, 7). Furthermore, we found there were


257 and 293 unique GATA2 genes with only GATA2 binding but no other analyzed TF bindings from S4 to RAS1 and S3 to RAS2 respectively, suggesting that this subset of GATA2 genes can


independently exert a pioneer function upon androgen stimulation (Fig. 3e). GATA2-ASSOCIATED DYNAMIC NUCLEOSOME STATES SWITCHING To elucidate the pioneer capacity of GATA2, we conducted


ChIP-ePENS of GATA2 in both Vehicle (Veh) and DHT-treated LNCaP cells and used ePEST to identify GATA2 binding borders at one-base resolution21 (Supplementary Fig. 8 and Supplementary Table 


3). We identified a total of 32,342 and 27,613 border composed sites (BCSs) as binding footprint boundaries in Veh and DHT-treatment respectively. Interestingly, we found paired border sites


(PBSs) of GATA2 borders followed a bimodal distribution with a 13–14 bp gap, similar to the distribution of FOXA1 borders with an 11 bp in our previous study41 (Fig. 4a, Supplementary Fig. 


9, and Supplementary Data 1). About 42.1% of GATA2 borders were associated with the nucleosome states in Veh and dropped to 36.2% in DHT-treated cells (Fig. 4b, Supplementary Figs. 10, 11,


and Supplementary Data 2, 3). GATA2 borders on S2 showed a dramatic decrease at 45.5% upon the DHT treatment (Fig. 4c). By plotting the accumulating distribution of borders around the


nucleosome dyad, we found more than 80% of GATA2 borders were located ~50–60 bp on the edge of the nucleosome for S4 and S3, and the peaks tended to spread wider in the DHT treatment, while


a majority of GATA2 borders were located in the middle of nucleosomes for S6 and S7 (Fig. 4d and Supplementary Fig. 12). Two examples demonstrated GATA2-associated nucleosomes switching from


inaccessible to accessible states while maintaining its binding (Fig. 4e and Supplementary Fig. 13). We further examined the open chromatin changes for both GATA2-associated nucleosome


states switching by ATAC-seq data and observed a great increase from 11.2 to 37.1% for S4-RAS1 and 38.6 to 72.3% for S3-RAS2, respectively (Fig. 5a and Supplementary Fig. 14). Further, only


21 (7.6%) of 278 genes associated with unique GATA2-governed S4-RAS1 switching and only 9 (6.7%) of 135 genes associated with unique GATA2-governed S3-RAS2 switching were overlapped with


1036 AR-dependent DHT-treated differentially expressed genes, respectively, suggesting that the vast majority of GATA2-governed dynamic nucleosome states switching are independent of AR


signaling (Fig. 5b and Supplementary Figs. 15, 16). KEGG pathway analysis further identified WNT signaling and nuclear receptor meta pathways were the top pathways for S4-RAS1 and for


S3-RAS2 respectively (Fig. 5c). GATA2 strongly preferred binding at the edge of nucleosomes and were almost equally distributed on both sides (Fig. 5d). Remarkably, ZNF700, IRF protein


family, ZNF569, and SOX protein family were found as the top enriched motifs in GATA2-governed S4-RAS1 switching (Fig. 5e) and SOX9 was also identified as a potential co-binding TF on WNT


signaling genes by a publicly available database collected all ChIP-seq of TFs from ENCODE and ChEA42 (Fig. 5f and Supplementary Fig. 17). Taken together, our data suggested a noncanonical


pioneer model of GATA2 that it initially functions as a PF binding at the edge of a nucleosome in an inaccessible crowding array; under the DHT-treated condition, it reconfigures


inaccessible to accessible nucleosome state; subsequently, it acts as a master transcription factor either directly or to recruit other signaling-specific TFs to enhance WNT signaling in an


AR-independent manner. GATA2 IN MEDIATING NUCLEOSOME DYNAMICS AND ENHANCING WNT SIGNALING PATHWAY To substantiate this model, we conducted various in vitro assays on 20 Wnt/β-catenin


signaling genes selected from GATA2-governed S4-RAS1 switching. Competitive nucleosome-binding assays including in vitro nucleosome-binding and electrophoretic mobility shift assays were


designed to detect a binding range of nucleosome position with GATA2 binding border at 0, 41, 65, and 85 bp. The 65 bp DNA showed the highest supershift of the others, confirming that GATA2


prefers binding at the edge of nucleosomes (Fig. 6a and Supplementary Fig. 18). Open chromatin assays further demonstrated that 17 of 20 Wnt/β-catenin signaling genes showed an increase in


chromatin opening upon DHT treatment (Supplementary Fig. 19). Our genome-wide ATAC-seq data showed that a majority of 413 GATA2-governed S4/3-RAS1/2 switching genes significantly reduced the


chromatin accessibility and overall 29.6% of open chromatin regions on a genome-wide scale were lost after knockdown GATA2 gene (Fig. 6b and Supplementary Figs. 20,21). Together, we


provided several lines of evidence to support the notion that GATA2 was involved in regulating chromatin accessibility and nucleosome reorganization. Furthermore, we used siRNA to knock down


GATA2 in LNCaP cells to create a siGATA2 subline and measured the gene expression changes by RT-qPCR. We found that 14 of 20 genes in siGATA2 vs siCtrl LNCaP cells were downregulated under


the DHT-treated condition (Fig. 6c), suggesting that GATA2 regulates Wnt/β-catenin signaling gene expression levels. Collectively, our results revealed a dual pioneer and master role of


GATA2 in mediating nucleosome dynamics and enhancing downstream Wnt/β-catenin signaling in an AR-independent manner (Fig. 6d). DISCUSSION In this study, we systematically defined genome-wide


functional nucleosome states with a quantitative method (Fig. 2e). One major advantage of our method is to fully utilize the high-resolution nucleosome level genomic data including


MNase-seq and MNase-ChIP-seq. Intriguingly, we were able to obtain the same functional nucleosome states in DHT-treated LNCaP cells as in untreated cells when applying the method to the


data, demonstrating the validity and broadness of our quantitative definition. Although numerous studies have revealed the basic principles of nucleosome organization and its dynamics3,4,


our work clearly filled a knowledge gap in the field since most of the previous work were focused on qualitatively defining the nucleosome states without systematically providing the


trajectory of clear quantitative cutoff thresholds38, or on examining the nucleosome landscape in specific genomic regions4. More importantly, these functional nucleosome states could be


further used to elicit the pioneer capacity of any TFs including known PFs by integrating with one-base resolution ChIP-ePENS or ChIP-exo data. In theory, our integrative approach can


accurately define the pioneer capacity of any known PFs or distinguish the pioneer factors from non-pioneer factors by comparing two or more different biological conditions. This statement


is attested by the following four foundations of our approach: (1) we identify the PF/TF-associated condensed nucleosome states; (2) we identify the PF/TF binding borders within the


nucleosomes; (3) we determine whether the PF/TF is accompanying the nucleosome switches under at least two biological conditions; and (4) we perform competitive nucleosome-binding assays to


validate the pioneering capacity. Although our approach is able to define the pioneer functionality of any TFs, we are cautious that the PF functionality and capacity should be interpreted


tightly with the specific biological context. Nevertheless, our approach highlights the importance of utilizing the high resolution of high throughput genomic data in elucidating the pioneer


function of TFs. Remarkably, we found almost half (42.1%) of GATA2 was bound on nucleosomes (combined all states) in untreated LNCaP cells and 51.8% of these GATA2-associated nucleosomes


were switched to more accessible nucleosomes or free nucleosome regions in DHT-treated cells (Fig. 4b), suggesting that this subset of GATA2 might function as a pioneer factor in the


hormone-induced context. We unexpectedly found this GATA2 pioneer action exerts in an AR-independent manner and regulates specific downstream signaling pathways (Fig. 5b, c). It seems that


GATA2 further acts as a master transcription factor either directly or to recruit other signaling-specific TFs to the chromatin to regulate the Wnt/β-catenin pathway upon androgen


stimulation (Fig. 6). This data is in stark contrast with the archetypical pioneer function of FOXA1 such that FOXA1 opens the condensed chromatin to mainly serve for an AR binding activity


under DHT-treated conditions (Supplementary Figs. 22, 23). Collectively, our data support a noncanonical pioneer GATA2 model and elicit the pioneer capacity of GATA2 action in


hormone-induced prostate cancer cells. Despite that previous studies19,25,26,27 have demonstrated the pioneer functionality of GATA2 in hormone-induced prostate cancer cells, all of these


studies emphasized on the pioneering role of GATA2 in activating or enhancing AR-dependent gene transcription. By contrast, our results illustrated a pioneer function for GATA2 regulation in


which it regulates oncogenic Wnt/β-catenin signaling by circumventing AR signaling. Our finding that GATA2 exerts an AR-independent functionality in promoting aggressive prostate cancer is


consistent with a previous study that GATA2 regulates a core subset of clinically relevant genes in an AR-independent manner43. In summary, we provided a quantitative model of defining


functional nucleosome states to the community. We also conducted a detailed examination of the pioneer capacity of GATA2 in regulating dynamical nucleosome reorganization in hormone-induced


prostate cancer cells, and further implicated GATA2-mediated Wnt/β-catenin signaling in conferring aggressiveness in prostate cancer. Our work may provide a rationale for targeting GATA2


downstream signaling as a therapeutic strategy to treat advanced prostate cancer. Our work also offers a structural and mechanistic insight into the dynamics of pioneer factors governed by


nucleosome reorganization. METHODS MNASE-CHIP-SEQ AND MNASE-SEQ MNase-ChIP-seq and MNase-seq protocols were performed according to previous studies44. In brief, LNCaP cells were exposed to


10 nM DHT or DMSO (Veh) for 4 h. Mono-nucleosomes with solubilized chromatin was achieved by MNase digestion of 2 min at 37 °C, then immunoprecipitated with antibody-conjugated magnetic


beads. DNA is phenol extracted and ethanol precipitated. Libraries were prepared from isolated DNA and sent for sequencing on the Illumina HiSeq3000 at the UTHSA sequencing core. All samples


were performed in biological replicates. Antibodies include: H3K4me1 (ab8895) 1:500 dilution, H3K4me2 (ab7766) 1:250 dilution, H3K27ac (ab4729) 1:500 dilution, H3K27me3 (ab6002) 1:250


dilution, H3K36me3 (ab9050) 1:250 dilution, H3K79me2 (ab8898) 1:250 dilution from Abcam (Cambridge, MA). H3K4me3 (07-473) 1:500 dilution, H3K9me3 (17-10242) 1:250 dilution from Millipore


(Upstate). CHIP-EPENS A modified ChIP-exo protocol for TFs was performed as following steps21: Cells were fixed with 1% formaldehyde for 10 min at room temperature and chromatin was


sonicated and incubated overnight with 2–4 μg antibodies against GATA2(sc-9008, Santa Cruz) 1:250 dilution with biological replicates. T4 DNA polymerase, T4 PNK, and Klenow DNA Polymerase


were used together for end polishing. The ligation step was performed with 1 mM dithiothreitol. Protein A Dynal magnetic beads were washed using modified RIPA buffer (50 mM Tris-HCl pH 8.0,


1 mM EDTA, 0.25% sodium deoxycholate, 1% NP-40, 0.5 M LiCl) followed by Tris pH 8.0 twice during each step. The library was amplified with only 10–12 cycles and prepared without gel-based


size selection. Paired-end sequencing (50 bp) was performed by Illumina HiSeq2500. ATAC-SEQ About 50,000 cells were resuspended in cold ATAC-seq resuspension buffer (10 mM Tris-HCl pH 7.4,


10 mM NaCl, and 3 mM MgCl2). Cell nuclei were then prepared by incubation in 50 μl of ATAC-seq resuspension buffer containing 0.1% NP-40, 0.1% Tween-20, and 0.01% digitonin on ice for 3 min.


After centrifugation, nuclei were resuspended in 50 μl of transposition mix (25 μl 2× TD buffer, 2.5 μl Nextera Tn5 transposase (Illuminar), 16.5 μl PBS, 0.5 μl 1% digitonin, 0.5 μl 10%


Tween-20, and 5 μl water), and incubated at 37 oc for 30 min in a thermomixer with shaking at 1000 rpm. Transposed fragments were then purified with a Zymo DNA Clean and Concentrator-5 Kit.


All libraries showed sufficient amplification after the five pre-amplification cycles and were quantified using the KAPA Library Quantification Kit. Libraries were then sequenced using


Illumina Novaseq 6000 at the Duke sequencing core. All samples were performed in biological replicates. DATA MAPPING MNase-seq, MNase-ChIP-seq, and ChIP-ePENS sequencing datasets were


generated in LNCaP cells with both Veh and DHT-treated conditions. Raw sequence reads were aligned against the human genomic sequence (hg19) using bowtie2 (version 2.2.8) with -v 3 -k 2 -m 1


-I 20 -X 400 for ChIP-seq data. Only uniquely mapped reads were used for further downstream analysis. DETECTION OF NUCLEOSOME POSITIONING We applied iNPS31, which uses a Laplacian of


Gaussian convolution model to obtain smooth estimates of the stringency of nucleosome positioning, on MNase-seq data to detect nucleosome centers (dyads) and robustly estimate the degree of


positioning. The kernel bandwidth w is a key parameter to control the smoothness of the stringency profile. We initially chose w = 30 as suggested in ref. 39 to conduct the calculation, then


adjust it to make sure that it provides sufficient smoothing for the particular data without sacrificing the sharpness of the positioning estimate. We assigned nucleosomes to 21,319genes


with their longest isoforms in which each gene was divided into three genomic regions: Promoter region of −1 Kb to 1 Kb around the transcriptional start site (TSS), Proximal region of −5 Kb


to −1 Kb upstream of TSS and Distal region of −50 Kb to −5 Kb upstream of TSS. We used MACS244 to identify enriched peaks of various histone marks using nonmodel and shift options in order


to remove a technical bias and peak shifting for ChIP-seq data. NUCLEOSOMESGROUPING AND STATES CLASSIFICATION The degree of positioning describes how well the nucleosome is positioned in the


cells’ population and the regularity score indicates the periodical feature of a nucleosome array, which is measured by calculating power spectral density with an interpolating method and


Welch’s method for the nucleosome states’ array. For quantitatively defining the nucleosome states, we utilized the following features related to nucleosomes, nucleosome positioning and


spacing, histone marks, and the similarity ratio between continuous nucleosomes. $${{{{{{\rm{\lambda


}}}}}}}_{{{{{{\rm{i}}}}}}}={{{{{{\rm{S}}}}}}}_{{{{{{\rm{i}}}}}}}/{{{{{{\rm{W}}}}}}}_{{{{{{\rm{i}}}}}}}+\omega \,(\mu \,-\,{{{{{{\rm{d}}}}}}}_{{{{{{\rm{i}}}}}}})$$ (1) $${{{{{{\rm{\beta


}}}}}}}_{{{{{{\rm{i}}}}}}}=\mathop{\sum}\limits_{k=1}^{n}\frac{{dk}}{{dx}}{a}^{k}{b}_{{{{{{\rm{i}}}}}}}$$ (2) where λi represents the nucleosome positioning and spacing, as Si is the peak


area of nucleosome Ni, Wi is the width of nucleosome position, ω is the weight for the spacing factor, μ is the average spacing in a specific area and di is the actual spacing between


nucleosomes. βi represents the histone mark state factor, as \(\frac{{dk}}{{dx}}\) is the weight of a specific histone mark k, while \({a}^{k}\) is the relative number of reads of histone


mark k and \(b\)i is the relative number of reads of nucleosome Ni $${{{{{\rm{\gamma }}}}}}=({{{{{{\rm{\lambda }}}}}}}_{{{{{{\rm{i}}}}}}}{{{{{{\rm{\beta


}}}}}}}_{{{{{{\rm{i}}}}}}}-{{{{{{\rm{\lambda }}}}}}}_{{{{{{\rm{i}}}}}}+1}{{{{{{\rm{\beta }}}}}}}_{{{{{{\rm{i}}}}}}+1})/\mathop{\sum}\limits_{n=1}^{i}{{{{{{\rm{\lambda


}}}}}}}_{{{{{{\rm{i}}}}}}}{{{{{{\rm{\beta }}}}}}}_{{{{{{\rm{i}}}}}}}$$ (3) where γ calculates the similarity ratio between nucleosome Ni+1 and N1 to Ni. If γ among calculated nucleosomes is


lower than 10%, the nucleosomes were merged into the same group. We defined nucleosome dyad position, degree of positioning, regularity score, and histone peak signals as grouped nucleosome


profiles. We then performed K-means clustering on grouped nucleosome profiles to obtain distinct classes of nucleosome states. IDENTIFICATION OF GATA2 BORDERS Border-calling of ChIP-ePENS


data was conducted by ePEST (version 1.0) with the parameter of -D True -p 1e-8 -R 25 -c 0.05 -k 2.0 -o. The ePEST algorithm was specifically designed for ChIP-ePENS40 and depending on a


statistical evaluation of Chernoff inequity on exo-5′-end reads and r-scan statistic method for peak-calling on son-3′-end reads, Border-calling was conducted specifically within these


binding regions and borders were finally assigned into each individual binding site by a graph-based strategy. A GATA2-associated gene was defined as the closest gene of GATA2 border


bindings and each GATA2 border pair was assigned to only one gene according to the order of the following criterion: gene body region (TSS~ TES of a gene), promoter region (TSS~ −1 Kb


upstream of a gene), proximal region (−5 Kb ~ −1 Kb upstream of a gene), then distal region (−50 Kb ~ −5 Kb upstream of a gene) and no associated gene. DIFFERENTIALLY EXPRESSED GENE


ANDATAC-SEQ ANALYSIS RNA-seq data were aligned by STAR (version 2.5.3) with default parameters. The differentially expressed genes were performed by HTseq-count (version 0.9.1) and DESeq2


(version 1.10.1) with thresholds of log2(|folder change | ) >1 and _p_ values < 0.05. ATAC-seq data of LNCaP cells were downloaded from the Gene Expression Omnibus (GSE105116).


ATAC-seq peaks were called using HOMER (version 4.10) find peaks localSize 50000 -size 150 -minDist 50 –fragLength 0 -style dnase. Differential accessibility was called using DESeq2 and


hyper- and hypo-accessible peaks were defined with a | log2 FC | > 1 and an adjusted _p_ value < 0.01. SIRNA ASSAY Silencer® Select siRNAs of GATA2 were obtained from Thermo Fisher


Scientific (Catalog #4392420, Santa Clara, CA). Each sample was performed in triplicates with siRNA of different targets. For transfection of siRNA oligos, cells were seeded in six-cell


plates with Lipofectamine® RNAiMAX Transfection Reagent for 48 h. The knockdown efficiency was detected by qPCR. IN VITRO NUCLEOSOME-BINDING AND ELECTROPHORETIC MOBILITY SHIFT ASSAYS The


nucleosome-binding assay was performed as following22: In vitro nucleosomes were generated from H2A/H2B dimer and H3.1/H4 tetramer (NEB). Synthesized double-stranded DNA sequences were mixed


at equal molar amounts and then added to histones at octamer/DNA molar ratios of 1.5:1 in 2 M NaCl. Nucleosomes were reconstituted through salt gradient dialysis and further purified by


7–20% sucrose gradient centrifuge and concentrated by 50,000 centrifugal filter units (Millipore, Amicon ultra). The protein-nucleosome-binding assays were carried out with the purified


nucleosomes mentioned above and human full-length recombinant GATA2 protein (Abcam catalog no. ab134866) in a 7 μL DNA binding buffer and incubated for 30 min. Protein binding was analyzed


by non-denaturing polyacrylamide gel electrophoresis with ethidium bromide staining. OPEN CHROMATIN ASSAY Open chromatin assay was performed using a Chromatin Accessibility Assay Kit


(ab185901) from Abcam (Cambridge, MA)45. Briefly, LNCaP cells were lysed and chromatin was extracted by adding lysis buffer to cell pellets and incubating for 10 min. Chromatin pellets were


centrifuged and resuspended in a nuclease reaction mix and then added to a stop solution and incubated with Proteinase K. Accessible fragments were then purified by DNA binding columns and


gene targets were analyzed with PCR by comparing the nuclease-treated condition and the no-nuclease control. DATA AVAILABILITY The data that support this study are available from the


corresponding authors upon reasonable request. The MNase-seq, MNase-ChIP-seq, ChP-ePENS of GATA2, and ChIP-seq of H1 data generated in this study have been deposited in the GEO database


under accession code GSE148935 and GSE182529. Source data are provided with this paper. CODE AVAILABILITY All codes used in this study are available on Github


(www.github.com/tianbao365/Nuc-PF) and Zenodo (https://zenodo.org/badge/latestdoi/296872753). REFERENCES * Jiang, C. & Pugh, B. F. Nucleosome positioning and gene regulation: advances


through genomics. _Nat. Rev. Genet._ 10, 161–172 (2009). Article  CAS  PubMed  PubMed Central  Google Scholar  * Zhou, K., Gaullier, G. & Luger, K. Nucleosome structure and dynamics are


coming of age. _Nat. Struct. Mol. Biol._ 26, 3–13 (2019). Article  CAS  PubMed  Google Scholar  * Struhl, K. & Segal, E. Determinants of nucleosome positioning. _Nat. Struct. Mol. Biol._


20, 267–273 (2013). Article  CAS  PubMed  PubMed Central  Google Scholar  * Schones, D. E. et al. Dynamic regulation of nucleosome positioning in the human genome. _Cell_ 132, 887–898


(2008). Article  CAS  PubMed  Google Scholar  * Lieleg, C., Krietenstein, N., Walker, M. & Korber, P. Nucleosome positioning in yeasts: methods, maps, and mechanisms. _Chromosoma_ 124,


131–151 (2015). Article  CAS  PubMed  Google Scholar  * Lieleg, C. et al. Nucleosome spacing generated by ISWI and CHD1 remodelers is constant regardless of nucleosome density. _Mol. Cell.


Biol._ 35, 1588–1605 (2015). Article  CAS  PubMed  PubMed Central  Google Scholar  * Zaret, K. S. et al. Pioneer factors, genetic competence, and inductive signaling: programming liver and


pancreas progenitors from the endoderm. _Cold Spring Harb. Symp. Quant. Biol._ 73, 119–126 (2008). Article  CAS  PubMed  PubMed Central  Google Scholar  * Hu, G. et al. Regulation of


nucleosome landscape and transcription factor targeting at tissue-specific enhancers by BRG1. _Genome Res._ 21, 1650–1658 (2011). Article  CAS  PubMed  PubMed Central  Google Scholar  *


Iwafuchi-Doi, M. & Zaret, K. S. Pioneer transcription factors in cell reprogramming. _Genes Dev._ 28, 2679–2692 (2014). Article  PubMed  PubMed Central  CAS  Google Scholar  * Rhee, H.


S. & Pugh, B. F. Comprehensive genome-wide protein-DNA interactions detected at single-nucleotide resolution. _Cell_ 147, 1408–1419 (2011). Article  CAS  PubMed  PubMed Central  Google


Scholar  * Chen, Z., Wang, L., Wang, Q. & Li, W. Histone modifications and chromatin organization in prostate cancer. _Epigenomics_ 2, 551–560 (2010). Article  CAS  PubMed  Google


Scholar  * Maxouri, S., Taraviras, S. & Lygerou, Z. Visualizing the dynamics of histone variants in the S-phase nucleus. _Genome Biol._ 19, 182 (2018). Article  PubMed  PubMed Central 


CAS  Google Scholar  * Petty, E. & Pillus, L. Balancing chromatin remodeling and histone modifications in transcription. _Trends Genet._ 29, 621–629 (2013). Article  CAS  PubMed  PubMed


Central  Google Scholar  * Mayran, A. et al. Pioneer factor Pax7 deploys a stable enhancer repertoire for specification of cell fate. _Nat. Genet._ 50, 259–269 (2018). Article  CAS  PubMed 


Google Scholar  * Rondelet, G., Dal Maso, T., Willems, L. & Wouters, J. Structural basis for recognition of histone H3K36me3 nucleosome by human de novo DNA methyltransferases 3A and 3B.


_J. Struct. Biol._ 194, 357–367 (2016). Article  CAS  PubMed  Google Scholar  * Carroll, J. S. et al. Chromosome-wide mapping of estrogen receptor binding reveals long-range regulation


requiring the forkhead protein FoxA1. _Cell_ 122, 33–43 (2005). Article  CAS  PubMed  Google Scholar  * Lee, C. S., Sund, N. J., Behr, R., Herrera, P. L. & Kaestner, K. H. Foxa2 is


required for the differentiation of pancreatic alpha-cells. _Dev. Biol._ 278, 484–495 (2005). Article  CAS  PubMed  Google Scholar  * Zhu, F. et al. The interaction landscape between


transcription factors and the nucleosome. _Nature_ 562, 76–81 (2018). Article  ADS  CAS  PubMed  PubMed Central  Google Scholar  * Wu, D. et al. Three-tiered role of the pioneer factor GATA2


in promoting androgen-dependent gene expression in prostate cancer. _Nucleic Acids Res._ 42, 3607–3622 (2014). Article  CAS  PubMed  PubMed Central  Google Scholar  * Cirillo, L. A. et al.


Opening of compacted chromatin by early developmental transcription factors HNF3 (FoxA) and GATA-4. _Mol. Cell_ 9, 279–289 (2002). Article  CAS  PubMed  Google Scholar  * Chen, Z. et al.


Diverse AR-V7 cistromes in castration-resistant prostate cancer are governed by HoxB13. _Proc. Natl Acad. Sci. USA_ 115, 6810–6815 (2018). Article  CAS  PubMed  PubMed Central  Google


Scholar  * Yu, X. & Buck, M. J. Defining TP53 pioneering capabilities with competitive nucleosome binding assays. _Genome Res._ 29, 107–115 (2019). Article  CAS  PubMed  PubMed Central 


Google Scholar  * Soufi, A. et al. Pioneer transcription factors target partial DNA motifs on nucleosomes to initiate reprogramming. _Cell_ 161, 555–568 (2015). Article  CAS  PubMed  PubMed


Central  Google Scholar  * Tewari, A. K. et al. Chromatin accessibility reveals insights into androgen receptor activation and transcriptional specificity. _Genome Biol._ 13, R88 (2012).


Article  CAS  PubMed  PubMed Central  Google Scholar  * Fernandez Garcia, M. et al. Structural features of transcription factors associating with nucleosome binding. _Mol. Cell_ 75,


921–932.e6 (2019). Article  CAS  PubMed  Google Scholar  * Magnani, L., Eeckhoute, J. & Lupien, M. Pioneer factors: directing transcriptional regulators within the chromatin environment.


_Trends Genet._ 27, 465–474 (2011). Article  CAS  PubMed  Google Scholar  * Voss, T. C. & Hager, G. L. Dynamic regulation of transcriptional states by chromatin and transcription


factors. _Nat. Rev. Genet._ 15, 69–81 (2014). Article  CAS  PubMed  Google Scholar  * Zaret, K. S. & Carroll, J. S. Pioneer transcription factors: establishing competence for gene


expression. _Genes Dev._ 25, 2227–2241 (2011). Article  CAS  PubMed  PubMed Central  Google Scholar  * Hankey et al. Shaping chromatin states in prostate cancer by pioneer transcription


factors. _Cancer Res._ 80, 2427–2436 (2020). Article  CAS  PubMed  PubMed Central  Google Scholar  * Zhao, J. C. et al. FOXA1 acts upstream of GATA2 and AR in hormonal regulation of gene


expression. _Oncogene_ 35, 4335–4344 (2016). Article  CAS  PubMed  PubMed Central  Google Scholar  * Chen, W. et al. Improved nucleosome-positioning algorithm iNPS for accurate nucleosome


positioning from sequencing data. _Nat. Commun._ 5, 4909 (2014). Article  ADS  CAS  PubMed  Google Scholar  * He, H. H. et al. Nucleosome dynamics define transcriptional enhancers. _Nat.


Genet._ 42, 343–347 (2010). Article  CAS  PubMed  PubMed Central  Google Scholar  * Liu, Q., Bonneville, R., Li, T. & Jin, V. X. Transcription factor-associated combinatorial epigenetic


pattern reveals higher transcriptional activity of TCF7L2-regulated intragenic enhancers. _BMC Genomics_ 18, 375 (2017). Article  PubMed  PubMed Central  CAS  Google Scholar  * Robinson, J.


L. et al. Elevated levels of FOXA1 facilitate androgen receptor chromatin binding resulting in a CRPC-like phenotype. _Oncogene_ 33, 5666–5674 (2014). Article  CAS  PubMed  Google Scholar  *


Kim, J. Y. et al. A role for WDR5 in integrating threonine 11 phosphorylation to lysine 4 methylation on histone H3 during androgen signaling and in prostate cancer. _Mol. Cell_ 54, 613–625


(2014). Article  CAS  PubMed  PubMed Central  Google Scholar  * Ma, F. et al. SOX9 drives WNT pathway activation in prostate cancer. _J. Clin. Invest_ 126, 1745–1758 (2016). Article  PubMed


  PubMed Central  Google Scholar  * Malinen, M., Niskanen, E. A., Kaikkonen, M. U. & Palvimo, J. J. Crosstalk between androgen and pro-inflammatory signaling remodels androgen receptor


and NF-kappaB cistrome to reprogram the prostate cancer cell transcriptome. _Nucleic Acids Res._ 45, 619–630 (2017). Article  CAS  PubMed  Google Scholar  * Stelloo, S. et al. Endogenous


androgen receptor proteomic profiling reveals genomic subcomplex involved in prostate tumorigenesis. _Oncogene_ 37, 313–322 (2018). Article  CAS  PubMed  Google Scholar  * Valouev, A. et al.


Determinants of nucleosome organization in primary human cells. _Nature_ 474, 516–520 (2011). Article  CAS  PubMed  PubMed Central  Google Scholar  * Caravaca, J. M. et al. Bookmarking by


specific and nonspecific binding of FoxA1 pioneer factor to mitotic chromosomes. _Genes Dev._ 27, 251–260 (2013). Article  CAS  PubMed  PubMed Central  Google Scholar  * Ye, Z. et al.


Genome-wide analysis reveals positional-nucleosome-oriented binding pattern of pioneer factor FOXA1. _Nucleic Acids Res._ 44, 7540–7554 (2016). Article  PubMed  PubMed Central  Google


Scholar  * Rouillard, A. D. et al. The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins. _Database_ 2016, baw100 (2016). Article


  PubMed  PubMed Central  CAS  Google Scholar  * Vidal, S. J. et al. A targetable GATA2-IGF2 axis confers aggressiveness in lethal prostate cancer. _Cancer Cell_ 27, 223–239 (2015). Article


  CAS  PubMed  PubMed Central  Google Scholar  * Wal, M. & Pugh, B. F. Genome-wide mapping of nucleosome positions in yeast using high-resolution MNase ChIP-Seq. _Meth. Enzymol._ 513,


233–250 (2012). Article  CAS  Google Scholar  * Gaspar-Maia, A. et al. Chd1 regulates open chromatin and pluripotency of embryonic stem cells. _Nature_ 460, 863–868 (2009). Article  ADS  CAS


  PubMed  PubMed Central  Google Scholar  Download references ACKNOWLEDGEMENTS We thank UTHSA Next Generation Sequencing Facilities (1S10OD021805) for services rendered for the production of


ChIP-ePENS, MNase-seq, MNase-ChIP-seq, and thank Duke sequencing core for services rendered for the production of ATAC-seq and ChIP-seq data. We are grateful to Dr. B Frank Pugh of the


Department of Molecular Biology and Genetics at Cornell University for reading the manuscript and providing suggestive comments. This project was partially supported by grants from NIH


R01GM114142 (V.X.J.), U54CA217297 (V.X.J. and Q.W.), and R01GM120221 (Q.W. and V.X.J.). AUTHOR INFORMATION Author notes * These authors contributed equally: Tianbao Li, Qi Liu, Zhong Chen. *


These authors jointly supervised this work: Qianben Wang, Victor X. Jin. AUTHORS AND AFFILIATIONS * Department of Molecular Medicine, University of Texas Health Science Center at San


Antonio, San Antonio, TX, 78229, USA Tianbao Li, Qi Liu, Kun Fang & Victor X. Jin * Department of Pathology and Duke Cancer Institute, Duke University School of Medicine, Durham, NC,


27710, USA Zhong Chen, Furong Huang & Qianben Wang * Edmond H. Fischer Signal Transduction Laboratory, College of Life Sciences, Jilin University, Changchun, 130012, China Xueqi Fu


Authors * Tianbao Li View author publications You can also search for this author inPubMed Google Scholar * Qi Liu View author publications You can also search for this author inPubMed 


Google Scholar * Zhong Chen View author publications You can also search for this author inPubMed Google Scholar * Kun Fang View author publications You can also search for this author


inPubMed Google Scholar * Furong Huang View author publications You can also search for this author inPubMed Google Scholar * Xueqi Fu View author publications You can also search for this


author inPubMed Google Scholar * Qianben Wang View author publications You can also search for this author inPubMed Google Scholar * Victor X. Jin View author publications You can also


search for this author inPubMed Google Scholar CONTRIBUTIONS V.X.J. conceived the project. V.X.J. and Q.W. conceived the functional validations. Q.W. provided the critical inputs to the


project. T.L, Q.L., Z.C., and F.H. conducted the experiments. T.L. and Q.L. performed the data analyses. V.X.J., T.L., Q.L., Z.C., and Q.W. wrote the manuscript, with all authors including


K.F., X.F., and F.H. contributing to the writing and providing the feedback. CORRESPONDING AUTHORS Correspondence to Qianben Wang or Victor X. Jin. ETHICS DECLARATIONS COMPETING INTERESTS


The authors declare no competing interests. PEER REVIEW PEER REVIEW INFORMATION _Nature Communications_ thanks Tobias Straub, Gerhard Coetzee, and the other, anonymous, reviewer(s) for their


contribution to the peer review of this work. Peer reviewer reports are available. ADDITIONAL INFORMATION PUBLISHER’S NOTE Springer Nature remains neutral with regard to jurisdictional


claims in published maps and institutional affiliations. SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION PEER REVIEW FILE DESCRIPTION OF ADDITIONAL SUPPLEMENTARY FILE SUPPLEMENTARY DATA


1 SUPPLEMENTARY DATA 2 SUPPLEMENTARY DATA 3 SOURCE DATA SOURCE DATA RIGHTS AND PERMISSIONS OPEN ACCESS This article is licensed under a Creative Commons Attribution 4.0 International


License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source,


provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons


license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by


statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit


http://creativecommons.org/licenses/by/4.0/. Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Li, T., Liu, Q., Chen, Z. _et al._ Dynamic nucleosome landscape elicits a


noncanonical GATA2 pioneer model. _Nat Commun_ 13, 3145 (2022). https://doi.org/10.1038/s41467-022-30960-x Download citation * Received: 20 May 2020 * Accepted: 24 May 2022 * Published: 07


June 2022 * DOI: https://doi.org/10.1038/s41467-022-30960-x SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get shareable link Sorry, a


shareable link is not currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative