A chromatin immunoprecipitation (X-ChIP) assay against the histone mark H3K4me3 for the model organism Exaiptasia diaphana is presented. The specificity and effectiveness of the assay are confirmed by quantitative PCR and next-generation sequencing. This protocol enables the increased investigation of protein-DNA interactions in the sea anemone E. diaphana.
Histone post-translational modifications (PTMs) and other epigenetic modifications regulate the chromatin accessibility of genes to the transcriptional machinery, thus affecting an organism's capacity to respond to environmental stimuli. Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has been widely utilized to identify and map protein-DNA interactions in the fields of epigenetics and gene regulation. However, the field of cnidarian epigenetics is hampered by a lack of applicable protocols, partly due to the unique features of model organisms such as the symbiotic sea anemone Exaiptasia diaphana, whose high water content and mucus amounts obstruct molecular methods. Here, a specialized ChIP procedure is presented, which facilitates the investigation of protein-DNA interactions in E. diaphana gene regulation. The cross-linking and chromatin extraction steps were optimized for efficient immunoprecipitation and then validated by performing ChIP using an antibody against the histone mark H3K4me3. Subsequently, the specificity and effectiveness of the ChIP assay were confirmed by measuring the relative occupancy of H3K4me3 around several constitutively activated gene loci using quantitative PCR and by next-generation sequencing for genome-wide scale analysis. This optimized ChIP protocol for the symbiotic sea anemone E. diaphana facilitates the investigation of the protein-DNA interactions involved in organismal responses to environmental changes that affect symbiotic cnidarians, such as corals.
The 2022 report by the Intergovernmental Panel on Climate Change (IPCC) highlights that despite growing awareness and mitigation efforts, increasingly more intense and frequent marine heatwaves are putting coral reefs at high risk of extensive bleaching and mass mortality within the next decades1. In order to inform coral reef conservation and restoration efforts, the current and projected effects of changing environmental conditions on benthic cnidarians are being investigated on multiple biological levels to understand the underlying mechanisms of response and resilience2.
The availability of investigative tools applicable to benthic cnidarians is crucial to meet this challenge, and developing these tools requires an active effort to transfer knowledge and technologies established in other fields to marine organisms3. The obstacles to working with many coral species are partly alleviated by using model systems, such as the sea anemone Exaiptasia diaphana (commonly referred to as Aiptasia)4. These fast-growing, facultatively symbiotic sea anemones are relatively easy to keep in laboratory conditions, reproduce both sexually and asexually, and lack the calcium carbonate skeleton5. The open-access reference genome5 of E. diaphana facilitates the use of epigenetic methods requiring sequencing. However, features such as a high water content, mucus production, and low tissue amounts per individual are challenges to the establishment of replicable protocols, thus curbing epigenetic research on E. diaphana and other cnidarians with similar features.
Epigenetic modifications can alter the phenotype without changing the genomic nucleotide sequence of an organism by regulating chromatin-associated processes6. Cnidarian epigenetic regulation is mostly investigated in the contexts of evolutionary history and development7,8,9,10, symbiosis establishment and maintenance11,12,13, and response to environmental changes14,15. Specifically, variations in patterns of DNA methylation, which, in most cases, involve the addition of a methyl group to a cytosine base, have been observed in response to changing environmental conditions such as warming13 and ocean acidification16. DNA methylation patterns have also been shown to be heritable intergenerationally, emphasizing the role of epigenetics in coral acclimatization to environmental stressors17. Compared to DNA methylation, there have been relatively few studies on other important epigenetic regulators, such as non-coding RNAs11,18,19, transcription factors9,10, or histone post-translational modifications (PTMs) in cnidarians20. The investigation of DNA-associated proteins is especially demanding as the available methods require access to a reference genome of the study organism and are expensive because of the large sample sizes and high-specificity antibodies needed3. With a wide variety of chemical groups that form PTMs at specific histone residues, understanding chromatin modification landscapes in cnidarians, especially in the context of impending environmental stresses, remains a big challenge.
The aim of this work is to advance the investigation of histone PTMs, histone variants, and other chromatin-associated proteins in cnidarians by presenting an optimized chromatin-immunoprecipitation (ChIP) protocol for the coral model E. diaphana (original protocol by Bodega et al.21). ChIP can be combined with quantitative PCR to characterize locus-specific protein-DNA interactions or next-generation sequencing (NGS) to map these interactions across the entire genome. In general, proteins and DNA are reversibly cross-linked so that the protein of interest (POI) remains bound to the same locus that it is associated with in vivo. While the common cross-linking method widely used in the mammalian model system is usually kept to 15 min or below at room temperature, the cross-linking approach was optimized to allow the formaldehyde to penetrate through the mucus produced by E. diaphana more effectively. The tissue is then flash-frozen in liquid nitrogen, homogenized, and lysed to extract the nuclei from the cells. The loss of material in these steps is avoided by using only one lysis buffer and then directly moving on to sonication, which fragments the chromatin into ~300 bp long fragments. These fragments are incubated with an antibody specific to the POI down to PTM-level precision. The antibody-protein-DNA immunocomplex is precipitated using magnetic beads that bind to the primary antibodies, thereby selecting only the DNA segments that are associated with the POI. After cross-link reversal and clean-up of the precipitate, the yielded DNA segments can be used for qPCR or DNA library construction for sequencing to map the segments to a reference genome and, thus, identity the loci the POI is associated with. More detail on considerations for each step can be found in Jordán-Pla and Visa22.
A procedural overview is provided in Figure 1. The ChIP-Seq data are available in the NCBI Sequence Read Archive (SRA) under BioProject code PRJNA931730 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA931730).
Figure 1: ChIP protocol workflow. Overview of the ChIP protocol workflow, including the estimated duration of each step and optional stop points. Please click here to view a larger version of this figure.
1. Animal collection
2. Cross-linking
Table 1: Solutions and buffers used in the ChIP protocol. The ingredients and their respective concentrations are listed for each buffer used in the protocol. Please click here to download this Table.
3. Homogenization and lysis
4. Sonication
5. DNA extraction and fragment size check
NOTE: Before moving on to the IP, the size of the fragments needs to be checked. If the fragments are too large (>500 bp), the number of sonication cycles and/or the sonication intensity must be increased; if they are too small (<150 bp), the sonication time and/or intensity must be reduced.
Figure 2: Fragment size check. Following the sonication, a subset of the sample is de-crosslinked, purified, and run on an agarose gel to ensure that the chromatin has been sheared to fragment sizes between 150-500 bp. Please click here to view a larger version of this figure.
6. Immunoprecipitation (IP)
7. Recovery of the immunocomplexes with magnetic beads
NOTE: Always avoid drying out the magnetic beads; keep them covered with liquid, or replenish the solutions as quickly as possible. Always work on one sample after the other.
8. Wash, elution, and cross-link reversal
9. Protein and RNA digestion and DNA purification
Following the above protocol, DNA associated with the trimethylation of histone 3 lysine 4 (H3K4me3) was immunoprecipitated. The ChIP-seq grade antibody was previously used on the sea anemone Nematostella vectensis7 and was validated here by the immunofluorescent staining of E. diaphana tissue sections (Figure 3). While the DNA yield depends on the amount of input material, it was regularly around 100 ng/µL. The obtained DNA fragments were analyzed by sequencing and qPCR (primer list in Supplementary File 1).
A sequencing depth of 40 million reads yielded ~17.6 million uniquely mapped reads. After quality checks, the raw reads were trimmed and mapped to the E. diaphana genome using Bowtie24 (see Table of Materials). The model-based analysis of the ChIP-Seq data with MACS (see Table of Materials) identified a total of 19,107 peaks25. As expected for H3K4me3, most peaks were located near the transcriptional start site (TSS), and the peak count frequency declined sharply on both sides of the TSS, but especially toward the gene body (Figure 4).
Three genes with high peaks around their TSSs were identified from the sequencing data, and qPCR primers were designed to target several loci of high peaks within these genes. The qPCR data were normalized using the percentage of input method (Figure 5). High enrichment of H3K4me3 relative to the input and mock controls was observed. The percentage of input varied between 2.7% to 10.7%, with differences in enrichment observed across genes and between different loci within the same gene.
Figure 3: Immunofluorescent staining of H3K4me3. An E. diaphana tissue section was stained with (A) blue Hoechst nucleic acid stain and (B) a yellow fluorophore-labeled secondary antibody against the primary antibody against H3K4me3. (C) An overlap of A and B showing co-localization in the nucleus. Scale bar = 20 µm. Please click here to view a larger version of this figure.
Figure 4: High peak count frequency around the transcriptional start site. The distribution of the peak count frequency of H3K4me3 modification spanning upstream (−) and downstream (+) 2,000bp around the transcriptional start site (TSS). Please click here to view a larger version of this figure.
Figure 5: H3K4me3 ChIP-qPCR in Exaiptasia diaphana. The results are represented as a percentage (%) of input. Loci near the transcriptional start site of the respective genes (AIPGENE12312, AIPGENE26042, and AIPGENE5950) were chosen for ChIP-qPCR. Error bars indicate the standard deviation between replicates, n = 3. Please click here to view a larger version of this figure.
Supplementary File 1: List of primers used for qPCR. Please click here to download this File.
Following the above protocol, the obtained DNA was successfully used for ChIP-qPCR and ChIP-Seq. The same general peak profile for H3K4me3 previously reported from other organisms26,27,28 was obtained here, with the highest peak around the TSS and high enrichment at the expected sites in the ChIP-qPCR. The relatively high number of multiply mapped reads in the ChIP-Seq data might have been caused by PCR duplicates. The amount of uniquely mapped reads could be increased by changes to the DNA library preparation protocol to reduce PCR duplicates. There are several normalization methods for ChIP-qPCR data, with the percentage of input method presented here being more commonly used. This method normalizes the IP and mock samples directly to the input, with the disadvantage that the input is processed differently from the IP and mock samples during the ChIP, which may introduce errors. The alternative fold enrichment method normalizes the signal of the IP based on the signal of the mock using the same primer sets, thus giving a signal-over-background ratio. However, the signal intensity of the mock sample can vary strongly, which, in turn, has large effects on the data. A detailed discussion of ChIP-qPCR and data normalization can be found in Haring et al.29.
A major adjustment in the method presented above compared to common ChIP protocols is the much longer cross-linking time, from around 15 min for histone proteins to an overnight incubation. The major aim of this adjustment is to facilitate the penetration of the fixative formaldehyde through the mucus layer into the deeper tissue of E. diaphana to preserve the protein-DNA interactions inside the nucleus. Optimization of this step is critical to ensure sufficient cross-linking to preserve the protein-DNA interactions without cross-linking so much that the shearing of the chromatin by sonication becomes ineffective. Adding detergents such as SDS can increase the sonication efficiency as well29. In addition, a long incubation with formaldehyde was found to ease the homogenization step and resulted in a more finely and evenly ground sample compared to fresh or frozen anemones that were homogenized before the cross-linking step. The recommended ranges of fragment lengths vary between 100 bp and 1,000 bp29,30 and may depend on the target protein. It is critical that each user optimizes the sonication conditions to reach the desired fragment length with as little sonication power as possible, as over-sonication may denature the proteins and, thus, impact the IP31. Another limitation of ChIP is the amount of material required, which particularly affects relatively small organisms such as anemones; this issue was addressed by reducing the loss of sample between steps. After homogenizing the sample, it was lysed immediately, thereby omitting several common nuclei preparation steps. The sample pool obtained from 20 anemones generally contained enough chromatin for three IPs and both mock and input controls. The amount of starting material (i.e., the number of anemones) could be further reduced in the future, especially when performing a ChIP with only one target protein. Depending on the intended downstream method, the controls should be adjusted; for ChIP-qPCR, it should be considered to include additional controls29, while for ChIP-seq, the mock can be omitted in favor of an input control.
While antibody validation is outside of the scope of this protocol and was, thus, only briefly touched on here, it is a critical step before performing ChIP. Especially when using commercial antibodies on invertebrate species, the availability of specific antibodies for the desired targets can be a limitation. In the first step, the H3 N-terminal tail sequence of E. diaphana and other model organisms, including zebrafish and mice, was compared and found to be highly conserved12. The antibody specificity was then tested using immunofluorescence, which co-localized the signals of the nucleic acid and antibody. The peak profile of H3K4me3 around the TSS obtained from the sequencing data gives further confidence regarding the specificity of the antibody.
Another consideration regarding antibody specificity is the possibility of any interaction with the symbiotic dinoflagellates that E. diaphana, as well as many other anthozoans, host in their cells. Transcriptome analyses in dinoflagellates of the genera Lingulodinium32 and Symbiodinium33 have found histone-encoding genes, including core histone H3 and several H3 variants at low expression levels, and the extent of functional conservation of the histone code is unclear34. Marinov and Lynch34 compared the sequence conservation of H3 variant N-terminal tails within and between dinoflagellate species, and Symbiodiniaceae species showed a high divergence around lysine 4 in the tail sequence, especially when considering the congruence of adjacent amino acids to lysine 4 as a factor. This region has also been shown to diverge from other model species such as Arabidopsis thaliana, Drosophila melanogaster, and Saccharomyces cerevisiae, which match the sequence of E. diaphana12. Therefore, the risk of unintended interaction of the antibody against H3K4me3 with dinoflagellate H3 is low. In addition, the sequences are only aligned to the E. diaphana genome, and the qPCR primers should be specific to E. diaphana sequences, providing an extra layer of filtration of any unintendedly precipitated sequences.
The presented ChIP protocol yields sufficient DNA for qPCR as well as next-generation sequencing, and while individual optimization by each user will likely be required, it provides a starting point for the increased investigation of protein-DNA interactions in benthic cnidarians, possibly in the context of symbiosis and environmental changes.
The authors have nothing to disclose.
None.
1 kb Plus DNA Ladder | Invitrogen | 10787018 | |
100% Ethanol | |||
15 mL falcon tubes | |||
16% Formaldehyde Solution (w/v), Methanol-free | Thermo scientific | 28906 | |
5 mL tube | |||
5PRIME Phase Lock tubes | Quantabio | 2302820 | Phase lock gel – light |
AGANI needle 25 G | Terumo | AN*2516R1 | |
Agarose | |||
Bovine Serum Albumin | Sigma-Aldrich | A7030 | |
Bowtie | open source, available from https://bowtie-bio.sourceforge.net/index.shtml | ||
Branson sonifier 250 | Branson | Sonicator | |
Centrifuge | Eppendorf | 5430 R | With rotors for 15 mL and 1.5 mL tubes |
ChIP-grade antibody (here polyclonal H3K4me3 antibody) | Diagenode | C15410003 | |
Complete EDTA-free Protease Inhibitor Cocktail | Roche | 11873580001 | 2 tablets/mL water for 100x stock |
Cover glass | vwr | 48393 194 | |
Disposable Spatula | vwr | 80081-188 | |
DNA purification kit (here QIAGEN QIAquick PCR purification kit) | QIAGEN | 28104 | |
Dounce tissue grinder | Wheaton | tight pestle | |
Dulbecco's Phosphate Buffered Saline (DPBS) 1x | gibco | 14190-094 | |
Dynabeads Protein G | Thermo Fisher Scientific | 10004D | |
EDTA | Sigma-Aldrich | 3609 | |
EGTA | Sigma-Aldrich | 324626 | |
Electrophoresis system | |||
Ethidium bromide | |||
FASTQC v0.11.9 | software | ||
Gel Doc EZ Documentation System | Bio-Rad | 1708270 | Gel imaging system |
Glycine | Sigma-Aldrich | 50046 | |
Injekt-F Tuberculin syringe 1 mL | B. Braun | 9166017V | |
Linear acrylamide (5 mg/mL) | Invitrogen | AM9520 | |
Low-retention 1.5 mL tube | |||
Magnetic separation rack | for 1.5 mL tubes | ||
Microscope | |||
Microscope slide | |||
Model-based Analysis of ChIP-Seq (MACS) | open source | ||
Mortar and pestle, porcelain | |||
NanoDrop 2000c spectrophotometer | Thermo scientific | ND-2000C | |
N-lauroylsarcosine | Sigma-Aldrich | 61739 | |
Nuclease-free water | |||
Pasteur pipette | |||
Phenol:Chloroform:Isoamyl Alcohol Mixture (25:24:1) | Sigma-Aldrich | 77617 | |
Proteinase K (20 mg/mL) | Ambion | AM2546 | |
Purple Loading Dye 6x | New England BioLabs | B7024S | |
Qubit 2.0 Fluorometer | Invitrogen | ||
RNase Cocktail Enzyme Mix | Invitrogen | AM2286 | RNase A = 500 U/mL, RNase T1 = 20000 U/mL |
Sodium acetate | Sigma-Aldrich | S2889 | |
Sodium chloride | Sigma-Aldrich | S9888 | |
Sodium deoxycholate | Sigma-Aldrich | 30970 | |
SYBR Safe DNA Gel Stain | Invitrogen | S33102 | |
TAE buffer | |||
Thermomixer | Eppendorf | 5384000012 | |
Tris base | Sigma-Aldrich | 10708976001 | |
Triton X-100 solution | Sigma-Aldrich | 93443 | |
Trypan Blue Stain (0.4%) | Gibco | 15250-061 | Danger: may cause cancer. Suspected of damagin fertility or the unborn child |
Tube rotator SB3 | Stuart | ||
UltraPure SDS Solution, 10% | Invitrogen | 15553027 | |
Vacuum pump |