We describe here a relatively fast and simple approach for mapping genome-wide mammalian replication timing, from cell isolation to the basic analysis of the sequencing results. A genomic map of a representative replication program will be provided following the protocol.
Replication of the genome occurs during S phase of the cell cycle in a highly regulated process that ensures the fidelity of DNA duplication. Each genomic region is replicated at a distinct time during S phase through the simultaneous activation of multiple origins of replication. Time of replication (ToR) correlates with many genomic and epigenetic features and is linked to mutation rates and cancer. Comprehending the full genomic view of the replication program, in health and disease is a major future goal and challenge.
This article describes in detail the "Copy Number Ratio of S/G1 for mapping genomic Time of Replication" method (herein called: CNR-ToR), a simple approach to map the genome wide ToR of mammalian cells. The method is based on the copy number differences between S phase cells and G1 phase cells. The CNR-ToR method is performed in 6 steps: 1. Preparation of cells and staining with propidium iodide (PI); 2. Sorting G1 and S phase cells using fluorescence-activated cell sorting (FACS); 3. DNA purification; 4. Sonication; 5. Library preparation and sequencing; and 6. Bioinformatic analysis. The CNR-ToR method is a fast and easy approach that results in detailed replication maps.
Mammalian DNA replication is tightly regulated to ensure the precise replication of each chromosome exactly once during the cell cycle. Replication occurs according to a highly regulated order — multiple large genomic regions (~Mb) replicate at the beginning of S phase (early replicating domains) whereas other genomic regions replicate later at middle or late S phase (middle and late replicating domains)1. Most of the genome replicates at the same time in all tissues (constitutive ToR domains), whereas 30% – 50% of the genome, changes its ToR between tissues2, during differentiation3,4 and to a lesser extent also during cancer transformation5. Moreover, certain genomic regions replicate asynchronously6,7,8, namely there is a difference in the ToR between the two alleles.
ToR correlates with many genomic and epigenomic features including transcription levels, GC content, chromatin state, gene density, etc.1,9. ToR is also associated with mutation rates and types10,11 and therefore unsurprisingly, perturbations of the replication program are linked to cancer12,13. The causal relationship between ToR and chromatin structure is not yet understood. It is possible that open chromatin facilitates early replication. However, an alternative model suggests that the chromatin is assembled during replication and the different chromatin regulators present at the beginning and end of S phase lead to differential packaging of early and late replicating regions1,14. We have recently shown that the ToR shapes the GC content by affecting the type of mutations that occur in different genomic regions11.
Fluorescence in situ hybridization (FISH) is the main method for measuring ToR at individual loci. It is performed simply by counting the percentage of S phase cells that exhibit single FISH signals vs. the percentage of doublets for a given allele15,16. An alternative method, consists of pulse labeling the DNA with BrdU, sorting cells according to their DNA content to multiple time points along S, immunoprecipitating DNA containing BrdU, and checking the abundance of precipitated DNA with qPCR17.
Genomic ToR mapping can be achieved by two methods. The first method is a genomic version of the BrdU-IP based method described above, in which the quantification of the amount of precipitated DNA in each fraction is done simultaneously for the entire genome through hybridization to microarrays or by deep sequencing. The second method, CNR-ToR, is based on measuring the copy number of each genomic region of S phase cells and normalizing by the DNA content in G1 cells. In this method, cells are sorted by FACS into non-replicating (G1 phase) and replicating (S phase) groups (Figure 1). Cells in G1 have the same copy number in all genomic regions and thus their DNA content should be the same. On the other hand, the DNA copy number in S depends on the ToR, since early replicating regions underwent replication in most cells and therefore their DNA content is doubled, whereas late replicating regions have not replicated yet in most cells and therefore their DNA content will be similar to that of G1 cells. Hence the S to G1 ratio of DNA content is indicative of the ToR. The amount of DNA for each genomic region is measured either by hybridization to microarrays or by deep sequencing2,8. The advantages of the CNR-ToR method will be further discussed.
This paper describes the CNR-ToR method for genomic ToR mapping as described in Figure 2. The paper discusses the fine details of the entire process from collecting cells until the basic analysis of the results and the creation of genomic ToR maps. The protocol described in this paper has been successfully performed on various cell types grown in culture. Future improvements of this protocol can lead to the mapping of the ToR in vivo and in rare cell types.
Note: ToR can be measured only on growing, unsynchronized cells. The procedure should begin with at least 1 – 2 x 106 fast growing cells, which will usually result in ~1 x 105 cells in S phase (the rate limiting step). It is recommended to conduct each experiment using two or three replicates. The entire process of CNR-ToR can be completed within one week — two days should be dedicated to all steps up to library preparation, one to two days are needed for sequencing and an additional day is necessary for the initial data analysis.
1. Collection of Cells from Culture
NOTE: The protocol is written for cells growing in culture in 10 cm plates (containing approximately 2 – 5 x 106 cells), but can be easily adjusted to other platforms.
2. Fixation
NOTE: For this part, all steps should be done at 4 °C.
3. PI Staining
4. Sort
Figure 1. Cell cycle phase determination based on PI intensity. Histogram showing the distribution of the cellular DNA content (measured by PI-Area) of mouse embryonic fibroblast (MEF) population. The DNA content is used to sort the population into two sub-populations i) G1 cells (2N DNA content) and ii) S phase cells (2N – 4N DNA content), using the marked regions. Please click here to view a larger version of this figure.
NOTE: The purpose of the collection of G1 cells is to account for biases in the sequencing efficiency between different genomic regions. An alternative approach is to use G1 arrested cells from the same cell type. This approach gives cleaner results (since it minimizes S phase contamination) but it may introduce biases stemming from genetic differences between the arrested cells and the measured cells.
5. DNA Purification
6. Sonication
7. Library Preparation, and Sequencing
NOTE: Many library preparation kits and different sequencing platforms should work similarly to the ones used by us and mentioned in the materials section. Actually in the past, ToR maps were generated using a very similar method with microarray platforms2.
8. Analysis
NOTE: Data analysis is based on the method used by A. Koren et al.19.
A typical ToR map is shown in Figure 3 for mouse embryonic fibroblasts (MEFs). This figure demonstrates the analysis process since it shows both the dots, which are the normalized S/G1 ratio for individual windows (step 8.3), as well as the line which results from the cubic smoothing and interpolation (step 8.5).
Such maps capture the organization of the replication program, which is a patchwork of two types of ToR domains: i) large regions (in the order of a megabase) which replicate simultaneously (CTRs = constant ToR regions) at early, middle or late S; and ii) temporal transition regions (TTRs) in which the ToR changes gradually. CTRs are connected to each other by TTRs and together these two types of replication organization cover the entire genome.
Despite the relatively low sequencing coverage (one read every 300 bp) used for ToR mapping, the resulting maps are quite robust. Figure 4 shows the reproducibility of ToR maps between triplicates of MEF cells compared to triplicates of mouse pre-B cells in the same region. Comparison between such maps allows the identification of regions with differential ToR which are defined as regions in which the differences in the ToR maps between tissues are significantly larger than the differences between replicates.
Figure 2: Protocol scheme. Schematic diagram describing the procedure of the CNR-ToR method. Please click here to view a larger version of this figure.
Figure 3: Representative genomic ToR maps. Shown is the ToR of the entire Chromosome 1 of MEF cells and a closer look at a ~50 Mb region. Shown are the Z scores of the S/G1 values in varying size windows (dots) and the smoothed data (solid line). High S/G1 values correspond to early replication whereas low values correspond to late replication. Arrows point to examples of constant ToR regions (CTRs; red) and temporal transition regions (TTRs; green). Please click here to view a larger version of this figure.
Figure 4: Reproducibility of the repeats. (A) Smoothed ToR of MEF triplicates (blue) compared to pre-B triplicates (red) in an 18 Mb region on mouse chromosome 1. Orange arrows point to examples of differential regions between the cell lines. (B) Heat map of the spearman correlations between the 6 samples. Please click here to view a larger version of this figure.
CNR-ToR can be performed in principle on any eukaryotic proliferating cell population that can be divided by FACS to S and G1 phases (reviewed by Rhind N. and Gilbert DM20). The method described here has been adjusted to mammalian cells with a genome size of ~3 Gb such as human and mouse. Small changes in the CNR-ToR protocol (in cell preparation and sequencing depth) are needed, in order to adjust it to other eukaryotes. Attention must be paid to the collection of sufficient amounts of S phase cells since it is the rate-limiting step. Thus, a preliminary FACS for cell cycle should be performed to confirm that the experiment can be done and to validate the amount of cells in S phase. For cells which exhibit irregular cell cycles, it is recommended to prestain the cells with BrdU to detect dividing cells, and follow the Anti-BrdU manufacturer protocol until the FACS step. Usually cells are stained with 10-20 µM BrdU for 30 min.
It is recommended to perform each experiment in biological triplicates to detect inconsistencies that may stem from differences in cell growth and in the sorting process. This is especially important for the identification of small changes in the ToR maps between cell types and conditions. For fast growing cells, a single 10 cm tissue culture plate/flask (containing approximately 2-5 x 106 cells) is recommended for each replicate in order to get >1 x 105 cells in S phase. This amount should be increased when working with slow growing cells, since the percentage of cells in S phase is lower and thus one has to sort more cells in order to get a sufficient amount of cells in S.
When sorting cells, it is important to achieve the best possible separation of the cells according to their DNA content and thus a low flow rate is recommended. In order to maximize the temporal resolution, it is important to collect cells from the entire S phase. This is achieved by gating S as wide as possible (Figure 1). On the other hand, G1 gating should be narrow (starting from the G1 peak and to the left; Figure 1), in order to avoid both sub-G1 and early S-phase contaminations.
Sonication can be performed by various methods, but it is recommended to use a focused ultrasonicator which enables a robust and easy sonication without the need for calibration of each experiment. Nevertheless, when first establishing the protocol in the laboratory, it is recommended to calibrate the parameters for obtaining a size distribution of 200-700 bp.
As explained in the introduction, there are two main methods for genome-wide ToR mapping — BrdU-IP and CNR-ToR. Both methods give similar ToR maps (data not shown) despite the differences between them. The CNR-ToR method is limited in its enrichment range since the maximum difference between early and late regions is twofold, whereas with the BrdU-IP much higher enrichment is achieved, since early replicating regions will contain BrdU almost only in the early fraction. On the other hand, the BrdU-IP method resolution is limited by the number of S phase fractions collected. In its most common application, only two fractions (early versus late S) are compared, which results in a compromise of the fine temporal resolution. The CNR-ToR method, however, gives a continuous signal along the entire S phase. Moreover, the BrdU-IP method is based on immuno-precipitation which usually gives a much higher background than the CNR-ToR method. Another advantage of the CNR-ToR method is that it can be used retrospectively to extract ToR information from the sequencing data of unsynchronized cultures as recently described21. Finally, the CNR-ToR method is preferable due to its simplicity and due to the fact that it can be down-scaled since it is not based on immuno-precipitation.
The role of ToR in the complex network of genomic and epigenomic features remains to be deciphered. The relative ease of the CNR-ToR method allows for the expansion of the existing ToR data to include many of the natural conditions that cells experience and should be explored thoroughly and genome wide. This includes various replication stress situations, endoreduplication as well as various cancer transformations. The use of SNP data and higher sequencing depth will also allow measuring the ToR of each allele separately. Moreover, the further decrease in sequencing cost will enable increasing of the resolution of ToR maps, which may further aid in allow the identification of subtle changes between conditions. Other future applications may be achieved by improving the current methods and applying ToR measurement on small numbers of cells which will enable measuring ToR in vivo.
The authors have nothing to disclose.
We thank Oriya Vardi for assistance in generating figures. Work in the IS group was supported by the Israel Science Foundation (grant No. 567/10) and the European Research Council Starting Grant (#281306).
PBS | BI (Biological Industries) | 02-023-1A |
Trypsin-EDTA | BI (Biological Industries) | 03-052-1B |
15ml conical tube | Corning | 430790 |
5ml Polystyrene round Bottom tube with cell strainer cap | BD-Falcon | 352235 |
Ethanol | Gadot | 64-17-5 |
RNAse-A 10mg/ml | Sigma | R4875 |
Propidiom iodide 1mg/ml | Sigma | P4170 |
parafilm | Parafilm | PM-996 |
1.5ml DNA LoBind Eppendorf tubes | Eppendorf | 22431021 |
BSA | Sigma | A7906 |
1.7ml MaxyClear tube | Axygen | MCT-175-C |
magnetic beads – Agencourt AMPure XP | Beckman Coulter | A63881 |
Ultrasonicator | Covaris | M-series -530092 |
50 µl microTUBE AFA Fiber Screw-Cap 6x16mm | Covaris | 520096 |
Qubit fluorometer | Invitrogen | |
Qubit dsDNA High Sensitivity (HS) Assay Kit | Invitrogen | Q32854 |
Electrophoresis.2200 Tape station system | Agilent | D1000 ScreenTape |
Seqeuncing – Illumina NextSeq system | Illumina | SY-415-1001 |
Dneasy kit for DNA purification | Qiagen | 69504 |
PureProteom Magnetic Stand | Millipore | LSKMAGS08 |
Anti-BrdU/FITC | DAKO | F7210 |
FACS sorter | BD | FACSARIA III |
FACS software | BD | FACSDiva v 8.0.1 |