Here, we outline the procedure for analyzing replication progression through pathogenic, structure-prone repeats using 2-dimensional gel electrophoresis.
Two-dimensional neutral/neutral gel-electrophoresis (2DGE) emerged as a benchmark technique to analyze DNA replication through natural impediments. This protocol describes how to analyze replication fork progression through structure-prone, expandable DNA repeats within the simian virus 40 (SV40)-based episome in human cells. In brief, upon plasmid transfection into human cells, replication intermediates are isolated by the modified Hirt protocol and treated with the DpnI restriction enzyme to remove non-replicated DNA. Intermediates are then digested by appropriate restriction enzymes to place the repeat of interest within the origin-distal half of a 3-5 kb-long DNA fragment. The replication intermediates are separated into two perpendicular dimensions, first by size and then by shape. Following Southern blot hybridization, this approach allows researchers to observe fork stalling at various structure-forming repeats on the descending half of the replication Y-arc. Furthermore, this positioning of the stall site allows the visualization of various outcomes of repeat-mediated fork stalling, such as fork reversal, the advent of a converging fork, and recombinational fork restart.
Short tandem repeats (STR) are small, typically 2-9 base pairs (bp), repetitive sequences of DNA that constitute around 3% of the human genome1. STR play an important role in gene regulation2; however, their repetitive composition leaves them prone to non-canonical DNA secondary structure formation and subsequent genetic instability3,4. From left-handed helices to hairpins/cruciforms, to three and four-stranded helices, these alternative DNA structures cause intrinsic challenges for the replisome. A natural prerequisite for secondary structure formation is DNA unwinding, which is a prerequisite for DNA replication. This presents a unique conundrum for genome functioning as many of these structures can form during replication, hindering replisome progression and ultimately causing replication fork stalling5,6,7, or in severe cases, fork collapse and DNA breakage8,9. Both the restart of stalled forks and DNA repair pathways have been shown to lead to repeat instability, such as repeat expansions10,11 and complex genome rearrangements (CGR)12,13. These events can result in the development of roughly 60 human diseases known as repeat expansion disorders, including Fragile X syndrome, Huntington's disease, Friedreich's ataxia, and others14,15 as well as CGR diseases, such as Emmanuel syndrome16. Therefore, to better understand the mechanisms of human disease driven by repeat instability, it is imperative to study the details of replication fork progression through those repeats.
A technique for studying replication progression emerged in the mid-1980s when Brewer and Fangman sought to provide direct evidence that replication initiation in Saccharomyces cerevisiae occurs at autonomous replication sequence (commonly known as ARS) elements17. In doing so, they separated structures of yeast replication intermediates in agarose, adapting an earlier method from Bell and Byers known as 2-dimensional Neutral/Neutral gel-electrophoresis (2DGE)18. This technique utilized the fact that nonlinear DNA travels differently in agarose gel than its linear equivalent of the same mass. More specifically, in 2DGE, isolated DNA is separated in two perpendicular dimensions, first primarily by size and then primarily by shape, to create a comprehensive map of replication at a particular region of interest. In their original paper, Brewer and Fangman demonstrated this as an arc composed of "simple Y" structures or replication forks bridging unreplicated DNA to their replicated counterparts. They further describe other observed intermediates as "bubbles" and "double Ys," representing replication origins and converging forks, respectively.
2DGE can be used to study the relative populations of DNA replication intermediates at a given time. Therefore, if one population of intermediates is more prevalent than another, this would be evident upon visualization. This makes 2DGE an especially useful tool for studying replication progression through challenging sequences, like structure-forming repeats. For example, if the region analyzed contains a sequence capable of inducing replication fork stalling, this would present as a bulge on the arc (Figure 1A), indicating an accumulation of replication forks at that locus. This can be seen with the replication of both hairpin-forming repeat sequences in yeast19,20,21 and triplex-forming repeats in human cells22,23,24. In addition to stalling, 2DGE can be used to observe DNA structures that do not conform to the standard simple Ys formed during replication, as in the case of recombinant intermediates25. These intermediates have a heavier and more branched X-shaped structure and, therefore, travel slower in both the first- and second dimensions than standard replication forks. Similar results can also be observed with regards to replication fork reversal20,24,26. In response to strong replication stress, eukaryotic cells have been shown to utilize replication fork reversal to rescue stalled forks. These reversed forks have similar molecular weight to stalled forks; yet their chicken-foot structure results in slower electrophoretic mobility in the second dimension relative to their Y-shaped complements, resulting in an extension up and out from the arc.
Figure 1: 2D Gel electrophoresis analysis of DNA replication. (A) Schematic of a typical 2DGE depicting replication through a structure-forming repeat capable of inducing fork stalling. Intermediate size and structure will influence electrophoretic mobility. (B) Sample Y-arc with ascending and descending arms respectively labeled. Abbreviation: 2DGE = Two-dimensional neutral/neutral gel-electrophoresis. Please click here to view a larger version of this figure.
Naturally, one of the most important aspects of 2DGE concerns the quality and quantity of replication intermediates. However, the resolution of 2DGE analysis of replication through endogenous loci in mammalian cells is insufficient for a single copy target sequence within the 6 × 109 bp diploid human genome, though it has been done for multi-copy genes, such as heavily amplified DHFR locus27 or ribosomal RNA28. SV40-based replication is an efficient and well-characterized means of studying replication in eukaryotic cells29. It provides a reliable model of eukaryotic replication that utilizes most of the host replisome machinery to replicate the viral genome, which is parceled into nucleosomes upon infection30,31. Two notable exceptions from the mammalian replisome are that the T-antigen (Tag), instead of the host CMG complex, serves as replicative DNA helicase, and DNA polymerase delta synthesizes both leading and lagging DNA strands32. We have taken advantage of this system by placing pathogenic stretches of structure-forming repeats downstream from an SV40 origin of replication within a plasmid that was originally created in the Massimo Lopes lab22. Importantly, this plasmid also contains the gene encoding for Tag itself, thus resulting in its constitutive and extremely potent replication upon transfection into a variety of cultured human cells. This feature gives rise to a large quantity of products, ideal for 2DGE analysis of the intermediates formed during and in response to the replication of pathogenic repeats in human cells. Here, we describe a detailed method of visualizing the replication of structure-forming repeats within the SV40-based human episome using 2-dimensional gel electrophoresis.
NOTE: The plasmid designed for our outlined 2DGE analysis in mammalian cells should contain an SV40 origin of replication several kb upstream of structure-prone repeats (Figure 2). Leading and lagging synthesis should be kept in mind when choosing what orientation relative to the origin the repeats should be cloned into the plasmid.
Figure 2: Digestion of repeat-containing plasmid for 2DGE analysis. Structure-prone repeats are depicted several kb downstream from the right-moving replication fork. Digestion with unique cutters 1 and 2 will place the repeat sequence on the descending arm of the Y-arc, given the sequence beyond the halfway point of the digested fragment. Abbreviation: 2DGE = Two-dimensional neutral/neutral gel-electrophoresis. Please click here to view a larger version of this figure.
1. Plasmid transfection into mammalian cells
2. Isolation of replication intermediates
3. Sample preparation and 2-dimensional gel electrophoresis
Figure 3: Excision of first-dimension intermediates prior to second-dimension separation. Following visualization of the ladder, the mobility of unreplicated fragments can be estimated. (I) This value can then be used to determine appropriate cut sites (a and b) to excise it and its replicated counterparts (II). The section of the gel should then be rotated and placed in the position of wells for the second-dimension separation. Abbreviation: CW = clockwise. Please click here to view a larger version of this figure.
4. Southern blotting and hybridization with radiolabeled probe
Figure 4: Assembly of Southern blot. Comprehensive schematic of a typical apparatus used for Southern blot transfer of intermediates from the second dimension onto a nylon membrane. Please click here to view a larger version of this figure.
If successful, upon visualization, a sharp arc of replication forks can be observed extending up and out from the massive 1n spot (Figure 5A). The size of a fragment, or percentage that is replicated, determines the fragment's mobility in the first dimension. As the intermediates develop a more jointed structure, they will begin to travel more slowly in the second dimension. Therefore, if an intermediate has traveled slowly in both dimensions, it can be asserted that it is a highly replicated molecule with significant junctional variation from conventional replication forks. In the case of lagging-strand synthesis of the (GAA)100 repeat, fork stalling is indicated by a darkened defined spot on the arc (I), representing an accretion of replication forks upon encounter of the repeat (Figure 5B), likely due to triplex formation. This is accompanied by heavier, more branched intermediates (II and III) above the stall site. We have described the nature of these intermediates in more depth in Rastokina et al. 202324. In short, these likely represent a combination of forks reversing in response to the stall (II), in addition to the advent of the converging fork (Figure 5B) (III).
One observation that may be made upon visualization is a less-than ideal, broad arc. At worst, this may present as a complete doubling of the arc. Typically, this represents poor separation of intermediates in the first dimension. This may be attributed to several factors, the most likely of which is either salt or protein contamination in the replication intermediate sample. A phenol:chloroform purification after digesting intermediates, followed by extensive washing with 70% ethanol, can minimize this risk. However, it should be noted that additional phenol exposure can increase the chance of denaturing intermediates, which may present as a darkened intensity of linear DNA under the arc, representing collapsed replication intermediates (Figure 5C).
In the worst-case scenario, visualization may result in a complete absence of replication intermediates, evidenced by the lack of arc (Figure 5D). It is unlikely that the absence of an arc is attributed to denatured replication intermediates, as there is no darkened spot under the expected area of the arc. Given the presence of a particularly small 1n spot (IV), the overall quantity of DNA present in this depicted example is minimal, and, therefore, this issue may be caused by under-replication of the plasmid or an overall poor transfection or isolation of DNA.
Figure 5: Potential outcomes of 2DGE analysis. (A) Successful 2DGE analysis of replication through the structure-forming (GAA)100 repeat in HEK293T cells. (I) refers to forks stalled at the repeat, whereas (II) and (III) refer to more structured intermediates that arise in response to the stall. An illustration of the 2.7 kb fragment is depicted above. (B) Representative intermediates that arise during replication of the (GAA)100 repeat. Using siRNA knockdowns of replication fork reversal and restart genes, genetic controls were established to identify the nature of these intermediates. (C) Depiction of unresolved replication intermediates that may appear if forks are denatured during isolation. (D) An unsuccessful 2DGE experiment as shown by a complete absence of replication intermediates. IV represents linear, unreplicated fragments of the digested plasmid. Abbreviation: 2DGE = Two-dimensional neutral/neutral gel-electrophoresis. Please click here to view a larger version of this figure.
2DGE provides a semi-quantitative and comprehensive image of the relative populations of intermediates that arise during the replication of a particular sequence. Given that the fragile molecular structures of replication forks must be maintained throughout this procedure, great care should be implemented to prevent physical shearing and chemical denaturation. Therefore, it is highly recommended that any alkaline treatment be avoided during plasmid isolation. To avoid this, we and others have implemented a modified form of the Hirt isolation of DNA, established by Hirt in 196733. Here, DNA is separated from proteins, RNA, and other cellular debris by treating cell lysate with 1 M NaCl at 4 °C overnight. While this method has proven exceptional for preserving the quality of replication intermediates, it does not appear to fully separate plasmid DNA from genomic DNA. This should be kept in mind when designing the sequence for radiolabeled probing, as the designed probe should not have high sequence similarity with any genomic DNA to best avoid off-target binding. Further, the integrity of replication intermediates may be greatly increased through UV-psoralen crosslinking. Here, cells can be incubated with psoralen for several minutes in the dark before being UV irradiated at 366 nm34, thus creating covalent bonds between the DNA molecules. In addition to reinforcing the cohesiveness of intermediate structures, this may also alleviate any concern of structures forming during isolation rather than in vivo.
One aspect that should be considered when designing this experiment is repeat placement on the resolved final arc. The first half of the arc extending out from the 1n spot is denoted as the ascending arm, while the second half is known as the descending arm (Figure 1B). To observe replication fork stalling, repeat placement on either arm of the arc should be sufficient. However, to observe more structured intermediates that arise at repeat sequences, such as postreplicative junctions, we recommend placing the repeat sequence on the descending arm. This is largely due to the slower electrophoretic mobility of these intermediates in both dimensions, which may result in co-migration with simple Ys structures farther along in their progression if the repeat-containing sequence were placed on the ascending arm, thus hindering any detailed analysis. For the best resolution, we recommend placing the repeat sequence somewhere between 60% and 90% within the fragment of interest relative to the origin of replication.
Attention to detail should be utilized during the DNA transfer step and subsequent treatment of the membrane. It is extremely important that the Southern blot is assembled in a manner that facilitates an even and tight transfer of DNA to the membrane. This means that all components on the transfer are free of air bubbles and that the apparatus is uniform across its surface. To assist in this, it is standard to flip the second-dimension gel over prior to adding it to the Southern blot. The bottom of the gel is assumed to be flatter than the top; therefore, flipping the gel over ensures the smoothest possible transfer of DNA to the membrane.
If a strong background is present in the final resolution of the blot, this may be indicative of the non-specific binding of the radiolabeled probe. This can be alleviated in a number of ways. First, it should be checked that the probe does not contain high sequence similarity to any genomic DNA. This can be easily verified using the Nucleotide Basic Local Alignment Search Tool (BLAST), which is readily available through the NIH website35. Otherwise, nonspecifically bound probes can be removed through additional stringent washes. We recommend beginning with two additional washes of buffer 1 (0.1x SSC, 0.1% SDS) at 42 °C in 15 min intervals each; however, more washes may be needed. Finally, the temperature at which hybridization occurs may be altered as well. For most probes containing a G/C content of 40-60%, 65 °C tends to be sufficient, though this is not a static value. Efficient binding of the probe is dependent on its melting temperature and, therefore, may require alterations to the hybridization temperature.
Given that 2DGE provides an overview of the relative populations of replication intermediates at a given time, one of its greatest drawbacks is that it does not provide a strong measure of replication progression timing. For this purpose, we recommend using single-molecule analysis like DNA combing. Further, while 2DGE allows for analysis of the replication intermediates that arise in response to stalling, genetic controls must be established to uncover their exact identity, which can be timely. Although costly, electron microscopy remains superior in identifying the exact structure of DNA molecules.
The authors have nothing to disclose.
We thank Jorge Cebrian and Anastasia Rastokina who started developing this approach in our lab, Massimo Lopes for providing us with pML113 plasmid and invaluable advice, Ylli Doksani for insightful discussions, and members of the Mirkin lab for their support. The work in the Mirkin lab is supported by the National Institute of General Medical Sciences [R35GM130322] and NSF-BSF [2153071].
10x TBE Buffer | Bio Rad | 1610733 | |
20x SSC Buffer | Fisher Scientific | BP1325-1 | |
293T cells | ATCC | CRL-3216 | |
a-32P dATP, 3000 Ci/mmol | Revvity | BLU512H250UC | |
Agarose | Fisher Scientific | BP160-500 | |
Amersham Hybond-N+ | Fisher Scientific | RPN303B | |
BAS Storage Phosphor Screens | Fisher Scientific | 28956482 | |
Church and Gibert's hybriddization buffer | Fisher Scientific | 50-103-5408 | |
DecaLabel DNA labeling kit | ThermoFisher Scientific | K0622 | |
DMEM, high gluctose, GltaMAX Supplement, pyruvate | ThermoFisher Scientific | 10569010 | |
DpnI | New England Biolabs | R0176S | Additional restriction enzymes will need to be purchased as well |
EDTA 0.5 M, pH 8 | Fisher Scientific | BP2482500 | |
Ethanol, 70% | Fisher Scientific | BP82031GAL | |
Fetal Bovine Serum | VWR | 97068-085 | |
Hydrochloric acid solution, 12 M | Millipore Sigma | 13-1683 | |
Isopropanol | Fisher Scientific | BP26184 | |
jetPRIME DNA and siRNA Transfection Reagent with Buffer | VWR | 101000027 | |
MycoZap Plus-CL | VWR | 75870-448 | |
NaCl | Millipore Sigma | 746398-500G | |
Nalgene Oak Ridge High-Speed Centrifuge Tubes | ThermoFisher Scientific | 3139-0050 | |
Phosphate Buffer Saline, pH 7.4 | ThermoFisher Scientific | 10010023 | |
Phosphate Buffer Saline, pH 7.5 | ThermoFisher Scientific | 10010024 | |
Proteinase K | ThermoFisher Scientific | EO0491 | |
Proteinase K | ThermoFisher Scientific | EO0492 | |
Pure Cellulose Chromatography Paper | Fisher Scientific | 05-714-4 | |
Pure Cellulose Chromatography Paper | Fisher Scientific | 05-714-5 | |
Ruler | Fisher Scientific | 09-016 | |
Scalpel | Fisher Scientific | 12-460-451 | |
Sodium dodecyl sulfate | Millipore Sigma | 436143-25G | |
Sodium hydroxide | Fisher Scientific | S25548 | |
Sorval LYNX 4000 Superspeed Centrifuge | ThermoFisher Scientific | 75006580 | |
Sub-cell Horizontal Electrophoresis System | Bio Rad | 1704401 | |
TH13-6 x 50 Swinging Bucket Rotor | ThermoFisher Scientific | 75003010 | |
Tris-HCl 1 M, pH 7.5 | Fisher Scientific | BP1757-500 | |
Trypsin-EDTA (0.25%), phenol red | ThermoFisher Scientific | 25200056 |
.