This protocol provides the steps from DNA extraction to experimental set-up for digital droplet PCR (ddPCR), including analysis for the identification and quantification of non-homologous end-joining (NHEJ) events at target sites following gRNA-induced Cas9 cleavage and DNA repair. Other uses of this method include applications such as polymorphism detection and gene-editing variant verification.
Recent advances in mosquito genomics and genetic engineering technologies have fostered a need for quick and efficient methods for detecting targeted DNA sequence variation on a large scale. Specifically, detecting insertions and deletions (indels) at gene-edited sites generated by CRISPR guide RNA (gRNA)/Cas9-mediated non-homologous end-joining (NHEJ) is important for assessing the fidelity of the mutagenesis and the frequency of unintended changes. We describe here a protocol for digital-droplet PCR (ddPCR) that is well-suited for high-throughput NHEJ analysis. While this method does not produce data that identifies individual sequence variation, it provides a quantitative estimate of the sequence variation within a population. Additionally, with appropriate resources, this protocol can be implemented in a field-site laboratory setting more easily than next-generation or Sanger sequencing. ddPCR also has a faster turn-around time for results than either of those methods, which allows a more quick and complete analysis of genetic variation in wild populations during field trials of genetically-engineered organisms.
Gene drives have immense potential to control insect populations of medical and agricultural relevance1,2,3,4,5. For example, gene-drive systems based on CRISPR Cas nucleases and guide RNAs (gRNAs) can be used to modify vector mosquito populations by introducing traits that confer refractoriness to malaria parasites leading to reduced transmission and less disease1,4,5. The gene-drive system copies itself and the associated trait from one homologous chromosome to another in the pre-meiotic germ cells, and this ensures that the majority of the offspring inherit the drive and create the potential for long-lasting and sustainable population modification in the field. However, one disadvantage of the Cas/gRNA-based methods is the possibility of generating insertion and deletion (indel) mutations through non-homologous end-joining (NHEJ) DNA repair, resulting in the generation of drive-resistant alleles, which when accumulated to a high enough frequency in the population, can stop the drive system from spreading1,2,3,4. This protocol details a high-throughput and reliable method that can determine the prevalence and relative quantity of indel mutations, at both the population and individual level, during Cas/gRNA-based gene drive.
Next-generation sequencing (NGS) methods provide unparalleled sequencing resolution. However, the cost and technical requirements associated with NGS prohibit routine testing and limit its use as a high-throughput method to assess indels6,7,8. Traditional PCR quantification methods have long been used as the standard evaluation procedure for genome indels; however, these methods are labor-intensive, take a long time to procure data, and have a high degree of variability. Digital-Droplet PCR (ddPCR) has been proven to be more sensitive at detecting mutations than Sanger sequencing in some applications and has a lower detection limit than NGS in others6,7,8,9. Moreover, the cost to assess a sample set and turn-around time for obtaining results is less expensive and faster, respectively, for ddPCR than either Sanger sequencing or NGS9. Using a dual-probe system, the Drop-Off assay identifies NHEJ alleles based on the absence of wild-type (WT) sequence at the gRNA-directed target Cas9 cut site. In this assay, a short amplicon including the predicted cut site of the Cas/gRNA-based system is amplified with a specific primer pair. One fluorescent probe is designed to bind to a conserved region of the amplicon and another fluorescent probe recognizes the WT sequence of the cut site. In the presence of an NHEJ allele, the latter will not bind to the amplicon.
The use of ddPCR provides the ability to design primers to target deletions, single base-pair differences and insertions, which will allow for NHEJ profiling in mosquito population analyses9. Given these attractive features, we created a protocol for ddPCR for high-throughput detection of indels generated from a Cas/gRNA-based gene-drive system in mosquitoes.
1. DNA extraction
2. ddPCR reactions and droplet generation preparation
3. PCR reaction preparation
4. Droplet generation
5. PCR
6. Droplet reading
7. Analysis
An application of this procedure appears in Carballar-Lejarazú et al.9. The ddPCR Drop-off assay utilizes two fluorescent probes to discern WT and indel sequences: A FAM probe binds to a conserved sequence within the amplicon, whereas the HEX probe targets the WT sequence of the targeted site (Figure 4A). In the presence of an indel, the HEX probe will not bind. Representative results can be found in Figure 2, Table 1, and Table 2 of Carballar-Lejarazú et al.9. Using this protocol, ddPCR has been proven to detect a wide variety of CRISPR-Cas9 induced NHEJ events and quantify the NHEJ frequency in an individual or pooled sample. Fifteen different pooled samples of 10 mosquitoes each contained various NHEJ alleles (Table 2 of Carballar-Lejarazú et al.9). These were analyzed with ddPCR using the protocol and parameters presented here. Results from Table 19 show that all 15 samples carried 100% indel alleles as identified by the Drop-off assay (Figure 4B). In another experiment, 11 pooled samples of WT mosquitoes and NHEJ mosquitoes with different NHEJ percentages (0%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, and 100%) were examined with this ddPCR protocol, and the results (Figure 2; Carballar- Lejarazú et al.9) showed that the identified percentage is close to the compared technique of Indel Detection by Amplicon Analysis (Figure 4C).
Figure 1: Experimental set-up and procedure. (A) Cartridge preparation for Droplet generation. (Top) Samples are filled in the middle row of the cartridge, while oil is filled in the bottom row. (Bottom) Top row filled with emulsified droplets after droplet generation. (B) Droplet generator with a cartridge filled with sample and covered by a gasket in place. (C) 96-well plate covered with foil seal in a Thermo-Cycler. (D) Droplet Reader with 96-well-plate in place with a metal cover latched over the plate to secure it. Please click here to view a larger version of this figure.
Figure 2: Droplet reading. (A) Software interface for droplet reading. Orange boxes show wells with samples. Gray boxes are empty wells. Experimental parameters are set up in the Edit Tools panel (right-hand side). Each sample can be edited by clicking on the respective sample box. Select Drop Off (DOF) for Experimental Type. In sample information, fill in the appropriate information for the sample's Name and Type, as well as SuperMix. Choose the Basic Drop-Off for the Assay Information. For the WT sample, choose WT for the Target Name, Ref for Target Type, and both FAM and HEX for Signal Ch1 and Ch2, respectively. For NHEJ samples, fill in the appropriate name for the Target Name, choose Unknown for the Target Type, and choose FAM for Signal Ch1. Leave Signal Ch2 at None. (B) Droplet count results for multiple samples. Please click here to view a larger version of this figure.
Figure 3: Drop-Off assay analysis. (A) Cluster 2D plot for the droplet count of the WT and NHEJ alleles. In the 2D Amplitude tab, all droplets are unclassified by default. In this figure, colors are manually assigned for distinguishing. The orange dots cluster are WT allele counts obtained by binding of both FAM and HEX probes at the reference sequence and target site sequence, respectively. Blue dots represent scores of droplets that have FAM binding to the reference sequence but no HEX binding at the target site sequence (hence drop-off of HEX). Gray dots are empty droplets that don't have either FAM or HEX binding. (B) Ratio/Abundance graphs of NHEJ events. Under the Ratio tab, select Fractional Abundance for a graph with the correspondent percentage of NHEJ events. Please click here to view a larger version of this figure.
Figure 4: Application of Drop-Off assay with ddPCR for non-homologous end-joining identification and quantification in the transgenic Anopheles stephensi line, AsMCRkh1. (A) Schematic presentation of the ddPCR Drop-Off assay to detect mutations at a targeted DNA site with a dual-probe system. An amplicon of 150-400 bp is amplified with the forward and reverse primers. A FAM-labeled probe is designed to bind to a conserved sequence of the amplicon, whereas a HEX-labeled probe is designed to bind to the WT gRNA targeted site. (B) Detection of various types of indels with ddPCR. Fifteen pools of 10 AsMCRkh1 mosquitoes each containing various types of indel, including insertion, deletion, and substitution, were analyzed with the ddPCR Drop-Off assay. Details of mutations and sequences can be found in Table 2 and Table S3 of Carballar et al.9. (C) Quantification of NHEJ in mixed samples of AsMCRkh1 and WT mosquitoes with various ratios (10:0, 9:1, 8:2, 7:3, 6:4, 5:5, 4:6, 3:7, 2:8, 1:9, and 0:10) using ddPCR and a compared technique of Indel Detection by Amplicon Analysis9. Images adapted from Carballar-Lejarazú et al. Biotechniques. 68(4):172-179 (2020)9. Please click here to view a larger version of this figure.
Primer/Probe | Sequence (5’ » 3’) |
ddPCR Forward Primer | ATGATCAAATGTCGACCG |
ddPCR Reverse Primer | ACCGTACTGGTTGAACA |
ddPCR HEX Probe (BHQ1) | [HEX]-TTCTACGGGCAGGGC-[BHQ1] |
ddPCR FAM Probe (BHQ1) | [6FAM]-CCACGTGGGATCGAAGG-[BHQ1] |
HEX: Hexachloro-fluorescein, FAM: 6-carboxyfluorescein, BHQ: Black Hole Quencher |
Table 1: Sequences of primers and probes.
Digital-droplet PCR is an efficient method to determine the presence of indel alleles resulting from NHEJ events in a Cas/gRNA-based gene-drive system and allows for quantification of the frequency of these alleles in individuals or populations. Some steps of the protocol need to be followed with special care to achieve reliable results. Firstly, the genomic DNA extraction needs to be performed carefully to ensure high quality and sufficient quantity. A good extraction will allow accurate determination of the haploid genome copies per reaction. In our experience, a commercially-available kit (see Table of Materials) provided consistently high-quality DNA extractions. However, extractions of individual mosquitoes can prove to be particularly challenging as the DNA pellet becomes hard to visualize and can easily be sucked up with the supernatant if not careful. Secondly, primers and probes must be designed carefully. Before completing the ddPCR experiment, ensure that the designed primers result in a single PCR product by first performing a traditional PCR and visualizing a single product via gel electrophoresis. The reference FAM probe also must be designed so that it is complementary to a highly-conserved sequence. This will ensure accurate detections of WT alleles throughout a diverse population. The primer/probe combinations for each unique experiment will have different thermocycler conditions, and it is recommended to optimize those conditions using a thermal gradient.
Other methods for identifying indels exist, such as Sanger sequencing or NGS. Sanger sequencing is limited because it has a lower limit of detection and low discovery power to identify novel variants. Sanger sequencing also is labor-intensive and is not high-throughput. Compared to Sanger sequencing, NGS does not have the same limitations of low sensitivity, discovery power, and throughput. Another benefit of NGS is its ability to detect a variety of mutations from single nucleotide polymorphism (SNPs) to rearrangements. However, NGS is a more costly and time-consuming method in the application of determining Cas9/gRNA-associated indels because there is only one target region of interest, and it is best suited for larger genome-wide analyses. Compared to the aforementioned methods, ddPCR is high throughput and has a quick turn-around time. If the ddPCR materials and instruments are available in-house, 96 samples can be processed within 1-2 days, making it well suited for quick analysis of large trials of Cas9/gRNA -modified organisms.
While many benefits exist for ddPCR there are also limitations. Firstly, the ddPCR equipment is not frequently available in an independent laboratory environment. ddPCR equipment may be available communally at larger research institutions, but this does not allow for ease of data generation and analysis outside of the institution. Secondly, unlike the alternatives, ddPCR does not provide the individual unique sequences of identified indel mutations. Digital-droplet PCR will provide the frequency of indel mutations within a population, but without the sequence, one cannot determine whether the indels present are more likely to conserve or inhibit the function of the gene of interest. The ddPCR method is perhaps suited best to analyze wild populations after a field release trial of a Cas9/gRNA-based drive organism because it can efficiently determine the frequency of introduction of the transgene into the native population and the generation of indels within the population in close to real-time. Due to the ddPCR quick turn-around time, it would be feasible to perform sampling and analyze the population in a field trial region weekly if materials were available locally. The start-up costs to purchase, import, and set up the ddPCR equipment would be high in remote laboratories but the benefits of being able to assess rigorously a wild population as it is undergoing modification from a drive system would justify the costs.
The authors have nothing to disclose.
Funding was provided by the University of California Irvine Malaria Initiative. AAJ is a Donald Bren Professor at the University of California, Irvine.
Reagents | |||
ddPCR Super Mix for Probes (no dUTP) | Bio-Rad | 1863024 | |
DNA extraction reagent (e.g. Wizard Genomic DNA Purification kit) | Promega | A1120 | |
EDTA (pH 8.0) | Invitrogen | AM9260G | |
Ethanol, 200 Proof | Thermo Fisher Scientific | A4094 | |
Isopropanol (Certified ACS) | Thermo Fisher Scientific | A416-500 | |
Nuclei Lysis Solution (NLS) (Wizard Genomic DNA Purification kit) | Promega | A1120 | |
PCR-grade Water | Any certified PCR-grade water can be used | ||
Protein Precipitation Solution (Wizard Genomic DNA Purification kit) | Promega | A1120 | |
Proteinase K 20 mg/mL | Thermo Fisher Scientific | AM2546 | |
Materials | |||
ddPCR 96-Well Plate | Bio-Rad | 12001925 | |
Droplet Generator DG8 Cartridge and Gaskets | Bio-Rad | 1864007 | |
Droplet Generation Oil for probes | Bio-Rad | 1863005 | |
Fluorescent probes (e.g. FAM/HEX probes) | Sigma-Aldrich | N/A | Probes are experiment specific and can be purchased from any certified seller available. |
Forward and Reverse oligonucleotide primers | Sigma-Aldrich | N/A | Primers are experiment specific and can be purchased from any certified seller available. |
Equipment | |||
C1000 Touch Thermal Cycler | Bio-Rad | 1851148 | Can use other Thermo cycler with gradient function and deep well |