Here, we present a method for generating tissue-specific binary transcription systems in Drosophila by replacing the first coding exon of genes with transcription drivers. The CRISPR/Cas9-based method places a transactivator sequence under the endogenous regulation of a replaced gene, and consequently facilitates transctivator expression exclusively in gene-specific spatiotemporal patterns.
Binary transcription systems are powerful genetic tools widely used for visualizing and manipulating cell fate and gene expression in specific groups of cells or tissues in model organisms. These systems contain two components as separate transgenic lines. A driver line expresses a transcriptional activator under the control of tissue-specific promoters/enhancers, and a reporter/effector line harbors a target gene placed downstream to the binding site of the transcription activator. Animals harboring both components induce tissue-specific transactivation of a target gene expression. Precise spatiotemporal expression of the gene in targeted tissues is critical for unbiased interpretation of cell/gene activity. Therefore, developing a method for generating exclusive cell/tissue-specific driver lines is essential. Here we present a method to generate highly tissue-specific targeted expression system by employing a "Clustered Regularly Interspaced Short Palindromic Repeat/CRISPR-associated" (CRISPR/Cas)-based genome editing technique. In this method, the endonuclease Cas9 is targeted by two chimeric guide RNAs (gRNA) to specific sites in the first coding exon of a gene in the Drosophila genome to create double-strand breaks (DSB). Subsequently, using an exogenous donor plasmid containing the transactivator sequence, the cell-autonomous repair machinery enables homology-directed repair (HDR) of the DSB, resulting in precise deletion and replacement of the exon with the transactivator sequence. The knocked-in transactivator is expressed exclusively in cells where the cis-regulatory elements of the replaced gene are functional. The detailed step-by-step protocol presented here for generating a binary transcriptional driver expressed in Drosophila fgf/branchless-producing epithelial/neuronal cells can be adopted for any gene- or tissue-specific expression.
The genetic toolbox for targeted gene expression has been well developed in Drosophila, making it one of the best model systems to investigate the function of genes involved in a wide variety of cellular processes. Binary expression systems, such as yeast Gal4/UAS (upstream activation sequence), was first adopted for tissue-specific enhancer trapping and gene misexpression in the Drosophila genetic model1 (Figure 1). This system facilitated the development of a large number of techniques such as spatiotemporal regulation of gene overexpression, misexpression, knockout in selected groups of cells as well as in cell ablation, cell marking, live tracing of cellular and molecular processes in embryo and tissues, lineage tracing and mosaic analyses during development. A number of binary transcription system, such as the bacterial LexA/LexAop system (Figure 1) and Neurospora Q-system, are powerful genetic tools that are now widely used in Drosophila, in addition to the original Gal4/UAS system for targeted gene expression1,2,3.
Here, we present a method to generate highly reliable tissue-specific binary expression system by employing a genome-editing technique. The recent advancements in CRISPR/Cas9 genome editing technology have allowed unprecedented opportunities to make directed genome changes in a broad range of organisms. Compared to the other available genome editing techniques, the CRISPR/Cas9 system is inexpensive, efficient, and reliable. This technology utilizes components of the bacterial adaptive immune system: a Cas9 endonuclease of Streptococcus pyogenes that creates a double-strand break (DSB), and a chimeric guide RNA (gRNA), which guides the Cas9 to a particular genome site for targeted DSB4. The cells contain the machinery to repair the DSB using different pathways. Non-homologous end joining (NHEJ) leads to small insertions or deletions to disrupt gene function, while homology-directed repair (HDR) introduces a defined directed/desirable genomic knock-in/knock-out by using an exogenous HDR donor as a template. The HDR-based replacement strategy can efficiently be utilized to generate a highly reliable tissue-specific binary expression system, which can overcome all the limitations of the traditional enhancer trap methods. We describe a step-by-step procedure for utilization of CRISPR/Cas9-based HDR repair in generating a binary transcription driver line that is expressed under the control of endogenous transcriptional and post-transcriptional regulation of a Drosophila gene. In this protocol, we demonstrate the generation of a driver line specific for branchless (bnl) gene encoding an FGF family protein that regulates branching morphogenesis of tracheal airway epithelium5. In this example, the first coding exon of the bnl gene was replaced by the sequence of a bacterial LexA transactivator sequence without altering any endogenous cis-regulatory sequences of the bnl gene. We show that the strategy generated a bnl-LexA driver line that spatiotemporally controls the expression of a reporter gene placed downstream of LexAoperator (LexAop or LexO) exclusively in bnl-expressing epithelial/mesenchymal/neuronal cells.
1. Designing and Constructing the gRNA Expression Vector
2. Designing and Constructing the HDR Donor
3. Embryo Injection, Fly Genetics and Screening for Genome Editing
This protocol was successfully used to generate a targeted binary expression reporter system specific for bnl expressing cells5. The cis-regulatory elements (CREs) that control complex spatiotemporal bnl expression are not characterized. Therefore, to achieve spatiotemporal expression under the control of the endogenous bnl regulatory sequence, only the first coding exon of bnl was designed to be replaced with the sequence of the bacterial LexA transactivator. An improved nls-LexA::p65 cassette known to provide optimal expression in Drosophila was selected.
The replacement strategy aimed to generate a chimeric nls-LexA:p65-bnl mRNA under endogenous transcriptional and post-transcriptional control. This chimeric mRNA contained an intact bnl 5' UTR, part of the 5' end and the 3' end of the edited bnl coding exon (first exon), and the complete downstream bnl introns and exons. To preserve the bnl RNA-specific splicing of the replaced exon, small 5' and 3' ends of the replaced coding exon were retained. However, to prevent synthesis of a chimeric Bnl protein fused to nls-LexA:p65, a T2A self-cleaving peptide sequence was incorporated between the residual 5' bnl coding region and the ATG of nls-LexA:p65. Also, a translation stop codon was added after the nls-LexA:p65 sequence to avoid co-translation of a truncated Bnl protein5 (Figure 4 and Figure 5A).
A replacement donor containing all these features was used, which had the T2A–nls-LexA:p65 sequence and 2 kb 5'- and 1.8 kb 3'- homology arms. Long homology arms were used for efficient HDR and insertion of a large DNA fragment at the site of repair (T2A-nls-LexA:p65 is 1.8 kb) (Figure 5A).
Two gRNAs were used to create DSBs at the defined positions to delete most of the first coding exon of bnl. Two gRNAs that matched all criteria described in section 1.3 (PAM sequence underlined) were:
gRNA1: TGTATCTGCGATGCCCCTCATGG
gRNA2: ATCCTTCAGATATTGCGGGATGG
Both gRNA recognition sites were disrupted in the replacement donor by the exogenous T2A–nls-LexA:p65 sequence, which is the easiest way to avoid retargeting of the gRNAs to the engineered locus. The disrupted gRNA recognition sequences in the replacement donor (the exogenous sequences that disrupted gRNA recognition sites in italics) were:
gRNA1: TGTATCTGCG-GGCTCCGGCGAAGGAC…
gRNA2: …AAAAACTCGTTTAGA-CGGGATGG
The pCFD4 gRNA expression vector containing these gRNAs, along with the replacement donor, was co-injected into the germline of nos-Cas9 embryos. Subsequently, the protocol described here was followed to screen for the intended replacement (Figure 5B, C). About 67% of injected G0 animals were fertile. And 56% of the fertile G0s were the founders, which gave rise to HDR-positive progeny. Among the F1 progeny checked, about 23% were positive for successful HDR, and 73% of the positive HDR were "ends-out" (Table 2). Since the first coding exon of bnl was "knocked out", all the HDR lines were found to be homozygous lethal, and the stocks were maintained over balancers.
The generation of a chimeric LexA-bnl mRNA in the cells was validated by RT-PCR analyses (Figure 5D). To verify if the obtained bnl–LexA lines were expressed in the endogenous bnl expression pattern, each "ends-out" HDR line was crossed to LexO-mCherryCAAX transgenic flies5, and the reporter expression pattern was examined in the progeny embryos and larvae. 42 out of 46 lines showed accurate spatiotemporal expression consistent with the previously reported bnl patterns5 (Figure 6). Four lines showed either weak or non-specific expression patterns. We predict that these lines might have accumulated mutations, which we need to verify with thorough sequencing analyses. Together these results confirmed that the HDR-mediated exon replacement strategy could successfully be employed to generate a targeted binary expression system for a gene. The tool can successfully be used to express reporter or ectopic genes in the spatiotemporal patterns of the bnl gene.
Figure 1: Scheme of a binary transcription system for targeted gene expression. A transactivator (GAL4 or LexA), expressed under the control of cis-regulatory elements (CREs) of a gene, drives an effector transgene placed downstream of the specific transcription binding sites (UAS for Gal4 or LexAop for LexA). Please click here to view a larger version of this figure.
Figure 2: An overview of workflow for CRISPR/Cas9-mediated genome editing to generate a binary transcription system. The approximate time duration required for each step is indicated. Please click here to view a larger version of this figure.
Figure 3: An illustration of the CRISPR screening and genetic cross scheme for establishing genome-edited fly lines. In genetic crosses, R stands for the edited allele that is on the 3rd chromosome. MKRS (Tp(3;3)MRS, M(3)76A[1] kar[1] ry[2] Sb[1]), is a 3rd chromosome marker; TM6B (In(3LR)TM6B, Antp[Hu] e[1] Tb[1]) is a 3rd chromosome balancer. Genome edited fly stocks are verified by PCR amplifying the target regions of interest from the genomic DNA extracted from either the F2 or F3 generation flies and sequence determining the PCR products. Please click here to view a larger version of this figure.
Figure 4: Generation of the replacement cassette by DNA assembly. A schematic drawing depicting PCR amplification and assembly of different PCR products (a, b, and c) into the donor vector (d) using a DNA assembly method. Please click here to view a larger version of this figure.
Figure 5: Generation of bnl-LexA by CRISPR/Cas9-mediated exon replacement. (A) Schematic drawing depicting the strategy of the CRISPR/Cas9 mediated HDR for exon replacement in the bnl locus. Box – exon; line- intron; replacement donor (pDonor-bnl:LexA) and two possible outcomes of the HDR were shown. The pDonor-bnl:LexA had the following features: (1)T2A-nls-LexA:p65 (~1.8kb) sequence flanked by 2 kb and 1.8 kb long homology arms (dashed lines), (2) a T2A self-cleaving peptide between the residual N terminal bnl exon and the nls-LexA:p65, and (3) a translation stop codon (red *) after the nls-LexA:p65 sequence. The HDR product retained all the transcriptional and post-transcriptional control of bnl,and the LexA:p65 protein is expected to be produced in the same pattern as endogenous Bnl. Small black arrows show the relative binding sites (not in scale) of the PCR primers (Table 1) used for 3-step screening or RT-PCR validation. (B) Agarose gel pictures showing results of the 3-step PCR screening. PCR products amplified from the genomic DNA of four successful ends-out HDR lines are shown; negative control, the genomic DNA of nos-Cas9 parental line; positive control, pDonor-bnl:LexA plasmid; M, Marker (SL2K DNA ladder). (C) An example of the screening gel showing the expected PCR product using primers M13F and rev3 for ends-in lines; M8-7 and M9-6 are two ends-in lines; negative and positive controls, the same as in B; M, Marker (NEB 1 kb DNA ladder). (D) RT-PCR analysis on total RNA from bnl-LexA and the nos-Cas9 control flies. Forward primer binds to a LexA specific region, reverse primer binds to a downstream bnl exon region; ~440 bp (base-pair) amplification band (*) was detected from RT-PCR on bnl-LexA mRNA, but not from the control RNA. M, 100 bp Marker (NEB). Adapted from Figures 2 and S1 in Du et al. 20175. Please click here to view a larger version of this figure.
Figure 6: Validation of tissue-specific and conditional bnl-LexA expression in different tissues. (A-D) bnl expression (red) in different larval tissues, with the btl expressing cells shown in green. (A,A') Wing disc bnl source ahead of the growing air sac primordium (ASP). (B, C) bnl expression in TR5 transverse connective (TC) (B) and TR2 dorsal branch (DB) (C). (D) bnl expression in genital discs. (E-J) Dynamic bnl expression (red) during embryonic tracheal branch (green) patterning; Small arrows, five bnl sources surrounding the tracheal placode at stage 10. Genotypes: A-J; btl-Gal4, UAS-CD8GFP/+; bnl-LexA,LexO-CherryCAAX/+. (K-L") Hypoxia–induced bnl-LexA expression profile in wing discs and associated TR2 tracheal metamere (TC, DB, dorsal trunk-DT). Genotype: bnl-LexA,LexO-CherryCAAX/+. (K) control discs from ex vivo cultured organs without CoCl2. (L-L") wing discs (L, L') and trachea (DT, (L'')) from cultured ex vivo organs with CoCl2 induced hypoxia. Star, ectopic expression induced by hypoxia. Scale bars = 30 µm; 50 µm (K-L"). Adapted from Figure 3 in Du et al. 20175. Please click here to view a larger version of this figure.
A. gRNA cloning and sequencing | |
bnl-lexA gRNA fwd | TATATAGGAAAGATATCCGGGTGAACTTCgTGTATCTGCGAT GCCCCTCAGTTTTAGAGCTAGAAATAGCAAG |
bnl-lexA gRNA rev | ATTTTAACTTGCTATTTCTAGCTCTAAAACTCCCGCAATATCTGAAGG ATcGACGTTAAATTGAAAATAGGTC |
T3 primer used for sequencing | CAATTA ACCCTCACTAAAGG-3' |
Note: nucleotides underlined anneal to U6 promoter or gRNA core on pCFD4 vector, the lowercase g/c was added to aid U6 promoter-dependent transcription | |
B. HDR donor construction | |
bnl N-F_pUC19 | AATTCGAGCTCGGTACtgtggtctttgaggctggaac |
bnl-lexA-N-R | tCCGcaagtCagtAGgctgccgcgtccttcgccggaGCCCGCAGATACAAGGCCC C |
lexA-F | CTactGacttgCGGaGAtGTcGAaGAGAACCCtGGCCCtATGCCACCCAA GAAGAAGC |
lexA-R | CTAAACGAGTTTTTAAGCAAACTCACTC |
bnl lexA-C Fwd | TAAAAACTCGTTTAGACGGGATGGCGTTGTCAAC |
bnl C-R_pUC19 | GCCAAGCTTGCATGCCtcgcataattgccgcctgg |
Note: nucleotides in capital overlap with pUC19 vector for Gibson Assembly, nucleotides underlined were sequence overhang for T2A peptide addition. | |
C. HDR screening and sequencing | |
bnl-lexA scr fwd1 | GTGGCGCACGCCCAATAAAC |
bnl-lexA scr rev1 | GATCCCAGCCAATCTCCGTTG |
bnl-lexA scr fwd2 | CAACGGAGATTGGCTGGGATC |
bnl-lexA scr rev2 | CTGGCCAACTGTAGGGAAGTC |
ends-in check rev3 | GCAATGTTATGCAATGCGTTGAC |
bnl-lexA seq fwd3 | CACTTGTCGCCCATATTGATACAATTG |
NOTE: These primers were used for PCR screening and sequencing, the approximate locations of primer binding sites are shown in Figure 5A as fwd1-2 and rev1-3. | |
D. RT-PCR analysis | |
RT-f | GATATGGATTTCTCCGCTTTGCTG |
RT-r | CCATGCAGAGATACAGGCAAGTG |
Table 1: Primers used in this study. (A) Primers for cloning gRNA expression vector and sequencing. (B) Primers for cloning HDR donor template. (C) Primers for screening and sequencing of the HDR products. (D) Primers used for RT-PCR verification of the chimeric LexA-bnl mRNA product.
genotype | HDR transmission rate % (# HDR-yielding G0/# fertile G0) | # HDR-positive F1/# total F1 checked (%) | # "ends-out" HDR/# total HDR (%) |
bnl-LexA | 56 (15/27) | 23 (60/259) | 73 (46/60) |
Table 2: The efficiency of CRISPR/Cas9-mediated HDR. The HDR transmission rate is calculated as the # of HDR-yielding G0/# of total fertile G0. Successful HDR% is calculated as the # of HDR-positive F1/# of total F1 screened. "Ends-out"% is calculated as the # of "ends-out" HDR/# of total positive HDR.
Traditionally, Drosophila enhancer traps were generated by two different methods. One of the ways includes random insertion of a driver (eg., Gal4) sequence in the genome by transposition (e.g., P-element transposition)1 . Alternatively, the driver sequences can be placed under the transcriptional control of a putative enhancer/promoter region in a plasmid construct, which would then be integrated into an ectopic site of the genome3,11. Although these methods have generated a large pool of Gal4 or LexA enhancer traps for Drosophila, they have several limitations. The P-element-mediated random insertion in the genome may disrupt the regulatory activity of critical cis-regulatory sequences. Due to the random transposition in the genome, transactivator expression may report patterns of multiple neighboring genes. On the other hand, a prior knowledge of the cis-regulatory sequences of a gene is necessary for an enhancer-trap plasmid construct. There is also a limit to the length of a sequence that can be cloned and tested in a plasmid construct. Isolating and cloning only a partial fragment of DNA out of the endogenous genomic context might not reproduce a complete gene-specific expression patterns. For instance, the bnl-Gal4 (NP2211) line, which was generated by P-element insertion of an enhancer trap Gal4 element just ahead of the bnl gene, does not completely reproduce the gene-specific expression patterns in the embryo5. Therefore, under such contexts, the repurposing techniques, such as HACK (homology assisted CRISPR knock-in), that can convert the existing Gal4 lines into another LexA or Q-system based driver line12 might not be very useful. To overcome these limitations, here, we described an efficient and alternative method for generating binary expression systems by genome editing. In this method, the first coding exon of a gene is swapped with the transactivator (eg., LexA or Gal4) sequence so that the expression of the transcription driver exclusively mimics the spatiotemporal patterns of the gene expression. Although the strategy and protocols described here were optimized for bnl-expression, the method can easily be adopted for other genes.
Determining an insertion site within the targeted gene region is the most critical step in this experimental strategy. The method we presented was designed to retain all the original transcriptional as well as the post-transcriptional regulations of the bnl gene5. Therefore, we planned to delete most of the first coding exon of the bnl gene and replace it with the LexA cassette. The replacement was planned in such a way that the edited allele expresses a hybrid LexA–bnl mRNA from the endogenous transcription and translation start site of bnl while retaining all the intronic regions. However, the hybrid mRNA was engineered to produce only LexA protein, but not Bnl. We showed that the strategy had successfully generated a bnl-LexA /LexO-based transcription system and that the system could accurately report the dynamic, spatiotemporal bnl expression patterns (Figure 5)5. A limitation of this process is the homozygous lethality in the Drosophila lines if the targeted gene is essential for survival. Further, use of a dual gRNA strategy may increase the chance of off-target editing. An alternative strategy can be employed to avoid these problems. In this strategy, a LexA or Gal4 cassette can be placed right at the ATG start site of the replaced gene and a T2A self-cleaving peptide sequence can be placed between the LexA/Gal4 cassette and the start of the intact coding exon of the target gene. This strategy is expected to produce a chimeric mRNA that can be translated into both the transctivator and the functional gene product. Such a design could possibly reduce the chance of homozygous lethality. In this strategy, a single gRNA is sufficient for genome editing. However, in the presence of two copies of functional genes, expression of the transgenic protein variants driven by the Gal4/LexA driver (eg., bnl-LexA driven expression of LexO-Bnl:GFP fusions, or Bnl proteins with deletions) will not provide information of the functionality of the variant proteins.
A limitation of the genome-editing strategy is the laborious process of targeting and editing of a single gene at a time. However, the simplicity and efficiency of CRISPR/Cas9-mediated genome editing method have revolutionized the targeted genomic knock-in/knock-out and replacement technology that can easily be adopted in any standard laboratory set up. In the last several years, Drosophila researchers have developed convenient toolkits for genome editing that can be employed to expand the genetic toolsets essential for understanding fundamental biological processes7,13. The genome editing and screening processes described here is relatively faster and takes about two months in comparison to any other homologous recombination-based replacement. For successful and efficient genome-editing results, it is important to follow several technically critical steps: 1) each gRNA is selected by the maximum stringency and activity without potential off-target; 2) the HDR donor contains ~1.5 kb homology arms flanking the gRNA targeting sites. Longer homology arms (1.8–2 kb) are used to increase the efficiency of HDR when a large exogenous DNA fragment needs to be inserted; 3) the HDR donor is designed to be free of the gRNA recognition sites to avoid retargeting of the gRNA to the engineered locus; and 4) a foolproof elaborate plan and coordination of the timing of genetic crosses with 3-step PCR-based screening to select an "ends-out" HDR line.
Unlike random transposition or cloned enhancer constructs, the strategy and protocols we described here ensures generation of an exclusively target gene-specific binary transcription system. Since the expression of the driver replaces a native gene exon, gene-specific expression of the driver is also versatile and is expected to mimic gene expression patterns under all developmental, metabolic, and physiological conditions5. Gene/cell-specific expression is the most desirable criteria of all the binary driver lines when they are used in various cell and developmental biology methods. For instance, gene-specific expression of reporter genes by a transcription driver facilitates reliable lineage analyses of the cells expressing the target gene. It also enables live imaging and tracking of the spatiotemporal expression, migration, and reorganization of cells during development. To investigate the cellular and molecular events during tissue morphogenesis, Gal4 or LexA driven overexpression/misexpression of various transgenes and RNAi-mediated gene knock-down in specific sets of cells are essential. Highly specific targeted expression system provides an unbiased analyses and interpretation of the results. Therefore, CRISPR/Cas9-based targeting of a gene exon and its replacement with a transactivator sequence provides a better alternative for generating highly specific targeted gene expression systems.
The authors have nothing to disclose.
We thank Dr. F. Port, Dr. K. O'Connor-Giles, and Dr. S. Feng for discussions on CRISPR strategy; Dr. T.B. Kornberg, and the Bloomington Stock Center for reagents; UMD imaging core facility; and funding from NIH: R00HL114867 and R35GM124878 to SR.
X-Gal/IPTG | Gentrox (Genesee Scientific) | 18-218 | cloning |
LB-Agar | BD Difco | BD 244520 | cloning |
Tris-HCl | Sigma Aldrich | T3253 | Biologie moléculaire |
EDTA | Sigma Aldrich | E1161 | Biologie moléculaire |
NaCl | Sigma Aldrich | S7653 | Biologie moléculaire |
UltraPure DNase/RNase-Free Water | ThermoFisher Scientific | 10977-023 | Biologie moléculaire |
10%SDS | Sigma Aldrich | 71736 | Biologie moléculaire |
KOAc | Fisher-Scientific | P1190 | Biologie moléculaire |
EtOH | Fisher-Scientific | 04-355-451 | Biologie moléculaire |
GeneJET Miniprep | ThermoFisher Scientific | K0503 | Miniprep |
PureLink HiPure Plasmid Maxipep kits | ThermoFisher Scientific | K210006 | Maxiprep |
BbsI | NEB | R0539S | Restriction enzyme |
Primers | IDT-DNA | PCR | |
pCFD4 | Kornberg Lab | DNA template and vector for gRNA | |
KAPA HiFi Hot Start- (Kapa Biosystems) | Kapa biosystems | KK2601 | PCR |
Q5-high fidelity Taq | NEB | NEB #M0491 | PCR |
Gibson Assembly Master Mix | NEB | NEB #E2611 | DNA assembly |
pBPnlsLexA:p65Uw | Addgene | DNA template for LexA amplification | |
Proteinase K | ThermoFisher Scientific | 25530049 | Biologie moléculaire |
2x PCR PreMix, with dye (red) | Sydlab | MB067-EQ2R | Biologie moléculaire |
Gel elution kit | Zymo Research (Genesee Scientific) | 11-300 | Biologie moléculaire |
TRI reagent | Sigma-Aldrich | Biologie moléculaire | |
Direct-zol RNA purification kits | Zymo Research (Genesee Scientific) | 11-330 | Biologie moléculaire |
OneTaq One-Step RT-PCR Kit | NEB | E5315S | Biologie moléculaire |
lexO-CherryCAAX | Kornberg Lab | Fly line | |
UAS-CD8:GFP | Kornberg lab | Fly line | |
btl-Gal4 | Kornberg lab | Fly line | |
MKRS/TB6B | Kornberg lab | Fly line | |
Confocal Microscope SP5X | Leica | Imaging expression pattern | |
CO2 station | Genesee Scientific | 59-122WCU | fly pushing |
Stereo microscope | Olympus | SZ-61 | fly pushing |
Microtube homogenizing pestles | Fisher-Scientific | 03-421-217 | genomic DNA isolation |
NanoDrop spectrophotometer | ThermoFisher Scientific | ND-1000 | DNA quantification |