We clone and analyze reporter genes generating circular RNAs. These reporter genes are larger than constructs to analyze linear splicing and contain Alu elements. To investigate the circular RNAs, the constructs are transfected into cells and resulting RNA is analyzed using RT-PCR after removal of linear RNA.
In addition to linear mRNAs, many eukaryotic genes generate circular RNAs. Most circular RNAs are generated by joining a 5' splice site with an upstream 3' splice site within a pre-mRNA, a process called back-splicing. This circularization is likely aided by secondary structures in the pre-mRNA that bring the splice sites into close proximity. In human genes, Alu elements are thought to promote these secondary RNA structures, as Alu elements are abundant and exhibit base complementarities with each other when present in opposite directions in the pre-mRNA. Here, we describe the generation and analysis of large, Alu element containing reporter genes that form circular RNAs. Through optimization of cloning protocols, reporter genes with up to 20 kb insert length can be generated. Their analysis in co-transfection experiments allows the identification of regulatory factors. Thus, this method can identify RNA sequences and cellular components involved in circular RNA formation.
Circular RNAs
Circular RNAs (circRNAs) are covalently closed single stranded RNAs that are expressed in most organisms. They are generated by joining a downstream 5' splice site to an upstream 3' splice site, a process called back-splicing (Figure 1A)1. Sequences in the pre-mRNA that exhibit base complementary as short as 30-40 nt bring back-splice sites into proper alignment for circRNA formation2. In humans, Alu elements1, representing about 11% of the genome3, form extensive double stranded RNA structures in pre-mRNA due to their self-complementarity4,5 and thus promote the formation of circRNAs1.
Currently, three major functions of circRNAs have been described. Some circRNAs bind microRNAs (miRNAs) and through sequestration act like miRNA sponges6. CircRNAs have been implicated in transcriptional and post transcriptional regulation, through competition with linear splicing7 or modulation of transcription factor activity8. Finally, circRNAs contain short open reading frames and proof of principle studies show that they can be translated9,10. However, the function of most circRNAs remains enigmatic. The majority of circular RNAs have been detected using next-generation sequencing methods11. Detailed analyses of individual genes using targeted RT-PCR approaches reveal that a large number of circular RNAs remains to be discovered12.
Use of reporter genes to analyze pre-mRNA processing
The analysis of mRNA derived from DNA reporter constructs transfected into cells is a well-established method to study alternative pre-mRNA splicing, which can be applied to circular RNAs. In general, the alternative exon, its surrounding introns, and constitutive exons are amplified and cloned into a eukaryotic expression vector. Frequently, the introns are shortened. The constructs are transfected into eukaryotic cells and usually analyzed by RT-PCR13,14. This approach has been extensively used to map regulatory splice sites and trans-acting factors in co-transfection experiments13,15,16,17,18. In addition, the generation of protein-expressing minigenes allowed for screening of substances that change alternative splicing19,20.
The method has been applied to circular RNAs. Currently, at least 12 minigene backbones have been described in the literature and are summarized in Table 1. With the exception of the tRNA based expression system21,22, they are all dependent on polymerase II promoters. Here, we describe a method to generate human reporter minigenes to determine cis and trans-acting factors involved in the generation of circular RNAs. An overview of the method using sequences of a published reporter gene23 is shown in Figure 1.
1. Design of the constructs
2. Select the sequence to be cloned in an expression vector
3. Design primers for cloning
4. PCR and amplicon detection
5. Optimization for longer DNA fragments for use of different polymerases
6. Purification of PCR products for cloning
7. DNA assembly and clone detection
NOTE: Cloning is done using an enzymatic DNA assembly kit, with minor modifications. The assembly is performed for 60 min at 50 °C and generally the lower range of DNA is used for assembly (20-100 fmol/20 µL reaction). The whole reaction mix is added to chemical competent cells and the whole cells plated out on a 6 cm agar plate.
8. Validation of clones
9. Analysis of circular RNA expressing reporter genes
10. RNase R treatment to remove linear RNAs
11. RT-PCR analysis
Reporter genes allow determination of regulatory factors that influence circular RNA formation. However, these reporter genes are large and contain repetitive elements that often make DNA constructs unstable. Due to their large size, it is often necessary to delete parts of the introns, which is achieved by amplifying genomic pieces containing the exons and smaller flanking intronic parts. These DNA pieces are enzymatically assembled, allowing construction without restriction enzymes.
The example of a circular RNA generated from the microtubule associated protein tau (MAPT) shows an application of the minigene approach to analyze circular RNAs. The tau 9→12 minigene used in this example was co-transfected with different splicing factors and the effect of these splicing factors was detected by RT-PCR (Figure 6). Different trans-acting factors influence both circular RNA and linear pre-mRNA formation. The experiment also shows that all the sequence elements necessary for circular RNA formation are localized in the cloned fragment.
Figure 1: Overview of the technique. (A) A hypothetical gene is shown. Introns are lines, exons are boxes, Alu elements are smaller striped boxes. Backsplicing from exon C to A creates a circular RNA. The structure of this circular RNA is shown in panel (E). (B) To create a reporter gene, exons and surrounding introns (at least 500 nt on each side) are amplified. The constructs should contain repetitive elements, which are usually Alu elements in humans. An exon upstream of exon A was included to provide an additional Alu element. The genomic fragments will overlap with their flanking 25 nts. (C) Fragments are cloned into an expression vector, driven by a CMV promoter. The successful recombination is detected by detection primers and validated by sequencing. (D) Cells are transfected with this construct. (E) Circular RNA is isolated and (F) amplified using circular RNA specific primers, preferable exon junction primers. During PCR amplification, linear RNA can also be amplified (G). (G) Orientation of the primers used to detect circular RNAs. The forward primer is in sense orientation (i.e., has the same sequence as the RNA) and the reverse primer is in antisense orientation (i.e., is the reverse complement of the RNA). Note that different from RT-PCR for linear mRNAs, the reverse primer is upstream of the forward primer. Please click here to view a larger version of this figure.
Figure 2: Selection of the sequence for minigene construction. (A) Browser display after the sequence shown in Supplemental Figure 1 is run against the human genomic database using BLAT. Literature exon numbers36 are indicated in the gene display, they are different from the numbers given by the browser.
1. The aligned sequences are shown under 'YourSeq'
2-4. Note that due to the circularity of the RNA, BLAT does not connect all exons with lines as it does in linear RNA. Exons 10 and 11 (corresponding to 2 and 3) are connected, but exon 12 (corresponding to 4) is not connected to exon 11.
5. Alu elements are shown in the repetitive element track.
(B) Sequence alignment between the planned construct and genomic DNA.
6. The planned construct was run against the database using BLAT.
7. Note the inclusion of several Alu elements in the construct. Please click here to view a larger version of this figure.
Figure 3: Example of the amplicons prior to cloning. (A) Optimized PCR products separated on a 1% agarose gel containing 1x GelGreen. The individual bands represent the PCR products that will be used in enzymatic DNA assembly. (B) The bands from (A) were cut out from the gel and purified. The purified PCR products were separated on a 1% agarose gel, which was subsequently stained with ethidium bromide. Please click here to view a larger version of this figure.
Figure 4: Restriction analysis of reporter genes. The tau 9-12 minigene used as an example was cut with restriction enzymes indicated to rule out major recombinations. Lane 1: cut with NcoI expected sizes 735 bp, 3345 bp, 6266 bp, lane 2 cut with XbaI expected size 10346 bp, lane 3 cut with HindIII expected sizes 3951 bp, 6395 bp, lane 4 cut with SmaI expected sizes 1168 bp, 1688 bp, 2708 bp, 4782 bp. Please click here to view a larger version of this figure.
Figure 5: Effect of primer multiplexing and RNase R treatment on circular RNA detection. (A) cDNA from samples A and B derived from human brain tissues was amplified with circular RNA primers circTau exon12_10 Reverse and circTau exon10_11 Forward. The reverse transcription for the cDNA was performed with the primers for linear and circular tau RNA. The expected band corresponding to tau circular RNA is shown by a triangle. The other strong bands are artifacts that did not match the human genome. (B) The experiment was repeated with identical PCR conditions, but the reverse transcription was performed only with the circTau exon12_10 Reverse primer. Only the expected band was amplified and validated through sequencing. (C) The RNA was treated with RNase R that removes linear RNA. The circular RNA is detectable after the treatment (left), whereas linear RNA gives no longer a detectable signal (right) Please click here to view a larger version of this figure.
Figure 6: Example of an analysis of a circRNA reporter gene. 1 µg of the tau 9→1237 reporter gene was transfected with 1 µg of splicing factors indicated. RNA was isolated 24 h post transfection and analyzed by RT-PCR. (A) Amplification of the linear tau mRNA. Due to alternative splicing of exon 10 two bands are observed. Their ratio changes due to the overexpression of splicing factors38,39. (B) Amplification of the circular 12→10 tau RNA23. Note the dependency of tau circRNA expression on expression of some splicing factors, especially the cdc2 like kinase clk2 and the SR protein 9G8. (C) The circular RNA of HIPK3 was used as a positive control indicating equal loading. Please click here to view a larger version of this figure.
Table 1: List of current minigenes expressing circular RNAs. Please click here to download this file.
Circular HIPK3 Control Primers | |
HIPK3 Reverse HIPK3 Forward |
TGCTTGGCTCTACTTTGAGTTTC TCGGCCAGTCATGTATCAAA |
Linear Primers | |
Tau Exon 12 Reverse Tau Exon 9 Forward |
CCCAATCTTCGACTGGACTC TGTCAAGTCCAAGATCGGCT |
Circular Primers | |
circTau exon12_10 Reverse circTau exon10_11 Forward |
CAGCTTCTTATTAATTATCTGCACCTTTT GAGGCGGCAGTGTGCAA |
Table 2: List of Primers.
Supplemental Figure 1: Tau circular RNA test sequence. Test sequence corresponding to a circular RNA from the MAPT locus. Different exons are indicated by underline, small caps and large caps. Please click here to view a larger version of this figure.
Supplemental Figure 2: Genomic sequence containing the planned minigene. Exons are highlighted in color and repetitive elements are underlined, italic and bold. Gray shading indicates flanking regions of low complexity that can be used to generate primers. Please click here to download this file.
Supplemental Figure 3: Sequences of the planned reporter gene. The vector sequence and the planned genomic fragments are shown. Please click here to download this file.
Supplemental Figure 4: Primers design for assembly. The sequence from Supplemental Figure 3 was entered into the builder tool. Please click here to view a larger version of this figure.
Supplemental Figure 5: Sequence of the tau 9->12 reporter gene used as an example. Please click here to download this file.
In general, circular RNAs are low abundant1, which complicates the study of their function and formation. Similar to linear RNAs13, the use of reporter minigenes allows the identification of cis and trans-acting factors that regulate the formation of circular RNAs. Thus, this approach generates hypotheses that can be further tested using the endogenous genes.
The most critical step is the design of the reporter gene. The enzymatic assembly of DNA fragments ("Gibson cloning"27) facilitates this design, as it allows construction of large reporter genes independent of restriction sites.
The back-splicing sites are brought together through flanking inverted repeats, which should be taken into account in reporter gene construction. The repeats are annotated in the genome browser 'repeat track' and selecting them shows their orientation. Keep in mind that proteins can also force the back-splicing sites into a secondary structure needed for circular RNA expression28 and for an unbiased analysis 1-2 kb of flanking intronic regions should be investigated.
To ensure stability of the constructs, an important consideration is the type of bacterial strains and their growth conditions. For shorter, simple constructs standard cloning bacteria are used, which are almost identical to DH5-alpha (huA2 Δ(argF-lacZ)U169 phoA glnV44 Φ80 Δ(lacZ)M15 gyrA96 recA1 relA1 endA1 thi-1 hsdR17). For longer fragments, containing more than 6 Alu elements, "stable" competent cells are used that lack a recombinase (recA) and endonuclease (endA1) (F' proA+B+ lacIq ∆(lacZ)M15 zzf::Tn10 (TetR) ∆(ara-leu) 7697 araD139 fhuA ∆lacX74 galK16 galE15 e14- Φ80dlacZ∆M15 recA1 relA1 endA1 nupG rpsL (StrR) rph spoT1 ∆(mrr-hsdRMS-mcrBC). If problems appear with recombination, indicated by low transformation counts, plate the transformed bacteria on two plates and let them grow at 30 ºC and 37 ºC, respectively. Due to the presence of numerous repetitive elements in the minigenes, they need to be fully sequenced using next generation sequencing, which is commercially available for around $150 per plasmid at 2019 rates. The sequence of the example is shown in Supplemental Figure 5. In addition, restriction fragment length polymorphism analysis for new larger preparation of the constructs is routinely performed. For example, using sites that cut 1-4 times results in a characteristic band pattern that rules out recombinations (Figure 4). Enzymes should be selected that give a characteristic band pattern of fragments that can be separated on an agarose gel.
Circular RNAs are analyzed with RT-PCR using exon junction primers that overlap with the backsplicing event (Figure 1F). Due to the circular nature of the RNA, the reverse (i.e. antisense) primer is upstream of the forward (i.e. sense) primer (Figure 1G). Primers detecting the abundantly expressed homeodomain-interacting protein kinase 3 (HIPK3) circular RNA1 are used as a positive control. HIPK3 and minigene specific reverse primers are reverse transcribed in the same tube, which allows their comparison. PCR reactions are performed with primers amplifying the linear mRNAs to compare processing patterns of circular and linear pre-mRNAs. We frequently observed aberrant bands when primers for linear and circular RNAs were mixed (Figure 5), and thus keep the reverse transcription of these samples separate.
RT-PCR analysis of circular RNAs is challenging and needs to be carefully controlled. While sensitive and convenient, it can produce artifacts unique for circular RNAs29. The reverse transcriptase can move several times around the RNA circle, which generates concatemers. Most circular RNA reporter genes generate both circular and linear RNA, which can cross-hybridize, leading to more PCR artifacts21,30,31. It is thus imperative to sequence the PCR products and validate findings using different techniques using Northern blots32 or RNase protections23.
Unexplained bands can also originate from aberrant amplification of linear RNA. Linear RNA can be removed using the exonuclease RNase R, which enriches circular RNAs33 (Figure 5C). RNase R treatment helps in the initial optimization of detection primers and can often be omitted once primers are optimized.
Alternative back-splicing can also contribute to unexplained bands as multiple circular RNAs can be formed from a genomic locus34. This alternative back-splicing is often the result of competing pre-mRNA structures formed by more than two inverted repeat elements. In addition, cryptic back-splice sites can occur32,35. Depending on the experimental goal, Alu elements can be repeated or added to the constructs. The complementary regions flanking back-splicing sites can be as short as 30-40 nt35 and replacement of Alu elements with shorter complementary regions can increase circular RNA formation2, which can be tested to improve circular RNA formation. Once the pre-mRNA sequences that cause back-splicing have been identified, it is thus possible to shorten circular RNA expressing constructs, which can improve transfection efficiency in some cases.
The authors have nothing to disclose.
This work was supported by the Department of Defense DoD grant AZ180075. Stefan Stamm thanks Jacqueline Noonan Endowment. Anna Pawluchin was supported by the DAAD, German academic exchange program, Justin R. Welden was a recipient of the University of Kentucky Max Steckler Award.
(PEI) Hydrochloride | Polysciences | 24765-1 | |
Builder tool | NEB | https://nebuilder.neb.com/#!/ | |
Dark Reader Transilluminator. | Clare Chemical Research | ||
Enzymatic DNA assembly kit | NEB | E2621S | |
Gel and PCR cleanup kit | Promega | A9282 | |
Glyco Blue | Thermo Fisher | AM9516 | |
pcDNA3.1 cloning site | Polycloning site | https://assets.thermofisher.com/TFS-Assets/LSG/manuals/pcdna3_1_man.pdf | |
Polymerase 1 | NEB | M0491L | Q5 DNA polymerase |
Polymerase 2 | Biorad | 1725310 | Long range polymerase (NEB), iproof (BioRad) |
Polymerase 2 | Qiagen | 206402 | Qiagen long range polymerase kit |
Reverse Transcriptase | Thermo Fisher | 18080044 | |
RNA isolation kit | Life Technologies | 12183025 | Ambion by Life Technologies |
RNAse R | Lucigen | RNR07250 | Epicenter/Lucigen |
Stable competent cells | NEB | C3040H | NEB stable cells |
Standard cloning bacteria | NEB | C2988J | NEB5-alpha competent |
Web tool to design primers | NEB | https://nebuilder.neb.com/#!/ | |
Web-based temperature calculations | NEB | https://tmcalculator.neb.com/#!/main |