Summary

Identification of Circular RNAs using RNA Sequencing

Published: November 14, 2019
doi:

Summary

Circular RNAs (circRNAs) are non-coding RNAs that may have roles in transcriptional regulation and mediating interactions between proteins. Following assessment of different parameters for construction of circRNA sequencing libraries, a protocol was compiled utilizing stranded total RNA library preparation with RNase R pre-treatment and is presented here.

Abstract

Circular RNAs (circRNAs) are a class of non-coding RNAs involved in functions including micro-RNA (miRNA) regulation, mediation of protein-protein interactions, and regulation of parental gene transcription. In classical next generation RNA sequencing (RNA-seq), circRNAs are typically overlooked as a result of poly-A selection during construction of mRNA libraries, or are found at very low abundance, and are therefore difficult to isolate and detect. Here, a circRNA library construction protocol was optimized by comparing library preparation kits, pre-treatment options and various total RNA input amounts. Two commercially available whole transcriptome library preparation kits, with and without RNase R pre-treatment, and using variable amounts of total RNA input (1 to 4 µg), were tested. Lastly, multiple tissue types; including liver, lung, lymph node, and pancreas; as well as multiple brain regions; including the cerebellum, inferior parietal lobe, middle temporal gyrus, occipital cortex, and superior frontal gyrus; were compared to evaluate circRNA abundance across tissue types. Analysis of the generated RNA-seq data using six different circRNA detection tools (find_circ, CIRI, Mapsplice, KNIFE, DCC, and CIRCexplorer) revealed that a stranded total RNA library preparation kit with RNase R pre-treatment and 4 µg RNA input is the optimal method for identifying the highest relative number of circRNAs. Consistent with previous findings, the highest enrichment of circRNAs was observed in brain tissues compared to other tissue types.

Introduction

Circular RNAs (CircRNAs) are endogenous, non-coding RNAs that have gained attention given their pervasive expression in the eukaryotic transcriptome1,2,3. They are formed when exons back-splice to each other and hence were initially considered to be splicing artifacts4,5. However, recent studies have demonstrated that circRNAs exhibit cell type, tissue, and developmental stage specific expression3,6 and are evolutionarily conserved2,3. Furthermore, they are involved in mediation of protein-protein interactions7, micro-RNA (miRNA) binding3,8,9,10, and regulation of parental gene transcription11.

In classical RNA sequencing (RNA-seq), circRNAs may be completely lost during library construction as a result of poly-A selection for mRNAs or may be difficult to isolate given their low abundance. However, recent circRNA characterization studies have incorporated a pre-treatment step using RNase R in order to enrich for circRNAs2,12,13. RNase R is an exoribonuclease that digests linear RNAs, leaving behind circular RNA structures. CircRNA enrichment protocols were optimized by generating and comparing data from two commercially available whole transcriptome library construction kits, with and without an RNase R pre-treatment step, and using varying amounts of total RNA input (1 to 4 µg). The optimized protocol was next used to evaluate the abundance of circRNAs across five different brain regions (cerebellum [BC], inferior parietal lobe [IP], middle temporal gyrus [MG], occipital cortex [OC] and superior frontal gyrus [SF]) and four other tissue types (liver [LV], lung [LU], lymph node [LN] and pancreas [PA]). RNA-seq libraries were paired end sequenced and data was analyzed using six different circRNA prediction algorithms: find_circ3, CIRI14, Mapsplice15, KNIFE16, DCC17, and CIRCexplorer18. Based on our analysis, the highest number of unique circRNAs was detected when using a stranded total RNA library preparation kit with RNase R pre-treatment and 4 µg total input RNA. The optimized protocol is described here. As previously reported19,20, the highest enrichment of circRNAs was observed in the brain compared to other tissue types.

Protocol

This research has been performed in compliance with all institutional, national and international guidelines for human welfare. Brain tissues were obtained from the Banner Sun Health Research Institute Brain and Body Donation Program in Sun City, AZ. The operations of the Brain and Body Donation Program are approved by the Western Institutional Review Board (WIRB protocol #20120821). All subjects or their legal representatives signed the informed consent. Commercial (non-brain) biospecimens were purchased from Proteogenex.

1. RNase R Treatment

NOTE: In the following steps, the reaction volume is adjusted to a total volume of 50 µL. This is the minimum sample volume to be used in the RNA cleanup & concentrator kit (see Table of Materials). Additionally, the optimized protocol described here is for an input amount of 4 µg of total RNA. A longer incubation time for RNase R treatment is recommended for an input amount >4 µg.

  1. Dilute total RNA to 4 µg in 39 µL RNase-free water in a microcentrifuge tube and mix well by pipetting.
  2. In a separate tube, dilute the RNase R to a working concentration of 2 U/µL with 1x RNase R Reaction Buffer. Make only enough for immediate use.
  3. Pipette 39 µL of total RNA and 5 µL of 10x RNase R Reaction Buffer into a 1.5 mL reaction tube and mix well by pipetting (50 µL will be the total reaction volume). Next, add 6 µL of RNase R (2 U/µL).
  4. Adjust the pipette to the full reaction volume (50 µL) and mix well by pipetting up and down 10 times.
  5. Place the tube in a 37 °C water bath for 10 min. Make sure that the full reaction volume is immersed in the water bath.
  6. Place the tube on ice and immediately proceed with RNA cleanup & concentration (section 2).

2. Purifying RNA Using an RNA Cleanup and Concentrator Kit

NOTE: When using high quality RNA (RIN>8, DV200>80%), RNase R treatment may result in loss of approximately 60% of RNA. Using a 4 µg input, it is estimated that 2–2.5 µg of treated RNA is left after section 1.

  1. Before starting, prepare the RNA Wash Buffer by adding 48 mL of 100% ethanol to the buffer concentrate and mix well by pipetting. Place purification columns into collection tubes (see Table of Materials) and place in a tube rack.
    NOTE: Use the following centrifugation settings for all following steps: 10,000–16,000 x g. If DNase I treatment has already been performed, skip DNase I treatment at this stage.
  2. Add 2 volumes of RNA Binding Buffer to the RNase R treated sample, and mix well by pipetting (total volume: 150 µL).
  3. Add 1 volume of 100% ethanol to the RNA Binding Buffer and RNase R treated sample mixture, and mix well by pipetting (total volume: 300 µL).
  4. Transfer the entire volume to the column and centrifuge the column for 30 s. Discard flow through.
  5. Add 400 µL of RNA Prep Buffer directly to the column, centrifuge the column for 30 s, and discard flow through.
  6. Add 700 µL of RNA Wash Buffer directly to the column, centrifuge the column for 30 s, and discard flow through.
  7. Add 400 µL of RNA Wash Buffer directly to the column, centrifuge the column for 2 min, and transfer the column to a fresh RNase-free 1.5 mL tube.
  8. Add 11 µL of RNase-free water directly to the column by holding the pipette tip right above the column filter and ensuring that water lands only on the column filter.
  9. Incubate the column for 1 min at room temperature and centrifuge for 1 min.
  10. Before discarding the column, check for flow through in the RNase-free tube. If elution was successful, store sample at -80 °C or immediately proceed with library preparation. The final total elution volume of approximately 10 µL is used for library construction.
    NOTE: Stopping point: Leave RNA at -80 °C for up to 7 days before continuing with library preparation.

3. circRNA Library Prep

NOTE: See Table of Materials for kit, which contains most reagents used in this section.

  1. rRNA depletion and fragmentation
    1. Transfer 10 µL of purified RNA from step 2.10 to a clean well in a new 96-well 0.3 mL PCR plate. To the well, add 5 µL of rRNA Binding Buffer followed by 5 µL rRNA Removal Mix. Gently pipette up and down 10 times to mix.
    2. Seal plate and incubate for 5 min at 68 °C on a pre-programmed, pre-heated thermocycler block. After completion of the 5 min incubation, place plate on bench and incubate at room temperature for 1 min.
    3. Remove seal from plate. Add 35 µL of vortexed room temperature rRNA removal beads to sample. Adjust the pipette to 45 µL and pipette up and down 10–20x to mix thoroughly. Incubate plate for 1 min at room temperature.
    4. Transfer plate to a magnetic stand and incubate on the stand for 1 min or until the solution clears. Transfer all of the supernatant (~45 µL) to new well on the same plate, or new plate (depending on how many samples you are working with).
    5. Vortex the RNA cleanup beads (see Table of Materials) until well dispersed, and add 99 µL of beads to each sample. Pipette up and down 10x to mix. Incubate the plate at room temperature for 10 min.
    6. Transfer the plate to the magnetic stand and incubate an additional 5 min or until the solution clears. Remove and discard all of the supernatant from the well.
    7. With the plate still on the magnetic stand, add 200 µL of freshly prepared 80% EtOH to the well without disrupting the beads. Incubate for 30 s, then remove and discard ethanol. Repeat for a total of 2 washes.
    8. Add 11 µL of Elution Buffer to each well and pipette up and down 10 times to mix. Incubate at room temperature for 2 min, and then transfer to the magnetic stand until the solution clears (1–5 min).
    9. Transfer 8.5 µL of the supernatant from the well to a new well on the same plate or to a new plate. Add 8.5 µL of the Elute, Primer, Fragment High mix to each well containing sample. Pipette up and down 10 times to mix thoroughly.
    10. Seal plate and incubate for 8 min at 94 °C on a pre-programmed, pre-heated thermocycler block. Remove from thermocycler when it reaches 4 °C and centrifuge briefly.
      NOTE: Proceed immediately to the Synthesize First Strand cDNA protocol.
  2. Synthesize cDNA
    1. For each sample being prepared, mix 9 µL First Strand Synthesis Mix with 1 µL of reverse transcriptase (see Table of Materials). Add 8 µL of the mixture to the sample. Pipette up and down 6 times to mix.
      1. Seal plate and incubate on a pre-programmed, pre-heated thermocycler block using the following parameters: 25 °C for 10 min, 42 °C for 15 min, 70 °C for 15 min, 4 °C hold. Proceed immediately to second strand synthesis.
    2. Add 5 µL of Resuspension buffer to each sample followed by 20 µL of Second Strand Marking master mix. Pipette the entire volume up and down 6 times.
    3. Seal plate and incubate on a pre-programmed, pre-heated thermocycler block set to 16 °C for 1 h. Following incubation, remove the plate from the thermocycler and let it equilibrate to room temperature.
    4. Vortex PCR purification beads (see Table of Materials) and add 90 µL beads to each well of sample. Pipette up and down 10 times to mix thoroughly. Incubate at room temperature for 10 min.
    5. Transfer the bead/sample mix to the magnetic stand and incubate for 5 min or until liquid clears. Remove and discard supernatant.
    6. Add 200 µL of 80% EtOH to each sample. Incubate samples on the magnetic stand at room temperature for 30 s. Discard supernatant. Repeat 1x.
    7. Allow beads to dry at room temperature for 6 min, and then remove from magnetic stand.
    8. Resuspend beads in 19.5 µL of Resuspension buffer. Pipette up and down 10 times to mix thoroughly. Incubate at room temperature for 2 min, then transfer to the magnetic stand and incubate for an additional 1 min or until the liquid clears.
    9. Transfer 17.5 µL of supernatant to new well/new plate.
      NOTE: If not proceeding immediately, samples can be stored at -20 °C for up to 7 days.
  3. Library preparation
    1. Add 12.5 µL of A-Tailing Mix to each well containing supernatant. Pipette the entire volume up and down 10 times to mix.
    2. Incubate the reaction on a pre-programmed, pre-heated thermocycler block set to 37 °C using the following parameters: 37 °C for 30 min, 70 °C for 5 min, 4 °C hold. When samples reach 4 °C, proceed immediately to adapter ligation.
    3. To each sample add 2.5 µL of Resuspension buffer, 2.5 µL of a unique RNA adapter, and 2.5 µL of Ligation Mix. Pipette up and down 10x to mix.
    4. Incubate samples on a pre-programmed, pre-heated thermocycler block at 30 °C for 10 min.
    5. Add 5 µL of Stop Ligation buffer to each sample and pipette up and down to mix.
    6. Add 42 µL of mixed PCR purification beads to each sample and mix thoroughly. Follow steps 3.2.6 through 3.2.10, but change the resuspension volume to 52 µL and final elution volume to 50 µL.
    7. Repeat the PCR purification bead protocol again with the 50 µL elution from step 3.3.6, but change the resuspension volume to 22 µL and final elution volume to 20 µL.
      NOTE: If not proceeding immediately, samples can be stored at -20 °C for up to 7 days.
    8. Add 5 µL of PCR Primer Cocktail and 25 µL of PCR Master Mix to each sample. Mix by pipetting up and down 10 times. Incubate the reaction on a pre-programmed, pre-heated thermocycler block using the following parameters: 98 °C for 30 s; then 8 cycles of 98 °C for 10 s, 60 °C for 30 s, and 72 °C for 30 s; then 72 °C for 5 min, then 4 °C hold.
      NOTE: Optimization of the total number of PCR cycles may be needed to generate sufficient amounts of library for sequencing.
    9. Follow the protocol for PCR bead purification (steps 3.2.4 through 3.2.9), except add 50 µL of well-mixed PCR purification beads and change the resuspension volume to 32.5 µL with a final elution volume of 30 µL.
      NOTE: Samples should be stored at -20 °C.
  4. Quantification and Quality Control using a nucleic acid analyzer
    1. Allow tapes and reagents to equilibrate at room temperature for 30 min.
    2. Mix 2 µL of library with 2 µL of HS D1000 buffer, and add to a compatible well plate.
    3. Seal tightly with compatible foil seal, and vortex for 1 min at 2,000 rpm.
    4. Spin down and load plate on the analyzer following software prompts.
      NOTE: Libraries should be approximately 260 bp in size.

4. Data Analysis Workflow

  1. Sequence RNA-seq libraries (see Table of Materials) to generate 82 bp paired-end reads. Convert raw sequencing data in the form of basecall files (.bcl) to FASTQs using the bcl2fastq tool (v0.2.19).
  2. Detect circRNAs.
    NOTE: Based on previously reported evidence that an ensemble circRNA detection approach performs better compared to using a single detection tool21,22, we suggest using multiple tools for circRNA detection. Here, circRNAs were identified using six existing circRNA prediction algorithms: find_circ, CIRI, CIRCexplorer, Mapsplice, KNIFE, and DCC, applying the recommended parameter settings for each algorithm.
    1. Download and install each circRNA detection algorithm on a Linux high performance computing cluster using the instructions provided by the developers.
    2. Align RNA-seq FASTQs against the reference genome (GRCh37), utilizing the aligner recommended for each tool.
    3. Following alignment, execute circRNA detection algorithms by applying their respective recommended parameter settings.
    4. Each tool will output a multi-column results file with the list of detected circRNAs, extract the circRNA co-ordinates and the number of supporting reads from this in order to quantify the number of candidates detected in each sample/test condition.
  3. Convert the circRNA coordinates output by CIRI, Mapsplice, and DCC to 0-based coordinates to be consistent with the other three algorithms.
  4. Select circRNAs with two or more supporting reads or downstream analyses and comparisons. Table 1 summarizes all the parameters evaluated in our study along with the total number of sequencing reads generated for each sample.
  5. For each sample/test condition, count the number of detected circRNAs normalized to the number of mapped reads generated for that library, per million. Summarize the results across the various tools/samples in box plots, as detailed in the Representative Results.

Representative Results

Data generated using a commercially available universal control RNA (UC) and using two library preparation kits, both of which include a ribo-depletion step in their protocols, was first assessed. Using an analytical workflow (Data analysis workflow, section 4), overall, a higher number of circRNAs was detected in the TruSeq datasets compared to the Kapa ones (Figure 1). Although the ribosomal RNA (rRNA) percentages were below 5% in datasets from both kits for lower input amounts (1, 2 ug), Kapa datasets had higher rRNA content for 4, 5, and 10 ug inputs (Table 2). Hence, based on the number of detected circRNAs and rRNA depletion efficiency, further experiments were performed using the TruSeq kit.

Next, the significance of RNase R pre-treatment was tested by comparing the data generated from RNase R pre-treated and non-pre-treated libraries. To this end, total RNA was extracted from the MG of healthy elderly individuals and sequencing data generated from libraries with (N = 3) and without (N = 3) pre-treatment using RNase R23 was compared. A higher number of circRNAs was consistently identified in the pre-treated libraries compared to the non-pre-treated ones (Figure 2). This is expected since pre-treatment removes linear RNAs, thus enriching for circRNA species.

Thirdly, the amount of input RNA that would be optimal for detecting a higher diversity of circRNAs was tested. Libraries were prepared using 1, 2, and 4 µg of total input RNA which was extracted from MG, OC, and SF brain regions, and as well as UC RNA. Comparing the abundance of circRNAs detected from each library, the highest diversity of circRNA species was observed when using 4 µg input RNA compared to 2 and 1 µg (Figure 3), as reflected by the number of unique circRNAs identified. One caveat to note is that although various incubation times during RNase R treatment were not tested, a trend whereby an increasing number of circRNAs were detected across total RNA inputs of 1 to 4 µg was observed when controlling for all other parameters.

This optimized protocol was then applied across multiple tissue types to compare circRNA abundances. Five brain regions, including BC, MG, OC, IP, and SF, from four healthy elderly individuals, were tested, along with four other tissue types, including LV, LU, LN and PA, from six healthy donors. Overall, a higher abundance of circRNAs was observed in the brain compared to other tissue types (Figure 4), as has been previously reported19,20.

Figure 1
Figure 1: CircRNA detection using TruSeq vs. Kapa total RNA kits. Sequencing data was generated for UC RNA using two separate total RNA library preparation kits, each with 1, 2, 4, 5, and 10 µg input RNA and ribonuclease R (RNase R) pre-treatment. The number of circRNAs detected by the tools in each sample was normalized to the number of mapped reads, per million (Y-axis). Please click here to view a larger version of this figure.

Figure 2
Figure 2: CircRNA detection with and without RNase R pre-treatment. Sequencing data generated using the TruSeq kit was used to compare the impact of RNase R pre-treatment. RNA was extracted from the middle temporal gyrus (MG) of healthy elderly controls for this analysis. The normalized number of circRNAs detected (Y-axis) was calculated similarly to Figure 1. RNase R+ = pre-treated with RNase R, RNase R- = not pre-treated with RNase R. Please click here to view a larger version of this figure.

Figure 3
Figure 3: CircRNA detection using varying amount of input RNA. Using RNA extracted from MG, occipital cortex (OC), and superior frontal gyrus (SF), as well as UC RNA, the number of unique circRNAs detected when using 1, 2, and 4 µg of input RNA, each whose library was constructed using the TruSeq kit and RNase R pre-treatment, was compared. The normalized number of circRNAs detected (Y-axis) was calculated similarly to Figure 1. Please click here to view a larger version of this figure.

Figure 4
Figure 4: CircRNA detection in brain vs. other tissue types. CircRNA enriched datasets using RNA extracted from various brain regions including cerebellum (BC), inferior parietal lobe (IP), MG, OC, and SF, as well as four other tissue types including liver (LV), lung (LU), lymph node (LN), and pancreas (PA) were generated. CircRNA enrichment was carried out using the Illumina TruSeq kit with RNase R pre-treatment and 4 µg of total input RNA. Box plots represent the number of circRNAs detected by at least three of the six tools across the samples from each brain region/tissue type. Please click here to view a larger version of this figure.

Test # Parameter evaluated Test conditions Sample source Input amounts/conditions/samples tested Total number of sequencing reads
1 Library preparation kit Illumina TruSeq Stranded Total RNA vs. the Roche Kapa Total RNA kits UC TruSeq: 1 µg 8,91,46,128
TruSeq: 2 µg 7,93,90,202
TruSeq: 4 µg 6,66,12,238
TruSeq: 5 µg 7,88,56,902
TruSeq: 10 µg 6,61,06,874
Kapa: 1 µg 8,83,95,496
Kapa: 2 µg 10,66,59,272
Kapa: 4 µg 10,62,34,954
Kapa: 5 µg 7,47,75,914
Kapa: 10 µg 11,00,68,504
2 Pre-treatment RNase R pre-treated vs. non pre-treated MG Pair1: MG_1 (RNase R+) 10,76,09,934
Pair1: MG_5 (RNase R-) 9,62,15,516
Pair2: MG_2 (RNase R+) 9,68,40,790
Pair2: MG_6 (RNase R-) 10,16,09,754
Pair3: MG_3 (RNase R+) 11,15,76,344
Pair3: MG_7 (RNase R-) 11,13,14,114
3 Total RNA input 1 µg vs. 2 µg vs. 4 µg MG, OC, SF, UC MG: 1 µg 12,00,94,758
MG: 2 µg 11,64,75,728
MG: 4 µg 12,13,15,232
OC: 1 µg 11,11,18,120
OC: 2 µg 11,53,25,492
OC: 4 µg 11,49,13,266
SF: 1 µg 12,27,24,142
SF: 2 µg 9,39,33,288
SF: 4 µg 12,33,31,474
UC: 1 µg 9,24,48,120
UC: 2 µg 12,58,15,354
UC: 4 µg 12,56,92,534
4 Tissue types Brain regions vs. other tissue types BC, MG, OC, SF, IP, LU, LV, LN, PA BC_1 10,72,08,904
BC_2 11,18,33,362
BC_3 9,61,25,856
BC_4 9,62,77,094
IP_1 9,86,56,506
IP_2 11,35,95,746
IP_3 12,87,81,536
IP_4 9,59,81,446
MG_1 10,76,09,934
MG_2 9,68,40,790
MG_3 11,15,76,344
MG_4 10,05,39,028
OC_1 8,85,47,042
OC_2 12,09,83,142
OC_3 10,26,55,452
OC_4 10,84,49,330
SF_1 8,76,21,824
SF_2 14,50,57,894
SF_3 11,01,52,030
SF_4 9,67,24,472
LN_1 15,02,94,816
LN_2 8,33,30,187
LN_3 11,30,96,032
LN_4 10,78,38,278
LU_1 16,05,27,595
LU_2 8,94,30,799
LU_3 9,14,51,858
LV_1 9,72,18,369
LV_2 10,54,16,880
LV_3 8,86,53,148
LV_4 8,61,02,943
LV_5 12,87,88,483
LV_6 11,87,76,622
PA_1 8,79,20,160
PA_2 7,82,36,741
PA_3 10,21,24,209
PA_4 11,53,22,926

Table 1: Sample and test conditions summary. Summary of all parameters and test conditions evaluated in this study, along with the total number of sequencing reads generated for each sample. UC = universal control, MG = middle temporal gyrus, OC = occipital cortex, BC = cerebellum, IP = inferior parietal lobe, SF = superior frontal gyrus, LU = lung, LV = liver, LN = lymph node, PA = pancreas, RNase R = ribonuclease R, RNase R+ = pre-treated with RNase R, RNase R- = not pre-treated with RNase R.

Sample source Input amounts/samples tested Percent rRNA
UC TruSeq: 1 µg 5.53%
TruSeq: 2 µg 4.11%
TruSeq: 4 µg 4.38%
TruSeq: 5 µg 3.21%
TruSeq: 10 µg 3.74%
Kapa: 1 µg 5.57%
Kapa: 2 µg 4.56%
Kapa: 4 µg 9.67%
Kapa: 5 µg 12.69%
Kapa: 10 µg 15.59%

Table 2: rRNA percentages in TruSeq vs. Kapa libraries.

Discussion

In this study, two commercially available library preparation kits, pre-treatment options, and input RNA amounts were tested in order to optimize a circRNA enrichment protocol for construction of circRNA sequencing libraries. Based on this study’s assessments, a number of key aspects and critical steps in creating circRNA sequencing libraries are apparent. Our evaluation confirms the utility of RNase R pre-treatment, as reflected by the increased number of circRNAs detected. Overall, a higher diversity of circRNAs when using the Illumina TruSeq library kit with RNase R pre-treatment and 4 µg of input RNA was observed. These results align with previous findings that the RNase R enrichment step is beneficial for detection of circRNAs2.

Additional key aspects of circRNA library construction include the amount of total RNA that is available for sequencing as well as the type of tissue that the RNA extracted from. Although a 4 µg input of total RNA was found to yield the highest number of detected circRNAs, the majority of RNAseq studies utilize <=1 µg of total RNA such that obtaining higher amounts may be challenging, particularly for analysis of human specimens. Identification of circRNAs remains feasible for lower input amounts, but it is relevant to acknowledge that the specificity of the analysis may be impacted. This study further highlights the higher number of circRNAs that are detected in human brain compared to other tissues, as previously reported19,20. It is thus critical to acknowledge the differential expression of circRNAs across different tissue types. Furthermore, additional research in the context of disease will be important for shedding light into how circRNAs may be involved in pathogenic processes.

The performance of the two assessed RNA library preparation kits also highlights that although different commercially-available kits may demonstrate significant similarities, differences are still observed when analyzing circRNAs. Two major findings from this comparison include decreased rRNA depletion and a lower number of circRNAs identified using one approach. While one possibility is that a higher abundance of rRNA in a sample may interfere with creating sequence-able circRNA library molecules, this finding emphasizes the need to assess seemingly similar kits, particularly when reagents are proprietary.

Although the data presented here provides insights into the existence and abundance of circRNAs in various tissue types, this study has a few technical limitations. Firstly, while RNase R treatment reduces the population of linear RNAs in a sample, it is not well understood if this depletion step introduces any biases in circRNA detection and whether it may deplete circRNAs. Previous studies have reported that in some cases, circRNAs are sensitive to RNase R2,24,25. Secondly, it is unclear if increasing the total RNA input above 4 µg will result in a linear increase in the number of identified circRNAs. As previously mentioned, available total RNA is often limited in research studies so lower input amounts were considered here. Of note, circRNAs can still be detected when using lower inputs but it is important to acknowledge that lower inputs are associated with detection of a lower number of circRNAs. Thirdly, the optimized protocol presented here utilizes RNAs extracted from a specific set of tissues. Given the variable distribution of circRNA expression across different tissue types, the association between total RNA input amounts and the number of identified circRNAs may differ across tissues.

With increasing interests in understanding the biological role of circRNAs, new strategies are also being developed to better enable characterization and identification of circRNAs. One new bioinformatics approach enables identification of circRNAs that may be lowly expressed through reconstruction of full-length circRNAs, and also enables quantification of expression of specific circRNA isoforms26. This approach takes advantage of features described as reverse overlap (RO) reads that may occur on the 3’ or 5’ ends of circRNA library molecules. Development of new strategies for identifying circRNAs, encompassing both laboratory approaches and bioinformatics tools, will contribute to the field’s understanding of the function and impact of circRNAs.

Offenlegungen

The authors have nothing to disclose.

Acknowledgements

We are grateful to the Banner Sun Health Research Institute Brain and Body Donation Program (BBDP) of Sun City, Arizona for the provision of human brain tissues. The BBDP has been supported by the National Institute of Neurological Disorders and Stroke (U24 NS072026 National Brain and Tissue Resource for Parkinson’s Disease and Related Disorders), the National Institute on Aging (P30AG19610 Arizona Alzheimer’s Disease Core Center), the Arizona Department of Health Services (contract 211002, Arizona Alzheimer’s Research Center), the Arizona Biomedical Research Commission (contracts 4001, 0011, 05-901 and 1001 to the Arizona Parkinson's Disease Consortium) and the Michael J. Fox Foundation for Parkinson’s Research27. This study was also supported by the DHS and the State of Arizona (ADHS grant # ADHS14-052688). We also thank Andrea Schmitt (Banner Research) and Cynthia Lechuga (TGen) for administrative support.

Materials

1000 µL pipette tips Rainin GP-L1000F
20 µL pipette tips Rainin SR L 10F
200 µL pipette tips Rainin SR L 200F
2200 TapeStation Accessories (foil covers) Agilent Technologies 5067-5154
2200 TapeStation Accessories (tips) Agilent Technologies 5067-5153
Adhesive Film for Microplates VWR 60941-064
AMPure XP Beads 450 mL Beckman Coulter A63882 PCR purification
Eppendorf twin.tec 96-Well PCR Plates VWR 951020401
High Sensitivity D1000 reagents Agilent Technologies 5067-5585
High Sensitivity D1000 ScreenTape Agilent Technologies 5067-5584
HiSeq 2500 Sequencing System Illumina SY-401-2501
HiSeq 3000/4000 PE Cluster Kit Illumina PE-410-1001
HiSeq 3000/4000 SBS Kit (150 cycles) Illumina FC-410-1002
HiSeq 4000 Sequencing System Illumina SY-401-4001
HiSeq PE PE Rapid Cluster Kit v2 Illumina PE-402-4002
HiSeq Rapid SBS Kit v2 (50 cycle) Illumina FC-402-4022
Kapa Total RNA Kit Roche KK8400
Molecular biology grade ethanol Fisher Scientific BP28184
Qubit Assay Tubes Supply Center by Thermo Fischer Q32856
Qubit dsDNA High Sense Assay Kit Supply Center by Thermo Fischer Q32854
RNA cleanup and concentrator – 5 Zymo RCC-100 Contains purification columns, collection tubes
RNAClean XP beads Beckman Coulter Genomics RNA Cleanup beads
Rnase R Lucigen RNR07250
SuperScript II Reverse Transcriptase 10,000 units ThermoFisher (LifeTech) 18064014
TapeStation 2200 Agilent Technologies Nucleic Acid analyzer
TElowE VWR 10128-588
TruSeq Stranded Total RNA Library Prep Kit Illumina 20020596 Kit used in section 3
Two-Compartment Divided Tray VWR 3054-1004
UltraPure Water Supply Center by Thermo Fischer 10977-015
Universal control RNA Agilent 740000

Referenzen

  1. Salzman, J., Gawad, C., Wang, P. L., Lacayo, N., Brown, P. O. Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types. PloS One. 7 (2), e30733 (2012).
  2. Jeck, W. R., et al. Circular RNAs are abundant, conserved, and associated with ALU repeats. RNA. 19 (2), 141-157 (2013).
  3. Memczak, S., et al. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature. 495 (7441), 333-338 (2013).
  4. Sanger, H. L., Klotz, G., Riesner, D., Gross, H. J., Kleinschmidt, A. K. Viroids are single-stranded covalently closed circular RNA molecules existing as highly base-paired rod-like structures. Proceedings of the National Academy of Sciences of the United States of America. 73 (11), 3852-3856 (1976).
  5. Nigro, J. M., et al. Scrambled exons. Cell. 64 (3), 607-613 (1991).
  6. Salzman, J., Chen, R. E., Olsen, M. N., Wang, P. L., Brown, P. O. Cell-type specific features of circular RNA expression. PLoS Genetics. 9 (9), e1003777 (2013).
  7. Du, W. W., et al. Foxo3 circular RNA promotes cardiac senescence by modulating multiple factors associated with stress and senescence responses. European Heart Journal. 38 (18), 1402-1412 (2016).
  8. Capel, B., et al. Circular transcripts of the testis-determining gene Sry in adult mouse testis. Cell. 73 (5), 1019-1030 (1993).
  9. Hansen, T. B., et al. miRNA-dependent gene silencing involving Ago2-mediated cleavage of a circular antisense RNA. The EMBO Journal. 30 (21), 4414-4422 (2011).
  10. Hansen, T. B., et al. Natural RNA circles function as efficient microRNA sponges. Nature. 495 (7441), 384-388 (2013).
  11. Li, Z., et al. Exon-intron circular RNAs regulate transcription in the nucleus. Nature Structural & Molecular Biology. 22 (3), 256-264 (2015).
  12. Tan, W. L., et al. A landscape of circular RNA expression in the human heart. Cardiovascular Research. 113 (3), 298-309 (2016).
  13. Zhong, Z., Lv, M., Chen, J. Screening differential circular RNA expression profiles reveals the regulatory role of circTCF25-miR-103a-3p/miR-107-CDK6 pathway in bladder carcinoma. Scientific Reports. 6, 30919 (2016).
  14. Gao, Y., Wang, J., Zhao, F. CIRI: an efficient and unbiased algorithm for de novo circular RNA identification. Genome Biology. 16, 4-014-0571-0573 (2015).
  15. Wang, K., et al. MapSplice: accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Research. 38 (18), e178 (2010).
  16. Szabo, L., et al. Statistically based splicing detection reveals neural enrichment and tissue-specific induction of circular RNA during human fetal development. Genome Biology. 16, 126-015-0690-0695 (2015).
  17. Cheng, J., Metge, F., Dieterich, C. Specific identification and quantification of circular RNAs from sequencing data. Bioinformatics. 32 (7), 1094-1096 (2016).
  18. Zhang, X. O., et al. Complementary sequence-mediated exon circularization. Cell. 159 (1), 134-147 (2014).
  19. Rybak-Wolf, A., et al. Circular RNAs in the mammalian brain are highly abundant, conserved, and dynamically expressed. Molecular Cell. 58 (5), 870-885 (2015).
  20. Ji, P., et al. Expanded Expression Landscape and Prioritization of Circular RNAs in Mammals. Cell Reports. 26 (12), 3444-3460 (2019).
  21. Hansen, T. B., Venø, M. T., Damgaard, C. K., Kjems, J. Comparison of circular RNA prediction tools. Nucleic Acids Research. 44 (6), e58 (2016).
  22. Zeng, X., Lin, W., Guo, M., Zou, Q. A comprehensive overview and evaluation of circular RNA detection tools. PLoS Comput Biol. 13 (6), e1005420 (2017).
  23. Sekar, S., et al. ACValidator: a novel assembly-based approach for in silico validation of circular RNAs. bioRxiv. , (2019).
  24. Westholm, J. O., et al. Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. Cell Reports. 9 (5), 1966-1980 (2014).
  25. Szabo, L., Salzman, J. Detecting circular RNAs: bioinformatic and experimental challenges. Nature Reviews Genetics. 17 (11), 679-692 (2016).
  26. Zheng, Y., Ji, P., Chen, S., Hou, L., Zhao, F. Reconstruction of full-length circular RNAs enables isoform-level quantification. Genome Medicine. 11 (1), 2 (2019).
  27. Beach, T. G., et al. Arizona study of aging and neurodegenerative disorders and brain and body donation program. Neuropathology. 35 (4), 354-389 (2015).

Play Video

Diesen Artikel zitieren
Sekar, S., Geiger, P., Cuyugan, L., Boyle, A., Serrano, G., Beach, T. G., Liang, W. S. Identification of Circular RNAs using RNA Sequencing. J. Vis. Exp. (153), e59981, doi:10.3791/59981 (2019).

View Video