The spatial arrangement of RNA-binding proteins on a transcript is a key determinant of post-transcriptional regulation. Therefore, we developed individual-nucleotide resolution UV crosslinking and immunoprecipitation (iCLIP) that allows precise genome-wide mapping of the binding sites of an RNA-binding protein.
The unique composition and spatial arrangement of RNA-binding proteins (RBPs) on a transcript guide the diverse aspects of post-transcriptional regulation1. Therefore, an essential step towards understanding transcript regulation at the molecular level is to gain positional information on the binding sites of RBPs2.
Protein-RNA interactions can be studied using biochemical methods, but these approaches do not address RNA binding in its native cellular context. Initial attempts to study protein-RNA complexes in their cellular environment employed affinity purification or immunoprecipitation combined with differential display or microarray analysis (RIP-CHIP)3-5. These approaches were prone to identifying indirect or non-physiological interactions6. In order to increase the specificity and positional resolution, a strategy referred to as CLIP (UV cross-linking and immunoprecipitation) was introduced7,8. CLIP combines UV cross-linking of proteins and RNA molecules with rigorous purification schemes including denaturing polyacrylamide gel electrophoresis. In combination with high-throughput sequencing technologies, CLIP has proven as a powerful tool to study protein-RNA interactions on a genome-wide scale (referred to as HITS-CLIP or CLIP-seq)9,10. Recently, PAR-CLIP was introduced that uses photoreactive ribonucleoside analogs for cross-linking11,12.
Despite the high specificity of the obtained data, CLIP experiments often generate cDNA libraries of limited sequence complexity. This is partly due to the restricted amount of co-purified RNA and the two inefficient RNA ligation reactions required for library preparation. In addition, primer extension assays indicated that many cDNAs truncate prematurely at the crosslinked nucleotide13. Such truncated cDNAs are lost during the standard CLIP library preparation protocol. We recently developed iCLIP (individual-nucleotide resolution CLIP), which captures the truncated cDNAs by replacing one of the inefficient intermolecular RNA ligation steps with a more efficient intramolecular cDNA circularization (Figure 1)14. Importantly, sequencing the truncated cDNAs provides insights into the position of the cross-link site at nucleotide resolution. We successfully applied iCLIP to study hnRNP C particle organization on a genome-wide scale and assess its role in splicing regulation14.
1. UV cross-linking of tissue culture cells
2. Bead preparation
3. Cell lysis and partial RNA digestion
4. Immunoprecipitation
5. Dephosphorylation of RNA 3'ends
6. Linker ligation to RNA 3′ ends
7. RNA 5' end labelling
8. SDS-PAGE and membrane transfer
9. RNA isolation
10. Reverse transcription
11. Gel purification of cDNA
12. Ligation of primer to the 5’end of the cDNA
13. PCR amplification
14. Linker and primer sequences
Pre-adenylated 3′ linker DNA:
[We order the DNA adapter from IDT and then make aliquots of 20μM.]
15. Representative Results:
Prior to sequencing of the iCLIP library, the success of the experiment can be monitored at two steps: the autoradiograph of the protein-RNA complex after membrane transfer (step 8.5) and the gel image of the PCR products (step 13.4). In the autoradiograph of the low-RNase samples, diffuse radioactivity should be seen above the molecular weight of the protein (Figure 2, sample 4). For high-RNase samples, this radioactivity is focused closer to the molecular weight of the protein (Figure 2, sample 3). When no antibody is used in the immunoprecipitation, no signal should be detected (Figure 2, samples 1 and 2). Further important controls for specificity of the immunoprecipitation either omit UV irradiation or use cells that do not express the protein of interest14.
The gel image of the PCR products (step 13.4) should show a size range that corresponds to the cDNA fraction (high, medium or low) purified in step 11.4 (Figure 4, lanes 4-6). Note that the PCR primers P3Solexa and P5Solexa introduce an additional 76 nt to the size of the cDNA. If no antibody is used during the immunoprecipitation, no corresponding PCR products should be detected (Figure 4, lanes 1-3). Primer dimer product can appear at about 140 nt.
For representative results of high-throughput sequencing and subsequent bioinformatic analyses see14.
Figure 1. Schematic representation of the iCLIP protocol. Protein-RNA complexes are covalently cross-linked in vivo using UV irradiation (step 1). The protein of interest is purified together with the bound RNA (steps 2-5). To allow for sequence-specific priming of reverse transcription, an RNA adapter is ligated to the 3′ end of the RNA, whereas the 5′ end is radioactively labelled (steps 6 and 7). Cross-linked protein-RNA complexes are purified from free RNA using SDS-PAGE and membrane transfer (step 8). The RNA is recovered from the membrane by digesting the protein with proteinase K leaving a polypeptide remaining at the cross-link nucleotide (step 9). Reverse transcription (RT) truncates at the remaining polypeptide and introduces two cleavable adapter regions and barcode sequences (step 10). Size selection removes free RT primer before circularization. The following linearization generates suitable templates for PCR amplification (steps 11-15). Finally, high-throughput sequencing generates reads in which the barcode sequences are immediately followed by the last nucleotide of the cDNA (step 16). Since this nucleotide locates one position upstream of the cross-linked nucleotide, the binding site can be deduced with high resolution.
Figure 2. Autoradiograph of cross-linked hnRNP C-RNA complexes using denaturing gel electrophoresis and membrane transfer. hnRNP C-RNA complexes were immuno-purified from cell extracts using an antibody against hnRNP C (α hnRNP C, samples 3 and 4). RNA was partially digested using low (+) or high (++) concentration of RNase. Complexes shifting upwards from the size of the protein (40 kDa) can be observed (sample 4). The shift is less pronounced when high concentrations of RNase were used (sample 3). The radioactive signal disappears when no antibody was used in the immunoprecipitation (samples 1 and 2).
Figure 3. Schematic 6% TBE-urea gel (Invitrogen) to guide the excision of iCLIP cDNA products. The gel is run for 40 min at 180 V leading to a reproducible migration pattern of cDNAs and dyes (light and dark blue) in the gel. Use a razor blade to cut (red line) the high (H), medium (M), and low (L) cDNA fractions. Start by cutting in the middle of the light blue dye and immediately above the mark on the plastic gel cassette. Divide the medium and low fractions and trim the high fraction about 1 cm above the light blue dye. Use vertical cuts guided by the pockets and the dye to separate the different lanes (in this example 1-4). The marker lane (m) can be stained and imaged to control sizes after the cutting. Fragment sizes are indicated on the right.
Figure 4. Analysis of PCR-amplified iCLIP cDNA libraries using gel electrophoresis. RNA recovered from the membrane (Figure 1) was reverse transcribed and size-purified using denaturing gel electrophoresis (Figure 2). Three size fractions of cDNA (high [H]: 120-200 nt, medium [M]: 85-120 nt and low [L]: 70-85 nt) were recovered, circularized, re-linearized and PCR-amplified. PCR products of different size distribution can be observed as a result of the different sizes of the input fractions. Since the PCR primer introduces 76 nt to the cDNA, sizes should range between 196-276 nt for high, 161-196 nt for medium and 146-161 nt for low size fractions. PCR products are absent when no antibody was used for the immunoprecipitation (lanes 1-3).
Since the iCLIP protocol contains a diverse range of enzymatic reactions and purification steps, it is not always easy to identify a problem when an experiment fails. In order to control for the specificity of identified RNA cross-link sites, one or more negative controls should be maintained throughout the complete experiment and subsequent computational analyses. These controls can be the no-antibody sample, the non-cross-linked cells, or immunoprecipitation from knockout cells or tissue. Ideally, these control experiments should not purify any protein-RNA complexes, and therefore should give no signal on the SDS-PAGE gel, and no detectable products after PCR amplification. High-throughput sequencing of these control libraries should return very few unique sequences. Knockdown cells are not recommended as a sequencing control, since the resulting sequences still correspond to cross-link sites of the same protein, which is purified from knockdown cells in smaller quantities.
Precautions should also be taken to avoid contamination with PCR products from previous experiments. The best way to minimize this problem is to spatially separate pre- and post-PCR steps. Ideally, the analysis of the PCR products and all subsequent steps should be performed in a separate room. Moreover, each member of the laboratory should use their own set of buffers and other reagents. In this way, sources of contamination can be easier identified.
The authors have nothing to disclose.
The authors thank all members of the Ule, Luscombe and Zupan laboratories for discussion and experimental assistance. We thank James Hadfield and Nik Matthews for high-throughput sequencing. We would like to point out that the iCLIP method described here shares several steps with the original CLIP protocol, developed by Kirk Jensen and J.U. in the laboratory of Robert Darnell. This work was supported by the European Research Council grant 206726-CLIP to J.U. and a Long-term Human Frontiers Science Program fellowship to J.K.
For gel electrophoresis and membrane transfer we recommend t he use of XCell SureLock® Mini-Cell and XCell II™ Blot Module Kit CE Mark (Invitrogen, EI0002), which is compatible with the use of the different precast minigels that are specified throughout the protocol. The brand and order number of all materials used is mentioned during the protocol. The list of enzymes used in the protocol is shown in the table below.
Name of reagent | Company | Catalogue # | Comments |
Protein A Dynabeads | Invitrogen | 10001D | use protein G for mouse or goat antibody |
RNase I | Ambion | AM2295 | activity can change from batch to batch |
T4 RNA ligase I | NEB | M0204S | |
PNK | NEB | M0201S | |
proteinase K | Roche | 03115828001 | |
Superscript III reverse transcriptase | Invitrogen | 18080044 | |
Circligase II | Epicentre | CL9021K | |
FastDigest® BamHI | Fermentas | FD0054 | |
AccuPrime™ SuperMix I | Invitrogen | 12342010 | this PCR mix gives the best results in our hands |