Capture Compounds are trifunctional small molecules to reduce the complexity of the proteome by functional reversible small molecule-protein interaction followed by photo-crosslinking and purification. Here we use a Capture Compound with S-adenosyl-L-homocysteine-binding as selectivity function to isolate methyltransferases from an Escherichia coli whole cell lysate and identify them by MS.
There is a variety of approaches to reduce the complexity of the proteome on the basis of functional small molecule-protein interactions such as affinity chromatography 1 or Activity Based Protein Profiling 2. Trifunctional Capture Compounds (CCs, Figure 1A) 3 are the basis for a generic approach, in which the initial equilibrium-driven interaction between a small molecule probe (the selectivity function, here S-adenosyl-L-homocysteine, SAH, Figure 1A) and target proteins is irreversibly fixed upon photo-crosslinking between an independent photo-activable reactivity function (here a phenylazide) of the CC and the surface of the target proteins. The sorting function (here biotin) serves to isolate the CC – protein conjugates from complex biological mixtures with the help of a solid phase (here streptavidin magnetic beads). Two configurations of the experiments are possible: “off-bead” 4 or the presently described “on-bead” configuration (Figure 1B). The selectivity function may be virtually any small molecule of interest (substrates, inhibitors, drug molecules).
S-Adenosyl-L-methionine (SAM, Figure 1A) is probably, second to ATP, the most widely used cofactor in nature 5, 6. It is used as the major methyl group donor in all living organisms with the chemical reaction being catalyzed by SAM-dependent methyltransferases (MTases), which methylate DNA 7, RNA 8, proteins 9, or small molecules 10. Given the crucial role of methylation reactions in diverse physiological scenarios (gene regulation, epigenetics, metabolism), the profiling of MTases can be expected to become of similar importance in functional proteomics as the profiling of kinases. Analytical tools for their profiling, however, have not been available. We recently introduced a CC with SAH as selectivity group to fill this technological gap (Figure 1A).
SAH, the product of SAM after methyl transfer, is a known general MTase product inhibitor 11. For this reason and because the natural cofactor SAM is used by further enzymes transferring other parts of the cofactor or initiating radical reactions as well as because of its chemical instability 12, SAH is an ideal selectivity function for a CC to target MTases. Here, we report the utility of the SAH-CC and CCMS by profiling MTases and other SAH-binding proteins from the strain DH5α of Escherichia coli (E. coli), one of the best-characterized prokaryotes, which has served as the preferred model organism in countless biochemical, biological, and biotechnological studies. Photo-activated crosslinking enhances yield and sensitivity of the experiment, and the specificity can be readily tested for in competition experiments using an excess of free SAH.
1) Preparation of E. coli DH5α Cell Lysate
2) Capture Assay (A), Competition Control (C), Pulldown (PD), Competition Control of Pulldown (CPD), and Combined Capture Assay plus Pulldown (A+PD)
3) SDS-PAGE of Captured Proteins
4) In-gel Tryptic Digest of Proteins and Peptide Extraction from Gel Bands
5) Tryptic Digest of Captured Proteins and Preparation of Peptides for LC-MS/MS
6) NanoLC-MS/MS Analysis
7) Peptide and Protein Identification via Automated Sequence Database Search
8) Representative Results
Table 1:
Protein | ORF | MW/kDa | Description | Substrate | A | C | PD | CPD | A+PD |
Dcm | b1961 | 53.5 | DNA-cytosine MTase | DNA (m5C) | 1 | 0 | 0 | 0 | 1 |
RlmI | b0967 | 44.4 | 23S rRNA m5C1962 MTase | rRNA (m5C) | 17 | 0 | 17 | 0 | 20 |
RlmL | b0948 | 78.9 | 23S rRNA m2G2445 MTase | rRNA (m2G) | 12 | 0 | 0 | 0 | 10 |
TrmB | b2960 | 27.3 | tRNA (guanine-N(7)-)-MTase | tRNA (m7G) | 11 | 0 | 0 | 0 | 13 |
CmoA | b1870 | 27.8 | tRNA (cmo5U34)-MTase | tRNA (mcmo5U) | 7 | 0 | 0 | 0 | 4 |
RsmG | b3740 | 23.4 | 16S rRNA m7G MTase | rRNA (m7G) | 6 | 0 | 1 | 0 | 5 |
RsmH | b0082 | 34.9 | 16S rRNA m4C1402 MTase | rRNA (m4C) | 5 | 0 | 0 | 0 | 7 |
RsmD | b3465 | 21.7 | 16S rRNA m2G966 MTase | rRNA (m2G) | 2 | 0 | 0 | 0 | 2 |
RsmB | b3289 | 48.3 | 16S rRNA m5C967 MTase | rRNA (m5C) | 1 | 0 | 0 | 0 | 0 |
MnmC | b2324 | 74.4 | Bifunctional protein includes tRNA (mnm(5)s(2)U34)-MTase | tRNA (mnm5s2U) | 1 | 0 | 0 | 0 | 0 |
PrmB | b2330 | 35.0 | 50S ribosomal protein L3 Gln150 MTase | protein (Gln) | 13 | 0 | 0 | 0 | 15 |
CheR | b1884 | 32.8 | Chemotaxis protein MTase | protein (Glu) | 0 | 0 | 0 | 0 | 1 |
Cfa | b1661 | 44.9 | Cyclopropane-fatty-acyl-phospholipid synthase | small molecule | 15 | 0 | 0 | 0 | 14 |
Tam | b1519 | 29.0 | Trans-aconitate 2-MTase | small molecule | 2 | 0 | 0 | 0 | 3 |
CysG | b3368 | 50.0 | Siroheme synthase includes uroporphyrinogen-III C-MTase | small molecule | 1 | 0 | 0 | 0 | 2 |
SmtA | b0921 | 29.8 | Protein smtA | (?a) | 7 | 1 | 0 | 0 | 8 |
MtnN | b0159 | 24.4 | 5′-Methylthioadenosine/SAH nucleosidase | small moleculeb | 36 | 0 | 0 | 0 | 39 |
GlnA | b3870 | 51.9 | Glutamine synthetase | small moleculec | 90 | 0 | 0 | 0 | 97 |
RplK | b3983 | 14.9 | 50S ribosomal protein L11 | of protein MTase PrmAd | 2 | 0 | 0 | 0 | 2 |
aNot (fully) characterized
bNo methylation but cleavage of the glycosidic bond of SAH
cNo methylation but binding of SAH into the ATP binding site as shown by CCMS experiments with ATP as competitor (data not shown)
dSubstrate of the 50S ribosomal protein L11 MTase PrmA; reproducible specific identification by CCMS (data not shown)
Table 1: MTases and other selected proteins identified by CCMS experiments. The given numbers denote the unweighted peptide spectral count per protein. Samples are duplicates of those analyzed by SDS-PAGE/silver stain in Figure 2. Much more MTases and other SAH binding proteins are identified in the CCMS assay (A) compared to the pulldown (PD) and SAH specificity is shown by the almost complete absence of these proteins in the competition control (C).
Table 2:
A | C | PD | CPD | A+PD | |
A | 111 (64) | ||||
C | 65 (41) | 107 (46) | |||
PD | 25 (15) | 23 (13) | 61 (17) | ||
CPD | 23 (13) | 22 (12) | 20 (14) | 47 (14) | |
A+PD | 87 (61) | 64 (41) | 23 (14) | 22 (12) | 124 (67) |
Table 2: Total number of identified proteins in the CCMS runs and protein overlap between the runs. The number of proteins identified with at least 2 peptides are given in parentheses. The high reproducibility of the method can be inferred from the high protein overlap (of mainly unspecific proteins) between comparable experiments (A vs. C and especially A vs. A+PD but also PD vs. CPD) especially with the proteins robustly identified with at least 2 peptides. See also Figure 3 for Venn diagrams and Supplementary Table S1 for a list of all identified proteins.
Figure 1A: Chemical structure of the trifunctional Capture Compound (CC). The selectivity function is framed with a droplet, the reactivity function with a star, and the sorting function with a half-moon. The chemically stable S-adenosyl-L-homocysteine (SAH) is the cofactor product of S-adenosyl-L-methionine (SAM) after methyl group transfer by SAM-dependent MTases, for which SAH acts as a product inhibitor.
Figure 1B: CCMS “on-bead” workflow. The CC is bound on the magnetic beads by its sorting function (a), the so formed caproBeads are incubated with the complex protein mixture (b), where a reversible binding equilibrium (c) is established between the selectivity function of the CC and the target proteins. Upon UV irradiation (d), the reactivity function forms a covalent crosslink. After washing the magnetic beads bearing the captured proteins (e), cleavage of the crosslinked CC-protein complexes from the magnetic beads (f), and tryptic digest (g), the captured proteins can be identified by MS analysis of the tryptic peptides.
Figure 2: SDS-PAGE/silver stain analysis of captured proteins (after step f in Figure 1B). The lane description is given on top of the gel (MW: molecular weight marker with the corresponding molecular weights of the marker bands given to the very right; L: 0.25% sample drawn from the E. coli DH5a whole cell lysate before adding the caproBeads in step b in Figure 1B; A: assay with addition of an excess of free SAH after UV irradiation step d in Figure 1B; C: control of assay including an excess of free SAH as competitor during steps c and d in Figure 1B (essential to determine any non-specifically captured proteins); PD: pulldown meaning no UV irradiation step d in Figure 1B and no addition of free SAH; CPD: control of pulldown using SAH as competitor; A+PD: combined assay plus pulldown meaning no addition of free SAH during the workflow). Proteins identified by MS from cut-out protein gel bands after in-gel tryptic digest are given to the very leftt. It is evident that photo-crosslinking enhances yield and sensitivity of the experiment, and the specificity can be readily tested for in competition experiments using an excess of free SAH. See Table 1 for MTases and other selected proteins identified by CCMS experiments of duplicate samples of those shown in the present figure.
Figure 3: Venn diagrams explaning the overlap of identified proteins in CCMS assay (A), competition control (C), and pulldown (PD). Left: Number of MTases and SAH nucleosidase, only, referring to Table 1. Right: Number of all identified proteins referring to Table 2 and Supplementary Table S1. The number of proteins identified with at least 2 peptides are given in parentheses.
The following precautions and comments may be useful when following the described protocol: a) A major advantage of CCs lies in the formation of a covalent bond between the CC and the MTase, as this permits subsequent stringent washing conditions. The covalent crosslink is achieved by a photoreaction triggered by UV light (310 nm max.). Normal overhead light contains only a small fraction of UV, however, protect the SAH-CC from longer exposure to overhead or even sun light up to the controlled and cooled irradiation in the caproBox. b) The biological samples from which the MTases are to be isolated may contain proteins prone to denaturation, consequently it is mandatory to keep the samples cool and to avoid frothing at all times. c) The caproBox cools the samples to 0-4 °C, the lamps emitting the UV light, however, also emit heat. Therefore it is necessary to briefly centrifuge the vials before irradiation, so proteins adhering to the caps or vial walls cannot form precipitation seeds. d) If re-suspension of the magnetic beads (e.g. in the wash solutions) is not possible by hand, shortly apply ultrasound by placing the samples into an ultrasound bath. e) Freshly prepare the 0.2% TFA/60 %ACN solution. We found that otherwise the captured proteins may not be cleaved from the beads. f) The final analysis of captured proteins is carried out by LC-MS/MS. Mass spectrometry is a highly sensitive method. It is necessary to use exclusively LC-MS grade reagents in the final steps (step 2.11 and further). Avoid contamination of the experiments by external protein sources, e.g. keratin originating from dust or from the experimenter. Particularly during the final digestion steps, it is recommended to pay attention to a clean work space, to wear gloves and a lab coat and possibly a hair net or ideally perform the final steps under a clean bench. Prepare the 50 mM ammonium bicarbonate buffer used for tryptic digest in LC MS grade water, filter through 0.22 μm filter, aliquot, store at -20°C, and use each aliquot only once to avoid contaminations. The trypsin solution (sequencing grade, Roche, prepare a 0.5 μg/μl solution by adding 1 mM HCl to lyophilized trypsin) can be stored at 4°C for several weeks. g) To obtain reliable mass spectra it is essential to have a stable spray in ESI-MS/MS analysis. h) For other LC-MS/MS systems than the one used in the present study, measurement parameters and peptide identification algorithms must be adjusted individually.
The following modifications are possible with respect to the described protocol: a) Alternatively to releasing the proteins from the beads by 60% ACN/0.2% TFA (step 2.11), the proteins can be directly tryptically digested within a bead suspension (same volumes as in step 5.1) or, for SDS-PAGE, the proteins can be released by suspending and heating the beads collected after step 2.9 to 95°C for 10 min in SDS sample buffer (both, the whole suspension or only the supernatant can be loaded into the gel pocket). We found that the beads slowly release small amounts of polymer into the aqueous tryptic digest solution during on-bead tryptic digest even after several aqueous wash steps. The polymer contamination interferes with MS peptide identification and can be washed off the beads using 80% ACN (at least three times). After the 80% ACN wash steps, the beads should be washed once with water prior to on-bead tryptic digest. b) Western blots using streptavidin-horseraddish peroxidase and ECL substrate can also be used to visualize successful crosslinking of the biotin containing CC to the proteins. Therefore, either the gel obtained after step 3.2 can be blotted or, because the sensitivity is about 10-fold higher than silver staining of gels, 10 μl samples after step 2.7 may also be analyzed by western blot. Mind that in the latter case endogeneously biotinylated proteins will also be detected besides the proteins artificially biotinylated by the SAH-CC. c) We found that it depends on the special system (lysate, addressed target proteins, selectivity and reactivity function of he CC), whether the “off-bead” configuration, where the crosslinking reaction takes place between free CC and protein in solution 4 or the presently described “on-bead” configuration (Figure 1B) performs better.
In general, the method should also be compatible with any state-of-the art stable isotope protein or peptide labelling technology, or assessment of the capture sample by 2D gel electrophoresis. In the “off-bead” configuration, it is also possible to capture proteins within whole cells (unpublished results). Furthermore, the drug or cofactor binding site of a protein can be outlined by determining the close-by crosslinking position of the CC within the protein sequence by MS peptide sequencing. The binding mode of a small molecule to a protein can be explored by using different chemical attachment positions at the selectivity function and different linker lengths. As shown in the present study, also protein binding partners of the proteins addressed by the selectivity function (RplK as substrate of PrmA) or unknown small molecule protein interactions (SAH to GlnA) can be identified. Summarized, the additional feature of the CCs, the photo-crosslinking reactivity, allows for the isolation and identification of low abundant proteins or functional protein families from complex protein mixtures with high sensitivity and provides scientists with an additional tool for studying small molecule – protein interactions.
The authors have nothing to disclose.
This work was supported by the Human Frontier Science Program Organization (HFSP Award 2007, RGP0058/2007-C). We thank Prof. Richard Roberts for initiating the project and for fruitful discussions.
Material Name | Type | Company | Catalogue Number | Comment |
---|---|---|---|---|
SAH caproKit™ | caprotec bioanalytics GmbH | 1-1010-050 (50 reactions) 1-1010-010 (10 reactions) |
Includes the SAH-CC, SAH competitor, streptavidin coated magnetic beads, capture buffer, and wash buffer | |
caproBox™ | caprotec bioanalytics GmbH | 1-5010-003 (110 V) 1-5010-004 (230 V) |
For reproducible photo-activation while cooling the samples | |
caproMag™ | caprotec bioanalytics GmbH | 1-5100-001 | For easy handling of magnetic particles without pipetting |