Proteins interact with each other and these interactions determine in a large part their functions. Protein interaction partners can be identified at high-throughput in vivo using a yeast fitness assay based on the dihydrofolate reductase protein-fragment complementation assay (DHFR-PCA).
Proteins are the building blocks, effectors and signal mediators of cellular processes. A protein’s function, regulation and localization often depend on its interactions with other proteins. Here, we describe a protocol for the yeast protein-fragment complementation assay (PCA), a powerful method to detect direct and proximal associations between proteins in living cells. The interaction between two proteins, each fused to a dihydrofolate reductase (DHFR) protein fragment, translates into growth of yeast strains in presence of the drug methotrexate (MTX). Differential fitness, resulting from different amounts of reconstituted DHFR enzyme, can be quantified on high-density colony arrays, allowing to differentiate interacting from non-interacting bait-prey pairs. The high-throughput protocol presented here is performed using a robotic platform that parallelizes mating of bait and prey strains carrying complementary DHFR-fragment fusion proteins and the survival assay on MTX. This protocol allows to systematically test for thousands of protein-protein interactions (PPIs) involving bait proteins of interest and offers several advantages over other PPI detection assays, including the study of proteins expressed from their endogenous promoters without the need for modifying protein localization and for the assembly of complex reporter constructs.
Protein interaction networks (PINs) offer a low resolution map of how proteins are functionally organized in the cell1. Each physical connection between two proteins, or protein-protein interaction (PPI), may represent an association that is stable in time, such as those found within protein complexes and that contribute to the structural organization of the cell. These connections may also represent transient associations that regulate the activity, stability, localization and interactions of the two partners. Identifying the physical interaction partners of a given protein therefore provides rich information on the function and regulation of that protein2,3. For these reasons, large efforts have been put towards the mapping of PINs in model organisms, including Escherichia coli4-6, Arabidopsis thaliana7, Saccharomyces cerevisiae8-12, Drosophila melanogaster13,Caenorhabditis elegans14 and Homo sapiens15. These studies have provided important insights into how proteins are organized in the cell and thus key information on proteins with previously unknown functions.
Several strategies have been developed over the years to study PINs. These technologies can be broadly grouped in three categories based on the kind of information they provide on PPIs (reviewed in16-18) . The first one is based on yeast two-hybrid and its derivatives19. These technologies provide information on the direct association between pairs of proteins, which allows constructing binary networks. The second family is based on the affinity purification of bait proteins and the identification of their associated partners, such as affinity purification followed by mass spectrometry20. These approaches identify groups of proteins that are directly or indirectly associated, generally in a stable manner, and are extremely powerful to identify protein complexes. The third approach is based on protein-fragment complementation assays (PCAs)11,21. This approach provides an intermediate level of resolution between the two former approaches, as it allows detecting direct and proximal associations between proteins. Each technique has its own strengths and weaknesses, as recently reviewed18.
The best-described eukaryotic PIN is by far the one of the budding yeast Saccharomyces cerevisiae, in part because its proteome is relatively less complex than those of other model eukaryotes and because high-throughput assays to detect PPIs have first been assayed and are more efficiently implemented in this model organism9-12. A particularly powerful method for the yeast system is the dihydrofolate reductase protein-fragment complementation assay (DHFR-PCA), an assay that has been used in different contexts to study the yeast PIN in standard and perturbed conditions11,22-26. This method relies on a survival assay that enables the detection of direct and near-direct PPIs for a given bait protein at both endogenous expression levels and native subcellular localizations of the interaction partners11,21 in a quantitative manner27. The signal obtained using this assay (i.e. colony size on high-density colony arrays) thus reflects the amount of protein complexes formed between the bait and prey in a cellular environment almost equivalent to the one of wild-type cells. The assay is based on the reconstitution of a reporter enzyme involved in folate metabolism, the dihydrofolate reductase (DHFR), whereby two complementary fragments of the DHFR that are fused to the two proteins of interest are brought into proximity when the two proteins interact, which in turn leads to the reversible reconstitution of the enzyme activity11 and growth of the strain on a medium containing methotrexate (MTX; Figure 1). This compound inhibits the endogenous DHFR enzyme, but not the mutated one used in the assay28. Two collections of PCA strains, one containing ~4,300 MATa strains with an ORF fused to the DHFR F[1,2] fragment and one containing ~4,800 MATα strains with an ORF fused to the DHFR [3] fragment, can be purchased to implement DHFR-PCA at small or large scale in any laboratory. Here, we describe a general but detailed protocol to screen for PPIs between one bait protein and ~4,800 prey proteins using this assay.
1. Construction/verification of Bait Strains
2. Pin-tool Sterilization and Printing Procedures
NOTE: The sterilization procedure described below was optimized for the pin-tools manipulated by the BM3-BC (S&P Robotics) robotic platform, but can be adapted to other platforms as well. This section describes the Pin-tool sterilization and printing procedures that are used to transfer cells from one medium to another for the rest of the protocol. In-house scripts used to perform these routines can be obtained upon request. Note that all steps can be performed without the need of a robotic platform using a manual pin-tool30.
3. Condensation of the DHFR F[3] Collection in 1,536 Arrays Using an Automated Robotic Platform
4. High-throughput DHFR-PCA Procedure
5. Image Analysis
6. Data Analysis
NOTE: Results from image analysis can be processed in a tabulator such as excel or using a scripting language such as R34. The following steps describe the procedure using a custom ImageJ31 script.
7. Validation of Physical Interactors Using Small-scale Experiments
NOTE: Any PPI of particular interest having a score above or close to the applied threshold can be validated using the DHFR-PCA assay in a small-scale experimental design using a growth assay on solid or liquid MTX medium. The steps below show the procedure to manually construct diploid PCA strains and perform spot assays on MTX medium. The experimenter should perform these steps for all necessary controls (Bait-DHFR F[1,2] x L-DHFR F[3], Zipper-linker-DHFR diploid strains and linker-DHFR diploid strain).
Supplementary Table 2 is an example of representative results obtained using the yeast protein Nup82 fused to the DHFR F[1,2] fragment as a bait. The threshold defined with the L-DHFR F[3] controls can be used as an empirical threshold to determine high confidence hits (Figures 3D & 3E). Alternatively, the score ranking can be used to perform Gene Ontology enrichments or other functional analyses36 based on gold standards37. The known physical interactors of the bait can be retrieved from databases like BioGRID35 and overlaid on the data (Figures 3E & 3F). In this example, five out of eight high confidence hits have been previously reported as Nup82 interactors and two are part of the Nup82 subcomplex, Nup116 and Nup159 (Figure 3F & 3G). The other member of the complex, Nsp1, does not show any interaction in our experiment. Two preys, Ade17 and Tef2 (not shown in Figure 3F), had scores above the hard threshold applied, but these are likely to be false positives as they interact with almost any bait protein in PCA screens we have performed (unpublished results). On the other hand, Pex30 may represent a novel physical interactor of Nup82 and we were able to confirm this interaction using DHFR-PCA at low-throughput (Figure 3G). Pex30 is a peroxisomal membrane protein and a few direct interactions have been reported between the nuclear pore complex (NPC) and this organelle. A two-hybrid screen identified two other NPC protein, Nup53 and Asm4 (Nup59), as physical interactors of Pex3038, and a genetic interaction between Pex30 and Nup170 has been reported39. Two other interaction partners detected, Nup120 and Nup85 (Figure 3F & 3G), are not part of the Nup82 sub-complex, illustrating the ability of the DHFR-PCA to detect interactions within and between subcomplexes in larger complexes11.
Figure 1: Engineering yeast strains for high-throughput DHFR-PCA (figure adapted from Leducq et al. 201233). (A) Construction of haploid MATa and MATα strains to fuse Gene1 (G1) and Gene2 (G2) with the DHFR F[1,2]-NatMX and the DHFR F[3]-HPH cassettes, respectively. Cassettes are amplified from plasmids pAG25-DHFR F[1,2] and pAG32-DHFR F[3] with forward primers G1-5’ and G2-5’, and reverse primers G1-3’ and G2-3’, and then inserted in the genome at the 3’ end of the target gene by homologous recombination. The resulting proteins, P1 and P2, are respectively fused to the DHFR F[1,2] fragment (MATa) and the DHFR F[3] fragment (MATα) via a flexible linker. (B) Verification of the construction in (A) is performed by sequencing the junctions between Gene1’s and Gene2’s ORFs and the DHFR cassettes. The constructed PCA strains of opposing mating types are then mated to form a diploid. Diploid strains grow on MTX medium if the two complementary DHFR fragments are brought into proximity by an interaction between P1 and P2, which reconstitutes the activity of the DHFR enzyme. (C) Construction of control diploid PCA strains for DHFR-PCA screens. The negative controls (L-DHFR) are constructed by transforming separately haploid MATa and MATα strains with plasmids p41-linker-DHFR F[1,2] and p41-linker-DHFR F[3]11, respectively. The two strains are mated resulting in a negative control diploid strain in which the DHFR fragments are unable to complement each other (top). Positive controls are constructed using the same approach as for the negative controls, but the plasmids transformed in haploid strains (p41-zipper-linker-DHFR F[1,2] (p41-ZL-DHFR F[1,2]) and p41-zipper-linker-DHFR F[3] (p41-ZL-DHFR F[3])) contain two GCN4 parallel leucine zipper fragments fused to the complementary DHFR fragments, which leads to a strong and constitutive interaction that reconstitutes DHFR activity (bottom). Please click here to view a larger version of this figure.
Figure 2: High-throughput DHFR-PCA procedure (figure adapted from Leducq et al.33). (A) The MATα DHFR F[3] collection is condensed in a 1536 format through two successive rounds of condensation. First, glycerol stock plates are combined by groups of four on selective YPD + HygB medium in a 384 format using the 96 pin-tool. Second, 384 arrays are combined by groups of four on selective YPD + HygB medium in a 1536 format using the 384 pin-tool. (B) Cell lawns of the MATa PCA bait strain are prepared by growing a saturated culture of the bait strain in selective YPD + Nat medium and plating the culture on a YPD + Nat omnitray. (C) These lawns are used to mate the bait strain with the DHFR F[3] collection on YPD medium. Cells are successively transferred twice on YPD + HygB + Nat medium to select for diploids and twice on MTX medium to perform PCA. Growth will only be observed on MTX medium if the DHFR fragments complement each other following an interaction between the bait and the prey proteins. Please click here to view a larger version of this figure.
Figure 3: Data analyses through steps of normalization, determination of a significance threshold and identification of positive interactions. (A) Density distribution of colony size on each plate (log2). (B) Normalization by the median of each image corrects for biases associated with plate-to-plate effects. (C) Heatmap showing the Spearman correlation coefficient among the plates, confirming the reproducibility of the procedure. (D) Distribution of scores for the tested PPIs and the L-DHFR F[3] controls. A hard threshold can be set to the 95th percentile of the L-DHFR F[3] distribution to identify high-confidence PPIs (represented by a dotted vertical line). (E) Rank order distribution of the average score of each prey. Previously reported physical interaction partners of Nup82p in the BioGRID35 are identified by circles and those reported in this study are identified by red dots. The threshold defined in (D) is shown as a dotted line. (F) Network showing the high-confidence PPIs identified in this study. Blue edges show previously reported physical interactors (BioGRID35) and yellow edges show a previously unreported interaction with Pex30. (G) Spot-dilution assay of PCA diploid strains involving Nup82-DHFR F[1,2] and prey-DHFR F[3] pairs identified as physical interactors of Nup82p in the present study. Growth assay was performed in DMSO medium (MTX solvent, left panel) and MTX medium (right panel). Negative controls consisting of Nup82-DHFR F[1,2] – Linker-DHFR F[3] and Linker-DHFR F[1,2] – Linker-DHFR F[3] and a positive control consisting of the strong interaction between two leucine zipper moieties (zipper-DHFR F[1,2] – zipper-DHFR F[3]) were included. Cell growth superior to the negative controls in MTX medium should be interpreted as a physical interaction. Please click here to view a larger version of this figure.
Amino acid | Quantity (g) |
Adenine sulfate* | 0.2 |
L-Tryptophane | 0.4 |
L-Tyrosine | 0.3 |
L-Phenylalanine | 0.5 |
L-Glutamic acid (monosodium salt) | 1.0 |
L-Asparagine | 1.0 |
L-Valine | 1.5 |
L-Threonine | 2.0 |
L-Serine | 3.75 |
Uracile | 0.2 |
L-Histidine HCl | 0.2 |
L-Arginine HCl | 0.2 |
L-Methionine* | 0.2 |
L-Lysine* | 0.2 |
L-Leucine | 0.6 |
* Withdrawn when performing standard PCA (as in this protocol) but can be added for other purposes. |
Table 1: Composition of the 10X -lys/met/ade drop-out in the MTX medium. Quantities are for one liter of 10x drop-out. Asterisks denote the compounds that should be withdrawn for the standard MTX medium, but that can be added for other purposes.
Supplementary Table 1: Combined data from the 12 test plates. Table 1 contains the concatenated data from the ImageJ analysis performed using a custom ImageJ script with each row corresponding to a single position on each 1,536 array. In addition, each row has been annotated with the information about the image file, DHFR collection plate, replicate, ORF and protein name.
Supplementary Table 2: Average normalized growth per strain. Table 2 contains the average Log2 normalized growth for each strain of the collection along with the standard deviation. Filtered preys, hits and known physical interactors are identified in separate columns.
We describe a protocol based on the DHFR-PCA assay enabling the systematic identification of physical interactors for any given bait protein at high-throughput. This protocol can be adapted by screening for more baits, and this at any desired level of replication. We demonstrate the reliability of this protocol by the identification of physical interaction partners for a bait protein involved in the Nuclear Pore Complex: Nup82. Our analysis enabled to find five previously reported interactors and one previously unreported interactor (Figures 3F & 3G), highlighting the ability of the method to study the yeast protein interactome.
The protocol described here includes several critical steps to which the experimenter should pay attention. We recommend to 1) Make sure that the bait DHFR F[1,2] fusion is correct (Figure 1B); this can be achieved by sequencing the construction and measuring proper protein expression using an anti-DHFR F[1,2] or anti-DHFR F[3] antibody; 2) Prior to beginning the screen, it is recommended to verify if any bait of interest exhibits promiscuous interactions in PCA screens. This can be done by performing control screens with baits crossed with the appropriate L-DHFR control or by manually mating the bait with the appropriate L-DHFR control and performing a growth assay in MTX medium. 3) Plates should be poured the day before they are used so that moisture is optimal for cell adherence on the agar surface during the printing process; 4) Source plates should not be used more than four times to transfer enough cells on the destination plate. Increasing the number of copies of the destination plate can be done by successive steps of expansion (e.g. 4 copies -> 16 copies -> 64 copies). Alternatively, cells can be picked at different positions on the lawns or in the colony between different rounds of replication; 5) If several positions are missing after the diploid selection(s), make sure that the source plates were not used too many times in the mating step (steps 4.5 to 4.7); 6) Ensure that the MTX medium contains all essential ingredients at the right concentrations. Indeed, if no growth at all is observed on the MTX medium, it can be either because no interaction is detectable by PCA for the proteins of interest or because the MTX medium was not prepared properly. To ensure that the medium allows growth of strains showing DHFR fragments complementation, a constitutive interaction can be added at empty positions of the collection and used as a positive control such as DHFR-fragments fused to leucine zipper moieties33 (Figure 1C). Parallel tests using the linker-DHFR fragments or the zipper-linker-DHFR fragments will allow to discriminate between conditions that allow all cells to grow (low MTX concentration or bait protein that tend to make false positive interactions, as described below) and conditions that prevent growth of all strains (MTX concentration too high or essential ingredient missing in the medium); 7) Given that PCA is performed through successive rounds of replications from one medium to another, cross-contamination among strains between different plates may occur if, for instance, the pin-tool is not sterilized properly between replication rounds and/or the last water bath (i.e. wet station) in the sterilization procedure is contaminated by colonies of previous replication rounds. Several positions on the arrays are empty and can thus be used as control positions where no growth should be observed to detect cross-contaminations.
Image analysis can be performed using several published softwares (see section 5 of the protocol) or any custom script. In this study, the custom script executes the following steps: 1) The script subtracts pixel values of an empty plate to pixel values of each plate in order to correct for illumination biases. 2) The script converts each background-corrected picture to binary using a pixel value threshold of 10. 3) For each 1,536 positions of each plate, determined by overlaying a rectangle on the edge colonies, the script runs ImageJ “Analyze particles …” function in a circular selection. The circular selection is set with a radius equal to the interval between two positions minus 10 pixels. 4) The script selects the closest particle from the center of the selection and confirms it as a colony if its location is not more than half the interval between two colonies away from the center of the selection. 5) The script measures pixel values of the selected particle on the background-corrected picture. 6) To further correct any remaining background illumination biases, the script subtracts the mean value of all the pixels from the circular selection that were not part of a particle to pixel values of the colony. The sum of these corrected pixel values, stored in the column “IntDenBackSub” of Supplementary Table 1, are used as a measure of colony size.
A critical step within the analysis part is the choice of the significance threshold. Here, we chose a threshold based on the distribution of the negative L-DHFR F[3] controls, but depending on the objective of the screen, such threshold may be too stringent. Indeed, L-DHFR F[3] controls are overexpressed (strong TEF promoter) such that the complementary fragments may spontaneously complement each other and these are thus not representative of the expression of most proteins. This is highlighted by the fact that the distribution of the L-DHFR F[3] controls is higher than the average of the background growth (Figure 3D). Thus, some interactions having scores below this stringent threshold but that are clearly outside of the background growth distribution can be considered as putative hits that may represent, for instance, transient or weak interactions. These can be further studied and cross-validated if, for instance, the two proteins are not expressed at levels that can allow the spontaneous complementation of the DHFR fragments like the L-DHFR controls. As an alternative, one could set a significance threshold based on the proportion of overlap with reported physical interactors in databases like BioGRID35 in order to maximize the proportion of true positives over false positives. However, unlike the use of the L-DHFR distribution, this alternative may not always be feasible if, for instance, the number of known physical interactors is not sufficiently high. Moreover, the choice of the significance threshold has an impact on the proportion of false positives and false negatives in the final data set. Indeed, like any other PPI detection assay, false positives can result from unspecific interaction of a protein with the DHFR-fusion protein if, for example, the protein is highly abundant as mentioned earlier. This is exemplified by the fact that some preys systematically interact with all bait proteins in PCA screens and, thus, need to be removed from the analysis11 (e.g. Tef2 and Ade17 and Supplementary Table 2). To circumvent this problem, a control PCA screen of the two collections against the appropriate L-DHFR control (F[1,2] or F[3]) to identify baits and preys exhibiting spontaneous DHFR fragments complementation can be performed in the specific conditions of each screen. Moreover, performing a Gene Ontology enrichment analysis can increase the confidence in the data if the function of a given bait is known. On the other hand, DHFR-PCA can give rise to false negatives for several reasons: 1) not all proteins can be fused to the DHFR fragments as these may destabilize the proteins or modify their localization if, for instance, the DHFR fusion to the C-terminus interferes with a localization signal; 2) DHFR reconstitution in some cellular compartments may not produce folate if, for instance, an essential precursor for folate synthesis is not available; 3) C-termini need to be within a distance of 8 nm for DHFR complementation to occur11. Thus, a well-known interaction may not be detected if their C-termini are not close enough in space. This is exemplified here by the fact that a large fraction of Nup82 physical interactions reported in databases, most of which are indirect, were not detected in our assay. Similarly, interactions between membrane proteins for which the C-termini are in trans relative to the membrane will not lead to DHFR fragments complementation and will not be detected11. Limitations 1) and 3) can be circumvented relatively simply by fusing the DHFR fragment to the N-termini of the protein. Doing so may prevent to interfere with a localization signal near the C-termini and may allow to detect an interaction between membrane proteins whose N and C-terminus are in cis relative to the membrane.
Several challenges remain in the study of PINs (reviewed in2,3). The maps of PINs produced so far have largely been described in a single experimental condition for each species and thus offer a single snapshot of how protein networks might be organized. There is therefore a need for the exploration of other experimental conditions to see how PINs may be reorganized in response to environmental changes, specific stimuli, across development or following mutations. These challenges will be overcome by the development of new technologies for interrogating PPIs in real-time, in living cells and by adapting current techniques so that they can be used by a larger community of laboratories. As a quantitative technique that can detect changes in the amount of DHFR complementation complexes27, DHFR-PCA can be adapted to overcome these challenges and has been used to study how PPIs are affected by a DNA damaging agent22, chemical agents25, gene deletions23,26 or in other yeast species and their hybrids33. Exploring these new dimensions will become more and more important to reveal the dynamic of the PIN.
The authors have nothing to disclose.
This work was supported by Canadian Institute of Health Research (CIHR) Grants 191597, 299432 and 324265, a Natural Sciences and Engineering Research Council of Canada Discovery grant and a Human Frontier Science Program grant to CRL. CRL is a CIHR New Investigator. Guillaume Diss is supported by a PROTEO fellowship. Samuel Rochette is supported by NSERC and FRQNT fellowships.
Name of Material/Equipement | Company | Catalog Number | Comments/Description |
BioMatrix Robot, Bench-top Configuration | S&P Robotics Inc. | BM5-BC | |
96-format Pin-tool | S&P Robotics Inc. | PH-96-10 | Standard 96-format Pin-tool with 96 high-precision floating pins |
384-format Pin-tool | S&P Robotics Inc. | PH-384-10 | Standard 384-format Pin-tool with 384 high-precision floating pins |
1536-format Pin-tool | S&P Robotics Inc. | PH-1536-05 | Custom 1536-format Pin-tool with 0.5mm high-precision floating pins |
Automated imaging module | S&P Robotics Inc. | IMG-02 | |
Methotrexate | Bioshop Canada Inc. | MTX440 | CAUTION: toxic compound |
Hygromycin B | Bioshop Canada Inc. | HYG003 | |
Nourseothricin dihydrogen sulfate | Werner BioAgents | 5010000 | |
Yeast-Interactome Collection | Thermo Scientific | YSC5849 | |
Omni Tray w/lid sterile | Thermo Scientific | 242811 | |
Anti-DHFR F[1,2] antibody | Sigma-Aldrich | D1067 | |
Anti-DHFR F[3] antibody | Sigma-Aldrich | D0942 |