The study details the methodology of FRET mapping including the selection of labeling sites, choice of dyes, acquisition, and data analysis. This methodology is effective at determining binding sites, conformational changes, and dynamic motions in protein systems and is most useful if performed in conjunction with existing 3-D structural information.
Förster resonance energy transfer (FRET) is an established fluorescence-based method used to successfully measure distances in and between biomolecules in vitro as well as within cells. In FRET, the efficiency of energy transfer, measured by changes in fluorescence intensity or lifetime, relates to the distance between two fluorescent molecules or labels. Determination of dynamics and conformational changes from the distances are just some examples of applications of this method to biological systems. Under certain conditions, this methodology can add to and enhance existing X-ray crystal structures by providing information regarding dynamics, flexibility, and adaptation to binding surfaces. We describe the use of FRET and associated distance determinations to elucidate structural properties, through the identification of a binding site or the orientations of dimer subunits. Through judicious choice of labeling sites, and often employment of multiple labeling strategies, we have successfully applied these mapping methods to determine global structural properties in a protein-DNA complex and the SecA-SecYEG protein translocation system. In the SecA-SecYEG system, we have used FRET mapping methods to identify the preprotein-binding site and determine the local conformation of the bound signal sequence region. This study outlines the steps for performing FRET mapping studies, including identification of appropriate labeling sites, discussion of possible labels including non-native amino acid residues, labeling procedures, how to perform measurements, and interpreting the data.
For proteins, elucidation of dynamics along with 3-dimensional (3-D) structural knowledge leads to an enhanced understanding of structure-function relationships of biomolecular systems. Structural methods, such as X-ray crystallography and cryogenic electron microscopy, capture a static structure and often require the determination of multiple structures to elucidate aspects of biomolecule binding and dynamics1. This article discusses a solution-based method for mapping global structural elements, such as binding sites or binding interactions, that are potentially more transient and less easily captured by static methods. Strong candidate systems for this methodology are ones in which a 3-D structure has been previously determined by X-ray crystallography, NMR spectroscopy, or other structural methods. In this case, we take advantage of the X-ray crystal structure of the SecA-SecYEG complex, a central player in the protein general secretory pathway, to map the location of a signal peptide binding site using Förster resonance energy transfer (FRET) prior to the transport of the preprotein across the membrane2. Manipulation of the biological system through genetic modifications coupled with our knowledge of the 3-D structure enabled the determination of the conformation of the signal sequence and early mature region immediately prior to insertion into the channel 3.
FRET involves the radiation-less transfer of energy from one molecule (donor) to another (acceptor) in a distance-dependent fashion that is through space4,5. The efficiency of this transfer is monitored through either a decrease in donor or an increase in acceptor fluorescence intensity. The efficiency of energy transfer can be described as
E = R06/(R06 + R6)
in which the R0 value is the distance at which the transfer is 50% efficient6. The technique has previously been described as a molecular ruler and is effective at determining distances in the 2.5-12 nm range, depending on the identity of the donor-acceptor dyes4,7,8,9. The donor fluorescence intensities and lifetimes with or without acceptor allow determination of transfer efficiencies and consequently, distances5,8. Due to the availability of the technology, sensitivity of the method, and ease of use, FRET has also found broad application in such areas as single-molecule fluorescence spectroscopy and confocal microscopy6. The advent of fluorescent proteins such as green fluorescent protein has made the observation of intracellular dynamics and live-cell imaging relatively facile10,11. Many FRET applications such as these are discussed in detail in this virtual issue.
In this study, we particularly focus on the use of FRET measurements to yield distance values to determine structural details. Previously, FRET measurements have been effectively used to determine the conformation of DNA molecules when bound to protein12,13,14, the internal dynamics of proteins, and protein binding interactions15,16,17. The advantages of this method lie in the ability to determine flexible and dynamic structural elements in a solution with relatively low amounts of material. Significantly, this method is particularly effective when used in conjunction with existing structural information and cannot be used as a means of 3-D structure determination. The method provides the best insight and refinement of structure if the work builds on existing structural information often coupled with computational simulation18,19. Here, the use of distances obtained from steady-state and time-resolved FRET measurements is described to map a binding site, the location of which was not known, on an existing crystallographic structure of the SecA-SecYEG complex, major proteins in the general secretory pathway3.
The general secretory pathway, a highly conserved system from prokaryotes to eukaryotes to archaea, mediates the transport of proteins either across or into the membrane to their functional location in the cell. For Gram-negative bacteria, such as E. coli, the organism used in our study, proteins are inserted into or translocated across the inner membrane to the periplasm. The bacterial SecY channel complex (termed the translocon) coordinates with other proteins to translocate the newly synthesized protein, which is directed to its correct location in the cell through a signal sequence typically located at the N-terminus20,21. For proteins bound for the periplasm, the ATPase SecA protein associates with the exit tunnel of the ribosome, and with the preprotein after approximately 100 residues have been translated22. Along with the SecB chaperone protein, it maintains the preprotein in an unfolded state. SecA binds to the SecYEG translocon, and through many cycles of ATP hydrolysis, facilitates protein transport across the membrane23,24.
SecA is a multi-domain protein that exists in cytosolic and membrane-bound forms. A homodimeric protein in the cytosol, SecA consists of a preprotein binding or cross-linking domain25, two nucleotide-binding domains, a helical wing domain, a helical scaffold domain, and the two helix finger (THF)26,27,28,29 (Figure 1). In previous crystallographic studies of the SecA-SecYEG complex, the location of the THF suggested that it was actively involved in protein translocation and subsequent cross-linking experiments with the signal peptide further established the significance of this region in protein translocation30,31. Previous studies, using the FRET mapping methodology, demonstrated that exogenous signal peptides bind to this region of SecA2,32. To fully understand the conformation and location of the signal sequence and early mature region of the preprotein prior to insertion into the SecYEG channel, a protein chimera in which the signal sequence and residues of the early mature region were attached to SecA through a Ser-Gly linker was created (Figure 1). Using this biologically viable construct, it was further demonstrated that the signal sequence and early mature region of the preprotein bind to the THF in a parallel fashion2. Subsequently, the FRET mapping methodology was used to elucidate the conformation and location of the signal sequence and early mature region in the presence of SecYEG as described below3.
Knowledge of the 3-D structure of the SecA-SecYEG complex33,34,35 and the possible location of the binding site allowed us to judiciously place donor-acceptor labels in locations where the intersection of individual FRET distances identifies the binding site location. These FRET mapping measurements revealed that the signal sequence and the early mature region of the preprotein form a hairpin with the tip located at the mouth of the SecYEG channel, demonstrating that the hairpin structure is templated prior to channel insertion.
1. Selection of labeling sites
2. Labeling the protein
3. Determine the R 0 values
4. Perform FRET spectral measurements
5. Analysis of FRET data
6. Mapping the distances
This study focused on determining the location of the preprotein binding site on SecA prior to insertion of the preprotein into the SecYEG channel. To map the binding site, FRET experiments were performed between different regions of the preprotein and three distinct locations on the SecA and SecYEG proteins (Figure 1A–D). From the distances obtained and three-dimensional structures of SecA, SecYEG, and the preprotein, the location of the preprotein binding site was predicted. Rather than employing three separate entities (SecA, SecYEG, and preprotein) to perform these measurements, the PhoA signal sequence was attached to SecA through genetic modification after incorporation of a Gly-Ser linker2,3. For facile labeling with dyes, Cys residues or amber mutations were introduced at residues 2, 22, 35, and 45 in the PhoA preprotein (Figure 1E).
Identification of sites and labeling
A putative binding site for the signal sequence had been previously identified using similar FRET mapping methods2,32. These and other studies had identified the two-helix finger (THF) and the preprotein cross-linking domain as possible binding sites of the signal peptide with a suggested orientation in a parallel position to the THF33,51,52,53,54 (Figure 1F). Thus, identification of potential labeling sites on the SecA and SecYEG proteins was done based on the location of the putative binding site in the SecA and SecYEG crystal structure (shown in green in Figure 1A–D). Three sites were chosen to triangulate the position of the putative binding site, where essentially the three sites form a triangle around the putative binding site. As shown in Figure 1, the three sites were within the FRET range of the binding site (50-70 Å). The dye pair of Alexa Fluor 488 (AF488) and Alexa Fluor 647 (AF647) was chosen, as the R0 value of 55.7 Å36 corresponds well with the expected distances between the labeled sites and the putative binding site ensuring measurement accuracy.
The three sites chosen for labeling, SecA37, SecA321, and SecY292 (shown as magenta, violet, and cyan spheres in Figure 1A–D) are located throughout the protein complex forming a triangle around the putative binding site. The three sites were separately mutated to Cys residues in a Cys-less mutant to ensure that only the correct position was labeled2,37. For the SecY292 experiments, the PhoA preprotein sites were labeled with AF647 and SecY residue 292 was labeled with AF488 using maleimide chemistry. In the chimeric protein, sites SecA37 and SecA321 were labeled with AF647 and the preprotein was labeled with AF488. In the SecA-PhoA chimeric protein, residues 2, 22, 37, and 45 of the PhoA preprotein segment were each mutated to an amber codon in individual proteins. The amber codon mutations allowed the introduction of unnatural amino acid, p-azidophenylalanine, at those positions, which were subsequently labeled with the AF488 using click chemistry39,40. Each mutation was generated and labeled independently to ensure correct, differential labeling of the protein components. The degree of labeling was determined for all proteins and generally needed to be 50% or better in order to proceed with the sample.
Determination of transfer efficiencies and distances
Prior to performing the energy transfer measurements, the donor quantum yield, overlap integral and R0 values were determined (steps 3.4-3.6). The donor quantum yield was measured relative to the dye, fluorescein, which has a quantum yield of 0.79 in 0.1 M NaOH47. Absorption and fluorescence emission spectra were obtained at a series of concentrations to generate a linear plot of absorption relative to fluorescence emission intensity to determine the quantum yield. In these measurements, it is critical to measure the absorption in the linear range (0.1-1.0) and all emission measurements need to be generated with the same slit settings. As these values are used to determine overlap integrals and R0 values, they should be measured under FRET conditions. Protein local environment profoundly affects dye emission and consequently, donor quantum yields should be measured for each of the sites investigated. We note that sites on SecA and SecYEG influence the R0 values more strongly than those on the PhoA portion of the chimera. For dye pairs with the same SecA or SecYEG site, the R0 values are typically within 5 Å of each other; whereas the R0 values can differ by as much as 20 Å for the two different SecA locations (residues 37 vs. 321), underscoring the importance of determining R0 values for each dye pair (Table 1).
The calculation of R0 assumes that the donor and acceptor dyes are freely rotating and the degree to which the dyes do not rotate contributes to the overall uncertainty in the measurement. To appropriately take into account the relative motion of the dyes and their orientations, steady-state fluorescence anisotropy measurements were performed on all the donor and acceptor dyes in the different labeling positions. These values, which were in the 0.10-0.21 range, were used to calculate the error associated with the distance measurements2,3,48. The relatively high anisotropy values observed for both the donor and acceptor dyes correspond to a reduction in dye rotation, which is inconsistent with the assumption of free rotation. The lack of free rotation generates an error of 19%-25% in the distance calculations. As shown in Table 1, this led to an average uncertainty in the measured distances of at most ± 15 Å. When mapping the FRET distances, these uncertainties in the distance measurements are an important consideration, as discussed below.
The calculated distance between donor-acceptor pairs is based upon the relationship between efficiency and distance, in which a higher efficiency is indicative of donor-acceptor pairs separated by a shorter distance. To determine FRET efficiencies, fluorescence emission spectra excited at the donor excitation wavelength (488 nm) are obtained on donor-only and donor-acceptor samples. Typically, a reduction in donor emission intensity signifies the presence of energy transfer (step 5.1). Figure 1G depicts the donor-acceptor spectra for the SecA 37 residue with either residue 2 or 22 of the PhoA chimera. The SecA37 residue is labeled with AF647 or the acceptor dye, and the PhoA residues are labeled with AF488 or the donor dye. At either position, the donor fluorescence is reduced and a small increase in acceptor fluorescence intensity can be seen in the donor-acceptor samples. Since excitation is done at the donor excitation wavelength of 488 nm, which does not directly excite the acceptor, any acceptor fluorescence observed results from energy transfer. Thus, the decrease in donor intensity and concomitant increase in acceptor intensity results from energy transfer between the two dyes. Significantly, the donor fluorescence intensity is higher for the PhoA2 position (blue) relative to the PhoA22 position (yellow) in the presence of the acceptor. This relative difference in the donor intensity decrease indicates that energy transfer between the PhoA2 residue and the SecA37 residue is weaker than the transfer between the PhoA22 and SecA37 residues, which implies that the PhoA2 residue is located further away from SecA37 than PhoA22. Distances are determined from the relationship between efficiency and R0 values (step 5.1.4).
Since the steady-state fluorescence measurements could represent an average of two or more distances, we also performed time-resolved fluorescence measurements. For these experiments, the lifetime of the donor dye is measured in the presence and absence of the acceptor (Figure 1G). If there were two distinct energy transfer processes contributing to the measured steady-state fluorescence efficiency, they would be observed as discreet lifetimes, provided they were resolvable within the time resolution of the instrument. To enhance the ability to resolve the lifetimes, 10,000 counts or more should be collected at the peak; however, the peak height or peak channel counts needs to be balanced with the time of acquisition and potential damage to the sample. Time-resolved measurements yielded single lifetimes for each donor-acceptor pair consistent with only one orientation or distance between the dyes. We note that small differences in the distance as observed in the areas revealed by our FRET mapping technique would not lead to resolvable lifetimes in our system. Moreover, the efficiencies as determined by the time-resolved fluorescence measurements were in good agreement with those determined from the steady-state measurements, providing further support that the measured efficiencies arise from only one distance between the dye pairs3.
Mapping the FRET distances onto the 3-dimensional structure
The resonance energy transfer measurements yield sufficient distance information to identify the binding site and orientation of the signal sequence on SecA. The three locations on SecA and SecYEG along with the four positions on the PhoA region of the SecA-PhoA chimera provide the 12 different distances used to map the binding site (Table 1). The twelve distances were mapped onto the three-dimensional X-ray cocrystal structure of the Thermotoga maritima SecA-SecYEG complex (PDBID: 3DIN) to identify the binding site of the signal sequence33. The structure of the SecA-SecYEG complex is similar to that observed in E. coli as evidenced by an in vivo photocrosslinking study55.
We use the beginning (PhoA2) and end (PhoA22) residues of the PhoA signal sequence in the SecA-PhoA chimera to illustrate how residue locations were identified on the SecA-SecYEG complex. As energy transfer can occur in all directions, the FRET distances and associated errors describe a spherical shell, with one of the dye locations from the donor-acceptor pair designated as the center. In this study, the residues SecA37, SecA321, and SecY292 form the centers of three spherical shells that describe the location of the PhoA2 residue of the signal sequence. Visualization of the overlapping regions arising from the three separate locations, SecA37 (magenta), SecA321 (purple), and SecY292 (cyan) are shown in Figure 3. Only a portion of each FRET shell intersects with the protein structure, and the residues and backbone that fall within that shell are highlighted. Thus, the protein regions within the shell defined by the SecA37-PhoA2 distance are shown in magenta (Figure 3A,E), while the regions defined by the SecA321-PhoA2 and SecY292-PhoA2 shells are shown in purple (Figure 3B,F) and cyan (Figure 3C,G), respectively. The putative binding site, consisting mainly of the two-helix finger, is shown in green.
As shown in Figure 3, each of these FRET shells defines a relatively large section of the protein complex. For all three locations, the FRET shell does intersect with the putative binding site; however, for the SecA321 residue, for example, the intersected area is smaller and lies towards the ends of the fingers with significant overlap with the helical scaffold domain. The intersection or the common area of all three FRET shells (Figure 3D,H), defines the location of the PhoA2 residue. This area is considerably smaller than each FRET shell and includes only a small portion of the THF with a large contribution from the helical scaffold. The scripts used for generating the FRET shells and the intersected areas for the molecular visualization program, PyMOL, are given in the Supplemental Information. Portions of the shells are visualized as pink dots on the SecA-SecYEG complex in Supplementary Figures 1–3.
A similar strategy was used to identify the location of the PhoA22 residue. The FRET shells defined by the PhoA22 FRET distances (Figure 4A–C,E–G) describe a smaller area relative to the PhoA2 residue (Figure 4D,H vs. Figure 3D,H). We interpret this difference to suggest that the PhoA2 residue and associated region are more flexible and labile than PhoA22. Significantly, the area ascribed to the PhoA22 residue is located closer to the tip of the THF and the mouth of the SecYEG channel, with regions of SecY identified in the common area (Figure 3D,H). All three dye pairings identify regions along with the putative binding site; however, the intersected common areas center the PhoA22 location (Figure 4D,H) at the opposite end of the THF relative to the PhoA2 location (Figure 3D,H). These findings would suggest that the signal sequence of the preprotein which extends from residues 2-22, lies along the THF in a relatively unstructured state. This result is consistent with earlier studies suggesting the signal sequence binds to the protein along the THF in an extended state and that the C-terminal end of SecA in the B. subtilis crystal structure essentially models the structure of the signal peptide and occupies the same location (shown in red Figure 1F)2,26. We employed a similar approach to identify two locations in the early mature region (residues 37 and 45) of the SecA-PhoA preprotein chimera to further define the binding and orientation of the signal sequence and early mature region as discussed below. In other studies FRET distances have been used effectively to refine an existing structure or molecular dynamics simiulation-derived model18,19,56; we were not able to do this, as no structure for the signal sequence bound to SecA exists.
Figure 1: Labeling sites in SecA-SecYEG complex with representative FRET spectra. (A–D) Four different views of the SecA-SecYEG co-crystal structure (PDBID: 3DIN)33 in which the labeling sites of SecA37, SecA321, and SecY292 are shown as magenta, violet, and cyan spheres, respectively. A-C are side views of the complex and D is a top view. SecA is shown in light grey, SecYEG is shown in dark grey and the putative binding site, the THF, is shown in green. (E) Schematic of the SecA-PhoA chimera construct, which connects the SecA protein to the PhoA preprotein through a Ser-Gly linker (not drawn to scale). The labeling sites on the PhoA portion of the chimera are given in blue, green, yellow, and red, corresponding to residues 2, 22, 37, and 45. (F) Ribbon diagram of the crystal structure of B. subtilis SecA protein (PDBID: 1M6N) colored by domain where nucleotide-binding domains 1 and 2 are shown in blue and light blue, respectively, the preprotein cross-linking domain is shown in gold, the central helix in green, the two-helix finger in cyan, the helical wing domain in dark green and the C-terminal linker in red26. The unstructured C-terminus serves as a model of the bound PhoA signal peptide based on a previous FRET mapping study2. (G) Steady-state fluorescence spectra of the donor only and donor-acceptor samples of the SecA37-AF647 and PhoA2-AF488 FRET pair and the SecA37-AF647 and PhoA22-AF488 FRET pair. The reduction in donor intensity for the donor-acceptor sample is indicative of energy transfer. Greater energy transfer occurs from PhoA22 relative to the PhoA2 site based on the decrease in donor intensity. (H) Time-resolved fluorescence donor only (magenta) and donor-acceptor (light magenta) decay spectra of the SecA37-AF647 and PhoA22-AF488 FRET pair. The instrument response function is shown in grey. The donor-acceptor complex gives a shorter decay and consequently a faster lifetime consistent with energy transfer. All molecular structures were generated with the indicated PDB file and PyMOL50. Figure 1E-H has been modified from Zhang et al.3. Please click here to view a larger version of this figure.
Figure 2: User interface of the FluorEssence program. (A) The opening window is shown with a circle around the red M. This must be clicked to connect the program with the fluorometer. (B) The experiment set-up window illustrates the different areas (monos, detectors, accessories) where scan relevant parameters are entered. Please click here to view a larger version of this figure.
Figure 3: FRET distance shells determined for the SecA-PhoA2 position. FRET distance shells constructed from the FRET distances and the associated uncertainties (Table 1) are depicted on the SecA-SecYEG complex (PDBID: 3DIN). (A–C) FRET distance shells for the PhoA2 location constructed with SecA37 (magenta), SecA321 (violet), and SecY292 (cyan), respectively at the center position. The shells are colored according to the center residue. (D) The intersection of the three FRET shells defines the location of PhoA2, shown in blue. (E–H) Views are rotated approximately 180° from A-D. All molecular structures were generated with the indicated PDB file and PyMOL50. Please click here to view a larger version of this figure.
Figure 4: FRET distance shells determined for the SecA-PhoA22 position. FRET distance shells constructed from the FRET distances and associated uncertainties (Table 1) are depicted on the SecA-SecYEG complex (PDBID: 3DIN). (A–C) FRET distance shells for the PhoA22 location constructed with SecA37 (magenta), SecA321 (violet) and SecY292 (cyan), respectively at the center position. The shells are colored according to the center residue. (D) The intersection of the three FRET shells defines the location of PhoA22, shown in yellow. (E–H) Views are rotated approximately 180° from A-D. All molecular structures were generated with the indicated PDB file and PyMOL50. Please click here to view a larger version of this figure.
Figure 5: FRET-mapped locations of PhoA2, PhoA22, PhoA37, and PhoA45 projected onto the B. subtilis SecA –Geobacillus thermodenitrificans SecYE cocrystal structure (PDBID: 5EUL). (A) Coloring of SecA-SecYE as in Figure 1. The OmpA peptide substrate inserted at the end of the THF is shown in pink. FRET-mapped regions were generated in the presence of ATP-γS, with PhoA2 shown in blue, PhoA22 in green, PhoA37 in yellow, and PhoA45 in red. Overlap regions are shown in olive (PhoA22 and PhoA37) and orange (PhoA37 and PhoA45). The peptide substrate (residues 749-791, cyan) was excised from the original structure and modeled into the putative binding region without any alterations to the structure (circled in red). (B) Enlarged view of the modeled peptide substrate. Residues 2 (Lys), 22 (Tyr), and 37 (Gly) of the OmpA peptide substrate are depicted in stick form in blue, green, and yellow, respectively. These residues in the modeled peptides exhibit excellent agreement with the predicted FRET mapped locations. For clarity, the nanobody in the original structure has been omitted. This figure has been modified from Zhang et al.3. Please click here to view a larger version of this figure.
labeled site on SecA-PhoA-SecYEG complex | Labeled site on PhoA peptide portion of SecA-PhoA Chimera | |||
PhoA2-AF488 | PhoA22-AF488 | PhoA37-AF488 | PhoA45-AF488 | |
SecA 37-AF467-PhoA | ||||
R0 = 57 | R0 = 57 | R0 = 57 | R0 = 60 | |
FRET efficiency | 0.27 +/- .01 | 0.52 +/- .02 | 0.39 +/- .03 | 0.36 +/- .03 |
distance | 67 +/- 15 | 56 +/- 12 | 62 +/- 13 | 66 +/- 15 |
SecA321-AF647-PhoA | ||||
R0 = 40 | R0 = 38 | R0 = 37 | R0 = 38 | |
FRET efficiency | 0.16 +/- .04 | 0.62 +/- .01 | 0.39 +/- 0.02 | 0.43 +/- 0.02 |
distance | 53 +/- 11 | 34.9 +/- 7.3 | 39.7 +/- 7.9 | 39.7 +/- 7.5 |
PhoA2-AF647 | PhoA22-AF647 | PhoA37-AF647 | PhoA45-AF647 | |
SecY292-AF488 EG | ||||
R0 = 57 | R0 = 50 | R0 = 53 | R0 = 54 | |
FRET efficiency | 0.25 +/- .06 | 0.46 +/- .06 | 0.24 +/- .05 | 0.30 +/- .01 |
distance | 68 +/- 15 | 51.3 +/- 9.2 | 64 +/- 14 | 62 +/- 13 |
adapted from reference 3. | ||||
R0 values given in angstroms were calculated as described in the text | ||||
The FRET efficiency was calculated from the decrease in donor fluorescence intensity in the presence of the acceptor as described. The error is reported as the SD from three independent measurements. | ||||
Donor-acceptor distances (R) are given in angstroms and calculated as described in the text. The error reported results from a consideration of the experimental error and that arising from the orientation of the dyes. Dye orientation is estimated from the steady state fluorescence anisotropy |
Table 1: Transfer Efficiencies and Distances Determined for the SecA-PhoA-SecYEG complex. FRET efficiencies, distances, and R0 values are given for the 12 distances used for mapping the preprotein binding site.
Supplementary Figure 1: FRET distance shell, shown in pink dots, determined for the SecA37 residue and the PhoA 37 residue on the SecA-SecYEG complex (PDBID: 3DIN). Sec A is shown in light grey, SecYEG is shown in dark grey and the SecA37 residue is shown in magenta. Please click here to download this File.
Supplementary Figure 2: FRET distance shell, shown in pink dots, determined for the SecA321 residue and the PhoA 37 residue on the SecA-SecYEG complex (PDBID: 3DIN). Sec A is shown in light grey, SecYEG is shown in dark grey and the SecA321 residue is shown in violet. Please click here to download this File.
Supplementary Figure 3: FRET distance shell, shown in pink dots, determined for the SecY292 residue and the PhoA 37 residue on the SecA-SecYEG complex (PDBID: 3DIN). Sec A is shown in light grey, SecYEG is shown in dark grey and the SecY292 residue is shown in cyan. Please click here to download this File.
Supplementary File. Please click here to download this File.
Through the use of the FRET mapping methodology, we identified the signal sequence binding site on the SecA protein. Importantly, the presence of a 3-D crystal structure of the complex greatly facilitated our study. The strength of this mapping methodology lies in the ability to use an existing structure to identify locations for labeling. This methodology cannot be used to determine a 3-D structure; however, determination of structural elements56, refinement of an existing structure49, determination of a binding site location2,32, or elucidation of dynamic motion57, are all possible applications of this method.
In the SecYEG-SecA-PhoA complex, the three labeling sites form a triangle around the putative binding site (Figure 1). The employment of multiple distance measurements from the vertices of this triangle to the same residue refines the location information similar to GPS navigation methods. Three sites, SecA37, SecA341, and SecY292, were identified on SecA and SecY along with four sites, PhoA2, PhoA22, PhoA37, PhoA45 in the signal sequence and early mature region of the preprotein to give a total of 12 distances for mapping the location of the signal peptide (Table 1). Importantly, to improve the accuracy of the distance measurements, labeling sites should be located in relatively static regions of the protein, such as in secondary structure elements rather than loops. Furthermore, sites should also be in locations that are relatively accessible to solvent for ease and increased efficiency of labeling. Within the SecA-PhoA-SecYEG complex, the performance of distance measurements from the triangle sites or vertices to residues in the binding substrate were sufficient for locating the residues in the binding substrate to a relatively small area (Figure 3D,H and Figure 4D,H). Identification of the intersected area from the multiple distance measurements significantly refines the location from that of single distance measurement, as shown in Figure 3 and Figure 4. Thus, when using this method, the measurement of multiple distances is strongly recommended. Although this method can identify regions of molecules involved in binding, for example, it does not provide precise structural information; such information is best obtained from other structural methods such as X-ray, NMR, and cryo-EM. FRET distances can be used to effectively refine an existing structure18,19 or model, in this case, that was not possible as no model of the signal sequence bound to SecA exists.
To ensure that dye labeling occurs at only desired locations and FRET distance measurements are accurate, the use of mutagenesis for labeling is preferred. Generation of a Cys-less mutant requires relatively conservative mutation of Cys residues to Ser or similar residues in the otherwise wild-type protein using site-directed mutagenesis methods. In the current study, Cys mutations were introduced into Cys-free versions of SecA and SecYEG for labeling37. The activity of the mutant protein was verified with a growth assay followed by an in vitro malachite green ATPase assay41,42. Relevant activity assays depend on the function of the protein, for example for DNA binding proteins, a DNA binding assay would be appropriate58. Failure to ensure labeling at only one location can lead to labeling of more than one site with the same dye, which significantly complicates the distance determinations. Thus, the introduction of a second label on the same protein can be done through the incorporation of unnatural amino acid by site-directed mutagenesis. We employed this methodology to label the SecA-PhoA chimera at locations distinct from Cys residues by introducing the unnatural amino acid, p-azidophenylalanine and labeling with click chemistry39,40.
An additional important consideration of this method is the choice of dyes used and the associated R0 value. After identification of the potential labeling sites, the distances to be measured can be estimated from the 3-D structure. With this information, investigators can choose dyes pairs with R0 values that span the desired range of expected distances. For example, the AF488-AF647 dye pair has a calculated R0 of 55.7 Å which provides a good range for labeling sites located an estimated 40-75 Å away from the putative binding site. Although the R0 values calculated in step 2.2 are useful for choosing which dye pair to use for your system, attachment of the dyes to the protein can alter their properties significantly. For greater accuracy, R0 values should be calculated from in situ experiments performed with labeled protein (steps 3.1 – 3.6.5).
Measurement of transfer efficiency can be done either by monitoring steady-state fluorescence emission and observing either a decrease in donor emission or an increase in acceptor emission.Although observation of both effects is desirable, the efficiency can be calculated from either as described5,8. Efficiency can also be calculated from the decrease in donor lifetime in the donor-acceptor sample relative to the donor-only sample. Determination of efficiency by more than one method is recommended, particularly the use of time-resolved methods to establish the relative homogeneity of the efficiencies measured.
The mapping method also allowed us to determine the relative orientation of the signal sequence and the early mature regions of the PhoA preprotein relative to SecA and the putative binding site. A SecA-SecYEG X-ray crystal structure and subsequent cryo-EM study provided clarity regarding the structure of the signal sequence and the early mature region of the preprotein with respect to the channel and SecA34,35. In the X-ray structure, residues 1-41 of the OmpA preprotein were attached to the tip of the two-helix finger and visualized in a hairpin structure in the channel (shown in pink, Figure 5). The locations of the four PhoA residues in the SecA-PhoA chimera were mapped onto this structure using the same protocol as described above. As shown in Figure 5, the location of the PhoA37 and PhoA45 residues (yellow, orange, red) is in between PhoA2 and PhoA22, with PhoA45 closer to PhoA2. These findings, particularly the location of PhoA45, suggested that the PhoA preprotein was forming a hairpin structure.
To further validate our FRET-identified binding site, we performed a comparison of our mapped locations with that of the OmpA preprotein, by excising the 41 residue preprotein structure from the channel and modeling it into the regions defined by FRET mapping (Figure 5, cyan). Without any alteration of the preprotein X-ray structure, we find that the locations of residues 2, 22, and 37 (shown in blue, green, and yellow) on the OmpA preprotein fragment excised structure agree remarkably well with the FRET mapped locations (Figure 5B) and suggest that the hairpin forms prior to channel entry. The OmpA preprotein ends at residue 41 in the X-ray crystal structure; however, the C-terminal of SecY, which is unstructured, provides an indication of the possible location of PhoA45. In our modeled structure, the hairpin loop sits at the mouth of the channel, poised to facilitate the translocation of the preprotein across the membrane. Thus, in this example, the FRET mapping methodology enhances existing information of the static SecA-SecYEG structure, by providing a hint of the dynamic motion needed for protein translocation across the membrane. Although not suitable for de novo structure determinations, if 3-dimensional structural information is available, the FRET mapping methodology can further the current understanding of structure-function relationships, through elucidation of binding sites and dynamic motions.
The authors have nothing to disclose.
This work was supported by National Institutes of Health grant R15GM135904 (awarded to IM) and National Institutes of Health Grant GM110552 (awarded to DBO).
490 nm LED laser | Horiba | 1684-LED | |
Alexa Fluor 647 C2 Maleimide//DIBO Alkyne | Life Technologies | A20347 | |
Agar | Difco | DF0812 | |
Alexa Fluor 488 C5 Maleimide/DIBO Alkyne | Life Technologies | A10254 | |
Alexa Fluor 488 DIBO Alkyne | Life Technologies | S10904 | |
Alexa Fluor 647 DIBO Alkyne | Life Technologies | S10906 | |
Amicon Ultra4 Centrifugal filter (50kDa MWCO) | Sigma | UFC805008 | |
Dodecylmaltoside (DDM) | Anatrace | D310 | |
E. coli alkaline phosphatase signal peptide SP22 | Biomolecules Midwest | N/A | Synthesized custom item |
extended signal peptide SP41 | Biomolecules Midwest | N/A | Synthesized custom item |
FluorEssence | Horiba | version 2.4 | spectral acquisition program for Fluoromax4 spectrofluorometer |
Fluoromax 4 spectrofluorometer | Horiba | N/A | |
GlobalsWE | Laboratory for Fluorescence Dynamics, University of California, Irvine | spectral analysis program for time-resolved decays | |
H4AzidoPheOH | BACHEM | 4020250.0001 | |
LB (Miller) Broth | Fisher Scientific | BP9723 | |
Ludox HS-40 colloidal silica (40 wt.% suspension in H2O) | Sigma-Aldrich | 420816 | dilution is needed to make a proper scattering solution |
PTI Felix GX | Horiba | version 4.1.0.4096 | spectral acquisition program for PTI Time Master Instrument |
PTI Time Master Instrument | Horiba | NA | |
Pymol Molecular Graphics Program | Schrodinger | version 2.4 | |
Water bath | Thermo Scientific | NESLAB RTE 10 |