An approach is presented for determining structures of viral membrane glycoprotein complexes using a combination of electron cryo-tomography and sub-tomogram averaging with the computational package Jsubtomo.
Enveloped viruses utilize membrane glycoproteins on their surface to mediate entry into host cells. Three-dimensional structural analysis of these glycoprotein ‘spikes’ is often technically challenging but important for understanding viral pathogenesis and in drug design. Here, a protocol is presented for viral spike structure determination through computational averaging of electron cryo-tomography data. Electron cryo-tomography is a technique in electron microscopy used to derive three-dimensional tomographic volume reconstructions, or tomograms, of pleomorphic biological specimens such as membrane viruses in a near-native, frozen-hydrated state. These tomograms reveal structures of interest in three dimensions, albeit at low resolution. Computational averaging of sub-volumes, or sub-tomograms, is necessary to obtain higher resolution detail of repeating structural motifs, such as viral glycoprotein spikes. A detailed computational approach for aligning and averaging sub-tomograms using the Jsubtomo software package is outlined. This approach enables visualization of the structure of viral glycoprotein spikes to a resolution in the range of 20-40 Å and study of the study of higher order spike-to-spike interactions on the virion membrane. Typical results are presented for Bunyamwera virus, an enveloped virus from the family Bunyaviridae. This family is a structurally diverse group of pathogens posing a threat to human and animal health.
Electron cryo-tomography is an electron cryo-microscopy imaging technique allowing the calculation of a three-dimensional (3D) reconstruction of complex biological specimens. Suitable specimens range from purified macromolecular complexes1, filaments2, coated vesicles3, and pleomorphic membrane viruses4 to whole prokaryotic cells5 and even thin areas of whole eukaryotic cells6. Following the data collection of a tilt-series, 3D tomographic volumes, or tomograms, may be calculated using several established software packages, including Bsoft7 and IMOD8.
Two aspects inherent to the study of biological specimens by electron cryo-tomography limit the biological interpretation of the corresponding tomographic volumes. First, due to the limited electron dose that can be applied to biological materials before introducing significant radiation damage, signal-to-noise ratios in tomographic data are typically very low. Second, as a result of limited sample tilt geometry during data collection, some views of the object remain absent, leading to a so-called ‘missing wedge’ artifact in the tomographic volume. However, both of these limitations can be overcome if the tomographic volume contains repeating identical structures, such as macromolecular complexes, that can be successfully averaged9-12.
Prior to averaging structures from tomogram reconstructions, objects of interest must be found and aligned to the same orientation. Locating such structures may be achieved by cross-correlation of a template structure in the tomographic volume using an approach often referred to as template matching13. The template used in this matching process can be derived from electron cryo-microscopy or electron cryo-tomography combined with 3D reconstruction, or it can be a density map simulated from an atomic structure. Several computational packages have been developed to carry out these tasks11.
Averaging of glycoprotein spikes of membrane viruses, such as HIV-1, has been a particularly successful approach for studying their structure14-16. An understanding of the structure is integral for revealing both the molecular basis of virus–host interactions and guiding antiviral and vaccine design development. While macromolecular crystallography is the technique of choice for high-resolution (usually better than 4 Å) structural analysis of individual viral glycoproteins and their complexes, the X-ray structures resulting from this method are of proteins isolated from the natural membranous environment on the virion. Thus, important details such as the higher order architecture of viral glycoproteins, in the context of the virion, remain lacking. On the other hand, electron cryo-microscopy and single particle reconstruction of entire enveloped viruses is restricted to virions with icosahedral symmetry17,18. Electron cryotomography combined with sub-volume alignment has thus emerged as a complementary technique allowing the study of glycoprotein spikes of irregularly shaped, pleomorphic viruses in situ.
We have developed software named Jsubtomo (www.opic.ox.ac.uk/jsubtomo) for the detection, alignment, and averaging of tomographic sub-volumes. Jsubtomo has been utilized in the structure determination of a number of cellular and viral structures19-26. Here, we outline a detailed protocol, which enables the determination viral-surface spike structures. To circumvent over-refinement of averaged structures by correlating noise, the ‘gold-standard’ refinement scheme is adopted10,27. Finally, strategies for visualization and interpretation of typical results are discussed.
A detailed protocol for the computational alignment and subsequent averaging of viral glycoprotein spikes is outlined. The protocol follows the workflow illustrated in Figure 1 and combines an automated search for the spikes using initial template structures and a gold-standard structure refinement.
Input data for this protocol is a set of tomographic reconstructions of the virions. One tomogram contains one or more virions. Initially, a small subset of spikes is manually picked and used to average and refine two independent models. These models are used to automatically locate spikes on all of the virions. Finally, two independent refinements are run and the resulting averages are compared and combined to produce the final structure.
The refinement approach is demonstrated by using programs from the Jsubtomo package. Programs from the Bsoft package28 are used for general image processing tasks and molecular graphics package UCSF Chimera29 is used to visualize results. The names of individual programs are given in italics and file formats are denoted with uppercase filename extensions.
Figure 1: General strategy for determining structures of glycoprotein spike complexes from pleomorphic membrane viruses. The numbers correspond to different sections in the Protocol. Please click here to view a larger version of this figure.
1. Extracting Virus Sub-volumes from Full-size Tomograms
2. Generation of Two Independent Initial Models
3. Gold-standard Iterative Alignment and Averaging of the Two Initial Spike Models
Iteratively align and average the two initial models generated in section 2 with jsubtomo_iterate_gold.py.
4. Generation and Alignment of Seeds to the Virus Surface for Template Matching
5. Gold-standard Iterative Alignment and Averaging of the Spike Structure
Automatically locate all the spikes in the virion sub-volumes using local template matching around the refined seeds and align and average the located spikes. Use the averages generated from a subset of manually picked spikes as initial templates.
6. Visualization of the Results
We demonstrate the application of the sub-tomogram averaging workflow outlined above for the envelope glycoprotein complex of Bunyamwera virus (Orthobunyavirus, Bunyaviridae) using a previously published data set24. Data collection and refinement parameters are listed in Table 1. One representative tomogram is shown in Figure 2.
Parameter (unit) | Value |
Data collection | |
Voltage (kV) | 300 |
Calibrated magnification (X) | 111,000 |
Pixel size (Å) | 5.4 |
Estimated dose (e–/ Å2) | 100 |
Underfocus (μm) | 4.0–4.5 |
First CTF zero (Å)a | 26–30 |
Tilt range (°) | –60–60 |
Tilt sampling (°) | 3 |
Data and refinement | |
Tomograms | 11 |
Virions | 29 |
Seeds per virion | 106 |
Total number of seeds | 3,074 |
Spikes detected | 1,346 |
Spikes after removing overlaps | 1,401 |
Spikes after cross-correlation based selection | 1,022 |
Spikes included in the final average | 1,022 |
Symmetry | C3 |
Angular sampling (°) | 8 |
Resolution range used in the refinement (Å) | 42–334 |
Final resolution estimate (Å)b | 35 |
Table 1: Bunyamwera data collection and refinement statistics.
a CTF, contrast transfer function.
b Calculated using Fourier shell correlation between two independently refined structures at a threshold of 0.143.
Figure 2: Slice through a tomogram of Bunyamwera virions. Several spike side views evident in the periphery of each virion are indicated with arrowheads. The tomogram has been low-pass filtered to 60 Å. Scale bar 100 nm.
First, we refined an initial model using 205 manually picked spikes (Figure 3). Three-fold symmetry of the center-most spike was evident without applying any symmetry (Figure 3B) and was imposed in the subsequent rounds of refinement (Figure 3C). For detecting all the spikes automatically on the virion surfaces, we generated 106 seeds for each virion at the radius of 43 nm and spacing of 20 degrees (Figure 4A) and iteratively refined their positions relative to the membrane (Figure 4B).
Figure 3: Refinement of the initial template structure. (A) Cylindrically averaged (C100) template constructed from manually defined positions of spikes. (B) Averaged density after five rounds of refinement without any symmetry (C1) imposed displays a spike with three-fold symmetric features. The resolution of the model is 48 Å. (C) Average of the spike was resolved at 41 Å after five rounds of refinement with three-fold symmetry.
Figure 4: Refinement of the seeds. A-B) A subset of seeds before (A) and after (B) refinement are shown on one virion density from Figure 2. Seeds in (B) have been color-coded based on the respective cross-correlation coefficients (blue, low correlation; red, high correlation).
The best correlating glycoprotein spike patches (top 75% after removing overlaps; ~1,000 spikes) were used to calculate the final average. The average was resolved to 35-Å resolution (Figure 5). It revealed a trimeric spike structure in the middle, in addition to some contribution from six neighboring spikes. Composite models of the virions, calculated by placing the structure in the known positions, revealed placement of the spikes on the virion surface (Figure 6A). Occasionally, locally ordered patches of spikes were evident (Figure 6B).
Figure 5: Refined structure of a patch of the glycoprotein spike layer after template matching. (A-B) Two maps ‘even’ and ‘odd’, reconstructed from two halves of the data are shown. The two maps show a remarkable degree of similarity, verifying the validness of the approach. The orientation around the spike long axis is different as the two maps have been reconstructed fully independently from each other. (C) Average of the two maps is shown at the final resolution of 35 Å.
Figure 6: Placement of the spikes on the Bunyamwera virion. (A) Composite model of a virion. View vectors (sticks) indicate the orientations of the spikes. Colors indicate the cross-correlation between each spike and the template structure (blue, low correlation; red, high correlation). (B) A close-up of an ordered patch of spikes.
Knowledge of viral glycoprotein spike structure on the virion membrane is essential for understanding virus replication and developing therapeutics to treat and prevent infection. Electron cryo-microscopy combined with single particle averaging has emerged as the most utilized method to solve the structures of enveloped virus particles, including the glycoprotein spikes. However, this method is limited to icosahedrally symmetric viruses. Here, through the application of electron cryo-tomography and subtomogram averaging in Jsubtomo, we have outlined a general protocol for determining glycoprotein spikes on pleomorphic enveloped viruses that are not amenable to other current structural biology methods. Our representative results demonstrate that the resolution of this method is sufficient to reveal insights into domain architecture, oligomerization, and higher order organization of glycoprotein spikes on intact virions.
The most critical step within this protocol is constructing two reliable starting models that are statistically independent from one another. Successful execution of this step assumes that glycoprotein spikes are sufficiently large and not packed against each other too tightly, so that individual spikes can be visually recognized and manually picked in the tomograms, and two independent models averaged. If this is not feasible, two modifications to the protocol can be attempted. First, two independent random models can be constructed by first defining two random subsets of subtomograms and then averaging the subtomograms within these subsets30. Second, if a structure of the isolated spike has been derived by other means, for example by X-ray crystallography, it can be used as a starting model. However, care must be taken to low-pass filter this model using a low-resolution cut-off (50-70 Å), as the two resulting models in the next round of refinement will be statistically independent only beyond this resolution. Due to this caveat, the former approach is recommended.
The obtainable resolution from this protocol depends on four major factors: i. data collection strategy and the quality of the input data, ii. number of the subtomograms, iii. alignment accuracy of the subtomograms, and iv. heterogeneity of the structures. While the first and second limitation can be largely overcome by using high signal-to-noise direct electron detectors combined with CTF corrected tomography and automated data collection, the alignment accuracy is further affected by the size and shape of the structure of interest itself. When applying this protocol on small spikes lacking prominent features, it may be advantageous to bind Fab fragments to the spike to improve the alignment accuracy and thus resolution31. Finally, if the structures to be averaged exhibit multiple conformations, sub-tomogram classification methods may be used to average different conformations separately. To that end, Jsubtomo integrates with the Dynamo package, offering powerful subtomogram classification9.
The above protocol is complementary to X-ray crystallography of isolated viral glycoproteins. Crystallographic structures can be fitted into sub-tomogram averages to obtain the precise orientation of the glycoprotein with respect to the virion membrane. Application of this methodology will undoubtedly continue to shed light onto enveloped virus structure and pathobiology.
The authors have nothing to disclose.
This work was supported by the Academy of Finland (130750 and 218080 to J.T.H.), the Wellcome Trust (090532/Z/09/Z; 089026/Z/09/Z to T.A.B.), and by the MRC (MR/J007897/1 to J.T.H and T.A.B; MR/L009528/1 to T.A.B.).
Name of Material/ Equipment | Company | Catalog Number | Comments/Description |
Jsubtomo (ver 1.3.1) | University of Oxford | n/a | www.opic.ac.uk/jsubtomo |
Bsoft (ver 1.8.7) | NIAMS, NIH | n/a | bsoft.ws |
UCSF Chimera | UCSF | n/a | www.cgl.ucsf.edu/chimera |