Plant intercellular connections, the plasmodesmata (Pd), play central roles in plant physiology and plant-virus interactions. Critical to Pd transport are sorting signals that direct proteins to Pd. However, our knowledge about these sequences is still in its infancy. We describe a strategy to identify Pd localization signals in Pd-targeted proteins.
Plasmodesmata (Pd) are cell-to-cell connections that function as gateways through which small and large molecules are transported between plant cells. Whereas Pd transport of small molecules, such as ions and water, is presumed to occur passively, cell-to-cell transport of biological macromolecules, such proteins, most likely occurs via an active mechanism that involves specific targeting signals on the transported molecule. The scarcity of identified plasmodesmata (Pd) localization signals (PLSs) has severely restricted the understanding of protein-sorting pathways involved in plant cell-to-cell macromolecular transport and communication. From a wealth of plant endogenous and viral proteins known to traffic through Pd, only three PLSs have been reported to date, all of them from endogenous plant proteins. Thus, it is important to develop a reliable and systematic experimental strategy to identify a functional PLS sequence, that is both necessary and sufficient for Pd targeting, directly in the living plant cells. Here, we describe one such strategy using as a paradigm the cell-to-cell movement protein (MP) of the Tobacco mosaic virus (TMV). These experiments, that identified and characterized the first plant viral PLS, can be adapted for discovery of PLS sequences in most Pd-targeted proteins.
Plasmodesmata (Pd) function as conduits for intercellular transport of key regulators of plant development and morphogenesis, ranging from transcription factors to mRNA and small RNA molecules. Furthermore, this macromolecular transport capacity of Pd is utilized by most plant viruses for their intercellular spread during infection; to move through Pd, plant viruses have evolved specialized proteins, termed movement proteins (MPs), that specifically target to Pd1,2,3,4,5,6,7. Molecular pathways of Pd transport most likely are intimately interconnected with the specific sequences that target the transported proteins into these pathways. Thus, identification of these Pd localization signals (PLSs) may be diagnostic of the corresponding Pd transport pathway. This is by analogy of Pd transport8, for example, to different nuclear import pathways, which can be specific for different nuclear localization signal (NLS) sequences9,10. Conceptually, both NLSs and PLSs represent non-cleavable subcellular targeting sequences that are necessary and sufficient for targeting. However, unlike NLSs11, the sequence information about PLSs is severely limited. Specifically, only four protein sequences involved in Pd targeting have been reported, with all of them derived from endogenous plant proteins. The first one is represented by a homeobox domain of KN112 – a transcription factor that moves from inner cell layers to epidermis of the plant leaf13 – and its KNOX homologs14. The second one also is from a transcription factor, Dof, which contains a putative PLS described as the intercellular trafficking (IT) motif15. The third sequence is from the PDLP1 plasmodesmata-resident type 1 membrane protein, and it is represented by a transmembrane domain16. Finally, the fourth Pd targeting sequence was recently reported for glycosylphosphatidylinositol (GPI)-anchored proteins and it is represented by the glycosylphosphatidylinositol (GPI) modification signal17.
Interestingly, until very recently, no PLSs have been reported for viral MPs. Previous studies indicated the presence of putative PLS sequences in plant viral MPs18,19, but no true PLS, i.e., a minimal amino acid sequence both necessary and sufficient for Pd targeting of an unrelated cargo molecule (e.g., CFP) has been identified in a viral MP. Yet one of these proteins, MP of the Tobacco mosaic virus (TMV), was the first for which Pd localization and transport have been demonstrated20.
To address this gap, we developed an experimental strategy to identify TMV MP PLS. This strategy was based on three concepts. (i) We defined PLS as a minimal amino acid sequence that is both necessary and sufficient for protein targeting to Pd21. (ii) Because TMV MP first targets Pd and then translocates through these channels22, we aimed at uncoupling these two activities and identifying the bona fide PLS, which functions only for Pd targeting, and not for the subsequent transport. (iii) We analyzed the identified PLS for amino acid residues important for its Pd targeting activity, whether structurally or functionally. Using this approach, we delineated a 50-amino acid residue sequence at the amino-terminus of TMV MP that acts as bona fide PLS. This was done by producing a series of TMV MP fragments that saturated the entire length of the protein, tagging their carboxyl-termini with CFP and transiently expressing them in plant tissues. Pd localization of each of the tested fragments was determined by coexpressing them with a Pd marker protein, PDCB1 (Pd callose binding protein 1)23. The smallest fragment that still localized to Pd, but did not traverse Pd, was considered to represent PLS. Finally, the PLS was alanine-scanned to determine the key amino acid residues required for its structure and/or function.
Whereas here we illustrate this approach by describing identification of TMV MP PLS, it may be employed to discover PLSs in any other Pd-targeted proteins, whether encoded by plant pathogens or by the plants themselves; this is because our method does not take advantage of any unique features of viral MPs with regards to their ability to target to Pd.
1. Plant Material
2. Expression Vector Construction
3. Agroinfiltration
4. Confocal Microscopy
5. Identification of PLS
6. Identification of Key PLS Residues using Alanine Scanning42
The representative data, which faithfully illustrate the results expected from the described protocols and identify the TMV MP PLS, are adapted from Yuan et al.21. Figure 1A first summarizes major constructs expressing the full-length TMV MP (1-268), TMV MP PLS (comprising the first 50 amino acid residues of the protein, 1-50), and its alanine scanning V4A derivatives fused to CFP (generated as described in Steps 2.2, 5.2 and 6) whereas Figure 1B summarizes and quantifies the subcellular localization of these tagged proteins. Figure 1C illustrates the characteristic peripheral punctate Pd localization patterns observed for TMV MP-CFP and for TMV MP PLS-CFP as well as for the coexpressed Pd marker, PDCB1-DsRed2 (see Step 5.1). Figure 2A illustrates Pd targeting of TMV MP-CFP and for TMV MP PLS-CFP and nucleocytoplasmic distribution of coexpressed free DeRed2 (see Step 5.1). Finally, Figure 2B illustrates the loss of Pd targeting by both TMV MP and TMV MP PLS with a single alanine substitution V4A, indicating that this residue is important for the PLS structure or function; in the same cells the coexpressed Pd marker PDCB1 still localizes to Pd (see Step 6).
These data show that this conceptually and technically simple procedure of systematic production of protein fragments, their fusion to an autofluorescent marker-cargo protein, and testing for the ability to localize to Pd can identify a minimal protein sequence necessary and sufficient for Pd targeting, i.e., PLS. For correct interpretation of these localization data, it is critical to perform colocalization experiments with a known Pd protein, e.g., PDLP or PDCB family members16,23 or one of the plant viral MPs that sort to Pd44. Obviously, the colocalization does not have to be complete, as not all P-localizing proteins may be targeted to the same Pd, but a statistically significant degree of colocalization with at least one of Pd marker proteins is necessary for defining the PLS activity.
Figure 1: Identification of TMV MP PLS. (A) Summary of main CFP-fusion constructs used to identify TMV MP PLS. TMV MP sequences, green boxes; CFP, blue boxes; P, promoter; T, terminator. Numbers on the left indicate the amino acid residues included in each TMV MP fragment. (B) Summary of subcellular localization of fusion proteins shown in (A). Pd localization or nucleocytoplasmic localization was determined based on the localization of the corresponding subcellular markers, PDCB1-DsRed2 and free DsRed2, respectively. The percentage of cells exhibiting each localization pattern is shown based on scoring 100 expressing cells per construct in three independent experiments. Data are mean ± standard errors (SE). (C) Pd targeting of TMV MP-CFP, TMV MP PLS-CFP, and the Pd marker PDCB1-DsRed2. CFP signal is in blue; DsRed2 signal is in purple; plastid autofluorescence was filtered out. Images are single confocal sections. Scale bars = 10 µm. DIC, Differential interference contrast micrograph. This figure is adapted from Yuan et al.21. Please click here to view a larger version of this figure.
Figure 2: Pd targeting of TMV MP, TMV MP PLS and their alanine-scanning mutants. (A) Subcellular localization of TMV MP-CFP, TMV MP PLS-CFP, and the nucleocytoplasmic marker DsRed2. (B) Loss of Pd targeting ability of the V4A alanine-scanning mutants of TMV MP-CFP and TMV MP PLS-CFP. Pd were visualized by coexpression of the Pd marker PDCB1-DsRed2. CFP signal is in blue; DsRed2 signal is in purple; plastid autofluorescence was filtered out. Images are single confocal sections. Scale bars = 10 µm. DIC, Differential interference contrast micrograph. This figure is adapted from Yuan et al.21. Please click here to view a larger version of this figure.
This protocol has four core constituents: the concept of identifying a sequence that is both necessary and sufficient for targeting to Pd, systematic division of the protein of interest into fragments that are progressively reduced in length, fusing the tested fragments to an autofluorescent protein that serves both as tag and as macromolecular cargo, and functional assay for Pd targeting in living plant tissues following transient expression of the tested fusion proteins. Note that Agrobacterium-mediated transient expression generates data within a relatively short period of time, i.e., several days, as compared to months required for production of transgenic plants often used in similar studies15.
Coexpression of the protein of interest with subcellular localization markers, e.g., PDCB1 or free DsRed2, usually is performed from a multigene expression vector. However, if transfer of several expression cassettes into a single vector is technically problematic, coexpression can also be performed by transferring each cassette into a separate binary construct and combining equivalent volumes of liquid cultures of the resulting Agrobacterium strains for agroinfiltration. Because this protocol utilizes transient expression, the binary vector need not carry selection markers for stable transformation, although their presence is not detrimental to the expression efficiency.
This protocol is optimized for protein expression and targeting in N. benthamiana; however, it can be used in other plant species, including Arabidopsis45,46, amenable to genetic transformation by Agrobacterium. For other plant species, or if otherwise desired, this protocol can be adapted to utilize biolistic delivery41 for transient expression directly from pSAT-based vectors, avoiding the need to transfer the expression cassettes to binary vectors.
A critical point in this protocol is the consistent physiological and developmental condition of the plant material. Specifically, all plants must be healthy throughout the entire experiment, and, for agroinfiltration, the N. benthamiana plants must be less than 4-weeks old and at the 4-8-leaf stage with the largest leaf being 4-6 cm in diameter. If the plants are too young, the effects of agroinfiltration will be too severe, interfering with expression and localization of the tagged proteins. On the other hand, the older plants often exhibit variable architecture of Pd24,25, also affecting the consistency of Pd-targeting patterns.
Still, due to varying physiological conditions of the plant as a whole, and of its transformed cells in particular, the efficiency of Pd targeting in each experiment may somewhat vary. Thus, it is important to analyze a relatively large number of cells (>100) per experiment, and perform at least three biological replicates of each experiment. Generally, we consider a tested fragment to contain Pd targeting activity, if it exhibits Pd localization in at least 70% of the transformed cells. Differences in Pd targeting efficiency of each fragment, e.g., percent of transformed cells showing the Pd-specific punctate accumulation pattern, should be analyzed by the Student's t-test, and p-values <0.05-0.01, corresponding to the statistical probability of greater than 95-99%.
Another important issue is the position of the fluorescent tag/cargo in the fusion molecule. Some Pd-targeted proteins also associate with the endoplasmic reticulum (ER), and this association might require the unobstructed amino-terminal sequence. In these cases, amino-terminal fusions, in which the carboxyl-terminus of the protein fragment of interest is fused to the amino-terminus of the fluorescent tag, e.g., CFP or DsRed2, should be used. This default placement of the tag can be altered in the following cases. (i) If the amino-terminal fusions are not usable – i.e., they do not exhibit meaningful subcellular localization patterns, are poorly expressed, or do not generate fluorescent signal sufficient for reliable detection – and if the protein of interest does not contain amino-terminal signal sequences or carboxyl-terminal fusions in which the amino-terminus of the tested fragment is fused to the carboxyl-terminus of the tag, they should be attempted. (ii) The carboxyl-terminal fusions also should be used if the protein of interest contains signal sequences at its carboxyl-terminus. (iii) If the protein of interest has both amino- and carboxyl-terminal signals, such as those found in GPI-anchored proteins, or if it contains an amino-terminal signal peptide yet its amino-terminal fusion is not usable, the tag should be placed internally, (either downstream of the amino-terminal signal or upstream of the carboxyl-terminal signal). Some useful procedures for internal fluorescent tagging of proteins are described in our earlier publications47,48.
At present, there is no known consensus sequence for PLS. As such (in step 5.2) we recommend, initially subdividing the protein into contiguous ~100-200 residue-long fragments. These can then be made progressively shorter based on the observed Pd-targeting activity or the lack thereof. One should be mindful of other potential targeting signals present in the protein (e.g., NLS, signal peptides, etc.). However, as the potential overlap of such sequences with PLS sequences is unknown, these protein regions should be included in this mutational analysis.
So far, this protocol was used to identify a single PLS in a protein molecule21. Potentially, however, a protein can contain more than one signal sequence; indeed multiple NLSs have been described in many nuclear proteins, including GATA transcription factors49. Thus, when analyzing different fragments of the tested protein for Pd targeting, it is possible that more than one such fragment will be identified, indicating the presence of multiple PLSs which may act independently or synergistically.
The main limitation of this protocol is that, following agroinfiltration, visualization of subcellular localization of transiently expressed proteins of interest is best achieved in epidermal cells and at relatively narrow growth stage. When necessary, this limitation can be overcome by producing transgenic plants that stably express each of the tested proteins in most of their tissues and cell types. Also, this protocol would not be useful for studies of viral MPs or other Pd-targeted proteins that function only in plants species that are not amenable to transient genetic transformation.
While we developed this technique for identification of a PLS of TMV MP, it can be used for the discovery of PLSs in any other plant viral MPs or in plant endogenous proteins that localize to Pd, transiently or permanently. Furthermore, if combined with appropriate subcellular localization markers47, this protocol is suitable for identification of many other targeting signals within proteins of interest in plants.
The authors have nothing to disclose.
For the lack of space, we cited mostly review articles, and we apologize to our colleagues whose original work was not cited. The work in the V.C. laboratory is supported by grants from NIH, NSF, USDA/NIFA, BARD, and BSF to V.C., and the S.G.L. laboratory is supported by NIH and funds from the Departments of Plant Pathology and Plant-Microbe Biology to S.G.L.
Confocal laser scanning microscope (CLSM) | Zeiss | LSM5 | Any CLSM with similar capabilities is appropriate |
Zen software for confocal microscope imaging | Zeiss | 2009 version | The software should be compatible with the CLSM used |
Quickchange II site-directed mutagenesis kit | Agilent | 200523 | |
Acetosyringone | Sigma-Aldrich | D134406 | |
MES | Sigma-Aldrich | 69892 | |
Syringes without needles | BD | 309659 | |
MgCl2 | FisherScientific | M33-500 | |
Spectinomycin | Sigma-Aldrich | S4014 | |
Rifampicin | Sigma-Aldrich | R3501 | |
Ampicillin | Sigma-Aldrich | A0166 |