This protocol enables the impact of prophages on their hosts to be revealed. Bacterial cultures are synchronized using conditions that best support the lysogenic state, limiting spontaneous induction. RT-qPCR unequivocally distinguishes prophage-restricted genes and those uncoupled from phage control from those that are expressed during the lytic replication cycle.
Temperate phages are found integrated as prophages in the majority of bacterial genomes. Some prophages are cryptic and fixed in the bacterial chromosome, but others are active and can be triggered into a replicative form either spontaneously or by exposure to inducing factors. Prophages are commonly associated with the ability to confer toxin production or other virulence-associated traits on their host cell. More recent studies have shown they can play a much bigger role in altering the physiology of their hosts. The technique described here has enabled us to investigate how prophages affect gene expression in the opportunistic bacterium Pseudomonas aeruginosa.
In this work, the growth of the wild-type P. aeruginosa strain PAO1 was compared with that of isogenic lysogens carrying different combinations of prophages from the Liverpool Epidemic Strain (LES) LESB58. In a lysogen culture, a proportion of bacterial cells will be supporting lytic bacteriophage replication (spontaneous induction) with a high level of expression per cell of late phage genes, such as those associated with the assembly of phage particles, thus masking the low-level gene expression associated with lysogen-restricted gene expression. The impact of spontaneous induction can thus obscure prophage gene expression across a lysogen population.
Growth profiling experiments were used to identify spontaneous induction, which was minimal during the early exponential growth phase. This study reports how to prepare sample cultures during the early exponential growth phase and how to set up adequate controls despite low cell numbers. These protocols ensure the reliable and reproducible comparison of wild-type and lysogenic bacteria under various conditions, thus improving the transcriptomic profiling of prophage genomes and aiding in the identification of previously unrecognized prophage functions.
Recently, phage therapy for tackling antimicrobial resistance1 and CRISPR-Cas-based gene editing2 have generated renewed interest in bacteriophage research. Again, advancements in biotechnology have enabled the deeper investigation of the interactions between bacteria and phages3. However, the therapeutic use of phage ("phage therapy") is hampered by concerns about phages acting as mobile genetic elements with the capacity to transfer virulence and resistance genes horizontally4. The expanse of "dark matter"5 (genes with unknown functions) is both troubling and enticing. Dark matter is considered a gap in our understanding of phage biology and a largely untapped resource for molecular tools and potential novel therapeutics6. The development of high-throughput sequencing techniques, along with improved gene annotation7,8,9 and new peptide-folding algorithms10, is improving the detection, description, and functional prediction of phage genes. However, science is still far from validating most phages' gene functions in culture or in the real world.
RNA sequencing (RNA-Seq) can globally map gene expression during phage infection and has significantly improved the understanding of both the phage and bacterial elements involved in lytic and lysogenic cycles11,12. During lysogenic processes, temperate phage genomes are integrated into bacterial DNA to become prophages13. Global gene expression profiling experiments can be used to identify prophage-restricted genes that are encoded on temperate phage genomes but only expressed during the lysogenic state11. Such genes do not encode phage structural proteins and are not involved in any phage infection processes. RNA-Seq can be used to identify those genes that are more likely to influence the biology of the bacterial host, either by inducing a gain of function or regulating the existing bacterial genes, thus often enabling the bacteria to adapt to changing environments. Therefore, the ability of prophages to act as microbial puppet masters, controlling a range of bacterial functions, could be studied.
There are two major barriers to the effective analysis of prophage-restricted gene expression. Firstly, the availability of susceptible hosts is a key issue. By definition, prophages are already incorporated into their specific host genome, so it is challenging to find a susceptible wild-type host to compare the global gene expression in the presence and absence of the prophage. This can be achieved either through the de novo infection of another susceptible host or the deletion of the prophage from the original wild-type isolate, without disrupting the rest of the host genome. The second barrier lies in the heterogeneous nature of lysogenic populations. Some prophages degrade through mutation or recombination to become "cryptic", meaning they are fixed in a specific location of the bacterial genome. However, other prophages are "active" and can be induced into a replicative, lytic cycle spontaneously or after exposure to inducing factors. In many lysogenic cultures, the rate of spontaneous induction means that a proportion of the bacterial cells are always undergoing lytic phage replication14,15,16. A high level of expression of late phage genes in these populations masks the low-level gene expression associated with lysogen-restricted gene expression11,17. The proportion of lysogens undergoing spontaneous prophage induction may vary with the growth state, growth conditions, or other triggers. Therefore, to study the impacts of prophages upon the lysogen, spontaneous prophage induction events must be minimized as much as possible by optimizing the growth conditions to favor the lysogenic state.
This study reports the preparatory work done to investigate the influence of a set of cohabiting prophages from the Liverpool Epidemic Strain (LES) of Pseudomonas aeruginosa. Active prophages were induced and isolated from LES and used to infect the model P. aeruginosa host strain, PAO116,18,19. The whole genomes of the wild-type P. aeruginosa strain, PAO1, and its lysogen, PAO1Φ2, were sequenced (at a depth of 30x coverage) to ensure the identity of the wild-type strain and to confirm that the lysogen was isogenic. The LES has been associated with increased morbidity and mortality in cystic fibrosis patients, and LES phages19 have been suggested to aid adaptation to the cystic fibrosis lung environment16,19,20. Despite strong evidence that these prophages affect the biology of their host20,21, the majority of their gene functions are yet to be characterized, and the specific mechanisms of interaction are poorly understood. A transcriptomics approach can empirically uncover the prophage gene functions in a controlled host background. Since spontaneous induction can affect expression profiles, this article describes how to optimize the growth conditions to favor the lysogenic state. Such synchronization of cultures can be validated by real-time PCR to quantify the expression levels of key genetic markers that are associated with crucial stages of LES phage replication in PAO1. The same approach has been used previously to identify the prophage-restricted functions of Shiga-toxigenic phages that affect motility, acid resistance, and antimicrobial resistance in Escherichia coli11,17,21,22.
1. Create a selectable indicator host (Figure 1)
NOTE: Phage culture lysates can contain contaminating cells from the original bacterial host. Having an antibiotic-resistant indicator strain allows for the discrimination between the indicator strain and the original bacterial host of the prophage. Using a selectable indicator strain enables the accurate enumeration of the infective phage particles without requiring centrifugation or filtration steps to remove the phage from the lysogen cells following the phage amplification steps. The selectable indicator host strain also reduces the time and number of steps for phage enumeration so that multiple conditions can be trialed simultaneously.
2. Temporal direct enumeration of spontaneous induction (Figure 2)
3. Preparation of un-induced and induced lysogen cultures for RNA extraction (Figure 3)
4. Isolation of RNA from un-induced and induced lysogen cultures
CRITICAL: All these steps should be performed in an RNase-free environment28. The workbenches should be wiped with 10% NaClO or proprietary RNase inactivators. The labware should be treated with RNase inhibitors such as DEPC treatment, and nuclease-free water should be used in all the reactions.
5. Removal of contaminating DNA from the RNA by DNase treatment
6. Qualitative and quantitative analysis of the DNase-free RNA
7. First-strand cDNA synthesis
8. Standard curve and quantitative (q)-PCR to determine the expression levels of marker genes that indicate different stages of phage replication
In this work, the direct temporal enumeration of phage production from a PAO1 LESΦ2 lysogen culture grown under non-inducing conditions was used to determine the impact of spontaneous LESΦ2 induction. The phage density was at its lowest point with a mean of ~2.61 x 106 plaque-forming units (PFU)·mL−1 2 h after subculture in fresh medium during the early exponential phase of growth, suggesting that lysogeny was the dominant state. The LESΦ2 titer rapidly increased to a mean of ~2.4 x 108 PFU·mL−1 within 4 h and reached the highest density after 6 h (mean of ~5.83 x 109 PFU·mL−1; Figure 4).
Minimal spontaneous induction was observed during the early log phase of lysogen growth (after 2 h). However, the measurable presence of phages in the culture medium was the result of many prior events, including the following: the packaging of nucleic acids into protein heads, the assembly of proteins into phage particles, and the expression of late phage genes, middle-stage phage genes, and early regulatory phage genes. It was important to catch the infected cells prior to the expression of the phage-associated replication events; hence, 90 min was chosen to let the culture grow prior to induction. To capture the gene expression profile of the PAO1, LESΦ2 lysogen samples from a culture were harvested pre-induction and post-induction over a 90 min period, as mentioned in step 3.4. This 90 min time point is well before high levels of spontaneous induction of the resident prophage are detected by the plaque assay from step 2.3.2. Since the bacterial cell density was low during early exponential growth, the culture volumes were scaled up to 800 mL to ensure ample material for the gene expression studies. The samples were collected from the uninduced culture and induced cultures every 10 min, and RNA was extracted to map the expression profile of the key markers for lysogeny and lytic replication during the bacterial growth. Total RNA was purified and validated for the absence of genomic DNA using qPCR assays targeting the 16S rRNA gene (step 6.1). The samples reaching an RIN ≥ 9 passed quality control and were converted to cDNA.
The annotated LESΦ2 genome was examined to identify genes that are well-known players in the lysogenic and lytic replication cycles of temperate phages. These identified genes were then used to validate the qRT-PCR for the expression profiling of the lysogen cycle-restricted and lytic cycle-associated genes from induced and un-induced cultures. We quantified the absolute DNA copy number and conducted a Wilcoxon signed-rank test using R36 to compare the expression levels in un-induced and induced cultures (Figure 5). A marked increase in the expression of the cro gene (an early marker of lytic replication) from ~2.31 x 109 copies in un-induced cultures to ~3.02 x 1011 copies 30 min post-induction (Wilcoxon signed-rank test: p < 0.01) was observed. Similarly, O proteins and P proteins, which are mid-stage markers of lytic replication (and are predicted to be involved in phage genome replication), also showed significant upregulation from ~1.74 x 108 to ~1.25 x 1010 copies (Wilcoxon signed-rank test: p < 0.01) and from ~ 6.05 x 102 to ~5.68 x 105 copies (Wilcoxon signed-rank test: p < 0.01), respectively. Finally, the tail-associated structural genes were used as late markers of the lytic replication cycle. Again, we observed a significant increase in expression from ~2.31 x 106 copies in un-induced cultures to ~4.38 x 108 copies 30 min post-induction (Wilcoxon signed-rank test: p < 0.01). Thus, the quantitative RT-PCR data confirmed that the gene expression of well-established marker genes for lytic replication followed the expected trend, with the early, mid, and late markers showing multiple-fold differential expression in the predicted order (Figure 5). Since the expression of the markers for lytic replication was upregulated 30 min post-recovery, this is considered as an appropriate representative time point for studying the transcriptomic landscape of active temperate phages and their bacterial hosts during the lytic cycle.
We observed some expression of lytic genes in un-induced conditions, confirming that some spontaneous induction always occurs, even in optimized cultures in which the lysogen numbers are represented with the highest ratio of CFU to released PFU in the early log phase. This means that there will always be some level of “noise” in the transcriptomics data, which reinforces the importance of carefully prepared controls, including induced and un-induced cultures. The appropriate choice of the internal control genes to determine the fold changes in expression relies on carefully examining the transcriptomics data to identify genes that are expressed at the same level in both the un-induced and induced samples. Our preliminary results suggest that rpoD was the most reliable control gene tested and had the most stable expression (~1.71 x 105 copies before induction and ~3.33 x 105 copies 30 min post-induction; Wilcoxon signed-rank test: p = 0.3594) compared to the 16S rRNA or proC genes (Figure 5). The variability of the expression of the internal controls led to the measurement of the absolute numbers of transcripts. Future examination of the transcriptomics data will support the choice of appropriate internal controls for further validation.
The cI gene was used in our gene profiling exercise, as it is a well-recognized marker of lysogeny. Compared to the markers for lytic replication, the expression of the cI gene was relatively stable (Figure 5), but the copy number of this gene was reassuringly high in the un-induced cultures compared to those of the markers for lytic replication. These data are in agreement with the low PFU numbers in the same samples, thus confirming that high repressor expression was associated with lower levels of phage production. The data reported here demonstrate that the expression of the cI transcript for this particular phage is not significantly downregulated post-induction, as seen in the Stx phages11,17. Repressor activity is normally controlled at both the transcriptional and post-translational levels, so the repressor gene can be transcribed, but the resultant protein is immediately subjected to autocleavage. Further experimentation is required to validate transcriptional and post-translational controls. Moreover, from our standard curve, the minimum detection limit of qPCR appears to be ~102 copies.
Together, our findings from plaque and qRT-PCR assays validate our strategy for culture and RNA sample preparation to generate a well-controlled input for RNA-Seq experiments. The un-induced cultures in the early-exponential phase exhibited low levels of spontaneous induction and lytic gene expression, suggesting the dominance of lysogeny. In contrast, the cultures isolated 30 min after induction showed significant increases in the expression of marker genes that indicate the dominance of lytic replication.
Figure 1: The protocol for creating the rifampicin-resistant indicator host (Created with BioRender.com). Please click here to view a larger version of this figure.
Figure 2: The experimental design for enumerating the PFU and CFU of a lysogen from the same sample. (Created with BioRender.com) Please click here to view a larger version of this figure.
Figure 3: The experimental design for sampling induced and un-induced cultures for RNA isolation. (Created with BioRender.com) Please click here to view a larger version of this figure.
Figure 4: Temporal enumeration of spontaneous induction. Temporal enumeration of spontaneous LES prophage production using the PFU from the PAO1 Φ2 lysogen with the concurrent CFU, n = 8 (two biological and four technical replicates); the error bars represent the standard deviation. The dark red points indicate the CFU·mL−1 in LB; the dark blue points indicate the PFU·mL−1 in LB. The spontaneous release of the φ2 infective phage by the lysogens is at the lowest measurable level at 2 h post-inoculation. Please click here to view a larger version of this figure.
Figure 5: Absolute copy number of the target marker genes. The absolute copy number of phage marker genes confirm the predicted expression patterns, derived using RT-qPCR, of genes expected to play important roles in lysogeny and lytic cycles. The dots represent both three biological and three technical replicates (n = 9). (A)The red box represents the lysogeny marker, cI; (B) green represents the early lytic marker, cro; (C,D) blue represents the mid lytic markers, DNA replication genes; (E) magenta represents the late lytic marker, tail structural genes; (F–H) gray represents the host markers that were used as internal controls, and (I) white represents the DNA gyrase B, which was used as an induction control. The solid horizontal lines show the median of the distribution. Please click here to view a larger version of this figure.
Table 1: Primers designed in this study. The sequences of specific primers for the marker genes and internal controls used in this study are provided, along with their corresponding NCBI accession IDs. Please click here to download this Table.
Table 2: Efficiency of the primers used in this study calculated using the qPCR standard curve. Please click here to download this Table.
The creation of a selectable indicator host, previously used in plaque assays to more accurately quantify the spontaneous induction of Stx phage from E. coli MC106137,38,39, has been described here for P. aeruginosa phage LESΦ2. This intervention has the added benefit of reducing the sample processing steps and time, thus enabling the simultaneous assessment of spontaneous induction rates in multiple culture conditions. There is a risk of generating other mutations during the creation of rifampicin-resistant variants40; however, in this work, the evolved strain was only used as an indicator host for the enumeration of plaques from cultures of interest and was not included in the transcriptomic analysis. As long as the selectable indicator strain remains equally susceptible to infection by the phage of interest, there is no concern about other acquired mutations. Nevertheless, no differences in the restriction fragment length polymorphism profiles were detected by the pulse field gel electrophoresis (PFGE) analysis of PAO1WT and PAO1RIF (data not shown).
When choosing host cells, it is rare to find an indicator strain that does not already harbor prophages. As a case in point, PAO1 harbors the filamentous prophage Pf4. The experimental controls for this study were designed to be able to directly examine the gene expression of specific phages (in this case, LES prophage 2) and the effects this phage has on bacterial gene expression. In the comparison of transcripts from PAO1 carrying the LES prophage 2 and lacking the LES prophage 2 (both lysogen and non-lysogen carry the endogenous Pf4), which serve as internal controls to exclude the impact of Pf4 on the host. Additionally, it has been demonstrated that Pf4 usually does not cause lysis in its host cell41 and is, therefore, not capable of confounding the results of these experiments.
It is well-established that careful quality control is crucial in sample preparation for producing meaningful omics data42. However, as previously described11, the careful characterization of prophage activity in the preparation of lysogen cultures for such studies is rarely performed. Here, we detail our systematic protocols for preparing a well-controlled and optimized set of cultures for transcriptomic studies to better explore the interactions between bacteria and temperate phages. The synchronicity of the population was controlled by bringing the culture through at least four doublings before treating it with the inducing antibiotic norfloxacin. By determining the MIC of norfloxacin for the strain in the study, we could ensure that the concentration of the inducing agent was just above the MIC for the “induction” treatment. The treated cells were then diluted 1:10 to lower the norfloxacin concentration below the MIC after the 1 h treatment in order to allow the cells to recover and complete the phage replication process, ending in the lysis of the cell and the release of infective phage progeny. The cells only enter the lytic replication cycle following the induction stimulus once the concentration of norfloxacin has been brought below the MIC during the recovery period. In this case, going above 1 µg·mL−1 norfloxacin means that the drug could not be effectively diluted below the MIC, as the MIC for norfloxacin for PAO1 is 0.19 µg·mL−1. The level of inducer dilution must be balanced with the need for lysogen recovery and the retention of the culture density for harvesting the RNA. The data discussed here demonstrate that it is possible to synchronize cultures to create samples in which lysogeny dominates, thus reducing the noise from spontaneous induction and enabling the detection of true lysogeny-driven changes in gene expression. Since the lysogenic state is predominant in the early-exponential phase of growth when the bacterial cell density is low, we suggest scaling up the cultures to harvest enough RNA for subsequent gene expression studies such as RNA-Seq.
The use of norfloxacin as an inducing agent to force cultures into the lytic cycle is well-reported43,44; however, this will also affect the expression of other bacterial genes in the process45,46. To mitigate this, RNA libraries from control wild-type cultures grown under the same inducing and non-inducing conditions should be included in RNA-Seq experiments. The use of internal controls and key marker genes to validate the stages of phage replication by qRT-PCR is also crucial for accurate comparisons. Quantitative RT-PCR profiling cannot be interpreted by comparing the absolute numbers of transcripts for each gene at various time points; it is the shape of the profile that matters. First, only one small region in the transcript for any gene has been sampled, so whether it is a short-lived or longer-lived element is unknown27. Certainly, RNA-Seq mapping of transcripts shows that the density of the mapping data varies significantly over the length of a gene. Secondly, it is the shape of the gene expression profile that should be interpreted for a marker gene associated with the lytic cycle or the lysogenic lifestyle or even uncoupled from the phage regulatory circuits11. Spontaneous induction is a real issue in lysogen culture and will always result in the expression of lytic cycle-associated genes. However, profiling does show that the genes associated with the lytic replication cycle are suppressed in their expression pre-induction (at least two log folds) and up-regulated post-induction.
The previously conducted transcriptomic analyses of Stx phage interactions with E. coli support a thorough understanding of the phage genes involved in maintaining lysogeny and triggering the lytic cycle11,17. Currently, the LES phages of P. aeruginosa have been annotated, but their key gene functions are less well understood. Transcriptomic studies will enable the re-annotation of the LES prophages and improve our understanding of the genes involved in the lysogeny and lytic cycle. Linking gene sequence to function represents a major challenge in the study of novel prophages, which further highlights the need for more studies to confirm the phage gene functions for the production of better annotation tools47. The wider application and adaptation of the protocols and extra quality control measures detailed in this video article could help in unveiling various prophage functions and, thus, improving annotation pipelines and transforming our understanding of phage and bacterial biology.
PAO1 | 6 | ||
LESB58 | 6 | ||
LES phages | Induced and purified from LESB58 using Norfloxacin. | This study | |
Lysogeny Broth (LB) | Merck | 1.10285.500 | |
LB Agar | Merck | 1.10283.500 | |
Agar Agar | Fisher | A/1080/53 | |
Top Agar | 0.4 g Agar Agar+2.5 g LB Broth in 100 mL water; autoclave and use. | – | |
Rifampicin | Sigma (Stock: 50 mg/mL in Methanol- Mix well and use 0.22µm filter to sterilize and store it in -20°C until use) | R3501 | |
Glacial Acetic Acid | Fisher 1% (v/v) in water | 10060000 | |
Norfloxacin | Sigma (Stock: 25 mg/mL of 1% Glacial Acetic Acid-Mix well and use 0.22µm filter to sterilize and store it in -20°C until use;To avoid freeze thaw cycles, store as small aliquotes) | N9890 | |
Phenol saturated with citrate buffer pH 4.3 | Sigma | P-4682 | |
Molecular Biology grade Ethanol | Fisher | 16695992 | |
TRIzol | Invitrogen | 12044977 | |
Chloroform | Fisher | 11398187 | |
Isopropanol | Fisher | 17150576 | |
Nuclease-free H2O | Invitrogen | 10526945 | |
10X TURBO DNase | Ambion | AM1907 | |
Qubit RNA HS, BR Kit | Invitrogen | Q10210 | |
Agilent RNA 6000 Nano Kit | Agilent | 5067-1511 | |
SuperScriptIII first strand synthesis kit | Invitrogen | 18080051 | |
PCR Reagents | Bioline Mytaq Red 2X | BIO-25043 | |
qPCR Reagents | Sensifast SYBR Hi Rox | BIO-92020 | |
PCR purification kit | Isolate II PCR and Gel Kit | BIO-52060 | |
TA cloning kit | TA Cloning Kit, with pCR 2.1 Vector, without competent cells | K202040 | |
StepOne Real Time PCR system | Thermo Fisher Scientific | 4376600 |