This protocol aims to quantify SARS-CoV-2 RNA in wastewater and air samples to be used for wastewater-based epidemiology studies and to assess the exposure risk to SARS-CoV-2 in indoor and outdoor aerosols. This protocol also describes a tiled amplicon long-template sequencing approach for SARS-CoV-2 whole genome characterization.
Wastewater-based epidemiology has emerged as a promising and efficacious surveillance system for SARS-CoV-2 and other infectious diseases in many nations. The process typically involves wastewater concentration, nucleic acid extraction, amplification of selected genomic segments, and detection and quantification of the amplified genomic segment. This methodology can similarly be leveraged to detect and quantify infectious agents, such as SARS-CoV-2, in air samples. Initially, SARS-CoV-2 was presumed to spread primarily through close personal contact with droplets generated by an infected individual while speaking, sneezing, coughing, singing, or breathing. However, a growing number of studies have reported the presence of SARS-CoV-2 RNA in the air of healthcare facilities, establishing airborne transmission as a viable route for the virus. This study presents a composite of established protocols to facilitate environmental detection, quantification, and sequencing of viruses from both wastewater and air samples.
In December 2019, a novel disease called COVID-19 emerged, caused by a previously unknown coronavirus, SARS-CoV-21. The resulting global pandemic has presented a significant challenge to clinical and public health laboratories worldwide, as a large number of individuals require testing to accurately assess virus transmission and prevalence in the community. However, in many regions, achieving the necessary level of testing in a timely and spatially comprehensive manner is economically unfeasible2,3. Current surveillance systems based on individual clinical diagnostics rely heavily on symptom severity and individual reporting, as well as the extent to which these symptoms overlap with existing diseases circulating in the population4,5,6,7,8,9,10. Consequently, a high number of asymptomatic cases contributes to a significant underestimation of disease burden7,11.
Due to these challenges, wastewater-based epidemiology (WBE) for COVID-19 surveillance was proposed as a complementary surveillance strategy. WBE was first described in 200112, and was initially used to trace cocaine and other illegal drugs13. This approach relies on the assumption that it is possible to calculate the initial concentration of any substance that is stable in wastewater and excreted by humans8,12. WBE has been successfully implemented in many countries as a complementary and efficient surveillance system for SARS-CoV-23,8,14,15,16. The majority of methods to detect human viruses in aquatic environments follow these steps: concentration, nucleic acid extraction, amplification of the genomic segment (or segments) chosen, and detection/quantification of the amplified genomic segment3.
Another important environment for the detection and quantification of SARS-CoV-2 is in air samples. Initially, SARS-CoV-2 was thought to be transmitted mainly through close personal contact with respiratory droplets from aerosols generated by an infected person while speaking, sneezing, coughing, singing, or breathing17. However, several studies began to report the presence of SARS-CoV-2 RNA in the air, especially in healthcare facilities and other enclosed spaces18,19,20,21. Evidence of SARS-CoV-2 viability in air samples taken indoors in hospitals and other enclosed spaces has been found when the virus concentration was sufficiently high22,23,24. Outdoor studies have generally found no evidence of SARS-CoV-2, except in crowded outdoor spaces21,25,26,27,28,29. As of now, airborne transmission of SARS-CoV-2 has been recognized as a mode of transmission30,31. A recent review study shows the differences between outdoors, where risks of airborne transmission are minimal outside of crowded areas, and indoors, where larger risks could be present in poorly ventilated environments in which strong sources (i.e., number of infected people) could be present. A recent comprehensive review study has highlighted the substantial differences between the risks of airborne transmission in outdoor versus indoor environments, particularly in crowded areas with poor ventilation. The study indicates that the risk of airborne transmission is minimal in outdoor environments, where there is a larger volume of air available for the dilution and dispersion of virus particles32. These findings have important implications for public health policies and guidelines related to COVID-19. By recognizing the significant differences in transmission risks between indoor and outdoor environments, policymakers can develop more effective strategies to mitigate the spread of the virus and protect public health.
There are a variety of methods and protocols for the detection, quantification, and sequencing of SARS-CoV-2 from different environmental samples. This method article aims to present a combination of well-established protocols that allow laboratories with different capacity levels to perform environmental detection, quantification, and sequencing of viruses from wastewater and air samples.
All methods described here have been published elsewhere and contain small modifications from the original methods.
1. Wastewater collection and sample pre-processing
NOTE: Due to the low concentrations of SARS-CoV-2 RNA in environmental samples, the implementation of a concentration step is crucial for a successful detection33,34,35. Described here is the first reported method for the detection of SARS-CoV-2 in wastewater36.
2. Quantification of SARS-CoV-2 RNA by real-time-quantitative polymerase chain reaction (RT-qPCR)
NOTE: The below protocol is according to the CDC 2019-Novel Coronavirus (2019-nCoV) RT-PCR diagnostic panel37. Divide the primer/probe mix into several aliquots to avoid freezing and thawing cycles.
3. Sequencing variants in wastewater and data analysis
NOTE: The described protocol is a modified protocol created by Quick et al.39,40. It uses two sets of primers for SARS-CoV-2 genome amplification by PCR tiling methodology-ARTIC primers and VarSkip primers. A combination of primers is used to guarantee the best genome coverage and to minimize the possibility of novel mutations causing primers of one type to fail. In general, the protocol is divided into three parts: reverse transcription (RT) and amplicon generation, sequencing library preparation, and sequencing and data analysis.
The results summarized in Table 3 show examples of the detection and quantification of SARS-CoV-2 RNA in wastewater and air samples using the method described in this article. Wastewater samples were collected from wastewater treatment plants in Spain and Slovenia and were considered positive if the Ct was less than 40 in at least two of the three replicates, with quantification considered valid if the Ct had a variation of less than 5%. In Spain and Portugal, indoor and outdoor air samples were collected, and the same rules were applied. Duplicates were used for the air samples, not triplicates as for the wastewater samples, as the aim of the study was to detect SARS-CoV-2 RNA and not to quantify it. Furthermore, an RT-qPCR run was only considered valid if all controls used (as described in this protocol) behaved as expected and if no significant deviations were observed in the mengovirus RNA control. For these samples, the mengovirus recovery efficiency varied between 13.7% and 19.7%. The inhibition was assessed by diluting tenfold and 100-fold extracted RNA and by comparing the resulting RT-qPCR results, as well as using a commercial kit. The instructions for each inhibition control kit should be followed as described by the manufacturers.
Wastewater samples had a standard deviation from 1.91% to 13.98% among the triplicates and ranged from 3.05 x 103 to 2.83 x 108 gene copies/L. Air samples ranged from 6.17 x 103 to 5.48 x 109 with a standard deviation between the duplicates between 0.54% and 10.95%. This is in accordance with previous studies6,48,49,50.
As described in this protocol, the samples were sequenced and an example of three sequenced sample results is summarized in Table 4 and Figure 1. In all three samples, the lineage BA.5.x was assigned as the most prevalent by the Freyja tool (95.3%, 95.4%, and 99.8%).
Figure 1: SARS-CoV-2 lineage prevalence in selected wastewater samples. Lineages were assigned using the Freyja tool. The summary of prevalence numbers does not necessarily reach 100%, as some of the data is not of sufficient quality. Please click here to view a larger version of this figure.
Table 1: Reagents and volumes to be added of each to prepare the mastermix for SARS-CoV2 quantification by RT-qPCR. Please click here to download this Table.
Table 2: Reagents and volumes to be added of each to prepare the mastermix to amplify the complete SARS-CoV-2 genomes in the samples. Please click here to download this Table.
Table 3: Summary of RT-qPCR quantification results of SARS-CoV-2 RNA from wastewater and air samples. The N1 gene was used for quantification and the N2 was used for confirmation (positive or negative). The results are expressed as copies/L. Please click here to download this Table.
Table 4: Detected mutations and lineage prevalence. Lineages were assigned using the Freyja tool and according to the mutations of interest found using Nextclade. Genome coverage for each sample is shown. The summary of prevalence numbers does not necessarily reach 100%, as some of the data is not of sufficient quality. Please click here to download this Table.
Microbial and viral detection and quantification using (RT-)qPCR methods have garnered widespread acceptance due to their remarkable sensitivity. However, these techniques face numerous challenges when analyzing environmental samples. Wastewater samples contain an abundance of inhibitory substances that can skew measurements and generate misleading results. To tackle these limitations and enhance precision, a complex protocol was conceived, designed, and implemented. This protocol was tailored by combining protocols from the scientific literature and to specifically address the restrictions inherent in qPCR methods by incorporating a series of stringent controls at each stage of the process. By integrating these rigorous quality control measures, this protocol offers a robust solution to the challenges faced by qPCR-based detection and quantification methods in complex environmental samples51,52. The negative control of isolation (NCI) and non-template control (NTC) are essential tools to assess the likelihood of contamination during the RNA extraction and RT-qPCR reaction. Moreover, the integration of mengovirus control RNA provides a crucial means to evaluate inter-sample variability, as well as the efficiency of the concentration step and the inhibitory effects of agents present in complex environmental samples. In every RT-qPCR reaction, positive controls must be included to confirm the effectiveness of the amplification process. It is critical to carefully monitor each step of the protocol, including the storage and processing time of the samples. The degradation of the samples can be influenced by temperature and the complex composition of wastewater, making it essential to store the samples at a temperature of 4 °C and process them within 24 h. By adhering to these stringent quality control measures, researchers and public health workers can confidently and accurately analyze environmental samples to uncover vital insights into microbial and viral populations and better understand the impact of environmental factors on human health48.
The selection of RNA targets for SARS-CoV-2 detection is a critical factor that can impact the sensitivity of RT-qPCR assays. In this study, the authors evaluated the RdRp, E, N1, and N2 targets and found that the RdRp and E assays had lower sensitivity than the N1 and N2 assays, consistent with previous research53. The N1 and N2 assays use double quencher probes, which have been shown to enhance their sensitivity in (RT-)qPCR assays54. Those reasons lead to the decision of using N1 for quantification and N2 as a confirmation of the RNA presence, in order to eliminate doubts when very low concentrations are found. The observed discrepancies among RT-qPCR targets was previously reported36.
Several studies have employed similar protocols and have reported the successful detection and quantification of SARS-CoV-2 RNA in wastewater during the ongoing COVID-19 pandemic6,48,49,55,56. The reported SARS-CoV-2 RNA concentrations are in accordance with the ones reported in this study and protocol48,50,55. Some of these studies have explored the relationship between active cases and concentrations of RT-qPCR target genes, particularly N1 and N26,36,49,50,55,57,58,59. To reduce variability in the obtained gene concentrations, it has been proposed to multiply the experimental concentrations by the flow rate at the time of sampling to obtain the initial viral load36. This approach has been incorporated into the VATar COVID-19 project in Spain, a national monitoring system for COVID-19 in wastewater60.
As the next phase of COVID-19 wastewater-based epidemiology (WBE) studies, researchers have attempted to detect SARS-CoV-2 mutations in wastewater to estimate the circulation of variants of concern within communities. Previous studies have shown a strong correlation between the variants identified in wastewater and those present in the population at the same time. These findings suggest that wastewater-based monitoring could serve as a valuable tool in tracking the spread of SARS-CoV-2 variants61,62. WBE has shown great promise in monitoring SARS-CoV-2 and can be used for the developing future surveillance strategies for pandemics. This approach is particularly useful in the early stages of a pandemic, when there are limited individual tests and many cases may be asymptomatic, and has a complementary strategy to clinical surveillance. Therefore, WBE can be especially useful in places with low economic capacity that cannot afford other more expensive surveillance strategies. The described method can be easily adapted to other viruses that might be relevant to monitor in the future.
The results of air sampling in indoor and outdoor environments revealed that SARS-CoV-2 RNA is present in both environments, although in lower concentrations in outdoor environments38,63,64. These findings suggest that activities and exposures in which mask-wearing and social distancing are difficult to maintain, such as eating, drinking, or attending music festivals, could pose a higher risk for COVID-19 transmission. To mitigate such risks, it is critical to consider appropriate ventilation measures and the use of masks, especially with the emergence of new variants of concern that are more transmissible, such as the Delta (B.1.617.2) and Omicron (B.1.1.529) variants. In light of the lifting of COVID-19 restrictions in many areas, maintaining the use of masks is recommended to keep community transmission of SARS-CoV-2 at a minimum and prevent future outbreaks of the disease5,21,26,65.
The authors have nothing to disclose.
This work was performed with financial support from the Regional Government of Castilla y Leon and the FEDER program (projects CLU 2017-09, UIC315 and VA266P20).
Adapter+A25+A2:D19+A2:D20+A2+A2:D19 | Oxford Nanopore | EXP-AMII001 | Sequencing |
AllPrep PowerViral DNA/RNA Kit | Qiagen | 28000-50 | RNA extraction kit |
AMPure XP | Beckman Coulter | A63880 | PCR Purification, NGS Clean-up, PCR clean-up |
ARTIC SARS-CoV-2 Amplicon Panel | IDT | 10011442 | SARS-CoV-2 genome amplification |
Blunt/TA Ligase Master Mix | NEB | M0367S | Library preparation |
CENTRICON PLUS70 10KDA. | Fisher Scientific | 10296062 | Concentration filters |
CORIOLIS COMPACT AIR SAMPLER | Bertin Technologies | 083-DU001 | Air sampler |
Duran laboratory bottles | Merck | Z305200-10EA | Sampling Bottles |
Flow Cell (R9.4.1) | Oxford Nanopore | FLO-MIN106D | Sequencing |
General labarotory consumables (tubes, qPCR plates, etc) | |||
Ligation Sequencing Kit | Oxford Nanopore | SQK-LSK109 | Sequencing |
LunaScript RT SuperMix Kit | NEB | E3010 | cDNA synthesis |
Mengovirus extraction control Kit | Biomérieux | KMG | Concentration control |
Nalgene General Long-Term Storage Cryogenic Tubes | Thermofisher | 5011-0012 | Sample storage |
Native Barcoding Expansion 1-12 (PCR-free | Oxford Nanopore | EXP-NBD104 | Barcoding |
NEBNext Ultra II End Repair/dA-Tailing Module | NEB | E7595 | DNA repair |
NEBNext VarSkip Short SARS-CoV-2 Primer Mixes | NEB | E7658 | SARS-CoV-2 genome amplification |
NEBNext Quick Ligation Reaction Buffer | NEB | B6058S | Sequencing |
Phosphate buffered saline | Merck | P4474 | Collection buffer |
Phosphate-buffered saline (PBS, 1X), sterile-filtered | Thermofisher | J61196.AP | Elution of air samples |
Q5 Hot Start High-Fidelity 2X Master Mix | NEB | M0494S | hot start DNA polymerase |
Qubit RNA HS Assay Kit | Thermofisher | Q32852 | RNA quantitation |
SARS-CoV-2 RUO qPCR Primer & Probe Kit | IDT | 10006713 | Primer-Probe mix and qPCR positive control |
TaqPath 1-Step RT-qPCR Master Mix | Thermofisher | A15299 | RT-qPCR kit |