Described here is a method for the extraction, purification, and quality control of genomic DNA from the obligate biotrophic fungal pathogen, powdery mildew, for use in long-read genome sequencing.
The powdery mildew fungi are a group of economically important fungal plant pathogens. Relatively little is known about the molecular biology and genetics of these pathogens, in part due to a lack of well-developed genetic and genomic resources. These organisms have large, repetitive genomes, which have made genome sequencing and assembly prohibitively difficult. Here, we describe methods for the collection, extraction, purification and quality control assessment of high molecular weight genomic DNA from one powdery mildew species, Golovinomyces cichoracearum. The protocol described includes mechanical disruption of spores followed by an optimized phenol/chloroform genomic DNA extraction. A typical yield was 7 µg DNA per 150 mg conidia. The genomic DNA that is isolated using this procedure is suitable for long-read sequencing (i.e., > 48.5 kbp). Quality control measures to ensure the size, yield, and purity of the genomic DNA are also described in this method. Sequencing of the genomic DNA of the quality described here will allow for the assembly and comparison of multiple powdery mildew genomes, which in turn will lead to a better understanding and improved control of this agricultural pathogen.
Powdery mildews are a group of obligate biotrophic fungal plant pathogens that, when taken together, are the largest cause of plant disease worldwide1. There are over 900 described species of powdery mildew, which have been taxonomically grouped into five tribes within the family Erisyphaceae2. Due both to their economic importance and the intimate relationship they develop with their hosts, powdery mildew diseases have been the subject of research for > 100 years. Upon infection, powdery mildews elicit drastic changes in the cellular structure, metabolism and molecular biology of their hosts, to benefit this pathogen. However, the study of powdery mildews is particularly challenging due to their obligate lifestyle, and growth of the fungus in pure culture has not yet been described3,4,5,6,7,8. Reliable, stable genetic transformation of powdery mildews has also not yet been accomplished, although transient transformation has been reported in some species9,10.
The sequencing and assembly of powdery mildew genomes has proven difficult due to a number of features of the genome itself. Powdery mildew genomes are large (120 – 180 Mbp) relative to other fungal genomes, and consist of 60 – 90% evenly distributed repetitive elements11. These elements include non-long terminal repeats, as well as uncategorized repetitive elements. Two formae speciales of a single powdery mildew species, Blumeria graminis f. sp. hordei and f. sp. tritici (Bgh and Bgt, respectively) as well as the grape powdery mildew Erysiphe necator, have been sequenced, and draft genomes for several others have been completed12,13,14. The repetitive nature of the genomes has made assembly difficult, and the completed Bgh genome was assembled into 6,989 supercontigs with an L50 of 2 Mb12.
Despite the large genome, the powdery mildews appear to have a small number of protein coding genes, with 5,845 and 6,540 genes predicted in Bgh and Bgt, respectively. The sequenced powdery mildews also appear to lack at least 99 core genes that are essential in other fungi, which is consistent with the dependence of the fungi on their host plant for survival11,12,13,14.
Repetitive sequences near telomers, centromeres, ribosomal RNA gene arrays and regions enriched in transposable elements are poorly assembled from short-read sequencing strategies and are often under-represented in genome assemblies15. Such regions are thought to be responsible for many of the gaps that occur in genome sequences, and this applies to the powdery mildews with their extensive expansion of repetitive elements16. Highly plastic genome regions are often found in such repetitive regions3. They serve as a site of chromosome rearrangements and often encode genes under strong selective pressure, such as the genes encoding effector proteins and the genes encoding enzymes of secondary metabolism. Advances in single-molecule long-read sequencing technologies provide a potential solution for sequencing across repetitive regions of genomes15. For example, Faino et al. (2015) found that including long-sequence read technologies and optical mapping allowed them to produce a gap-less genome sequence for two strains of the fungal plant pathogen Verticillium dahlia, tripling the proportion of repetitive DNA sequences in the genome, increasing the number of gene annotations (and reducing the number of partial or missing gene annotations) and revealing genome rearrangements17.
To employ these long-read sequencing technologies, high concentrations of high quality genomic DNA, with minimal sizes > 20 kbp, are needed. Here we describe our methods for conidial collection, purification of high molecular weight DNA from conidia and our quality control assessments using the powdery mildew species Golovinomyces cichoracearum grown on cucumber18. This protocol is based on a protocol developed in the B. Keller laboratory group (University of Zürich, Zürich Switzerland)13,19 and includes several modifications that led to increased DNA yields and a higher proportion of DNA >48.5 kbp in size. The protocol also includes quality control steps based on recommendations from the United States Department of Energy Joint Genome Institute20,21,22.
The function of each reagent included in the lysis buffer, and the rationale for each purification step, are as described in Henry (2008)23. Available published protocols for isolation of high molecular weight genomic DNA from plant and microbial tissues were also consulted during the design of this protocol24,25,26,27,28.
1. Preparation of Fungal Material
2. Genomic DNA Purification
3. Quality Control
A representative example of 60ng purified genomic DNA from G. cichoracearum run on an agarose gel using gel electrophoresis and using pulsed-field gel electrophoresis are shown in Figures 1 and 2, respectively. Genomic DNA preparations that pass, marginally pass and fail quality control are represented. Genomic DNA preparations that fully pass quality control are ideal for sequencing using long-read genome sequencing approaches. Preparations that marginally pass quality control are acceptable, but preparations that fully pass are preferred. Preparations that fail quality control are not acceptable for long-read genome sequencing approaches.
Figure 1: Agarose Gel Electrophoresis of Purified Genomic DNA. Lane 1: A DNA ladder, with bands ranging from 20 kbp to 75 bp; Lane 2: genomic DNA that passed quality control; Lane 3: genomic DNA that was considered marginal; and Lane 4: genomic DNA that failed quality control. Acceptable and marginal genomic DNA samples have a single band running above 20 kbp with minimal smearing at < 20 kbp. Genomic DNA samples that fail quality control have smearing below 20 kbp, and may include low molecular weight fragments at approximately 100 bp (*). Pulsed-field gel electrophoresis is required to distinguish between acceptable and marginal DNA samples. Ethidium bromide was added to the gel prior to electrophoresis to visualize the genomic DNA. Samples were electrophoresed at 60 V for 50 min. The gel was imaged under UV light using a gel documentation system.
Figure 2: Pulsed-Field Gel Electrophoresis (PFGE) of Purified Powdery Mildew Genomic DNA. Lane 1: molecular weight standards, with the highest band at 48.5 kbp and the lowest at 8.1 kbp. Lanes 2 and 5: molecular weight standards showing bands of 20, 10, 7, 5, 4, 3, 2 and 1.5 kbp (at the bottom of the gel image). Lane 3: a marginal DNA sample, with the majority of the DNA between 20 and 48.5 kbp in size. Lane 4: an acceptable DNA sample, with the majority of the DNA > 48.5 kbp (*). Samples were run at 75 V for 15 h using a 5-80 kbp PFGE waveform program. DNA-staining dye was added to the gel after electrophoresis and the gel was imaged under UV light using a gel documentation system.
Reagent | Final Concentration |
Potassium metabisulfite (K2S2O5) | 0.25 M |
Tris buffer pH 7.5 | 0.2 M |
Ethylenediaminetetraacetic acid (EDTA) | 50 mM |
Sodium chloride (NaCl) | 2 M |
Cetyltrimethyl ammonium bromide (CTAB) | 2% v/v |
Table 1: Preparation of Lysis Buffer. Bring to a volume of 5 mL with dH2O.
Target | Primer Name | Primer Sequence | Tm (°C) | Expected Product Size (bp) |
Fungal ITS2 | FW (ITS9) | GAACGCAGCRAAIIGYGA | 56-61.3 | 336 |
RV (ITS4) | TCCTCCGCTTATTGATATGC | 52.1 | ||
Bacterial 16S V4 | FW (515F) | GTGCCAGCMGCCGCGGTAA | 63.8-66.5 | 331 |
RV (805R) | GGACTACHVGGGTWTCTAAT | 46.9-53.8 | ||
Bacterial 16S V4-V5 | FW (515F-Y) | GTGYCAGCMGCCGCGGTAA | 60.8-66.5 | 450 |
RV (926R) | CCGYCAATTYMTTTRAGTTT | 43.9-53.8 | ||
Fungal 18S V4 | FW | CCAGCASCYGCGGTAATTCC | 58.7-61.5 | 431 |
RV | ACTTTCGTTCTTGATYRA | 43.2-48.3 | ||
Cucumis sativus PDS (LOC101204524) | CsPDS F | GTGAGTAAGGTTACCGTTTGGGGCTTATCCCAAT | 63.8 | 750 |
CsPDS R | GCGTGAGCTCGGTACTCTCATCCACTCTTGCAC | 66.6 |
Table 2: Primers for Contamination Assessment.
In order to obtain pure, high molecular weight genomic DNA from the obligate biotrophic powdery mildew fungi, a modified version of previously described methods was developed30. The average yield using this optimized protocol is 7 µg DNA per 150 mg conidia, a doubling of the yield obtained with a prior protocol. Also, the average size increased from approximately 20 kbp to over 48.5 kbp. This protocol was optimized in the cucumber-G. cichoracearum system. In order to obtain the best quality and highest purity DNA in other powdery mildew or obligate biotrophic fungi, additional modification of this protocol may be required. In the future, this protocol may be used to obtain high molecular weight DNA suitable for long-read DNA sequencing from other obligate biotrophic fungi, though further modification may be required.
Balancing high DNA yield and high molecular-weight of extracted DNA was a concern during the optimization of this protocol. Steps including vortexing were eliminated in order to minimize shearing of the DNA, and were replaced by gently mixing by inversion. This change increased the quality of the extracted DNA without significantly reducing the yield. Additionally, removal of the steel balls at the beginning the cell lysis step significantly increased DNA yield when compared to leaving the balls in the solution until the end of the lysis step.
No G. cichoracearum conidia remained intact after the ball milling procedure in step 1.3. To obtain good yields of high molecular weight genomic DNA, it is crucial that the fungal cell wall is disrupted during this step. Conidial integrity can be assessed microscopically, and additional ball milling or alternative methods of disruption may be required for other fungal species.
Some fungal conidia, notably the Blumeria sp., contain pigments that can contaminate DNA extraction and purification attempts. In these cases, we recommend including the DNA filtration step as outlined in the Bourras et al. (2015) protocol19 and an additional wash step with TE after the filtration step. The addition of the filtration step resulted in a small but significant increase in DNA degradation, and should only be included if excessive pigmentation remains after the second DNA precipitation step. These pigments, if allowed to remain in solution, may interfere with later analysis of quality and quantity of DNA and could potentially interfere with sequencing reactions.
This protocol includes both a traditional agarose gel electrophoresis step and a pulsed-field gel electrophoresis step. The first is an initial assessment of the fraction of DNA that is > 20 kbp and the fraction that runs as a smear at < 20 kbp. For the purpose of the downstream sequencing reactions required for powdery mildew genomes, the fraction of DNA < 20 kbp should be minimal (see Figure 1, lane 2 versus lane 4). The pulsed-field gel electrophoresis assessment provides a better estimate of the size as DNA > 20 kbp can be separated by size. DNA of > 48.5 kbp is desirable for use in subsequent sequencing reactions (see Figure 2, lane 4).
The fungal material used in these experiments was propagated in growth chambers and contamination was limited using strict isolation protocols and personnel controls. Because of this, contamination with other, non-powdery mildew pathogens was minimal, and sequencing of the ITS region amplified using fungi-specific primers recovered only G. cichoracearum sequences. This may not be the case for powdery mildews collected from field sites or greenhouses lacking the same isolation controls, and contamination assessment in these cases will be critical for the assembly of high-quality genomes from the extracted DNA.
No one modification of the various fungal DNA isolation protocols was uniquely important. Rather slight modifications at many steps resulted in the relatively high yields of high molecular weight powdery mildew DNA reported here.
The authors have nothing to disclose.
This protocol was developed in support of a Joint Genome Institute-Community Sequencing Project (#1657). The work was supported in part by the Philomathia Foundation, the National Science Foundation (#0929226) and the Energy Biosciences Institute to S. Somerville; and by Swiss National Science Foundation (#310030_163260) to B. Keller. We would like to thank our colleagues, M. Figureroa and M. Miller (University of Minnesota), R. Panstruga (RWTH Aachen University), C. Pedersen (University of Copenhagen), P. Spanu (Imperial College), J. Taneja (University of California Berkeley), M. Wildermuth (University of California Berkeley), R. Wise (Iowa State University) and S. Xiao (University of Maryland) for their generous advice and for volunteering their protocols as we developed the protocol described here.
MM400 Ball Mill | Retsch | 20.745.0001 | Or equivalent ball mill |
2mL tubes for ball milling | Sarstedt | 72.694.007 | |
5/32" stainless steel milling balls | OPS Diagnostics | GBSS 165-5000-01 | |
Microprocessor controlled 280 Series Water Bath Preset to 65°C | Thermo Fisher Scientific | 2825 | Or equivalent water bath |
Microprocessor controlled 280 Series Water Bath Preset to 37°C | Thermo Fisher Scientific | 2825 | Or equivalent water bath |
Eppendorf 5417R refrigerated microcentrifuge | Krakeler Scientific | 38-022621807 | Or equivalent microcentrifuge |
Chlorform:IAA (24:1 v/v) | Sigma-Aldrich | C0549 | Hazardous in case of skin contact or inhalation. Wear nitrile gloves, protective eyewear, lab coat and work in chemical hood |
Phenol:Chloroform:IAA (25:24:1 v/v) | Thermo Fisher Scientific | 15593031 | Hazardous in case of skin contact or inhalation. Wear nitrile gloves, protective eyewear, lab coat and work in chemical hood |
100% Isopropanol | Sigma-Aldrich | W292907 | |
100% Ethanol | Sigma-Aldrich | 34923 | Chill to -20°C prior use |
Sodium Acetate | Sigma-Aldrich | S2889 | |
Rnase, Dnase-free (10mg/ml) | Thermo Fisher Scientific | EN0531 | |
Potassium metabisulfite | Sigma-Aldrich | P2522 | |
Sodium Lauryl Sacroinate | Sigma-Aldrich | L9150 | |
Tris base | Sigma-Aldrich | 10708976001 | |
Ethylenediaminetetraacetic acid (EDTA) | Sigma-Aldrich | EDS | |
Sodium chloride (NaCl) | Sigma-Aldrich | S9888 | |
Cetyltrimethyl ammonium bromide (CTAB) | Sigma-Aldrich | H6269 | |
Uvex by Honeywell Futura Goggles S345C, Uvextreme | Staples | 423263 | Or equivalent eyewear |
High Five COBALT nitrile gloves, LARGE | Neta Scientific | HFG-N193 | Or equivalent gloves |
GelRed Nucleic Acid Gel Stain 10,000X in water | Biotium | 41003 | |
Ethidium Bromide 10mg/ml | Biotium | 40042 | Hazardous in case of skin contact. Wear nitrile gloves and protective eyewear |
Agarose Ultrapure Bioreagent | VWR International LLC | JT4063 | |
GeneRuler 1kb Plus DNA Ladder | Thermo Fisher Scientific | FERSM1332 | |
8-48kb CHEF DNA size standards | Bio-Rad | 170-3707 | |
Pippin Prep | Life Technologies Corporation | 4472172 | Or equivalent pulse-field gel aparatus |
Whatman qualitative filter paper, Grade 1 | Sigma-Aldrich | WHA1001042 | Or equivalent growth chamber |
Percival Growth Chamber | Percival | AR66LXC9 | |
NanoDrop 8000 UV-Vis Spectrophotometer | Thermo Fisher Scientific | ND-8000-GL | Or equivalent spectrophotometer |
Quant-iT PicoGreen dsDNA Assay Kit | Thermo Fisher Scientific | P11496 | |
TAE Buffer (Tris-acetate-EDTA) (50X) | Thermo Fisher Scientific | B49 | Dilute 1/50 with water before use (to 1X) |
TE Buffer | Thermo Fisher Scientific | 12090015 | |
Tris-Borate-EDTA, 10X Solution (Electrophoresis) | Thermo Fisher Scientific | BP13334 | Dilute 1/10 with water before use (to 1X) |
Liquid Nitrogen | Liquid nitrogen (-196 °C) is a freezing hazard. It will also expand rapidly upon warming and should not keep in a tightly closed container |