This protocol provides experimental steps and information about reagents, equipment, and analysis tools for researchers who are interested in carrying out whole genome array-based comparative genomic hybridization (CGH) analysis of copy number variations in plants.
Mutants are invaluable genetic resources for gene function studies. To generate mutant collections, three types of mutagens can be utilized, including biological such as T-DNA or transposon, chemical such as ethyl methanesulfonate (EMS), or physical such as ionization radiation. The type of mutation observed varies depending on the mutagen used. For ionization radiation induced mutants, mutations include deletion, duplication, or rearrangement. While T-DNA or transposon-based mutagenesis is limited to species that are susceptible to transformation, chemical or physical mutagenesis can be applied to a broad range of species. However, the characterization of mutations derived from chemical or physical mutagenesis traditionally relies on a map-based cloning approach, which is labor intensive and time consuming. Here, we show that a high-density genome array-based comparative genomic hybridization (aCGH) platform can be applied to efficiently detect and characterize copy number variations (CNVs) in mutants derived from fast neutron bombardment (FNB) mutagenesis in Medicago truncatula, a legume species. Whole genome sequence analysis shows that there are more than 50,000 genes or gene models in M. truncatula. At present, FNB-induced mutants in M. truncatula are derived from more than 150,000 M1 lines, representing invaluable genetic resources for functional studies of genes in the genome. The aCGH platform described here is an efficient tool for characterizing FNB-induced mutants in M. truncatula.
Legumes (Fabaceae) are the third largest family of flowering plants, with many economically important species such as soybean (Glycine max) and alfalfa (Medicago sativa). Legume plants can interact with nitrogen-fixing soil bacteria, generally called Rhizobia to develop root nodules in which the atmospheric dinitrogen is reduced to ammonia for use by the host plant. As such, cultivation of legume crops requires little input of nitrogen fertilizers and thus contributes to sustainable agriculture. Legume crops produce leaves and seeds with high protein content, serving as excellent forage and grain crops. However, cultivated legume species generally have complex genome structures, making functional studies of genes that play key roles in legume-specific processes cumbersome. Medicago truncatula has been widely adopted as a model species for legume studies primarily because (1) it has a diploid genome with a relatively small haploid genome size (~550 Mbp); (2) plants can be stably transformed for gene functional studies; and (3) it is closely related to alfalfa (M. sativa), the queen of forages, and many other economically important crops for translational studies. Recently, the genome sequence of M. truncatula cv Jemalong A17 has been released1,2. Annotation of the genome shows that there are more than 50,000 predicted genes or gene models in the genome. To determine the function of most of the genes in the M. truncatula genome is a challenging task. To facilitate functional studies of genes, a comprehensive collection of mutants in the range of over 150,000 M1 lines has been generated using fast neutron bombardment (FNB) mutagenesis in M. truncatula cv Jemalong A173,4. Fast neutron, a high energy ionization mutagen, has been used in generating mutants in many plant species including Arabidopsis5,6, rice (Oryza sativa)7, tomato (Solanum lycopersicum), soybean (Glycine soja; G. max)8,9, barley (Hordeum vulgare), and Lotus japonicus10. A large portion of mutations derived from FNB mutagenesis are due to DNA deletions that range in size from a few base pairs to mega base pairs9,11. Many phenotype-associated genes have been successfully identified and characterized4,12,13,14,15,16,17,18,19. Previously, molecular cloning of the underlying genes from FNB mutants relied on a map-based approach, which is time consuming and limits the number of mutants to be characterized at the molecular level. Recently, several complimentary approaches including transcript-based methods, genome tiling array-based comparative genomic hybridization (CGH) for DNA copy number variation detection, and whole genome sequencing, have been employed to facilitate the characterization of deletion mutants in diverse organisms including animals and plants20,21,22,23,24,25,26,27,28,29,30,31.
To facilitate the characterization of FNB mutants in M. truncatula, a whole-genome array-based comparative genomic hybridization (CGH) platform has been developed and validated. As reported in animal systems, the array-based CGH platform allows detection of copy number variations (CNVs) at the whole genome level in M. truncatula FNB mutants. Furthermore, lesions can be confirmed by PCR and deletion borders can be identified by sequencing. Overall, the array CGH platform is an efficient and effective tool in identifying lesions in M. truncatula FNB mutants. Here, the array CGH procedure and PCR characterization of deletion borders in an M. truncatula FNB mutant are illustrated.
The following protocol provides experimental steps and information about reagents, equipment and analysis tools for researchers who are interested in carrying out whole genome array-based comparative genomic hybridization (CGH) analysis of copy number variations in plants. As an example, Medicago truncatula FN6191 mutant was used to identify deletion regions and candidate genes associated with mutant phenotypes. M. truncatula FN6191 mutant, originally isolated from a fast neutron bombardment-induced deletion mutant collection32 (see Table of Materials), exhibited a hyper-nodulation phenotype after inoculation with the soil bacterium, Sihorhizobium meliloti Sm1021, in contrast to wild type plants.
NOTE: Figure 1 shows the five steps for the array CGH protocol. They are: 1) Preparation of plant materials; 2) Isolation of high quality DNA samples; 3) Labeling and purification of DNA samples; 4) Hybridization, washing, and scanning of whole genome arrays; and 5) CGH data analysis. M. truncatula whole genome tiling arrays contain a total of 971,041 unique oligo probes targeting more than 50,000 genes or gene models in the genome (See Table of Materials). The unique probes are spaced approximately every 150 base pairs (bp) in exonic regions and 261 bp in intronic regions of the M. truncatula genome.
1. Preparation of Plant Materials
2. Isolation of High Quality DNA Samples
3. DNA Labeling and Purification
4. Hybridization, Washing, and Scanning of the Genome Arrays
5. CGH Data Analysis
Figure 2 shows the distribution of normalized log2 ratios of mutant versus WT signals across the whole genome. Analysis of CGH data revealed an approximate 22 kb deletion on chromosome 4 that encompasses the entire SUNN gene33 and several other annotated genes in FN6191 mutant (Figure 2, Figure 3). The candidate deleted region was covered by 73 consecutive probes on the array with the mean normalized log2 ratios less than -2.5 (Supplemental Table 1). An inspection of deletion borders suggested that the putative deletion in this mutant is flanked by probes located between coordinates 22,313,168 and 22,334,934 on chromosome 4 (Figure 3). To confirm the deletion borders, PCR primers were designed to be approximately 1,500 bp apart from the predicted deletion borders (FN6191-DB-F: GCTAGCAAGGGTCTGCGCAAGTT; FN6191-DB-R: GTATCGAGAAGGTCTTATAGCAGC) and PCR amplification was performed with these primers and the following parameters: 94 °C, 5 min; 94 °C, 45 s, 55 °C, 30 s, 72 °C, 1.5 min; for 35 cycles; 72 °C, 10 min and then 10 °C indefinitely. As expected, a single PCR product of approximately 1.5 kb was successfully amplified from the FN6191 mutant, but not WT (Figure 4A). Based on the M. truncatula genome release v3.5, the size of the deletion in FN 6191 was estimated to be 26.67 kb (Figure 4B). The CGH results showed that no other abnormal segments are present in the FN6191 mutant genome (Figure 2).
Figure 1: Workflow of array comparative genomic hybridization (aCGH) analysis of M. truncatula mutant. (A) Preparation of plant materials: M. truncatula wild type (Jemalong A17) and FN6191 mutant were grown for three to four weeks in greenhouse. (B) Isolation of high quality genomic DNA: One gram of young leaves from single plants was used to isolate genomic DNA. DNA quality was determined using a spectrophotometer and gel electrophoresis. (C) DNA labeling and purification. Genomic DNAs of reference and experimental samples were labeled with Cyanine 3 (Cy3) and Cy5 using a DNA labeling kit and purified using a purification column. (D) Hybridization, washing, and scanning of 1 x 1 M M. truncatula CGH array. (E) CGH analysis: Log2 ratios of experimental/reference signals that are less than or equal to -2.5 or greater than or equal to 2.5 are considered as putative deletions and duplications, respectively. Please click here to view a larger version of this figure.
Figure 2: Array based comparative genomic hybridization analysis of copy number variations in M. truncatula hyper nodulation mutant FN6191. Shown are log2 mutant/wild type ratios of probes on all eight chromosomes. A consecutive 73-probe region on chromosome 4 was identified as the deleted region in the mutant (arrow). Please click here to view a larger version of this figure.
Figure 3. Identification of the deleted region in M. truncatula FN6191 mutant. A close-up view of the region on chromosome 4, in which 73 microarray probes exhibited significantly reduced log2 mutant/wild type ratios, and six mapped genes including SUNN (Medtr4g070970). Arrows indicate the location and direction of primers for PCR amplification of deletion borders. Please click here to view a larger version of this figure.
Figure 4: Confirmation of deletion borders in M. truncatula FN6191 mutant. (A) PCR amplification of deletion borders: A 1.5 kb product was amplified from the FN6191 mutant using primers flanking the deletion borders (lane 1; Mu). The same set of primers did not amplify any products from WT (M. truncatula Jemalong A17) due to a size limitation of PCR (lane 2; Wt). Lane M, 1 kb ladder. (B) Sequences of the deletion junction in the FN6191 mutant. An arrow denotes the deletion junction. (C) Sequencing results showed that the deletion borders (arrows) are located at the coordinates 22309163 and 22335836 on chromosome 4. Please click here to view a larger version of this figure.
Labeling mix 1 | Per reaction x sample # (μL) | Labeling mix 2 | Per reaction x reference # (μL) |
ddH2O | 6.0 x | ddH2O | 6.0 x |
5x Buffer | 10.0 x | 5x Buffer | 10.0 x |
10x dNTPs | 5.0 x | 10x dNTPs | 5.0 x |
Cyanine 5 | 3.0 x | Cyanine 3 | 3.0 x |
Exo-Klenow | 1.0 x | Exo-Klenow | 1.0 x |
Total | 25 x | Total | 25 x |
Table 1: Labeling Mixes.
Supplemental Table 1: Probe sequences deleted in M. truncatula FN6191 mutant.
Please click here to download this file.
We have developed an array-based CGH platform for the detection and characterization of fast neutron bombardment (FNB)-induced mutants in M. truncatula cv. Jemalong A17. To demonstrate the use of the array CGH method in detecting gene mutations, we performed aCGH analysis of the mutant FN6191, which exhibited a hyper-nodulation phenotype in contrast to wild type plants, when inoculated with S. meliloti Sm1021. For segmentation analysis, a segment was deemed significant if the log2 ratio mean of the probes within the segment was above the upper threshold or below the lower threshold for that given array comparison. The upper threshold for each comparison was determined to be the log2 ratio value of the 95th percentile of all data points. The lower threshold for each comparison was determined to be the log2 ratio value of the 5th percentile of all data points24.
Our analysis showed that significant copy number changes can be determined by retrieving segments with an average normalized log2 ratio greater than the average by 2.5 SD (duplication) or less than the average by 2.5 SD (deletion). Based on CGH analysis, we detected a deletion region with an estimated size of 22 kb on chromosome 4, encompassing the SUNN gene33. Sequencing results confirmed that the deletion size was 26.67 kb based on M. truncatula genome release v3.5. It has been shown that SUNN encodes a CLV1-like leucine-rich repeat receptor kinase that controls the nodule number in M. truncatula33. Our CGH results suggest that FN6191 is a new SUNN allele. Based on these results, we reasoned that the CGH method coupled with phenotypic and genetic analysis can be successfully used to rapidly identify candidate genes in FNB deletion mutants. Over the past several years, we have used this platform in identifying and characterizing numerous FNB mutants for gene functional studies in M. truncatula34,35. We have generated a database of copy number variations (CNVs) associated with FNB mutant lines36. In this database, deleted sequences from CGH analyses of over 100 confirmed symbiotic nodulation mutants have been mapped to the M. truncatula genome. A blast server has been set up to facilitate the search for deleted sequences in the mutant collection. The development and further expansion of the deletion database would be highly valuable for functional genomics research using M. truncatula FNB mutant resources.
The aCGH protocol has five major steps. Although all steps are important and should be carried out carefully as described earlier, the followings are especially critical: (1) DNA samples should not be degraded and should not be contaminated by RNA during the preparation step; (2) Test and reference DNA samples should be equally well prepared so that the labeled samples are of the same high quality; (3) Labeled DNA samples should not be exposed to a high intensity of light and a high level of ozone; and (4) it is important to keep Washing Buffer II at 37 °C during the washing step.
The coverage of the current M. truncatula genome array is primarily on the exonic regions and less on the intronic regions, and not at all in intergenic regions. The coverage in the latter regions can be improved if lesions in these regions are important for mutant phenotypes. On the other hand, the current probe design has less power in identifying small mutations and single nucleotide polymorphisms (SNPs). This limitation can be overcome by using CGH+SNP microarray design37. Overall, the genome array-based comparative genomic hybridization (aCGH) platform is a powerful tool for the analysis of copy number and structural variations in M. truncatula FNB mutants.
The authors have nothing to disclose.
Funding of this work is provided in part by a grant from NSF Plant Genome Research (IOS-1127155).
Medicago truncatula genome array, 1 x 1 M | Agilent | G4123A | |
Medicago truncatula FN6191 (mutant) | In house | FN6191 | |
Medicago truncatula Jemalong A17 (reference) | In house | A17 | |
Sulfuric acid | Sigma-Aldrich | 320501 | |
DNeasy Plant Mini Kit | Qiagen | 69104 | |
Nanodrop Spectrophotometer | Thermo Scientific | 1000D | |
SureTag DNA Labeling Kit | Agilent | 5190-3400 | |
Random primer | Agilent | 5190-3399 | |
Acetonitrile | Sigma-Aldrich | 271004-1L | |
Thermocycler | MJ research | PTC-200 | |
Centrifuge | Labnet international Inc | Spectrafuge 24D | |
Stabilization and Drying Solution | Agilent | 5185-5979 | |
Oligo aCGH/ChIP-on-chip Hybridization Kit | Agilent | 5188-5380 | |
Hybridization Chamber gasket slides | Agilent | G2505 | |
Human Cot-1 DNA | Agilent | 5190-3393 | |
Oligo aCGH/ChIP-on-chip Wash Buffer 1 and 2 | Agilent | 5188-5221 | |
Hybridization Chamber, stainless | Agilent | G2534A | |
Hybridization oven | Agilent | G2545A | |
Purification Columns | Agilent | 5190-3391 | |
Laser scanner | Roche | MS200 | |
NimbleScan 2.6 | Roche Nimblegen | 5225035001 | |
Signal Map 1.9 | Roche Nimblegen | Signalmap1.9 |