Diagnostic fragmentation filtering, implemented into MZmine, is an elegant, post-acquisition approach to screen LC-MS/MS datasets for entire classes of both known and unknown natural products. This tool searches MS/MS spectra for product ions and/or neutral losses that the analyst has defined as being diagnostic for the entire class of compounds.
Natural products are often biosynthesized as mixtures of structurally similar compounds, rather than a single compound. Due to their common structural features, many compounds within the same class undergo similar MS/MS fragmentation and have several identical product ions and/or neutral losses. The purpose of diagnostic fragmentation filtering (DFF) is to efficiently detect all compounds of a given class in a complex extract by screening non-targeted LC-MS/MS datasets for MS/MS spectra that contain class specific product ions and/or neutral losses. This method is based on a DFF module implemented within the open-source MZmine platform that requires sample extracts be analyzed by data-dependent acquisition on a high-resolution mass spectrometer such as quadrupole Orbitrap or quadrupole time-of-flight mass analyzers. The main limitation of this approach is the analyst must first define which product ions and/or neutral losses are specific for the targeted class of natural products. DFF allows for the subsequent discovery of all related natural products within a complex sample, including new compounds. In this work, we demonstrate the effectiveness of DFF by screening extracts of Microcystis aeruginosa, a prominent harmful algal bloom causing cyanobacteria, for the production of microcystins.
Tandem mass spectrometry (MS/MS) is a widely used mass spectrometry method that involves isolating a precursor ion and inducing fragmentation via application of activation energy such as collision induced dissociation (CID)1. The manner in which an ion fragments is intimately linked to its molecular structure. Natural products are often biosynthesized as mixtures of structurally similar compounds rather than as a single unique chemical2. As such, structurally related compounds that are part of the same biosynthetic class often share key MS/MS fragmentation characteristics, including shared product ions and/or neutral losses. The ability to screen complex samples for compounds that possess class-specific product ions and/or neutral losses is a powerful strategy to detect entire classes of compounds, potentially leading to the discovery of new natural products3,4,5,6. For decades, mass spectrometry methods such as neutral loss scanning and precursor ion scanning performed on low resolution instruments have allowed ions with the same neutral loss or product ions to be detected. However, the specific ions or transitions needed to be defined prior to performing the experiments. As high-resolution mass spectrometers have become more popular in research laboratories, complex samples are now commonly screened using non-targeted, data-dependent acquisition (DDA) methods. In contrast to traditional neutral loss and precursor ion scanning, structurally related compounds can be identified by post-acquisition analysis7. In this work, we demonstrate a strategy we have developed termed diagnostic fragmentation filtering (DFF)5,6, a straight-forward and user-friendly approach to detect entire classes of compounds within complex matrices. This DFF module has been implemented into the open-source, MZmine 2 platform and available by downloading MZmine 2.38 or newer releases. DFF allows users to efficiently screen DDA datasets for MS/MS spectra which contain product ion(s) and/or neutral loss(es) that are diagnostic for entire classes of compounds. A limitation of DFF is characteristic product ions and/or neutral losses for a class of compounds must be defined by the analyst.
For example, each of the more than 60 different fumonisin mycotoxins identified8,9 possess a tricarballylic side chain, that generates a m/z 157.0142 (C6H5O5–) product ion upon fragmentation of the [M-H]– ion4. Therefore, all putative fumonisins in a sample can be detected using DFF by screening all MS/MS spectra within a DDA dataset that contain the prominent m/z 157.0142 product ion. Similarly, sulfated compounds can be detected by screening DDA datasets for MS/MS spectra that contain a diagnostic neutral loss of 79.9574 Da (SO3)3. This approach has also been successfully applied for detecting new cyclic peptides5 and natural products that contain tryptophan or phenylalanine residues6.
To demonstrate the effectiveness of DFF and its ease of use within the MZmine platform10, we have applied this approach to the analysis of microcystins (MCs); a class of over 240 structurally related toxins produced by freshwater cyanobacteria11,12,13.
The most commonly reported cyanotoxins are MCs, with the MC-LR (leucine [L]/arginine [R]) congener most frequently studied (Figure 1). MCs are monocyclic non-ribosomal heptapeptides, biosynthesized by multiple cyanobacteria genera including Microcystis, Anabaena, Nostoc, and Planktothrix12,13. MCs are composed of five common residues and two variable positions occupied by L-amino acids. Nearly all MCs possess a characteristic β-amino acid 3-amino-9-methoxy-2,6,8-trimethyl-10-phenyldeca-4,6-dienoic acid (Adda) residue at position 511. The MS/MS fragmentation pathways of MCs are well described14,15; the Adda residue is responsible for the prominent MS/MS product ion, m/z 135.0803+ (C9H11O+) as well as other product ions including m/z 163.1114+ (C11H15O+) (Figure 2). Non-targeted DDA datasets of Microcystis aeruginosa cellular extracts can be screened for all microcystins present using these diagnostic ions, granted that the microcystins have an Adda residue.
1. Preparation of non-targeted liquid chromatography (LC)-MS/MS datasets
NOTE: DFF can be performed using any high-resolution mass spectrometer and analytical method optimized for a target class of analytes. MC optimized LC-MS/MS conditions on Orbitrap mass spectrometer are listed in the Table of Materials.
2. Diagnostic fragmentation filtering of imported DDA files
3. Example use of DFF for microcystin analysis
The DFF plot generated following the analysis of M. aeruginosa CPCC300 is shown in Figure 4. The x-axis of this plot is the m/z of the precursor ions that satisfied the defined DFF criteria while the y-axis shows the m/z of all product ions within the MCs MS/MS spectra. For this analysis, the criteria for MC detection included precursor ions within the m/z range of 440-1200, retention times between 2.00–6.00 min. Most importantly, these MS/MS spectra contain both m/z 135.0803 and 163.1114 (± 3 ppm) above the defined 15% basepeak intensity threshold. Under these conditions, a total of 4116 MS/MS spectra were acquired during the LC-MS/MS DDA analysis. Of those, 26 spectra satisfied the DFF criteria were detected in the M. aeruginosa CPCC300 extract. However, multiple MS/MS spectra can be acquired on the same compound, particularly for higher intensity ions. In this extract, only 18 unique precursor m/z were found. The smallest ion (m/z 497.2746, [M+2H]2+) is the doubly charged complement of the [M+H]+ precursor m/z 993.5389, which was also detected by DFF. Based on previously published studies on this M. aeruginosa strain18, the major MCs detected can be confidently assigned as MC-LR and [D-Asp3]MC-LR. Investigating the mass spectra of the remaining putative MCs revealed that two were 13C isotopes of other detected MCs (m/z 993.5389, 1025.5343) and another was an adduct of and MC of m/z 993.5389. Of the 12 remaining putative MCs, four corresponded to the masses of known MCs, and eight were previously unreported compounds (Supplementary File. Table S1).
Figure 1: Chemical structure of MC-LR. The Adda residue is common in a large proportion of known MCs and produces diagnostic product ions at m/z 135.0803 and 163.1114. Other MC variants that contain a dimethyl-Adda and acetyldemethyl-Adda residue at position 5 are known and would not produce the same product ions. Please click here to view a larger version of this figure.
Figure 2: MS/MS spectra of MC-LR. MS/MS spectra acquired on a Orbitrap mass spectrometer showing the prominent product ion at m/z 135.0803 derived from the Adda residue. An additional product ion at m/z 163.1114 is also derived from the Adda residue and increases the selectivity of the DFF analysis. Please click here to view a larger version of this figure.
Figure 3: DFF dialogue box within MZmine. The product ions and/or neutral losses that are diagnostic for the targeted class of compounds are inputted. Retention time and precursor ion filters can be used to increase selectivity of the analysis. The minimum diagnostic ion intensity refers the threshold intensity of the diagnostic product ions and neutral losses that must be achieved in order for the spectra to satisfy the DFF criteria. Lowering this value may result in false positive hits. Please click here to view a larger version of this figure.
Figure 4: DFF plot for MC analysis of M. aeruginosa cellular extract. DFF analysis of the M. aeruginosa CPCC300 extract found 26 spectra that met the defined DFF criteria, comprising 18 unique m/z values. Right clicking the plot allows the user to “Zoom Out” the domain and/or range axes. A doubly charged precursor ion was detected at m/z 497.2746 and corresponded to an unknown MC at [M+H]+ 993.5389. The two known MCs produced by strain CPCC300 are [D-Asp3]MC-LR and MC-LR 18. In total, eight putative MCs did not correspond to the m/z of known MCs, four MCs corresponded to the m/z of multiple congeners and three were found to be isotopes/adducts of other MCs (Supplementary File. Table S1). The DFF plot shown here was generated manually in Excel from the “putative_MCs_data.csv” that was automatically made upon executing the DFF module. Please click here to view a larger version of this figure.
Supplementary File. Optimized conditions for LC-MS/MS analysis of M. aeruginosa extracts. Please click here to download this file.
DFF is a straight-forward and rapid strategy for detecting entire classes of compounds, especially relevant for natural product compound discovery. The most important aspect of DFF is defining the specific MS/MS fragmentation criteria for the targeted class of compounds. In this representative example, DFF was used to detect all Adda residue containing MCs present in an M. aeruginosa cellular extract. Although the vast majority of MCs contain an Adda residue, other residues at this position have been known, notably demethyl- and acetyldemethyl-Adda variants19. Any MCs with these residues would not be detected using the defined criteria. However, as DFF is a post-acquisition approach, additional diagnostic fragments can easily be investigated on the same dataset using the simple step-by-step protocol outlined here. This also allows the analyst to detect compounds with hypothetical modifications that would alter the diagnostic product ions and/or neutral loss.
Adducts and in-source fragments may also meet DFF criteria and be incorrectly interpreted as unique analytes. False positives may arise when other compounds present in the extract exhibit the same product ions and/or neutral losses. In both cases, this can be alleviated by using additional product ions and neutral losses that increase method selectivity.
Although precursor ions may meet all of the DFF criteria defined by the analyst and represent compounds within the targeted class, their absolute identity will still be putative. Using the identification confidence levels, proposed by Schymanski (2014), MCs detected using this MS/MS approach have a ‘level 3’ identification confidence when unequivocal molecular formula of the precursor ion can be assigned by accurate mass and the isotope profile20. In this example, eight putative MCs had masses that corresponded to multiple, isobaric MCs11. Absolute identity would have been achieved by either comparison of retention time and MS/MS spectra with an authentic standard or confirmed by NMR and other spectroscopic methods after purification. Putative compounds that do not correspond to masses of any known members of the targeted class, such as the eight putative MCs detected here, represent tangible targets for discovering new natural products.
The authors have nothing to disclose.
The authors thank Heather Roshon (Canadian Phycological Culture Centre, University of Waterloo for providing the cyanobacteria culture studied and Sawsan Abusharkh (Carleton University) for technical assistance.
Cyanobacteria | |||
Microcystis aeruginosaCPCC300 | CANADIAN PHYCOLOGICAL CULTURE CENTRE | CPCC300 | https://uwaterloo.ca/canadian-phycological-culture-centre/ |
Software | |||
Proteowizard (software) | software | http://proteowizard.sourceforge.net/ | |
Mzmine 2 | software | http://mzmine.github.io/ | |
LC-MS | |||
Q-Exactive Orbitrap | Thermo | – | Equipped with HESI ionization source |
1290 UHPLC | Agilent | Equipped with binary pump, autosampler, column compartment | |
C18 column | Agilent | 959757-902 | Eclipse Plus C18 RRHD column (2.1 × 100 mm, 1.8 μm) |
Solvents | |||
Optima LC-MS grade Methanol | Fisher | A456-4 | |
OptimaLC-MS grade Acetonitrile | Fisher | A955-4 | |
OptimaLC-MS grade Water | Fisher | W6-4 | |
LC-MS grade Formic Acid | Fisher | A11710X1-AMP | |
Vortex-Genie 2 | Scientific Industries | SI-0236 | |
Centrifuge Sorvall Micro 21 | Thermo Scientific | 75-772-436 | |
Other | |||
Amber HPLC vials 2 mL/caps | Agilent | 5182-0716/5182-0717 | |
0.2-μm PTFE syringe filters | Pall Corp. | 4521 | |
Whatman 47mm GF/A glass microfiber filters | Sigma-Aldrich | WHA1820047 | |
Media | |||
MA media (pH 8.6) ( quantity / L) | Watanabe, M. F. & Oishi, S. Effects of environmental factors on toxicity of a cyanobacterium (Microcystis aeruginosa) under culture conditions. Applied and Environmental microbiology. 49 (5), 1342-1344 (1985). | ||
Ca(NO3)·4H2O, 50 mg | Sigma-Aldrich | C2786 | |
KNO3, 100 mg | Sigma-Aldrich | P8291 | |
NaNO3, 50 mg | Sigma-Aldrich | S5022 | |
Na2SO4, 40 mg | Sigma-Aldrich | S5640 | |
MgCl2·6H20, 50 mg | Sigma-Aldrich | M2393 | |
Sodium glycerophosphate, 100 mg | Sigma-Aldrich | G9422 | |
H3BO3, 20 mg | Sigma-Aldrich | B6768 | |
Bicine, 500 mg | Sigma-Aldrich | RES1151B-B7 | |
P(IV) metal solution, 5 mL | |||
Bring the following to 1 L with ddH2O | |||
NaEDTA·2HO | Sigma-Aldrich | E6635 | |
FeCl3 ·6H2O | Sigma-Aldrich | 236489 | |
MnCl2·4H2O | Baker | 2540 | |
ZnCl2 | Sigma-Aldrich | Z0152 | |
CoCl2·6H2O | Sigma-Aldrich | C8661 | |
Na2MoO4·2H2O | Baker | 3764 | |
Cyanobacteria BG-11 50X Freshwater Solution | Sigma-Aldrich | C3061-500mL |