We describe a general protocol that integrates high-resolution mass spectrometry analysis and molecular docking in traditional Chinese medicine research.
The separation and analysis of the desired chemical components are important subjects for the fundamental research of traditional Chinese medicine (TCM). Ultra-high-performance liquid chromatography quadrupole time-of-flight tandem mass spectrometry (UPLC-Q-TOF-MS/MS) has gradually become a leading technology for the identification of TCM ingredients. Gynura bicolor DC. (BFH), a perennial stemless herb used for medicine and food in China has medicinal effects such as clearing heat, moistening the lung, relieving cough, dispersing stasis, and relieving swelling. Polyphenols and flavonoids contain numerous isomers, which hinder the identification of the complex compounds in BFH. This paper presents a systematic protocol for studying chemical constituents of BFH based on solvent extraction and integrated data via UPLC-Q-TOF-MS.
The method described here includes systematic protocols for sample pretreatment, MS calibration, MS acquisition, data processing, and analysis of results. Sample pretreatment includes collection, cleaning, drying, crushing, and extraction. MS calibration consists of multipoint and single-point correction. Data processing includes data importing, method establishment, analysis processing, and result presentation. Representative results of the typical fragmentation pattern of phenolic acids, esters, and glycosides in Gynura bicolor DC. (BFH) are presented in this paper. In addition, organic solvent selection, extraction, data integration, collision energy selection, and method improvement are discussed in detail. This universal protocol can be widely used to identify complex compounds in TCM.
Traditional Chinese medicine (TCM) has been clinically practiced in China for thousands of years, and it plays a vital role in maintaining the health of Chinese people1. The composition of TCM is diverse and complex, and TCM has been widely reported in many qualitative studies focusing on the chemical composition2. The chemical components in TCM can be roughly divided into the following categories such as alkaloids, organic acids, phenylpropanoids, coumarins, lignans, quinones, flavonoids, terpenoids, triterpenoid saponins, steroid saponins, cardiac glycosides, and tannins3. Given the large numbers of unknown components and indistinguishable isomers in TCM, the separation and analysis of the desired chemical components are important subjects for the fundamental research of TCM4.
Ultra-high-performance liquid chromatography quadrupole time-of-flight mass spectrometry (UPLC-Q-TOF-MS) has been applied to analyze substances in traditional Chinese medicine (TCM), which can be separated by ultra-high-performance liquid chromatography5,6. The high resolution of MS can provide extensive ion information, which is used for database analysis with error less than 5 ppm7. After turning on the collision energy, the secondary MS mode can obtain secondary fragment ions, whose intensity and number are affected by the magnitude of energy8.
Gynura bicolor DC. (BFH), a perennial stemless herb widely used for medicine and food (Figure 1A), is a rare and endangered plant unique to China9. BFH has abundant anthocyanins, polyphenols, flavonoids, and strong antioxidant capacity10. BFH has medicinal effects including clearing heat, cooling blood, moistening the lung, relieving cough, dispersing stasis, relieving swelling, relieving summer heat, and eliminating heat. Few studies have focused on the chemical composition of BFH11. Polyphenols and flavonoids contain numerous isomers, which make the identification of the complex compounds in BFH difficult. A universal method for the identification of chemical components must be developed, which can be applied to all kinds of TCM. This study aimed to report a systematic protocol for studying chemical constituents of BFH based on solvent extraction and integrated data via UPLC-Q-TOF-MS.
1. Sample pretreatment
2. MS calibration
3. MS acquisition
4. Data processing
5. Result analysis
The chemical composition identification of BFH was used as a model to display the representative results. Base peak chromatograms of solvent-extracted Gynura bicolor DC. are shown in Supplemental File 1-Supplemental Figure S1-S6, and the observed retention time (RT), component name, formula, mass-to-charge ratio (m/z), and mass error are listed in Tables S1-S6. In the ZBTK, all 35 kinds of compounds were identified from separated peaks in UPLC-TOF-MS. As shown in Supplemental File 1-Supplemental Table S1, the main chemicals contained acid (quinic acid, DL-malic acid, citric acid, o-coumaric acid, and 3-O-trans-coumaroylquinic acid), glycosides (gentiopicrin, picroside II, suffruticoside J, copoline, and borneol-2-O-β-D-apiofuranosyl-(1→6)-β-D-glucoside), and esters (bilobalide, trimethyl citrate, and trans-O-glucosyl-methyl-trans-cinnamate). In the ZBTK-CT, 25 kinds of extra compounds were detected (Supplemental File 1-Supplemental Table S2), and 45 kinds of total compounds were observed.
When chloroform was used as an extractant, all 50 compounds were found. Among them, the 28 kinds of extra compounds included coronaric acid, sanleng acid, agrimol A, and methyl artemisate (Supplemental File 1-Supplemental Table S3). When petroleum ether was used as an extractant, 44 kinds of total compounds were observed. We found 17 kinds of extra compounds including notopterol, onjisaponin, and cnideol B (Supplemental File 1-Supplemental Table S4). A total of 65 compounds were obtained in solvent using ethyl acetate as an extractant. Compared with the previous four extractives, 37 extra compounds were noted such as daphnetin, acetovanillone B, and stigmasterol-3-O-β-D-glucoside (Supplemental File 1-Supplemental Table S5). In the butyl alcohol extractive, we found 67 kinds of total compounds, which contained 26 kinds of extra compounds including rengyoside C, chromones, aturametelin F, and deacetylmatricarin-8-O-β-D-glucopyranoside (Supplemental File 1-Supplemental Table S6). In summary, we found 168 compounds based on all the compound results of six extractants.
To deeply understand the cleavage pathway of compound types in UPLC-TOF-MS, we selected three compounds as examples. For acids, 3-O-trans-coumaroylquinic acid with m/z = 337.09222 could lose a C7H10O5 or C9H8O2 group via the hydrolysis reaction to form intermediates with m/z = 163.04116 or 199.05557, respectively (Figure 4D). For esters, trans-O-glucosyl-methyl-trans-cinnamate with m/z = 385.11308 could be converted into intermediate with m/z = 153.07485 or 135.04679 via C-C cleavage at different sites (Figure 4E). For glycosides, citrusin C with m/z = 385.11308 transformed into intermediate with m/z = 272.09174 via C-C bond cleavage in the aliphatic chain, whereas the loss of the C7H6O2 group resulted in intermediate with m/z = 113.02534 (Figure 4F). In another pathway, the breakage of the C-O-C bond on the benzene ring led to the formation of an intermediate with m/z = 163.11245, while the hexose unit disappeared.
The representative results of molecular docking revealed 2D and 3D structures in Figure 4G,H, respectively. Gene ontology enrichment analysis showed the results of gene function in terms of biological process (BP), cellular component (CC), and molecular function (MF; Figure 4I). The Kyoto Encyclopedia of Genes and Genomes pathway annotation analysis showed the pathway results of differentially expressed genes, which could be used to further understand the function of genes (Figure 4J). The detailed results of molecular docking are shown in Supplemental File 1-Supplemental Table S7 and Supplemental File 1-Supplemental Figure S7-S9.
Figure 1: Pretreatment method of traditional Chinese medicine. (A) Gynura bicolor DC. (B) Cleaning and drying. (C) Pulverization. (D) Organic solvent addition. (E) Ultrasound extraction. (F) Centrifugation of mixture. (G) Filtration of supernatant. Please click here to view a larger version of this figure.
Figure 2: Operation procedure of UPLC-Q-TOF-MS/MS. (A) Sodium formate preparation. (B) Mass axis calibration. (C) MS tune. (D) Sample placement. (E) Mass spectrum acquisition. (F) Mass spectrum display. Please click here to view a larger version of this figure.
Figure 3: Data processing and integration. (A) Data conversion. (B) Method modification. (C) Data processing. (D) Statistical analysis of results. Please click here to view a larger version of this figure.
Figure 4: Deep result analysis. (A) Secondary mass spectrum. (B) Secondary fragment ion display. (C) Fracture site display. (D) fragmentation pattern 1. (E) Fragmentation pattern 2. (F) Fragmentation pattern 3. (G) Demonstration of the active ingredient action target. (H) 3D structure of the active ingredient action target. (I) GO enrichment analysis. (J) KEGG pathway annotation analysis. Please click here to view a larger version of this figure.
Supplemental File 1: Identification of compounds, molecular docking results, base peak chromatograms, and network analysis. Please click here to download this File.
Besides water decoction12, organic solvent extraction is another common method of TCM pretreatment13. According to the principle of similar phase dissolution, numerous components have been extracted by the combination of various organic solvents14. Ultrasonic-assisted extraction is one of the main methods used to obtain components in TCM15. Supercritical carbon dioxide extraction is good for extracting certain kinds of substances such as lignans16. Moreover, microwave-assisted extraction in ionic liquids (ILs) was investigated as an alternative to conventional organic solvent extractions17. However, the precise control of microwave energy is difficult to achieve, which may cause secondary reactions such as etherification dehydration and esterification dehydration.
Optimal settings of collision energy can yield additional second-order MS fragment information. Information from tandem MS, i.e., MSE mode, was provided by two parallel alternating scans using either low energy to obtain molecular ion information or high energy to obtain full-scan accurate mass fragment ions18. During the setting of high-energy parameters, the range of 10-35 V is usually used in ramp transfer collision energy to enhance precursor fragmentation efficiency19.
The vast amount of raw MS data was generated from high-resolution MS. Raw MS data, which included molecular ion peaks in the MS spectrum and fragment ion peaks in the MS/MS spectrum, must be thoroughly processed and integrated. MS data analysis software aimed to analyze all metabolites in a biological sample comprehensively20. The screening of sulfated metabolites was based on the data-dependent acquisition of full MS scans and multidimensional metabolite data21. On the basis of evolving technology, artificial intelligence prediction will be a feasible development direction for MS data analysis.
The majority of compounds can be identified by this method. However, the limited secondary fragments cannot fully identify all isomers including flavonoids and tannins. To achieve structure identification, researchers can employ tandem mass spectrometry to provide further information on fragment ions, which help in the identification of compound structures8. Given the limitation of electrospray ionization ion or atmosphere pressure chemical ionization source, high-resolution MS is not suitable for macromolecule analysis. Matrix-assisted laser desorption ionization can promote the ionization of large molecular compounds, such as proteins22. In summary, UPLC-Q-TOF-MS/MS is one of the most widely applicable techniques for the identification of TCM compounds. It has the potential to be applied to other fields, including clinical medicine and energy chemistry.
The authors have nothing to disclose.
This work was funded by the National Natural Science Foundation of China (82104881), Inheritance and Innovation Team of TCM Treatment of Immune Diseases, Chongqing Medical Scientific Research Project (Joint project of Chongqing Health Commission and Science and Technology Bureau) (2022DBXM007), A special project for performance incentive and guidance of Chongqing Scientific Research Institute (cstc2022jxjl120005), A special project for Chongqing Postdoctoral Science Foundation (2022CQBSHTB3035), Senior Medical Talents Program of Chongqing for Yong and Middle-aged, the Program for Scientific Institutions of Chongqing (independent research project No.2022GDRC015).
chloroform | Sinopharm Chemical ReagentCo., Ltd | CAS 67-66-3 | |
ethanol | ChuandongChemical | CAS 64-17-5 | |
ethyl acetate | ChuandongChemical | CAS 141-78-6 | |
liquid chromatograph | Waters | ACQUITY Class 1 plus | |
MassLynx | Waters | V4.2 | MS control software |
n-butyl alcohol | ChuandongChemical | CAS 71-36-3 | |
petroleum ether | ChuandongChemical | CAS 8032-32-4 | |
Quadrupole time-of-flight mass spectrometry | Waters | SYNAPT XS | |
UNIFI | Waters | Data analysis software |
.