MMHub, a database for the mulberry metabolome Open Access

Examples of metabolite identification based on database comparisons and specific fragmentation patterns in MMHub. Molecular feature m0375 (A) was identified as D-pantothenic acid by comparison with MassBank (B). Two molecular features (m1903 and m1963) were annotated as kaempferol 3-O-rutinoside (C) and rutin (D) based on flavonoid-specific fragmentation patterns. Identities of three metabolites shown in this figure were confirmed by comparison with authentic standards.

Figure 2

MS2T library construction

A segmented full MS/MS scan strategy developed from the widely targeted metabolomics approach was used to construct the mulberry MS2T library (Figure 1). This strategy comprised 18 full MS/MS runs (segmented with 50 m/z) instead of thousands of multiple ion monitoring-enhanced product ions transitions in 113 runs (22). Original chromatographic peaks (signals) and related mass spectra were manually checked. Further filtering was conducted to remove potentially redundant signals including those of isotopes, in-source fragmentation products, K⁺, Na⁺ and NH4⁺ adducts and dimerzations. The signal/noise (s/n) check and three rounds of redundancy removal were performed using the same standards with the software that we developed in-house. Supporting information such as PubChem compound information, molecular formula and main fragments were also added to the MS2T library according to Fernie’s recommendation (23).

Metabolite identification and annotation

Metabolite identification/annotation was based on the accurate m/z, retention time (RT) and fragmentation patterns. The accuracy of metabolite identification (from high to low) was divided into four levels (A–D). Level A was the most accurate identification, indicating that those metabolites had the same RT (± 0.1 min) and mass spectra as those of authentic standards. Level B indicated those metabolites showed >85% match rate when their main fragments were searched against public databases (MassBank, KNApSAcK, HMDB and METLIN), or showed specific fragmentation patterns. Metabolites only with confident m/z (|error|≤10 ppm) by comparison with references or detected in other species were defined as level C and D (relatively low accuracy). The proportions of metabolites in categories C and D were relatively low.

Quantification of metabolites

To improve the sensitivity and accuracy of quantification, 936 molecular features were analyzed in two m/z ranges (from 100 to 400 and from 400 to 1000). The quantitative calculations were conducted using Thermo Scientific™ Xcalibur™ software v. 2.2. Each metabolite was quantified with accurate mass tolerance (units, 20 ppm) and precision (decimals, 0.0001). To verify data stability and reproducibility, a mixture of 50 randomly chosen extracts was repeatedly analyzed (n = 6) as the reference control through the analytical procedure. Each set of 20 samples and the relative content of each metabolite were normalized to the average in the reference control. The relative content of each metabolite in 91 mulberry resources has been deposited in the MMHub as its log² value.

Classification of 124 identified/annotated metabolites in MMHub. The numbers of metabolites in each classification are shown in parentheses.

Figure 3

Snapshots of searches for metabolites and related information in MMHub. Researchers can retrieve general information about metabolites by basic browsing (A and B) or specific searches (C). Detailed information including available compound information (D), quantitative data (E) and mass spectrum (F) retrieved by clicking the ‘Peak no.’ link.

Figure 4

Results

Database description

We analyzed the metabolome of mulberry using a widely targeted metabolomics approach based on high performance liquid chromatography-tandem mass spectrometry (HPLC-MS/MS) (13). By applying this segmented full MS/MS scan strategy (Figure 1), metabolites with mass range from m/z 100 to 1000 were detected with high sensitivity in 18 LC-MS/MS runs. In total, 4924 chromatographic peaks (signals) with s/n > 10 were manually checked. To produce a matrix with fewer biased and redundant data, redundancies caused by signals from isotopes, in-source fragmentation products, K⁺, Na⁺ and NH4⁺ adducts and dimerzations were removed using in-house software written in Perl (22). After the first redundancy removal, 2319 signals were obtained. To produce optimal mass spectra data and remove potential redundancies, a high-concentration mixture of compounds was subjected to LC-MS/MS analysis under the data-dependent MS² (dd-MS²) mode. These new data were run through redundancy removal software (second redundancy removal) and generated a data matrix with 1577 signals.

We expected that transitions derived from the same metabolites would be strongly correlated among different samples. Thus, a pre-test quantitative analysis of 50 mulberry resources was conducted and correlation coefficients between transition pairs were calculated. We found that 641 metabolites with high correlation coefficients (>0.9 for metabolites eluted earlier than 2.5 min or > 0.8 for metabolites eluted later than 2.5 min). The same retention time (±0.1 min) were considered as another criterion of redundancy (third redundancy removal). Finally, an MS2T library containing 936 molecular features with almost no redundancy was obtained and is reported as recommended previously (Tables S1 and S2) (23).

User guide for uploading and submission of mass spectra data into MMHub.

Figure 5

Identification and annotation of metabolites in mulberry leaves

In MMHub (version 1.0), 37 commercially available standards were identified using the same profiling conditions as those used to analyze the extracts. Eighty-seven metabolites were putatively identified by querying MS/MS spectra data against the literature or databases (MassBank, KNApSAcK, HMDB and METLIN) (Figure 2A and B). A number of metabolites were identified by analyzing specific fragmentation patterns. For example, two flavonoids, m1903 (RT 7.35 min, m/z 595.1650, error − 1.18 ppm) and m1963 (RT 6.81 min, m/z 611.1590, error − 2.78 ppm) were identified as follows: the diagnostic fragment ions of kaempferol were m/z 287.0546, 153.0179 and 121.0284; the natural loss of hexoside (162 Da, 449.1068 → 287.0546) and rhamnoside (146 Da, 595.1650 → 449.1068) moieties were revealed in the mass spectrum (Figure 2A); the diagnostic fragment ions of quercetin were m/z 303.0494, 257.0440, 229.0491 and 153.0180; and the natural losses of the hexoside and rhamnoside moieties were revealed in the mass spectrum (Figure 2B). Subsequent comparative analysis of standards confirmed that m1903 and m1963 were kaempferol 3-O-rutinoside and rutin (quercetin 3-O-rutinoside), respectively. Their glycosylation sites were determined and were consistent with our predictions. MMHub contains 124 identified or tentatively annotated metabolites and 90 metabolites with associated chemical structures including 44 flavonoids, 15 alkaloids, 17 amino acids, 9 lysophosphatides, 11 vitamins, 4 polypeptides and several other kinds of metabolites (Figure 3).

User interface

To provide a user-friendly way to access all metabolomics data mentioned above, we constructed MMHub (version 1.0), which allows researchers to browse and search the data efficiently. The MMHub database is navigated by a top menu, with four major sections: Home, Browse, About and a powerful string search box (Figure 4A). We have not provided any user guidelines because it is extremely easy for users to find information. For example, by clicking on the ‘Click to find more’ hyperlink on the MMHub homepage, a new page opens with detailed information about metabolomics (Figure 4B).

A powerful string search box is located at the right top of each page in the MMHub database. Any string can be submitted as a query to retrieve corresponding records from the database. By searching for ‘flavonoid’, for example, matching hits with high scores are listed in tabular format (Figure 4C). MMHub is fully searchable with scrips to view and extract metabolomics information, including peak no., RT, structure identifiers, international chemical identifier keys, PubChem compound identifiers, metabolite name, metabolite class, molecular formula and other information supported in the MS2T library (Tables S1 and S2). Clicking on the link ‘Peak no.’ in the browsing page takes the reader to a page with detailed information about that peak, including basic qualitative data (RT, m/z, main fragments and available compound information) (Figure 4D), quantitative data (specific distribution in 91 mulberry resources with two biological duplicates) (Figure 3E) and original mass spectrum (Figure 4F). A key feature of MMHub is that it provides links for known compounds to the PubChem database, where more features are listed (2D and 3D structures).

As an open metabolome database for mulberry, we expect to accommodate more metabolite data for other mulberry tissues, which can be added by researchers from any institution. By clicking on the ‘Submit’ hyperlink, any user can submit metabolite data to MMHub (Figure 5). Essential information (marked with *) including peak no., compound name, molecular formula, main MS/MS fragments and localization in mulberry tissues should be supplied. Users can choose to submit data manually one by one or directly upload data in file format. To ensure the authenticity and reliability of the data, all uploaded data will be temporarily stored in our local server before being deposited in MMHub. New supporting data that have been contrasted and filtered against available data in MMHub will be updated regularly and displayed to all users.

Discussion

Mulberry has been identified as a potential functional nutraceutical food in recent years (24). Bioactive compounds in mulberry have been shown to prevent and treat hyperglycemia (25, 26), hyperlipidemia (3, 27), Alzheimer’s disease (28) and cancer (29). Mass spectral data are important experimental data for research on bioactive compounds. MMHub (version 1.0) is the first public repository of mass spectra of small chemical compounds (<1000 Da) in mulberry leaves. Although our ESI-MS² data were obtained under non-standardized and independent experimental conditions, Volna et al. found that the fragmentation patterns are almost identical for all tandem mass analyzers and that only the ratios of the product ions differ somewhat (16, 30). For instance, m0167 and m0375 are identified as DNJ and D-pantothenic acid, respectively, in MMHub and show similar fragmentation patterns and main fragments in the HMDB (HMDB ID: HMDB0035359) and MassBank (MassBank record: PR100400) databases (Figure 2B). Actually, 87.1% of 124 identified metabolites (108/124) in MMHub are also listed in other public databases (MassBank, METLIN, HMDB and KNApSAcK).

The widely targeted metabolic profiling method based on HPLC-MS/MS is not limited to a single mulberry species or particular tissue (Figure 1). This approach has been used to construct multiple MS2T libraries including 983 molecular features for maize kernels (22), 840 for rice leaves (31) and 2059 for qingke and barley (14). Primary and secondary metabolites like amino acids, nucleic acids, vitamins, flavonoids and vitamins can be efficiently detected using this high-throughput method. Qualitative and quantitative variations in metabolism are regarded as the ‘metabotype’ (32), which represents the bridge between the genotype and the phenotype of a plant (11). Recently, metabolomics combined with broad profiling approaches like genomics and transcriptomics has become an essential method to explore the diversity of plant metabolism and its underlying molecular mechanisms (22, 33, 34). This comprehensive metabolome database will provide a starting point for research on bioactive metabolites and will be useful for the identification of new candidate genes in mulberry.

The existing public metabolome databases are richly annotated and can meet many of the needs of biochemists, clinical chemists, physicians, medical geneticists, nutritionists and members of the metabolomics community (12). MMHub is a specialized database for mulberry research. Although many metabolites remain unidentified, MMHub provided a comprehensive collection of data for research on the mulberry-specific metabolome. The database also includes related quantitative data for 91 mulberry resources with two biological duplicates. This information may be useful for mulberry breeders and plant biochemists to screen for quality resources and specific metabolites.

MMHub was constructed to help researchers mine for metabolomics information easily and effectively. It has several features and advantages: (i) all metabolomics information, including metabolites, structure, mass spectra and validated metabolite concentrations, are strictly curated and verified; (ii) as an initial metabolomics data repository for mulberry, it can be used to provide basic data for other databases; and (iii) MMHub is a work in progress. Newly identified metabolomics data for mulberry will be integrated into MMHub as quickly as possible, and submission queries from all researchers will be encouraged. The comprehensive metabolomic studies of multiple mulberry tissue including fruit and bark are carrying out. It is particularly exciting given that the tissue-specific metabolites, such as anthocyanidins, will greatly enrich the diversity of mulberry metabolites in MMHub. In addition, the agronomic traits of mulberry resources including leaf type, fruit size and fruit color will be integrated into the next version of MMHub.

Conclusion

In summary, MMHub is a user-friendly, freely available and comprehensive metabolomics database that brings together data for thousands of endogenous mulberry metabolites. We believe that MMHub is unique. Although MMHub is a work in progress, our aim is to make it a special metabolomics repository for mulberry species. We are committed to continuously updating this database with strictly curated data, including more metabolites and more tissues. We expect that MMHub will be useful for applications in metabolomics, traditional medicines, biomarker discovery and general education.

Availability

All metabolomics data deposited in this database are freely available to any researcher without restrictions.

Database name: MMHub.

Database URL: https://biodb.swu.edu.cn/mmdb/

Conflict of interest. The authors declare no competing financial interest.

Acknowledgements

The authors are grateful to all the laboratory members who provided advice during this work.

Funding

National Key Research and Development Program (2018YFD1000602); Chongqing Research Program of Basic Research and Frontier Technology (cstc2018jcyjAX0407); Fundamental Research Funds for the Central Universities (XDJK2017C076 and SWU118040); China Postdoctoral Science Foundation (2017M612884).

References

Park

Lee

S.M.

Lee

J.E.

et al. (

2013

)

Anti-inflammatory activity of mulberry leaf extract through inhibition of NF-kappa B

J. Funct. Foods

178

–

186

Jeong

J.H.

Lee

N.K.

Cho

S.H.

et al. (

2014

)

Enhancement of 1-deoxynojirimycin content and alpha-glucosidase inhibitory activity in mulberry leaf using various fermenting microorganisms isolated from Korean traditional fermented food

Biotechnol Bioproc E

1114

–

1118

X.Q.

Thakur

Chen

G.H.

et al. (

2017

)

Metabolic effect of 1-deoxynojirimycin from mulberry leaves on db/db diabetic mice using liquid chromatography-mass spectrometry based metabolomics

J. Agric. Food Chem.

4658

–

4667

Hunyadi

Liktor-Busa

Marki

et al. (

2013

)

Metabolic effects of mulberry leaves: exploring potential benefits in type 2 diabetes and hyperuricemia

Evid-Based Compl Alt

2013

948627

Deng

M.J.

Lin

X.D.

Wen

C.W.

et al. (

2017

)

Metabolic changes in the midgut of Eri silkworm after oral administration of 1-deoxynojirimycin: a 1H-NMR-based metabonomic study

PLoS One

e0173213

Zhang

Zhu

K.L.

Zhang

et al. (

2019

)

Purification of flavonoids from mulberry leaves via high-speed counter-current chromatography

Processes

–

101

Kwon

O.C.

W.T.

Kim

H.B.

et al. (

2019

)

UPLC-DAD-QTOF/MS analysis of flavonoids from 12 varieties of Korean mulberry fruit

J. Food Qual.

2019

–

OpenURL Placeholder Text

Chen

Sheng

Qiu

et al. (

2019

)

Purification, characterization and in vitro and in vivo immune enhancement of polysaccharides from mulberry leaves

PLoS One

e0208611

Yan

F.J.

and

Zheng

X.D.

(

2017

)

Anthocyanin-rich mulberry fruit improves insulin resistance and protects hepatocytes against oxidative stress during hyperglycemia by regulating AMPK/ACC/mTOR pathway

J. Funct. Foods

270

–

281

10.

Fernie

A.R.

Trethewey

R.N.

Krotzky

A.J.

et al. (

2004

)

Metabolite profiling: from diagnostics to systems biology

Nat. Rev. Mol. Cell Biol.

763

–

769

11.

Fang

C.Y.

Fernie

A.R.

and

Luo

(

2019

)

Exploring the diversity of plant metabolism

Trends Plant Sci.

–

12.

Wishart

D.S.

Knox

Guo

A.C.

et al. (

2009

)

HMDB: a knowledgebase for the human metabolome

Nucleic Acids Res.

D603

–

D610

13.

Chen

Gong

Guo

et al. (

2013

)

A novel integrated method for large-scale detection, identification, and quantification of widely targeted metabolites: application in the study of rice metabolomics

Mol. Plant

1769

–

1780

14.

Zeng

Yuan

Dong

et al. (

2020

)

Genome-wide dissection of co-selected UV-B responsive pathways in the UV-B adaptation of qingke

Mol. Plant

112

–

127

15.

Tohge

Wendenburg

Ishihara

et al. (

2016

)

Characterization of a recently evolved flavonol-phenylacyltransferase gene provides signatures of natural light selection in Brassicaceae

Nat. Commun.

12399

16.

Horai

Arita

Kanaya

et al. (

2010

)

MassBank: a public repository for sharing mass spectral data for life sciences

J. Mass Spectrom.

703

–

714

17.

Taguchi

Nishijima

and

Shimizu

(

2007

)

Basic analytical systems for lipidomics by mass spectrometry in Japan

Methods Enzymol.

432

185

–

211

18.

Smith

C.A.

O’Maille

Want

E.J.

et al. (

2005

)

METLIN: a metabolite mass spectral database

Ther. Drug Monit.

747

–

751

19.

Zhang

et al. (

2013

)

Draft genome sequence of the mulberry tree Morus notabilis

Nat. Commun.

2445

20.

Zeng

et al. (

2014

)

MorusDB: a resource for mulberry genomics and genome biology

Database (Oxford)

2014

–

OpenURL Placeholder Text

21.

Zeng

Chen

Zhang

et al. (

2015

)

Definition of eight mulberry species in the genus Morus by internal transcribed spacer-based phylogeny

PLoS One

e0135411

22.

Wen

et al. (

2014

)

Metabolome-based genome-wide association study of maize kernel leads to novel biochemical insights

Nat. Commun.

3438

23.

Fernie

A.R.

Aharoni

Willmitzer

et al. (

2011

)

Recommendations for reporting metabolite data

Plant Cell

2477

–

2482

24.

Srivastava

Kapoor

Thathola

et al. (

2003

)

Mulberry (Morus alba) leaves as human food: a new dimension of sericulture

Int. J. Food Sci. Nutr.

411

–

416

25.

Kimura

Nakagawa

Kubota

et al. (

2007

)

Food-grade mulberry powder enriched with 1-deoxynojirimycin suppresses the elevation of postprandial blood glucose in humans

J. Agric. Food Chem.

5869

–

5874

26.

Katsube

Imawaka

Kawano

et al. (

2006

)

Antioxidant flavonol glycosides in mulberry (Morus alba L.) leaves isolated based on LDL antioxidant activity

Food Chem.

–

27.

Chen

J.J.

and

X.R.

(

2007

)

Hypolipidemic effect of flavonoids from mulberry leaves in triton WR-1339 induced hyperlipidemic mice

Asia Pac. J. Clin. Nutr.

290

–

294

PubMed

OpenURL Placeholder Text