Abstract

Phytochelatins (PyCs) are a diverse set of plant compounds that chelate metals, protect against metal toxicity and function in metal homeostasis. PyCs are present in plants consumed as food by humans and could, in principle, impact absorption and utilization of essential and toxic metals such as selenium and cadmium, respectively. PyCs vary in terminal amino acid composition and chain length, exist in multiple oxidation states and reversibly bind multiple metals; consequently, PyCs include a large set of possible structures. Although individual PyC-metal complexes have been studied, no resource exists to characterize the diversity of PyCs and PyC-metal complexes. We used the scientific literature to develop a database of elemental formulas for polymer forms varying in chain length from 2 to 11 glutamyl-cysteine repeats. Using elemental formulas, we calculated monoisotopic masses using the most abundant isotopes of each element and calculated masses for complexes with 13 metals of nutritional and toxicological significance. The resulting phytochelatin database (PyCDB) contains 46 260 unique elemental formulas for PyC and PyC-metal complexes. The database is available online for download as well as for direct mass queries for mass spectrometry using an accurate mass annotation tool for user-selected PyC types, metals and adducts of interest. We performed studies of a commonly consumed food—onion—to validate the database and test utility of the tool. Onion samples were analyzed using ultra-high resolution mass spectrometry-based metabolomics. Mass spectral features were annotated using the PyCDB web tool and the R package, xMSannotator; annotated features were further validated by collision-induced dissociation mass spectrometry. The results establish use and a workflow for PyCDB as a resource for characterization of PyCs and PyC-metal complexes.

Introduction

Phytochelatins (PyCs) function as key mediators of metal detoxification and homeostasis in plants. PyC-metal complexes protect plants from metal toxicity through chelating heavy metals and metalloids such as cadmium (Cd) and arsenic (As). They also bind required nutrients such as zinc (Zn), selenium (Se) and copper (Cu) (1). Due to their essential role, PyCs are extensively studied in agriculture and soil bioremediation (2–4). However, studies focus on analyzing only specific PyCs and a few metals of interest. More comprehensive characterization of PyC-metal complexes would allow greater understanding of metal sequestration and management in plants and additionally, a role for PyCs in metal bioavailability and toxicity in humans and other animal species consuming PyC-containing foods.

PyCs are glutathione (GSH)-derived polypeptides that are formed enzymatically by dipeptidyl transfer of a donor γ-glutamyl-cysteine (γ-Glu-Cys) to GSH or related peptide. The first PyC form identified was (γ-Glu-Cys)n-Gly (n = 2–11) (5). Other forms exist in which glycine (Gly) can be substituted with β-alanine (β-Ala), Ala, glutamine (Gln), serine (Ser) or glutamate (Glu) or with no additional amino acid (4,6–8). Because of their high thiol (-SH) content due to Cys residues, PyCs have strong metal-binding abilities with increasing metal capacity with increasing PyC size (3,9).

PyC synthesis is increased in response to metal exposure, allowing plants to bind and mediate risk of heavy metal toxicity. PyCs have been observed in a wide range of plant species (1), accumulating in different tissues depending on the plant (10). For example, in rice (Oryza sativa) seedlings exposed to a high dose of Cd, PyC2-Gly was at the highest concentration in leaves followed by roots and shoots (6). In a different study of wild basil (Clinopodium vulgare) grown with excess Cd, the roots had 4-fold and 10-fold higher concentrations of PyC2-Gly compared to leaves and shoots, respectively (11). PyC concentration and lengths also vary with exposure to different metal types and concentrations (11). Functionally, within plants, PyC-metal complexes are not only sequestered in the vacuoles of plant cells but also occur at lower proportions in cytosol and phloem sap (1,12). However, comprehensive characterization of types, concentration and tissue-location of PyCs and PyC-metal complexes has not been completed across common plants grown for human consumption.

With the development of high-resolution metabolomics (HRM) (Figure 1) (13–16), characterization of multiple forms of metal-free PyC and PyC-metal complexes within a single analysis is possible. However, no database exists to allow characterization of PyCs in plants and food. Here we create a phytochelatin database (PyCDB) for use with metabolomics that contains elemental compositions and monoisotopic masses for a wide range of probable and experimentally detected PyC and PyC-metal complexes. We also provide the PyCDB through an accompanying web-based metabolite annotation tool. We include a metabolomics analysis of a common plant food—onion—as an example for identifying PyCs using the PyCDB in a HRM workflow. Additionally, we provide validation of PyC and PyC-metal complexes included in the PyCDB using collision-induced dissociation mass spectrometry. This database provides a resource for research of PyC and PyC-metal complexes in studies of metals with agricultural, nutritional and toxicological significance.

HRM workflow for PyC detection and validation. Using liquid chromatography with ultra-high resolution mass spectrometers followed by the application of data extraction algorithms, broad characterization of the metabolites (mass-to-charge, m/z; retention time, RT; and relative intensity) can be obtained. The mass spectral feature table is then searched against the database of compounds using matching criteria such as a retention time window and maximum allowable parts per million differences. Annotated features of interest are then targeted for validation by collision-induced dissociation using MS/MS.
Figure 1

HRM workflow for PyC detection and validation. Using liquid chromatography with ultra-high resolution mass spectrometers followed by the application of data extraction algorithms, broad characterization of the metabolites (mass-to-charge, m/z; retention time, RT; and relative intensity) can be obtained. The mass spectral feature table is then searched against the database of compounds using matching criteria such as a retention time window and maximum allowable parts per million differences. Annotated features of interest are then targeted for validation by collision-induced dissociation using MS/MS.

Materials and Methods

Selection of PyCs

Many metals bind with high affinity to thiols (-SH) in Cys residues, and thousands of potential variations of PyC-metal complexes are possible due to the variation in PyC lengths and terminal amino acids. In addition, sulfur can exist in different oxidation states (e.g. thiol versus disulfide form), and metals can interact with multiple functional groups in the peptides (e.g. O- and N-containing groups). Because generation of non-biologically relevant structures in a database can introduce an obstacle to understanding biology, we limited the range of predicted structures to those for which evidence indicates existence or likelihood of existence under relevant biologic conditions. Therefore, to define the scope of the database, we started with thiol (-SH) forms of PyCs. As most metals have the greatest binding affinity to the ionized thiolate (-S) form of thiol groups, preferential binding to thiol groups (over O- and N-containing groups) was assumed for this database iteration. At biological pH (near neutral or slightly basic), a proportion of the thiol groups is expected to be in the metal-binding thiolate form, and experimental evidence shows thiolate binding in PyC-metal complexes (17–19). Based upon this, the PyC-metal complex elemental compositions were calculated accordingly, with two protons removed for every divalent (2+) metal ion bound. For monoisotopic mass calculations based on PyC elemental compositions, we used the most abundant isotope for each element. Finally, predicted compounds were based on likely chemical interactions for common metal oxidation states at neutral pH, room temperature and room air. As knowledge of PyC-metal complexes increases, future versions will be updated to include other PyC-metal complexes.

Using the seven PyC forms (see `Base PyCs’ in Table 1) based upon C-terminal amino acid and repeating γ-Glu-Cys peptide units of 2–11, we calculated base elemental formulas from which predicted forms of PyC-metal complexes were generated (1) (Figure 2). As (γ-Glu-Cys)n-Ala and (γ-Glu-Cys)n-β-Ala are identical in elemental composition and monoisotopic mass, duplicate elemental compositions were not included. Disulfide (S-S) bonds are formed in molecules with two or more Cys under oxidizing conditions. Disulfide bonds form during food storage and preparation (20). Additionally, disulfide bonds will form during sample preparation unless specific anaerobic conditions or reducing agents are used. Disulfide PyC forms likely also exist in vivo due to normal reduction-oxidation reactions occurring as part of cellular signaling and metabolic processes. Single disulfide forms for PyC2 to PyC11 were calculated by subtracting 2H from the elemental formulas, accounting for the two protons lost from thiol groups during disulfide bond formation. For each additional disulfide, an additional 2H were subtracted. For metal-bound forms, an additional two Cys (-SH groups) will be required for binding of a metal in a 2+ oxidation state. For example, a two-disulfide form can only occur in PyC4 or longer, and 2+ metal (Me2+)-binding can only occur with a two-disulfide form in PyC6 or longer. Up to five disulfide forms were included corresponding to known PyC lengths (Table 1).

Table 1

PyC structures vary by terminal amino acid, number of repeating peptide units (n = 2–11) and number of disulfide bonds (m = 1–5)

Base PyCsElemental formula (e.g. n = 2)Disulfide form
(γ-Glu-Cys)n-GlyC18H29N5O10S2(S-S)m(γ-Glu-Cys)n-Gly
(γ-Glu-Cys)n-β-AlaC19H31N5O10S2(S-S)m(γ-Glu-Cys)n-β-Ala
(γ-Glu-Cys)n-AlaC19H31N5O10S2(S-S)m(γ-Glu-Cys)n-Ala
(γ-Glu-Cys)nC16H26N4O9S2(S-S)m(γ-Glu-Cys)n
(γ-Glu-Cys)n-GlnC21H34N6O11S2(S-S)m(γ-Glu-Cys)n-Gln
(γ-Glu-Cys)n-SerC19H31N5O11S2(S-S)m(γ-Glu-Cys)n-Ser
(γ-Glu-Cys)n-GluC21H33N5O12S2(S-S)m(γ-Glu-Cys)n-Glu
Base PyCsElemental formula (e.g. n = 2)Disulfide form
(γ-Glu-Cys)n-GlyC18H29N5O10S2(S-S)m(γ-Glu-Cys)n-Gly
(γ-Glu-Cys)n-β-AlaC19H31N5O10S2(S-S)m(γ-Glu-Cys)n-β-Ala
(γ-Glu-Cys)n-AlaC19H31N5O10S2(S-S)m(γ-Glu-Cys)n-Ala
(γ-Glu-Cys)nC16H26N4O9S2(S-S)m(γ-Glu-Cys)n
(γ-Glu-Cys)n-GlnC21H34N6O11S2(S-S)m(γ-Glu-Cys)n-Gln
(γ-Glu-Cys)n-SerC19H31N5O11S2(S-S)m(γ-Glu-Cys)n-Ser
(γ-Glu-Cys)n-GluC21H33N5O12S2(S-S)m(γ-Glu-Cys)n-Glu
Table 1

PyC structures vary by terminal amino acid, number of repeating peptide units (n = 2–11) and number of disulfide bonds (m = 1–5)

Base PyCsElemental formula (e.g. n = 2)Disulfide form
(γ-Glu-Cys)n-GlyC18H29N5O10S2(S-S)m(γ-Glu-Cys)n-Gly
(γ-Glu-Cys)n-β-AlaC19H31N5O10S2(S-S)m(γ-Glu-Cys)n-β-Ala
(γ-Glu-Cys)n-AlaC19H31N5O10S2(S-S)m(γ-Glu-Cys)n-Ala
(γ-Glu-Cys)nC16H26N4O9S2(S-S)m(γ-Glu-Cys)n
(γ-Glu-Cys)n-GlnC21H34N6O11S2(S-S)m(γ-Glu-Cys)n-Gln
(γ-Glu-Cys)n-SerC19H31N5O11S2(S-S)m(γ-Glu-Cys)n-Ser
(γ-Glu-Cys)n-GluC21H33N5O12S2(S-S)m(γ-Glu-Cys)n-Glu
Base PyCsElemental formula (e.g. n = 2)Disulfide form
(γ-Glu-Cys)n-GlyC18H29N5O10S2(S-S)m(γ-Glu-Cys)n-Gly
(γ-Glu-Cys)n-β-AlaC19H31N5O10S2(S-S)m(γ-Glu-Cys)n-β-Ala
(γ-Glu-Cys)n-AlaC19H31N5O10S2(S-S)m(γ-Glu-Cys)n-Ala
(γ-Glu-Cys)nC16H26N4O9S2(S-S)m(γ-Glu-Cys)n
(γ-Glu-Cys)n-GlnC21H34N6O11S2(S-S)m(γ-Glu-Cys)n-Gln
(γ-Glu-Cys)n-SerC19H31N5O11S2(S-S)m(γ-Glu-Cys)n-Ser
(γ-Glu-Cys)n-GluC21H33N5O12S2(S-S)m(γ-Glu-Cys)n-Glu

PyC-metal complexes were calculated using the elemental formulas described above as base units for construction of other forms. If two thiols were not available for binding, the PyC-metal complex was not included.

Formation of PyC-metal complexes. A. PyCs form complexes with metal ligands. Metal ligands in 2+ oxidation state will bind with the sulfurs of thiol groups on two cysteine residues. B. An example of the elemental formulas in PyCDB for the base PyC, phytochelatin2-glycine (PyC2-Gly), in metal-bound and unbound forms. C. An example of the abbreviated name for the base PyC form, number of repeating peptide units, and metal (if bound).
Figure 2

Formation of PyC-metal complexes. A. PyCs form complexes with metal ligands. Metal ligands in 2+ oxidation state will bind with the sulfurs of thiol groups on two cysteine residues. B. An example of the elemental formulas in PyCDB for the base PyC, phytochelatin2-glycine (PyC2-Gly), in metal-bound and unbound forms. C. An example of the abbreviated name for the base PyC form, number of repeating peptide units, and metal (if bound).

Selection of metals

Database metals include selected metals of either nutritional or toxicological importance. Other factors considered for metal selection included the common oxidation states, most abundant isotopes and expected Lewis acid-base chemistry. The Lewis acid-base chemistry influences the ability of metals to form stable complexes. This relates to whether the metal ion is a hard acid or soft acid. Soft acids have greater tendency to form stable complexes with thiolates (-S) (21,22). Of note but beyond the scope of the current discussion, the molecular environment of the thiol group will also impact the affinity to which it binds the metal. The molecular environment will be influenced by factors such as the biological matrix and sample preparation. For example, the binding affinity of metals to the thiolate will change depending on pH. These factors must be considered when designing experiments and interpreting results for studies of PyC-metal complexes.

Many of the soft metals of nutritional and toxicological interest have a common oxidation state of 2+: cadmium (Cd), cobalt (Co), copper (Cu), iron (Fe), lead (Pb), manganese (Mn), mercury (Hg), nickel (Ni) and zinc (Zn). Some have common oxidation states of 1+: Cu, Hg and silver (Ag). Initial calculations were made for the 2+ ions of the most abundant isotope, including 114Cd, 59Co, 63Cu, 56Fe, 208Pb, 55Mn, 202Hg, 58Ni and 64Zn, for the 1+ ion (107Ag), and for negatively charged (2-) forms of selenium (80Se). This includes five nutritionally important minerals (Fe, Cu, Mn, Se and Zn) and eight metals of environmental health concern (Cd, Co, Pb, Mn, Hg, Ni, Ag and Se) (Supplementary Information, Figure S-2). Some metals, such as Se and Mn, are both nutritional and toxicological metals due to tolerable upper intake limits (i.e. Mn, adults, 11 mg/day; Se, adults, 400 μg/day) being relatively low compared to some exposure levels in humans. Although calcium (40Ca, 2+) and magnesium (24Mg, 2+) are hard acids and may not bind effectively to thiols, they are abundant in plants and could be present as PyC complexes (22). These Ca and Mg forms were included in the current database and additional forms for potassium (39K, 1+) and sodium (23Na, 1+) could be included in future database iterations.

PyC-metal complexes

PyC-metal complexes including one or two metals for 2+ ions and Se (2-) were calculated by subtracting 2H and 4H, respectively, from base elemental formulas to account for the loss of thiol protons during metal binding. For selenium, the calculations were based upon experimental results demonstrating that selenite reacts with two PyC2 to create a bound PyC2-Se and an oxidized (S-S)PyC2. The product was shown to be the selenotrisulfide, -S-Se-S- (23), where the selenium of selenite was reduced to the formal 2+ oxidation state in binding to two thiolates. Ag (1+) one-metal and two-metal forms were calculated by subtracting 1H per metal addition. For two metal (2+) bound forms, at least four thiols would be needed. Therefore, calculations for two metal (2+) forms were completed for PyC4 and larger, one-disulfide forms for PyC6 and larger, two-disulfide forms for PyC8 and larger and three-disulfide forms for PyC10 and PyC11. Two metal complexes were calculated for all possible combinations of included metals, recognizing that the precise structure of the complexes cannot be predicted based upon these calculations. A larger number of metals and metal combinations could bind to longer chain lengths (up to 5 in PyC10 or PyC11 forms). However, these were not included in this PyCDB version because of the large number of combinations and limited evidence for such complex forms.

For all two-metal PyC-metal complexes, bridging sulfurs, as occur in iron-sulfur clusters, are possible for most of the metals included in this database iteration (24). We referred to these as sulfido and disulfido forms. The elemental formulas were calculated by adding either one or two sulfurs to elemental formulas for two metal complexes. As selenide may form similar complexes, these were calculated by addition of one selenium to the elemental compositions, referred to as the selenido form. Diselenido forms will be included in future database versions if disulfido forms are found.

Calculation of monoisotopic masses

Monoisotopic masses were calculated using the elemental formulas of the predicted compounds. Monoisotopic mass values were generated for each PyC and PyC-metal complex using a modification of the R package, OrgMassSpecR. This R package allows for automated calculations of monoisotopic masses using elemental formulas. Modifications were made to the functions ListFormula and MonoisotopicMass to include all elements of interest. Monoisotopic mass calculations were completed using the National Institute of Standards and Technology exact masses (rounded to eight decimal places) for the most abundant isotopes (25).

PyCDB web interface

The PyCDB web tool was developed using the shiny package, shinyBS package and DT package in R. The web interface is maintained in the shiny server. Users can enter their experimental masses directly in a text box or upload them with `.csv’ format or `.txt’ format to perform accurate mass matching within the PyCDB web tool. Using the get_mz_by_monoisotopicmass function in xMSannotator, the mass-to-charge ratio (m/z) for each PyC and PyC-metal complex for adducts of interest were calculated by adding the mass of the respective adduct (e.g. M + H, M + 2H) and dividing by the charge state (z = 1,2,3) of the adduct (26). For accurate mass matching with the web tool, there is the option to include annotation only for elemental formulas that meet the nitrogen (N), oxygen (O), phosphorus (P) and sulfur (S) to carbon (NOPS) ratio check (27). The NOPS check allows filtering out elemental formulas that do not include the most common ratios of N, O, P and S atoms to carbon. The user-defined input parameters can be specified and used by the back-end R function to find PyC/PyC-metal matches. Once the processing is complete, a table of PyC/PyC-metal matches is available in the web tool or for download as a .csv file.

Using the PyCDB locally

The user has the option of using the full database locally with existing tools or R packages, such as xMSannotator (26). With the xMSannotator multilevelannotation function, annotation criteria are available in addition to those available via the PyCDB web interface. The output of multilevelannotation includes a confidence level (i.e. none, low, medium, high) for annotated metabolites. Additional criteria include retention time clustering, hydrogen/carbon ratio checks, adduct requirements specified by the user for high confidence scores (e.g. M + H for high confidence), and abundance ratio checks for isotopes and multiply charged adducts.

Results

PyCDB content

The current database includes 46 260 unique elemental formulas for 240 PyC and 46 020 PyC-metal complexes. Information available for each complex includes the molecular formula, monoisotopic mass, isotope, PyC type (see Table 1), number of repeat units (e.g. PyC2 to PyC11) and information on bound metals (including type and number). With increasing PyC length, the number of compounds in the database increases due to the higher number of metal-binding sites in longer peptides (Supplementary Information, Figure S-2).

PyCDB website implementation

The database is available in a user-friendly form from https://kuppal.shinyapps.io/pycdb/. As seen in Figure 3, users can select specific search criteria after uploading their experimental masses. Search options include selecting a subset of the 13 metals, adducts of interest and specific PyC lengths or types (i.e. terminal amino acid). The `NOPS check’ option can help users further filter less likely masses based on the ratio of NOPS atoms to carbon in the chemical formula (27). The user can also define the mass error in parts per million (ppm; e.g. 5 ppm) to perform the accurate mass search. As described above, the user-defined input parameters are used by the back-end R function to find PyC matches. The full database is also available for download at https://s3.amazonaws.com/phytochelatindatabase/full_version_PyCDB_20180821.csv, which can be used with other annotation tools like xMSannotator for users interested in additional annotation criteria (26).

Screenshots of the web version of the PyCDB. Query masses of interest can be entered manually or uploaded from a .csv or .txt file. Screenshot of an example search using the `Load Example’ button with the default search settings and the output shown below. The output can be reviewed on the webpage or downloaded as a .csv file for future use. The webpage also provides an option to `Download Whole Database’ for use with other annotation tools such as the R package, xMSannotator.
Figure 3

Screenshots of the web version of the PyCDB. Query masses of interest can be entered manually or uploaded from a .csv or .txt file. Screenshot of an example search using the `Load Example’ button with the default search settings and the output shown below. The output can be reviewed on the webpage or downloaded as a .csv file for future use. The webpage also provides an option to `Download Whole Database’ for use with other annotation tools such as the R package, xMSannotator.

Example: PyC detection in onions

Samples from eight onions were analyzed using HRM. Detected metabolites were defined by accurate mass (m/z), retention time (seconds) and intensity profiles of mass spectral features. Details of the sample preparation and analysis are described in the Supplementary Information. A total of 19 270 features were detected on the C18 column. The feature table was analyzed in two ways, targeting a subset of the database (Supplementary Information). First, the feature table was annotated using xMSannotator with the custom database option. The xMSannotator code used for the analysis is provided in the Supplementary Information. A total of 628 features were annotated using the PyCDB subset. The feature table was also analyzed in the PyCDB web tool using the same parameters as xMSannotator but without the NOPS check. The PyCDB web tool uses a simple annotation function, which does not provide confidence scores. This resulted in 845 annotated features.

Using selection criteria based on high confidence matches from xMSannotator and high intensity features, features were selected for further validation with collision-induced dissociation mass spectrometry (MS/MS). The feature m/z 538.1270 was matched to (S-S)PyC2-Gly (C18H27N5O10S2) [M + H]. MS/MS of this m/z target was completed to confirm the identity (see Figure S-1). Many annotated PyC-metal complexes from the onion data were at too-low intensity for collision-induced dissociation analysis. PyC-metal complexes formed using chemical standards and selected metals (see Supplementary Information) showed that experimental masses were consistent with predicted masses. Additional details of metabolite validation and PyC-metal complex collision-induced dissociation are provided in the Supplementary Information (Figure S-1 and Figure S-3 to S-8). Examples are provided along with literature data for PyCs and PyC-metal complexes in Table 2 (6,18,19,28–30).

Table 2

Comparisons of predicted masses in database with experimental masses of PyCs identified by mass spectrometry

PyCPredicted mass (m/z)Experimental mass (m/z)Mass deviation (ppm)Adduct
Unbound PyCs
(S-S)PyC2-Glya538.1272538.1254a3.3M + H
PyC2-Glya,b540.1429540.1437a, 540.1(18), 540.1430(6), 540.0(28)1.5, −, 0.1, −M + H
PyC3-Glya,b772.1946772.1955a, 772.2(18), 772.1948(6), 772.0(28)1.2, −, 0.3, −M + H
PyC4-Glya,b1004.24641004.2436a, 1004.2(18), 1004.2458(6), 1004.0(28)2.8, −, 0.6, −M + H
PyC5-Glyb1236.29821236.2(18)-M + H
PyC6-Glyb1468.351468.2(18)-M + H
PyC2-Serb570.1534570.1538(6)0.7M + H
PyC3-Serb802.2052802.206(6)1M + H
PyC4-Serb1034.2571034.2577(6)0.7M + H
PyC2-Glnb611.18611.1802(6)0.3M + H
PyC3-Glnb843.2318843.2328(6)1.2M + H
PyC2-Glub612.164612.1648(6)1.3M + H
PyC3-Glub844.2158844.2167(6)1.1M + H
PyC4-Glub1076.26751076.2686(6)1M + H
PyC-metal complexes
PyC2-Gly-Hgb740.0979740(29), 740.1(30)-M + H
PyC3-Gly-Hgb972.1496972(29), 972.1(30)-M + H
PyC4-Gly-Hgb1202.18581202(29), 1202.2(30)-M + H
PyC4-Gly-Hg(2)b1404.15641404(29), 1404.2(30)-M + H
PyC2-Gly-Cda652.0306652.0346a6.1M + H
PyC3-Gly-Cda,b,c884.0824884.0710a,c, 884.1(18)12.9, −M + H
PyC4-Gly-Cda,c1116.13411116.1392a,c, 1116.1(18)4.6, −M + H
PyC4-Gly-Cd(2)a1228.02181228.0279a5M + H
PyC5-Gly-Cdb1348.18591348.2(18)-M + H
PyC2-Pbb746.1039746.1034(19)0.7M + H
PyC3-Pbb978.1556978.1559(19)0.3M + H
PyC4-Pbb1210.20741210.1986(19)7.3M + H
PyC4-Pb(2)b1416.16841416.1556(19)9M + H
PyC2-Zna,b,c602.0564602.0560a,c, 602.0544(19)0.7, 3.3M + H
PyC3-Zna,c834.1081834.1136a,c6.6M + H
PyC4-Zna,c1066.15991066.1608a,c0.8M + H
PyC4-Zn(2)a,c1128.07341128.0763a,c2.6M + H
PyC2-Mna593.0653593.0665a2M + H
PyC3-Mna825.1170825.1251a9.8M + H
PyC4-Mna1057.16881057.1672a1.5M + H
PyC4-Mn(2)a1110.09121110.0913a0.1M + H
PyCPredicted mass (m/z)Experimental mass (m/z)Mass deviation (ppm)Adduct
Unbound PyCs
(S-S)PyC2-Glya538.1272538.1254a3.3M + H
PyC2-Glya,b540.1429540.1437a, 540.1(18), 540.1430(6), 540.0(28)1.5, −, 0.1, −M + H
PyC3-Glya,b772.1946772.1955a, 772.2(18), 772.1948(6), 772.0(28)1.2, −, 0.3, −M + H
PyC4-Glya,b1004.24641004.2436a, 1004.2(18), 1004.2458(6), 1004.0(28)2.8, −, 0.6, −M + H
PyC5-Glyb1236.29821236.2(18)-M + H
PyC6-Glyb1468.351468.2(18)-M + H
PyC2-Serb570.1534570.1538(6)0.7M + H
PyC3-Serb802.2052802.206(6)1M + H
PyC4-Serb1034.2571034.2577(6)0.7M + H
PyC2-Glnb611.18611.1802(6)0.3M + H
PyC3-Glnb843.2318843.2328(6)1.2M + H
PyC2-Glub612.164612.1648(6)1.3M + H
PyC3-Glub844.2158844.2167(6)1.1M + H
PyC4-Glub1076.26751076.2686(6)1M + H
PyC-metal complexes
PyC2-Gly-Hgb740.0979740(29), 740.1(30)-M + H
PyC3-Gly-Hgb972.1496972(29), 972.1(30)-M + H
PyC4-Gly-Hgb1202.18581202(29), 1202.2(30)-M + H
PyC4-Gly-Hg(2)b1404.15641404(29), 1404.2(30)-M + H
PyC2-Gly-Cda652.0306652.0346a6.1M + H
PyC3-Gly-Cda,b,c884.0824884.0710a,c, 884.1(18)12.9, −M + H
PyC4-Gly-Cda,c1116.13411116.1392a,c, 1116.1(18)4.6, −M + H
PyC4-Gly-Cd(2)a1228.02181228.0279a5M + H
PyC5-Gly-Cdb1348.18591348.2(18)-M + H
PyC2-Pbb746.1039746.1034(19)0.7M + H
PyC3-Pbb978.1556978.1559(19)0.3M + H
PyC4-Pbb1210.20741210.1986(19)7.3M + H
PyC4-Pb(2)b1416.16841416.1556(19)9M + H
PyC2-Zna,b,c602.0564602.0560a,c, 602.0544(19)0.7, 3.3M + H
PyC3-Zna,c834.1081834.1136a,c6.6M + H
PyC4-Zna,c1066.15991066.1608a,c0.8M + H
PyC4-Zn(2)a,c1128.07341128.0763a,c2.6M + H
PyC2-Mna593.0653593.0665a2M + H
PyC3-Mna825.1170825.1251a9.8M + H
PyC4-Mna1057.16881057.1672a1.5M + H
PyC4-Mn(2)a1110.09121110.0913a0.1M + H
a

aCompared with standards; see supplementary information.

b

bPreviously identified PyC derivative with mass as reported.

c

cData not shown.

Table 2

Comparisons of predicted masses in database with experimental masses of PyCs identified by mass spectrometry

PyCPredicted mass (m/z)Experimental mass (m/z)Mass deviation (ppm)Adduct
Unbound PyCs
(S-S)PyC2-Glya538.1272538.1254a3.3M + H
PyC2-Glya,b540.1429540.1437a, 540.1(18), 540.1430(6), 540.0(28)1.5, −, 0.1, −M + H
PyC3-Glya,b772.1946772.1955a, 772.2(18), 772.1948(6), 772.0(28)1.2, −, 0.3, −M + H
PyC4-Glya,b1004.24641004.2436a, 1004.2(18), 1004.2458(6), 1004.0(28)2.8, −, 0.6, −M + H
PyC5-Glyb1236.29821236.2(18)-M + H
PyC6-Glyb1468.351468.2(18)-M + H
PyC2-Serb570.1534570.1538(6)0.7M + H
PyC3-Serb802.2052802.206(6)1M + H
PyC4-Serb1034.2571034.2577(6)0.7M + H
PyC2-Glnb611.18611.1802(6)0.3M + H
PyC3-Glnb843.2318843.2328(6)1.2M + H
PyC2-Glub612.164612.1648(6)1.3M + H
PyC3-Glub844.2158844.2167(6)1.1M + H
PyC4-Glub1076.26751076.2686(6)1M + H
PyC-metal complexes
PyC2-Gly-Hgb740.0979740(29), 740.1(30)-M + H
PyC3-Gly-Hgb972.1496972(29), 972.1(30)-M + H
PyC4-Gly-Hgb1202.18581202(29), 1202.2(30)-M + H
PyC4-Gly-Hg(2)b1404.15641404(29), 1404.2(30)-M + H
PyC2-Gly-Cda652.0306652.0346a6.1M + H
PyC3-Gly-Cda,b,c884.0824884.0710a,c, 884.1(18)12.9, −M + H
PyC4-Gly-Cda,c1116.13411116.1392a,c, 1116.1(18)4.6, −M + H
PyC4-Gly-Cd(2)a1228.02181228.0279a5M + H
PyC5-Gly-Cdb1348.18591348.2(18)-M + H
PyC2-Pbb746.1039746.1034(19)0.7M + H
PyC3-Pbb978.1556978.1559(19)0.3M + H
PyC4-Pbb1210.20741210.1986(19)7.3M + H
PyC4-Pb(2)b1416.16841416.1556(19)9M + H
PyC2-Zna,b,c602.0564602.0560a,c, 602.0544(19)0.7, 3.3M + H
PyC3-Zna,c834.1081834.1136a,c6.6M + H
PyC4-Zna,c1066.15991066.1608a,c0.8M + H
PyC4-Zn(2)a,c1128.07341128.0763a,c2.6M + H
PyC2-Mna593.0653593.0665a2M + H
PyC3-Mna825.1170825.1251a9.8M + H
PyC4-Mna1057.16881057.1672a1.5M + H
PyC4-Mn(2)a1110.09121110.0913a0.1M + H
PyCPredicted mass (m/z)Experimental mass (m/z)Mass deviation (ppm)Adduct
Unbound PyCs
(S-S)PyC2-Glya538.1272538.1254a3.3M + H
PyC2-Glya,b540.1429540.1437a, 540.1(18), 540.1430(6), 540.0(28)1.5, −, 0.1, −M + H
PyC3-Glya,b772.1946772.1955a, 772.2(18), 772.1948(6), 772.0(28)1.2, −, 0.3, −M + H
PyC4-Glya,b1004.24641004.2436a, 1004.2(18), 1004.2458(6), 1004.0(28)2.8, −, 0.6, −M + H
PyC5-Glyb1236.29821236.2(18)-M + H
PyC6-Glyb1468.351468.2(18)-M + H
PyC2-Serb570.1534570.1538(6)0.7M + H
PyC3-Serb802.2052802.206(6)1M + H
PyC4-Serb1034.2571034.2577(6)0.7M + H
PyC2-Glnb611.18611.1802(6)0.3M + H
PyC3-Glnb843.2318843.2328(6)1.2M + H
PyC2-Glub612.164612.1648(6)1.3M + H
PyC3-Glub844.2158844.2167(6)1.1M + H
PyC4-Glub1076.26751076.2686(6)1M + H
PyC-metal complexes
PyC2-Gly-Hgb740.0979740(29), 740.1(30)-M + H
PyC3-Gly-Hgb972.1496972(29), 972.1(30)-M + H
PyC4-Gly-Hgb1202.18581202(29), 1202.2(30)-M + H
PyC4-Gly-Hg(2)b1404.15641404(29), 1404.2(30)-M + H
PyC2-Gly-Cda652.0306652.0346a6.1M + H
PyC3-Gly-Cda,b,c884.0824884.0710a,c, 884.1(18)12.9, −M + H
PyC4-Gly-Cda,c1116.13411116.1392a,c, 1116.1(18)4.6, −M + H
PyC4-Gly-Cd(2)a1228.02181228.0279a5M + H
PyC5-Gly-Cdb1348.18591348.2(18)-M + H
PyC2-Pbb746.1039746.1034(19)0.7M + H
PyC3-Pbb978.1556978.1559(19)0.3M + H
PyC4-Pbb1210.20741210.1986(19)7.3M + H
PyC4-Pb(2)b1416.16841416.1556(19)9M + H
PyC2-Zna,b,c602.0564602.0560a,c, 602.0544(19)0.7, 3.3M + H
PyC3-Zna,c834.1081834.1136a,c6.6M + H
PyC4-Zna,c1066.15991066.1608a,c0.8M + H
PyC4-Zn(2)a,c1128.07341128.0763a,c2.6M + H
PyC2-Mna593.0653593.0665a2M + H
PyC3-Mna825.1170825.1251a9.8M + H
PyC4-Mna1057.16881057.1672a1.5M + H
PyC4-Mn(2)a1110.09121110.0913a0.1M + H
a

aCompared with standards; see supplementary information.

b

bPreviously identified PyC derivative with mass as reported.

c

cData not shown.

Discussion

Future directions

The database is open source and can be refined as plant and food metabolomics data are compared to the predicted complexes. Such modifications can include additional metals of nutritional and toxicological significance such as molybdenum and arsenic. Although selenocysteine (Sec) is not common in plants, Sec may form in the PyC in high-selenium conditions and be of interest to explore. Other metals and metal oxidation forms are possible. The complex coordination chemistries of the metals will need to be considered for inclusion. To aid in confirmation of identities, additional methods, such as metal removal or study by ion mobility spectrometry-mass spectrometry, may be needed to address low abundance of the PyC-metal complexes in biologic materials. Although up to two metals were only considered in the current database iteration, PyC binding of up to five metals for longer forms is chemically possible and could be explored. Finally, a future iteration of the PyCDB could include an in silico fragmentation tool to account for predicted MS/MS spectra of PyC-metal complexes. Although peptide fragmentation tools are available, predicted fragmentation for PyC-metal complexes will need to consider metal interactions with O- and N-containing carboxy and amino groups as well as sulfido, disulfido and selenido PyC forms included in the database.

In addition to the potential applications of the PyCDB for understanding metal homeostasis in plants and absorption of nutritional and toxicological metals in plant-derived foods, the PyCDB could also be useful for understanding metal-dependent processes such as nutritional immunity. Nutritional immunity is the process by which a host controls access to micronutrients to protect from bacterial infections (31,32), and PyCs could contribute to mechanisms of nutritional immunity due to their metal-binding characteristics. Thus, the PyCDB has the potential to be a useful resource for diverse in vivo and in vitro investigations of PyCs and PyC-metal complexes.

Conclusions

The range of potential PyCs and PyC-metal complexes in plants and food products is extensive and diverse. The PyCDB provides a foundational resource for research efforts to characterize PyC profiles. Here we demonstrate PyCs can be detected and validated in the edible portion of a commonly consumed plant food, onion. Additionally, PyCs and PyC-metal complexes formed in vitro are detected at the predicted masses in the database. Future database versions can include additional PyC forms and validated compounds. Given the significant role of PyCs in binding metals of toxicological and nutritional significance, this database provides a resource to improve understanding of PyCs in metal homeostasis and metal bioavailability in plants and plant-derived foods consumed by animals.

Funding

National Institute of Diabetes and Digestive and Kidney Diseases (T32 DK007734); National Institute of Environmental Health Sciences (R01 ES023485, U2C ES030163); and National Institutes of Health, Office of the Director (S10 OD018006).

Conflict of interest. None declared.

Database URL:https://kuppal.shinyapps.io/pycdb/

References

1.

Cobbett
,
C.
and
Goldsbrough
,
P.
(
2002
)
Phytochelatins and metallothioneins: roles in heavy metal detoxification and homeostasis
.
Annu Rev Plant Biol
,
53
,
159
182
.

2.

Cobbett
,
C.S.
(
2000
)
Phytochelatins and their roles in heavy metal detoxification
.
Plant Physiol
,
123
,
825
832
.

3.

Hirata
,
K.
,
Tsuji
,
N.
and
Miyamoto
,
K.
(
2005
)
Biosynthetic regulation of phytochelatins, heavy metal-binding peptides
.
J Biosci Bioeng
,
100
,
593
599
.

4.

Rauser
,
W.E.
(
1995
)
Phytochelatins and related peptides. Structure, biosynthesis, and function
.
Plant Physiol
,
109
,
1141
1149
.

5.

Rauser
,
W.E.
(
1990
)
Phytochelatins
.
Annu Rev Biochem
,
59
,
61
86
.

6.

Mou
,
R.X.
,
Cao
,
Z.Y.
,
Lin
,
X.Y.
et al.  (
2016
)
Characterization of the phytochelatins and their derivatives in rice exposed to cadmium based on high-performance liquid chromatography coupled with data-dependent hybrid linear ion trap orbitrap mass spectrometry
.
Rapid Commun Mass Spectrom
,
30
,
1891
1900
.

7.

Wood
,
B.A.
and
Feldmann
,
J.
(
2012
)
Quantification of phytochelatins and their metal (loid) complexes: critical assessment of current analytical methodology
.
Anal Bioanal Chem
,
402
,
3299
3309
.

8.

Kubota
,
H.
,
Sato
,
K.
,
Yamada
,
T.
et al.  (
2000
)
Phytochelatin homologs induced in hairy roots of horseradish
.
Phytochemistry
,
53
,
239
245
.

9.

Mehra
,
R.K.
,
Kodati
,
V.R.
and
Abdullah
,
R.
(
1995
)
Chain length-dependent Pb (II)-coordination in phytochelatins
.
Biochem Biophys Res Commun
,
215
,
730
736
.

10.

Marentes
,
E.
and
Rauser
,
W.E.
(
2007
)
Different proportions of cadmium occur as Cd-binding phytochelatin complexes in plants
.
Physiol Plant
,
131
,
291
301
.

11.

Bardarov
,
K.
,
Naydenov
,
M.
and
Djingova
,
R.
(
2015
)
HPLC-HRMS method for fast phytochelatins determination in plants. Application to analysis of Clinopodium vulgare L
.
Talanta
,
142
,
20
27
.

12.

Mendoza-Cozatl
,
D.G.
,
Butko
,
E.
,
Springer
,
F.
et al.  (
2008
)
Identification of high levels of phytochelatins, glutathione and cadmium in the phloem sap of Brassica napus. A role for thiol-peptides in the long-distance transport of cadmium and the effect of cadmium on iron translocation
.
Plant J
,
54
,
249
259
.

13.

Soltow
,
Q.A.
,
Strobel
,
F.H.
,
Mansfield
,
K.G.
et al.  (
2013
)
High-performance metabolic profiling with dual chromatography-Fourier-transform mass spectrometry (DC-FTMS) for study of the exposome
.
Metabolomics
,
9
,
S132
S143
.

14.

Go
,
Y.M.
,
Walker
,
D.I.
,
Liang
,
Y.
et al.  (
2015
)
Reference standardization for mass spectrometry and high-resolution metabolomics applications to exposome research
.
Toxicol Sci
,
148
,
531
543
.

15.

Go
,
Y.M.
,
Uppal
,
K.
,
Walker
,
D.I.
et al.  (
2014
)
Mitochondrial metabolomics using high-resolution Fourier-transform mass spectrometry
.
Methods Mol Biol
,
1198
,
43
73
.

16.

Walker
,
D.I.
,
Mallon
,
C.T.
,
Hopke
,
P.K.
et al.  (
2016
)
Deployment-associated exposure surveillance with high-resolution metabolomics
.
J Occup Environ Med
,
58
,
S12
S21
.

17.

Chen
,
L.Q.
,
Yang
,
L.M.
and
Wang
,
Q.Q.
(
2009
)
In vivo phytochelatins and Hg-phytochelatin complexes in Hg-stressed Brassica chinensis L
.
Metallomics
,
1
,
101
106
.

18.

Yen
,
T.Y.
,
Villa
,
J.A.
and
DeWitt
,
J.G.
(
1999
)
Analysis of phytochelatin-cadmium complexes from plant tissue culture using nano-electrospray ionization tandem mass spectrometry and capillary liquid chromatography/electrospray ionization tandem mass spectrometry
.
J Mass Spectrom
,
34
,
930
941
.

19.

Scheidegger
,
C.
,
Suter
,
M.J.
,
Behra
,
R.
et al.  (
2012
)
Characterization of lead-phytochelatin complexes by nano-electrospray ionization mass spectrometry
.
Front Microbiol
,
3
,
41
.

20.

Jones
,
D.P.
,
Coates
,
R.J.
,
Flagg
,
E.W.
et al.  (
1992
)
Glutathione in foods listed in the National Cancer Institute's Health Habits and History Food Frequency Questionnaire
.
Nutr Cancer
,
17
,
57
75
.

21.

Haas
,
K.L.
and
Franz
,
K.J.
(
2009
)
Application of metal coordination chemistry to explore and manipulate cell biology
.
Chem Rev
,
109
,
4921
4960
.

22.

Pearson
,
R.G.
(
1963
)
Hard and soft acids and bases
.
J. Am. Chem. Soc.
,
85
,
3533
3539
.

23.

Spain
,
S.M.
and
Rabenstein
,
D.L.
(
2004
)
Characterization of the selenotrisulfide formed by reaction of selenite with end-capped phytochelatin-2
.
Anal Bioanal Chem
,
378
,
1561
1567
.

24.

Vahrenkamp
,
H.
(
1975
)
Sulfur atoms as ligands in metal-complexes
.
Angew Chem Int Edit
,
14
,
322
329
.

25.

Coursey
,
J.S.
,
Schwab
,
D.J.
,
Tsai
,
J.J.
et al. . (
2015
)
Atomic weights and isotopic compositions
. National Institute of Standards and Technology, Gaithersburg, MD,
2017
.

26.

Uppal
,
K.
,
Walker
,
D.I.
and
Jones
,
D.P.
(
2017
)
xMSannotator: an R package for network-based annotation of high-resolution metabolomics data
.
Anal Chem
,
89
,
1063
1067
.

27.

Kind
,
T.
and
Fiehn
,
O.
(
2007
)
Seven golden rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry
.
BMC Bioinformatics
,
8
,
105
.

28.

Vacchina
,
V.
,
Chassaigne
,
H.
,
Oven
,
M.
et al.  (
1999
)
Characterisation and determination of phytochelatins in plant extracts by electrospray tandem mass spectrometry
.
Analyst
,
124
,
1425
1430
.

29.

Spisso
,
A.A.
,
Cerutti
,
S.
,
Silva
,
F.
et al.  (
2014
)
Characterization of hg-phytochelatins complexes in vines (Vitis vinifera cv Malbec) as defense mechanism against metal stress
.
Biometals
,
27
,
591
599
.

30.

Dago
,
A.
,
Arino
,
C.
,
Diaz-Cruz
,
J.M.
et al.  (
2014
)
Analysis of phytochelatins and Hg-phytochelatin complexes in Hordeum vulgare plants stressed with Hg and Cd: HPLC study with amperometric detection
.
Int J Environ an Ch
,
94
,
668
678
.

31.

Diaz-Ochoa
,
V.E.
,
Jellbauer
,
S.
,
Klaus
,
S.
et al.  (
2014
)
Transition metal ions at the crossroads of mucosal immunity and microbial pathogenesis
.
Front Cell Infect Microbiol
,
4
,
2
.

32.

Hood
,
M.I.
and
Skaar
,
E.P.
(
2012
)
Nutritional immunity: transition metals at the pathogen-host interface
.
Nat Rev Microbiol
,
10
,
525
537
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.