Gliome database: a comprehensive web-based tool to access and analyze glia secretome data

Overview of the Gliome database. (A) Data mining step to obtain the glia secretome information. The obtained protein names are converted into UniProt ID. (B) Information is entered according to the database format so that users can browse various types of information. At this step, various external databases were used alongside ours. (C) A diagram of navigation at the Gliome database. The protein identifiers of any types were converted into official gene name/UniProt ID.

Figure 2

The websites of the public databases used in this work

Table 1

Open in new tab

The websites of the public databases used in this work

Database	URL	Description
Google Scholar	https://scholar.google.com/	Google Scholar provides a simple way to broadly search for scholarly literature.
Scopus	https://www.scopus.com	Scopus is an extensive, multidisciplinary database of peer-reviewed literature.
PubMed	https://www.ncbi.nlm.nih.gov/pubmed/	PubMed is a free search engine to access primarily the MEDLINE database of references and abstracts on life sciences and biomedical topics.
SignalP	http://www.cbs.dtu.dk/services/SignalP/	The SignalP 5.0 server predicts the presence of signal peptides and the location of their cleavage sites in proteins.
SecretomeP	http://www.cbs.dtu.dk/services/SecretomeP/	The SecretomeP 2.0 server performs ab initio prediction of classical vs. non-classical secretion.
DisGeNet	https://www.disgenet.org/	DisGeNet is a discovery platform containing one of the largest publicly available collections of genes and variants associated with human diseases.
DAVID	https://david.ncifcrf.gov/	DAVID provides a comprehensive set of functional annotation tools for investigators to understand the biological roles of a large list of genes.
IMEx	http://imex.sourceforge.net/	IMEx is a consortium that makes a data resource, which enables the user to download, combine, visualize and analyze data in a single format from multiple resources.
PSI-MI CV	https://www.ebi.ac.uk/ols/ontologies/mi	PSI-MI CV is a structured and controlled vocabulary for the annotation of experiments concerned with PPIs.
VerSeDa	http://genomics.cicbiogune.es/VerSeDa/index.php	VerSeDa has been developed to accelerate the prediction process for whole secretomes (the full set of secreted proteins by a given organism).
UniProtKB	https://www.uniprot.org/	The UniProtKB is the central hub for the collection of functional information on proteins, with accurate, consistent and rich annotation.

Database	URL	Description
Google Scholar	https://scholar.google.com/	Google Scholar provides a simple way to broadly search for scholarly literature.
Scopus	https://www.scopus.com	Scopus is an extensive, multidisciplinary database of peer-reviewed literature.
PubMed	https://www.ncbi.nlm.nih.gov/pubmed/	PubMed is a free search engine to access primarily the MEDLINE database of references and abstracts on life sciences and biomedical topics.
SignalP	http://www.cbs.dtu.dk/services/SignalP/	The SignalP 5.0 server predicts the presence of signal peptides and the location of their cleavage sites in proteins.
SecretomeP	http://www.cbs.dtu.dk/services/SecretomeP/	The SecretomeP 2.0 server performs ab initio prediction of classical vs. non-classical secretion.
DisGeNet	https://www.disgenet.org/	DisGeNet is a discovery platform containing one of the largest publicly available collections of genes and variants associated with human diseases.
DAVID	https://david.ncifcrf.gov/	DAVID provides a comprehensive set of functional annotation tools for investigators to understand the biological roles of a large list of genes.
IMEx	http://imex.sourceforge.net/	IMEx is a consortium that makes a data resource, which enables the user to download, combine, visualize and analyze data in a single format from multiple resources.
PSI-MI CV	https://www.ebi.ac.uk/ols/ontologies/mi	PSI-MI CV is a structured and controlled vocabulary for the annotation of experiments concerned with PPIs.
VerSeDa	http://genomics.cicbiogune.es/VerSeDa/index.php	VerSeDa has been developed to accelerate the prediction process for whole secretomes (the full set of secreted proteins by a given organism).
UniProtKB	https://www.uniprot.org/	The UniProtKB is the central hub for the collection of functional information on proteins, with accurate, consistent and rich annotation.

DAVID indicates Database for Annotation, Visualization and Integrated Discovery Bioinformatics Resources; IMEX, International Molecular Exchange; PSI, Proteomics Standards Initiative; MI, Molecular Interactions; CV, controlled vocabulary; VerSeDa, Vertebrate Secretome Database; UniProt Knowledgebase, UniProtKB.

Glial secretome refers to the full set of proteins secreted by glial cells. Currently, glial secretome has been a topic of active interest in biomarker discovery (17, 23, 24). Accordingly, a number of proteomics studies have identified numerous secretory proteins related to various neurological disorders (19, 25–29). For instance, amyloid peptides and their precursors, tau proteins (30), have been identified as potential biomarkers for Alzheimer’s disease and alpha-synuclein (31) and apolipoprotein H (32) for Parkinson’s disease (33). Nevertheless, there are many hurdles regarding glial secretome analysis. First, secretome analysis can typically be compromised due to contaminations with proteins from cell debris or culture supplements and result in false identification. Second, it is hard to predict whether a protein is secreted or not, since proteins can be secreted via two pathways, by either the canonical (through an N-terminal signal peptide) or non-canonical secretion pathway (34, 35). To overcome these limitations, several bioinformatics tools, such as SignalP (36), SecretomeP (37), TMHMM (38) and WoLF.PSORT (39), have been developed that can predict secretory proteins. However, these tools show varying reliabilities of detection in secretome analysis and can be biased. Therefore, a considerable amount of time and effort should be invested to confirm the results. To our knowledge, no specific database for glial secretome presently exists. Thus, a systematic and curated database that can manage the glial secretome data is in high demand. To fill this important gap, we generated the ‘Gliome’ database, a web-based tool to access and analyze glia-derived secretory proteins. The database provides a set of information about manually curated glia-based experiments as well as the disease associations and protein–protein interactions (PPIs) of glia-secreted proteins. It integrates a diffused glial secretome data and provides a comprehensive platform for glial research.

Materials and Methods

Data collection and pre-processing

The data in the Gliome database were manually obtained through a comprehensive literature search at the Google Scholar, Scopus and PubMed databases using general keywords, namely ‘glia, astrocyte, microglia, oligodendrocyte, Schwann cell, secretome and secretory protein’. To collect all the relevant data, literatures were searched by the following selection criteria: (i) main or supplementary tables containing detailed information on glial proteins, which were identified by secretomic analysis such as LC-MS/MS analysis, ELISA or protein array; (ii) differential expression of glia-secreted proteins under specific stimulation conditions; (iii) experiments related to specific secretory proteins; and (iv) molecular/clinical validation experiments have been performed for the identified biomarkers (glia-secreted proteins). These criteria were used to gather information of secretome analysis, expression level under a certain stimulation, experimental design and relevance to clinical use. Applying these criteria, 58 publications were selected as the data source to collect the relevant data (Figure 1). All the glial secretory proteins found through the literature search were compiled for the generation of the Gliome database (Figure 2). For ID mapping, different types of protein IDs (such as the International Protein Index) were unified to the UniProt ID using UniProtKB (40) and the ID conversion tool of the Database for Annotation, Visualization and Integrated Discovery (DAVID) Bioinformatics Resources (41). The basic information [such as signal peptide (a discrimination score obtained using the SignalP. Y indicates discrimination score greater than 0.45; N, discrimination score lesser than 0.45. Secretory proteins are indicated by Y.), SecretomeP value (neural network score obtained using the SecretomeP. Non-classically secreted proteins are predicted based on a score greater than 0.5), functions and Gene Ontology functions] about the glial secretory proteins was annotated using the public databases VerSeDa (42), UniProtKB, PubMed, DisGeNet, Scopus and DAVID (Table 1). For searching the publications regarding the PPIs of the glial secretory proteins, all the synonyms for each protein at UniProtKB and Entrez Gene (43) were used. For PPI detection methods, we used the standard nomenclature in two highly verified databases, International Molecular Exchange (http://imex.sourceforge.net/) and the PSI-MI controlled vocabulary (https://www.ebi.ac.uk/ols/ontologies/mi). Using various combinations of protein names, their synonyms and PPI detection methods, we searched the literature for the physical interactors of glia-secreted proteins. The PubMed and Scopus databases were manually screened for the experimental information related to PPI, whereby 137 articles were identified (Figure 1).

Database architecture and web interface

The Gliome database used a MySQL database server version 5.7.22 to store and query the collected data. The web interface was implemented by Ruby on Rails version 5.1.4, a server-side web application framework written in Ruby programming language. The system runs on a Phusion Passenger application server version 5.2.3 and an Nginx HTTP server version 1.12.2 hosted on an Ubuntu Linux server version 16.04.2 LTS. HTML5, CSS and JavaScript were used for building the client-side user interfaces. We also utilized jQuery 3.2, Semantic UI version 2.3.1 and Font Awesome version 4.7.0 to generate a user-friendly interface.

Results

The structure of the Gliome database

The Gliome database was structured to provide the following: (i) a straightforward searchable depository for glial secretory proteins and basic information about them, such as secretory pathway and functions; (ii) information across different secretome measurements; (iii) disease relevance; and (iv) prediction of the protein interactors of each glial secretory protein. Therefore, the Gliome database comprises glial secretory proteins characterized in various studies and provides information about the annotation of these proteins and predictions about their functions.

The main page and appearance of the Gliome database. (A) Main tabs, (B) quick search window, (C) protein list, (D) protein information. When a user enters a protein name, such as ‘LCN2’, on the quick search window, the page of LCN2 protein appears. By clicking the UniProt ID, the relevant detailed information can be obtained.

Figure 3

Gliome database navigation. The main tab at the top of the homepage is hyperlinked with four major informative browsers including protein information (‘Proteins’), experiments (‘Experiments’), associated diseases (‘Diseases’) and PPI (‘Interactions’).

Figure 4

Web interface

The Gliome database home page includes database status and a brief introduction of the database within four major lines: (i) information retrieval about glia-secreted proteins, (ii) analysis of differential protein expression under specific conditions, (iii) disease associations of glia-secreted proteins and (iv) PPIs of glia-secreted proteins (Figures 2 and 3). On top of the home page, we implemented the query window that allows users for a quick protein search (Figure 3). To query, the users first need to specify a gene name or UniProt ID. Entering a search term (gene name or UniProt ID) opens the list table that contains the search term in any field. When users click the UniProt ID in the table, the detailed information about the queried protein appears.

The Gliome database contains four main tabs designed for different purposes. Users can utilize the database by clicking ‘Proteins’, ‘Experiments’, ‘Diseases’ and ‘Interactions’ (Figure 4). The Gliome database also provides instructions in the ‘Tutorial’ section. The browser that appears after clicking ‘Proteins’ tab includes a query box and the whole protein list. Users can choose species using the optional drop-down menu. If users enter a gene name or UniProt ID on the query box and select species using the drop-down menu, a result table will be displayed. By clicking the UniProt ID, users can browse the summary table of basic protein information, including existence (Y)/non-existence (N) of a signal peptide, SecretomeP value, general function and GO functions. At the same time, users can access associated experiments, disease or protein interactions using beside tabs. ‘Experiments’ tab at the top of the main page shows the page of drop-down menus and a complete list of experiments. This page was designed to allow users to browse through the experiments associated with the protein of interest or to simulate an experiment. Users can select a glial cell type and experimental stimulus and browse glial secretory proteins under a selected experimental condition. Clicking the UniProt ID or References returns to the page of protein information or publication at the PubMed database. The ‘Diseases’ tab provides a list of the proteins associated with the queried disease of interest. Entering a disease name (auto-completion) displays the protein names, UniProt IDs and References. The ‘Interactions’ page offers a manually curated protein interaction of each protein and related analytical methods.

Discussion

The goal of the Gliome database is to answer several important questions. (i) Is the protein of interest secreted? (ii) What is the cellular origin of the secretory protein of interest? (iii) Has the secretory protein of interest been experimentally investigated? (iv) Is the protein of interest relevant to human diseases? (v) Is there any protein that interacts with the secretory protein of interest?

Our database provides well-summarized fundamental information on glia-secreted proteins. The basic information may be useful to predict the biological functions of novel glial proteins and to determine whether a protein of interest has a canonical or non-canonical secretory pathway. Secretory proteins were grouped according to glial cell types and stimulation conditions in the ‘Experiments’ tab so that one can easily identify the glial cell type and experimental condition from which a particular protein of interest was derived. Moreover, as the glial secretome reflects various states of the nervous system under a given condition in real time (24), it is a potentially rich source of biomarkers. Accordingly, a set of glial secretory proteins associated with diseases may be a fundamental source for disease-specific biomarkers or therapeutic targets (44). Recently, the role of glia-derived exosome or extracellular vesicle (EV) proteins in neuron–glia communication and neurological disease has been emerging (45, 46). EVs from ALS patients and animal models contained many misfolded proteins mainly released by astrocytes, implying the deleterious role of astrocytes in ALS pathology (47), and the stimulation of astrocytes with proinflammatory or anti-inflammatory cytokines results in the release of neurotoxic or neuroprotective proteins in EVs, respectively (48). Therefore, our database provides information about a broad range of glial secretome.

Furthermore, our database provides information about the PPIs of glia-secreted proteins. PPI plays a fundamental role in virtually all biological processes. The identification of the interacting proteins of specific glia-secreted proteins may deepen our understanding of the mechanisms by which the glial proteins act and glia regulate the nervous system. This will lead to the discovery of mechanism-based drug targets and the optimization of treatment strategies (49). In our database, the manually curated PPI information was included together with other public PPI databases, such as IntAt, MINT and STRING, to provide high-confident experimental information. Therefore, the Gliome database is equipped with more accurate PPI information of glia-secreted proteins, which may be helpful for clinical usage in the future.

Funding

This research was supported by Basic Science Research Program through the National Research Foundation, funded by the Korean government (Ministry of Science, ICT and Future Planning, MSIP) (2016M3C7A1904148). DK was supported by the Collaborative Genome Program for Fostering New Post-Genome Industry of the National Research Foundation (NRF) funded by the Ministry of Science and ICT (MSIT) (NRF-2014M3C9A3064706).

Conflict of interest. The authors declare no competing interest.

Database URL:www.gliome.org

References

Jha

M.K.

Kim

J.H.

Song

G.J.

et al. (

2018

)

Functional dissection of astrocyte-secreted proteins: implications in brain health and diseases

Prog. Neurobiol.

162

–

Philips

and

Rothstein

J.D.

(

2017

)

Oligodendroglia: metabolic supporters of neurons

J. Clin. Invest.

127

3271

–

3280

Jakel

and

Dimou

(

2017

)

Glial cells and their function in the adult brain: a journey through the history of their ablation

Front. Cell. Neurosci.

Colonna

and

Butovsky

(

2017

)

Microglia function in the central nervous system during health and neurodegeneration

Annu. Rev. Immunol.

441

–

468

Kidd

G.J.

Ohno

and

Trapp

B.D.

(

2013

)

Biology of Schwann cells

Handb. Clin. Neurol.

115

–

Sun

L.O.

and

Barres

B.A.

(

2016

)

Glia get neurons in shape

Cell

165

775

–

776

Cope

E.C.

and

Gould

(

2019

)

Adult neurogenesis, glia, and the extracellular matrix

Cell Stem Cell

690

–

705

Rasband

M.N.

(

2016

)

Glial contributions to neural function and disease

Mol. Cell. Proteomics

355

–

361

Jha

M.K.

Seo

Kim

J.H.

et al. (

2013

)

The secretome signature of reactive glial cells and its pathological implications

Biochim. Biophys. Acta

1834

2418

–

2428

10.

Samy

Z.A.

Al-Abdullah

Turcani

et al. (

2018

)

Rat astrocytes during anoxia: secretome profile of cytokines and chemokines

Brain Behav.

e01013

11.

Harrell

C.R.

Fellabaum

Jovicic

et al. (

2019

)

Molecular mechanisms responsible for therapeutic potential of mesenchymal stem cell-derived secretome

Cells

467

Google Scholar

12.

da Cunha

B.R.

Domingos

Stefanini

A.C.B.

et al. (

2019

)

Cellular interactions in the tumor microenvironment: the role of secretome

J. Cancer

4574

–

4587

13.

Tien

W.S.

Chen

J.H.

and

K.P.

(

2017

)

SheddomeDB: the ectodomain shedding database for membrane-bound shed markers

BMC Bioinformatics

14.

Willis

C.M.

Nicaise

A.M.

Peruzzotti-Jametti

et al. (

2020

)

The neural stem cell secretome and its role in brain repair

Brain Res.

1729

146615

15.

Xia

Minamino

Kuwabara

et al. (

2019

)

Stem cell secretome as a new booster for regenerative medicine

Biosci. Trends

299

–

307

16.

Mukherjee

and

Mani

(

2013

)

Methodologies to decipher the cell secretome

Biochim. Biophys. Acta

1834

2226

–

2232

17.

Hathout

(

2007

)

Approaches to the study of the cell secretome

Expert Rev. Proteomics

239

–

248

18.

Park

S.R.

Kim

J.W.

Jun

H.S.

et al. (

2018

)

Stem cell secretome and its effect on cellular mechanisms relevant to wound healing

Mol. Ther.

606

–

617

19.

Schira

Heinen

Poschmann

et al. (

2019

)

Secretome analysis of nerve repair mediating Schwann cells reveals Smad-dependent trophism

FASEB J.

4703

–

4715

20.

Kim

J.H.

Afridi

Lee

W.H.

et al. (

2020

)

Proteomic examination of the neuroglial secretome: lessons for the clinic

Expert Rev. Proteomics

207

–

220

21.

Robinson

J.L.

Feizi

Uhlen

et al. (

2019

)

A systematic investigation of the malignant functions and diagnostic potential of the cancer secretome

Cell Rep.

2622

–

2635

22.

Hsiao

Y.C.

Chu

L.J.

Chen

J.T.

et al. (

2017

)

Proteomic profiling of the cancer cell secretome: informing clinical research

Expert Rev. Proteomics

737

–

756

23.

Song

Kwon

Joo

J.Y.

et al. (

2019

)

Secretomics to discover regulators in diseases

Int. J. Mol. Sci.

3893

Google Scholar

24.

Suk

(

2010

)

Combined analysis of the glia secretome and the CSF proteome: neuroinflammation and novel biomarkers

Expert Rev. Proteomics

263

–

274

25.

Osman

A.M.

Rodhe

Shen

et al. (

2019

)

The secretome of microglia regulate neural stem cell function

Neuroscience

405

–

102

26.

Pooladi

Abad

S.K.

and

Hashemi

(

2014

)

Proteomics analysis of human brain glial cell proteome by 2D gel

Indian J. Cancer

159

–

162

27.

Han

Jin

Woo

et al. (

2014

)

Proteomic analysis of mouse astrocytes and their secretome by a combination of FASP and StageTip-based, high pH, reversed-phase fractionation

Proteomics

1604

–

1609

28.

Kim

W.K.

Kim

Cui

et al. (

2014

)

Secretome analysis of human oligodendrocytes derived from neural stem cells

PLoS One

e84292

29.

Jha

M.K.

Kim

J.H.

and

Suk

(

2014

)

Proteome of brain glia: the molecular basis of diverse glial phenotypes

Proteomics

378

–

398

30.

Shaw

L.M.

Vanderstichele

Knapik-Czajka

et al. (

2009

)

Cerebrospinal fluid biomarker signature in Alzheimer’s disease neuroimaging initiative subjects

Ann. Neurol.

403

–

413

31.

Eller

and

Williams

D.R.

(

2009

)

Biological fluid biomarkers in neurodegenerative parkinsonism

Nat. Rev. Neurol.

561

–

570

32.

Abdi

Quinn

J.F.

Jankovic

et al. (

2006

)

Detection of biomarkers with a multiplex quantitative proteomic platform in cerebrospinal fluid of patients with neurodegenerative disorders

J. Alzheimers Dis.

293

–

348

33.

Constantinescu

Zetterberg

Holmberg

et al. (

2009

)

Levels of brain related proteins in cerebrospinal fluid: an aid in the differential diagnosis of parkinsonian disorders

Parkinsonism Relat. Disord.

205

–

212

34.

Nickel

(

2010

)

Pathways of unconventional protein secretion

Curr. Opin. Biotechnol.

621

–

626

35.

Rothman

J.E.

and

Orci

(

1992

)

Molecular dissection of the secretory pathway

Nature

355

409

–

415

36.

Almagro Armenteros

J.J.

Tsirigos

K.D.

Sonderby

C.K.

et al. (

2019

)

SignalP 5.0 improves signal peptide predictions using deep neural networks

Nat. Biotechnol.

420

–

423

37.

Bendtsen

J.D.

Jensen

L.J.

Blom

et al. (

2004

)

Feature-based prediction of non-classical and leaderless protein secretion

Protein Eng. Des. Sel.

349

–

356

38.

Krogh

Larsson

von Heijne

et al. (

2001

)

Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes

J. Mol. Biol.

305

567

–

580

39.

Horton

Park

K.J.

Obayashi

et al. (

2007

)

WoLF PSORT: protein localization predictor

Nucleic Acids Res.

W585

–

W587

40.

UniProt Consortium

(

2019

)

UniProt: a worldwide hub of protein knowledge

Nucleic Acids Res.

D506

–

D515

PubMed

41.

Huang

d.W.

Sherman

B.T.

Stephens

et al. (

2008

)

DAVID gene ID conversion tool

Bioinformation

428

–

430

42.

Cortazar

A.R.

Oguiza

J.A.

Aransay

A.M.

et al. (

2017

)

VerSeDa: vertebrate secretome database

Database (Oxford)

2017

–

Google Scholar