The Halophile Protein Database Open Access

pI or isoelectric point is the pH at which the net charge on the protein is zero. pI can be directly affected by the reduction of disulphide bonds in the proteins. The molecular weight is the elementary biophysical parameter and has direct correlation with the volume of the molecule. It influences the protein structure, which is functionally very important. The difference between the total number of positively (Arg + Lys) and negatively (Asp + Glu) charged amino acids in the protein gives the net charge of a protein. The pattern of hydrophobicity and net charge on the protein represents a unique structural feature of the proteins ( 20 ).

The half-life of a protein is defined as the time required for half of the total amount of protein in a cell to disappear after its synthesis. The in vivo stability of the protein is largely determined by the amino acids present at N-terminal of the protein and is given by the N-end rule ( 25–27 ). The instability index is an indicator of stability of a protein in vitro . The proteins with instability index smaller than 40 are predicted as stable, whereas, a value above 40 indicates instability of the protein ( 28 ) ( http://web.expasy.org/protparam/protparam-doc.html ). The formula of instability index (II) is as follows:

i=L-1
II = (10/L) * SumDIWV(x[i]x[i+1])
i=1
where: L is the length of sequence
DIWV(x[i]x[i+1]) is the instability weight value for the dipeptide starting in position i.

The relative volume occupied by aliphatic side chains (alanine, valine, isoleucine and leucine) is defined as the aliphatic index of a protein. The aliphatic index may influence thermostabiltiy of globular proteins. The sum of hydropathy values of all the amino acids, divided by the number of residues in the protein sequence gives the GRAVY value.

Database architecture

In order to store the information about protein properties of different strains of halophilic archaea/bacteria, open source database software MySQL (version 5.1.3.6) was utilized. The data is stored in the form of associated tables, which also follows Relational Database Management System (RDBMS) concepts. MySQL is feature-rich database software that provides speedy data access, ease of use, portability and also supports most of ANSI SQL commands. The data consistency and non-redundancy were maintained by employing normalization techniques on the developed database. HTML and PHP were used to render a dynamic web interface and the appropriate database connectivity techniques were utilized for quick and easy information retrieval. The viewing of the data is freely available along with a facility to download data. This web application has been hosted using an open source WAMP Server (version 2.0i, windows web development environment) which also provides multiuser access facility. WAMP server allows hosting web applications developed using PHP and MySQL over Apache2 web server. Figure 2 depicts the architecture of HProtDB.

Figure 2.

HProtDB architecture.

The spectrum of the database comprises of database tables for user management, protein, biochemical and biophysical properties of proteins. Besides, fields of the tables cover details of all attributes of the concerned parameter. A primary key in each table is identified for uniquely defining a record. Similarly, the foreign keys were identified from other tables for setting relationship among different entities. Some of the tables were master tables, which were meant for providing the real world values to fields in different tables, while building the queries and presenting the reports.

Figure 3 shows the Data Flow Diagram (DFD) of the HProtDB. The whole system has been depicted in such a way so that the continuity of information flow should not be lost at the next level. This DFD shows all the processes together with the data stores.

Figure 3.

Data flow diagram.

The home page of the database is depicted in Figure 4 . The different tables on the home page provide links to general information, such as protein, amino acids, microbes and other modules related to data entry and retrieval. The search facility ( Figure 5 ) enables the user to search the biochemical and physical properties of the desired protein either through accession number or protein names given in the dropdown list. The user has to select the desired protein, and subsequently all information related to the protein gets extracted from the database and displayed on the screen. The data retrieval option on the home page also provides the user to search for any specific halophilic archaea/bacteria records. This option provides the list of strains and clicking on a particular strain gives the protein and protein properties. In this way, user can access any or all 21 different strains of halophilic archaea/bacteria ( Figures 6–8 ).

Figure 4.

Screenshot of the Halophile Protein Database (HProtDB) home page.

Figure 5.

Search page of HProtDB.

Figure 6.

Snapshot of different strains list.

Figure 7.

Snapshot of protein names of specific strains.

Figure 8.

Snapshot of biochemical/biophysical properties of protein.

Results and discussion

We have constructed a database which provides biochemical/biophysical properties of the proteins from halophilic archaea/bacteria. The study of these properties may lead to elucidation of mechanisms for salt tolerance. Identifying salt-tolerant proteins in halophilic bacteria and transfer of such proteins to other agriculturally important bacteria such as Rhizobium, Azotobacter, Cyanobacteria etc. will be useful from applied point of view as the engineered microbes may be able to adapt in saline conditions. The information in our database may also be useful for designing synthetic proteins with optimal physicochemical proteins which may be of use in saline conditions.

Conclusion

The HProtDB lists various physicochemical properties of the proteins of halophilic archaea/bacteria. Halophilic archaea/bacteria are excellent models for study of osmoregulatory mechanisms that permit these organisms to grow in saline environments. The information in the database might prove useful in elucidating the fundamental mechanisms for salt tolerance and for identifying the characteristics of the genes involved in salt tolerance. These may prove useful in identifying and annotating novel salt tolerant genes ( 29 ).

Funding

Funding for open access charge: Centre for Agricultural Bioinformatics, ICAR - Indian Agricultural Statistics Research Institute.

Conflict of interest . None declared.

References

Kennedy

S.P.

W.V.

Salzberg

S.L.

et al. . (

2001

)

Understanding the adaptation of Halobacterium species NRC-1 to its extreme environment through computational analysis of its genome sequence

Genome Res.

1641

–

1650

Paul

Bag

S.K.

Das

et al. . (

2008

)

Molecular signature of hypersaline adaptation: insights from genome and proteome composition of halophilic prokaryotes

Genome Biol.

R70

Pace

C.N.

(

1990

)

Conformational stability of globular proteins

Trends Biochem. Sci.

–

Horovitz

Fersht

A.R.

(

1992

)

Co-operative interactions during protein folding

J. Mol. Biol.

224

733

–

740

Dill

K.A.

(

1990

)

Dominant forces in protein folding

Biochemistry

7133

–

7155

Winter

J.A.

Christofi

Morroll

et al. . (

2009

)

The crystal structure of Haloferax volcanii proliferating cell nuclear antigen reveals unique surface charge characteristics due to halophilic adaptation

BMC Struct. Biol.

Mevarech

Frolow

Gloss

L.M.

(

2000

)

Halophilic enzymes: proteins with a grain of salt

Biophys. Chem.

155

–

164

Marqusee

Sauer

R.T.

(

1994

)

Contribution of a hydrogen bond/salt-bridge network to the stability of secondary and tertiary structures in lambda repressor

Protein Sci.

2217

–

2225

Pfeil

(

1986

)

Unfolding of proteins

In:

Hinz

H.J.

(ed).

Thermodynamic Data for Biochemistry and Biotechnology

Springer-Verlag

Berlin

, pp.

349

–

376

Google Preview

Stickle

D.F.

Presta

L.G.

Dill

K.A.

et al. . (

1992

)

Hydrogen bonding in globular proteins

J. Mol. Biol.

226

1143

–

1159

Jencks

W.P.

(

1969

)

Catalysis in chemistry and enzymology

McGraw-Hill Book Co

New York

Google Preview

Von Hippel

P.H.

Schleich

(

1969

)

The effects of neutral salts on the structure and conformational stability of macromolecules in solution

, In:

Timasheff

S.N.

Dasman

(eds).

Structure and stability of biological macro molecules

Marcel-Dekker Inc.

New York

, pp.

416

–

574

Google Preview

Abram

Gibbons

N.E.

(

1961

)

The effect of chlorides of monovalent cations, urea, detergents and heat on morphology and the turbidity of suspensions of red halophilic bacteria

Can. J. Microbiol.

741

–

750

Brown

A.D.

(

1963

)

The peripheral structures of gram-negative bacteria cation-sensitive dissolution of the cell membrane of the halophilic bacterium, Halobacterium halobium

Biochim. Biophys. Acta.

425

–

435

Brown

A.D.

(

1964

)

Aspects of bacterial response to the ionic environment

Bacterial. Rev.

296

–

329

Brown

A.D.

(

1964

)

The development of halophilic properties in bacteriol membranes by acylation

Biochim. Biophys. Acta.

136

–

142

Mevarecha

Frolowa

Glossb

L.M.

(

2000

)

Halophilic enzymes: proteins with a grain of salt

Biophys. Chem.

155

–

164

Kyte

Doolittle

R.F.

(

1982

)

A simple method for displaying the hydropathic character of a protein

J. Mol. Biol.

157

105

–

132

Larsen

(

1967

)

Biochemical aspects of extreme halophilism

Adv. Microb. Physiol.

–

132

Dao-pin

Anderson

D.E.

Baase

W.A.

et al. . (

1991

)

Structural and thermodynamic consequences of burying a charged residue within the hydrophobic core of T4 lysozyme

Biochemistry

11521

–

11529

Kushner

D.J.

Onishi

(

1966

)

Contributions of protein and lipid components to the salt response of envelopes of an extremely halophilic bacterium

J. Bacteriol.

653

–

660

Onishi

Kushner

D.J.

(

1966

)

Mechanism of dissolution of the extreme halophile Halobacterium cutiruburm

J. Bacteriol.

646

–

652

Hochstein

L.I.

Dalton

B.P.

(

1968

)

Salt specificity of a reduced nicotinamide adenine dinucleotide oxidase prepared from a halophilic bacterium

J. Bacteriol.

–

Lanyi

J.K.

(

1969

)

Studies of the electron transport chain of extremely halophilic bacteria, Salt dependence of reduced diphosphopyridine nucleotide oxidase

J. Biol. Chem.

244

2864

–

2869

Bachmair

Finley

Varshavsky

(

1986

)

In vivo half-life of a protein is a function of its amino-terminal residue

Science

234

179

–

186

Gonda

D.K.

Bachmair

Wunning

et al. . (

1989

)

Universality and structure of the N-end rule

J. Biol. Chem.

264

16700

–

16712