IQdb: an intelligence quotient score-associated gene resource for human intelligence

The statistically significant enriched pathways of IQ-associated genes in the core dataset from different pathway databases

Pathway	Source	Corrected P-value*
Neuronal system	Reactome	4.28E-04
Cocaine addiction	KEGG PATHWAY	3.95E-03
Long-term potentiation	KEGG PATHWAY	9.04E-03
Dopamine degradation	BioCyc	1.88E-02
Developmental biology	Reactome	2.51E-02
Noradrenaline and adrenaline degradation	BioCyc	2.51E-02
Adrenaline and noradrenaline biosynthesis	PANTHER	2.76E-02
Arginine and proline metabolism	KEGG PATHWAY	3.79E-02
Serotonin neurotransmitter release cycle	PID Reactome	4.18E-02
Dopamine neurotransmitter release cycle	PID Reactome	4.18E-02
Neurotransmitter release cycle	PID Reactome	4.18E-02

Pathway	Source	Corrected P-value*
Neuronal system	Reactome	4.28E-04
Cocaine addiction	KEGG PATHWAY	3.95E-03
Long-term potentiation	KEGG PATHWAY	9.04E-03
Dopamine degradation	BioCyc	1.88E-02
Developmental biology	Reactome	2.51E-02
Noradrenaline and adrenaline degradation	BioCyc	2.51E-02
Adrenaline and noradrenaline biosynthesis	PANTHER	2.76E-02
Arginine and proline metabolism	KEGG PATHWAY	3.79E-02
Serotonin neurotransmitter release cycle	PID Reactome	4.18E-02
Dopamine neurotransmitter release cycle	PID Reactome	4.18E-02
Neurotransmitter release cycle	PID Reactome	4.18E-02

*The corrected P-value was calculated by Fisher exact test followed by Benjamini–Hochberg multiple testing correction using the Ingenuity Pathway Tool.

Table 1.

The statistically significant enriched pathways of IQ-associated genes in the core dataset from different pathway databases

Pathway	Source	Corrected P-value*
Neuronal system	Reactome	4.28E-04
Cocaine addiction	KEGG PATHWAY	3.95E-03
Long-term potentiation	KEGG PATHWAY	9.04E-03
Dopamine degradation	BioCyc	1.88E-02
Developmental biology	Reactome	2.51E-02
Noradrenaline and adrenaline degradation	BioCyc	2.51E-02
Adrenaline and noradrenaline biosynthesis	PANTHER	2.76E-02
Arginine and proline metabolism	KEGG PATHWAY	3.79E-02
Serotonin neurotransmitter release cycle	PID Reactome	4.18E-02
Dopamine neurotransmitter release cycle	PID Reactome	4.18E-02
Neurotransmitter release cycle	PID Reactome	4.18E-02

Pathway	Source	Corrected P-value*
Neuronal system	Reactome	4.28E-04
Cocaine addiction	KEGG PATHWAY	3.95E-03
Long-term potentiation	KEGG PATHWAY	9.04E-03
Dopamine degradation	BioCyc	1.88E-02
Developmental biology	Reactome	2.51E-02
Noradrenaline and adrenaline degradation	BioCyc	2.51E-02
Adrenaline and noradrenaline biosynthesis	PANTHER	2.76E-02
Arginine and proline metabolism	KEGG PATHWAY	3.79E-02
Serotonin neurotransmitter release cycle	PID Reactome	4.18E-02
Dopamine neurotransmitter release cycle	PID Reactome	4.18E-02
Neurotransmitter release cycle	PID Reactome	4.18E-02

*The corrected P-value was calculated by Fisher exact test followed by Benjamini–Hochberg multiple testing correction using the Ingenuity Pathway Tool.

Enrichment diseases for the 158 IQ-related genes in the core dataset

As a fundamental role of cognition, it is not surprising that the genes are consistently associated with a number of complex diseases. Although it is difficult to measure how much the IQ score may have contributed to certain diseases based on gene content, it might give a clue that helps to generate hypotheses to examine the potential role of IQ score as a risk factor in relevant disease. A quick disease analysis has revealed that the 158 genes in the core dataset are related to a broad spectrum of human diseases such as various cancers and mental disorders (Table 2). In total, 81 genes are related to psychotic and mental disorders. The mental disorders mainly include schizophrenia, autism, depression, bipolar, obsessive-compulsive disorder and Parkinson’s disease. Plenty of previous reports suggest that early-onset and adult-onset schizophrenia are associated with intellectual deficits (46, 47). However, the underlying common molecular mechanism between schizophrenia and IQ scores is still unknown. In IQdb, 37 genes related to schizophrenia are highly enriched in neurotransmitter metabolism pathways, including ‘Adrenaline and noradrenaline biosynthesis’, ‘Dopamine clearance from the synaptic cleft’ and ‘Arginine and proline metabolism’. These pathways suggest that the early-onset and adult-onset schizophrenia might be related to some compound metabolisms such as dopamine metabolism. Most interestingly, several IQ-related genes are associated with several mental disorders. For instance, SLC6A4 is associated with autistic disorder, schizophrenia, obsessive compulsive disorder, bipolar disorder, personality disorders, affective disorder, attention deficit hyperactivity disorder, suicide, Alzheimer’s disease and depression. Thus, the relationships between common IQ-associated genes and diseases are promising for future biological experiments or replication efforts to discover the underlying common pathways. In summary, IQdb is valuable in discovery of potential candidate genes, pathways and potential cross-talks between mental disorder and intelligence using comprehensive annotation and user-friendly interface. As a first effort to systematically collect and extend candidate IQ-associated genes, IQdb is also useful to better clarify the molecular mechanisms related to human intelligence.

Table 2.

The top 10 enriched diseases of IQ-associated genes in the core dataset with experimental supports

Disease	Source	Corrected P-value*
Behavior disease	FunDO	8.71E-09
Psychotic disorder	FunDO	1.42E-08
Autistic disorder	FunDO	2.98E-07
Cognitive function	GAD	9.38E-06
Schizophrenia	GAD	5.66E-05
Obsessive compulsive disorder	GAD	4.26E-04
Noonan syndrome	KEGG DISEASE	6.58E-04
Other congenital disorders	KEGG DISEASE	1.14E-03
Bipolar disorder	FunDO	3.95E-03
Congenital disorders of development	KEGG DISEASE	4.74E-03

Disease	Source	Corrected P-value*
Behavior disease	FunDO	8.71E-09
Psychotic disorder	FunDO	1.42E-08
Autistic disorder	FunDO	2.98E-07
Cognitive function	GAD	9.38E-06
Schizophrenia	GAD	5.66E-05
Obsessive compulsive disorder	GAD	4.26E-04
Noonan syndrome	KEGG DISEASE	6.58E-04
Other congenital disorders	KEGG DISEASE	1.14E-03
Bipolar disorder	FunDO	3.95E-03
Congenital disorders of development	KEGG DISEASE	4.74E-03

*The corrected P-value was calculated by Fisher exact test followed by Benjamini–Hochberg multiple testing correction using the Ingenuity Pathway Tool.

Table 2.

Open in new tab Download slide

The top 10 enriched diseases of IQ-associated genes in the core dataset with experimental supports

Disease	Source	Corrected P-value*
Behavior disease	FunDO	8.71E-09
Psychotic disorder	FunDO	1.42E-08
Autistic disorder	FunDO	2.98E-07
Cognitive function	GAD	9.38E-06
Schizophrenia	GAD	5.66E-05
Obsessive compulsive disorder	GAD	4.26E-04
Noonan syndrome	KEGG DISEASE	6.58E-04
Other congenital disorders	KEGG DISEASE	1.14E-03
Bipolar disorder	FunDO	3.95E-03
Congenital disorders of development	KEGG DISEASE	4.74E-03

Disease	Source	Corrected P-value*
Behavior disease	FunDO	8.71E-09
Psychotic disorder	FunDO	1.42E-08
Autistic disorder	FunDO	2.98E-07
Cognitive function	GAD	9.38E-06
Schizophrenia	GAD	5.66E-05
Obsessive compulsive disorder	GAD	4.26E-04
Noonan syndrome	KEGG DISEASE	6.58E-04
Other congenital disorders	KEGG DISEASE	1.14E-03
Bipolar disorder	FunDO	3.95E-03
Congenital disorders of development	KEGG DISEASE	4.74E-03

*The corrected P-value was calculated by Fisher exact test followed by Benjamini–Hochberg multiple testing correction using the Ingenuity Pathway Tool.

Interface Development of Database

All data and information in IQdb are stored in a free, fast and reliable open-source relational database MySQL on a Linux server. Web-based interface to the database is implemented in object-oriented Java, which is a platform-independent language and easy to deploy and update. All the Web applications run under a Tomcat + Apache Web server environment. Based on the JavaServer Pages (JSP) technology, dynamical Web pages for each gene in the database are generated. For genes with different evidence, the comprehensive annotation and links are provided (Figure 2A). Gene expression in various tissues and brain regions is represented in tabular format (Figure 2A). In addition, the original literature to support their association with IQ scores is also complied for the 158 genes in the core dataset. For other expanded genes, literature is compiled from the NCBI GeneRIF database (48), which may be useful for users to judge their potential roles with IQ or other cognitive processes.

Figure 2.

Web interface of IQdb. (A) The basic information in each IQ-associated gene page. (B) Query interface for text search. (C) BLAST search interface for comparing query against all sequences in IQdb. (D) Browser interface for genes in top 10 enriched pathways, top 10 enriched diseases and shared cytoband.

IQdb allows users to do text query (Figure 2B), or to run BLAST search against the sequences in IQdb (Figure 2C). To provide a powerful text-based query, six different user-friendly input forms are provided for Entrez Gene ID, pathway and disease annotation, genomic region, literature content and gene expression range in 22 tissues or brains regions. Moreover, a quick full-text search for GeneID, gene symbol or gene alias and publication is on the top right of each page, which is efficient for users to access any data in the database, especially literature-based annotations. In addition, users can browse the data in IQdb in a variety of ways, including significantly enriched pathway, related disease, reported linkage region and chromosome number (Figure 2D). Finally, for any advanced study, IQdb provides all downloadable genetic and population information in a plain text for all the collected 139 SNPs related to IQ.

Conclusions

IQdb is constructed as a free database and analysis server to enable users to rapidly search and retrieve summarized IQ-associated genes. Enrichment pathway analyses reveal that multiple signal events related to IQ-associated genes are involved in cognitive systems. Central questions should focus on integration of various signaling pathways to process information. In addition, comprehensive disease enrichment analyses interlink IQ-associated genes with many relevant cancers and mental disorders. IQdb is freely available at http://iqdb.cbi.pku.edu.cn.

Funding

This work was supported by the National High-tech 863 Program of China [grant numbers 2006AA02A312, 2008BAI64B01], the National Natural Science Foundation of China [grant number 31171270] and the National Science and Technology Infrastructure Program [grant number 2009FY120100].

Conflict of interest. None declared.

REFERENCES

Mortensen

Sorensen

Jensen

, et al. ,

IQ and mental disorder in young men

Br. J. Psychiatry

2005

, vol.

187

(pg.

407

415

)

Koenen

Moffitt

Roberts

, et al. ,

Childhood IQ and adult mental disorders: a test of the cognitive reserve hypothesis

Am. J. Psychiatry

2009

, vol.

166

(pg.

)

Kalbfleisch

Loughan

. ,

Impact of IQ discrepancy on executive function in high-functioning autism: insight into twice exceptionality

J. Autism Dev. Disord.

2011

, vol.

(pg.

390

400

)

Deary

Johnson

Houlihan

. ,

Genetic foundations of human intelligence

Hum. Genet.

2009

, vol.

126

(pg.

215

232

)

Plomin

Haworth

CMA

. ,

Genetics of high cognitive abilities

Behav. Genet.

2009

, vol.

(pg.

347

349

)

Plomin

Spinath

. ,

Intelligence: genetics, genes, and genomics

J. Pers. Soc. Psychol.

2004

, vol.

(pg.

112

129

)

Meyer

Zweig

Hinrichs

, et al. ,

The UCSC Genome Browser database: extensions and updates 2013

Nucleic Acids Res.

2013

, vol.

(pg.

D64

D69

)

Huang

, et al. ,

AutismKB: an evidence-based knowledgebase of autism genetics

Nucleic Acids Res.

2012

, vol.

(pg.

D1016

D1022

)

Stark

Breitkreutz

Chatr-Aryamontri

, et al. ,

The BioGRID Interaction Database: 2011 update

Nucleic Acids Res.

2011

, vol.

(pg.

D698

D704

)

Prasad

Goel

Kandasamy

, et al. ,

Human Protein Reference Database—2009 update

Nucleic Acids Res.

2008

, vol.

(pg.

D767

D772

)

Willis

Hogue

. ,

Searching, viewing, and visualizing data in the Biomolecular Interaction Network Database (BIND)

Curr. Protoc. Bioinformatics

2006

Chapter 8, Unit 8.9

Sun

Jia

Fanous

, et al. ,

A multi-dimensional evidence-based candidate gene prioritization approach for complex diseases-schizophrenia as a case

Bioinformatics

2009

, vol.

(pg.

2595

6602

)

Sayers

Barrett

Benson

, et al. ,

Database resources of the National Center for Biotechnology Information

Nucleic Acids Res.

2011

, vol.

(pg.

D38

D51

)

Magrane

Consortium

. ,

UniProt Knowledgebase: a hub of integrated protein data

Database (Oxford)

2011

, vol.

2011

pg.

bar009

Flicek

Amode

Barrell

, et al. ,

Ensembl 2011

Nucleic Acids Res.

2011

, vol.

(pg.

D800

D806

)

Gene Ontology Consortium. (2010) The Gene Ontology in 2010: extensions and refinements. Nucleic Acids Res., 38, D331–D335

Wiltshire

Batalov

, et al. ,

A gene atlas of the mouse and human protein-encoding transcriptomes

Proc. Natl Acad. Sci. USA

2004

, vol.

101

(pg.

6062

6067

)

Jones

Overly

Sunkin

. ,

The Allen Brain Atlas: 5 years and beyond

Nat. Rev. Neurosci.

2009

, vol.

(pg.

821

828

)

Wang

Sandberg

Luo

, et al. ,

Alternative isoform regulation in human tissue transcriptomes

Nature

2008

, vol.

456

(pg.

470

476

)

Habegger

Noisa

, et al. ,

Dynamic transcriptomes during neural differentiation of human embryonic stem cells revealed by short, long, and paired-end sequencing

Proc. Natl Acad. Sci. USA

2010

, vol.

107

(pg.

5254

5259

)

Langmead

Trapnell

Pop

Salzberg

. ,

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

Genome Biol.

2009

, vol.

pg.

R25

Mortazavi

Williams

McCue

, et al. ,

Mapping and quantifying mammalian transcriptomes by RNA-Seq

Nat. Methods

2008

, vol.

(pg.

621

628

)

Trapnell

Pachter

Salzberg

. ,

TopHat: discovering splice junctions with RNA-Seq

Bioinformatics

2009

, vol.

(pg.

1105

1111

)

Trapnell

Williams

Pertea

, et al. ,

Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation

Nat. Biotechnol.

2010

, vol.

(pg.

511

515

)

Karp

Ouzounis

Moore-Kochlacs

, et al. ,

Expansion of the BioCyc collection of pathway/genome databases to 160 genomes

Nucleic Acids Res.

2005

, vol.

(pg.

6083

6089

)

Kanehisa

Araki

Goto

, et al. ,

KEGG for linking genomes to life and the environment

Nucleic Acids Res.

2008

, vol.

(pg.

D480

D484

)

Schaefer

Anthony

Krupa

, et al. ,

PID: the Pathway Interaction Database

Nucleic Acids Res.

2009

, vol.

(pg.

D674

D679

)

Thomas

Campbell

Kejariwal

, et al. ,

PANTHER: a library of protein families and subfamilies indexed by function

Genome Res.

2003

, vol.

(pg.

2129

2141

)

Croft

O'Kelly

, et al. ,

Reactome: a database of reactions, pathways and biological processes

Nucleic Acids Res.

2011

, vol.

(pg.

D691

D697

)

Matthews

Gopinath

Gillespie

, et al. ,

Reactome knowledgebase of human biological pathways and processes

Nucleic Acids Res.

2009

, vol.

(pg.

D619

D622

)

Zhao

Chen

Gao

, et al. ,

RLEdb: a database of rate-limiting enzymes and their regulation in human, rat, mouse, yeast and E

coli. Cell Res.

2009

, vol.

(pg.

793

795

)

Zhao

. ,

PathLocdb: a comprehensive database for the subcellular localization of metabolic pathways and its application to multiple localization analysis

BMC Genomics

2010

, vol.

Suppl. 4

pg.

S13

Zhao

Chen

. ,

TSdb: a database of transporter substrates linking metabolic pathways and transporter systems on a genome scale via their shared substrates

Sci. China Life Sci.

2011

, vol.

(pg.

)

Becker

Barnes

Bright

Wang

. ,

The genetic association database

Nat. Genet.

2004

, vol.

(pg.

431

432

)

Kanehisa

Goto

Furumichi

, et al. ,

KEGG for representation and analysis of molecular networks involving diseases and drugs

Nucleic Acids Res.

2010

, vol.

(pg.

D355

D360

)

Osborne

Flatow

Holko

, et al. ,

Annotating the human genome with Disease Ontology

BMC Genomics

2009

, vol.

Suppl. 1

pg.

Feng

Flatow

, et al. ,

From disease ontology to disease-ontology lite: statistical methods to adapt a general-purpose ontology for the test of gene-ontology associations

Bioinformatics

2009

, vol.

(pg.

i63

i68

)

Hindorff

Sethupathy

Junkins

, et al. ,

Potential etiologic and functional implications of genome-wide association loci for human diseases and traits

Proc. Natl Acad. Sci. USA

2009

, vol.

106

(pg.

9362

9367

)

Maglott

Ostell

Pruitt

Tatusova

. ,

Entrez Gene: gene-centered information at NCBI

Nucleic Acids Res.

2011

, vol.

(pg.

D52

D57

)

Kanehisa

Goto

Hattori

, et al. ,

From genomics to chemical genomics: new developments in KEGG

Nucleic Acids Res.

2006

, vol.

(pg.

D354

D357

)

Nishimura

. ,

BioCarta

Biotech Softw. Internet Rep.

2001

, vol.

(pg.

117

120

)

Hermjakob

Fleischmann

Apweiler

. ,

Swissknife—'lazy parsing' of SWISS-PROT entries

Bioinformatics

1999

, vol.

(pg.

771

772

)

Stein

. ,

Using the Reactome database

Curr. Protoc. Bioinformatics

2004

Chapter 8, Unit 8.7

Feramisco

Sadreyev

Murray

, et al. ,

Phenotypic and genotypic analyses of genetic skin disease through the Online Mendelian Inheritance in Man (OMIM) database

J. Invest. Dermatol.

2009

, vol.

129

(pg.

2628

2636

)

Huang da

Sherman

Lempicki

. ,

Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists

Nucleic Acids Res.

2009

, vol.

(pg.

)

Goldberg

Fatjo-Vilas

Munoz

, et al. ,

Increased familiarity of intellectual deficits in early-onset schizophrenia spectrum disorders

World J. Biol. Psychiatry

2011

, vol.

(pg.

493

500

)

Dickson

Laurens

Cullen

Hodgins

. ,

Meta-analyses of cognitive and motor function in youth aged 16 years and younger who subsequently develop schizophrenia

Psychol. Med.

2011

(pg.

)

Cohen

Hunter

. ,

GeneRIF quality assurance as summary revision

Pac. Symp. Biocomput.

2007

(pg.

269

280

)