Abstract

Identifying protein targets for a bioactive compound is critical in drug discovery. Molecular similarity is a main approach to fish drug targets, and is based upon an axiom that similar compounds may have the same targets. The molecular structural similarity of a compound and the ligand of a known target can be gauged in topological (2D), steric (3D) or static (pharmacophoric) metric. The topologic metric is fast, but unable to represent steric and static profile of a bioactive compound. Steric and static metrics reflect the shape properties of a compound if its structure were experimentally obtained, and could be unreliable if they were based upon the putative conformation data. In this paper, we report a pharmaceutical target seeker (PTS), which searches protein targets for a bioactive compound based upon the static and steric shape comparison by comparing a compound structure against the experimental ligand structure. Especially, the crystal structures of active compounds were taken into similarity calculation and the predicted targets can be filtered according to multi activity thresholds. PTS has a pharmaceutical target database that contains approximately 250 000 ligands annotated with about 2300 protein targets. A visualization tool is provided for a user to examine the result.

Database URL: http://www.rcdd.org.cn/PTS

Introduction

For decades, the paradigm of drug discovery and development has been one-drug-for-one-target (1). Recent advances in systems biology (2) and chemical biology demonstrate that existing drugs can interact with multiple targets (3, 4). However, multi-target interactions are either unknown or insufficiently understood in most cases. There are increasing needs to predict drug targets for an agent due to growing number of bioactive compounds identified from phenotypic assays (5–7). The prediction has to be validated by experiments, such as structure biological approaches or proteomics. The in silico approaches can significantly reduce the costs and improve the performance of the experimental approaches for drug target fishing.

A drug target prediction method can be categorized into structure-based or ligand-based method. INDOCK (8) and TarFisDock (9) are typical structure-based target fishing tools using molecular docking algorithms, which rely on the target structure availability and the structure diversity of the binding pocket. However, a ligand-based target fishing approach uses the ligand-compound similarity based on topological structures (fingerprints) (10, 11), molecular shapes, pharmacophores (12) or compound activity profiles (13). The ligand-based target fishing approaches are being adopted due to the increasing availability of bioassay data (14–16). SEA (17) and SuperPred (18) are typical ligand-based approaches that use ligand databases and compound topological (2D) similarity measurements. Other methods, such as Chemmapper (19), Superimpose (20) and wwLigCSRre (21) use 3D structure similarity metric to predict protein targets. 2D and 3D similarity measurements are complimentary, and 3D similarity measurements seem capable of picking novel chemotypes (22) if the template structures were experimentally obtained.

In this work, we have implemented a pharmaceutical target seeker (PTS), which uses the experimental 3D structures of ligands with known targets to calculate the similarity of the ligand and a compound. For those ligands for which experimental structure data are not available, their energy-minimized conformations are generated for the 3D similarity calculations. The 3D similarity search engine is Weighted Gaussian Algorithm (WEGA) (23), which can take steric and pharmacophoric profile into account. The user can rule out impossible targets by setting activity thresholds in order to expedite the target fishing process. PTS contains approximately 250 000 ligands annotated with 2300 protein targets.

Materials and methods

Data preparation

The data of bioactive compounds and their targets were collected from public databases.

Target data were derived from therapeutic target database (TTD version 2015) (24) and reference (25). Through UniProt ID, ligand data and their relations with targets were extracted from UniProt (26), ChEMBL20 (27) and BindingDB (28, 29), PDBbind (version 2014) (30–32) and RCSB PDB databases.

The data were pre-processed with the following steps:

  1. removing obsolete UniProt IDs from TTD target data;

  2. removing counter ion moieties from bioactive ligands;

  3. removing compounds from ChEMBL20 data if their activity (IC50/Ki/Kd) values are greater than 50 μM;

  4. removing small compounds (heavy atoms < 6) and large compounds (MW > 1000 Da).

This resulted in 266 866 ligands associated with 2298 protein targets, 537 095 bioactivity data points, 4391 crystal structures and 16 590 related articles in the PTS built-in database (Table 1). Among the targets, 14% of them have drugs in the market, 41% of them have drug candidates under clinic trails, 40% of them have ligands under the investigations and 5% of them have compounds that were discontinued for pharmaceutical studies.
Table 1.

Statistics data of PTS

DataNumberSourceNumberAvailability(extracted)(original)Target2298TTD (2015)2875bidd.nus.edu.sg/group/cjttd/Compound266 866ChEMBL201 463 270www.ebi.ac.uk/chembl/PDB4391PDBbind (2014)10 656www.pdbbind-cn.org/Activity record537 095ChEMBL2013 520 737www.ebi.ac.uk/chembl/Reference16 590ChEMBL2059 610www.ebi.ac.uk/chembl/
Table 1.

Statistics data of PTS

DataNumberSourceNumberAvailability(extracted)(original)Target2298TTD (2015)2875bidd.nus.edu.sg/group/cjttd/Compound266 866ChEMBL201 463 270www.ebi.ac.uk/chembl/PDB4391PDBbind (2014)10 656www.pdbbind-cn.org/Activity record537 095ChEMBL2013 520 737www.ebi.ac.uk/chembl/Reference16 590ChEMBL2059 610www.ebi.ac.uk/chembl/

Similarity algorithm

The target fishing process is based upon an axiom that similar compounds may have the same targets. An in-house algorithm, WEGA, is used to compute the steric and static similarity of a ligand-compound pair. WEGA is based on the first order Gaussian approximation, which simplifies the shape density functions of the molecules by avoiding expensive higher order terms calculation.

WEGA offers three similarity calculation methods: 1) shape matching, which is only the molecular volumetric overlay, 2) feature matching, which is the pharmacophore mapping of molecule pair, 3) combination matching, which integrates the advantage of the above two aspects and is the most precise approach. The detailed method of WEAG is described in reference (23).

Webserver implementation

PTS uses a browser and server framework. Client interface was implemented in HTML and Javascript. The back-end is implemented in Golang language and MySQL database system. The molecule editor and chemical structure viewers are supported with Marvin JS and ChemDoodle web component. All tools have been summarized in Table 2.

Table 2.

The tools used for PTS implementation

ToolUseAvailabilityMarvin JSChemical structure inputmarvinjs-demo.chemaxon.com/latest/ChemDoodle WebStructure draw and 3D displayweb.chemdoodle.com/Open BabelChemical file format conversionopenbabel.org/wiki/Main_PageDiscovery Studio3D conformation generationaccelrys.com/products/collaborative-science/biovia-discovery-studio/MySQLStorage databasewww.mysql.com/JQueryForeground and background interactionjquery.com/GolangWeb server languagegolang.org/
Table 2.

The tools used for PTS implementation

ToolUseAvailabilityMarvin JSChemical structure inputmarvinjs-demo.chemaxon.com/latest/ChemDoodle WebStructure draw and 3D displayweb.chemdoodle.com/Open BabelChemical file format conversionopenbabel.org/wiki/Main_PageDiscovery Studio3D conformation generationaccelrys.com/products/collaborative-science/biovia-discovery-studio/MySQLStorage databasewww.mysql.com/JQueryForeground and background interactionjquery.com/GolangWeb server languagegolang.org/

Results

Workflow

PTS provides an intuitive interface to predict small molecule protein targets. A user can input a query molecule by uploading a file (mol/SDF format) or drawing a chemical structure with its built-in chemical structure editor (Figure 1B). PTS will generate the possible 3D conformations for the query and, employ WEGA to compute the 3D similarities of the molecular conformations and the ligand structures in PTS ligand database (Figure 1C). A typical task of PTS takes about 30–60 min, depending on the flexibility of the query compound and calculation method assigned by users. Each user’s query is automatically assigned with a Job-ID that allows the user to receive and inspect the target prediction result (Figure 1D). The result page lists the predicted targets with their common names linked to UniProt (26) database if they are available. Targets are ranked according to their score with respect to the query molecule. Thus, the most possible target is placed at the top of the list. Sometimes, multiple targets may be inferred for the query compound based on a single similar compound. If so, the order of the target presented in the table has no specific meaning regarding to prediction significance. Besides, predicted targets can be tailored by the activity threshold of a template ligand (such as 10, 20 or 50 µM) (Figure 1D).

PTS working protocol. (A) Main menu, (B) chemical structure editor, (C) ligand structure from PTS builtin ligand database, (D) predicted target list, (E) target profile and (F) query molecules docked in the binding pocket of a predicted target.
Figure 1.

PTS working protocol. (A) Main menu, (B) chemical structure editor, (C) ligand structure from PTS builtin ligand database, (D) predicted target list, (E) target profile and (F) query molecules docked in the binding pocket of a predicted target.

By clicking on the check button, a user can inspect the predicted template ligands for a query molecule (Figure 1D). In the result page, target data fields include UniProt ID, CHEMBL ID, TTD ID, type, organism, gene and biological functions (Figure 1E). The template ligands are ranked based upon their similarity values to the query molecule. The resulting targets can be tailored by setting the 3D similarity values to the query molecule (default threshold is 0.6, and ligands show low similarity with the query if below this threshold). For the template ligands with experimental structure data, the query molecular structure is superimposed with the ligands in the binding pocket of the predicted target (Figure 1F), and downloadable.

Case study 1: seeking targets for Afatinib

Afatinib is an irreversible kinase inhibitor targeting epidermal growth factor receptor (EGFR) and inhibiting tyrosine kinase auto-phosphorylation (33) to stop tumor cells growth. Afatinib is available for the first-line treatment of patient with metastatic non-small cell lung cancer. The targets predicted by PTS are listed in Table 3.

Table 3.

The predicted targets for Afatinib

RankUniProt IDTarget nameOrganismScore1P00533EGFRHomo sapiens (Human)0.742P25440Bromodomain-containing protein 2Homo sapiens (Human)0.723Q15059Bromodomain-containing protein 3Homo sapiens (Human)0.724O60885Bromodomain-containing protein 4Homo sapiens (Human)0.725P349695-hydroxytryptamine 7 receptorHomo sapiens (Human)0.726Q07820Induced myeloid leukemia cell differentiation protein Mcl-1Homo sapiens (Human)0.727P09917mRNA of human 5-lipoxygenaseHomo sapiens (Human)0.728P17948Vascular endothelial growth factor receptor 1Homo sapiens (Human)0.729P0825372 kDa type IV collagenaseHomo sapiens (Human)0.7110P24557Thromboxane-A synthasenil0.71
Table 3.

The predicted targets for Afatinib

RankUniProt IDTarget nameOrganismScore1P00533EGFRHomo sapiens (Human)0.742P25440Bromodomain-containing protein 2Homo sapiens (Human)0.723Q15059Bromodomain-containing protein 3Homo sapiens (Human)0.724O60885Bromodomain-containing protein 4Homo sapiens (Human)0.725P349695-hydroxytryptamine 7 receptorHomo sapiens (Human)0.726Q07820Induced myeloid leukemia cell differentiation protein Mcl-1Homo sapiens (Human)0.727P09917mRNA of human 5-lipoxygenaseHomo sapiens (Human)0.728P17948Vascular endothelial growth factor receptor 1Homo sapiens (Human)0.729P0825372 kDa type IV collagenaseHomo sapiens (Human)0.7110P24557Thromboxane-A synthasenil0.71

Experimental data indicate that Afatinib is an EGFR inhibitor (IC50 = 1 nM) (34). EGFR (UniProt ID: P00533) is ranked at the top of the predicted target list by PTS (Table 3). The predicted Afatinib binding poses are aligned with the native EGFR ligands as shown in Figure 2. PTS also predicted other potential targets, however, there are no evidences showing that Afatinib is strongly binding with them. The data for the alignments of Afatinib and the native ligands of these targets can be found in Supplementary Figure S1.

Afatinib (yellow) aligned with known EGFR inhibitor CHEMBL484108 (A, red) and CHEMBL482489 (B, red).
Figure 2.

Afatinib (yellow) aligned with known EGFR inhibitor CHEMBL484108 (A, red) and CHEMBL482489 (B, red).

Case study 2: seeking targets for Tamoxifen

Estrogen receptors (ERs) are well-known targets for Tamoxifen (3537). The targets predicted by PTS for tamoxifen are listed in Table 2. ERα (UniProt ID: P03372) and ERβ (UniProt ID: Q92731) are ranked as the top-1 and top-6. The predicted Tamoxifen binding poses aligned with the native ERα and ERβ ligands are depicted in Figure 3.

Tamoxifen (yellow) aligned with ERα (A, red, PDB Code: 1XQC) and ERβ (B, red, PDB Code: 2QTU) selective ligands in binding pocket.
Figure 3.

Tamoxifen (yellow) aligned with ERα (A, red, PDB Code: 1XQC) and ERβ (B, red, PDB Code: 2QTU) selective ligands in binding pocket.

In Table 4, PTS places top-10 targets. After examing the other predicted targets besides ERs, we find that CYP2D6 (38), Prostaglandin synthase (40), 3-β-hydroxysteroid-δ(8),δ(7)-isomerase (39) and Collagenase 3 (41, 42) has been experimentally reported as off-targets. In fact, 38 proteins have been experimentally validated to interact with tamoxifen or 4 H-tamoxifen (Supplementary Table S1). In the top-100 PTS predicted targets for Tamoxifen, 15% proteins are experimentally confirmed off-targets, which can be found in Supplementary Table S2.

Table 4.

Experimentally proved Tamoxifen targets and off-targets predicted by PTS

RankUniProt IDTarget nameOrganismScore1P03372ER alphaHomo sapiens (Human)0.822P040353-hydroxy-3-methylglutaryl-coenzyme A reductaseHomo sapiens (Human)0.823P08684mRNA of CYP3A4Homo sapiens (Human)0.824P23458JAK1Homo sapiens (Human)0.815P41145Kappa-type opioid receptorHomo sapiens (Human)0.816Q92731ER betaHomo sapiens (Human)0.817O14965Aurora kinase AHomo sapiens (Human)0.808Q96GD4Serine/threonine protein kinase 12Homo sapiens (Human)0.809P29597TYK2Homo sapiens (Human)0.8010P52333Tyrosine-protein kinase JAK3Homo sapiens (Human)0.8011P10635Cytochrome P450 2D6Homo sapiens (Human)0.80
Table 4.

Experimentally proved Tamoxifen targets and off-targets predicted by PTS

RankUniProt IDTarget nameOrganismScore1P03372ER alphaHomo sapiens (Human)0.822P040353-hydroxy-3-methylglutaryl-coenzyme A reductaseHomo sapiens (Human)0.823P08684mRNA of CYP3A4Homo sapiens (Human)0.824P23458JAK1Homo sapiens (Human)0.815P41145Kappa-type opioid receptorHomo sapiens (Human)0.816Q92731ER betaHomo sapiens (Human)0.817O14965Aurora kinase AHomo sapiens (Human)0.808Q96GD4Serine/threonine protein kinase 12Homo sapiens (Human)0.809P29597TYK2Homo sapiens (Human)0.8010P52333Tyrosine-protein kinase JAK3Homo sapiens (Human)0.8011P10635Cytochrome P450 2D6Homo sapiens (Human)0.80

Case study 3: validating a target for Chlorprothixene

Chlorprothixene is an old antipsychotic drug. It antagonizes dopaminergic D1 (UniProt ID: P21728) and D2 (UniProt ID: P14416) receptors in the brain to exert its antipsychotic effect (43). Chlorprothixene also antagonizes histamine H1 receptor (44). But, there is no direct evidence to show Chlorprothixene interacts with H1. PTS predicts H1 receptor is the target of Chlorprothixene, the results are listed in Supplementary Table S3 (Figure 4). Sridhar R. Vasudevan proved experimentally that Chlorprothixene binds with H1 receptor and selectively inhibit histamine-induced calcium release with IC50 of 1 nM.

Chlorprothixene (red) aligned with H1 receptor antagonist promazine (A, yellow) and CHEMBL363581 (B, yellow).
Figure 4.

Chlorprothixene (red) aligned with H1 receptor antagonist promazine (A, yellow) and CHEMBL363581 (B, yellow).

Case study 4: predicting potential side-effects for Fluoxetine

hERG (the human Ether-à-go-go-Related Gene, UniProt ID: Q12809) is known as a potassium (K+) ion channel mediating the repolarizing IKr current in the cardiac action potential. A drug that potentially interacts with hERG can result in lethal side-effect (45). Fluoxetine is a selective serotonin reuptake inhibitor for treating depressive disorder. The potential targets PTS predicted for Fluoxetine are listed in Table 5.

Table 5.

The predicted target profile for fluoxetine

RankUniProt IDTarget nameOrganismScore1P35354Prostaglandin G/H synthase 2Homo sapiens (Human)0.862P23219Prostaglandin G/H synthase 1Homo sapiens (Human)0.863Q01959Sodium-dependent dopamine transporterHomo sapiens (Human)0.854P31645Sodium-dependent serotonin transporterHomo sapiens (Human)0.855P23975Sodium-dependent noradrenaline transporterHomo sapiens (Human)0.856P14416D(2) dopamine receptorHomo sapiens (Human)0.847P35462D(3) dopamine receptorHomo sapiens (Human)0.848P11229Muscarinic receptorHomo sapiens (Human)0.839P35367Histamine H1 receptorHomo sapiens (Human)0.8310Q12809Potassium channel H-ERGHomo sapiens (Human)0.83
Table 5.

The predicted target profile for fluoxetine

RankUniProt IDTarget nameOrganismScore1P35354Prostaglandin G/H synthase 2Homo sapiens (Human)0.862P23219Prostaglandin G/H synthase 1Homo sapiens (Human)0.863Q01959Sodium-dependent dopamine transporterHomo sapiens (Human)0.854P31645Sodium-dependent serotonin transporterHomo sapiens (Human)0.855P23975Sodium-dependent noradrenaline transporterHomo sapiens (Human)0.856P14416D(2) dopamine receptorHomo sapiens (Human)0.847P35462D(3) dopamine receptorHomo sapiens (Human)0.848P11229Muscarinic receptorHomo sapiens (Human)0.839P35367Histamine H1 receptorHomo sapiens (Human)0.8310Q12809Potassium channel H-ERGHomo sapiens (Human)0.83

Sodium-dependent serotonin transporter (UniProt ID: P31645) is the primary target of Fluoxetine, which is ranked at fourth in the list. By inspect the potential target list, we find the top-10 target is hERG. Further literature studies reveal that Fluoxetine is experimentally proved as hERG inhibitor (IC50 = 3.1 μM) (46).

Discussion

PTS predicts targets for a compound through superimposing the compound structure onto the 3D ligand structures of putative targets. This approach considers pharmacophore shape similarity that 2D approaches cannot do. Other 3D approaches use molecular docking techniques, while PTS does not employ ligand-receptor docking techniques and can still produce results where the receptor structure data are not available. Both PTS and SwissTargetPrediction are web-based target fishing tools. The four testing cases were tested on both tools, which produced similar results. Additionally, small scale of structurally diverse drugs that targeting four classes of biological systems (GPCR, Ion channel, Nuclear receptor and kinase) were extracted to test the success rate and applicability of PTS. Averagely, at least one known targets of each drug is found among the top 20 predicted targets for 70% of the ligands (Supplementary Material). The advantage of PTS is able to superimpose the query molecule into the binding pockets of the putative targets when the experimental structure data are available. However, there are limits for PTS, too. The activity cliff issue, i.e. a subtle change in the chemical structure cause the great loss of bioactivity, may be a common concern for all ligand-based methods, including the 3D approaches.

To date, PTS has received >500 queries from 11 countries in the world since 10 August 2016. It can be used for target identification, drug repurposing, toxic risk estimation and molecular interaction simulation pre-processing tool.

Funding

National Science Foundation of China (81473138, 81573310); Guangdong Frontier & Key Technology Innovation Program (2015B010109004); Guangdong Natural Science Foundation (2016A030310228); the Fundamental Research Funds for the Central Universities (2013HGCH0015); Tianhe-2 based biomedical health data application support platform (U1611261); the Fundamental Research Funds for the Central Universities (17LGJC23). Funding for open access charge: National Science Foundation of China (81473138), funded by the National Natural Science Foundation of China.

Supplementary data

Supplementary data are available at Database Online.

Conflict of interest. None declared.

References

1

Medina-Franco

J.L.
, Giulianotti
M.A.
, Welmaker
G.S.
(2013) Shifting from the single to the multitarget paradigm in drug discovery.
Drug Discov. Today
, 18, 495–501.

2

Boran

A.D.
, Iyengar
R.
(2010) Systems approaches to polypharmacology and drug discovery.
Curr. Opin. Drug Discov. Dev
., 13, 297–309.

3

Paolini

G.V.
, Shapland
R.H.
, van Hoorn
W.P.
(2006) Global mapping of pharmacological space.
Nat. Biotechnol
., 24, 805–815.

4

Hopkins

A.L.
(2007) Network pharmacology.
Nat. Biotechnol
., 25, 1110–1111.

5

Schenone

M.
, Dancik
V.
, Wagner
B.K.
, (2013) Target identification and mechanism of action in chemical biology and drug discovery.
Nat. Chem. Biol
., 9, 232–240.

6

Ziegler

S.
, Pries
V.
, Hedberg
C.
(2013) Target identification for small bioactive molecules: finding the needle in the haystack.
Angew. Chem. Int. Ed. Engl
., 52, 2744–2792.

7

Liu

Z.
, Ding
P.
, Yan
X.
(2016) ASDB: a resource for probing protein functions with small molecules.
Bioinformatics
, 32, 1752–1754.

8

Chen

Y.Z.
, Zhi
D.G.
(2001) Ligand-protein inverse docking and its potential use in the computer search of protein targets of a small molecule.
Proteins
, 43, 217–226.

9

Li

H.
, Gao
Z.
, Kang
L.
(2006) TarFisDock: a web server for identifying drug targets with docking approach.
Nucleic Acids Res
., 34, W219–W224.

10

Gunther

S.
, Kuhn
M.
, Dunkel
M.
(2008) SuperTarget and Matador: resources for exploring drug-target relationships.
Nucleic Acids Res
., 36, D919–D922.

11

Yao

Z.J.
, Dong
J.
, Che
Y.J.
(2016) TargetNet: a web service for predicting potential drug-target interaction profiling via multi-target SAR models.
J. Comput. Aided Mol. Des
., 30, 413–424.

12

Liu

X.
, Ouyang
S.
, Yu
B.
(2010) PharmMapper server: a web server for potential drug target identification using pharmacophore mapping approach.
Nucleic Acids Res
., 38, W609–W614.

13

Lounkine

E.
, Keiser
M.J.
, Whitebread
S.
(2012) Large-scale prediction and testing of drug activity on side-effect targets.
Nature
, 486, 361–367.

14

Gaulton

A.
, Bellis
L.J.
, Bento
A.P.
(2012) ChEMBL: a large-scale bioactivity database for drug discovery.
Nucleic Acids Res
., 40, D1100–D1107.

15

Liu

X.
, Xu
Y.
, Li
S.
(2014) In Silico target fishing: addressing a ‘Big Data’ problem by ligand-based similarity rankings with data fusion.
J. Cheminformatics
, 6, 33.

16

Liu

Z.
, Zheng
M.
, Yan
X.
(2014) ChemStable: a web server for rule-embedded naive Bayesian learning approach to predict compound stability.
J. Comput. Aided Mol. Des
., 28, 941–950.

17

Keiser

M.J.
, Roth
B.L.
, Armbruster
B.N.
(2007) Relating protein pharmacology by ligand chemistry.
Nat. Biotechnol
., 25, 197–206.

18

Dunkel

M.
, Gunther
S.
, Ahmed
J.
(2008) SuperPred: drug classification and target prediction.
Nucleic Acids Res
., 36, W55–W59.

19

Gong

J.
, Cai
C.
, Liu
X.
(2013) ChemMapper: a versatile web server for exploring pharmacology and chemical structure association based on molecular 3D similarity method.
Bioinformatics
, 29, 1827–1829.

20

Bauer

R.A.
, Bourne
P.E.
, Formella
A.
(2008) Superimpose: a 3D structural superposition server.
Nucleic Acids Res
., 36, W47–W54.

21

Sperandio

O.
, Petitjean
M.
, Tuffery
P.
(2009) wwLigCSRre: a 3D ligand-based server for hit identification and optimization.
Nucleic Acids Res
., 37, W504–W509.

22

Thimm

M.
, Goede
A.
, Hougardy
S.
(2004) Comparison of 2D similarity and 3D superposition. Application to searching a conformational drug database.
J. Chem. Inf. Comput. Sci
., 44, 1816–1822.

23

Yan

X.
, Li
J.
, Liu
Z.
(2013) Enhancing molecular shape comparison by weighted Gaussian functions.
J. Chem. Inf. Model
, 53, 1967–1978.

24

Yang

H.
, Qin
C.
, Li
Y.H.
(2016) Therapeutic target database update 2016: enriched resource for bench to clinical drug target and targeted pathway information.
Nucleic Acids Res
., 44, D1069–D1074.

25

Overington

J.P.
, Al-Lazikani
B.
, Hopkins
A.L.
(2006) How many drug targets are there?
Nat. Rev. Drug Discov
., 5, 993–996.

26

Pundir

S.
, Magrane
M.
, Martin
M.J.
(2015) Searching and navigating UniProt databases.
Curr. Protoc. Bioinformatics
, 50, 1 27 21–1 27 10.

27

Papadatos

G.
, Gaulton
A.
, Hersey
A.
(2015) Activity, assay and target data curation and quality in the ChEMBL database.
J. Comput. Aided Mol. Des
., 29, 885–896.

28

Gilson

M.K.
, Liu
T.Q.
, Baitaluk
M.
(2016) BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology.
Nucleic Acids Res
., 44, D1045–D1053.

29

Nicola

G.
, Berthold
M.R.
, Hedrick
M.P.
(2015) Connecting proteins with drug-like compounds: open source drug discovery workflows with BindingDB and KNIME.
Database-Oxford
, 2015, 1–22.

30

Wang

R.
, Fang
X.
, Lu
Y.
(2004) The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures.
J. Med. Chem
., 47, 2977–2980.

31

Wang

R.
, Fang
X.
, Lu
Y.
(2005) The PDBbind database: methodologies and updates.
J. Med. Chem
., 48, 4111–4119.

32

Liu

Z.
, Li
Y.
, Han
L.
(2015) PDB-wide collection of binding data: current status of the PDBbind database.
Bioinformatics
, 31, 405–412.

33

Ward

R.A.
, Anderton
M.J.
, Ashton
S.
(2013) Structure- and reactivity-based development of covalent inhibitors of the activating and gatekeeper mutant forms of the epidermal growth factor receptor (EGFR).
J. Med. Chem
., 56, 7025–7048.

34

Ji

X.
, Peng
T.
, Zhang
X.
(2014) Design, synthesis and biological evaluation of novel 6-alkenylamides substituted of 4-anilinothieno[2, 3-d]pyrimidines as irreversible epidermal growth factor receptor inhibitors.
Bioorg. Med. Chem
., 22, 2366–2378.

35

Sasson

S.
(1991) Equilibrium binding analysis of estrogen agonists and antagonists: relation to the activation of the estrogen receptor.
Pathol. Biol. (Paris)
, 39, 59–69.

36

Favoni

R.E.
, de Cupis
A.
(1998) Steroidal and nonsteroidal oestrogen antagonists in breast cancer: basic and clinical appraisal.
Trends Pharmacol. Sci
., 19, 406–415.

37

Medina

P.
, Favre
G.
, Poirot
M.
(2004) Multiple targeting by the antitumor drug tamoxifen: a structure-activity study.
Curr. Med. Chem. Anticancer Agents
, 4, 491–508.

38

Higgins

M.J.
, Stearns
V.
(2010) CYP2D6 polymorphisms and tamoxifen metabolism: clinical relevance.
Curr. Oncol. Rep
., 12, 7–15.

39

Paul

R.
, Silve
S.
, De Nys
N.
(1998) Both the immunosuppressant SR31747 and the antiestrogen tamoxifen bind to an emopamil-insensitive site of mammalian Delta8-Delta7 sterol isomerase.
J. Pharmacol. Exp. Ther
., 285, 1296–1302.

40

Ritchie

G.A.
(1980) The direct inhibition of prostaglandin synthetase of human breast cancer tumor tissue by tamoxifen.
Recent Results Cancer Res
., 71, 96–101.

41

Nilsson

U.W.
, Garvin
S.
, Dabrosin
C.
(2007) MMP-2 and MMP-9 activity is regulated by estradiol and tamoxifen in cultured human breast cancer cells.
Breast Cancer Res. Treat
, 102, 253–261.

42

Keiser

M.J.
, Setola
V.
, Irwin
J.J.
(2009) Predicting new molecular targets for known drugs.
Nature
, 462, 175–181.

43

Froimowitz

M.
, Cody
V.
(1993) Biologically active conformers of phenothiazines and thioxanthenes. Further evidence for a ligand model of dopamine D2 receptor antagonists.
J. Med. Chem
., 36, 2219–2227.

44

Fjalland

B.
, Boeck
V.
(2009) Neuroleptic blockade of the effect of various neurotransmitter substances.
Acta Pharmacol. Toxicol. (Copenh)
, 42, 206–211.

45

Jamieson

C.
, Moir
E.M.
, Rankovic
Z.
(2006) Medicinal chemistry of hERG optimizations: highlights and hang-ups.
J. Med. Chem
., 49, 5029–5046.

46

Thomas

D.
, Gut
B.
, Wendt-Nordahl
G.
(2002) The antidepressant drug fluoxetine is an inhibitor of human ether-a-go-go-related gene (HERG) potassium channels.
J. Pharmacol. Exp. Ther
., 300, 543–548.

Author notes

Peng Ding, Xin Yan contributed equally to this work.

© The Author(s) 2017. Published by Oxford University Press.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data