chapter5 lbvs m8 - vrije universiteit amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further...

30
71 Chapter 5 Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors Sabine Schultes, Albert J. Kooistra, Eric E. J. Haaksma, Rob Leurs, Iwan J.P. de Esch, Chris de Graaf In preparation Abstract Virtual screening (VS) has become an integral part of fragment-based drug discovery (FBDD). In this study we have evaluated the applicability of ligand-based virtual screening (LBVS) methods for identifying small fragment-like biologically active molecules using different similarity descriptors and different consensus scoring approaches. For this purpose we have evaluated the performance of 14 chemical similarity descriptors in retrospective virtual screening studies to discriminate fragment-like ligands of three membrane bound receptors from fragments that are experimentally determined to have no affinity for these proteins (true inactives). We used a consistent fragment affinity data set for two G Protein-Coupled Receptors (GPCRs), the histamine H4 receptor (H4R) and the histamine H1 receptor (H1R), and one Ligand-Gated Ion Channel (LGIC), the serotonin receptor (5HT3AR), to validate our systematic retrospective virtual screening studies. We further tested different consensus scoring methods to combine the results for multiple actives and all possible combinations of different similarity descriptors. Our studies show that LBVS methods are able to discriminate active from inactive fragments and allow the identification of scaffolds that are different from the reference template structure. Best enrichments were observed with quite large reference actives. These actives however rank larger actives higher than smaller ones. This indicates that LBVS for especially small fragments is a challenging task. Consensus scoring of multiple actives (group fusion) or multiple similarity descriptors (similarity fusion) is very effective to increase enrichments. Whereas the enrichments of individual similarity descriptors are quite target dependent, most combinations gave better enrichments than the average enrichment. For prospective virtual screening our results highly recommend to use a combination of both, group fusion and data fusion.

Upload: others

Post on 24-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

71

Chapter 5

Full combinatorial consensus scoring for ligand-based virtual fragment screening

at membrane bound receptors Sabine Schultes, Albert J. Kooistra, Eric E. J. Haaksma, Rob Leurs, Iwan J.P. de Esch,

Chris de Graaf

In preparation

Abstract

Virtual screening (VS) has become an integral part of fragment-based drug discovery (FBDD). In this study we have evaluated the applicability of ligand-based virtual screening (LBVS) methods for identifying small fragment-like biologically active molecules using different similarity descriptors and different consensus scoring approaches. For this purpose we have evaluated the performance of 14 chemical similarity descriptors in retrospective virtual screening studies to discriminate fragment-like ligands of three membrane bound receptors from fragments that are experimentally determined to have no affinity for these proteins (true inactives). We used a consistent fragment affinity data set for two G Protein-Coupled Receptors (GPCRs), the histamine H4 receptor (H4R) and the histamine H1 receptor (H1R), and one Ligand-Gated Ion Channel (LGIC), the serotonin receptor (5HT3AR), to validate our systematic retrospective virtual screening studies. We further tested different consensus scoring methods to combine the results for multiple actives and all possible combinations of different similarity descriptors. Our studies show that LBVS methods are able to discriminate active from inactive fragments and allow the identification of scaffolds that are different from the reference template structure. Best enrichments were observed with quite large reference actives. These actives however rank larger actives higher than smaller ones. This indicates that LBVS for especially small fragments is a challenging task. Consensus scoring of multiple actives (group fusion) or multiple similarity descriptors (similarity fusion) is very effective to increase enrichments. Whereas the enrichments of individual similarity descriptors are quite target dependent, most combinations gave better enrichments than the average enrichment. For prospective virtual screening our results highly recommend to use a combination of both, group fusion and data fusion.

Page 2: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

72

Introduction

Fragment-based drug discovery (FBDD) is a promising approach in the identification of novel, chemically tractable leads for the development of new therapeutic agents.1-4 Key advantage of FBDD is that the respective chemical space described by low molecular weight compounds is much smaller than for drug-like compounds.5-7 In addition, the lower complexity of small fragments is believed to increase the chance of a good match between ligand and protein.8 However, the experimental identification of fragment hits is often more challenging than for drug-like compounds.9 Fragments have in general lower affinities for their targets and therefore need special screening methods.10 To select compounds for experimental testing virtual screening has evolved as an integral part in the drug discovery process.11 There are two general virtual screening methods, structure-based virtual screening (SBVS) and ligand-based virtual screening (LBVS). SBVS is frequently applied when a suitable structural model of the target is available. Ligand-based techniques provide alternative and complementary approaches particularly when the target structure is not available but active ligand molecules have been identified.12 The applicability of virtual screening for the discovery of small fragment-like molecules is still a relatively unexplored research area for both structure-based13-17 and ligand-based15,17 approaches.

LBVS is based on the chemical similarity to one or more known active ligands of a specific protein target.18 Information of the ligand can be encoded in a bit string representation, called molecular fingerprint that can be used for comparison.19 Such fingerprints can be classified into four different classes: binary (presence/ absence) circular fingerprints, circular fingerprints considering counts, path-based and keyed fingerprints and pharmacophore-based fingerprint.20 Other methods compare the overlay of the 3D structures of 2 molecules21,22, or uses molecular graph approaches23,24 or feature trees25,26 to determine the similarity between compounds.27

As LBVS methods are based on the similarity to known actives the challenge is to find novel structures.18 In the context of ligand-based virtual fragment screening the size dependence of similarity measurements bear an additional challenge.19,28 Moreover there is a lack of proper test sets for virtual fragment screening. The low affinities of fragments provide a challenge for detection of fragment actives. Therefore the number of known fragment actives is often very low. The selection of appropriate decoys should also be done with care as decoys should not be distinguishable from actives by very simple physical properties.29 One widely used benchmark dataset is the directory of useful decoys (DUD).30 In this dataset decoys resemble the physical properties of the actives. The DUD dataset is frequently used for retrospective structure and ligand-based virtual

Page 3: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

73

screening approaches.30-33 However the selection of decoys always bears the problem that assumed inactives may actually be actives.34

This study represents a systematic comparative investigation of the challenges and applicability of different methods for ligand-based virtual fragment screening. We used a consistent fragment affinity data set for two the histamine H4 receptor (H4R) and the histamine H1 receptor (H1R) and the serotonin receptor (5HT3AR), to validate our systematic retrospective virtual screening studies. All three proteins are of pharmaceutical relevance. Both, the H1R and H4R, are G Protein-Coupled Receptors (GPCRs) and key players in immunological and allergic responses.35 Antagonists for the H1R receptor, a GPCR, are known as the classical ‘antihistamines’ and are used against allergic, inflammatory responses like rhinitis or itching due to insect stings.36 The more recently identified H4R, is a promising drug target for diseases like asthma, allergic rhinitis, pruritis, itch or rheumatoid arthritis.37-43 The 5HT3AR is a Ligand-Gated Ion Channel and is found on neurones, where they trigger rapid depolarisation due to a transient inward current.44 5-HT3 receptor antagonists are in clinical use to treat nausea and vomiting in cancer patients receiving chemotherapy or radiation therapy.45

An advantage of our study is that we used true inactives that are experimentally confirmed in consistent in-house screens46,47 of a chemically diverse library of fragment-like molecules.35,36 In most other virtual screening evaluations decoy molecules randomly selected from large chemical databases are considered as inactives for a specific protein target, although these molecules have not been experimentally determined to be inactive for this protein.30-33 We tested each similarity descriptor individually for each reference active and applied different consensus scoring approaches. Consensus scoring combines the data from individual virtual screens with the aim to generate a result superior to the individual screens. Combining the results from individual reference actives48-57 was previously defined as group fusion.56 In the same way consensus scoring approaches can be applied for combining the results of the screens using different similarity descriptors33,58-60, also called similarity fusion.56 Most of the similarity fusion approaches that have been reported to date involve the use of single reference structures.33,58-60 In the current study we have to best of our knowledge for the first time systematically evaluated different combinations of group fusion and similarity fusion approaches. Furthermore the abilities of different similarity search methods to identify compounds that do not contain chemical scaffolds present in the respective reference actives were evaluated. Overall, the current study gives insights into some of the strengths and challenges of ligand-based virtual fragment screening that can be useful for future FBDD studies.

Page 4: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

74

Methods:

Datasets:

Experimentally validated active and inactive fragment-like molecules from in-house experimental screening campaigns of 1080 fragments46 at H1R, H4R, and 5-HT3RA were appended with active fragment-like molecules with pKi>7.0 selected from the CHEMBL database61 to achieve an active inactive ration of approximately 1:10 (Table 1). All molecules obey the following fragment rules: heavy atom count (HA) < 22, number of rotatable bonds (Nrot) < 5, logP < 3.46,47 The descriptors were calculated using MOE.62

Table 1. Source and number of the actives and inactives used within this study.

in-house actives CHEMBL actives In-house inactives

H4R 48 49 935

H1R 36 48 943

5HT3AR 70 26 919

Chemical similarity descriptors

The following similarities using the Tanimoto coefficient63 were calculated: MACCS, TGT, TGD, GpiDAPH3, TAT, TAD, piDAPH3, piDAPH4 (implemented in MOE62), FCFP_4, ECFP_4, FCFC_4, ECFC_4 (available in Pipeline Pilot64). Furthermore EDprints65 similarities were determined using an in-house script, while 3D shape-based ROCS alignments were ranked according to ComboScore, which combines the Shape Tanimoto score (3D overlapping of shapes) and the Color Score (common pharmacophore features within the reference query and the test compounds in the screening database).66 For the 3D methods (TAD, TAT, piDAPH3, piDAPH4 and ROCS), the 3D conformations of the reference query and the test compounds (actives and inactives) in the screening database were generated using Corina.67 For the test compounds conformational databases were generated using OMEGA (using standard settings).68 It should be noted that the results of 3D ligand structure-based virtual screening studies can depend on the 3D conformations of the query and test compounds in the screening database.32,69 The 3D generation procedure applied in the current study are in line with previously reported virtual screening studies.17,69,70

Table 2 gives and overview of the properties of the different similarity descriptors used in this study.

Page 5: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

75

Table 2. Similarity descriptors

Substructure descriptors

MACCS71: (2D) Encode presence/absence of 166 predefined structural keys

ECFP_472: Atom type based extended connectivity fingerprint (2D)

Topological fingerprints that encode information on atom centered fragments. The number (4) describes a certain diameter of chemical bonds within information of the surrounding atoms is encoded

FCFP_472: Functional class based extended connectivity fingerprint (2D)

ECFC_4: Atom type based extended connectivity count (2D)

FCFC_4: Functional class based extended connectivity count (2D)

Property descriptor

EDprints: Electron density fingerprint (2D)65 Encodes electron density information (calculated chemical shifts of 1H and 13C and partial charges)

Pharmacophore-based descriptors

TAD: Typed atom distances, spatial 2-point pharmacophore based fingerprint (3D)

Encodes atom types (Donor, Acceptor, Polar, Anion, Cation, Hydrophobe) and graph (2D) or spatial (3D) distances between 2 or 3 atoms.

TGD: Typed graph distances, graph based 2-point pharmacophore based fingerprint (2D)

TAT: Typed atom triangles, spatial 3-point pharmacophore based fingerprint (3D)

TGT: Typed graph triangles, graph based 3-point pharmacophore based fingerprint (2D)

GpiDAPH3: graph based 3-point pharmacophore based fingerprint (2D)

Encodes atom types that are calculated based on three atomic properties: ‘in pi system’, ‘is donor’, ‘is acceptor’ and graph (2D) or spatial (3D) distances between 3 or 4 atoms

piDAPH3: spatial 3-point pharmacophore based fingerprint (3D)

piDAPH4: spatial 4-point pharmacophore based fingerprint (3D)

Shape based descriptors

ROCS66: rapid overlay of chemical structures (3D)

Conformational overlay of shape and pharmacophore features

Scaffold similarity analysis

To determine scaffold similarity between compounds, the freely available program strip-it73 was used for the extraction of predefined scaffolds from organic small molecules. In this study we generated ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold 2 (SCF2). SCF1 are extracted by removing all side chains. SCF1 were further converted to SCF2 by exchanging all heteroatoms to carbon atoms, changing all bond orders to single bonds and finally reducing all linkers between ring systems to one single bond. (Figure 1).74,75

Page 6: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

76

Figure 1. Demonstration of generation of SCF1 and SCF2.73

Analysis retrospective virtual screening studies

Similarity matrices were generated using all actives as reference actives and all actives and inactives as test compounds. Enrichment factors (EF) were calculated based on the 1% fraction of enriched false positives (FP) (excluding self-comparisons).76

For testing the performance of different virtual fragment screening approaches in enriching compounds with scaffolds74,75 different to the respective reference active, similarity scores resulting from comparison of actives with test compounds, sharing the same scaffolds, were excluded. For the calculation of the enrichment factor only the remaining similarity scores for each reference active were taken into account. As we had several actives for each target in hand the enrichment for each reference active was calculated individually. If all these enrichment factors were averaged than the average enrichment factor was obtained.23 The individual similarity scores for each reference active or subsequently for each similarity descriptor were additionally combined using consensus scoring approaches.

Consensus scoring approaches

The following consensus scoring methods were applied:

Mean-value: For each test compound the mean similarity score over all reference actives was calculated.33,48,49,52-55,57

Max-value: For each test compound the maximum similarity score to any of the reference actives was selected.26,48,49,51-55,57,59,77

Mean-rank: The ranks of the test compounds for each reference active were normalized (0-1) and the mean over the normalized ranks for each test compound was calculated.33,49,57-60

Ranked-by-vote: The ranks of the test compounds were normalized (0-1) and the rate of the occurrence in the top 1% (normalized rank > 0.99) for each test compound was calculated.33,58,59

EF = fractionTPfractionFP1%

Page 7: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

77

Table 3 demonstrates how these consensus scoring approaches work based on theoretical examples of virtual screening hit lists.

Table 3. Demonstration how different consensus scoring methods for the case that all scaffolds are included (all) and for the case that scaffolds that are similar to the reference are not included in the hit lists (SCF1 or SCF2, see “scaffold similarity analysis74,75). The similarity values between reference (Ref) and test (Test) compounds are theoretical values. The numbers in brackets are the normalized ranks (best 1, worst 0) of the test compounds compared to the respective reference compound. Mean-value and max-value are calculated by taking the mean or the maximum of the similarity values. Mean-rank is the mean of the normalized ranks and ranked-by-vote is the rate of the occurrence (1 always, 0 never) in the top 1% of the ranks (normalized rank > 0.99).

All Ref 1 Ref 2 Ref 3 Mean-value

Max- value

Mean- rank

Ranked-by-vote

Test 1 0.70 (1.00) 0.80 (1.00) 0.90 (1.00) 0.80 0.90 1.00 1.00 Test 2 0.60 (0.63) 0.50 (0.57) 0.40 (0.38) 0.50 0.60 0.52 0.00 Test 3 0.10 (0.00) 0.20 (0.00) 0.30 (0.00) 0.20 0.30 0.00 0.00 Test 4 0.60 (0.63) 0.80 (1.00) 0.40 (0.38) 0.60 0.80 0.66 0.33 Test 5 0.50 (0.25) 0.30 (0.29) 0.70 (0.75) 0.50 0.70 0.43 0.00

SCF1 or SCF2 Ref 1 Ref 2 Ref 3 Mean-

value Max- value

Mean- rank

Ranked-by-vote

Test 1 0.80 (1.00) 0.90 (1.00) 0.85 0.90 1.00 1.00 Test 2 0.60 (1.00) 0.40 (0.50) 0.50 0.60 0.75 0.50 Test 3 0.10 (0.00) 0.20 (0.00) 0.30 (0.00) 0.20 0.30 0.00 0.00 Test 4 0.60 (1.00) 0.40 (0.50) 0.50 0.60 0.75 0.50 Test 5 0.50 (0.40) 0.30 (0.50) 0.40 0.70 0.45 0.00

All: All similarity scores were included for group fusion. SCF1/SCF2: Only similarity values resulting from comparisons of reference and test compounds with dissimilar SCF1 or SCF2 were used for group fusion.

Results and discussion:

We have retrospectively validated the applicability of different ligand-based similarity descriptors for virtual fragment screening. So far only a few selected methods have been tested for this purpose.15,17 In this study however we tested a large range of similarity descriptors to investigate their applicability for fragment screening. We first investigated the performance of each reference active individually. Subsequently we tested different consensus scoring methods to combine the screening results from the actives (group fusion)48-57 and also to combine the search results from multiple similarity descriptors (similarity fusion)33,58-60. For this task we used fragment-like actives and inactives for H4R, H1R and 5HT3AR, three pharmaceutically relevant proteins78,79 that have been studied with fragment-based drug discovery approaches.14,15,47

Page 8: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

78

Performance of individual reference actives

We first analyzed the performance of the individual reference ligands in retrieving actives that have dissimilar SCF2. Figure 2 shows the enrichment factors at 1% FP in retrospective virtual screening runs against H1R, H4R, and 5HT3AR test sets based on different similarity descriptors and using reference ligands with different numbers of heavy atoms (HAs). For all three tested targets chemical similarity searches against relatively larger fragment-like reference ligands (HA count 16-22) give higher enrichments of active over inactive molecules in the retrospective virtual screening runs. It should be noted however that there is one single small H4R reference ligand (HA count: 13) that gives exceptional high enrichment of 65.7 using ECFC_4 (Figure 2A). The general better performance of the relatively lager actives seems to be independent of the relative amount active molecules compared to inactive molecules. For example for H4R the number of actives with a HA count of 21 or 22 is relatively low and nevertheless high enrichments are observed (Figure 2A). The same is true for actives for 5HT3AR with a HA count of 20, 21 or 22 (Figure 2C). One reason for the general better performance of larger actives might be that small changes in the structure of small compounds are expected to lead to larger dissimilarities than for bigger compounds. Except from that part also the SAR might change more dramatically leading to so-called activity cliffs.80,81 Moreover the Tanimoto similarity measurement is described to be size dependent.19,28 These factors might be reasons for the lower enrichments of actives over inactives in virtual screening if ligands with low number of HAs are used as references. Figure 3 shows how reference ligands that give the highest average enrichment (over all similarity descriptors) rank compounds with different numbers of HAs. Virtual screening runs against these references ligands, that contain 18, 19 and 20 HAs for H4R, H1R and 5HT3AR respectively, generally yield higher rankings for fragments that are relatively larger (HA count 16-22) which is largely independent of the HA distributions of actives and inactives (Figure 3). It should be noted however that especially for H1R and H4R data sets the relative number of small actives and ratio between actives and inactives is lower for molecules with less than 16 HAs (Figure 3A-B), which affects virtual screening ranking and enrichments for these smaller compounds. Overall our retrospective virtual screening analyses however seem to indicate that the discrimination of especially smaller (< 16 HAs) active fragments from inactive fragments is a challenging task in virtual fragment screening.

Page 9: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

79

Fig

ure 2

Figure 3

Page 10: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

80

Figure 2. (see previous page) Enrichment at 1% FP (left axis) obtained for each reference ligand using different similarity descriptors (colored crosses). Test compounds sharing similar SCF275 scaffolds with the respective reference ligand are excluded from the analyses. Density of the number of actives and inactives (right axis) for each HA count are shown with solid lines and dashed lines respectively. In panel A the methimmepip analogue is indicated as a relatively small reference ligand (13 heavy atoms) that gives a relatively high virtual screening enrichment factor

Figure 3. (see previous page) Ranks for the actives (with a dissimilar SCF2 compared to the reference compound) that were ranked to the according reference active compound (shown above the plots) using different similarity descriptors (colored crosses). Ranks were normalized from 0 to1 whereas the first ranked compound has the rank 1 and the last ranked compound has the rank 0.

Average performance of individual chemical similarity search methods

The average enrichment factor of a method for a specific target is calculated by dividing the sum of all enrichment factors obtained with all actives for a specific target by the total number actives.23 Comparison of the average enrichment factor at 1% FP (Figure 4, Table S1) shows that the performance of the different similarity descriptors decreases with decreasing scaffold similarity of the test compounds compared to the reference actives (all>SCF1>SCF2).

Retrospective virtual screening enrichments are generally somewhat higher for H4R, than for H1R, while enrichments for 5HT3AR are relatively low. This is also the case if similar scaffolds (SCF1 and SCF2) are not considered in the analysis (Figure 4 and Table S1). This is demonstrated in Figure 5, which shows the average enrichment curves observed for EDprints for all three targets. In this plot the average early enrichment curve is the highest for H4R followed by H1R and 5HT3AR.

A comparison of the MACCS-similarities of the actives of each dataset revealed that the actives of H4R are more similar to each other than actives of H1R, while the actives of 5HT3AR are relatively dissimilar to each other (Figure 6). The observation that the performance of the similarity descriptors decreases with decreasing self-similarity of the actives is in line with other studies.49

Comparing the average enrichment factor at 1% FP (Figure 4) shows that the performances of most of the similarity descriptors are quite comparable. For all three tested targets (H4R, H1R and 5HT3AR) there is no method that led always to best average enrichments. However among the 5 best performing similarity descriptors are MACCS and TAT (for all 3 tested targets) while GpiDAPH3, piDAPH3 and especially piDAPH4 are among the worst performing similarity descriptors for H4R and H1R. This observation is independent if similarity values for all test compounds are taken into account (Figure 4A, D, G, Table S1) or if similarity values of test compounds that have similar SCF1 or SCF2

Page 11: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

81

Figure 4

Page 12: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

82

Figure 5 Figure 6

Page 13: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

83

Figure 4 (see two previous pages). Enrichment factors at 1% FP of various similarity descriptors using different group fusion methods (blue: mean-value, red: max-value, yellow: rank-sum, green: ranked-by-vote) and the average enrichment (black) for H4R (A-C), H1R (D-F) and 5HT3AR (G-I). Enrichment factors in panel A, D and G were calculated using all similarity values (except self similarity values), enrichment factors in panel B-C, E-F and H-I were calculated leaving out similarity values resulting from comparison of reference and test compounds that have the same SCF1 (B, E and H) or SCF2 (C, F and). I

Figure 5 (see previous page). Average enrichment curves observed for EDprints for H4R, H1R and 5HT3AR including A) all similarity values and only similarity values resulting from comparison of compounds with B) dissimilar SCF1 and C) dissimilar SCF2.

Figure 6 (see previous page). Pair-wise MACCS Tanimoto similarities between actives-actives (solid lines) and inactives-actives (dashed lines).

to the respective reference active are excluded. (Figure 4B-C, E-F, H-I, Table S1). The low virtual screening accuracy of GpiDAPH3, piDAPH3 and piDAPH4 compared to other methods has been observed in previous retrospective virtual screening evaluations.20 However the difference in the performance between different methods is generally relatively small. Especially in the case of the 5HT3AR were the worst performance of all similarity descriptors compared to the other datasets is observed there is hardly any difference in the average enrichment factors of the individual methods (Figure 4G, H and I, Table S1)

Consensus scoring

Combination of similarity values: group fusion

If multiple actives are available consensus scoring approaches to combine similarity values of test compounds can be applied to enhance the effectiveness of the virtual screening methods.48-57 In general if quite an amount of actives are known also more sophisticated machine learning methods like naive Bayesian classifier or a support vector machine can be used to discriminate actives from inactives.82-85 In this study we evaluated the performance of four different consensus scoring approaches to combine similarity values or the ranks of the test compounds obtained for multiple actives (also called group fusion56): mean-value20,33,48,49,52-55,57, max-value20,26,48,49,51-55,59,77, mean-rank20,33,49,57-60 and ranked-by-vote33,58,59. A demonstration how these consensus scoring approaches work is shown in the Methods section in Table 3. Mean-value and max-value are calculated by taking the mean or the maximum of the similarity values. Mean-rank is the mean of the normalized ranks and ranked-by-vote is the rate of the occurrence (1 always, 0 never) in the top 1% of the ranks (normalized rank > 0.99).

Page 14: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

84

Figure 4 shows that certain consensus scoring methods achieve better enrichments than the average enrichment. For H4R both max-value and the ranked-by-vote consensus methods achieve the best results. However for H1R and 5HT3AR ranked-by-vote most often achieves better enrichment results than other consensus scoring approaches. Ranked-by-vote always achieves better enrichment (at 1%FP) than average enrichment. Max-value most of the time achieves better enrichment than the average enrichment, except for piDAPH4 for all targets and additionally for piDAPH3 for H1R in the case similarity values, resulting from comparisons of compounds sharing similar SCF1, are excluded. So far using the max-value was already reported to be successful for group fusion.20,26,48,49,51-55,57,77 However, to the best of our knowledge, studies about the use of ranked-by-vote for group fusion have not been reported so far.

The performance of consensus scoring of the actives is not related to the performance of the average enrichment. For example the GpiDAPH3 method gives the lowest average enrichment factor compared to all other similarity descriptors in retrospective virtual screening runs on the H1R set (Figure 4D, Table S1). However using ranked-by-vote as consensus scoring enrichment of GpiDAPH3 at 1% FP is 9 times higher and makes GpiDAPH3 the second best method after MACCS (Figure 4D, Table S1). But especially this method performs poorly for H1R if the scaffold similarity of the test compounds compared to the respective reference active is reduced (Figure 4E-F, Table S1). For 5HT3AR GpiDAPH3 is always among the best methods using ranked-by-vote as group fusion. This shows that the relative performance of the individual methods to identify new scaffolds (i.e. considering all compounds or, SCF1 or SCF2 scaffold similarity scenarios) is in fact target dependent.

The decrease in performance of the similarity descriptors with decreasing self-similarity is a challenge for the discovery of novel scaffolds. Our study shows however that for each target there are some promising methods for enriching of compounds that have dissimilar SCF2 compared to the respective reference compounds. The best performing methods if SCF2 dissimilar compounds compared to the reference ligands are enriched are for H4R ROCS, FCFC_4 and EDprints and TGT using max-value (Figure 4C, Table S1). For H1R these methods are TAT, MACCS and EDprints using ranked-by-vote (Figure 4F, Table S1) whereas for 5HT3AR these methods are GpiDAPH3 using ranked-by vote and EDprints and TGD using max-value (Figure 4I, Table S1). In other studies substructure descriptors77,86, extended connectivity descriptors77, pharmacophore-based descriptors57,77,86,87 and shape-based descriptors88 have been already reported to be successful in scaffold hopping or in enriching of a diverse set of compounds.89 EDprints, which encodes information about electron densities and charges, have been previously

Page 15: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

85

shown to be more robust toward subtle rearrangements of chemical groups and more suitable for screening against reference molecules with fused ring systems compared to other similarity descriptors.65 In this study we showed for all tested targets that EDprints are successful in enriching of active compounds that have dissimilar SCF2 compared to the respective reference actives. As example Table 4 shows the H4R active that achieved best enrichment of actives (with dissimilar SCF2) at a 1% FP rate using EDprints. The 5 highest ranked active compounds (which have dissimilar SCF2) based on virtual screening against this quinoxaline reference ligand are three aminopyrimidines, one quinazoline and one indolecarboxamide, showing the potential of scaffold hopping using ligand-based modeling approaches, as demonstrated for H4R.90-92

Table 4. Best active hits of an EDprints similarity search for H4R actives using the active that showed best enrichment factor at 1% FP as QUERY compound. As test compounds only compounds that have a dissimilar SCF2 to the QUERY compound were selected.

QUERY

Best HITS

EDprints similarity 0.33 0.21 0.12 0.11 0.09

*EDprints similarities were calculated using the McConnaughey similarity measure.65

Combination of similarity descriptors: similarity fusion

In the same way as the results from different actives can be fused to a combined score (group fusion56), these combined scores of the different similarity descriptors can be again fused (similarity fusion56).20,33,55,58-60 For this task the group fused scores from each similarity descriptor were normalized. Scores that were obtained by single similarity descriptors using a certain consensus scoring approach for group fusion were further combined with a certain consensus scoring approach (similarity fusion). Enrichment factors for all combinations of the 14 investigated similarity descriptors (16392 possible combinations), combined with a certain group and similarity fusion approach (16 possible combinations), were calculated. In this way we tested which combination of similarity descriptors combined with a certain combination of group and similarity fusion method (262272 possible combinations) gives the highest virtual screening enrichment.

ClNCl

N N

H+N

H+N

N

NN

H+N

N

NN

NH2

H+N

N

NN

NH2

Cl

N

N

HN

N+HN

ClHN

O

N

NH+

Page 16: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

86

Table 5 shows the combination of similarity descriptors that led to highest enrichment. The enrichment factor (1% FP) of the best combination of similarity descriptors is better than the enrichment factor of the best individual method.

Table 5. The minimal combination of similarity descriptors that led to best enrichment (marked with X) and the similarity descriptor that led on its own to best enrichment with the same group fusion method (colored red and marked with X if included in the best combination, if not it is marked with O).

Targ

et

Leav

e-in

EF (1

% F

P) b

est

sing

le m

etho

d EF

(1%

FP)

be

st c

ombi

natio

n

Gro

up

fusi

on

Sim

ilarit

y fu

sion

MA

CC

S

ECFP

_4

FCFP

_4

ECFC

_4

FCFC

_4

EDpr

ints

TAD

TGD

TAT

TGT

Gpi

DA

PH3

piD

APH

3

piD

APH

4

RO

CS

H4R

All 86.6 96.9 ranked-by-vote

mean-rank X O X X

SCF1 81.8 95.9 ranked-by-vote

mean-value X X X X X X

SCF2 72.2 91.8 max-value

mean-rank X X X X X X O

H1R

All 57.1 67.9 ranked-by-vote

mean-rank X X X X

SCF1 53.6 65.5 ranked-by-vote

mean-value X X X X X

SCF2 44.0 63.1 ranked-by-vote

mean-value X X X X X

5HT3AR

All 32.3 39.6 ranked-by-vote

mean-rank X X X X

SCF1 25.3 33.3 ranked-by-vote

mean-rank X X X O

SCF2 or

SCF2

18.8 27.1 max-value

mean-rank X X X

18.8 27.1 max-value

mean-value X X O X

All: All similarity scores were included for group fusion. SCF1/SCF2: Only similarity values resulting from comparisons of reference and test compounds with dissimilar SCF1 or SCF2 were used for group fusion.

Interestingly, the best combination does not always include the similarity descriptor that gives the highest enrichment. For H4R (SCF2) for example the best combination (group fusion: max-value, data fusion: mean-rank) gives an enrichment factor of 91.8 (at 1% FP) and includes piDAPH4, a method that individually has one of the lowest virtual screening accuracies (0.8 using max-value for group fusion) for this receptor (Figure 4C, Table S1). It should be noted however that the other five similarity descriptors (MACCS, ECFP_4, TAD, TGD and piDAPH3) in the consensus approach have relatively high virtual screening enrichments individually (between 51.9-60.8 using max-value for group

Page 17: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

87

fusion, see Figure 4C and Table S1). ROCS on the other hand, which is the best similarity descriptor for H4R, is not included in the best consensus approach. Moreover, the best consensus approach that combines the best four similarity descriptors, ROCS, FCFP_4, EDprints and TGT, gives only an enrichment factor of 81.4 (based on max-value for group fusion and mean-rank for similarity fusion). Interestingly, for all tested virtual screening scenarios MACCS is always included in the combination of similarity descriptors that led to the best enrichment (Table 5).

An analysis of the combinations of similarity descriptors that led to the top 1% of best enrichments (at 1% FP) shows that piDAPH3 and MACCS are most frequently included for all tested targets and consensus approaches (Table 6). The success of other similarity descriptors is however very dependent on the target and/or on the tested similarity scenario. The TAT similarity descriptor for example occurs in 78.7% of the cases in combinations that give best enrichments for H1R (SCF2). The same TAT method however is only part of 25.2% of the best combinations for H4R (SCF2). Another example is ECFC_4 that occurs in 79.5% of the top 1% best combinations for H4R when all compounds are included, but only in 4.7% of the top combinations in the SCF2 similarity scenario (Table 6).

Table 6. Occurrence (%) of the different similarity descriptor combinations that give the (top 1%) best enrichments (at a 1% FP rate). Enrichments were calculated for of all possible combination of similarity descriptors (16369) and all 16 (4 times 4) combinations of group and similarity fusion methods.

Targ

et

Leav

e-in

MA

CC

S

ECFP

_4

FCFP

_4

ECFC

_4

FCFC

_4

EDpr

ints

TAD

TGD

TAT

TGT

Gpi

DA

PH3

piD

APH

3

piD

APH

4

RO

CS

H4R All 60.3 66,4 33.5 79.5 32.6 57.5 54.2 23.8 36.8 76.6 21.2 55.0 39.4 77.3

SCF1 87.6 59.3 29.4 44.9 19.8 38.7 61.5 52.4 38.5 57.3 60.0 85.0 47.9 31.7 SCF2 96.6 53.3 49.1 4.7 18.6 58.1 74.0 60.9 25.2 55.3 59.0 85.9 74.8 57.4

H1R All 90.8 62.3 23.3 40.8 38.5 14.3 25.9 32.3 57.1 54.0 65.9 84.7 21.1 50.7

SCF1 92.0 52.1 35.5 56.0 41.2 39.5 66.7 56.6 74.9 65.7 62.4 88.3 26.6 48.0 SCF2 84.3 63.2 51.1 49.7 50.0 58.9 51.0 56.1 78.7 64.8 48.7 86.2 42.3 42.1

5HT3AR All 82.4 52.4 27.6 35.6 47.6 47.6 32.6 44.4 45.1 30.7 72.8 88.9 32.0 19.5

SCF1 71.4 45.8 51.4 49.1 50.1 39.3 40.6 30.7 65.0 20.7 81.2 95.0 46.6 24.5 SCF2 67.4 67.7 58.3 40.9 43.9 34.6 32.9 36.7 45.6 50.5 55.8 90.5 66.4 66.3

All: All similarity scores were included for group fusion. SCF1/SCF2: Only similarity values resulting from comparisons of reference and test compounds with dissimilar SCF1 or SCF2 were used for group fusion.

Page 18: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

88

Table 7 shows the frequency how often a combination of a group and a similarity fusion method led to the best (top 1%) of enrichments (out of all 262272 tested combinations). Ranked-by-vote as group fusion method most often led to top enrichments. Ranked-by-vote always led to top enrichments for H1R and in more than 88% of the cases for 5HT3AR. When test compounds that have similar SCF1 or SCF2 to the respective H4R reference ligands are omitted from the virtual screening hit list, the max-value method more often gives top enrichments for H4R. However, when all compounds are considered, ranked-by-vote also leads more often to top enrichments for H4R (in 99% of the cases).

Table 7. Frequency (%) how often a certain combination of a group and a similarity fusion method led to enrichments within the top 1%. Enrichments (at a 1% FP rate) were calculated for all 16 (4x4) combinations of group and similarity fusion methods for all possible combinations of similarity descriptors (16369).

H4R All SCF1 SCF2

Similarity fusion

Group-fusion

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean-value 0.1 0 0 0 0.2 0 0.0 0 0 0 0 0 max-value 0.6 0 0.3 0 33.0 0 32.4 0 14.1 0 85.9 0 mean-rank 0 0 0 0 0 0 0 0 0 0 0 0 ranked-by-vote 76.5 0.4 22.1 0 29.3 0.4 4.7 0 0 0 0 0

H1R All SCF1 SCF2

Similarity- fusion

Group-fusion

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean-value 0 0 0 0 0 0 0 0 0 0 0 0 max-value 0 0 0 0 0 0 0 0 0 0 0 0 mean-rank 0 0 0 0 0 0 0 0 0 0 0 0 ranked-by-vote 45.8 18.3 35.9 0 97.2 0.1 2.7 0 99.8 0 0.2 0

5HT3AR All SCF1 SCF2

Similarity - fusion

Group-fusion

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean-value 5.5 0.3 0.2 0 9.2 0 0 0 2.6 0 3.1 0 max-value 0.2 0 0 0 0 0 1.1 0 1.8 1.5 1.1 0.4 mean-rank 0 0 0 0 0 0 0 0 0 0 0.6 0 ranked-by-vote 6.8 0.8 86.2 0 55.0 1.1 33.6 0 80.4 1.1 7.4 0

All: All similarity scores were included for group fusion. SCF1/SCF2: Only similarity values resulting from comparisons of reference and test compounds with dissimilar SCF1 or SCF2 were used for group fusion.

Page 19: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

89

In contrast to group fusion, for similarity fusion the mean-value or mean-rank most often led to the best (top 1%) enrichments (Table 7). Mean-value led most often to top enrichments, but in several cases mean-rank was better. In 86% of the cases a combination of max-value group fusion and mean-rank similarity fusion resulted in top enrichments for H4R (SCF2 similarity scenario). For 5HT3AR (including all test compounds) a combination of ranked-by-vote group fusion and mean-rank similarity fusion also resulted in top enrichments in 86% of the cases.

Analysis of the enrichments achieved with similarity fusion methods for all possible combinations of similarity descriptors (for each group fusion methods) revealed that mean-value in more than 83% and mean-rank in more than 92% of the cases in the tested enrichment scenarios achieved enrichments that are higher than the average of the enrichments obtained with the individual similarity descriptors that were included in the similarity fusion model (Table 8). Moreover the enrichments obtained with the mean-value or mean-rank similarity fusion method were also frequently higher than the enrichments of any of the individual similarity descriptors (Table 9).

Page 20: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

90

Table 8. Frequency (%) how often a similarity fusion method led to a higher enrichment than the average enrichment of the individual methods (for all possible combination of similarity descriptors).

H4R All SCF1 SCF2

Similarity fusion

Group-fusion

mean value

max value

mean rank

ranked by

vote mean value

max value

mean rank

ranked by

vote mean value

max value

mean rank

ranked by

vote

mean-value 100.0 37.2 92.7 0.0 99.9 14.9 99.8 1.2 99.9 2.7 99.9 2.1 max-value 100.0 43.9 99.8 0.4 100.0 45.6 100.0 0.0 99.9 44.4 99.9 0.0 mean-rank 100.0 36.3 100.0 15.1 99.9 55.1 99.8 45.6 99.6 31.5 99.6 0.1 ranked-by-vote 99.9 87.3 99.8 0.0 99.9 92.0 100.0 0.0 99.7 38.6 100 0.0

H1R All SCF1 SCF2

Similarity- fusion

Group-fusion

mean value

max value

mean rank

ranked by

vote mean value

max value

mean rank

ranked by

vote mean value

max value

mean rank

ranked by

vote

mean-value 99.8 91.7 99.9 93.3 99.7 35.4 99.9 92.7 99.7 63.7 99.8 91.8 max-value 99.7 21.1 99.8 69.8 99.9 22.7 99.9 91.9 99.8 17.4 99.8 90.3 mean-rank 100.0 60.3 100.0 87.6 99.9 45.5 99.9 81.0 100.0 74.8 99.9 99.4 ranked-by-vote 99.8 96.1 99.9 0.0 99.7 73.9 99.8 52.4 99.9 60.2 99.9 28.9

5HT3AR All SCF1 SCF2

Similarity - fusion

Group-fusion

mean value

max value

mean rank

ranked by

vote mean value

max value

mean rank

ranked by

vote mean value

max value

mean rank

ranked by

vote

mean-value 99.7 20.7 98.3 89.2 99.6 34.9 98.0 87.7 99.1 43.9 96.2 73.0 max-value 83.7 2.8 93.6 27.6 96.3 7.3 98.5 67.5 99.2 24.9 96.9 94.2 mean-rank 97.9 71.5 98.4 52.9 97.9 69.9 98.5 53.6 95.1 88.1 94.9 31.8 ranked-by-vote 97.8 60.4 99.4 46.0 99.6 60.0 99.6 59.9 99.8 30.4 99.7 26.0

All: All similarity scores were included for group fusion. SCF1/SCF2: Only similarity values resulting from comparisons of reference and test compounds with dissimilar SCF1 or SCF2 were used for group fusion.

Page 21: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

91

Table 9. Frequency (%) how often a similarity fusion led to a higher enrichment than any of the individual methods (for all possible combination of similarity descriptors).

H4R All SCF1 SCF2

Similarity fusion

Group-fusion

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean-value 65.1 8.7 22.4 0.0 80.7 4.7 47.8 0.0 80.7 0.4 74.3 0.0 max-value 98.7 21.8 97.6 0.0 89.9 4.2 90.0 0.0 98.7 7.5 97.9 0.0 mean-rank 99.8 2.4 99.8 0.0 96.6 7.1 97.7 1.2 92.0 0.6 94.8 0.1 ranked-by-vote 92.1 44.5 90.5 0.0 96.9 34.6 95.9 0.0 94.0 1.0 99.9 0.0

H1R All SCF1 SCF2

Similarity- fusion

Group-fusion

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean-value 59.6 24.8 59.1 27.1 88.0 2.8 89.5 8.4 91.5 3.8 94.1 14.1 max-value 61.9 5.6 76.9 5.4 99.0 12.5 98.5 59.3 93.2 10.0 93.3 24.3 mean-rank 99.8 30.6 99.8 41.8 99.7 12.7 99.8 34.7 99.8 34.6 99.8 76.3 ranked-by-vote 40.0 27.0 51.8 0.0 77.2 6.1 38.8 1.5 94.8 3.4 45.6 0.4

5HT3AR All SCF1 SCF2

Similarity - fusion

Group-fusion

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean value

max value

mean rank

ranked by

vote

mean-value 89.6 2.0 64.2 3.9 87.5 2.0 61.2 3.8 89.0 5.2 72.0 3.7 max-value 32.6 0.2 40.9 0.0 44.5 2.2 52.8 7.9 18.2 5.1 17.8 1.9 mean-rank 92.8 35.5 93.1 9.8 92.6 40.3 92.4 14.2 90.2 67.4 87.9 7.6 ranked-by-vote 22.5 40.44 25.5 1.9 68.6 8.0 46.3 2.7 56.2 1.6 44.9 0.8

All: All similarity scores were included for group fusion. SCF1/SCF2: Only similarity values resulting from comparisons of reference and test compounds with dissimilar SCF1 or SCF2 were used for group fusion.

These results of our retrospective virtual fragment screening studies recommend the use of mean-value or mean-rank similarity fusion consensus scoring, as the performance of the individual similarity descriptors is rather target-dependent. For the same reason also other comparative virtual screenings suggest the use of similarity fusion methods.20,58 Mean-rank was already reported to be successful for similarity fusion.33,58-60. To best of our knowledge, this study is the first study showing that also the use of the mean-value is successful for similarity fusion.

Page 22: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

92

Conclusions

This study presents a detailed retrospective LBVS study of fragment-like compounds. It has been shown that LBVS is suitable for enriching of fragments and that also dissimilar scaffolds compared to the respective reference actives can be enriched. However our study also pointed out that the enrichment of especially very small fragments can be a challenging task. The use of a dataset with true actives and true inactives enabled us to perform an evaluation based on a consistent data set of solid experimental data. We tested several similarity descriptors with and without consensus scoring approaches. We have shown that consensus scoring methods are very effective to increase enrichment compared to the average enrichment. Ranked-by-vote and also (although less often) the max-value performed best for group fusion, mean-rank and mean-value performed best for similarity fusion. Our retrospective ligand-based virtual fragment screening analyses highly recommend the use of both group fusion and similarity fusion methods if multiple actives and similarity descriptors are available. For future research it will be of high interest to test the combination of group and similarity fusion approaches in a prospective virtual screening.

Supplementary information

Table S1. Enrichment factor at 1%FP for various similarity methods and group fusion methods

H1R All SCF1 SCF2

GF SD

AVG MEV MXV MER RBV AVG MEV MXV MER RBV AVG MEV MXV MER RBV

MACCS 31.6 68.0 85.6 22.7 83.5 27.8 60.8 80.4 15.5 81.8 24.0 38.1 58.4 15.5 55.7 ECFP_4 32.6 82.8 85.6 17.5 81.4 28.6 79.4 72.5 14.4 74.2 22.2 58.8 58.8 12.4 57.8 FCFP_4 30.0 74.6 85.6 23.7 86.6 25.6 43.3 73.2 18.6 70.8 18.7 51.5 70.1 15.5 57.4 ECFC_4 31.0 59.8 78.4 19.6 82.5 27.4 51.5 60.8 9.3 73.2 23.8 49.5 43.3 8.2 58.8 FCFC_4 26.0 54.6 74.2 36.1 70.1 21.7 41.2 40.2 17.5 66.0 18.3 13.4 28.9 10.3 41.2 EDprints 27.8 53.6 82.5 15.5 73.2 23.4 34.0 69.1 8.2 57.8 19.3 22.7 64.9 7.2 53.6 TAD 29.1 46.4 77.3 5.2 81.4 24.9 20.6 68.0 2.1 68.0 22.0 6.2 59.8 0.0 47.4 TGD 35.7 57.7 76.3 2.1 77.3 32.4 47.4 67.0 1.0 73.2 29.5 22.7 60.8 0.0 53.6 TAT 33.1 57.7 70.1 30.9 80.4 29.2 47.4 65.3 12.4 78.0 24.5 12.4 57.7 5.2 59.8 TGT 29.1 47.4 85.6 4.1 83.5 25.2 18.6 83.5 3.1 63.9 21.8 0.0 64.9 0.0 52.6 GpiDAPH3 24.8 59.8 83.2 22.7 77.3 20.5 39.2 76.3 22.7 68.0 17.9 27.8 61.5 22.7 39.5 piDAPH3 23.0 52.6 83.9 28.9 80.4 18.4 33.0 51.9 25.8 53.6 14.7 24.7 51.9 21.6 33.0 piDAPH4 20.9 46.4 2.8 9.3 76.3 16.1 22.7 1.1 9.3 40.3 12.4 8.2 0.8 5.2 27.8 ROCS 27.4 61.9 80.4 17.5 75.3 23.2 44.3 76.3 15.5 64.3 19.9 29.9 72.2 9.3 35.8

Page 23: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

93

Table S1. continued H1R

All SCF1 SCF2 GF

SD AVG MEV MXV MER RBV AVG MEV MXV MER RBV AVG MEV MXV MER RBV

MACCS 14.7 31.0 37.0 7.1 57.1 13.5 22.6 22.6 7.1 53.6 12.9 13.1 14.3 9.5 14.7 ECFP_4 10.7 46.4 29.8 14.3 47.6 9.5 35.7 15.5 10.7 43.0 8.9 26.2 16.7 10.7 10.7 FCFP_4 9.2 34.5 39.3 15.5 42.9 7.9 23.8 21.4 13.1 32.5 7.5 19.0 19.0 10.7 9.2 ECFC_4 10.3 19.0 44.0 4.8 47.6 9.0 14.3 21.4 4.8 38.1 8.4 10.7 11.9 4.8 10.3 FCFC_4 9.9 21.4 46.4 8.3 44.0 8.5 13.1 23.8 8.3 35.7 8.2 10.7 20.2 6.0 9.9 EDprints 12.7 31.0 28.6 19.0 47.6 11.4 29.2 19.0 14.3 37.0 11.1 25.0 11.9 11.9 12.7 TAD 11.1 20.2 39.3 9.5 35.7 9.6 13.1 28.6 9.5 22.6 9.3 13.1 28.6 9.5 11.1 TGD 11.4 16.7 35.7 9.5 38.1 10.0 14.3 22.6 9.5 11.9 9.8 16.7 27.4 9.5 11.4 TAT 13.5 32.1 33.3 14.3 48.8 12.2 32.1 28.6 14.3 45.2 11.8 31.0 27.4 14.3 13.5 TGT 13.0 29.8 40.5 11.9 53.6 11.7 15.5 23.8 9.5 34.5 11.2 15.5 9.5 8.3 13.0 GpiDAPH3 6.3 17.9 41.1 4.8 54.8 4.8 7.1 24.6 4.8 16.7 4.2 6.0 16.7 4.8 6.3 piDAPH3 9.4 16.7 31.9 15.5 42.9 8.1 16.7 23.8 16.7 37.7 7.4 15.5 25.6 15.5 9.4 piDAPH4 7.5 15.5 2.3 15.5 23.8 6.1 15.5 1.2 15.5 22.6 5.6 14.3 1.2 11.9 7.5 ROCS 12.4 28.6 32.1 7.1 45.2 11.3 22.6 21.4 6.0 42.9 10.6 17.9 19.7 4.8 12.4

5HT3AR All SCF1 SCF2

GF SD

AVG MEV MXV MER RBV AVG MEV MXV MER RBV AVG MEV MXV MER RBV

MACCS 6.0 12.5 16.7 4.2 18.8 5.7 8.3 13.5 4.2 17.7 4.7 5.2 6.3 4.2 8.3 ECFP_4 5.7 16.7 6.3 5.2 21.9 5.3 14.6 9.4 5.2 18.5 4.7 10.4 7.3 4.2 15.8 FCFP_4 5.1 12.5 10.4 0.0 16.7 4.8 10.4 9.4 0.0 16.7 4.4 8.3 14.6 0.0 12.7 ECFC_4 5.8 4.2 10.4 3.1 20.8 5.4 4.2 9.4 3.1 20.8 4.8 4.2 8.3 2.1 11.5 FCFC_4 5.4 4.2 7.3 2.1 15.6 4.9 4.2 11.5 2.1 17.7 4.5 2.1 7.3 2.1 11.5 EDprints 5.9 6.3 15.6 2.1 17.7 5.7 5.2 14.6 2.1 14.6 5.1 5.2 18.8 2.1 15.6 TAD 4.4 1.0 13.7 2.1 10.4 4.2 1.0 9.5 2.1 9.4 3.6 1.0 9.6 2.1 5.2 TGD 5.2 5.2 12.4 2.1 31.3 4.9 3.1 14.6 2.1 19.8 4.2 1.0 18.8 0.0 9.4 TAT 6.2 12.5 25.0 6.3 20.8 6.1 7.3 15.6 5.2 21.9 5.4 4.2 14.6 2.1 12.5 TGT 4.8 5.2 10.4 2.1 17.7 4.4 3.1 6.3 2.1 16.7 3.9 2.1 12.5 2.1 8.3 GpiDAPH3 5.6 12.5 7.7 5.2 32.3 5.3 12.5 10.4 5.2 25.3 4.7 7.3 6.3 3.1 22.3 piDAPH3 4.5 4.2 4.5 4.2 18.8 4.3 4.2 2.4 4.2 16.7 3.9 4.2 5.2 4.2 11.5 piDAPH4 4.4 9.4 0.9 3.1 7.3 4.1 8.3 0.7 4.2 4.8 3.8 5.2 0.8 4.2 5.1 ROCS 6.3 11.5 16.7 3.1 17.7 6.1 13.5 9.4 3.1 19.8 5.4 7.3 8.3 4.2 13.5 SD: similarity descriptor, GF: group fusion, AVG: average enrichment, MEV: mean-value, MXV: max-value, MER: mean-rank, RBV: ranked-by-vote, All: All similarity scores were included for group fusion. SCF1/SCF2: Only similarity values resulting from comparisons of reference and test compounds with dissimilar SCF1 or SCF2 were used for group fusion.

Page 24: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

94

Chembl Numbers

H4R:

100873, 108910, 121534, 248131, 248765, 312342, 312343, 312377, 312423, 312463, 312670, 312671, 312672, 312724, 312751, 312752, 312773, 312774, 312801, 312869, 312903, 312980, 352811, 410363, 431541, 431732, 431733, 431734, 431744, 431745, 431746, 431747, 457289, 529229, 529369, 529419, 529662, 529836, 529919, 553926, 568946, 569878, 569880, 569955, 570019, 570020, 570021, 570022, 570023

H1R:

100487, 1084132, 1085672, 1088882, 1088898, 1088899, 1090198, 1092259, 1092600, 1092649, 1125, 11603, 1222552, 12620, 129198, 13795, 1669412, 24000, 24001, 24031, 24316, 24517, 24665, 24835, 25198, 25414, 26064, 26116, 270177, 287052, 287367, 316951, 317171, 319829, 32813, 331188, 360953, 369075, 444353, 47482, 551340, 556506, 559251, 654, 715, 83731, 90, 99529

5HT3AR:

108799, 1110, 122903, 1276421, 131053, 136090, 13864, 14070, 1643880, 266591, 269158, 277120, 292066, 320963, 335877, 34291, 39, 40260, 404372, 41272, 42538, 44100, 46, 53929, 565547, 68281

References

1. Murray, C. W.; Rees, D. C. The rise of fragment-based drug discovery. Nature chemistry, 2009, 1, 187-192.

2. Warr, W. A. Fragment-based drug discovery. Journal of computer-aided molecular design, 2009.

3. de Kloe, G. E.; Bailey, D.; Leurs, R.; de Esch, I. J. Transforming fragments into candidates: small becomes big in medicinal chemistry. Drug discovery today, 2009, 14, 630-646.

4. Congreve, M.; Chessari, G.; Tisi, D.; Woodhead, A. J. Recent developments in fragment-based drug discovery. Journal of medicinal chemistry, 2008, 51, 3661-3680.

5. Fink, T.; Reymond, J. L. Virtual exploration of the chemical universe up to 11 atoms of C, N, O, F: assembly of 26.4 million structures (110.9 million stereoisomers) and analysis for new ring systems, stereochemistry, physicochemical properties, compound classes, and drug discovery. Journal of chemical information and modeling, 2007, 47, 342-353.

6. Loving, K.; Alberts, I.; Sherman, W. Computational approaches for fragment-based and de novo design. Current Topics in Medicinal Chemistry, 2010, 10, 14-32.

7. Crisman, T. J.; Bender, A.; Milik, M.; Jenkins, J. L.; Scheiber, J.; Sukuru, S. C. K.; Fejzo, J.; Hommel, U.; Davies, J. W.; Glick, M. ‚Virtual fragment linking': an approach to identify potent binders from low affinity fragment hits. Journal of medicinal chemistry, 2008, 51, 2481-2491.

8. Hann, M. M.; Leach, A. R.; Harper, G. Molecular complexity and its impact on the probability of finding leads for drug discovery. Journal of chemical information and computer sciences, 2001, 41, 856-864.

Page 25: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

95

9. Murray, C. W.; Verdonk, M. L.; Rees, D. C. Experiences in fragment-based drug discovery. Trends in pharmacological sciences, 2012, 33, 224-232.

10. Erlanson, D. A.; McDowell, R. S.; O'Brien, T. Fragment-based drug discovery. Journal of medicinal chemistry, 2004, 47, 3463-3482.

11. Matter, H.; Sotriffer, C. in Virtual Screening 319-358 (Wiley-VCH Verlag GmbH & Co. KGaA, 2011).

12. Koeppen, H.; Kriegl, J.; Lessel, U.; Tautermann, C. S.; Wellenzohn, B. in Virtual Screening 61-85 (Wiley-VCH Verlag GmbH & Co. KGaA, 2011).

13. Chen, Y.; Shoichet, B. K. Molecular docking and ligand specificity in fragment-based inhibitor discovery. Nature chemical biology, 2009, 5, 358-364.

14. de Graaf, C.; Kooistra, A. J.; Vischer, H. F.; Katritch, V.; Kuijer, M.; Shiroishi, M.; Iwata, S.; Shimamura, T.; Stevens, R. C.; de Esch, I. J.; Leurs, R. Crystal structure-based virtual screening for fragment-like ligands of the human histamine H(1) receptor. Journal of medicinal chemistry, 2011, 54, 8195-8206.

15. Istyastono, E. P. Computational studies of the histamine H4 recetor-lignad interactions. 2012. 16. Marcou, G.; Rognan, D. Optimizing fragment and scaffold docking by use of molecular

interaction fingerprints. Journal of chemical information and modeling, 2007, 47, 195-207. 17. Sirci, F.; Istyastono, E. P.; Vischer, H. F.; Kooistra, A. J.; Nijmeijer, S.; Kuijer, M.; Wijtmans, M.;

Mannhold, R.; Leurs, R.; de Esch, I.; de Graaf, C. Virtual fragment screening: discovery of histamine H3 receptor ligands using ligand-based and protein-based molecular fingerprints. Journal of chemical information and modeling, 2012, DOI: 10.1021/ci3004094.

18. Eckert, H.; Bajorath, J. Molecular similarity analysis in virtual screening: foundations, limitations and novel approaches. Drug discovery today, 2007, 12, 225-233.

19. Flower, D. R. On the Properties of Bit String-Based Measures of Chemical Similarity. Journal of chemical information and computer sciences, 1998, 38, 379-386.

20. Bender, A.; Jenkins, J. L.; Scheiber, J.; Sukuru, S. C.; Glick, M.; Davies, J. W. How similar are similarity searching methods? A principal component analysis of molecular descriptor space. Journal of chemical information and modeling, 2009, 49, 108-119.

21. Grant, J. A.; Gallardo, M. A.; Pickup, B. T. A fast method of molecular shape comparison: A simple application of a Gaussian description of molecular shape. J Comput Chem, 1996, 17, 1653-1666.

22. Hawkins, P. C. D.; Skillman, A. G.; Nicholls, A. Comparison of Shape-Matching and Docking as Virtual Screening Tools. Journal of medicinal chemistry, 2006, 50, 74-82.

23. Gillet, V. J.; Willett, P.; Bradshaw, J. Similarity searching using reduced graphs. Journal of chemical information and computer sciences, 2003, 43, 338-345.

24. Harper, G.; Bravi, G. S.; Pickett, S. D.; Hussain, J.; Green, D. V. The reduced graph descriptor in virtual screening and data-driven clustering of high-throughput screening data. Journal of chemical information and computer sciences, 2004, 44, 2145-2156.

25. Lessel, U.; Wellenzohn, B.; Lilienthal, M.; Claussen, H. Searching Fragment Spaces with Feature Trees. Journal of chemical information and modeling, 2009, 49, 270-279.

26. Kiss, R.; Sandor, M.; Gere, A.; Schmidt, E.; Balogh, G. T.; Kiss, B.; Molnar, L.; Lemmen, C.; Keseru, G. M. Discovery of novel histamine H4 and serotonin transporter ligands using the topological feature tree descriptor. Journal of chemical information and modeling, 2012, 52, 233-242.

27. Bajorath, J. Methods for Ligand-Based Virtual Screening. Frontiers in Medicinal Chemistry, Volume (4), 2009, 1.

28. Holliday, J. D.; Salim, N.; Whittle, M.; Willett, P. Analysis and display of the size dependence of chemical similarity coefficients. Journal of chemical information and computer sciences, 2003, 43, 819-828.

29. Nicholls, A. What do we know and when do we know it? Journal of computer-aided molecular design, 2008, 22, 239-255.

Page 26: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

96

30. Huang, N.; Shoichet, B. K.; Irwin, J. J. Benchmarking sets for molecular docking. Journal of medicinal chemistry, 2006, 49, 6789-6801.

31. Hu, G.; Kuang, G.; Xiao, W.; Li, W.; Liu, G.; Tang, Y. Performance evaluation of 2D fingerprint and 3D shape similarity methods in virtual screening. Journal of chemical information and modeling, 2012, 52, 1103-1113.

32. von Korff, M.; Freyss, J.; Sander, T. Comparison of ligand- and structure-based virtual screening on the DUD data set. Journal of chemical information and modeling, 2009, 49, 209-231.

33. Svensson, F.; Karlen, A.; Skold, C. Virtual screening data fusion using both structure- and ligand-based methods. Journal of chemical information and modeling, 2012, 52, 225-232.

34. Scior, T.; Bender, A.; Tresadern, G.; Medina-Franco, J. L.; Martinez-Mayorga, K.; Langer, T.; Cuanalo-Contreras, K.; Agrafiotis, D. K. Recognizing Pitfalls in Virtual Screening: A Critical Review. Journal of chemical information and modeling, 2012.

35. Thurmond, R. L.; Gelfand, E. W.; Dunford, P. J. The role of histamine H1 and H4 receptors in allergic inflammation: the search for new antihistamines. Nature reviews. Drug discovery, 2008, 7, 41-53.

36. Simons, F. E. R. Advances in H1-Antihistamines. New England Journal of Medicine, 2004, 351, 2203-2217.

37. Coruzzi, G.; Adami, M.; Guaita, E.; de Esch, I. J.; Leurs, R. Antiinflammatory and antinociceptive effects of the selective histamine H4-receptor antagonists JNJ7777120 and VUF6002 in a rat model of carrageenan-induced acute inflammation. Eur J Pharmacol, 2007, 563, 240-244.

38. Cowden, J. M.; Riley, J. P.; Ma, J. Y.; Thurmond, R. L.; Dunford, P. J. Histamine H4 receptor antagonism diminishes existing airway inflammation and dysfunction via modulation of Th2 cytokines. Respir Res, 2010, 11, 86.

39. Dunford, P. J.; Williams, K. N.; Desai, P. J.; Karlsson, L.; McQueen, D.; Thurmond, R. L. Histamine H4 receptor antagonists are superior to traditional antihistamines in the attenuation of experimental pruritus. J Allergy Clin Immunol, 2007, 119, 176-183.

40. Hsieh, G. C.; Chandran, P.; Salyers, A. K.; Pai, M.; Zhu, C. Z.; Wensink, E. J.; Witte, D. G.; Miller, T. R.; Mikusa, J. P.; Baker, S. J.; Wetter, J. M.; Marsh, K. C.; Hancock, A. A.; Cowart, M. D.; Esbenshade, T. A.; Brioni, J. D.; Honore, P. H4 receptor antagonism exhibits anti-nociceptive effects in inflammatory and neuropathic pain models in rats. Pharmacol Biochem Behav, 2010, 95, 41-50.

41. Lim, H. D.; Smits, R. A.; Leurs, R.; De Esch, I. J. The emerging role of the histamine H4 receptor in anti-inflammatory therapy. Curr Top Med Chem, 2006, 6, 1365-1373.

42. Rossbach, K.; Wendorff, S.; Sander, K.; Stark, H.; Gutzmer, R.; Werfel, T.; Kietzmann, M.; Baumer, W. Histamine H4 receptor antagonism reduces hapten-induced scratching behaviour but not inflammation. Exp Dermatol, 2009, 18, 57-63.

43. Thurmond, R. L.; Desai, P. J.; Dunford, P. J.; Fung-Leung, W. P.; Hofstra, C. L.; Jiang, W.; Nguyen, S.; Riley, J. P.; Sun, S.; Williams, K. N.; Edwards, J. P.; Karlsson, L. A potent and selective histamine H4 receptor antagonist with anti-inflammatory properties. J Pharmacol Exp Ther, 2004, 309, 404-413.

44. Derkach, V.; Surprenant, A.; North, R. A. 5-HT3 receptors are membrane ion channels. Nature, 1989, 339, 706-709.

45. Aapro, M. S. 5-HT3 receptor antagonists. An overview of their present status and future potential in cancer therapy-induced emesis. Drugs, 1991, 42, 551-568.

46. de Graaf, C.; Vischer, H. F.; de Kloe, G. E.; Kooistra, A. J.; Nijmeijer, S.; Kuijer, A.; Chow, K. Y.; England, P.; Leurs, R.; de Esch, I. J. P. Small and colourful tesserae make beautiful mosaics: fragment-based chemogenomics. Drug Discov. Today, 2012, accepted.

47. Verheij, M. H.; de Graaf, C.; de Kloe, G. E.; Nijmeijer, S.; Vischer, H. F.; Smits, R. A.; Zuiderveld, O. P.; Hulscher, S.; Silvestri, L.; Thompson, A. J.; van Muijlwijk-Koezen, J. E.;

Page 27: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

97

Lummis, S. C.; Leurs, R.; de Esch, I. J. Fragment library screening reveals remarkable similarities between the G protein-coupled receptor histamine H(4) and the ion channel serotonin 5-HT(3)A. Bioorganic & medicinal chemistry letters, 2011, 21, 5460-5464.

48. Chen, B.; Mueller  , C.; Willett, P. Combination Rules for Group Fusion in Similarity-Based Virtual Screening. Molecular Informatics, 2010, 29, 533-541.

49. Hert, J.; Willett, P.; Wilton, D. J.; Acklin, P.; Azzaoui, K.; Jacoby, E.; Schuffenhauer, A. Comparison of fingerprint-based methods for virtual screening using multiple bioactive reference structures. Journal of chemical information and computer sciences, 2004, 44, 1177-1185.

50. Hert, J.; Willett, P.; Wilton, D. J.; Acklin, P.; Azzaoui, K.; Jacoby, E.; Schuffenhauer, A. Enhancing the effectiveness of similarity-based virtual screening using nearest-neighbor information. Journal of medicinal chemistry, 2005, 48, 7049-7054.

51. Hert, J.; Willett, P.; Wilton, D. J.; Acklin, P.; Azzaoui, K.; Jacoby, E.; Schuffenhauer, A. New methods for ligand-based virtual screening: use of data fusion and machine learning to enhance the effectiveness of similarity searching. Journal of chemical information and modeling, 2006, 46, 462-470.

52. Schuffenhauer, A.; Floersheim, P.; Acklin, P.; Jacoby, E. Similarity metrics for ligands reflecting the similarity of the target proteins. Journal of chemical information and computer sciences, 2003, 43, 391-405.

53. Tovar, A.; Eckert, H.; Bajorath, J. Comparison of 2D fingerprint methods for multiple-template similarity searching on compound activity classes of increasing structural diversity. ChemMedChem, 2007, 2, 208-217.

54. Whittle, M.; Gillet, V. J.; Willett, P.; Alex, A.; Loesel, J. Enhancing the effectiveness of virtual screening by fusing nearest neighbor lists: a comparison of similarity coefficients. Journal of chemical information and computer sciences, 2004, 44, 1840-1848.

55. Whittle, M.; Gillet, V. J.; Willett, P.; Loesel, J. Analysis of data fusion methods in virtual screening: similarity and group fusion. Journal of chemical information and modeling, 2006, 46, 2206-2219.

56. Willett, P. Similarity-based virtual screening using 2D fingerprints. Drug discovery today, 2006, 11, 1046-1053.

57. Zhang, Q.; Muegge, I. Scaffold hopping through virtual screening using 2D and 3D similarity descriptors: ranking, voting, and consensus scoring. Journal of medicinal chemistry, 2006, 49, 1536-1548.

58. Baber, J. C.; Shirley, W. A.; Gao, Y.; Feher, M. The use of consensus scoring in ligand-based virtual screening. Journal of chemical information and modeling, 2006, 46, 277-288.

59. Wiggers, H. J.; Rocha, J. R.; Cheleski, J.; Montanari, C. A. Integration of Ligand- and Target-Based Virtual Screening for the Discovery of Cruzain Inhibitors. Molecular Informatics, 2011, 30, 565-578.

60. Tan, L.; Geppert, H.; Sisay, M. T.; Gütschow, M.; Bajorath, J. Integrating Structure- and Ligand-Based Virtual Screening: Comparison of Individual, Parallel, and Fused Molecular Docking and Similarity Search Calculations on Multiple Targets. ChemMedChem, 2008, 3, 1566-1571.

61. Gaulton, A.; Bellis, L. J.; Bento, A. P.; Chambers, J.; Davies, M.; Hersey, A.; Light, Y.; McGlinchey, S.; Michalovich, D.; Al-Lazikani, B.; Overington, J. P. ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Research, 2012, 40, D1100-D1107.

62. MOE v. 2010.10 ( Chemical Computing Group Inc, Montreal). 63. Todeschini, R.; Consonni, V.; Xiang, H.; Holliday, J.; Buscema, M.; Willett, P. Similarity

coefficients for binary chemoinformatics data: overview and extended comparison using simulated and real datasets. Journal of chemical information and modeling, 2012.

64. Pipeline Pilot v. 8.0 (Accelrys Software Inc., San Diego).

Page 28: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Chapter 5

98

65. Kooistra, A. J.; Binsl, T. W.; van Beek, J. H.; de Graaf, C.; Heringa, J. Electron density fingerprints (EDprints): virtual screening using assembled information of electron density. Journal of chemical information and modeling, 2010, 50, 1772-1780.

66. ROCS v. 2.3.2 ( OpenEye Scientific Software, Inc., Santa Fe, NM, USA). 67. Corina v. 3.46 (Molecular Networks GmbH, Erlangen, Germany). 68. OMEGA v. 2.3.2 ( OpenEye Scientific Software, Inc., Santa Fe, NM, USA). 69. Kirchmair, J.; Ristic, S.; Eder, K.; Markt, P.; Wolber, G.; Laggner, C.; Langer, T. Fast and

efficient in silico 3D screening: toward maximum computational efficiency of pharmacophore-based and shape-based approaches. Journal of chemical information and modeling, 2007, 47, 2182-2196.

70. de Graaf, C.; Rognan, D. Selective structure-based virtual screening for full and partial agonists of the beta2 adrenergic receptor. Journal of medicinal chemistry, 2008, 51, 4978-4985.

71. Durant, J. L.; Leland, B. A.; Henry, D. R.; Nourse, J. G. Reoptimization of MDL keys for use in drug discovery. Journal of chemical information and computer sciences, 2002, 42, 1273-1280.

72. Rogers, D.; Hahn, M. Extended-connectivity fingerprints. Journal of chemical information and modeling, 2010, 50, 742-754.

73. Strip-it v. 1.0.1 (Silicos-it Schilde, Belgium). 74. Bemis, G. W.; Murcko, M. A. The Properties of Known Drugs. 1. Molecular Frameworks.

Journal of medicinal chemistry, 1996, 39, 2887-2893. 75. Xu, Y. J.; Johnson, M. Using molecular equivalence numbers to visually explore structural

features that distinguish chemical libraries. Journal of chemical information and computer sciences, 2002, 42, 912-926.

76. Jain, A. N.; Nicholls, A. Recommendations for evaluation of computational methods. Journal of computer-aided molecular design, 2008, 22, 133-139.

77. Vogt, M.; Stumpfe, D.; Geppert, H.; Bajorath, J. Scaffold hopping using two-dimensional fingerprints: true potential, black magic, or a hopeless endeavor? Guidelines for virtual screening. Journal of medicinal chemistry, 2010, 53, 5707-5715.

78. Leurs, R.; Vischer, H. F.; Wijtmans, M.; de Esch, I. J. En route to new blockbuster anti-histamines: surveying the offspring of the expanding histamine receptor family. Trends in pharmacological sciences, 2011, 32, 250-257.

79. Thompson, A. J.; Lummis, S. C. The 5-HT3 receptor as a therapeutic target. Expert opinion on therapeutic targets, 2007, 11, 527-540.

80. Bajorath, J.; Peltason, L.; Wawer, M.; Guha, R.; Lajiness, M. S.; Van Drie, J. H. Navigating structure-activity landscapes. Drug discovery today, 2009, 14, 698-705.

81. Maggiora, G. M. On outliers and activity cliffs--why QSAR often disappoints. Journal of chemical information and modeling, 2006, 46, 1535.

82. Bender, A. in Chemoinformatics and Computational Chemical Biology Vol. 672 Methods in Molecular Biology (ed Jürgen Bajorath) Ch. 7, 175-196 (Humana Press, 2011).

83. Chen, B.; Harrison, R. F.; Papadatos, G.; Willett, P.; Wood, D. J.; Lewell, X. Q.; Greenidge, P.; Stiefl, N. Evaluation of machine-learning methods for ligand-based virtual screening. Journal of computer-aided molecular design, 2007, 21, 53-62.

84. Geppert, H.; Horvath, T.; Gartner, T.; Wrobel, S.; Bajorath, J. Support-vector-machine-based ranking significantly improves the effectiveness of similarity searching using 2D fingerprints and multiple reference compounds. Journal of chemical information and modeling, 2008, 48, 742-746.

85. Hammann, F.; Gutmann, H.; Baumann, U.; Helma, C.; Drewe, J. Classification of cytochrome p(450) activities using machine learning methods. Molecular pharmaceutics, 2009, 6, 1920-1926.

86. Renner, S.; Schneider, G. Scaffold-hopping potential of ligand-based similarity concepts. ChemMedChem, 2006, 1, 181-185.

Page 29: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

Full combinatorial consensus scoring for ligand-based virtual fragment screening at membrane bound receptors

99

87. Hessler, G.; Baringhaus, K.-H. The scaffold hopping potential of pharmacophores. Drug Discovery Today: Technologies, 2010, 7, e263-e269.

88. Martin, Y. C.; Muchmore, S. Beyond QSAR: lead hopping to different structures. QSAR & Combinatorial Science, 2009, 28, 797-801.

89. Schneider, G.; Schneider, P.; Renner, S. Scaffold-hopping: how far can you jump? QSAR & Combinatorial Science, 2006, 25, 1162-1171.

90. Smits, R. A.; Lim, H. D.; Hanzer, A.; Zuiderveld, O. P.; Guaita, E.; Adami, M.; Coruzzi, G.; Leurs, R.; de Esch, I. J. Fragment based design of new H4 receptor-ligands with anti-inflammatory properties in vivo. Journal of medicinal chemistry, 2008, 51, 2457-2467.

91. Smits, R. A.; de Esch, I. J.; Zuiderveld, O. P.; Broeker, J.; Sansuk, K.; Guaita, E.; Coruzzi, G.; Adami, M.; Haaksma, E.; Leurs, R. Discovery of quinazolines as histamine H4 receptor inverse agonists using a scaffold hopping approach. Journal of medicinal chemistry, 2008, 51, 7855-7865.

92. Smits, R. A.; Lim, H. D.; van der Meer, T.; Kuhne, S.; Bessembinder, K.; Zuiderveld, O. P.; Wijtmans, M.; de Esch, I. J.; Leurs, R. Ligand based design of novel histamine H(4) receptor antagonists; fragment optimization and analysis of binding kinetics. Bioorganic & medicinal chemistry letters, 2012, 22, 461-467.

Page 30: Chapter5 LBVS M8 - Vrije Universiteit Amsterdam 5.pdf · ‘ring_with_linkers_1’, herein further called scaffold 1 (SCF1), and ‘murcko_2’ scaffolds, herein further called scaffold

100