chapter 6 6.pdf · 127 structure-based prediction of gpcr-ligand function - chapter 6 6 6 g...
TRANSCRIPT
-
VU Research Portal
In silico Medicinal Chemistry
Kooistra, A.J.
2015
document versionPublisher's PDF, also known as Version of record
Link to publication in VU Research Portal
citation for published version (APA)Kooistra, A. J. (2015). In silico Medicinal Chemistry: Investigating GPCRs: key regulators of signal transductionand cell function.
General rightsCopyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright ownersand it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.
• Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal ?
Take down policyIf you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediatelyand investigate your claim.
E-mail address:[email protected]
Download date: 22. Jun. 2021
https://research.vu.nl/en/publications/355d81ed-8d02-4443-b34f-36ef824d1812
-
125
Structure-based prediction of GPCR-ligand function
Chapter 6
-
126
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
-
127
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
G protein-coupled receptors (GPCRs) are versatile membrane-bound switch nodes of cell function and constitute arguably one of the most important classes of proteins that is encoded in the human genome.373, 379 Next to stimulating (agonism), blocking (an-tagonism) or reducing constitutive activity (inverse agonism), GPCRs ligands are able to stim-ulate cell-signaling pathways that are completely independent of G-proteins.363, 380, 384 This new concept of GPCR biased-signaling is an intriguing phenomenon in which different ligands can promote or block multiple conformational states of the same receptor that are each linked to a different functional outcome.78, 380, 384 The first human GPCR crystal structures were only solved in the past seven years93, 176 and have further increased our understanding of the molecular details of how GPCRs work.73, 117, 176 These spectacular advances in GPCR structure determination have resulted in 109 crystal structures410 at this moment, covering 20 class A93, 176, 2 class B81, 82, 1 class C107 and 1 class F84 GPCRs. Despite these advances less than 2% of all human GPCRs have been crystallized, and most of these structures represent inactive-state conformations, with the exception of the adenosine A2A receptor, the neurotensin receptor type 1411, and the β-adrenoceptors 1 (β1R) and 2 (β2R).176 The 31 different β1R/β2R structures (while finalizing this chapter a 32nd, covalently bound, β-adrener-gic structure was released394) cover multiple receptor activation states in combination with 19 ligands (Table 6.1) with different functional effects (Table 6.1).102, 104, 105, 187, 193, 206, 302, 303, 386-388, 390-393 This unique GPCR structure-function data set offers a unique opportunity to assess the possibil-ities and limitations of the structure-based computational prediction of GPCR ligand function in virtual screening (VS) studies. GPCR crystal structures157, 158, 161, 163, 164, 173 and homology models115, 132 have been successfully used to discover new GPCR ligands. While in most VS studies new antagonists were found by docking into inverse agonist/antagonist bound GPCR crystal structures157, 158, 161, 163, 164, 173 or GPCR models based on the inactive bovine rhodopsin (bRho) structure91 and refined using known antagonists115, 132, several reports show that also agonists can be found by structure-based VS in inactive GPCR models.122, 141, 144, 146, 152, 156 In some of these studies,141, 146, 152 the initial bRho-based inactive state homology model was refined using true agonists, but in other cases122, 144 the inactive model was even refined by antagonists. The other way around, agonist-biased models have also been suc-cessfully applied to discover new antagonists.156 Previous docking-based VS studies149 against β2R crystal structures nevertheless indicated that the functional activity of in silico hits is the same as the functional activity of the co-crystallized ligand bound to the structure that is used for the docking simulations.149 While docking against the first carazolol (antagonist/inverse agonist) bound β2R crystal structure enabled the discovery of new β-adrenergic antagonists/inverse ago-nists164, 178, docking studies against the BI167107 (full agonist) bound active-state crystal structure facilitated the identification of partial/full agonists.395 Moreover, even before the determination of partial/full agonist bound β2R crystal structures104, 303, 388 several studies showed that the in-verse-agonist-bound β2R crystal structure could be used for the selective identification of ago-nists by introducing (small) conformational changes in the β2R structure and using customized scoring approaches to select docking poses.162, 276, 404, 412 In independent modeling studies162, 276, 404, 412 the position and/or orientation of the side chains of S2045.43 and S2075.46 (through rotation of
Co-authored by: Rob leurs, Iwan J.P. de Esch, and Chris de Graaf
Submitted for publication
-
128
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
TM5 and/or adjustment of the rotameric states of S2045.43 and S2075.46) was customized based on ligand SAR and receptor mutation studies,248, 263, 389, 408, 409, 413 thereby enabling the preferen-tial scoring of agonists over antagonists. The selective identification of partial/full agonists was significantly increased by employing an interaction fingerprint (IFP)45 scoring method that de-termines ligand-binding-mode similarity instead of classical (energy-based) scoring functions.162 These studies illustrate how (small) differences in protein conformation can influence struc-ture-based virtual screening results, as has also been demonstrated by recent comparative VS studies against GPCR crystal structures and homology models.157, 159, 166, 173 Systematic consider-ation of alternative protein conformations/flexibility in docking simulations can, for example, be taken into account either by incorporating flexibility for (selected) binding-site residues during the docking simulations414, 415, by creating binding-site ensembles143, by using multiple crystal structures416, or refining protein models with different ligands.115, 154, 417
GPCR crystal structure-based identification of new ligands with a different functional effect than the co-crystallized ligand has been defined as one of the challenges in the current era of GPCR structural and chemical biology.117 The growing number of β-adrenoceptor crystal structures meanwhile covers multiple activation states and multiple co-crystallized ligands with different efficacy classes.379, 383, 384 This unique GPCR structure-function data set allowed us for the first time to systematically and separately investigate the effect of both the reference ligands and the receptor conformations in combination with classical (energy-based)149, 418 and IFP-scoring45, 419 approaches. In this study we have investigated whether (and to what extent) i) the crystal structures retain their preference for ligands with the same functional effect as the co-crystal-lized ligand, ii) the different binding site conformations of the crystal structures have an impact on the outcome of VS studies, iii) specific IFPs can change or amplify the preference of a crystal structure for ligands with a specific functional effect and iv) the IFP derived from the predicted docking pose of a small agonist (norepinephrine) can be used for the selective retrieval of par-tial/full agonists (p/fAGO) over antagonist/inverse agonists (ANT/iAGO) and decoys.
6.1 Computational Methods
Crystal structures retrieval and preparation
All β-adrenergic crystal structures were downloaded from the PDB (last accession date, 4th of April 2014).61 Subsequently, all structures were superposed using MOE420 based on the resi-dues surrounding the ligands. Based on the initial superposed structures all (47) residues with-in 4.5Å of any ligand were determined as being the pocket residues: V90/M822.53, V94/862.57, G98/902.61, A99/912.62, L101/H932.64, V102/I942.65, G105/K972.68, W107/99, C114/1063.25, W117/1093.28, T118/1103.29, D121/1133.32, V122/1143.33, L123/1153.34, V125/1173.36, T126/1183.37, V172/T1644.56, S173/1654.57, C198/19045.49, C199/19145.50, D200/19245.51, F201/19345.52, V202/F19445.53, T203/19545.54, N204/1965.35, Y207/1995.38, A208/2005.39, I209/2015.40, S211/2035.42, S212/2045.43, I214/V2065.45, S215/2075.46, F216/2085.47, W303/2866.48, F306/2896.51, F307/2906.52, N310/2936.55, N313/H2966.58, V314/2976.59, D322/K3057.32, F325/Y3087.35, V326/I3097.36, A327/L3107.37, N329/3127.39, W330/3137.40, G332/3157.42, and Y333/3157.43.The first and second residue number refer to the Uniprot residue number of β1R and β2R, respec-tively, while the Ballesteros-Weinstein83 and extracellular loop 2 (ECL2)160 residue numbers are reported as superscript. An in-house protocol similar to the KLIFS protocol370 was performed to align all β-adrenergic crystal structures based on the selected 47 pocket residues, protonate the structures and separately extract the ligand, pocket, protein and (if present) waters, ions, or-
-
129
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
ganometallics and cofactors in MOL2 format. All structures were subsequently inspected using MOE and when necessary the protein-hydroxyl groups were adjusted for optimal H-bonding between the co-crystallized ligand and the receptor.
Consideration of alternative ligand ring conformations in 3ZPQ
The ring conformation of the fragment-like arylpiperazine ligand 19 in chain B of the β1R crystal structure (PDB-code 3ZPQ) did not allow for the secondary amine in the piperazine moiety to interact with both N329/3127.39 and D121/1133.32 due to its twist conformation. We therefore re-docked the co-crystallized ligand allowing for all possible ring conformations and selected the highest-ranking pose, which is in the more stable chair conformation421 (named 3ZPQ_dock, see Figure S6.1).
Pocket and ligand analyses
Pocket volumes were calculated using POVME422 (version 1.1.0) with a grid spacing of 0.5Å and a padding of 1.09Å. The initial inclusion volume was generated based on three spheres with a radius of 7.0Å of which the centers were based on the superposed co-crystallized ligands (blue volume in Figure S6.7). RMSD values between the pockets from the crystal structures were calculat-ed using MOE. Distance calculations within the quadruplet (formed by G98/902.61, D121/1133.32, N329/3127.39 and S211/2035.42) and the subsequent Z-score analyses were performed using an in-house script. The residues that form the quadruplet were selected based on their location and function in the binding pocket: G2.61 is opposite of S5.42 (an important interaction point for polar groups, in particular for agonists), N7.39 is opposite of D3.32 (both are key interaction points for the ligand amine group), the N7.39/D3.32 pair is perpendicular to the G2.61/S5.42 pair (Figure S6.3), and all four residues are roughly positioned at the same height in the TM helices with respect to the membrane. The 2D similarity of the co-crystalized ligands was assessed using Extended Connectivity fingerprint (ECFP-4) in the Pipeline Pilot Suite385 and EDprints30 (in-house imple-mentation).
Compound preparation and docking
26 ligands (13 f/pAGO and 13 ANT/iAGO) and 980 decoy compounds were obtained from the supporting information accompanying the publication from de Graaf et al..162 In total 24 addition-al β-adrenergic ligands (12 f/pAGO and 12 ANT/iAGO) were selected based on the classification of Baker.333, 423 Molecular (2D) structures of all ligands are shown in Figure S6.4. All compounds were prepared starting from the SMILES format by first protonating all compounds at physiologi-cal pH using Chemaxon's Calculator (version 5.1.4)364 and were subsequently converted to MOL2 format using CORINA (version 3.49).365, 424 All molecular docking simulations were carried out in fivefold using PLANTS345 (version 1.2) using the (default) ChemPLP scoring function, search speed 2, flipping of free ring corners enabled, and generating 99 poses for every compound with a clustering RMSD of 1.0. The docking experiments were performed in fivefold in order to reduce the effect of the stochastic nature of PLANTS345, as the use of a stochastic algorithm (as used in multiple docking algorithms, e.g. PLANTS345, GOLD425, and QXP426) can result in different docking poses for the same compound when performing multiple docking runs.The binding site was based on the center of carazolol in β2R (PDB-code 2RH1) with a binding-site radius of 15Å, thereby covering all pocket residues (except for G105/K972.68, which is pointing away from the binding site). In total 1030 compounds were docked in 48 β-adrenergic structures yielding almost 24.5 million docking poses in total.
-
130
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
Interaction Fingerprint calculation and scoring
From the individual pockets and ligands, that were extracted from the crystal structures, in-teractions fingerprints were calculated through application of the FingerPrintLib (version 2.2, downloaded from http://bioinfo-pharma.u-strasbg.fr, using OpenEye’s OEChem Toolkit).45, 427 All docking poses were post-processed with IFP by combining the docking poses with the consistent pocket residue selection as indicated earlier. The IFPs from all docking-poses were scored using all 40 reference IFPs resulting in ~979 million IFP-scores representing their interaction profile similarity (as calculated by the Tanimoto similarity coefficient).
Virtual screening assessment
Based on the PLANTS and IFP scores of all docking poses the highest scoring docking poses according to each of the scoring methods were selected. The resulting hit lists were used to rank both the f/pAGO and the ANT/iAGO ligands as well as the decoys and determine the enrichment factor (EFx, where x is the false-positive rate) for each reference IFP (or ChemPLP score) and docking structure combination. The EFs were calculated by determining the true-positive rate (TP-rate, i.e. the percentage of f/pAGO and ANT/iAGO respectively) at a 1% false-positive rate347 (FP-rate, i.e. 1% of the decoys, see Figure 6.3 and 6.5a/b) resulting in the EF1% of 1968 scoring combinations (40 reference IFP scores and 1 PLANTS score combined with 48 structures) for both the ranking of f/pAGO and ANT/iAGO ligands (see Figure 5).
6.2 Subtle ligand-type-dependent structural changes in the receptor binding site
There currently are 31 β-adrenergic structures; 15 human β2R and 16 turkey β1R structures (Table 6.1).102-105, 187, 193, 206, 302, 303, 386-388, 390-393 While finalizing this chapter a 32nd β-adrenergic structure was released394 with a covalently-bound agonist that forms similar H-bond interactions with TM5 as the epinephrine (PDB-code 4LDO)103 and has the same covalent linker as FAUC50 (PDB-code 3PDS).388 Multiple crystallization methods have been employed to obtain stable crystals for structure determination of these receptor-ligand complexes,93 amongst them T4 lysozyme insertion at the intracellular loop (ICL) 3 or at the N-terminus, introduction of thermostabilizing mutations, introduction of covalently binding ligands, and the addition of antibody fragments.428 All crystal structures were obtained in combination with one of the 19 unique ligands (apart from the ligand-free structure,393 see Table 6.1 and 6.2); 9 (partial/full) inverse agonists, 7 (partial/full) agonists, 1 covalently-bound full agonist, and 2 unvalidated fragments (but suggested to be antagonists,392 vide infra). Throughout this manuscript we abbreviate full and partial agonists as f/pAGO, antagonists as ANT and (full/partial) inverse agonists as iAGO (see legend Table 6.1). It should be noted that we did not focus on the differences between β1R and β2R in this manuscript, as the pocket-lining residues are almost identical (only residue 7.35 differs, a phenylalanine in β1R but tyrosine in β2R) we assume that the ligand-binding modes in β1R and β2R are transferable.429 Also, in the set of ligands that we analyzed there is only one highly selective compound (ICI 118,551, which has a ~540 fold higher affinity for β2R than for β1R).423 Moreover, it is suggested that selectivity for one of the receptors333, 423, 430 is based on ligand kinetics431 (i.e. binding to and dissociation from the receptor) and receptor activation429 rather than differences in ligand binding modes.431
-
131
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
Table 6.1. Overview of all β-adrenoceptor crystal structures, the co-crystallized ligands and their function.
PDB Ligand Funct-iona,p Biase PDB Ligand Funct-iona,p Bias
e
3NYA193 OOH
NH2+
Alprenolol
ANT
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Yesj,k
Nof,i3ZPR392
N
N+H2N
Arylpiperazine 20
ANTb●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
-
4AMI105
N
OOH
NH2+
HN
Bucindolol
ANTo
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Yesg,m3P0G303,q
3SN6104,q
4LDE103,q
O
OHNH
O
NH2+ OH
BI-167107
fAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Yesn
2YCW390
2RH1102
4GBR391
2R4R386,d
2R4S386,d
3KJ6206,d
NH
ONH2+ OH
Carazolol
iAGOo●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Noh 2Y02302
OHNH2+
O OHNH
O
Carmoterol
fAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Yesm,n
4AMJ105OH
O
NH
NH2+
O
O
Carvedilol
iAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Yesg,j,m
Nof2Y00302
2Y01302
Dobutamine
pAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Yesf
Nol
2VT4387
2YCX390
2YCY390
4BVN101
NH
ONH2+
N
OH
Cyanopindolol
iAGOo●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
- 4LDO103,q OH
OH
OHNH2+
Epinephrine
fAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Nof,g,l
3NY8193 NH2+
OOH
ICI 118,551
iAGO●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Nof,h 3PDS388,c
FAUC50
fAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
-
2YCZ390
N
INH
ONH2+ OH
Iodocyanopindolol
iAGO●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
- 4LDL103,qOH
OH
OHNH2+
HO
Hydroxybenzyl-isoproterenol
fAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Yesn
3D4S187
O
N NS
NOOH
NH2+
Timolol
iAGO●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Nof 2Y03302 OH
OH
OHNH2+
Isoproterenol
fAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Nof,g,-h,i,j,k,l
OHOH
NH2+
HO
NH2+ OH
OO OH
S O
NH
-
132
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
PDB Ligand Funct-iona,p Biase PDB Ligand Funct-iona,p Bias
e
3NY9193
OO
O
OOH
NH2+
VS hit (Kolb)
iAGO●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
- 2Y04302OH
OH
OHNH2+
Salbutamol
pAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Nol
3ZPQ392 N+H2N
NH
Arylpiperazine 19
ANTb●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
- - OH
OH
OH
+H3N
Norepinephrine
fAGO
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−freep/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
Nog
4GPO393 Ligand-free
a) Function abbreviations: ANT – antagonist, AGO – agonist, i – inverse, p – partial, f – full. b) Not validated.392 c) covalent-ly-bound ligand. d) Ligand and extracellular part of the receptor were not resolved due to weak electron density. e) This column indicates whether a ligand has a signaling preference for β-arrestins over G-proteins or not according to f) Casella et al.430, g) Drake et al.432, h) Kahsai et al.433, i) Kaya et al.434, j) Kim et al.435, k) Liu et al.205, l) Rajagopal et al.436, m) Warne et al.105 and n) Weiss et al.395. o) Although these ligands are considered antagonists/inverse agonists, they can also have a partial agonist effect depending on the activation state of the receptor.333, 437 p) Color coding: Full/partial agonist (red), antagonist/inverse agonist (blue), or unvalidated antagonist (cyan). Ligands with a circle do not show a signaling preference or it is not known for these compounds, ligands with triangular icons have been shown to have a β-arrestin signaling bias. q) The receptor is in its active state.
Superposition of all individual monomers of the crystal structures of β1R and β2R based on their pocket residues (see Materials and Methods) shows the conserved heptahelical fold, but also indicates some structural variation, in particular in TM1 (a 60 degree kink)387, 390, TM5 and TM6 (which differ between active and inactive states104, 303 and the straight and bent inactive-state TM6390). While the structural differences in the intracellular regions have been described in sev-eral reviews73, 176, our systematic analysis will focus on the structural variations in the ligand-bind-ing site within the β-adrenergic GPCR family.Figure 6.2a/b shows that the pocket residues in all (β1R and β2R) crystal structures have largely conserved conformations. The outliers are the FAB-complexes (PDB-codes: 2R4R, 2R4S386 and 3KJ6206) in which the ECL domain, the top of the helices, and the ligand (carazolol) were not solved due to the distorted density in this region.206, 386 When comparing the pocket residues by means of the mutual RMSD values (Table S6.2, Figure 6.2c), they are generally low (average all-atom RMSD 0.9Å). By determining the average RMSD of the binding site residues of each individual structure to all f/pAGO and all ANT/iAGO structures it becomes apparent that all f/pAGO structures and all inactive-state ANT/iAGO structures cluster together (Figure 6.2c). This means that the binding site conformation within the inactive-state f/pAGO structures, but also within the ANT/iAGO structures is conserved. Moreover, the active-state structures form a sep-arate cluster compared to other f/pAGO structures, which can be ascribed to the movement of TM5 and TM6104, 303 (Table 6.1, Table S6.2, Figure S6.2).438 Outliers are the FAB-complexes (due to the distorted electron density), but also the ligand-free structure (4GPO393), which have a high RMSD value compared to the other structures. Interestingly, this apo structure does not cluster with any of the f/pAGO structures or with ANT/iAGO structures.To further quantify the differences between the pockets we measured the distances between the Cα atoms of G2.61, D3.32, N7.39, and S5.42 (Figure 6.1c, Figure S6.3, and Table S6.3). While the mea-surements show contraction of the pocket upon agonist binding302 (mainly through movement of TM5), the highest contraction of the pocket was found for the ligand-free β1R structure.393
-
133
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
RMSD
a
(f/pAGO)
RMSD
a
(ANT/iAGO
)
Distanceb
Volumec
Receptor
2RH1_A 0.8 0.6 0.2 377 β22VT4_A 0.8 0.6 0.2 492 β12VT4_B 0.9 0.7 0.1 442 β12VT4_C 0.9 0.7 0.1 442 β12VT4_D 0.8 0.6 0.2 493 β12YCW_A 0.7 0.5 -0.2 411 β12YCW_B 0.8 0.6 -0.2 409 β12YCX_A 1.0 0.7 0.2 459 β12YCX_B 1.0 0.7 0.1 464 β12YCY_A 0.8 0.6 0.0 437 β12YCY_B 0.8 0.6 -0.1 441 β12YCZ_A 0.9 0.6 0.2 469 β12YCZ_B 0.9 0.6 0.2 472 β13D4S_A 0.8 0.6 0.2 278 β23NY8_A 0.9 0.7 0.8 387 β23NY9_A 0.8 0.6 0.5 299 β23NYA_A 0.9 0.7 0.5 248 β24AMI_A 0.7 0.6 -0.2 411 β14AMI_B 0.6 0.6 -0.2 396 β14AMJ_A 0.7 0.5 0.5 488 β14AMJ_B 0.7 0.5 0.3 424 β14BVN_A 0.7 0.5 0.5 302 β14GBR_A 0.9 0.7 0.8 360 β23ZPQ_A 0.7 0.5 0.0 445 β13ZPQ_B 0.7 0.5 0.0 391 β13ZPR_A 0.7 0.5 0.2 460 β13ZPR_B 0.7 0.5 0.1 414 β12Y00_A 0.6 0.6 -0.6 368 β12Y00_B 0.6 0.6 -0.7 325 β12Y01_A 0.6 0.6 -0.4 435 β12Y01_B 0.6 0.6 -0.6 319 β12Y02_A 0.6 0.7 -0.4 430 β12Y02_B 0.6 0.7 -0.5 389 β12Y03_A 0.7 0.7 -0.1 462 β12Y03_B 0.6 0.6 -0.4 348 β12Y04_A 0.6 0.6 0.0 493 β12Y04_B 0.6 0.5 -0.5 406 β13P0G_A 0.9 1.1 -0.6 270 β23PDS_A 0.9 0.7 -0.6 293 β23SN6_Rd 1.0 1.1 -0.7 388 β24LDE_A 0.9 1.2 -0.8 284 β24LDL_A 0.9 1.2 -1.0 273 β24LDO_A 1.0 1.2 -1.0 236 β24GPO_A 1.0 0.9 -1.5 282 β14GPO_B 1.0 0.9 -1.5 283 β12R4R_A 2.4 2.2 2.6 - β22R4S_A 2.4 2.2 2.7 - β23KJ6_A 2.2 2.0 1.4 - β2
H2.64
S5.42
S5.43
S5.46
D45.51
F45.52
Y5.38
D3.32
K/D7.32
Y/F7.35
H2.64
W3.28
F6.51
N6.55
F45.52
A5.39
V3.33
V3.36 W6.48 Y7.43
I7.36
b
a c
Figure 6.1 A visual (a-b) and quantitative (c) analysis of the pocket residues from all X-ray structures. Pocket residues (carbon atoms are colored blue, salmon, white and gray for ANT/iAGO, f/pAGO, FAB-complexed and ligand-free structures respectively) of all chains of all crystal structures (listed in C by their PDB-code followed by their chain identifier) are shown superimposed with the cartoon representation of β2R (2RH1102) and bucindolol (green carbon atoms, 4AMI105), as seen from the side (a) and from the top (b). An overview (c) of a) Average all-atom RMSD using the pocket residues (Å) compared to all 16 f/pAGO and all 27 ANT/iAGO structures (full overview in Table S6.2), b) the average Z-score of the distances between G2.61, D3.32, N7.39 and S5.42 (full overview in Table S6.3, Figure S6.3) and c) the pocket volume (Å3, Figure S6.4). d) The side chains of the ECL2 residues are not all fully resolved, thereby influencing the pocket volume (Figure S6.7) as well as RMSD values. The red and blue background coloring mark values associated with f/pAGO and ANT/iAGO properties, respectively. Icons: f/pAGO (red), ANT/iAGO (blue), or unvalidated ANT(cyan) with no or unknown signaling preference (circle), or β-arrestin biased ligands (triangle) (see Table 6.1).
Note, however, that these differences are small and that mainly the distance between D3.32 and N7.39 to S5.42 varies between the different structures (on average 0.6Å and 1.0Å, respectively). Analysis of volumes of the individual binding pockets (Figure 6.1c, Figure S6.7) indicates that β1R generally has a bigger pocket volume than β2R (on average 415 Å3 compared to 308 Å3). This can mainly be ascribed to the residue differences at positions 7.32 (D/K in β1R/2 respectively) and 7.35 (F/Y in β1R/2 respectively) resulting in a more confined ligand-binding space in β2R (Figure
-
134
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
6.1, Figure S6.7). Although the RMSD values between all pocket residues are low (Figure 6.1c, Table S6.7), the effect of all subtle changes combined is reflected by the large differences in the pocket volumes. For the β1R and β2R the pocket volumes range from 282 to 493 Å3 and from 236 to 388 Å3, respectively (only comparing structures with fully resolved pocket residues). In line with the results of the distance measurements and the RMSD calculations, the ligand-free structure has the smallest pocket volume of all β1R structures. The active-state adrenaline-bound structure (4LDO103) has the smallest pockets volume of all β2R structures, adapted to this small endogenous agonist.
6.2.1 Identification of ligand-type specific molecular interaction profiles
The binding modes and interaction profiles of the 19 different co-crystallized ligands (Table 6.1 and 6.2) were subsequently investigated using the aforementioned IFP method.45 This technique encodes 7 different types of interactions between the ligand and each of the binding pocket residues. The following interactions with the ligand are encoded: hydrophobic contact, aromatic face-to-edge and face-to-face, H-bond acceptor-donor and donor-acceptor, negative-positive and positive-negative ionic interaction (as shown in Figures 6.2 and 6.5).45 The IFPs for all ligands in all monomers were calculated for the available β-adrenergic crystal structures. The 31 crystal structures contain in total 48 monomers (43 ligand-bound). Of the 14 crystallized ligand-pro-tein complexes containing more than one monomer, only 3 structures had identical interaction patterns within different monomers of the same crystal structure (see Figure 6.2). Moreover, ligands present in multiple co-crystal structures (e.g. cyanopindolol, carazolol, BI-167107) did also not yield identical IFPs. This emphasizes once more that crystal structures are snapshots in time439 of entities that are flexible by nature. The overview of all generated reference IFPs from the ligand-bound crystal structures (Figure 6.2) shows that there are several conserved interactions within the pocket. Apart from dobu-tamine and arylpiperazines 19 and 20, all co-crystallized β1R and β2R ligands contain an etha-nolamine moiety (which is abundant in β-adrenergic ligands368) that can form a tight H-bond interaction network with the sidechains of D3.32 and N7.39 (Figures 6.2, 6.5a/b).367, 389 In chain B of the arylpiperazine 19 structure the ligand does not form H-bonds with N7.39 nor with D3.32. We therefore redocked the co-crystallized ligand into its structure considering alternative ring conformations, resulting in an additional IFP (annotated as 3ZPQ_dock, see Figure 6.2, Figure S6.1, and Materials and Methods).Most β-adrenergic ligands contain an H-bond donor with which they are able to interact with S211/2035.42 (74%, Figure 6.2, Table S6.8).429 More specific for full agonists (carmoterol, iso-proterenol, BI-167107 and FAUC50) is a catechol or a catechol-mimicking moiety with two (or more) H-bond donors that allows for an extra H-bond with another TM5 serine, namely S215/2075.46.440 Partial agonists also have two H-bond donors that can interact with TM5, but their interactions seem to be less optimal (e.g. the shortest observed distance between the H-bond donor from the catechol in dobutamine to the S5.46 acceptor is 3.8Å).429 All ligands also contain at least one aromatic ring, which allows for aromatic stacking with F307/2906.52, F306/2896.51, and F201/19345.52 in 95%, 56% and 54% of all structures respectively. We furthermore observed aromatic interactions with W117/1093.28 for many agonists (60%), ionic interactions with the key ionic anchor D121/1133.32 for all ligands and double H-bond interactions with N329/3127.39 as de-scribed above (see Table S6.8).
-
135
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
Liga
ndAl
pren
olol
3NYA
_Aβ2
89%
25%
Buci
ndol
ol4A
MI_A
β147
%31
%Bu
cind
olol
4AMI
_Bβ1
47%
38%
Cara
zolo
l2R
H1_A
β284
%6%
Cara
zolo
l2Y
CW_A
β189
%13
%Ca
razo
lol
2YCW
_Bβ1
89%
0%Ca
razo
lol
4GBR
_Aβ2
63%
0%Ca
rved
ilol
4AMJ
_Aβ1
42%
13%
Carv
edilo
l4A
MJ_B
β126
%6%
Cyan
opin
dolo
l2V
T4_A
-Dβ1
89%
44%
Cyan
opin
dolo
l2Y
CX_A
β174
%0%
Cyan
opin
dolo
l2Y
CX_B
β174
%13
%Cy
anop
indo
lol
2YCY
_Aβ1
95%
25%
Cyan
opin
dolo
l2Y
CY_B
β184
%13
%Cy
anop
indo
lol
4BVN
_Aβ1
89%
50%
ICI51
18,5
513N
Y8_A
β279
%25
%Io
docy
anop
indo
lol2Y
CZ_A
-Bβ1
95%
31%
Tim
olol
3D4S
_Aβ2
84%
31%
VS5h
it5(K
olb)
3NY9
_Aβ2
74%
6%Ar
ylpi
pera
zine
519
3ZPQ
_Aβ1
11%
0%Ar
ylpi
pera
zine
519
3ZPQ
_Bβ1
21%
0%Ar
ylpi
pera
zine
519
3ZPQ
_doc
kβ1
58%
6%Ar
ylpi
pera
zine
520
3ZPR
_Aβ1
68%
0%Ar
ylpi
pera
zine
520
3ZPR
_Bβ1
42%
0%BI
G167
107
3P0G
_Aβ2
16%
63%
BIG1
6710
73S
N6_R
β258
%56
%BI
G167
107
4LDE
_Aβ2
47%
69%
Carm
oter
ol2Y
02_A
β116
%75
%Ca
rmot
erol
2Y02
_Bβ1
16%
81%
Dobu
tam
ine
2Y00
_Aβ1
0%38
%Do
buta
min
e2Y
00_B
β111
%38
%Do
buta
min
e2Y
01_A
β10%
56%
Dobu
tam
ine
2Y01
_Bβ1
5%50
%Ep
inep
hrin
e4L
DO_A
β20%
50%
FAU
C50
3PDS
_Aβ2
0%44
%Hy
drox
yben
zylIS
O4L
DL_A
β211
%63
%Is
opro
tere
nol
2Y03
_A-B
β126
%75
%Sa
lbut
amol
2Y04
_Aβ1
42%
88%
Salb
utam
ol2Y
04_B
β163
%81
%63
%N
orep
inep
hrin
ea2Y
03_A
β10%
63%
Hydr
opho
bic5
cont
act
Arom
atic
5face
GtoG
face
Arom
atic
5edg
eGto
Gface
Hbon
d5(r
esid
ue5d
onor
)Hb
ond5
(res
idue
5acc
epto
r)Io
nic5
(res
idue
5pos
itive
)Io
nic5
(res
idue
5neg
ativ
e)
W10
73.
285.
466.
48
GL/
HV/
IW
99W
6.51
45.5
145
.50
6.52
45.5
245
.54
5.38
5.39
5.42
5.43
7.40
7.43
6.55
7.35
555555
555Re
sidu
e5C
hain
555555
555555
555555
5552.61
2.64
2.65
7.36
7.39
T
3.32
3.33
3.36
3.37
3.29
D
TM3
ECL2
TM5
FT
YA
SS
DV
VT
CS
Receptor
TM2
ECL1
ANT/iAGO5IFP5≥50.6
f/pAGO5IFP5≥50.6
TM7
V/I
NW
YF/
Y
TM6
WF
FN
Figu
re 6
.2 A
n ov
ervi
ew o
f the
uni
que
inte
ract
ion
finge
rprin
ts o
f all
co-c
ryst
alliz
ed li
gand
s. Th
e co
lors
indi
cate
the
pres
ence
of a
n in
tera
ctio
n (a
s see
n fr
om th
e re
sidue
) ac
cord
ing
to th
e co
lors
des
crib
ed a
t the
bot
tom
of t
he fi
gure
. Ide
ntic
al IF
Ps fo
r mul
tiple
mon
omer
s w
ithin
a P
DB-
entr
y ar
e gr
oupe
d (e
.g. 2
VT4_
chai
nA-D
). Th
e la
st
two
colu
mns
des
crib
e th
e am
ount
of t
imes
(as a
per
cent
age
of th
e to
tal c
ompa
rison
s) a
n IF
P co
mpa
rison
resu
lts in
a sc
ore ≥
0.6
whe
n co
mpa
red
with
the
ANT/
iAG
O
refe
renc
e IF
Ps (a
blu
e ba
ckgr
ound
indi
cate
s a h
igh
perc
enta
ge) a
nd th
e f/
pAG
O re
fere
nce
IFPs
(a re
d ba
ckgr
ound
indi
cate
s a h
igh
perc
enta
ge).
a) Th
e IF
P of
the
high
est
scor
ing
dock
ing
pose
of n
orep
inep
hrin
e in
2Y0
3-ch
ainA
(see
Fig
ure
6.6)
. Nam
es a
nd ic
ons:
f/pA
GO
(red
), AN
T/iA
GO
(blu
e), o
r unv
alid
ated
AN
T392
(cya
n) w
ith n
o or
un
know
n sig
nalin
g pr
efer
ence
(circ
le),
or β
-arr
estin
bia
sed
ligan
ds (t
riang
le) (
see
Tabl
e 6.
1).
-
136
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
All interaction profiles (IFPs) of co-crystallized ligands (8 f/pAGO and 11 ANT/iAGO) in 33 β1R and 15 β2R binding sites were compared and their Tanimoto similarity scores were determined (Table S6.4). Figure 6.2c reports for each individual IFP the percentage of similar IFPs (Tanimoto similarity score ≥ 0.645) derived from (other) ANT/iAGO and f/pAGO co-crystal structures com-plexes. This analysis shows that the IFPs of ANT/iAGO ligands are more similar to each other than to f/pAGO IFPs (75% versus 19% similar pairs respectively). Vice versa, the pairwise similarity between f/pAGO IFPs is higher (62%) than similarity to ANT/iAGO (62% versus 21% respectively, Figure 6.2). Moreover, the arylpiperazine ligands cluster together with the ANT/iAGO IFPs (on average 40% similar pairs compared to 1% similarity with f/pAGO IFPs) suggesting that these compounds are antagonists, as previously proposed.392 These analyses show that the functional effect of a ligand is encoded in the ligand-protein interaction patterns, and suggest that these IFPs can be used to discriminate ANT/iAGO from f/pAGO ligands in structure-based virtual screening studies.
Figure 6.3 Overall enrichment at 1% false positive rate (FP-rate) for the retrieval of 25 partial/full agonists and 25 inverse agonists/antagonists over a set of 980 decoy molecules using IFP scoring (a). Full ROC curves visualizing the retrieval rate (TP-rate) of f/pAGO (red) and ANT/iAGO (blue) in the best ANT/iAGO structure (b) and best f/pAGO structure (c, legend shown in b). The structures are indicated by their PDB code followed by an underscore and the chain identifier (except when there was only one chain or all chains of the structure had similar performance). The 2D structures represent the co-crystallized ligand for selected structures. The ANT/iAGO-axis is scaled from 0 to 60 and the f/pAGO axis from 0 to 100. *1) 3ZPQ_A, 3ZPQ_dock, 2Y00_B. *2) 4AMJ_B, 3ZPQ_B. Icons: fpAGO (red), ANT/iAGO (blue), or unvalidated ANT392 (cyan) with no or unknown signaling preference (circle), or β-arrestin biased ligands (triangle) (see Table 6.1).
6.2.2 Selective retrieval of f/pAGO over ANT/iAGO in structure-based virtual screen-ing
In order to quantify to what extent the differences between the individual structures and the reference IFPs have an influence on the outcome and selectivity of a VS, we docked a set of 1030
-
137
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
compounds comprising 25 f/pAGO, 25 ANT/iAGO and 980 physicochemically similar decoys162 in all β1R and β2R structures. The docking studies were performed using PLANTS345 using the ChemPLP scoring function and the resulting binding modes were post-processed using IFP45 and ranked according to their PLANTS and IFP score. In 85% of all f/pAGO and ANT/iAGO structures IFP scoring yielded equal or higher enrichment factors (EF1%) than ChemPLP-scoring (Figure 6.4, Figure S6.6). Using IFP to rank the docking poses of ligands of the same functional activity class gives a virtual screening enrichment that is on average two times as high as the enrichment obtained by ChemPLP-scoring. Moreover, in 90% of the structures IFP scoring yielded an equal or higher selectivity (higher retrieval of f/pAGO than ANT/iAGO over decoys for f/pAGO structures and vice versa) than when using ChemPLP scoring (Figure 6.4, Figure S6.6). The resulting EF1% values of f/pAGO and ANT/iAGO based on IFP similarity scoring (using the co-crystallized ligand reference IFP) are shown in Figure 6.4a. For 36 of the 43 protein-ligand complexes, the EF1% of ligands with the same functional effect as the co-crystallized ligand was higher than the enrichment of ligands with a different functional effect. Virtual screening runs against five (putative) ANT/iAGO bound receptor struc-tures, namely the arylpiperazine-19-bound β1R complexes (3ZPQ), the bucindolol-β2R complexes (4AMI), and one of the carvedilol-β2R complexes (4AMJ), resulted in a higher enrichment for p/fAGO ligands than for ANT/iAGO ligands. For the arylpiperazine-20-bound β1R structures392 (PDB-code 3ZPR) no enrichment was obtained for any of the known ligands (EF1% = 0). For 11 unique crystal structures (of the 14 structures with multiple monomers) a significant dif-ference in the EF1% is observed (ΔEF1% > 5) between the individual monomers, demonstrating the impact of subtle differences in the reference IFP and/or binding site structure in different monomers of the same crystal structure. For example, the EF1% for f/pAGO over decoys for chain A of structure 2Y02 is 84 compared to 52 for chain B, while the reference IFPs of both chains is identical. This can potentially be ascribed to the larger pocket volume of chain A (490 versus 389 Å3 for chain B, which is mainly due to the rotation of F3257.35), therefore the pocket of chain A is better able to accommodate the larger ligands. In contrast, the small pocket of the ligand-free β1R structure393 (4GPO) is not able to effectively accommodate ligands as D1213.32, F3076.52, and N3297.39 are moved inwards (Figure S6.6), thereby preventing the prediction of correct binding modes using rigid-protein docking simulations. Similarly, the small pocket of the active-state, adrenaline-bound β2R structure (4LDO103) is not able to accommodate the relatively bigger TM5/6 binding moieties of most ANT/iAGO and is therefore more selective for f/pAGO.Figure 6.3 shows that overall the ANT/iAGO ligand-protein complexes are more selective (av-erage EF1% of 27 and 4 for ANT/iAGO and f/pAGO respectively) than the f/pAGO complexes (average EF1% of 13 and 46 for ANT/iAGO and f/pAGO respectively). However, virtual screening against f/pAGO bound structures in many cases gives a high enrichment for both ANT/iAGO and (an even higher enrichment for) p/fAGO ligands. For example, virtual screening against chain A of the salbutamol-bound β1R structure (PDB-code 2Y04), results in a remarkably high enrich-ment factor for both f/pAGO and ANT/iAGO (EF1% of 72 and 32 respectively). To evaluate the individual contribution of the reference IFPs and the protein structure we re-scored all docked compounds in all structures with all reference IFPs (i.e. not only with the IFP that was derived from the corresponding receptor-ligand complex). This furthermore allowed us to rescore the docked compounds with IFP in the structures without a ligand (PDB-codes: 2R4R, 2R4S, 3KJ6, and 4GPO). The resulting scoring-matrices containing the EF1% of f/pAGO and ANT/iAGO are shown in Figure 6.4a and 6.4b respectively.
-
138
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
03
58
1013
1518
2023
2528
3033
3538
4043
4548
5053
5558
6063
6568
7073
7578
8083
8588
9093
9598
100
EF 1
% (f
/pA
GO
)
PLANTS
2RH1_A
2VT4_A-D
2YCW_A
2YCW_B
2YCX_A
2YCX_B
2YCY_A
2YCY_B
2YCZ_A-B
3D4S_A
3NY8_A
3NY9_A
3NYA_A
4AMI_A
4AMI_B
4AMJ_A
4AMJ_B
4BVN_A
4GBR_A
3ZPQ_A
3ZPQ_B
3ZPQ_dock
3ZPR_A
3ZPR_B
2Y00_A
2Y00_B
2Y01_A
2Y01_B
2Y02_A
2Y02_B
2Y03_A-B
2Y04_A
2Y04_B
3P0G_A
3PDS_A
3SN6_R
4LDE_A
4LDL_A
4LDO_A
Norepiphrine
2RH1
_Aβ2
80
80
00
88
08
40
012
2016
00
120
816
360
016
2012
1232
1656
4044
2028
4044
4452
482V
T4_A
β18
00
40
04
40
00
120
08
200
00
012
1628
00
128
812
2424
4452
4824
80
1216
5256
2VT4
_Bβ1
80
04
00
00
00
00
40
2424
88
40
164
240
016
2828
2424
3636
4036
2836
440
4432
442 V
T4_C
β112
00
40
00
00
04
00
424
2412
120
08
428
00
1216
1624
3632
4444
3632
248
4836
5244
2VT4
_Dβ1
80
00
04
88
00
012
00
1628
04
80
812
160
00
00
812
1648
4852
124
44
1656
602Y
CW_A
β120
00
00
44
80
84
120
424
124
420
04
1220
00
1216
1216
3216
6464
6036
1624
4436
3272
2YCW
_Bβ1
160
80
00
412
08
412
012
208
88
200
824
320
012
1620
1232
3272
5660
2416
816
4044
482Y
CX_A
β120
012
00
08
120
160
44
012
160
020
04
836
00
88
128
160
4444
368
164
1616
4048
2YCX
_Bβ1
280
80
00
88
08
48
04
812
44
160
412
440
012
1616
2020
844
4036
1612
828
2832
442 Y
CY_A
β116
00
00
00
00
00
00
012
2012
120
04
1216
00
424
164
2020
5244
4040
164
2828
4444
2YCY
_Bβ1
80
00
00
00
00
00
00
2816
00
00
84
40
08
812
1220
2044
4036
3616
828
3640
402Y
CZ_A
β18
00
00
00
00
00
00
016
160
80
08
412
00
80
412
128
4036
4020
40
1632
2044
2YCZ
_Bβ1
120
00
00
00
00
00
00
48
00
00
04
160
012
128
48
848
3636
2416
432
2020
443D
4S_A
β216
00
00
00
00
00
00
412
44
48
04
820
00
812
1216
4036
4040
4032
2412
4844
4036
3NY8
_Aβ2
820
00
124
00
00
00
00
2432
00
80
2020
320
012
128
436
1236
2432
168
848
5244
403 N
Y9_A
β212
08
04
00
80
80
00
420
160
08
08
820
00
812
88
3212
4036
3220
424
4848
3244
3NYA
_Aβ2
360
00
00
00
04
00
00
2816
44
80
44
240
012
1616
2420
2440
4036
248
1644
4440
404A
MI_A
β116
812
44
48
160
208
80
824
284
816
08
1220
00
1616
1620
3636
3644
3620
248
3240
2836
4AMI
_Bβ1
244
160
08
420
012
48
08
2828
04
80
1216
324
012
1216
2452
3656
5248
3228
832
2848
484A
MJ_A
β112
00
00
00
40
00
44
020
168
48
08
2436
00
2016
2028
4028
5632
4428
168
2428
5240
4AMJ
_Bβ1
120
00
00
44
00
40
00
44
128
40
1624
160
016
2416
1636
2464
4452
2820
420
3256
644 B
VN_A
β18
00
00
04
40
04
00
028
124
48
016
1216
00
1612
1216
3620
4840
4020
288
3628
5244
4GBR
_Aβ2
80
00
00
00
80
00
00
2028
44
40
88
160
04
44
04
1240
1624
1220
012
3224
403Z
PQ_A
β18
00
00
08
80
00
120
40
164
48
016
828
00
1620
1620
3220
5244
4024
288
2424
4840
3ZPQ
_Bβ1
124
00
00
88
04
08
04
124
04
40
168
160
016
1616
1644
3244
4836
1632
824
3256
443Z
PR_A
β112
00
04
08
40
00
120
816
244
40
08
1616
00
816
1616
412
5232
3612
160
020
4440
3ZPR
_Bβ1
120
80
00
128
08
84
04
84
40
00
128
80
020
1616
1212
840
4032
1620
08
2440
362 Y
00_A
β132
820
44
812
288
128
48
1212
2016
1612
012
824
00
1616
1616
7252
6464
6440
4412
6836
6460
2Y00
_Bβ1
288
248
44
820
412
412
04
1620
84
120
812
200
016
1616
2468
5652
5248
3648
2072
3260
562Y
01_A
β112
824
120
128
288
44
120
820
2012
416
48
1616
00
1216
1224
4852
5256
6024
3212
6424
4448
2Y01
_Bβ1
1212
164
04
012
44
812
00
2016
1212
124
1212
204
016
1212
2472
4436
5232
2840
2468
2040
362Y
02_A
β120
1216
44
48
2812
128
08
2012
1612
824
48
420
00
1620
1628
8456
6860
4840
4444
8060
6472
2Y02
_Bβ1
1620
1612
412
424
2412
124
412
2020
1624
164
48
284
012
2012
2080
5264
6868
4044
5284
5248
722 Y
03_A
β112
420
00
04
164
88
44
812
812
1216
08
412
00
1212
2020
6864
6468
6444
4016
8456
5268
2Y03
_Bβ1
3212
2412
48
412
124
200
012
168
1616
164
84
40
024
1620
2476
5256
4844
3640
2872
4068
562Y
04_A
β112
412
00
820
1612
120
40
416
1620
1220
04
1216
00
1620
1624
8052
6472
7236
4420
4440
8080
2Y04
_Bβ1
124
124
08
816
816
80
00
3216
124
204
816
440
016
1612
1636
3260
5644
2828
3240
3248
443P
0G_A
β244
012
00
48
164
1612
80
128
84
828
00
08
00
016
120
4440
4440
4036
3244
6440
4448
3PDS
_Aβ2
280
44
08
416
120
48
00
124
128
120
44
40
012
84
1248
3640
4840
2832
3660
4056
523S
N6_R
β248
1224
48
48
4012
248
124
832
284
2824
04
020
00
1220
1612
6032
6036
3644
4464
6432
6460
4 LDE
_Aβ2
164
40
04
416
40
124
40
2416
08
160
812
40
012
128
1676
4836
3636
4036
4872
3652
524L
DL_A
β216
40
80
84
120
012
40
012
324
08
08
48
00
08
88
6032
5240
4820
3648
5232
4452
4LDO
_Aβ2
284
40
04
48
40
84
00
128
88
40
44
40
012
208
2036
3636
5232
2428
2880
2448
404G
PO_A
β116
00
00
40
04
40
00
00
04
40
00
416
00
2012
2424
2832
820
820
368
324
1628
4GPO
_Bβ1
200
80
00
00
00
40
00
00
40
00
44
120
024
2028
2420
2024
2416
424
828
020
162 R
4R_A
β212
2032
4428
2416
3232
3228
128
1216
164
432
020
2044
164
40
48
2020
2020
2012
436
1212
2824
2R4S
_Aβ2
1212
3636
1624
2436
2436
208
04
2020
2020
168
2416
4432
80
120
1216
1616
164
128
4412
1620
123K
J6_A
β28
1620
2024
160
2012
2016
168
1216
164
48
416
2420
2416
44
84
48
1212
124
80
48
1220
IFP
Structure
Figure 6.4a Enrichment factors at a 1% false positive rate using all reference IFPs (columns) on all β-adren-ergic monomers for the retrieval of for f/pAGO over physicochemically similar decoys. The white to red and white to blue gradients as background color mark a low to high enrichment for f/pAGO (A) and ANT/iAGO respectively. Icons: fpAGO (red), ANT/iAGO (blue), or unvalidated ANT392 (cyan) with no or unknown signaling preference (circle), or β-arrestin biased ligands (triangle) (see Table 6.1).
-
139
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
PLANTS
2RH1_A
2VT4_A-D
2YCW_A
2YCW_B
2YCX_A
2YCX_B
2YCY_A
2YCY_B
2YCZ_A-B
3D4S_A
3NY8_A
3NY9_A
3NYA_A
4AMI_A
4AMI_B
4AMJ_A
4AMJ_B
4BVN_A
4GBR_A
3ZPQ_A
3ZPQ_B
3ZPQ_dock
3ZPR_A
3ZPR_B
2Y00_A
2Y00_B
2Y01_A
2Y01_B
2Y02_A
2Y02_B
2Y03_A-B
2Y04_A
2Y04_B
3P0G_A
3PDS_A
3SN6_R
4LDE_A
4LDL_A
4LDO_A
Norepiphrine
2RH1
_Aβ2
812
4840
836
3652
4448
3624
1244
128
48
520
164
204
84
84
420
824
2824
88
4424
816
02V
T4_A
β112
1248
368
3220
4436
4836
3628
3212
208
1640
04
020
40
48
44
2016
2036
288
1232
204
2812
2VT4
_Bβ1
124
4824
836
3248
4848
4040
1644
2028
88
440
120
160
44
44
412
1224
3636
128
2832
88
82V
T4_C
β112
2044
3216
2824
4440
4436
3232
3612
168
844
08
012
00
88
412
168
1636
404
420
3212
208
2VT4
_Dβ1
816
3628
1628
2432
4436
4028
1632
128
812
444
40
168
04
44
424
1220
3628
1612
2820
824
202Y
CW_A
β116
2428
3620
4024
4440
3628
3220
3616
1212
1652
412
012
04
412
44
2012
2432
2416
1224
2012
012
2YCW
_Bβ1
1216
4432
2040
2832
4444
4040
2040
1212
88
444
120
120
44
412
416
824
2424
1212
2820
1212
82 Y
CX_A
β116
1244
2420
2832
4444
4444
448
248
88
440
48
016
04
44
44
204
2424
244
416
248
2016
2YCX
_Bβ1
1212
4012
1616
2032
4440
3636
1220
412
84
364
40
160
04
48
816
816
2420
164
2416
1212
02Y
CY_A
β18
1252
4024
2828
4852
5636
4016
4016
44
452
44
012
00
412
44
128
432
2016
832
2812
128
2YCY
_Bβ1
436
4036
3224
3240
4056
2820
040
1212
88
440
80
200
44
44
416
816
3624
1612
2020
168
122Y
CZ_A
β18
1640
2812
2836
4036
3636
408
408
128
844
44
028
00
44
44
84
1632
368
412
1612
1212
2YCZ
_Bβ1
84
3620
824
2840
4436
2816
1240
816
44
360
40
120
04
40
412
420
3632
84
1624
820
163D
4S_A
β220
1260
2416
2832
4448
5236
3232
488
124
448
48
04
00
48
44
2012
424
244
844
284
164
3NY8
_Aβ2
836
3632
3232
2036
3236
3224
1232
812
48
324
84
200
08
44
412
412
424
44
3228
88
203N
Y9_A
β216
1644
2020
3632
3228
4428
2020
284
84
444
012
416
04
44
48
88
2024
204
440
244
168
3NYA
_Aβ2
2012
3612
816
2040
2836
3232
2028
08
88
320
80
40
04
44
44
420
2020
40
2832
128
204A
MI_A
β124
2432
2824
2816
3236
3224
2016
3212
1212
428
48
416
416
412
84
84
1224
164
828
128
812
4AMI
_Bβ1
1632
2428
2820
2428
2832
3232
1620
1216
84
284
124
2012
124
84
416
88
2020
48
2420
84
124A
MJ_A
β124
1648
3220
2420
3636
4432
2420
368
2012
840
44
012
00
88
84
44
2020
408
1220
164
208
4AMJ
_Bβ1
2824
4428
1632
2036
4444
3636
1644
88
84
524
204
120
84
48
48
1236
4844
1212
1612
1212
84B
VN_A
β112
432
2412
1628
3624
3236
2420
248
168
436
08
48
00
44
44
124
3232
324
1236
324
824
4GBR
_Aβ2
816
3616
1236
2028
2036
3212
832
816
84
2016
120
120
04
44
08
016
2024
80
128
80
43Z
PQ_A
β112
2028
3228
4020
3236
2840
1616
244
1212
424
84
48
04
412
1216
1216
2028
3612
012
248
84
3ZPQ
_Bβ1
1628
3632
2432
2036
3236
4824
2020
1616
88
204
84
40
04
124
1220
824
1220
88
2020
816
243Z
PR_A
β18
1632
3220
4020
2824
3228
2420
2812
124
432
88
44
00
48
44
80
820
244
84
44
44
3ZPR
_Bβ1
812
3628
1236
1620
2832
4024
812
412
84
164
124
80
04
44
48
012
1616
40
04
88
02Y
00_A
β112
2816
2824
1612
2420
1616
84
1612
1612
1212
08
412
04
812
48
168
824
2012
420
208
128
2Y00
_Bβ1
1612
2424
1212
424
2820
168
1220
120
124
204
40
00
44
44
420
1212
2812
812
2416
84
42Y
01_A
β18
1616
2412
1212
2424
2820
1216
248
412
824
44
04
416
48
44
128
1216
2012
424
168
84
2Y01
_Bβ1
1212
168
44
04
812
84
128
1620
48
40
00
40
44
84
412
88
2012
84
88
44
42Y
02_A
β18
3216
2820
164
1224
124
812
812
1212
420
164
04
04
48
48
2412
1620
1612
1228
328
48
2Y02
_Bβ1
1224
2828
1624
820
2420
2016
820
88
1212
124
00
84
84
44
412
412
2020
44
2020
44
82Y
03_A
β112
2024
2428
168
2428
2020
88
164
412
1224
80
04
44
44
44
1616
2024
248
812
2812
48
2Y03
_Bβ1
1228
2832
1616
816
2020
2012
1216
44
88
244
48
80
04
84
424
128
1216
84
2020
820
122Y
04_A
β18
1224
288
3220
2836
2424
3212
2020
164
428
84
04
44
48
44
128
2032
368
824
208
1216
2Y04
_Bβ1
1216
2424
1224
1624
3220
2420
1220
88
124
284
00
88
124
48
44
1216
3232
124
2012
80
43P
0G_A
β212
48
04
00
40
124
44
120
04
412
00
124
00
04
44
124
44
44
416
168
84
3PDS
_Aβ2
80
120
00
1212
1212
1616
812
84
44
160
00
00
04
40
412
816
1624
1212
2820
88
123S
N6_R
β212
820
40
124
1224
2012
1616
48
012
1620
08
08
44
44
44
44
416
88
412
84
48
4LDE
_Aβ2
124
2420
88
1228
2412
2428
1212
44
88
240
80
80
00
44
416
40
2016
84
2820
412
44L
DL_A
β212
824
208
124
2420
2012
84
88
120
1228
04
88
48
08
04
164
44
84
416
124
04
4LDO
_Aβ2
120
164
08
812
812
812
128
88
48
40
44
80
04
44
44
84
124
84
1616
412
04G
PO_A
β10
00
00
04
40
00
00
04
04
40
00
00
00
00
00
00
00
04
04
40
00
4GPO
_Bβ1
40
40
00
44
44
00
04
00
00
00
00
00
04
04
40
04
44
04
00
00
02R
4R_A
β28
2024
2824
2012
2420
2420
128
816
164
412
00
016
816
00
00
1616
168
124
44
416
04
2R4S
_Aβ2
1212
2832
2824
1228
3228
2812
1616
2424
44
200
124
812
164
84
432
3228
84
44
284
324
03K
J6_A
β212
424
2420
44
2412
2424
208
2416
164
416
08
208
04
88
1212
88
1624
244
1212
1212
44
03
58
1013
1518
2023
2528
3033
3538
4043
4548
5053
5558
6063
6568
7073
7578
8083
8588
9093
9598
100
EF 1
% (A
NT/
iAG
O)
IFP
Structure
Figure 6.4b Enrichment factors at a 1% false positive rate using all reference IFPs (columns) on all β-adren-ergic monomers for the retrieval of for ANT/iAGO over physicochemically similar decoys. The white to red and white to blue gradients as background color mark a low to high enrichment for f/pAGO (A) and ANT/iAGO respectively. Icons: fpAGO (red), ANT/iAGO (blue), or unvalidated ANT392 (cyan) with no or unknown signaling preference (circle), or β-arrestin biased ligands (triangle) (see Table 6.1).
-
140
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
In Figure 6.4a we see that also ANT/iAGO structures can be used to efficiently retrieve f/pAGO ligands when using the right reference IFP. This is surprising, as this contrasts the results of the previous VS studies in which the rotamers of the TM5 serines and/or whole TM5 had to be rotated for the efficient retrieval of f/pAGO ligands.162, 276, 404, 412 Interestingly, the isoproterenol IFP (PDB-code 2Y03) allows efficient retrieval of f/pAGO ligands in all ANT/iAGO structures with an average EF1% of 47. Ranking docking poses in f/pAGO structures with an ANT/iAGO reference IFP on the other hand does not yield high enrichments of ANT/iAGO ligands (with an average ANT/iAGO EF1% of 13). The highest average EF1% (21) was obtained by the cyanopindolol IFP (2YCY_B) in the f/pAGO structures. An unanticipated finding was that when pairing specific f/pAGO IFPs with ANT/iAGO structures also high EF1% values were obtained for the selective retrieval of ANT/iAGO. When using the IFPs of salbutamol (pAGO, PDB-code 2Y04) an average ANT/iAGO EF1% of 27 was obtained (Figure 6.4a/b). In short, these statistics emphasize that not only the choice of structure but also the choice of reference IFP has a high impact on the resulting enrichment factors and, more inter-estingly, on the ligand selectivity of the results.
6.2.3 Selective structure-based VS for ligands with the desired functional effect
Based on the highest EF1%s (Figure 6.3) we selected one f/pAGO and one ANT/iAGO ligand-pro-tein complex for a more detailed investigation. From all ANT/iAGO-bound complexes the cy-anopindolol-bound (iAGO) β1R structure 2VT4 (both chains A and B) obtained the highest ANT/iAGO EF1% value (48) and a f/pAGO EF1% value of 0. As both chains gave similar virtual screening enrichments (Figure 6.4b), we randomly selected chain A of 2VT4 (Figure 6.5b) for further in-vestigation. From all f/pAGO complexes chain A of the carmoterol (fAGO) bound structure β1R structure 2Y02 (Figure 6.5a) obtained the highest f/pAGO EF1% value (84) and an ANT/iAGO EF1% value of 24. In figure 6.3b and 6.3c the individual ROC-plots are displayed for the retrieval of ANT/iAGO and f/pAGO over decoys by IFP-similarity scoring of docking poses generated in the 2VT4 and 2Y02 complexes using the reference IFPs of the corresponding co-crystallized ligands. Figures 6.3b/c show the high early and overall enrichment for ligands with the same functional effect of the co-crystallized ligand and much lower early and overall enrichment for ligands with a different functional effect.By rescoring all the docking poses of the compounds that were docked in all 48 structures with the reference IFPs from the two selected structures the impact of the reference IFP selection was investigated (Figure 6.5d/e). For the selected f/pAGO IFP (2Y02_A), 93% of all ligand-pro-tein complexes (excluding the 2Y02 structures) yielded an equal or higher f/pAGO EF1% than when using their own IFP, in 64% of all cases a lower ANT/iAGO EF1% and for almost all ligand-pro-tein complexes a higher f/pAGO EF1% than ANT/iAGO EF1% (Figure 6.5d, 89% of all cases). When using the selected ANT/iAGO IFP (2VT4_A) instead of the IFP of the co-crystallized ligand to rescore all docked compounds in the ligand-protein complexes a higher ANT/iAGO EF1% (80% of all cases), a lower f/pAGO EF1% (51% of all cases) and a higher ANT/iAGO EF1% than f/pAGO EF1% (82% of all cases) was obtained (Figure 6.5e). In two cases the use of the selected ANT/iAGO IFP resulted in an even higher ANT/iAGO EF1% than with its own structure, namely in combination with the structure of cyanopindolol-bound β1R (EF1% 52, 2YCY_A) and timolol-bound β1R (EF1% 60, 3D4S). This is an increase of 4 and 24 in ANT/iAGO EF1% for the 2YCY_A and 3D4S structures respectively compared to the enrichment obtained with their native IFP. This was not observed for the selected f/pAGO IFP as the combination with its own structure (2Y02_A) resulted in
-
141
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
the highest f/pAGO EF1% (84). The results for one of the Fab-complexes (2R4S) were somewhat surprising as the f/pAGO EF1% in combination with the selected ANT/iAGO IFP were higher than with the selected f/pAGO IFP (36 and 16 respectively). Combining this with the fact that this structure is missing 23 of the binding pocket residues (vide supra), it was even more surprising that these enrichment factors can be obtained when docking in a structure with only half of the binding pocket. This does, however, match the success of previously published VS studies using homology models without ECL2160 and low-resolution homology models.122 Overall it is clear that the selection of the reference IFP is essential and can influence the enrichment as well as selec-tivity of the outcome of a VS study regardless of which structure was used.In order to separately quantify the impact of the structure on the enrichment values we studied the effect of rescoring all docking poses in the selected two structures with all 39 reference IFPs (Figure 6.5f/g). The results of rescoring the docking poses in the 2 selected structures with all reference IFPs show similar trends as rescoring with the reference IFP of the selected complexes (Figure 5d/e). For the selected f/pAGO structure (2Y02_A, Figure 6.5f) we see a shift in the results towards the f/pAGO and the opposite for when using the selected structure (2VT4_A, Figure 6.5g). The observed effect is, however, less pronounced. When we compare the obtained enrichment factors they are on average 38% lower for the selected structures than when using their IFP to rescore the docking poses. Moreover, with the right IFP the selectivity of a crystal structure the can be shifted towards ligands with another functional effect than the co-crystallized ligand (Figure 6.5d/e), the oppo-site (shifting the selectivity by using the right structure) is less frequent and less prominently observed. The selection of the crystal structure does, however, clearly influence the extent to which the IFP to able to shift this selectivity and also the final enrichment values. Combined these results indicate that the correct IFP is more crucial for obtaining selectivity as well as high enrichments than the structure, but they also indicate that obtaining the best results are obtained by finding the right combination of IFP and structure.
-
142
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
0 10 20 30 40 50 60
020
4060
8010
0
EF1% ANT/iAGO
EF1%
f/pA
GO
●●
●●●●
●
●●●
●
●
●
●●
●
●
● ●
●
●
●●
●
●●●
p/fAGOANT/iAGOANT (Not validated)
0 10 20 30 40 50 60
020
4060
8010
0
EF1% ANT/iAGO
EF1%
f/pA
GO
●
●●●●●
●●
●
●●●● ●●
●
● ●● ●●
●
●●
●●
● ●●
●
●●
●
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
0 10 20 30 40 50 60
020
4060
8010
0
EF1% ANT/iAGO
EF1%
f/pA
GO
●
●●
●
●
●●
●●● ●
●●
●●
●●
●
●
●
●
●
●
●●
●
●
●
●
● ●●
●
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
0 10 20 30 40 50 60
020
4060
8010
0
EF1% ANT/iAGO
EF1%
f/pA
GO
● ●●
● ●● ●
● ●●
●
● ● ●●
●●
●
●●
●
●●
●
●
●●●
p/fAGOANT/iAGOANT (Not validated)
e d
c
N6.55%
D3.32%
N7.39%
S5.46%
S5.42%S5.43%
N6.55%
D3.32%
N7.39%
S5.46%
S5.42%S5.43%
Agonist: 2Y02 (chain A) Antagonist: 2VT4 (chain A) a
2VT4 IFP in all structures 2Y02_A IFP in all structures
All IFPs in 2VT4_A structure All IFPs in 2Y02_A structure
4GPO_A
4GPO_A
4LDE
2YCZ
Ligand StructureCarmoterol 2Y02_chainAProcaterol 2Y02_chainA
Cyanopindolol 2VT4_chainAAlprenolol 2VT4_chainA
N7.39D3.32 S5.42 S5.43 S5.46 N6.55
b
g f
Figure 6.5 Analysis of VS runs using the IFPs (d-e) and structures (f-g) of the most f/p AGO selective (2Y02_A) and ANT/iAGO selective (2VT4_A) protein-ligand complexes (see Figure 3A) show that both the reference IFP and protein structure determine enrichment and functional selectivity of the VS study. The
-
143
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
binding modes of the selected f/pAGO (a, 2Y02_A) and selected ANT/iAGO (b, 2VT4_A) protein-ligand complex, a docked ligand with the same efficacy class (green carbon atoms) and their corresponding IFPs (c) for the displayed residues. EF1% results for agonists and antagonist versus the decoys when using the reference IFP of the selected f/pAGO (d) and selected ANT/iAGO (e) for scoring docked compounds in all 48 protein structures. EF1% results for f/pAGO and ANT/iAGO versus the decoys when using the 38 unique reference IFPs of all ligand-protein complexes for rescoring the docked compounds in the 2Y02_A (f) and 2VT4_A (g) structure. Icons: fpAGO (red), ANT/iAGO (blue), or unvalidated ANT392 (cyan) with no or un-known signaling preference (circle), or β-arrestin biased ligands (triangle) (see Table 6.1).
6.2.4 Selective structure-based virtual screening for partial/full agonists with a com-putationally predicted norepinephrine interaction fingerprint
From the f/pAGO and ANT/iAGO retrieval rates in all structures (Figure 6.4a/b), we can see that in particular the IFP of epinephrine (PDB-codes 4LDO) is able to selectively retrieve f/pAGO ligands in all structures (with an average f/pAGO and ANT/iAGO EF1% of 43.6 and 9.4 respectively). This small endogenous agonist makes all key interactions with D1133.32, N3127.39 and S2035.42 in β2R (Figure 6.2). Based on this observation we hypothesized that the IFP of an even smaller agonist, that could also make all these interactions, should also be highly effective for the retrieval of f/pAGO over ANT/iAGO in all structures. As the crystal structure of norepinephrine bound to β1R or β2R has not yet been resolved we docked norepinephrine and used the docking pose (Figure 6.6b) with the highest IFP-score (compared to the IFP of isoproterenol) to obtain the reference IFP for norepinephrine in β1R (Figure 6.2).Indeed, when using this computationally predicted IFP to rescore all docked compounds in all structures we were able to retrieve f/pAGO ligands with a high efficiency (the average f/pAGO and ANT/iAGO EF1% were 46 and 8 respectively, Figure 6.4a and 6.6). On average the obtained f/pAGO EF1% was almost twice as high compared to the 2Y02_A IFP (the IFP from the highest-scoring f/pAGO complex). It should be noted that this is mostly due to higher f/pAGO EF1% in the ANT/iAGO structures. At the same time a 20% lower ANT/iAGO EF1% was obtained compared to the 2Y02_A IFP. Moreover, the ANT/iAGO structure that obtained the highest f/pAGO EF1% with this IFP was a cyanopindolol-bound β1R structure (2YCW_A, Figure 6.6a), which coincides with the slight contraction of the pocket and the low f/pAGO RMSD as previously de-termined (Figure 6.1c). Interestingly, also in combination with the selected ANT/iAGO structure (cyanopindolol-bound 2VT4_A) the norepinephrine IFP was able to obtain a high and selective retrieval rate for f/pAGO (EF1% of 56 and 12 for f/pAGO and ANT/iAGO respectively, Figure 6.6c). The norepinephrine IFP combined with the 2Y02_A structure resulted in a slightly reduced f/pAGO EF1%, but at the same time reduced the ANT/iAGO EF1% thereby making the results much more selective (Figure 6.6d).By using the IFP from a small compound that makes all key interactions these key interactions are highly emphasized, yielding more selective results. Whenever a docking pose of a compound misses one or more of the key interactions present in the reference IFP this immediately results in a relatively low IFP score.
-
144
Chapter 6 - Structure-based prediction of GPCR-ligand function
6 6
0 10 20 30 40 50 60
020
4060
8010
0
EF1% ANT/iAGO
EF1%
f/pA
GO
●
●
●●
●
●
● ●● ●
●● ●
●●
● ●●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●●●
p/fAGOANT/iAGOANT (Not validated)Fab−complexLigand−free
0.1
0.5
1.0
5.0
10.0
50.0
100.0
020
4060
80100
2Y02_A
f/pAGOANT/iAGORandom
0.1
0.5
1.0
5.0
10.0
50.0
100.0
020
4060
80100
2VT4_A
f/pAGOANT/iAGORandom
b!
c!
1.0
1.2
1.4
1.6
1.8
2.0
020
4060
8010
0
FP−rate (%)
TP−r
ate
(%)
1.0
1.2
1.4
1.6
1.8
2.0
020
4060
8010
0
FP−rate (%)
TP−r
ate
(%)
a!
d!
1.0
1.2
1.4
1.6
1.8
2.0
020
4060
8010
0
FP−rate (%)
TP−r
ate
(%)
1.0
1.2
1.4
1.6
1.8
2.0
020
4060
8010
0
FP−rate (%)
TP−r
ate
(%)
N6.55%
D3.32%
N7.39%
S5.46%
S5.42%
S5.43%
2Y03_A
4LDE 4LDL
4LDO 4GPO_A
3KJ6_A
Figure 6.6 EF1% results for agonists and antagonists (a) when using the reference IFP of docked norepi-nephrine (Figure 6.2) in 2Y03_A (b). The individual ROC-plots for the retrieval of agonists (red curve) and antagonists (blue curve) in the selected ANT/iAGO structure 2VT4_A (c) and f/pAGO structure 2Y02_A (d). Icons: fpAGO (red), ANT/iAGO (blue), or unvalidated ANT392 (cyan) with no or unknown signaling preference (circle), or β-arrestin biased ligands (triangle) (see Table 6.1).
6.3 Biased agonists
G-protein-coupled receptors can signal via G-proteins as well as β-arrestins.441 So-called biased ligands can induce a preference of the receptor for one pathway over the other.407 An in-depth literature search learned that for 7 of the 19 co-crystallized ligands there is a record of biased signaling (Table 6.1). Amongst them are bucindolol and carvedilol, two β-blockers that promote signaling via the β-arrestin pathway.105 One should note that the signaling bias of compounds is not always consistently observed and reported (e.g. for cyanopindolol205, 430, 434, 435, carvedilol105, 430, 432, 435, dobutamine430, 436) as the effect seems to be dependent on assay type and condi-tions436, receptor-activation state437, and receptor subtype430 (β1 versus β2). For another 7 of the 19 co-crystallized compounds we consistently found one or more publications reporting that ligand signaling is not biased (Table 6.1). It should be noted that the 7 biased ligands are all, with the exception of alprenolol, relatively large compounds (Figure 6.1, Table 6.1) that stretch from the major pocket (between TM3, 4, 5, and 6) into the minor pocket389 (between TM1, 2, 3, and 7).To identify potential key interactions for β-arrestin biased ligand, we grouped all IFPs (Figure 6.2) of both the reported 7 biased and the reported 7 unbiased ligands. Subsequently, we calculated the difference in abundance for each interaction with each pocket residue (Table S6.8). This highlighted four interactions that had a high abundance (≥50%) and were unique for the biased ligands. Three of them are hydrophobic contacts, namely with L101/H932.64, D200/19245.51, and V326/I3097.36 and the fourth is an aromatic stacking interaction (face-to-face or face-to-edge) with W117/1093.28. Interestingly, this last interaction was present for all biased ligands, except for alprenolol. This could indicate that the signaling bias for these ligands is (partially) due to the aromatic stacking with this tryptophan residue. Moreover, isoproterenol is a full agonist with no signaling bias (it is even used as reference ligand442), but when a hydroxybenzyl moiety is at-tached the ligand (hydroxybenzylisoproterenol, 4LDL) makes an aromatic interaction with W3.28
-
145
Structure-based prediction of GPCR-ligand function - Chapter 6
6 6
and induces a β-arrestin signaling bias.395 This is also in line with mutation studies of the homol-ogous W3.28 in the muscarinic M2 receptor (also a member of the aminergic-GPCR family72) that changed the signaling preference of the receptor when stimulated by specific ligands.443 Also the contacts with L/H2.64, D45.51, a