mark thompson planaria software seattle, wa
DESCRIPTION
Molecular Docking With ArgusLab. Mark Thompson Planaria Software Seattle, WA. http://www.planaria-software.com. Molecular Docking A complicated search problem. - PowerPoint PPT PresentationTRANSCRIPT
Mark ThompsonPlanaria Software
Seattle, WA
http://www.planaria-software.com
Molecular Docking With ArgusLab
Find the optimal ligand/protein configurations and accurately (at least consistently) predict their binding free energy without using formal statistical mechanics approaches.
Ligand is flexible.
Protein binding site is flexible (side chains and protein backbone).
Do this in under ~5 seconds on a commodity processor (ideally under 1 second or faster).
Molecular DockingMolecular DockingA complicated search problemA complicated search problem
We have developed two docking engines and an empirical scoring function in ArgusLab 4.0.
•ShapeDock: shape-based method, approximates exhaustive search.
•GADock: Lamarckian genetic algorithm similar to AutoDock.
•AScore: scoring function based on XScore of Wang and coworkers.
Typical ShapeDock times for ligands with 10-15 torsions are < 30 seconds on a 2.4 GHz Pentium laptop computer.
Our docking code is implemented for both interactive docking and screening
of ligand databases.
AbstractAbstract
Molecular modeling application runs on Windows platforms.
ArgusLab 4.0ArgusLab 4.0
1. Ligand is described as a torsion tree Nodes are groups of bonded atoms that do not have rotatable bonds; connections between nodes are torsions. Topology of tree is crucial to efficient docking. A balanced tree with a large central node is best. 2. Construct two grids that overlay the binding siteGrid points marked as inside or outside the free volume of binding site. Fine grid used to determine if atoms of a pose fragment are inside or outside the binding site. Coarse grid is used to establish the search points inside the binding site. 3. At each “search point of interest”Ligand’s root node is placed on a search point and a set of diverse and energetically favorable rotations is created. Translations near the search point are allowed to remove bumps with the target.
ShapeDock ShapeDock Approximates an exhaustive searchApproximates an exhaustive search
((similarities to Fred, Dock, and Glide)similarities to Fred, Dock, and Glide)
4. For each rotation, construct the torsions in breadth-first order Use pre-defined torsion values based on bond order of central bond. Use fine grid to test newly added atoms for bumps with protein and intra-ligand contacts to accept or reject pose fragment. Allow small torsion adjustments to minimize bumps. (rings are treated as rigid)
5. Score pose candidates Poses candidates are those that survive the torsion search. They are ranked to maintain set of N-lowest energy poses (N typically 50-150). Clustering poses as they are found maintains diversity in the final set. 6. Optimize the final set of poses
•Coarse minimization of all poses.•Re-cluster and rank using more aggressive cluster cutoff.•Minimize 25 lowest energy poses more aggressively.•Stochastic search of 25 lowest poses to find nearby minima.
ShapeDockShapeDock
Translation Rigid-body rotation Torsions
Population of individuals Fitness of each is docking
score Each generation:
Select breeding individuals Mutation Crossover Local minimization Elitism Check for convergence
Genetic Degrees of Freedom Search procedure
GADockGADockLamarckian Genetic AlgorithmLamarckian Genetic Algorithm
CompareCompare GADock & ShapeDock GADock & ShapeDock
Robust & General Slow, hard to define
convergence Not reproducible
(Stochastic) Can get caught in a local
minima
Some ligand/binding site types still cause problems
Fast! Reproducible Formally explores all
minima
GADock ShapeDock
LigandHydrogens addedHybridization and AScore atom types assigned
Target
Crystal waters remainHybridization and AScore atom types assigned
Miscellaneous
Atom charges not required All steps are done automatically inside ArgusLab (no user intervention required). However, manual modifications to above may be done if desired. Running the docking calculation: Select the ligand and binding site. Accept default parameters (grid size and resolution) or modify them. Run the docking.
Sample Preparation and RunSample Preparation and Run
2.4 GHz Pentium(R) 4 Dell Inspiron laptop
ShapeDock: Typical TimingsShapeDock: Typical Timings
Target Ligand Torsions Time (sec)
1HPV VX478 14 21 1HVR XK263 8 16 4DFR Methotrexate 9 3 1IEP Gleevec 7 11
1CBX Benzylsuccinate 5 5 1STP Biotin 5 3 3PTB Benzamidine 0 1
AScore AScore an empirical scoring functionan empirical scoring function
AScore is based on terms taken from the HPScore piece of XScore [1]
[1] “Further development and validation of empirical scoring functions for structure-based binding affinity prediction” Wang, R, Lai, L, and Wang, S. J. Comp. Aided Mol. Design 16, 11-26, 2002
Gbind = Gvdw + Ghydrophobic + GH-bond + GH-bond (chg) + Gdeformation + G0
Gvdw = CVDW VDW
Ghydrophobic = Chydrophobic HP
GH-bond = CH-bond HB
GH-bond (chg-chg & chg-neutral) = CH-bond(chg) HB
Gdeformation = Crotor RT
G0 = Cregression
AScoreAScore
Sum is over hydrophobic ligand-protein atom pairs f(dij) = 1.0 d < dij,0 + 0.5Å
= 2/3 (d0 + 2 – d) dij,0 + 0.5 Å < d <= dij,0 + 2.0 Å
= 0 d > dij,0 + 2.0 Å
HP = protein
jij
ligand
i
df
VDW =
ligand
i
ligand
ij ij
ij
ij
ijprotein
j ij
ij
ij
ijligand
i r
d
r
d
r
d
r
d4
0,
8
0,
4
0,
8
0, 22
dij,0 is sum of vdW radii of atoms i,j
intra-ligand VDW excludes 1-2, 1-3 bonded pairs.
AScoreAScore
RT =
RTi = 0 atom i not involved in any torsion.
= 0.5 atom i involved in 1 torsion. = 1.0 atom i involved in 2 torsions. = 0.5 atom i involved in > 2 torsions.
ligand
iiRT
HB = protein
j
ligand
i
HBij
HBij = f(rij) f(1,ij) f(2,ij)
rij distance between donor/acceptor atoms
1,ij angle between donor root-donor-acceptor
2,ij angle between donor-acceptor-acceptor root
Each term varies from 1.0 to 0.0 depending on how close to ideal value. Maximum number of H-bonds per donor/acceptor atom imposed.
AScore extends XScore to allow it to be used as the docking objective function.
Separate H-bond term involving charged donor and/or acceptor groups.
Max. number of H-bonds per donor/acceptor imposed by uniformly scaling total found to the maximum number allowed for any given ligand atom.
Ligand has hydrogens added.
Hydrogens included in the VDW term.
Crystal waters retained (but hydrogens not added). H-bonds with crystal waters treated as having ideal H-bond geometry but with a scaling factor fit to experiment.
H-bonds with target metals treated as ideal geometry, but with scaling factor fit to experiment.
SH treated as H-bond donor/acceptor, >S treated as H-bond acceptor.
Intra-ligand VDW energy included.
AScoreAScoreDifferences with XScoreDifferences with XScore
•Begin with the published XScore parameters.[1]
•Begin with Wang’s data set of 100 protein-ligand structures.[2]
•Remove incorrect structures to get a final training set of 84 structures:
39 hydrophilic, 20 hydrophobic, 25 mixed
•Modify H-bond parameters & other new parameters to improve correlation of score of x-ray
pose and experiment binding free.
[1] “Further development and validation of empirical scoring functions for structure-based binding affinity prediction” Wang, R, Lai, L, and Wang, S. J. Comp. Aided Mol. Design 16, 11-26, 2002
[2] “Comparative Evaluation of 11 Scoring Functions for Molecular Docking” Renxiao Wang, Yipin Lu, and Shaomeng Wang. J. Med. Chem. 2003, 46, 2287-2303
Structure Type
Correlation Gbind with Gexperiment
RMSD Binding Affinity
(kcal/mol)
Hydrophilic 0.53 2.3
Hydrophobic 0.84 2.0
Mixed 0.70 2.1
All Structures 0.70 2.2
Parameterization & ValidationParameterization & Validation(in progress)(in progress)
Dock the training set using the ShapeDock engine.
Structure Type
Correlation Gbind with Gexperiment
RMSD Binding Affinity
(kcal/mol)
Ave. RMSD(Å)
Hydrophilic 0.43 2.4 1.4
Hydrophobic 0.80 2.2 1.9
Mixed 0.61 2.4 1.7
All Structures
0.64 2.3 1.6
Parameterization & ValidationParameterization & Validation
[1] “The Effect of Small Changes in Protein Structure on Predicted Binding Modes of Known Inhibitors of Influenza Virus Neruaminidase: PMF-Scoring in Dock4” Ingo Muegge, Med. Chem. Res. 9, 1999, 490-500.
Trial Study: Trial Study: Influenza Virus NeuraminidaseInfluenza Virus Neuraminidase [1]
• Glycoprotein enzyme cleaves sialic acid residues from maturing virus particles. • Eleven conserved residues make up the binding site.• Dominated by H-bonding & charge-charge group interactions (e.g. carboxyl : guanidino)
DANA GANA
100,000 x increase in binding affinity
-10.2 kcal/mol -11.8 kcal/mol ~ 3x enhancement
Neuraminidase DockingsNeuraminidase DockingsShapeDockShapeDock
9 of the 10 structures reproduced the experimental binding mode.
Correlation of predicted and measured binding affinities
R2 = 0.70Ave. RMSD = 1.55 Angstroms
-12
-11
-10
-9
-10 -9 -8 -7 -6 -5 -4 -3 -2
log IC50
AS
co
re S
co
re (kcal
/mo
l)
• ShapeDock and GADock engines (IDockEngine interface, DockEngineFactory, etc).
• AScore scoring function with modifiable parameter set (IScore interface).
• Easy to make the ligand and binding site groups with one mouse click.
• Dock ligand as flexible, rigid, or using only selected torsions.
• Score current pose, optimize current pose, and full docking.
• Scoring function pre-evaluated on a scoring grid(s).
• Database docking supports SDF file as ligand database (IDataSource).
• Efficient reuse of scoring and docking grids allows user to interactively
modify ligand or choose new ligand and quickly dock new structures.
• Results summarized in external file and in a tree-view. User can click on
poses to view details.
Docking in ArgusLab 4.0Docking in ArgusLab 4.0
• 3D interactive molecule builder & viewer
• Computational experiments•QM: Extended Huckel, Semi-empirical (MNDO, AM1, PM3), ZINDO, and ab initio (via interface to Gaussian 98/03). •MM: Universal Force Field (UFF), CVFF, AMBER, custom force fields for research. Polarizable molecular mechanics, Rappe & Goddard’s charge equilibration scheme for UFF.•Geometry optimizations, electronic excited states, MD simulations, free-energy perturbation, and potential of mean force.•QM/MM and QM/MMpol.•Molecular Docking.
• Properties & misc. dipole moments, atom-charges, transition properties, surface properties, animate normal modes, view dock poses, ribbons, solvent-accessible surfaces, SCRF solvent effects,
explicit solvent, periodic boundary conditions, Ewald sums, etc.
• Manage/organize results: treeview tool for editing structures and viewing results. Results and structures can be saved in ArgusLab XML file.
ArgusLab CapabilitiesArgusLab Capabilities
• Multi-document interface, multi-threaded.
• Written in C++ (some old legacy C-code is wrapped in C++)
• Uses OpenGL for graphics, Win32 API for windowing system.
• Garbage collection for graphics objects, events, etc.
• Custom hash-tables & containers in addition to use of STL.
• Custom Model-View-Controller (MVC) transport layer.
• 3D editor built on a command processor model (support undo/redo).
Installed User Base
• ~20,000 downloads/licenses.
• Popular in university teaching programs and with students. (free )
• Used in several industrial settings.
Arguslab ArchitectureArguslab Architecture
Score the PDBbind DatabaseScore the PDBbind Database
Score the 786 structures from the PDBbind database[1] (14 incorrect structures were removed from the original 800 in database)
[1] “The PDBbind Database: Collection of Binding Affinities for Protein-Ligand Complexes with Known Three-Dimensional Structures” Renxiao Wang, Xueliang Fang, Yipin Lu, and Showmeng Wang. J. Med. Chem. 2004, 47, 2977-2980
PDBbind Database
Correlation Gbind with Gexperiment
RMSD Binding Affinity
(kcal/mol)
786 Structures 0.47 2.9