Professional Documents
Culture Documents
Pertemuan 7 Kimkom Teori Desain Obat Berbasis Ligan
Pertemuan 7 Kimkom Teori Desain Obat Berbasis Ligan
Virtual Screening
Val Gillet
University of Sheffield
Bio
• BA Natural Sciences (Chemistry) – Cambridge
University
• MSc Information Science – Sheffield University
• PhD – Chemoinformatics – Sheffield University
• Postdoc – De Novo Design - Leeds University
• Currently Professor of Chemoinformatics –
Sheffield
• Areas of expertise:
− Similarity searching, structure-activity
relationships, pharmacophore mapping, de
novo design
− Machine learning methods, evolutionary
algorithms, graph theory, molecular
representations
The Drug Discovery Challenge
3D Structure of Target
Unknown Known
Ligand-Based Structure-Based
Methods Methods
Actives known Actives and inactives known
3D Structure of Target
Unknown Known
Ligand-Based Structure-Based
Methods Methods
Actives known Actives and inactives known
O O
O O O
HO OH O O
O OH
• Similarity coefficient
− A quantitative measure of similarity between two sets of
molecular descriptors
• Each bit in the bit string (binary vector) represents one molecular
fragment. Typical length is ~1000 bits
• The bit string for a molecule records the presence (“1”) or absence
(“0”) of each fragment in the molecule
• Originally developed for speeding up substructure search
− for a query substructure to be present in a database molecule each bit set to “1”
in the query must also be set to “1” in the database structure
• Similarity is based on determining the number of bits that are
common to two structures
Example fragments
C
N N
C C C
C C C
C
C C C
C C C C
C
d. Ring Composition e. Ring Fusion f. Atom Pair
N rs C rd C rs C rs C rs XX3 XX3 XX3 XX2 XX2 N 0;3 - 2 - C 0;3
N N
C C
C C C
C O O O O
C
SIM RD
R D C
Daylight fingerprints; HO O
OH
Tanimoto similarities Morphine
N N
N
O O
O O O
OH O O
O
Cyclooxygenase inhibitors
Bohm, Flohr & Stahl, Scaffold hopping. Drug Discovery Today: Technologies, 2004, 1, 217-224
Pharmacophore Vectors:
Similog
• Similog keys 0100
• Atom typing scheme based on four O
6
properties: hydrogen-bond donor,
hydrogen-bond acceptor, bulkiness
and electropositivity
• Atom triplets of strings encoding 0010
absence and presence of 6
properties, plus distance encoding O
form a DABE key
4 O
• Vector contains a count for each of H
the 8031 possible DABE keys 1100
0010-4-1100-6-0100-6-
Schuffenauer et al. Similarity metrics for ligands reflecting the similarity of target proteins
Journal of Chemical Information and Computer Sciences, 2003, 43, 391-405
Reduced Graphs
• Alignment independent
− Fingerprint approaches
• Alignment-based
− Field-based and surface-based methods
• No consensus as to the most effective method
3D fingerprints
3D Structure of Target
Unknown Known
Ligand-Based Structure-Based
Methods Methods
Actives known Actives and inactives known
Glossary of terms used in Medicinal Chemistry (IUPAC Recommendations 1998) Pure & Appl.
Chem. 1998, 70(5), 1129-1143 http://dx.doi.org/10.1351/pac199870051129).
Example: Rimonabant
hydrophobic
feature hydrogen bond acceptor
(HBA) feature + projected
point
aromatic ring
feature + projected
point hydrophobic
feature
Rimonabant
b N O O
O
S
N
O N
O
O O N
O N
N O
O O
N N N O
O O O P O O
N O N
N
O N O P O
O N O
O N
N O P O
O
O O O
N O O
O O
Database searching
• Conformational search
− On-the-fly
− Ensemble of conformers
• Database search should be “compatible” with parameters used to
generate the pharmacophore
− The same pharmacophore feature definitions should be used to
describe the database structures as were used to generate the
pharmacophore
− The database should be generated using the same protocol as used to
generate the pharmacophore
− What tolerance should be used to allow a match?
• If two pharmacophore features are separated by 5Å what distance
range is acceptable: 4.5-5.5Å; 4-6Å?
• Should all tolerances be the same?
• What effect does this have on recall and precision?
− Can exclusion/inclusion volumes be used?
Pharmacophore-based VS: workflow
Select actives +
Select inactives/decoys Select
actives for validation compounds
for screening
Generate
Generate ‘compatible’
conformers validation Filter (availability,
database Generate/select properties,
‘compatible’ novelty, visually
compound inspect
Generate database mappings,…)
(Modify) Validation 2:
pharmacophore Search validation
database – enrichment, Virtual screening
models
specificity, sensitivity?
Perform
search/mapping(s)
Validation 1:
Map actives back on
pharmacophore Prioritise/select
pharmacophore
model(s)
Example - Cannabinoid CB1 receptor
antagonists
• No CB1 crystal structure, only very limited
success with homology models Rimonabant
• Aim was to assay 420 compounds selected using
a pharmacophore model
− 8 CB1 selective antagonists/inverse agonists were
selected from the literature including rimonabant
− A maximum of 250 unique conformations were
generated for each molecule (with Macromodel using
the MMFF94s force field)
Cannabinoid
− Pharmacophores were generated with Catalyst. Receptor 1 (CB1)
− The model that yielded the most reasonable mapping antagonist
pharmacophore
for Rimonabant was selected for the database search
− The database contained about 500k compounds (max.
of 150 conf. per molecule, generated with Catalyst)
Rimonabant
Cannabinoid Receptor 1
(CB1) antagonist
pharmacophore
H. Wang et al. J. Med. Chem. 2008, 51, 2439-2446
(http://dx.doi.org/10.1021/jm701519h)
(Commercial) software
Examples (by no means comprehensive):
Software Source Recent published use cases
Catalyst (Discovery Accelrys http://dx.doi.org/10.1007/s00894-011-1105-5
Studio) http://dx.doi.org/10.1016/j.bmcl.2010.12.131
GASP Tripos http://dx.doi.org/10.1016/j.jmgm.2010.02.004
GALAHAD Tripos http://dx.doi.org/10.1016/j.bmc.2011.09.016
http://dx.doi.org/10.1016/j.ejmech.2010.09.012
Ligandscout Inte:ligand http://dx.doi.org/10.1016/j.eplepsyres.2011.08.0
16
MOE Chemical http://dx.doi.org/10.1007/s10822-011-9442-0
Computing http://dx.doi.org/10.1016/j.ejmech.2010.07.020
Group
Phase Schrödinger http://10.1111/j.1747-0285.2011.01130.x
http://cs-
test.ias.ac.in/cs/Volumes/100/12/1847.pdf
Some references for pharmacophores
• A. R. Leach, V. J. Gillet, R. A. Lewis, R. Taylor Three-Dimensional Pharmacophore
Methods in Drug Discovery J. Med. Chem. 2010, 53, 539-558
(http://dx.doi.org/10.1021/jm900817u)
• F. Caporuscio, A. Tafi Pharmacophore Modelling: A Forty Year Old Approach and its
Modern Synergies Curr. Med. Chem. 2011, 18, 2543-2553
3D Structure of Target
Unknown Known
Ligand-Based Structure-Based
Methods Methods
Actives known Actives and inactives known
3D Structure of Target
Unknown Known
Ligand-Based Structure-Based
Methods Methods
Actives known Actives and inactives known
two points NH
O O
S
N O
H
NH
DOCK
O
O
S
• Ligand atoms are matched to N
H
sphere centres so that O
NH
N
DOCK
• Many different mappings (poses) are possible
• Each pose is scored based on goodness of fit
• Highest scoring pose is presented to the user
Exploring conformational space
of ligands
• Ensemble of conformations
− A series of conformations is generated before docking
− Each conformer is docked in turn as a rigid body
− FLOG (variant on DOCK)
− Glide, FRED: often use filters and approximations to identify
conformations of interest
• Ligand torsions
O O N
O
1 H 1 2
3 H
N H
Ligand 2
2
N H
1 Protein
O
1
O 4
N O
2 H
7 H
5 H H 4 3
N N
H
6
Hydrogens
Acceptors
Flexible Docking: FlexX
• Incremental construction: flexible ligand; rigid protein
− The conformation of the ligand is constructed step-wise within
the active site
− The ligand is broken down into fragments
− Base fragments of ligand are docked first
− A systematic conformational search of the ligand is carried out as
each new fragment is added in all possible ways
− The protein binding site is used to prune the search tree
N
N
N N
O O
OH OH
Fragment-based docking: FlexX
Interaction model:
Interaction centre of first group lies
approximately on interaction surface
of second group.
B. Kramer et al.
“Ligand Docking and Screening
with FlexX”, Med. Chem. Res.
1999, 9, 463-478
• GOLD Score
− Protein-ligand hydrogen bond energy S(hb_ext)
− Protein-ligand van der Waals (vdw) energy S(vdw_ext)
− Ligand internal energy S(int)
Scoring Functions: II
• Empirical
− Böhm J. Comput. Aided Mol. Design 8 (1994) 243-256
RMSD N
N
Root Mean Square Deviation
Evaluating a docking program
The GOLD result (dark) superimposed on the Xray structure
(light)
1CIN: Wrong
Fatty acid binding protein
GOLD: Validation
• GOLD validation
− 305 complexes found in PDB (CCDC/Astex dataset)
− ligand extracted from complex
− ligand minimised
− docked back to protein
− GOLD prediction compared with original crystal structure
HO O
Enol Ketone
• Conformations
− Need to ensure sufficient sampling of conformational space has
been carried out
− Can we be sure the bioactive conformation has been generated?
− May want to apply filtering techniques to prune unlikely
candidates prior to carrying out the docking
Current Status of Docking: 1
• Most docking programs take account of conformational flexibility of
the ligand but very flexible ligands are still difficult
• Some protein-ligand interactions occur via a water molecules
− Can switch waters on and off in the binding site but usually based on
positions seen in the x-ray structure
• Some docking programs allow protein side chain flexibility
− Full protein flexibility cannot yet be handled except by molecular
dynamics with is extremely computationally demanding
• Scoring functions
− Reasonably good at finding the correct pose for a given protein-ligand
complex
− Less good at ranking different ligands against the same protein (virtual
screening)
• Variety of different post-processing procedures are available to help
reorder the output
Current Status of Docking: 2
• Despite its limitations docking is very widely used and there are
many success stories
− see Kolb et al. Curr. Opin. Biotech., 2009, 20, 429, and
Waszkowycz et al., WIREs Comp Mol. Sci., 2011, 1, 229)
• Performance varies from target to target, and scoring function to
scoring function
− See for example, Plewczynski et al, “Can we trust docking
results? Evaluation of seven commonly used programs on
PDBbind database”, J. Comp. Chem., 2011, 32, 742.
• Care needs to be taken when preparing both the protein and the
ligands
• The more information you have (and use!), the better your chances
− Targeted library, docking constraints, filtering poses, seeding
with known actives, comparing with known crystal poses
Conclusions
• Wide range of virtual screening techniques have been
developed
• The performance of different methods varies on
different datasets
• Increased complexity in descriptors and method does
not necessarily lead to greater success
• Combining different approaches can lead to improved
results
• Computational filters should be applied to remove
undesirable compounds from further consideration
Some more references
• Ripphausen et al. (2010) Quo vadis, virtual screening? A comprehensive
review of prospective applications. Journal of Medicinal Chemistry, 53,
8461-8467.
• Scior et al. (2012) Recognizing pitfalls in virtual screening: a critical review.
Journal of Chemical Information and Modeling, 52, 867-881
• Sotriffer (Ed) Virtual Screening. Principles, Challenges and Practical
Guidelines. Wiley-VCH, 2011.
• Varnek A, Baskin I. Machine Learning Methods for Property Prediction in
Chemoinformatics: Quo Vadis? Journal of Chemical Information and
Modeling 2012, 52, 1413−1437
• Hartenfeller, M.; Schneider, G. Enabling future drug discovery by de novo
design. Wiley Interdisciplinary Reviews-Computational Molecular Science
2011, 1, 742-759.