Professional Documents
Culture Documents
Docking MD
Docking MD
Docking MD
“Docking”
and
“Molecular Dynamics simulations”
TOC
Amino acids
23.02.2012 4
Wikimedia
Wikimedia
23.02.2012 5
Wikimedia
peptide: > 1 AA
oligopeptide: < 10 (other sources state 30)
polypeptide: > 10 AAs
protein: > 50 AAs
macropeptide: > 100 AAs
monopeptide: 1 AA
dipeptide: 2 AA
tripeptide: 3 AA
tetrapeptide: 4 AA
pentapeptide: 5 AA
hexapeptide: 6 AA
heptapentide: 7 AA
octapeptide: 8 AA
nonapeptide: 9 AA
decapeptide: 10 AA
undecapeptide: 11 AAs
...
icosapeptide: 20 AAs
tricontapeptide: 30 AAs
tetracontapeptide: 40 AAs
… however the exact definitions differ (and you do not need to learn them for
the examination of this lecture!)
Structure levels
Wikimedia
23.02.2012 9
And what
about the
size of
proteins
and AAs?
[Janeway]
~20x20x20 nm ~13x6x5 nm
2 more definitions:
/ / / / .
Let us try with this one …
- („induced fit“)
23.02.2012 15
23.02.2012 Bernhard
B h d Knapp
K 20
Ad 1) Search Algorithms used (for spatial space)
Systematic docking
- Brute Force
- Fragmentation
- Database
Heuristic docking
- Monte Carlo
- Genetic algorithms
- Tabu search
Simulations Docking
- Molecular Dynamics
- Gradient (Energy) Methods
“flexible ligand”
- only the ligand is considered als flexible, the receptor remains rigid
“full flexibility”
- computational very expensive
„The pose score is often a rough measure of the fit of a ligand into the
active site. The rank score is generally more complex and might attempt to
estimate binding energies.“
23.02.2012 25
23.02.2012
223
3.02.2012 Bernhard Knapp 28
… and what about the correctness and reliability?
Currently correct results are more or less restricted to the area where the
tools have been calibrated
e.g. for pMHC the area under the ROC is between 0.5 and 0.75 using
different substitution and scoring tools [Knapp, 2008]
But
"We have long known that there is nothing in biology which is
fundamentally inconsistent or incommensurable with mathematics,
chemistry, and physics. Biology long ago rejected vitalism. The only
information needed for life is provided by an organism's chemical
constituents. It is unlikely in the extreme that living systems cannot be
understood in terms of chemistry and physics.“ [Wan, 2008]
Example Autodock
“flexible ligand”
- only the ligand is considered als flexible, the receptor remains rigid
“full flexibility”
- computational very expensive
Simulated Annealing
Global min
Different solutions
23.02.2012 Bernhard Knapp 36
Autodock: sampling of spatial space (2/4)
simulated annealing (german “abkühlen”) procedure:
Hydrogen bounds,
weighted by angle t
Electrostatic forces
Torsion angles
Solvation effects
Autodock2007: unbound?
Extended
Compact
Bound
“’Is there really a case where a drug that’s on the market was designed by a
computer?’ When asked this, I invoke the professorial mantra (’All questions
are good questions.’), while sensing that the desired answer is ’no’. Then,
the inquisitor could go back to the lab with the reassurance that his or her
choice to avoid learning about computational chemistry remains wise.”
23.02.2012 48
Take home messages for the first part
bond length
ே್
1
ܸ = ܭ ܾ െ ܾ ଶ ܾ = ݎ = ݎ െ ݎ
2
ୀଵ
bond angle
ݎ ή ݎ
ߠ = ܽݏܿ ܿݎ
ݎ ή ݎ
torision
1
ܸఝ = ܭ1 + ܿ ݏ ߮
2 ఝ
ݎ × ݎ ή ݎ × ݎ
߮ = arc cos
ݎ × ݎ ή ݎ × ݎ
too far
away
[Shaw et al.]
too
close
23.02.2012 54
What does the „bond angle“ term really mean?
perfect
too big
[Shaw et al.]
too
small
23.02.2012 55
perfect
tilted
[Shaw et al.]
23.02.2012 56
Non bonded interactions
ே ே
1
ܸே = ܸ + ܸா
2
Coulomb
ݍ ݍ
ܸா =
ݎ
Lennard-Jones
ଵଶ
ߪ ߪ
ܸ = 4߳ െ
ݎ ݎ
[Shaw et al.]
23.02.2012 58
What does the „Lennard-Jones“ term really mean?
perfect
too far
away
[Shaw et al.]
too
close
23.02.2012 59
… and calculate the forces for each time step while adjusting the postions
Iterate …
and iterate …
and iterate …
and iterate …
and iterate …
[from wikipedia]
„The equations are solved simultaneously in small time steps. The system is
followed for some time, taking care that the temperature and pressure remain at
the required values, and the coordinates are written to an output file at regular
intervals. The coordinates as a function of time represent a trajectory of the
system.“ [Gromacs Manual]
Calculate forces
Move atoms
Increment time
Stop criterion
reached?
0 ns 20 ns
Example for
MD simulation
Limitations of MD simulations (2 of 2)
(on the basis of Gromacs)
Long range interactions are cutoff: only one image of each particle in
the periodic boundary conditions is considered => cutoff can not
exceed half the box size
Boundary conditions are unnatural: a lot of particles have vacuum as
neighbor to avoid that periodic boundary conditions are used.
=> Sometimes the system is influencing itself
Computational costs and runtime (3 months for 20 ns!)
Cumulative errors in numerical integration and limitation in floating
point representation
… a huge set of individual configurations over time. But what does this
agglomeration of single structures tell us?
RMSD
¦ r
N
1 2
RMSD i
X
riY
N i 1
Where N is the number of atoms, i is the current atom, rX is
the target structure and rY is the reference structure.
The RMSD over time (in this case rY is the first frame)
All frames:
Frame
with
highest
RMSD:
E.g.
RMSF
Next idea: fluctuation of a particular amino acid over time. Calculate the
“root mean square fluctuation”:
M
¦ r (t ri
1
)~
2
RMSFi i k
M k 1
How much of a certain area is exposed to the solvent (e.g. a amino acid
or a region)? Calculate the solvent accessible surface area
solvent
protein
(possible)
target
SASA
23.02.2012 87
Sousa SF, Fernades P, Ramos MJ. Protein-Ligand Docking Current Status and Future Challanges, Proteins
2006; 65:15-26.
A semiempirical free energy force field with charge-based desolvation. Huey,R., Morris,G.M., Olson,A.J., and
Goodsell,D.S. (2007). J Comput Chem. 28, 1145-1152.
Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. Morris,
G. M., Goodsell, D. S., Halliday, R. S., Huey, R., Hart, W. E., Belew, R. K., and Olson, A. J. J.Computational
Chemistry 19, 1639-1662. 1998.
Kitchen DB, Decornez H, Furr JR, Bajorath J. Docking and scoring in virtual screening for drug discovery:
methods and applications, Nat. Rev. Drug Discov. 2004; 3:935-949.
Dodson GG, Lane DP, Verma CS (2008) Molecular simulations of protein dynamics: new windows on
mechanisms in biology. EMBO Rep 9: 144-150.
Karplus M, Kuriyan J (2005) Molecular dynamics and protein function. Proc Natl Acad Sci U S A 102: 6679-
6685.
Hess B, Kutzner C, vanderSpoel D, Lindahl E. GROMACS 4: Algorithms for Highly Efficient, Load-Balanced,
and Scalable Molecular Simulation. J Chem Theory Comput 2008.