Professional Documents
Culture Documents
Molecular Modelling in Structure Determination: Structure Calculation From NMR Data
Molecular Modelling in Structure Determination: Structure Calculation From NMR Data
NMR experiment
chemical shifts
Have molecule and some data on conformation...
short distances
Objectives:
internal angles
find conformation(s) satisfying experimental data
maintain likely (local) conformation orientation angles
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Compare conformation to data Distance information from NOE
7 8
N OEij rij
6 calibration factor Ccal unknown (not measurable)
estimate from reference distances
! N
# 16
derive a penalty function to measure difference between
"
r (x) = ra6 (x)
calculated and measured data: Edata
a=1
e.g., harmonic potential for distances (... not a good choice)
16
r0 (Ccal N OE)
N!
N OE
#2
ri (x) ri0
"
EN OE
N : number of possible assignments of peak i=1
J = A + Bcos() + Ccos(2)
= 60
3-bond coupling depends on torsion angle
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Parametrisation of from structures Residual dipolar couplings
13 14
Proteins:
direction of bond vectors (e.g., N-
H) can be determined
The parameters A B C need to be parametrized with known relative to coordinate system
(X-ray) structures attached to molecule
A B C tensor parameters
Typical value for 3JHNH
6.51 -1.76 1.60
6.41 -1.46 1.90 ! "
i j 3
6.98 -1.38 1.72 D res
3 Dax (3 cos () 1) + Drh sin () cos(2)
2 2
rij 2
Michael Nilges. Structure calculation from NMR data.
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Hybrid Energy Hybrid energy function
19 20
{
combine data and physical model of molecule force field
into one function (target function, hybrid energy function)
(Levitt)
force field
Ehybrid = Ephys + wdata Edata
guess wdata
2
N
!noe (ri (X) Li ) if r(X) < Li
Edata 0 if Li r(X) Ui
2
(ri (X) Ui ) if r(X) > Ui
i
potential form
sources of error weight in hybrid energy function
measurement
spin diffusion, internal dynamics
Ehybrid = Ephys + wdata Edata
loose upper and lower bounds
FBWH potential
flat bottom harmonic walls
no force between L and U
How to determine the weights Hybrid energy summary
25 26
{
Data potential needs to include parameter to treat
(unknown) noise
potential shape
force field
weights
The weight in the hybrid energy needs to be set by
{
(empirical) means
cross-validation
data experience
Bayesian analysis
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Energy minimisation
High energy
Simulated annealing barriers
Molecular dynamics to fold protein
Torsion angle dynamics
Standard
(Monte Carlo) minimisation
Distance geometry only "downhill"
Genetic algorithm
...
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Distance Geometry Minimisation by molecular dynamics
31 32
r
d 2 ri c
2 = rE
dt mi ri hybrid
"MD-simulated annealing"
Energy scaling
35 36
equivalence:
r mass/ energy/ temperature
d 2 ri c scaling
= rE
dt 2 mi ri hybrid
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Structure calculation with MD Structure calculation with MD
37 38
Difficult search
problem: many now:
degrees of freedom
20 s per structure on PC
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Probability and energy
43 44
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Example:
Ehybrid = Ephys (X) + wdata Edata (D, X)
Gaussian distribution of error for r,
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
1 2
Edata (r r(X))
2 2
The weight is related to the error in the data 1 1 2
LN(x0 , x, ) exp[ (log[x0 ] log[x]) ]
2 2 x0 2 2
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Lognormal distribution ...Life is LogNormal
49 50
Gaussian distribution
To calculate the joined probability from single probabilities, The hybrid energy function is negative logarithm of joint
multiply: probability
Minimum energy corresponds to maximum probability
Probability of a structure
Relative weight should depend on data quality
Posterior Probability likelihood story is incomplete (what about wdata?)
P (X|D, I) P (X|I)P (D|X, , I)
...
Probability of a structure likelihood
prior distribution P (X|D, I) P (X|I)P (D|X, , I)
prior distribution
Bayesian analysis:
kB T
wdata =
2RM S
BPTI, 0.62
Update iteratively during structure calculation
weight overall data quality IL4, 1.14
only possible for least squares-type potential IL8, 1.44
Minimizing hybrid energy corresponds to maximizing the 1. Introduction: relating data to structure
probability of a structure, given data and force field
2. Hybrid energy and treatment of errors
...if one knows the data quality, scale factors, ...
3. Minimisation of hybrid energy
Relative weights
4. Relation to probability theory
usually set by empirically (trial and error, experience, cross validation)
5. Problems with the minimisation approach: an alternative
Bayesian determination of weight possible
Relationship of error distribution and restraint potentials 6. Automated structure calculation
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Michael Nilges. Structure calculation from NMR data. Michael Nilges. Structure calculation from NMR data.
Structure by deduction: Inference instead of deduction
61 62
SH3 (Campbell):
150 NOEs from perdeuterated domain
sparse data set; standard structure calculation does not produce unique
fold
77