Professional Documents
Culture Documents
Lecture 6 Merged
Lecture 6 Merged
Lecture 6 Merged
• Problem statement
• Computational strategies
H06W8a: Medical image analysis •
•
Theory of mutual information
Implementation
Class 6: Image registration • Validation
• Applications
T? T!
• Combining information from multiple images acquired using different scanners or at • If T is known, the second image can be resampled such that it has the same size and
different time points requires their geometric relationship to be known, i.e. the dimensions as the first image (cfr ‘basic concepts’).
transformation T that maps 3D points in one image onto the anatomically corresponding
point in the other. • After resampling, voxels at identical positions in both images correspond to
anatomically identical points.
• The mapping T compensates for differences in patient positioning or scan plane selection.
• Information extracted from one image (e.g. object contours) can then be simply
• Unless prospective measures were taken prior to image acquisition, T is in general unknown projected onto the other image.
and needs to be recovered retrospectively from the image content itself.
• Images of the same subject: same modality (e.g. MR/MR), different modality (e.g. CT/MR,
PET/MR), rigid (no deformations) or non-rigid (in case of deformations)
– Follow-up (different imaging sessions)
– Motion compensation (same imaging session)
– Compounding / stitching of partially overlapping images (larger FOV)
• With traditional radiological examination of images printed as 2D slices on radiological film
(‘hard copy examination’), this problem is subjectively solved by the radiologist by looking – Fusing complementary information
for corresponding anatomical landmarks in corresponding images and by mentally • Images of different subjects: non-rigid registration to compensate for shape differences
constructing a 3D anatomical interpretation. – Atlas construction (‘mean shape’ templates)
• This is however rather tedious and not very accurate, as it is necessarily limited to – Atlas-based segmentation
identifying corresponding slices, without providing a true 3D registration solution. – Group analysis (e.g. patients vs controls)
• For computer-aided interpretation of the images displayed on a computer screen (‘soft
copy examination’) and for image analysis, a more formal 3D registration solution is
• Rigid / affine registration : more or less solved for many applications
required, using an algorithm that finds the optimal T according to a suitable criterion
• Non-rigid registration : different algorithms available, very much still research…
Application: brain image registration Application: multi-spectral analysis
MR/PET
Neuro-imaging using different MRI:
MR/CT alignment compensates for inter-scan head motion
T1 PD T2 FLAIR DTI
MR/MR
Time 1 Time 2
localisation detection
in CT in PET
Radiotherapy planning for treatment of prostate cancer: Radiotherapy planning for treatment of prostate cancer:
MR used for target delineation, CT for dose calculation MR used for target delineation, CT for dose calculation
MR
CT
MR
T?
• GEOMETRIC FEATURES:
– External marker based registration
uses specifically designed external markers (= fiducials)
– Point-based registration
aligns corresponding anatomical landmark points
– Surface-based registration
aligns corresponding object surfaces
è These all require some form of segmentation to identify corresponding
objects in the images
• INTENSITY FEATURES:
– Voxel-based registration
maximizes intensity similarity (unimodal, multimodal)
è These can be applied without the need for segmentation Application: stereotactic surgery planning using CT and MR. A frame with known geometry is fixed to
the patient’s skull and visualized in the images.
• Accuracy:
– Can be highly accurate (< 0.5 mm)
– Caveats:
• geometric distortion in MRI
localizer • marker segmentation & centroid localisation:
T Optimization problem:
pi qi qi T(pi) find T for which C(T) is minimal, i.e. for which the sum of squared distances between
corresponding points in both sets is smallest
èClosed form solution if T is rigid body (‘Procrustes’)
q2 T(p2)
d i2 =| qi - T ( pi ) |2 d2
q1 T(p3)
N
1 T(p1)
C=
N
å w .d
i =1
i i
2 d1 q3
d3
There can be errors in the indication of the points used for registration (e.g. manual
mistakes due to image ambiguities)
èFiducial localization error (FLE)
Hence, some point pairs may be less reliable landmarks than others. This can be
incorporated in the registration criterion C(T) by the weights wi.
èe.g. wi = 1/Var(FLEi) (rotation matrix)
èClosed form solution if T is rigid body (‘Procrustes’) (translation)
• Indentification of corresponding landmarks usually requires manual intervention, • Extract corresponding object surfaces from the images to be registered
although some strategies for automated extraction of anatomical landmarks or
• Find the transformation T that minimizes the distance between both
geometrically characteristic points have been developed for some applications.
surfaces, using a suitable distance measure
• Errors in the accurate 3-D localization of corresponding points in both images
propagate directly into the registration result. Such errors can be reduced by • In order to evaluate the distance between both surfaces, point
increasing the number of landmark points. correspondences need typically to be established (e.g. ‘closest points’)
Example: automatically extracted Example: automatically extracted
è Actual implementation depends on the surface representation
anatomical landmarks (M. Betke et al.) ‘corner’ points (K. Rohr et al.) • Should be able to deal with:
– partial overlap
– outliers
– local optima
dj yj yj
dj
xj xj
1) For each point xj on the transformed surface S1, find the closest point yj on S2
1) Construct a map D that gives the distance in each point to surface S2
2) Find the transformation that minimizes the distances dj= |T(xj)-yj|, excluding
è S2 is represented by in an implicit way as the zero-level set of D
outliers:
2) Find the transformation such that the mean value of D evaluated along the
transformed surface S1 is minimal
è No need to establish point correspondences explicitly
3) Update yj and iterate
è But: spatial quantisation introduces additional local minima
Surface-based registration Voxel-based registration
Automated extraction of corresponding surfaces ?
• Find the transformation T that maximizes intensity similarity between the two
images, assuming that similarity is maximal when the images are correctly aligned
• Using original intensity values or derived features (e.g. blurring, image gradients)
• Using all voxels or a subset thereof (depending on the application)
• Much less preprocessing needed than for point or surface-based registration
è much more suited for automation
• Maximize correlation:
Although the images are from the same modality, there may be significant
local intensity differences due to changes over time (e.g. lesion evolution) or
non-rigid deformations (not compensated for by affine registration),
which affect the robustness of the SSD and CC criterion…
å h(i , i ) = h (i )
i2
1 2 1 1
(sum over all rows yields histogram of I1)
å h(i , i ) = h (i )
i1
1 2 2 2
(sum over all columns yields histogram of I2)
å p(i , i ) = p (i )
i2
1 2 1 1
(sum over all rows yields marginal distribution I1)
a
b
å p(i , i ) = p (i )
1 2 2 2
(sum over all columns yields marginal distribution of I2)
I2 i1
I1 Unimodal: I1 Multimodal:
p(a,b) p(a,b)
è intensities a and b of corresponding è relationship between a and b is strongly
voxels p and q of registered images I1 and data dependent
I2 are likely to be similar
Only voxels in the region of overlap of both
a a images are considered è p(a,b) depends
è p(a,b) is clustered around diagonal
on T through varying correspondence (p,q)
b I2 b I2 and through varying region of overlap
(256 x 256 bins) (256 x 256 bins)
Histogram dispersion Histogram dispersion
Registered (correct T) Not registered (incorrect T)
The joint histogram changes with the registration transformation T
I1= MR
I2 = CT
I1 vs I1
Multimodal: dispersion is
data dependent
I2 I2 b I2 b I2
Interpretation Example
p(i1,i2)
I1 I2 p2(i2)
I1 = 0 I1 =1
HA(a), HB(a) : marginal entropy of A and B, respectively
I2 = 1 5/36 22/36 3/4
HAB(a) : joint entropy of A and B
IAB(a) : mutual information of A and B
I2 = 0 4/36 5/36 1/4
Example Example
p(i1,i2) p(i1,i2)
I1 I2 p2(i2) I1 I2 p2(i2)
I1 = 0 I1 =1 I1 = 0 I1 =1
1.2 1.1
s = 50 k = 0.001
1 1
s = 100 k = 0.002
original original + noise original original x inhomogeneity
s = 500 0.9 k = 0.004
0.8
0.8
0.6
0.7
0.4
0.6
0.2 0.5
translation translation
0 0.4
-10 -5 0 5 10 -10 -5 0 5 10
D log I = - k ||p - pc ||2
x translation (mm) x translation (mm)
k = 1e-3, 2e-3, 4e-3
Geometric distortion Limiting assumptions
0.9
0.8
0.7
translation 0.6
0.5
D x = k || p - pc ||2 -10 -5 0 5 10
k = 1e-4, 5e-4, 7.5e-4 x translation (mm)
• Nature of relationship between image intensities is assumed to be • Joint probability density can be estimated reliably …
spatially stationary in their region of overlap • This may be problematic if
• If not: additional joint histogram dispersion, not relevant for registration – the images have low resolution
• Example: severe intensity inhomogeneity in MRI (use of surface coils) – the region of overlap at registration is small
è should be rectified first prior to registration • If not: interpolation needed è interpolation artifacts ?
• Note: MI(I1(x),I2(T(x))) >< MI(I2(y),I1(T-1(y)))
èSampling of voxels in I1 or I2 respectively
èInterpolation in I2 or I1 respectively
èDifferent behavior, especially when T is a non-rigid transformation
(sampled values from I1/I2 are more or less fixed if the region of overlap is
more or less stationary, while interpolated values from I2/I1 may vary
significantly)
p
a = I1(p)
q
I2(q) ?
• Start with few samples initially to speed up the criterion evaluation (O(N) time)
• Add more samples as the registration proceeds to improve accuracy
• Alternatively: select (small) random set of samples at each iteration
è stochastic optimization procedure
Transformed voxel positions q = T(p) of voxels p in the floating image do not coincide
with voxels in the other image è need for interpolation in the reference image
• If the range of image A is [0,N1-1] and the range of B [0,N2-1], the joint
Nearest neighbour Linear Cubic, B-spline,... histogram H would have size N1 x N2
(order 0) (order 1) (higher order)
e.g. 1024x1024 (or more) for medical images.
q1 q2 q1 q2
• If the histogram is large, it will only be sparsely filled, such that small
w3 w4
q q changes in T affect many bins in H.
• Idea: distribute the contribution of each sample to the joint histogram over • Instead of computing an intensity value for q, distribute the contribution of
multiple bins to make this vary more smoothly with changes in the this sample to the joint histogram over multiple histogram bins
transformation parameters
• Density estimation using a Gaussian kernel function: q1 q2 Joint histogram
1
H (a, b) = å Gs (a - ai , b - bi ) w3 w4 b3 +w3
N i q
• The histogram H and hence the mutual information MI are a function of • Initialise translation to align image centers
the parameters of T,
• Initialise rotation/scaling to align corresponding image axes relative to the
e.g. for rigid body registration: MI = MI(tx,ty,tz,fx,fy,fz)
patient (right/left, anterior/posterior, inferior/superior)
• Optimization problem: find optimal values for the transformation • Exploit spatial information in the DICOM header (for images acquired in the
parameters such that MI is maximal same session on the same modality)
èiterative search
• Exploit prior knowledge, e.g. standardized imaging protocols
èstarting from some initial values X
Z Y
X
• Problem: avoid local optima …
X Z
Y Z Y
X
Y Z
0.6 TRI PV
0.5 0.964
0.876
The presence of local optima deteriorates registration robustness The presence of local optima deteriorates registration robustness
more with NN than with TRI or PV interpolation
-4
x 10
2
= global optimum
= local optimum Same registration
0.882 experiment, using 1
different interpolation
methods and starting from
different initial parameter NN
values TRI
0.881
PV
-0.5 -0.25 0 0.25 0.5 a = parameter vector
a* = optimal value for a 0
0 0.1 0.2 0.3 mm
particular interpolation | a - a* | degrees
type
Optimization strategies Gradient of mutual information
q1 PV interpolation q2
• Non-gradient methods
w3 w4
– heuristic search, Powell, simplex dw3 q dw4
• Gradient methods dq
w2 w1
– steepest descent dw2 dw1
– conjugate gradient, Quasi-Newton q4 q3
– Levenberg-Marquardt
• Multi-resolution, especially for non-rigid registration
èCourse global registration first, finer local registration next
èImproves convergence to correct optimum
For affine registration:
1 gradient evaluation ~ 12 criterion evaluations
Non-gradient Gradient CJG, SMP, LVM > POW, STD, QSN Important speedup
with same precision by multi-resolution approach
Steepest Conjugate Quasi- Levenberg-
Powell Simplex descent gradient Newton Marquardt Ne
350
250
300
200
250
150 200
150
100
100
50 50
search trajectory in 2-D parameter subspace of rigid-body registration
POW SMP STD CJG QSN LVM CJG, various multiresolution strategies
Reslice Moving window Linked cursor Retrospective Registration Evaluation Project (RREP)
(J.M.Fitzpatrick et al., 1996)
RREP study 1: CT to MR
5
- 7 patient datasets :
4
CT : 512 x 512, 28-34 slices
Error (mm)
0.65 x 0.65 x 4 mm 3
MR: axial PD, T1, T2 and rectified images
256 x 256, 20-26 slices 2
1.25 x 1.25 x 4 mm
1
- compared with stereotactic reference
- evaluated at 8 points near brain surface 0
Pd Pdr T1 T1r T2 T2r
Accuracy 12
RREP 1: PET / MR
- 7 patient datasets :
8
PET : 128 x 128, 15 slices
Error (mm)
2.59 x 2.59 x 8 mm 6
MR: axial PD, T1, T2 and rectified images
256 x 256, 20-26 slices 4
1.25 x 1.25 x 4 mm
2
- compared with stereotactic reference
- evaluated at 8 points near brain surface 0
Pd Pdr T1 T1r T2 T2r
Accuracy RREP 2 : CT / MR
4
RREP study 2: CT to MR
3.5
- 9 patient datasets : 3
2.5
Error (mm)
0.5
- compared with stereotactic reference
0
- evaluated at 8 points near brain surface MP PD T1 T2
Quality of gold standard ? Impact of geometric distortion ?
Reference Reference
Wrong Correct
Reference Reference
MI MI
Correct Wrong
MI MI
Compare Compare
(2D)
(3D)
Registration: find (A,t,c) such that I1(x) and I2(x’(x)) are most similar
Warping: given N point pairs (pi,qi), find (A,t,c) such that x’(pi) = qi, "i
èClosed form solution
èExtrapolates discrete point correspondences to whole spatial domain x
Regularization of non-rigid registration Regularization of non-rigid registration
(2) Smoothness imposed explicitly through penalty functions: (3) Physics-based regularization:
Registration criterion Regularization penalty
(similarity measure) (local smoothness constraint)
Elastic: µÑ2u + (l + µ )Ñ(Ñu ) + F (x, u ) = 0
Cost function: E(u) = -Esimilarity (u,I1,I2,) + g1.Epenalty1 + g2.Epenalty2 (u) + …
Viscous fluid: µÑ2v + (l + µ )Ñ(Ñv ) + F (x, u ) = 0
u: deformation field
v: deformation velocity field
F: external force field (similarity measure)
R = registration domain l,µ: material parameters
VR = volume of R
g = geometric transformation
J = Jacobian of g èequation of motion of an elastic / viscous material under the influence of a force F
J = determinant of J èF designed such that the deformation maximizes the registration criterion
èsolved numerically on the discrete voxel grid using finite difference approximations
è heuristic, introduces additional parameters, need for tuning, … èinterpolation needed to estimate the transformation at between-voxel locations
• Computational approaches:
– Mass-spring models
• Spring stiffness varies with tissue type
– Finite-element models
• Elasticity modulus varies spatially
– Voxel-based approaches
• Penalty term is function of local tissue type
• Tissue modeling
– Requires segmentation: indication of tissue type in each voxel ?
– Appropriate tissue-specific deformation models ?
Same patient,
different time point
u = Si wi ui
T1 PD T2 FLAIR DTI
Time 1 Time 2
localisation detection
in CT in PET
MR
CT
MR
Radiotherapy planning CT Radiotherapy follow-up CT Planning matched to follow-up
Spline-based representation
Local volume preserving constraint
depending on tissue type
Application: motion compensation Application: subtraction CTA
Pre-contrast CT Post-contrast CT
Affine Non-rigid
Application: assessment of alveolar bone cleft Application: assessment of alveolar bone cleft
graft from pre and post operative CT graft from pre and post operative CT
9 year old
child with Post-
innate bone operative
defect in assessment
lower jaw of graft
quality
10 days before intervention… …10 months after intervention Delineate cleft on pre-op image… …to identify graft on post-op image
image
matched
comparison
geometrical
model
ESP (anthropomorphic
spine phantom)
Histogram dispersion Model-guided measurements
Image contours
Accuracy validation
(by segmentation)
(by comparison)
Image intensity Image intensity Image intensity
Inter-subject registration:
automated brain segmentation Application: inter-subject matching
Patient MR scan
gray white
matter matter
image-
information
Tissue
segmentation
csf
other
a priori
knowledge
registration
Tissue atlas Intensity model
Inter-subject registration: Inter-subject registration:
atlas construction atlas-based segmentation
rigid registration Original Rigid Non-rigid
non-rigid registration
External markers: – not retrospective • Maximization of mutual information highly successful for affine image
e.g. stereotactic surgery registration
• MMI is very general, robust and accurate
Point based:
• MMI requires no segmentation or user intervention è completely
anatomical or – interactive
geometrical landmarks – correspondence? automated
• Highly suited for routine clinical use (already commercially available)
Surface based: – segmentation? • Non-rigid matching using MI is still an active area of research ...
objects – correspondence?
Voxel based:
intensity differences – unimodal
intensity correlation – linear relation
histogram dispersion ! mutual information
Next class