3 DSensing

3D Sensing and Reconstruction
Readings: Ch 12: 12.5-6, Ch 13: 13.1-3, 13.9.4
• Perspective Geometry
• Camera Model
• Stereo Triangulation
• 3D Reconstruction by Space Carving

3D Shape from X
means getting 3D coordinates
from different methods
• shading
• silhouette mainly research
• texture
• stereo
• light striping used in practice
• motion
Perspective Imaging Model: 1D
real image
point E xi This is the axis of the
real image plane.
f
camera lens O is the center of projection.
O
image of point
D B in front image This is the axis of the front
zc
xf image plane, which we use.
xi = xc
xc B f zc
3D object
point
Perspective in 2D Yc
(Simplified)
camera
P´=(xi,yi,f) Xc
3D object point
yi
P=(xc,yc,zc) ray f
F
=(xw,yw,zw)
xi
yc optical
axis
zw=zc
xi xc
= xi = (f/zc)xc
Zc xc f zc
yi = (f/zc)yc
Here camera coordinates yi = yc
equal world coordinates. f zc
3D from Stereo
3D point
left image right image
disparity: the difference in image location of the same 3D

point when projected under perspective to two different cameras.
d = xleft - xright
Depth Perception from Stereo
Simple Model: Parallel Optic Axes
image plane z
f Z
camera L
xl
b
baseline
f
camera R xr
x-b
X P=(x,z)
z x-b y-axis is
z x z y = y
= = = perpendicular
f xl f xr f yl yr to the page.
Resultant Depth Calculation
For stereo cameras with parallel optical axes, focal length f,

baseline b, corresponding image points (xl,yl) and (xr,yr)
with disparity d:
z = f*b / (xl - xr) = f*b/d This method of

determining depth
x = xl*z/f or b + xr*z/f from disparity is
called triangulation.
y = yl*z/f or yr*z/f
Finding Correspondences
• If the correspondence is correct, triangulation works VERY well.
• But correspondence finding is not perfectly solved.

(What methods have we studied?)
• For some very specific applications, it can be solved

for those specific kind of images, e.g. windshield of
a car.
° °
3 Main Matching Methods
1. Cross correlation using small windows.
dense
2. Symbolic feature matching, usually using segments/corners.
sparse
3. Use the newer interest operators, ie. SIFT. sparse

Epipolar Geometry Constraint:
1. Normal Pair of Images
The epipolar plane cuts through the image plane(s) P
forming 2 epipolar lines.
y1 z1 z2
y2 epipolar
plane
P1
P2 x
C1 b C2
The match for P1 (or P2) in the other image,
must lie on the same epipolar line.
Epipolar Geometry:
General Case
P
y1
y2 P2
P1 e2 x2
e1 x1
C1
C2
Constraints 1. Epipolar Constraint:
Matching points lie on
corresponding epipolar
lines.
P 2. Ordering Constraint:
Usually in the same
Q
order across the lines.
e2
e1
C1
C2
Structured Light
light stripe
3D data can also be derived using
• a single camera
• a light source that can produce

stripe(s) on the 3D object
light camera
source
Structured Light
3D Computation
3D data can also be derived using
3D point
• a single camera (x, y, z)
• a light source that can produce

stripe(s) on the 3D object
 (0,0,0)
light b
f x axis
b source
[x y z] = --------------- [x´ y´ f] (x´,y´,f)
3D f cot  - x´ image
Depth from Multiple Light
Stripes
What are these objects?

Our (former) System
4-camera light-striping stereo
cameras
projector
rotation
table
3D
object
Camera Model: Recall there are 5
Different Frames of Reference
yc
xc
• Object yf zw
C
xf
• World
image a
• Camera
• Real Image W
zc yw
zp pyramid
• Pixel Image yp A object
xp
xw
The Camera Model
How do we get an image point IP from a world point P?
Px
s IPr c11 c12 c13 c14
c21 c22 c23 c24 Py
s IPc =
Pz
s c31 c32 c33 1
1
image camera matrix C world
point point
What’s in C?
The camera model handles the rigid body transformation
from world coordinates to camera coordinates plus the
perspective transformation to image coordinates.
1. CP = T R WP Why is there not a scale factor here?

2. FP = (f) CP
CPx
s FPx 1 0 0 0 CPy
s FPy = 0 1 0 0 CPz
s FPz 0 0 1 0 1
s 0 0 1/f 0
3D point in
image perspective camera
point transformation coordinates
Camera Calibration
• In order work in 3D, we need to know the parameters

of the particular camera setup.
• Solving for the camera parameters is called calibration.
yc • intrinsic parameters are

yw
xc of the camera device
C
zc
• extrinsic parameters are
W xw where the camera sits
zw in the world
Intrinsic Parameters
• principal point (u0,v0)

C
f
• scale factors (dx,dy)
• aspect ratio distortion factor  (u0,v0)
• focal length f
• lens distortion factor 

(models radial lens distortion)
Extrinsic Parameters
• translation parameters
t = [tx ty tz]
• rotation matrix
r11 r12 r13 0

R= r21 r22 r23 0
Are there really
r31 r32 r33 0
nine parameters?
0 0 0 1
Calibration Object
The idea is to snap

images at different
depths and get a
lot of 2D-3D point
correspondences.
The Tsai Procedure
• The Tsai procedure was developed by Roger Tsai

at IBM Research and is most widely used.
• Several images are taken of the calibration object

yielding point correspondences at different distances.
• Tsai’s algorithm requires n > 5 correspondences
{(xi, yi, zi), (ui, vi)) | i = 1,…,n}
between (real) image points and 3D points.
• Lots of details in Chapter 13.

We use the camera parameters of
each camera for general stereo.
P
y1
y2
e2 P2=(r2,c2) x2
P1=(r1,c1)
e1 x1
B
C
For a correspondence (r1,c1) in
image 1 to (r2,c2) in image 2:
1. Both cameras were calibrated. Both camera matrices are

then known. From the two camera equations B and C we get
4 linear equations in 3 unknowns.
r1 = (b11 - b31*r1)x + (b12 - b32*r1)y + (b13-b33*r1)z

c1 = (b21 - b31*c1)x + (b22 - b32*c1)y + (b23-b33*c1)z
r2 = (c11 - c31*r2)x + (c12 - c32*r2)y + (c13 - c33*r2)z

c2 = (c21 - c31*c2)x + (c22 - c32*c2)y + (c23 - c33*c2)z
Direct solution uses 3 equations, won’t give reliable results.

Solve by computing the closest
approach of the two skew rays.
V = (P1 + a1*u1) – (Q1 + a2*u2)
P1 (P1 + a1*u1) – (Q1 + a2*u2) · u1 = 0

u1
(P1 + a1*u1) – (Q1 + a2*u2) · u2 = 0
solve for
shortest
V P
Q1
u2
If the rays intersected perfectly in 3D, the intersection would be P.

Instead, we solve for the shortest line segment connecting the two rays
and let P be its midpoint.

3 DSensing

Uploaded by

Copyright:

Available Formats

You might also like

3 DSensing

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

3 DSensing

Uploaded by

Copyright:

Available Formats

3D Sensing and Reconstruction

Readings: Ch 12: 12.5-6, Ch 13: 13.1-3, 13.9.4

• 3D Reconstruction by Space Carving

left image right image

disparity: the difference in image location of the same 3D

For stereo cameras with parallel optical axes, focal length f,

z = f*b / (xl - xr) = f*b/d This method of

• If the correspondence is correct, triangulation works VERY well.

• But correspondence finding is not perfectly solved.

• For some very specific applications, it can be solved

2. Symbolic feature matching, usually using segments/corners.

3. Use the newer interest operators, ie. SIFT. sparse

• a light source that can produce

• a light source that can produce

What are these objects?

1. CP = T R WP Why is there not a scale factor here?

• In order work in 3D, we need to know the parameters

• Solving for the camera parameters is called calibration.

yc • intrinsic parameters are

• principal point (u0,v0)

• aspect ratio distortion factor  (u0,v0)

• lens distortion factor 

r11 r12 r13 0

The idea is to snap

• The Tsai procedure was developed by Roger Tsai

• Several images are taken of the calibration object

• Tsai’s algorithm requires n > 5 correspondences

{(xi, yi, zi), (ui, vi)) | i = 1,…,n}

between (real) image points and 3D points.

• Lots of details in Chapter 13.

1. Both cameras were calibrated. Both camera matrices are

4 linear equations in 3 unknowns.

r1 = (b11 - b31*r1)x + (b12 - b32*r1)y + (b13-b33*r1)z

r2 = (c11 - c31*r2)x + (c12 - c32*r2)y + (c13 - c33*r2)z

Direct solution uses 3 equations, won’t give reliable results.

P1 (P1 + a1*u1) – (Q1 + a2*u2) · u1 = 0

If the rays intersected perfectly in 3D, the intersection would be P.

You might also like

z = fb / (xl - xr) = fb/d This method of

r1 = (b11 - b31r1)x + (b12 - b32r1)y + (b13-b33*r1)z

r2 = (c11 - c31r2)x + (c12 - c32r2)y + (c13 - c33*r2)z

P1 (P1 + a1u1) – (Q1 + a2u2) · u1 = 0