Professional Documents
Culture Documents
High Dimensional Spaces
High Dimensional Spaces
BITS Pilani
Pilani Campus
High dimensional space
• Our geometric intuition often fails in higher dimensions.
• Many properties of simple objects, such as higher dimensional
analogs of cubes and spheres, are very counterintuitive.
Example
• Most of the volume of a high dimensional objects is near the surface.
• The volume of a sphere approaches 0 as the dimension increases to
high values.
• Vectors in high dimensions become nearly orthogonal.
3
Spheres and cubes in 2-d
• Consider a square with side length 1. 1/2
• At each corner of the square place a circle
of radius 1/2, so that the circles cover the 1
edges of the square.
• Then consider the circle centered at the
center of the square that is just large
enough to touch the circles at the corners of
the square.
4
Spheres and cubes in 3-d
5
Spheres and cubes in 3-d
• To understand what happens in higher
dimensions, we need to compute the
radius of the inner sphere in terms of the
dimension.
• The radius of the inner sphere is equal to 1
1/2(Length of diagonal of cube) – (Radius of
sphere at the corners).
• The length of the diagonal is .
• Thus, the radius of the inner sphere is
6
Spheres and cubes in 3-d
• In dimension, 2 and 3, the inner sphere is
inside the cube.
• In the radius of the inner sphere is equal to .
It touches the cube.
• In the radius of the inner sphere is
7
Volume in high dimensions
• The area of a circle is .
• Intersect the sphere with a plane at some
height above the center of the sphere.
• Summing up cross-sectional area gives
volume.
8
Volume of a unit sphere
10
Concentration of measure (approach to
expected value)
11
Concentration of measure
• Every point on a d-sphere
(circle is 1-sphere) must
satisfy the equation,
•.
• As increases the number of
terms in the sum increases,
and each coordinate gets a
smaller share.
14
Homework
• Find the distribution of the points (x,y) randomly selected from the
circumference of a circle (i.e. 1-sphere).
Hint: Calculate distributions of x=cos(), y=sin().
15
Gaussians in high dimension
√
𝑘
¿∨𝑥∨¿ 2= ∑𝑥 𝑖
2
𝑖=1
• s distributed according to
the chi distribution.
• Mode of chi distribution is
• In high dimensions, k is
represented as d, i.e. the
number of dimensions.
• Most of the samples gets
concentrated around a
sphere.
17
Chi distribution with ‘k’ degrees of freedom
Clustering in high dimensions
• Consider X1∼(μ1,I), X2∼(μ2,I), X3∼(μ3,I),
μ1=[0,0,0,⋯]T, μ2=[5,0,0,⋯]T, μ3=[10,0,0,⋯]T.
18
Clustering in high dimensions
19
Distance matrix of points
20
Better clustering in low dimensions
21
Spherical Gaussian
• The –dimensional spherical Gaussian (pdf has spherical symmetry) with 0 mean
and variance in each coordinate has density function
24