Professional Documents
Culture Documents
Som 5
Som 5
Online
K-means
clustering
Scale-based Self-organizing
clustering Map (SOM)
Online clustering
N 2
E (mi ( j ) x j )
j 1
mi mi E
mi mi E
mi ( j ) ( x j mi ( j ) )
Scale-based Clustering
• Clustering is done at a “scale”
• An answer to the question of “how many
clusters”
• Best clusters tend to live over the longest
range of scales
Algorithm
• Start with a large number of clusters
• Initialize by selecting from data set
• Initialize “sigma” to a small value
• Update all centroids
• Eliminate duplicate centroids whenever there is a
merger
• Increase sigma by a constant factor
• If there are more than 1 unique centroid continue
update of centroids
• Stop only when a single unique centroid remains
Data set
Clustering result:
Evolution of the centroids
THE SELF-ORGANIZING MAP
SOM topology
2D SOM
(rectangular)
1D SOM
3D SOM
Toroid SOM
(hexagonal)
Topographic maps in the brain
Winner and its Neighborhood
Neighborhood
Neighborhood functions (L(r*, r))
For r N
wr L (r , r*)( x p wr )
Neighborhood, N
ordering settling
settling
Simple examples (2D data; 1D SOM)
2D data; 1D SOM
Peano’s curve
Or
Space-filling
curve
(Kohonen 1990)
3D data; 2D SOM
3D data; 2D SOM
Initial random weights
1
0.8
0.6
0.4
0.2
W(2,j)
-0.2
-0.4
-0.6
-0.8
-1
-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1
W(1,j)
Network after 100 iterations
1
0.8
0.6
0.4
0.2
W(2,j)
-0.2
-0.4
-0.6
-0.8
-1
-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1
W(1,j)
Network after 1000 iterations
1
0.8
0.6
0.4
0.2
W(2,j)
-0.2
-0.4
-0.6
-0.8
-1
-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1
W(1,j)
Network after 10,000 iterations
1
0.8
0.6
0.4
0.2
W(2,j)
-0.2
-0.4
-0.6
-0.8
-1
-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1
W(1,j)
Another example
Phonetic Keyboard
Mapping World Poverty