Professional Documents
Culture Documents
Stat 390 Presentation 2
Stat 390 Presentation 2
Information Criterion
Approach
Silhouettes Method
Jump Method
Gap Statistic
k-Means Clustering Example
The (dis)similarity of observations are analyzed by their
Euclidean distance, as done so in k-means clustering
A linkage criterion then specifies the dissimilarity of sets as a
function of the pairwise distances of observations in the sets
Agglomerative Hierarchical
Clustering
Suppose we begin with the same eleven different instances we would like to
cluster. Here, we take a “top-down" approach:
1. All observations start in one cluster.
2. Splits are performed recursively as one moves down the hierarchy based
on similarities between the observations.
3. At each step of iteration, the most heterogeneous cluster is divided into
two until all observations are in their own cluster
Works Cited
James, Gareth, et al. An Introduction to Statistical Learning:
with Applications in R. Springer, 2017.