Professional Documents
Culture Documents
Department of Information Technology: Question Bank TE IT AY-22-23 Sem VI Module 04: Clustering & Outlier Analysis
Department of Information Technology: Question Bank TE IT AY-22-23 Sem VI Module 04: Clustering & Outlier Analysis
1. Choose any two techniques for finding distance between any the clusters in
hierarchical clustering and explain them with an example. (CO4, BTL03)
2. Apply any hierarchical clustering for finding distance between the following clusters. (CO4,
BTL03)
NO X Y
1 2 10
2 2 5
3 8 4
4 5 8
5 7 5
6 6 4
7 1 2
8 4 9
3. Demonstrate what is an outlier and describe methods that can be used for outlier
analysis (CO4, BTL03)
8. Write what is clustering? Illustrate the working of K-means algorithm for the
following data K=02, Data: {2,4,10,12,3,20,11,25}. (CO4, BTL03)
9. Use K- means to cluster the following data set into 3 clusters. Discuss strengths &
weakness of K-means clustering. (CO4, BTL03)
Protei 20 21 15 22 20 25 26 20 18 20
n
Fat 9 9 7 17 8 12 14 9 9 9
10. Coordinates of objects are given below. Apply K-mediods (PAM). Number of
clusters=2. (CO4, BTL03)
1 1 4
2 5 1
3 5 2
4 5 4
5 10 4
6 25 4
7 25 6
8 25 7
9 25 8
10 29 7
11. Assume that the dataset D is given by the table below. Follow single link, complete
link and average link technique to find clusters in D. Use Euclidean distance
measure. (CO4, BTL03)
X Y
P1 0.4 0.53
P2 0.22 0.38
P3 0.35 0.32
P4 0.26 0.19
P5 0.08 0.41
P6 0.45 0.30
12. Illustrate the working of K-means algorithm for the following data K=02, Data:
{2,14,10,12,3,20,21,37}. (CO4, BTL03)
13. Explain K-means clustering and solve the following with k = 3, Data: {2, 3, 6,
8, 9, 12, 15, 18, 22}.
Points to remember:
2. Clustering can also help marketers discover distinct groups in their customer base.
And they can characterize their customer groups based on the purchasing patterns.
(Subject In-Charge)