Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 4

Xavier Institute of Engineering

Department of Information Technology


Question Bank TE IT AY-22-23 Sem VI

Module 04: Clustering & Outlier Analysis

1. Choose any two techniques for finding distance between any the clusters in
hierarchical clustering and explain them with an example. (CO4, BTL03)
2. Apply any hierarchical clustering for finding distance between the following clusters. (CO4,
BTL03)

NO X Y
1 2 10
2 2 5
3 8 4
4 5 8
5 7 5
6 6 4
7 1 2
8 4 9

3. Demonstrate what is an outlier and describe methods that can be used for outlier
analysis (CO4, BTL03)

4. Demonstrate the working of DBSCAN algorithm using appropriate diagrams (CO4,


BTL03)

5. Demonstrate the working of BIRCH algorithm using appropriate diagrams. (CO4,


BTL03)

6. Illustrate the working of K-means algorithm using appropriate example. (CO4,


BTL03)

7. Illustrate the working of K-medoids algorithm using appropriate example (CO4,


BTL03)

8. Write what is clustering? Illustrate the working of K-means algorithm for the
following data K=02, Data: {2,4,10,12,3,20,11,25}. (CO4, BTL03)

9. Use K- means to cluster the following data set into 3 clusters. Discuss strengths &
weakness of K-means clustering. (CO4, BTL03)
Protei 20 21 15 22 20 25 26 20 18 20
n
Fat 9 9 7 17 8 12 14 9 9 9
10. Coordinates of objects are given below. Apply K-mediods (PAM). Number of
clusters=2. (CO4, BTL03)

Number X co-ordinate Y co-ordinate

1 1 4

2 5 1

3 5 2

4 5 4

5 10 4

6 25 4

7 25 6

8 25 7

9 25 8

10 29 7

11. Assume that the dataset D is given by the table below. Follow single link, complete
link and average link technique to find clusters in D. Use Euclidean distance
measure. (CO4, BTL03)

X Y

P1 0.4 0.53

P2 0.22 0.38

P3 0.35 0.32

P4 0.26 0.19

P5 0.08 0.41

P6 0.45 0.30

12. Illustrate the working of K-means algorithm for the following data K=02, Data:
{2,14,10,12,3,20,21,37}. (CO4, BTL03)
13. Explain K-means clustering and solve the following with k = 3, Data: {2, 3, 6,
8, 9, 12, 15, 18, 22}.

14. What are outliers? Explain outlier analysis. (CO4, BTL04)

15. Explain Types of Outliers. (CO4, BTL04)

16. Describe different Outliers detection methods. (CO4, BTL01)

Points to remember:

1. Clustering is the process of making a group of abstract objects into classes of


similar objects. A cluster of data objects can be treated as one group.

2. Clustering can also help marketers discover distinct groups in their customer base.
And they can characterize their customer groups based on the purchasing patterns.

3. Clustering also helps in classifying documents on the web for information


discovery.

4. Clustering Algorithm k-Means: A Centroid-Based Technique.

5. Clustering Algorithm k-Medoids: A Representative Object-Based Technique.

6. Hierarchical clustering, DBSCAN, BIRCH


7. An outlier is an observation that lies an abnormal distance from other values in a
random sample from a population. The exits data objects that do not comply with the
general behavior or model of the data. Such data objects, which are grossly different
from or inconsistent with the remaining set of data are called outliers
8. Applications: Intrusion detection, fraud detection,

9. Outlier Detection Techniques: Density-based techniques , feature bagging,support


vector machine

QB Prepared by: Ms. Jaya Jeswani

(Subject In-Charge)

You might also like