Welcome to Scribd!

Exp 8

Uploaded by

0% found this document useful (0 votes)

3 views7 pages

The document discusses hierarchical clustering, specifically agglomerative hierarchical clustering. It explains that hierarchical clustering groups unlabeled data points with similar characteristics into clusters. Agglomerative clustering treats each data point as an individual cluster initially, then successively merges the closest pairs of clusters into larger clusters. The document provides steps to perform agglomerative hierarchical clustering and includes an example using student exam scores to visualize how clusters are formed and represented in a dendrogram.

Original Description:

Original Title

Exp-8

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

3 views7 pages

Exp 8

Uploaded by

ansari amman

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 7

Search inside document

EXPERIMENT NO.

AIM:-

Implementation of a hierarchical clustering method.

THEORY:-

Introduction to Hierarchical Clustering:-

Hierarchical clustering is another unsupervised learning algorithm that is used

to group together the unlabeled data points having similar characteristics.
Hierarchical clustering algorithms falls into following two categories.
Agglomerative hierarchical algorithms − In agglomerative hierarchical
algorithms, each data point is treated as a single cluster and then successively
merge or agglomerate (bottom-up approach) the pairs of clusters. The hierarchy
of the clusters is represented as a dendrogram or tree structure.
Divisive hierarchical algorithms − On the other hand, in divisive hierarchical
algorithms, all the data points are treated as one big cluster and the process of
clustering involves dividing (Top-down approach) the one big cluster into
various small clusters.

Steps to Perform Agglomerative Hierarchical Clustering

We are going to explain the most used and important Hierarchical clustering i.e.
agglomerative. The steps to perform the same is as follows −
 Step 1 − Treat each data point as single cluster. Hence, we will be
having, say K clusters at start. The number of data points will also be K
at start.
 Step 2 − Now, in this step we need to form a big cluster by joining two
closet datapoints. This will result in total of K-1 clusters.
 Step 3 − Now, to form more clusters we need to join two closet clusters.
This will result in total of K-2 clusters.
 Step 4 − Now, to form one big cluster repeat the above three steps until K
would become 0 i.e. no more data points left to join.
 Step 5 − At last, after making one single big cluster, dendrograms will be
used to divide into multiple clusters depending upon the problem.
Let’s visualize how hierarchical clustering works with an Example.

Suppose we have data related to marks scored by 4 students in Math and

Science and we need to create clusters of students to draw insights.

Now that we have the data, the first step we need to do is to see how distant
each data point is from each other.
For this, we construct a Distance matrix. Distance between each point can be
found using various metrics i.e. Euclidean Distance, Manhattan Distance, etc.
We’ll use Euclidean distance for this example:
We now formed a Cluster between S1 and S2 because they were closer to each
other. Now a question arises, what does our data look like now?

We took the average of the marks obtained by S1 and S2 and the values we get
will represent the marks for this cluster. Instead of averages, we can consider
maximum or minimum values for data points in the cluster.

gain find the closest points and create another cluster.

If we repeat the steps above and keep on clustering until we are left with just
one cluster containing all the clusters, we get a result which looks something
like this:

The figure we get is what we call a Dendrogram. A dendrogram is a tree-like

diagram that illustrates the arrangement of the clusters produced by the
corresponding analyses. The samples on the x-axis are arranged automatically
representing points with close proximity that will stay closer to each other.

PROGRAM:-
OUTPUT:-
CONCLUSION:-

Thus we learnt and implemented hierarchical clustering algorithm.

Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
Document14 pages
Hierarchical Clustering - 11.3.2024 - Full
0707thecaptaincool
No ratings yet
ML Unit 5
Document50 pages
ML Unit 5
SUJATA SONWANE
No ratings yet
Clustering
Document10 pages
Clustering
Saif Fazal
No ratings yet
Hierarchical Clustering: Required Data
Document6 pages
Hierarchical Clustering: Required Data
Hritik Agrawal
No ratings yet
Text Analytics Unit-3
Document11 pages
Text Analytics Unit-3
aathyukthas.ai20001
No ratings yet
Unit - 4 DM
Document24 pages
Unit - 4 DM
minto
No ratings yet
UNIT 4 K-Means Clustring
Document13 pages
UNIT 4 K-Means Clustring
sahil.utube2003
No ratings yet
Module-5-Cluster Analysis-Part1
Document24 pages
Module-5-Cluster Analysis-Part1
Shrimohan Tripathi
No ratings yet
Unit 3 Data
Document37 pages
Unit 3 Data
Sangam
No ratings yet
Hierachical Clustering
Document31 pages
Hierachical Clustering
Bhumika Thumma
No ratings yet
Unit 3 & 4 (p18)
Document18 pages
Unit 3 & 4 (p18)
Kashif Baig
No ratings yet
DWDM Unit5
Document14 pages
DWDM Unit5
sri charan
No ratings yet
Unit Ii
Document102 pages
Unit Ii
Sai Manasa
No ratings yet
K - Mean Clustering
Document12 pages
K - Mean Clustering
Shuvajit Das amit
No ratings yet
Data Analytics and Model Evaluation
Document55 pages
Data Analytics and Model Evaluation
toon town
No ratings yet
Clustering: Clustering Is One of The Most Common Exploratory Data Analysis
Document5 pages
Clustering: Clustering Is One of The Most Common Exploratory Data Analysis
Mada
No ratings yet
Unit 4 - Machine Learning - WWW - Rgpvnotes.in
Document15 pages
Unit 4 - Machine Learning - WWW - Rgpvnotes.in
luckyaliss786
No ratings yet
Unit 4
Document4 pages
Unit 4
adityapawar1865
No ratings yet
Hierarchical Clustering Unit 4 ML
Document14 pages
Hierarchical Clustering Unit 4 ML
Smriti Sharma
No ratings yet
DWM Exp8 127 133 137
Document4 pages
DWM Exp8 127 133 137
Manav Purswani
No ratings yet
Unit IV
Document96 pages
Unit IV
Sai Manasa
No ratings yet
Lecture Notes - Clustering
Document13 pages
Lecture Notes - Clustering
gunjan Bhardwaj
No ratings yet
Lecture+Notes+ +clustering
Document13 pages
Lecture+Notes+ +clustering
Pankaj Pandey
No ratings yet
Lecture14 Notes
Document9 pages
Lecture14 Notes
chelsea
No ratings yet
K Means Clustering
Document6 pages
K Means Clustering
Alina Corina Bala
No ratings yet
Clustering
Document23 pages
Clustering
Aditya Mohite
No ratings yet
Entropy (S) Log (P) : I 1c I I
Document5 pages
Entropy (S) Log (P) : I 1c I I
21eg105f35
No ratings yet
Unit 4 Aam
Document26 pages
Unit 4 Aam
davidhackwell531
No ratings yet
KMeans Clustering
Document16 pages
KMeans Clustering
Basant Kothari
No ratings yet
Machine Learning
Document23 pages
Machine Learning
Keertana R
No ratings yet
A Tutorial On Clustering Algorithms
Document4 pages
A Tutorial On Clustering Algorithms
jczerna
No ratings yet
Learning Types ML
Document18 pages
Learning Types ML
21124059
No ratings yet
K Mean
Document12 pages
K Mean
Shivram Dwivedi
No ratings yet
Clustering - The Data Ensemble
Document4 pages
Clustering - The Data Ensemble
Daniel N Sherine Foo
No ratings yet
Week 11
Document49 pages
Week 11
SvipDag
No ratings yet
Distance-Based Methods - KNN
Document8 pages
Distance-Based Methods - KNN
shreya sarkar
No ratings yet
Clustering Algorithm
Document17 pages
Clustering Algorithm
spraga1995
No ratings yet
Week-9-Part-2 Agglomerative Clustering
Document40 pages
Week-9-Part-2 Agglomerative Clustering
Michael Zewdie
No ratings yet
Supervised Learning, Unsupervised Learing and Reinforced Learning
Document32 pages
Supervised Learning, Unsupervised Learing and Reinforced Learning
BLAH
No ratings yet
Clustering
Document17 pages
Clustering
Aatri Pal
No ratings yet
DWDM 5
Document12 pages
DWDM 5
divyaneelam175
No ratings yet
Clustering (Unit 3)
Document71 pages
Clustering (Unit 3)
vedang maheshwari
100% (1)
Machine Learning Bloque 4
Document12 pages
Machine Learning Bloque 4
Alba Morales
No ratings yet
Unit 3 Clustering
Document101 pages
Unit 3 Clustering
rahuljssstu
No ratings yet
Unit - II
Document37 pages
Unit - II
ammujasty
No ratings yet
4 Clustering
Document9 pages
4 Clustering
Bibek Neupane
No ratings yet
Clustering
Document28 pages
Clustering
prabhakaran sridharan
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
Document9 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
Nikhil Jojen
No ratings yet
Unit 5 - Cluster Analysis
Document14 pages
Unit 5 - Cluster Analysis
eskpg066
No ratings yet
DM Lecture 06
Document32 pages
DM Lecture 06
Sameer Ahmad
No ratings yet
UNIT - 3 - Clustering
Document21 pages
UNIT - 3 - Clustering
Dev Goyal
No ratings yet
Slide TIF311 DM 10 11
Document49 pages
Slide TIF311 DM 10 11
sujan.cseru
No ratings yet
Unit 5 - Cluster Analysis
Document14 pages
Unit 5 - Cluster Analysis
Anand Kumar Bhagat
No ratings yet
Unsupervisd Learning Algorithm
Document6 pages
Unsupervisd Learning Algorithm
Shrey Dixit
No ratings yet
19 - Sessionppt - Clusteringalgos
Document36 pages
19 - Sessionppt - Clusteringalgos
Graisy Biswal
No ratings yet
Foml - U3
Document32 pages
Foml - U3
top techi tamizha
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
Document23 pages
Clustering Algorithm: An Unsupervised Learning Approach
SyedDabeerAli
No ratings yet
Unit Ii DM
Document82 pages
Unit Ii DM
Suganthi D PSGRKCW
No ratings yet
M4 - Clustering
Document43 pages
M4 - Clustering
Javada Javada
No ratings yet
Open I Maj Tutorial
Document89 pages
Open I Maj Tutorial
Vivek Aditya
No ratings yet
Stastical Analysis of The Time Course of Biolog Substrate Utilisation
Document7 pages
Stastical Analysis of The Time Course of Biolog Substrate Utilisation
Lateecka R Kulkarni
No ratings yet
Zooplankton Functional Groups in Tropical Reservoirs - Discriminating Traits and Environmental Drivers
Document20 pages
Zooplankton Functional Groups in Tropical Reservoirs - Discriminating Traits and Environmental Drivers
Julieta C
No ratings yet
FullMarks - Clustering StudentSolution 2
Document13 pages
FullMarks - Clustering StudentSolution 2
Ummu Uwais Ash-Shafi'ie
No ratings yet
Data Science
Document64 pages
Data Science
Smart's Vids
No ratings yet
4 - Data Analytics Using DM and ML Algorithms - 1
Document71 pages
4 - Data Analytics Using DM and ML Algorithms - 1
Tariku Wodajo
No ratings yet
Ccw331-Business Analytics Printed Notes
Document59 pages
Ccw331-Business Analytics Printed Notes
juanabhishek
No ratings yet
Artificial Intelligence Applications and Innovations
Document719 pages
Artificial Intelligence Applications and Innovations
Rammohan
100% (1)
SPE-165374-Global Model For Failure Prediction For Rod Pump Artificial Lift Systems
Document10 pages
SPE-165374-Global Model For Failure Prediction For Rod Pump Artificial Lift Systems
Brenda Rojas Cardozo
No ratings yet
AnalytixLabs - Data Science 360
Document20 pages
AnalytixLabs - Data Science 360
Gaurav Dixit
No ratings yet
Stop Gap Removal Using Spectral Parameters For Stuttered Speech Signal
Document5 pages
Stop Gap Removal Using Spectral Parameters For Stuttered Speech Signal
Velumani s
No ratings yet
Crime Analysis and Prediction Using Machine Learning
Document5 pages
Crime Analysis and Prediction Using Machine Learning
International Journal of Innovative Science and Research Technology
No ratings yet
TYIT SEM VI BI May 2019 Solution
Document21 pages
TYIT SEM VI BI May 2019 Solution
vivek
0% (1)
Final Version
Document25 pages
Final Version
Naveen Prakash
No ratings yet
Statistical Advisor
Document1 page
Statistical Advisor
Luiz Gustavo Araujo
No ratings yet
21.distribution of Clay Minerals in Marine Sediments Off Chennai, Bay of Bengal, India - Indicators of Sediment Sources A
Document13 pages
21.distribution of Clay Minerals in Marine Sediments Off Chennai, Bay of Bengal, India - Indicators of Sediment Sources A
Petrus Aditya
No ratings yet
Mathematical Foundations of Machine Learning: (NMAG 469, FALL TERM 2018-2019)
Document84 pages
Mathematical Foundations of Machine Learning: (NMAG 469, FALL TERM 2018-2019)
Danilo Rubicondo
No ratings yet
Pattern Classification: Second Edition
Document11 pages
Pattern Classification: Second Edition
R Gowri Prasad
No ratings yet
HM Customer Segmentation Report Team2
Document47 pages
HM Customer Segmentation Report Team2
mikeelsamu
No ratings yet
S1TBX Landcover Classification With Sentinel-1 GRD
Document32 pages
S1TBX Landcover Classification With Sentinel-1 GRD
danugroho
No ratings yet
A New Model For Assessment Fast Food Customer Behavior Case Study: An Iranian Fast-Food Restaurant
Document14 pages
A New Model For Assessment Fast Food Customer Behavior Case Study: An Iranian Fast-Food Restaurant
bonavy olingou
No ratings yet
A06-A Survey of Clustering Techniques
Document5 pages
A06-A Survey of Clustering Techniques
Deniz BIYIK
No ratings yet
Is It All Written in The Stars PDF
Document14 pages
Is It All Written in The Stars PDF
micheltrades
No ratings yet
Data Mining Concepts
Document175 pages
Data Mining Concepts
Nikolya Smirnoff
No ratings yet
MC 2324 Questionnaire Celeste Silva Espina
Document11 pages
MC 2324 Questionnaire Celeste Silva Espina
Marialgh1
No ratings yet
Ise-Vii-data Warehousing and Data Mining (10is74) - Notes
Document143 pages
Ise-Vii-data Warehousing and Data Mining (10is74) - Notes
Sudhir Anakal
100% (1)
Machine Learning Machine Learning and Da
Document19 pages
Machine Learning Machine Learning and Da
karama abdeljabbar
No ratings yet
Learning Book 11 Feb
Document322 pages
Learning Book 11 Feb
Tushar Agarwal
No ratings yet
Combining RFM Model and Clustering Techniques For Customer Value Analysis of A Company Selling Online
Document6 pages
Combining RFM Model and Clustering Techniques For Customer Value Analysis of A Company Selling Online
piorricoec
No ratings yet
The Digital Transformation and Disruption in Business Models of The Banks Under The Impact of Fintech and Bigtech
Document12 pages
The Digital Transformation and Disruption in Business Models of The Banks Under The Impact of Fintech and Bigtech
Afshan Satti
No ratings yet