Welcome to Scribd!

Clustering: Prof. Ankur Sinha

Uploaded by

0% found this document useful (0 votes)

38 views10 pages

This document discusses clustering, an unsupervised machine learning technique used to group unlabeled data points into clusters based on similarity. It provides examples of clustering applications in marketing, urban planning, and more. Different similarity measures for comparing data points are introduced, such as Euclidean distance. An example clusters 10 customers defined by age and service usage attributes. Hierarchical and k-means clustering algorithms are overviewed, with k-means explained as iteratively assigning points to centroids and updating centroids.

Original Description:

Original Title

clustering

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

38 views10 pages

Clustering: Prof. Ankur Sinha

Uploaded by

Vibhuti Batra

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 10

Search inside document

Clustering

Prof. Ankur Sinha

Indian Institute of Management Ahmedabad
Gujarat India
Clustering
• Grouping a set of data objects into different
groups based on similarity
• An example of unsupervised learning
• Data objects can be vectors representing
different attributes for an object, for example,
customer, location, product, etc.
Examples
• Used in a variety of areas
– Marketing
– Urban planning
– Customer segmentation
– Product segmentation
– Seismology
Similarity Measure
• If two objects i and j are represented by
vectors xi and xj
– How do you measure similarity between the two
objects
• Euclidean distance
• Manhattan distance
• Mahalanobis distance
– Similarity can be chosen based on the application
Similarity Measure
• Consider 10 customers with two attributes
– Attribute 1: Recent usage of services
– Attribute 2: Customer age
• Objective: Cluster the data into two classes and design two marketing
campaigns for the two customer segments
X 10 years
10

7
Customer Age

0
0 1 2 3 4 5 6 7 8 9 10
X 10 minutes

Usage of Service
Similarity Measure
• Consider 10 customers with two attributes
– Attribute 1: Usage of services
– Attribute 2: Customer age

10 Cluster 1 Cluster2
9

8
(3,4) (6,2)
7

6 (2,6) (7,2)
5

4
(4,5) (7,4)
3 (4,7) (8,4)
2

1
(3,8) (8,5)
0
0 1 2 3 4 5 6 7 8 9 10
Clustering approaches
• Hierarchical clustering
– Agglomerative
– Divisive
Step 0 Step 1 Step 2 Step 3 Step 4
agglomerative
(AGNES)
a ab
b abcde
c
cde
d
de
e
divisive
Step 4 Step 3 Step 2 Step 1 Step 0 (DIANA)
Clustering approaches
• K-means Clustering
– Select initial centroids randomly
– Assign objects to centroids based on similarity
measure
– Compute new centroid as mean of each class
– Repeat the above two steps until there is no
change
K-Means Clustering

Start with centroids randomly placed Assign points to the centroids Update centroids

Assign points to the new centroids Update centroids Assign points to the new centroids
Random centroids
K-Means Clustering

Start with centroids randomly placed Assign points to the centroids Update centroids

Assign points to the new centroids Update centroids Assign points to the new centroids

Continue until there is no

change in the structure of the
clusters

Solutions Manual For Optimal Control Theory: An Introduction
Document185 pages
Solutions Manual For Optimal Control Theory: An Introduction
rummpelstindick
70% (20)
A System Has 16 Tapes, and 4 Processes P 0, P 1, P 2, P 3 With Corresponding Requests
Document15 pages
A System Has 16 Tapes, and 4 Processes P 0, P 1, P 2, P 3 With Corresponding Requests
Hoàng Nguyễn
No ratings yet
A13 IFT2125 Intra1 en
Document7 pages
A13 IFT2125 Intra1 en
Sherjil Ozair
No ratings yet
Free Quality Function Deployment QFD House of Quality Template Excel Download
Document6 pages
Free Quality Function Deployment QFD House of Quality Template Excel Download
Karen Arias
No ratings yet
5th Grade Scope and Sequence 2019-2020
Document1 page
5th Grade Scope and Sequence 2019-2020
api-292237888
No ratings yet
My Startup Guide Workbook
Document44 pages
My Startup Guide Workbook
Vibhuti Batra
50% (2)
Clustering
Document84 pages
Clustering
manmeet singh tuteja
No ratings yet
8.hierarchical AGNES DIANA
Document46 pages
8.hierarchical AGNES DIANA
Shreyas Paraj
No ratings yet
Clustering
Document45 pages
Clustering
sujan.cseru
No ratings yet
CT075!3!2 DTM Topic 10 Cluster Analysis
Document21 pages
CT075!3!2 DTM Topic 10 Cluster Analysis
kishanselvarajah80
No ratings yet
Lect 10 DM
Document36 pages
Lect 10 DM
Saba Tariq
No ratings yet
Cluster
Document20 pages
Cluster
sondaravalli
No ratings yet
CT075!3!2-DTM-Topic 5-Data Preprocessing PART 1
Document44 pages
CT075!3!2-DTM-Topic 5-Data Preprocessing PART 1
kishanselvarajah80
No ratings yet
Lecture 23 - Clustring
Document14 pages
Lecture 23 - Clustring
bscs-20f-0009
No ratings yet
Lecture 3 - Herirachical Methods
Document16 pages
Lecture 3 - Herirachical Methods
Manikandan M
No ratings yet
SCA - Module 8
Document13 pages
SCA - Module 8
mahnoor
No ratings yet
Clustering Partitioning Methods
Document20 pages
Clustering Partitioning Methods
2K19/BMBA/13 RITIKA
No ratings yet
Chapter 3: Cluster Analysis: 3.1 Basic Concepts of Clustering
Document33 pages
Chapter 3: Cluster Analysis: 3.1 Basic Concepts of Clustering
preetam
No ratings yet
Clustering
Document125 pages
Clustering
Fariya Afrin
No ratings yet
Module5 - Outlier - Analysis: Reference: "Data Mining The Text Book", Charu C. Aggarwal, Springer, 2015. (Chapters 8)
Document21 pages
Module5 - Outlier - Analysis: Reference: "Data Mining The Text Book", Charu C. Aggarwal, Springer, 2015. (Chapters 8)
Rohith Roh
No ratings yet
Clustering
Document61 pages
Clustering
Rashul Chutani
No ratings yet
College of Engineering, Architecture and Technology
Document1 page
College of Engineering, Architecture and Technology
Erdan Durana
No ratings yet
MDA Session4
Document16 pages
MDA Session4
samarth
No ratings yet
4.3 K-Medoids
Document31 pages
4.3 K-Medoids
Pynshngain
No ratings yet
2 ADA Cluster Analysis
Document12 pages
2 ADA Cluster Analysis
Ash
No ratings yet
Clustering Data Mining
Document27 pages
Clustering Data Mining
Andrew
No ratings yet
Chapter 8 Modelling
Document29 pages
Chapter 8 Modelling
Võ Minh Trí
No ratings yet
20 - 1 - ML - Unsup - 01 - Partition Based - Kmeans
Document20 pages
20 - 1 - ML - Unsup - 01 - Partition Based - Kmeans
MohitKhemka
No ratings yet
Introduction To Data Mining Clustering Analysis
Document84 pages
Introduction To Data Mining Clustering Analysis
ak
No ratings yet
CS273a Final Exam
Document9 pages
CS273a Final Exam
Imelda
No ratings yet
Machine Learning
Document45 pages
Machine Learning
uxama
No ratings yet
ASTM Standards For Metallography
Document135 pages
ASTM Standards For Metallography
Aarón Escorza Mistrán
100% (1)
Math g4 m6 Mid Module Assessment
Document14 pages
Math g4 m6 Mid Module Assessment
Rana Halaby
No ratings yet
CH-6 DM Clustering
Document28 pages
CH-6 DM Clustering
addis alemayhu
No ratings yet
A Scatter Diagram Shows Relationships Between Two Sets of Data
Document7 pages
A Scatter Diagram Shows Relationships Between Two Sets of Data
Jhonrick Magtibay
No ratings yet
K Means
Document23 pages
K Means
Fadila Ahmad
No ratings yet
Conjoint Analysis
Document12 pages
Conjoint Analysis
balaa aiswarya
No ratings yet
Om Iii Iat
Document3 pages
Om Iii Iat
Abijith K S
No ratings yet
DAB - Lesson 06 (SE)
Document37 pages
DAB - Lesson 06 (SE)
thanh vu duc
No ratings yet
Introduction To Statistics and Application in Engineering Analysis
Document31 pages
Introduction To Statistics and Application in Engineering Analysis
wubied
No ratings yet
Lecture 2 - Clustering Methods
Document19 pages
Lecture 2 - Clustering Methods
Manikandan M
No ratings yet
Unsupervised Learning Models Overview, K-Means Algorithm: Sir Syed University of Engineering & Technology, Karachi
Document36 pages
Unsupervised Learning Models Overview, K-Means Algorithm: Sir Syed University of Engineering & Technology, Karachi
Fahama Bin Ekram
No ratings yet
SPSS Tutorial Cluster Analysis
Document42 pages
SPSS Tutorial Cluster Analysis
Anirban Bhowmick
No ratings yet
SPSS Tutorial Cluster Analysis PDF
Document42 pages
SPSS Tutorial Cluster Analysis PDF
cajimenezb8872
No ratings yet
Pertemuan 2 Pengantar Bistat
Document26 pages
Pertemuan 2 Pengantar Bistat
kimkimberly
No ratings yet
SPSS Week7
Document42 pages
SPSS Week7
jjrrnnjj04
No ratings yet
SPSS Week7
Document42 pages
SPSS Week7
twink_littlestar
No ratings yet
Projects
Document37 pages
Projects
Melani Cristal Alvarado
No ratings yet
RSB GB Case Studypptx
Document27 pages
RSB GB Case Studypptx
santhoshshivappa
No ratings yet
AKGIM/EXM/FM/03 Ajay Kumar Garg Institute of Management, Ghaziabad Pre-University Test
Document3 pages
AKGIM/EXM/FM/03 Ajay Kumar Garg Institute of Management, Ghaziabad Pre-University Test
chitkarashelly
No ratings yet
Processes in Ecology Bio464
Document3 pages
Processes in Ecology Bio464
daniel Afiq
No ratings yet
Unit 5
Document77 pages
Unit 5
khatuaryan16
No ratings yet
Concept Analysis For Engineers
Document10 pages
Concept Analysis For Engineers
mohammad jamal
No ratings yet
AGILE PMPeople - Controlling - Agile - Projects
Document51 pages
AGILE PMPeople - Controlling - Agile - Projects
Jim Jr . Oyola
No ratings yet
GVIP Journal SV
Document6 pages
GVIP Journal SV
Ruth Murphy
No ratings yet
ML L14 Clustering
Document59 pages
ML L14 Clustering
Mickey Mouse
No ratings yet
Decision Analysis
Document52 pages
Decision Analysis
X
No ratings yet
BY Deepak Asnora Gaurav Garg Adarsh Singh
Document13 pages
BY Deepak Asnora Gaurav Garg Adarsh Singh
deepak asnora
No ratings yet
Assignment 4 - Heaps
Document7 pages
Assignment 4 - Heaps
RafayGhafoor
No ratings yet
Edited Tos 2015
Document20 pages
Edited Tos 2015
Emie Joy Buagas
100% (1)
Syllabus
Document8 pages
Syllabus
elsonpaul
100% (1)
Object-Oriented Information Engineering: Analysis, Design, and Implementation
From Everand
Object-Oriented Information Engineering: Analysis, Design, and Implementation
Stephen Montgomery
No ratings yet
Math Practice Simplified: Fractions (Book G): Practice to Mastering Fractions
From Everand
Math Practice Simplified: Fractions (Book G): Practice to Mastering Fractions
Ann Cassill Sofge
Rating: 5 out of 5 stars
5/5 (1)
Guided Math Made Easy, Grade 2
From Everand
Guided Math Made Easy, Grade 2
Fanning
Rating: 5 out of 5 stars
5/5 (1)
Revision Notes For Class 12 Macro Economics Chapter 6 - Free PDF Download
Document8 pages
Revision Notes For Class 12 Macro Economics Chapter 6 - Free PDF Download
Vibhuti Batra
No ratings yet
April 2020 - Master Deck - Fashion
Document28 pages
April 2020 - Master Deck - Fashion
Vibhuti Batra
No ratings yet
IIMA Casebook
Document142 pages
IIMA Casebook
Vibhuti Batra
No ratings yet
CS Case Analysis Template
Document1 page
CS Case Analysis Template
Vibhuti Batra
No ratings yet
Revision Notes For Class 12 Macro Economics Chapter 3 - Free PDF Download
Document7 pages
Revision Notes For Class 12 Macro Economics Chapter 3 - Free PDF Download
Vibhuti Batra
No ratings yet
Revision Notes For Class 12 Macro Economics Chapter 5 - Free PDF Download
Document6 pages
Revision Notes For Class 12 Macro Economics Chapter 5 - Free PDF Download
Vibhuti Batra
No ratings yet
Revision Notes For Class 12 Macro Economics Chapter 2 - Free PDF Download
Document17 pages
Revision Notes For Class 12 Macro Economics Chapter 2 - Free PDF Download
Vibhuti Batra
No ratings yet
Revision Notes For Class 12 Macro Economics Chapter 4 - Free PDF Download
Document11 pages
Revision Notes For Class 12 Macro Economics Chapter 4 - Free PDF Download
Vibhuti Batra
No ratings yet
Revision Notes For Class 12 Macro Economics Chapter 1 - Free PDF Download
Document15 pages
Revision Notes For Class 12 Macro Economics Chapter 1 - Free PDF Download
Vibhuti Batra
No ratings yet
Linear Programming: Basic Concepts Solution To Solved Problems
Document15 pages
Linear Programming: Basic Concepts Solution To Solved Problems
Vibhuti Batra
No ratings yet
Hillier6e Chapter01
Document1 page
Hillier6e Chapter01
Vibhuti Batra
No ratings yet
Safola PDF
Document7 pages
Safola PDF
Vibhuti Batra
No ratings yet
Mnitel Pronto Italia: Syndicate A4
Document10 pages
Mnitel Pronto Italia: Syndicate A4
Vibhuti Batra
No ratings yet
Os Case Study Analysis: Managing Innovation at Nypro Inc
Document8 pages
Os Case Study Analysis: Managing Innovation at Nypro Inc
Vibhuti Batra
No ratings yet
Solution To Solved Problems: 1.S1 Make or Buy
Document3 pages
Solution To Solved Problems: 1.S1 Make or Buy
Vibhuti Batra
No ratings yet
Presented by Sarvashreshtha Chaudhary Basu Bhattar
Document14 pages
Presented by Sarvashreshtha Chaudhary Basu Bhattar
Vibhuti Batra
No ratings yet
CFI Accountingfactsheet-1499721167572 PDF
Document1 page
CFI Accountingfactsheet-1499721167572 PDF
Vibhuti Batra
No ratings yet
2019-20 CG PGPX Outline PDF
Document4 pages
2019-20 CG PGPX Outline PDF
Vibhuti Batra
No ratings yet
Handbook Preview PDF
Document9 pages
Handbook Preview PDF
Vibhuti Batra
No ratings yet
Assignment - NP-Completeness and Heuristic Algorithms
Document2 pages
Assignment - NP-Completeness and Heuristic Algorithms
Juan Jose Perez
No ratings yet
06 SortingB MergeSort
Document79 pages
06 SortingB MergeSort
lukiluki
No ratings yet
SAMPLE PE-Summer 2017: xx/xx/2017 Data Structures and Algorithms Using Java
Document3 pages
SAMPLE PE-Summer 2017: xx/xx/2017 Data Structures and Algorithms Using Java
Tran Hong Anh (K15 HL)
No ratings yet
Code
Document11 pages
Code
mushahed
No ratings yet
07 DP Coin Change Problem
Document18 pages
07 DP Coin Change Problem
assd
No ratings yet
3simple Factors of Polynomials
Document28 pages
3simple Factors of Polynomials
Eve Krystel Baya - Mamugay
No ratings yet
SC Express
Document24 pages
SC Express
Vivek Reghunathan
No ratings yet
Silence Sweep: A Novel Method For Measuring Electro - Acoustical Devices
Document35 pages
Silence Sweep: A Novel Method For Measuring Electro - Acoustical Devices
harjeet sindhu
No ratings yet
TBC 603 Fundamentals of Machine Learning
Document2 pages
TBC 603 Fundamentals of Machine Learning
shubhamjoc2003
No ratings yet
Manuscrit These Balesdent 2011
Document252 pages
Manuscrit These Balesdent 2011
bmsprague
No ratings yet
Week 8
Document8 pages
Week 8
Sam Fire
No ratings yet
Lecture 4 Slides DFT Sampling Theorem
Document32 pages
Lecture 4 Slides DFT Sampling Theorem
haldois
No ratings yet
Gauss PDF
Document48 pages
Gauss PDF
Del Fin
No ratings yet
DataMining Workbook Answers
Document18 pages
DataMining Workbook Answers
spagty desginer
No ratings yet
Dynamic Programming Longest Common Subsequence
Document3 pages
Dynamic Programming Longest Common Subsequence
Ankit
No ratings yet
08 - Graph Theory - Spring2022 - Important Network Models
Document60 pages
08 - Graph Theory - Spring2022 - Important Network Models
Ahmed Alaa
No ratings yet
Yandex - LeetCode
Document2 pages
Yandex - LeetCode
Daniil
No ratings yet
Digital Signal Processing With Matlab Examples, Volume 2 (2017)
Document939 pages
Digital Signal Processing With Matlab Examples, Volume 2 (2017)
william
50% (2)
Assignment 1 - DFT
Document3 pages
Assignment 1 - DFT
Santhoshi K
No ratings yet
Assignment 1: DATE-31-10-2021
Document11 pages
Assignment 1: DATE-31-10-2021
Kumoulica Kumoulica
No ratings yet
Solution Manual Neural Networks and Lear
Document5 pages
Solution Manual Neural Networks and Lear
Nadir Khan
No ratings yet
NMCP MCQ Unit 3
Document3 pages
NMCP MCQ Unit 3
Er Akash Sable
100% (3)
Hungarian Algorithm For Assignment Problem - Set 1 (Introduction)
Document10 pages
Hungarian Algorithm For Assignment Problem - Set 1 (Introduction)
Rahul Choudhary
No ratings yet
Insertion Sort
Document16 pages
Insertion Sort
White chillies
No ratings yet
1 (R2MDC) A Low Power Radix-2 FFT Accelerator For FPGA
Document5 pages
1 (R2MDC) A Low Power Radix-2 FFT Accelerator For FPGA
yunqi guan
No ratings yet
Image Fusion of Natural, Satellite, and Medical Images Using Undecimated Discrete Wavelet Transform and Contrast Visibility
Document7 pages
Image Fusion of Natural, Satellite, and Medical Images Using Undecimated Discrete Wavelet Transform and Contrast Visibility
tirupal
No ratings yet
Faculty of Engineering, Environment and Computing 310SE Advanced Digital Systems Open Time Constrained Assessment
Document6 pages
Faculty of Engineering, Environment and Computing 310SE Advanced Digital Systems Open Time Constrained Assessment
kelvin mwaniki
No ratings yet