Welcome to Scribd!

Kmeans Algorithm

Uploaded by

0% found this document useful (0 votes)

107 views9 pages

K-means clustering aims to partition data into k clusters, with each observation belonging to the cluster with the nearest mean. The standard k-means algorithm iteratively assigns observations to clusters based on means and recalculates the means based on the observations assigned. However, k-means does not guarantee an optimal solution due to its dependence on initial mean locations, so it is common to run it multiple times.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

107 views9 pages

Kmeans Algorithm

Uploaded by

misscoma

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 9

Search inside document

K-Means Clustering

1
K-Means Clustering

The k-means clustering aims to partition n data into k sets.

Objective Function n=11

k 2

arg min ∑ ∑ Χ j − mi
S i =1 Χ j ∈S i

where mi is the mean of points in Si

Observation set : ( X 1 , X 2 ,..., X n ) n=11, k=3(S1,S2,S3)

partition the n observations into k sets S.

k < n, S={S1, S2, … , Sk}

http://en.wikipedia.org/wiki/K-means_clustering, 30 March 2010

2
K-Means Clustering

Basic idea
• proposed by Hugo Steinhaus in 1956
Standard Algorithm
• proposed by Stuart Lloyd in 1957
• for a pulse-code modulation technique
The term “K-means”
• proposed by James MacQueen in 1967

http://en.wikipedia.org/wiki/K-means_clustering, 30 March 2010

3
Standard Algorithm

Standard Algorithm (k-means algorithm, Lloyd’s algorithm)

Assignment:

initial set of k means: m1(1),…,m

mk(1)
(selected by a random or heuristic method)

Update:

calculate the new means

-> centroid of the objects in the cluster

repeat until stable

-> no objects move group

http://en.wikipedia.org/wiki/K-means_clustering, 30 March 2010

4
Standard Algorithm

1. Select k points which are initial centroids of groups.

2. Assign each object to a group of the closest centroid.

3. When all objects have been assigned, update k centroids.

4. Repeat step 2 and 3 until the centroids no longer move or

the objects no longer move to other groups.
Examples

Initial positions groups by initial 1st step

positions

2nd step 3rd step final step

K-means interactive demo, http://home.dei.polimi.it/matteucc/Clustering/tutorial_html/AppletKM.html, 30 March 2010

6
Examples-
Examples-Matlab
100 100

90 90
n=20, k=3
80 80

70 70
Operation flow
60 60 1. Select initial centroid
50 50
(random)
40 40 2. Calculate Euclidian
30 30 distance
20 20 3. Assign group (find
10 10 minimum distance)
0
0 10 20 30 40 50 60 70 80 90
0
0 10 20 30 40 50 60 70 80 90 4. Calculate position of
new centroid
Initial positions & 1st step
grouping 5. Calculate stop
100 100
condition
90 90

80 80

70 70

60 60

50 50

40 40

30 30

20 20

10 10 Matlab Statistics Toolbox

0
0 10 20 30 40 50 60 70 80 90
0
0 10 20 30 40 50 60 70 80 90 : IDX = KMEANS(X, K)
2nd step final step
7
Summary

K-Means clustering
• is a fast and simple algorithm
• to solve clustering problem
But the algorithm
• does not necessarily find optimal configuration
• due to initialization problem
• by random or heuristic selection
And so k-means algorithm
• can be run multiple times
• to reduce above effect.

8
References

Joaquin Perez Ortega, Ma. Del Rocio Boone Rojas, and Maria J.
Somodevilla Garica, “Research issues on K-means Algorithm:
An Experimental Trial Using Matlab”, Proceedings of the 2nd
Workshop on Semantic Web and New Technologies (SemWeb09),
Puebla, Mexico, March 23-24, 2009.

CTEC5723 HighAssurance Coursework IbrahimSegunAina
Document26 pages
CTEC5723 HighAssurance Coursework IbrahimSegunAina
ibrahim aina
No ratings yet
Aleks PDF
Document3 pages
Aleks PDF
Sokmean Meng
No ratings yet
Daa Part 2C
Document10 pages
Daa Part 2C
hasimot979
No ratings yet
Face Detection: EE368: Digital Image Processing
Document15 pages
Face Detection: EE368: Digital Image Processing
kishore 5
No ratings yet
Matlab 2D and 3D PLOTS
Document45 pages
Matlab 2D and 3D PLOTS
Mukt Shah
No ratings yet
Regresi Linear Sederhana
Document10 pages
Regresi Linear Sederhana
ACHMAD REZA FAHCRUROJI 2020
No ratings yet
HW 5
Document3 pages
HW 5
Zachary Puckett
No ratings yet
Unit3 and Unit4 Problem Set
Document20 pages
Unit3 and Unit4 Problem Set
8858imaddy
No ratings yet
EX#1
Document3 pages
EX#1
suresh kumar saini
No ratings yet
Quiz 2 Solutions: Introduction To Algorithms
Document13 pages
Quiz 2 Solutions: Introduction To Algorithms
PorkerriaCcdlv
No ratings yet
CBSE Class 10 Maths Worksheet Statistics 1
Document2 pages
CBSE Class 10 Maths Worksheet Statistics 1
Joban Sandhu
No ratings yet
Laboratory #4
Document3 pages
Laboratory #4
CHARLOTTE PINEDA
No ratings yet
Binary Search Trees
Document10 pages
Binary Search Trees
Ani
No ratings yet
Example 1: DFT of Sine Waveform: (One Cycle, Two Cycles and Seven Cycles)
Document15 pages
Example 1: DFT of Sine Waveform: (One Cycle, Two Cycles and Seven Cycles)
narasimhan kumaravelu
No ratings yet
Skripsi Thessa CHP III-biblio
Document21 pages
Skripsi Thessa CHP III-biblio
Andreas Lalogiroth
No ratings yet
LAB: Recursive Solution of Difference Equation
Document2 pages
LAB: Recursive Solution of Difference Equation
MA Khan
No ratings yet
An Introduction To The Analysis of Extreme Values Using R and Extremes
Document82 pages
An Introduction To The Analysis of Extreme Values Using R and Extremes
abhinavatripathi
No ratings yet
Part 2
Document7 pages
Part 2
Roy Tufail
No ratings yet
11 Gul - Trench & Shaft 3
Document1 page
11 Gul - Trench & Shaft 3
ThaungMyint
No ratings yet
Monte Carlo Simulations Using Matlab: Vincent Leclercq, Application Engineer Email: Vincent - Leclercq@
Document29 pages
Monte Carlo Simulations Using Matlab: Vincent Leclercq, Application Engineer Email: Vincent - Leclercq@
Shan Deva
No ratings yet
CCP303
Document17 pages
CCP303
api-3849444
No ratings yet
3rd Sem Business Statistics Oct 2022
Document4 pages
3rd Sem Business Statistics Oct 2022
Chandan G
No ratings yet
Unit III Statistics
Document62 pages
Unit III Statistics
pj
No ratings yet
Sturm Liou Ville 3
Document6 pages
Sturm Liou Ville 3
imran5705074
No ratings yet
CBSE Class 10 Maths Worksheet - Statistics
Document2 pages
CBSE Class 10 Maths Worksheet - Statistics
Nilesh Vishwakarma
75% (4)
NeurIPS 2020 Neural Networks Fail To Learn Periodic Functions and How To Fix It Paper
Document12 pages
NeurIPS 2020 Neural Networks Fail To Learn Periodic Functions and How To Fix It Paper
supcontact47
No ratings yet
Hasil Pengukuran Suatu Percobaan Hasil Pengukuran Suatu Percobaan
Document4 pages
Hasil Pengukuran Suatu Percobaan Hasil Pengukuran Suatu Percobaan
nurulnajmis
No ratings yet
Midterm
Document5 pages
Midterm
Dipen Patel
No ratings yet
ANSWER KEYS Statistics
Document16 pages
ANSWER KEYS Statistics
John Cedrick Maglinao
No ratings yet
A Particle Swarm Optimization (PSO) Primer
Document16 pages
A Particle Swarm Optimization (PSO) Primer
sathiskumarBE
No ratings yet
Mean and Variance of The Sampling Distribution of The Sample Mean
Document3 pages
Mean and Variance of The Sampling Distribution of The Sample Mean
Cv
No ratings yet
Degeneracy & Optimisation Techniques of Transportation Problem
Document14 pages
Degeneracy & Optimisation Techniques of Transportation Problem
Gaming Rockstar
No ratings yet
Business Statistics Assign Men Ti
Document6 pages
Business Statistics Assign Men Ti
Yograj Rajput
100% (1)
04-Parameter Optimisation
Document13 pages
04-Parameter Optimisation
hameeee
No ratings yet
Example 1: DFT of Sine Waveform: Lecture Topic: Understanding DFT and FFT
Document15 pages
Example 1: DFT of Sine Waveform: Lecture Topic: Understanding DFT and FFT
ani
No ratings yet
Ashish Maths 10 B Statistics
Document14 pages
Ashish Maths 10 B Statistics
Akash Kumar
No ratings yet
Spiking Neural Network For On-Line Cognitive Activity EEG
Document8 pages
Spiking Neural Network For On-Line Cognitive Activity EEG
Phuong an
No ratings yet
Anfis Vignette
Document21 pages
Anfis Vignette
Wang Zhe
No ratings yet
Mid-1 - CSE P&S E2
Document1 page
Mid-1 - CSE P&S E2
Mohan Rao
No ratings yet
Mid-1 - CSE P&S E2
Document1 page
Mid-1 - CSE P&S E2
Mohan Rao
No ratings yet
A1INSE6220 Winter17sol PDF
Document5 pages
A1INSE6220 Winter17sol PDF
picala
No ratings yet
BMAT202L - CAT I - Model QP
Document1 page
BMAT202L - CAT I - Model QP
Pavneet Kaur
No ratings yet
Linear Regression: Best Fit Line
Document4 pages
Linear Regression: Best Fit Line
PAWAN TIWARI
No ratings yet
6 of 6 - Assignment - Practice
Document3 pages
6 of 6 - Assignment - Practice
tian jin
No ratings yet
Linear Programming For GC
Document14 pages
Linear Programming For GC
Marriel Palle Tahil
No ratings yet
Megc Technical Workshop: October 16, 2018
Document33 pages
Megc Technical Workshop: October 16, 2018
PratyushAgarwal
No ratings yet
Slides - B. Stat - I, Lecture 6 - Chap 3, Session 2, Median, Mode
Document22 pages
Slides - B. Stat - I, Lecture 6 - Chap 3, Session 2, Median, Mode
Kim Namjoonne
No ratings yet
Speedometer (1) : Your Own Text Goes Here Your Own Text Goes Here
Document8 pages
Speedometer (1) : Your Own Text Goes Here Your Own Text Goes Here
Shesharam Chouhan
No ratings yet
2A Data Description (A)
Document16 pages
2A Data Description (A)
SEVITHARNE A/P HARI SHANKER
No ratings yet
Psu Umd Daworkshop17 Hunt
Document17 pages
Psu Umd Daworkshop17 Hunt
Phuong an
No ratings yet
Applied Statistics...
Document11 pages
Applied Statistics...
I am Riju
No ratings yet
Chapter 14 - Statistics
Document34 pages
Chapter 14 - Statistics
wanetanishq
No ratings yet
Antiderivatives (6.1 & 6.4)
Document2 pages
Antiderivatives (6.1 & 6.4)
teachopensource
No ratings yet
Speed Dashboard
Document8 pages
Speed Dashboard
John Nose
No ratings yet
2023 Statistics Fin 6
Document21 pages
2023 Statistics Fin 6
T
No ratings yet
17 The Effect of Core
Document9 pages
17 The Effect of Core
Umair Nazir
No ratings yet
Introductory Statistics - Assignment 1
Document2 pages
Introductory Statistics - Assignment 1
chansadavid52
No ratings yet
Seminar on Micro-Local Analysis. (AM-93), Volume 93
From Everand
Seminar on Micro-Local Analysis. (AM-93), Volume 93
Victor Guillemin
No ratings yet
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
Rating: 3 out of 5 stars
3/5 (4)
Differentiation (Calculus) Mathematics Question Bank
From Everand
Differentiation (Calculus) Mathematics Question Bank
Mohmmad Khaja Shareef
Rating: 4 out of 5 stars
4/5 (1)
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
Rating: 2.5 out of 5 stars
2.5/5 (2)
Local 3D Shape Descriptor
Document17 pages
Local 3D Shape Descriptor
misscoma
No ratings yet
Least Squares & Pseudo Inverse
Document12 pages
Least Squares & Pseudo Inverse
misscoma
No ratings yet
Gda
Document17 pages
Gda
misscoma
No ratings yet
DHSCH 1
Document31 pages
DHSCH 1
misscoma
No ratings yet
Lec7 Full
Document35 pages
Lec7 Full
Hà Vân
No ratings yet
Beginning and Intermediate Algebra 5th Edition Elayn Martin Gay Test Bank
Document33 pages
Beginning and Intermediate Algebra 5th Edition Elayn Martin Gay Test Bank
dorothydoij03k
100% (36)
Residual Attention Network For Image Classification
Document9 pages
Residual Attention Network For Image Classification
Mistlemagic
No ratings yet
Two-Phase Method and Dual Simplex Method
Document8 pages
Two-Phase Method and Dual Simplex Method
Syed Waheeb Akhter Zaidi
100% (1)
Final Exam-2020
Document3 pages
Final Exam-2020
혁준
No ratings yet
Judul 3
Document41 pages
Judul 3
deyapertala062004
No ratings yet
Newton Raphson Method - Formula, Solved Examples
Document10 pages
Newton Raphson Method - Formula, Solved Examples
Rosemary Jibril
No ratings yet
Constrained Optimization-Lecture 11
Document2 pages
Constrained Optimization-Lecture 11
maimoona
No ratings yet
Fast Learning in Networks of Locally-Tuned Processing Units
Document14 pages
Fast Learning in Networks of Locally-Tuned Processing Units
dheerajkuma
No ratings yet
11 - Numerical Differentiation and Integration-Integration of Equations - (Romberg Integration)
Document8 pages
11 - Numerical Differentiation and Integration-Integration of Equations - (Romberg Integration)
Mohannad Qudah
No ratings yet
Legendre Polynomials PDF
Document19 pages
Legendre Polynomials PDF
Bappy K M B
No ratings yet
Week 07b
Document11 pages
Week 07b
ngokfong yu
No ratings yet
Factoring Polynomials GCF PDF
Document2 pages
Factoring Polynomials GCF PDF
Namsang
No ratings yet
MATH459 Project1 Solution
Document9 pages
MATH459 Project1 Solution
Gabriel Gonzalez
No ratings yet
Lecture 2: Roots of Equation: Dr. Nor Alafiza Yunus
Document62 pages
Lecture 2: Roots of Equation: Dr. Nor Alafiza Yunus
Haziq Khairi
No ratings yet
Fortran Code For Numerical Integration: Part-4
Document6 pages
Fortran Code For Numerical Integration: Part-4
N. T. Dadlani
No ratings yet
Presentation VIKOR
Document11 pages
Presentation VIKOR
rongphar9alon
No ratings yet
Assignment 2
Document2 pages
Assignment 2
mech mech1
No ratings yet
Widrow-Hoff Learning Rule
Document9 pages
Widrow-Hoff Learning Rule
حيدر الجوهر
No ratings yet
Efficient Epileptic Seizure Prediction Based On Deep Learning
Document10 pages
Efficient Epileptic Seizure Prediction Based On Deep Learning
Joan Sebastian Betancourt Arias
No ratings yet
Module 1: Introduction To Numerical Analysis Questions
Document2 pages
Module 1: Introduction To Numerical Analysis Questions
NinoMay Suazo Roble
No ratings yet
Design and Analysis of Algorithm
Document47 pages
Design and Analysis of Algorithm
Abhijit Bodhe
100% (1)
ANDAR Assignment 3 Open Methods MT 311 2021-2022 - 025258
Document15 pages
ANDAR Assignment 3 Open Methods MT 311 2021-2022 - 025258
Bai Fauziyah Marohomsalic
No ratings yet
Macro Lesson Plan
Document7 pages
Macro Lesson Plan
18JUMS04 BHARATHA SUBATHRA V
No ratings yet
1polynomialsand Rational Expressions
Document52 pages
1polynomialsand Rational Expressions
Niño Bhoy Flores
No ratings yet
Machine Learning DSE Course Handout
Document7 pages
Machine Learning DSE Course Handout
bhavana2264
No ratings yet
4985.practical Optimization. Algorithms and Engineering Applications by Andreas Antoniou
Document2 pages
4985.practical Optimization. Algorithms and Engineering Applications by Andreas Antoniou
Ruby Chan
No ratings yet
MIT2 29S15 Lecture10
Document26 pages
MIT2 29S15 Lecture10
Ihab Omar
No ratings yet