21BPS1595 Iot Lab-7

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

Tanya

21BPS1595

K Means Clustering in Knime: Unsupervised Machine


Learning
Aim:
To apply K Means clustering algorithm in Knime for unsupervised machine learning.

Theory:
1. K Means clustering is a fundamental unsupervised machine learning technique that partitions data
into K clusters, where K is a predefined number.
2. The algorithm iteratively assigns each data point to the nearest centroid and then recalculates the
centroids based on the mean of the points in each cluster.
3. It aims to minimize the within-cluster variance, ensuring that the data points within a cluster are as
similar as possible while maximizing the differences between clusters.
4. K Means is sensitive to the initial placement of centroids and may converge to a local minimum,
requiring multiple runs with different initializations or more advanced initialization techniques.
5. Despite its simplicity, K Means is widely used in various fields such as customer segmentation,
image compression, and anomaly detection due to its efficiency and interpretability.

Procedure:
1. Load dataset into Knime.
2. Pre-process data by handling missing values and normalizing if necessary.
3. Configure K Means node in Knime and specify the number of clusters (K).
4. Execute the K Means node to perform clustering.
5. Visualize clusters using appropriate visualization nodes in Knime.

Flow:
Properties:
Output:

Inference:
K Means clustering in Knime effectively partitions data into distinct clusters based on similarity.

Result:
The K Means algorithm successfully clustered the dataset into K distinct groups, revealing underlying patterns in
the data.

You might also like