KNIME & WEKA Software Presentation

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 16

KNIME Software

Prepared by:
Amr Mohammad Mounir Elboraay Essa
Ahmad Yasser Ibrahim El Enany
Ahmad Mohammad Jamal El-Din
Mohammad Atef Mahmoud

Data Mining
VS

Contents
Introduction
Features
Functions & tools
Flat file appearance

Introduction
KNIME:
Stands for (KoNstanz
Information MinEr).
It is a user friendly, coherent
open source data analytics,
reporting and integration
platform.
KNIME is developed by Prof.
Michael Berthold's group at the
University of Konstanz in

It uses CSV format ( Comma


Separated Values).
It is written in java
programming language.
It is open source software.
It started January 2004.

Features

Modular Data Pipeline Environment


Large collection of Data Mining
techniques
Data and Model Visualizations
Interactive Views on Data and Models
Java Code Base as Open Source Project
Integration with: R Library, Weka, etc.
Based on the Eclipse Plug-in technology

Functions & tools


Classification.
Association rules.
Clustering .
Neural networks.
Nave Bayes.
K nearest neighbour.
Decision trees.

Flat file in .CSV format(HeartDisease):


Age, gender, chest_pain_type,
cholesterol,
exercise_induced_angina,class
63,male,typ_angina,233,no,not_p
resent
67,male,asympt,286,yes,present
67,male,asympt,229,yes,present
38,female,non_anginal,?,no,not_p
resent

WEKA
It stands for (Waikato Environment
for Knowledge Analysis).
It is a set of software for machine
learning and data mining.
It was developed at the University
of Waikato, New Zealand.
It uses ARFF format ( Attribute
Relation File Format).

It is written in java
programming language.
It is open source software.
It started 1993.

features

49 data preprocessing tools


76 classification/regression
algorithms
8 clustering algorithms
10 feature selection algorithms
3 algorithms for finding
association rules

3 graphical user interfaces


The Explorer (exploratory
data analysis)
The Experimenter
(experimental environment)
The Knowledge Flow (new
process model inspired
interface).

Functions and tools


Classification.
Association rules.
Clustering .
Neural networks.
Nave Bayes.
K nearest neighbour.
Decision trees.

Flat file in .ARFF


format(Heart-Disease):
@relation heart-disease
@attribute age numeric
@attribute gender { female,
male}
@attribute
chest_pain_type{ typ_angina,
asympt, non_anginal,
atyp_angina}
@attribute cholesterol numeric

@data
63,male,typ_angina,233,no,not
_present
67,male,asympt,286,yes,prese
nt
67,male,asympt,229,yes,prese
nt
38,female,non_anginal,?,no,no
t_present

Thank
You For
Listenin
g

You might also like