Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

Pillai College Of Engineering

Name: Akanksha Gaonkar Class: TEIT-A


Roll No: ITA611 DOP:
DOS: Sign:
Marks:
_________________________________________________________________

Experiment 2
Title: Data Pre-Processing

Aim: To solve exercises on pre-processing and implement data pre-processing on the dataset
WEATHER.arff ( specify the individual dataset chosen) using WEKA.

Theory:

Data pre-processing is a data mining technique which is used to transform the raw data in a
useful and efficient format.
Data pre-processing, a component of data preparation, describes any type of processing
performed on raw data to prepare it for another data processing procedure. It has traditionally
been an important preliminary step for the data mining process. More recently, data pre
processing techniques have been adapted for training machine learning models and AI
models and for running inferences against them.
Data pre-processing transforms the data into a format that is more easily and effectively
processed in data mining, machine learning and other data science tasks. The techniques are
generally used at the earliest stages of the machine learning and AI development pipeline to
ensure accurate results.
Data Pre-Processing using WEKA:

Perform the following Pre -processing on the chosen dataset


1. Find the missing values and remove
2. Replace the missing values of nominal attribute to NULL
3. Rename a Numeric Attribute using filter
4. Rename the nominal values of the nominal attribute
5. Discretize numeric attribute to nominal using filter
6. Perform feature selection by using filter
Conclusion: Thus, data pre-processing has been successfully applied on the
WEATHER.arff dataset using WEKA.

You might also like