Professional Documents
Culture Documents
DMiningKuliah 2A DPreparation
DMiningKuliah 2A DPreparation
Pattern Evaluation
Data Mining
Task-relevant
Data
Selection and
Transformation
Data
Data Warehouse
Cleaning
Data Integration
Databases
January 21, 2024 Data Mining: Data Preprocessing 1
Data Preprocessing
Y1
Y1’
y=x+1
X1 (Salary) X
y = a + bx
where a is the point of intersection with the y-axis and b is the
slope of the line.
(x i x )( y i y )
b i 1
n
(x
i 1
i x) 2
a y bx
Where: xi and yi are the individual values for the descriptor variable (x i) and
the response (yi).
x is the mean of the descriptor variable x and y is the mean of the
response variable y.
January 21, 2024 Data Mining: Data Preprocessing 25
Simple Linear
Regression
(Example)
Formula:
Monthly Sales = 23.2064 +
0.00259 * Income
min-max normalization
v minA
v' (new _ maxA new _ minA) new _ minA
maxA minA
z-score normalization
v mean
v'
A
stand _ dev A