Professional Documents
Culture Documents
Statistica Pentru Evaluarea Performantelor
Statistica Pentru Evaluarea Performantelor
2
is the variance
is the population mean
n is the number of data items
x
i
is the i
th
data item
When calculating the variance for a sampling of data, a better result is obtained by
dividing the sum of squares by n 1 rather than by n.The proof of this is beyond the
scope of this book; however, it suffices to say that when the division is by n 1, the
sample variance is an unbiased estimator of the population variance. An unbiased esti-
mator has the characteristic that the average value of the estimator taken over all pos-
sible samples is equal to the parameter being estimated.
D.2 The Normal Distribution
The equation for the normal, or bell-shaped, curve is shown in Equation D.3.
where
f(x) is the height of the curve corresponding to values of x
e is the base of natural logarithms approximated by 2.718282
f x
x
( ) / (
( ) ( ) /
) =
1
2
2
2 (D.3 )e
2
D.2
2
1
2
1
=
=
n
x
i
i
n
( ) ( )
(D.1) =
=
1
1
n
x
i
i
n
CD-72 Appendix D
Two models, M
1
and M
2,
built with the same training data
Error rate E
1
and variance v
1
for model M
1
on test set A
Error rate E
2
and variance v
2
for model M
2
on test set B
2. Compute
3. Conclude
For model M
1
:
E
1
= 0.20
v
1
= .2 (1 .2) = .16
P
E E
v n v n
=
+
1 2
1 1 2 2
( / / )
( ) D.4
D.3
For model M
2:
E
2
= 0.30
v
2
= .3 (1 .3) = .21
[( ) ( )] (D.5)
where
is the joint variance
is the classifier error on the instance for learner model M
is the classifier error on the instance for learner model M
is the overall classifier error rate for model M minus the classifier error rate
for model M
is the total number of test set instances
1i
th
1
2i
th
2
1
2
P
0.20 0.30
=
(0.16/100 + 0.21/100)
CD-74 Appendix D
+ + +
e
i
i
variance( )
1
( )
1
D.7)
where
is the absolute error for the th instance
is the number of instances
1
2
mae
n
e mae
i
n
i
n
i
i
e
=
=
(
P
E E
V n
=
1 2
12
/
(D.6)
D.4
1
1
1
2
P
mae mae
v n
=
1 2
12
/
(D.9)
P
v n v n
mae mae
=
+
1 2
1 1 2 2
( / / )
(D.8)
CD-76 Appendix D
1 2
2 ( / )
(D.11)
D.5