Professional Documents
Culture Documents
Subject: Statistics
Subject: Statistics
2 / 21
Discriminant Analysis
3 / 21
Classification
4 / 21
The distinction
5 / 21
Two groups
6 / 21
Misclassification
However, the split in Ω may be such that there are individuals who
actually come from G2 but are in R1 and hence classified in G1 ,
and vice versa. These are known as misclassifications.
7 / 21
Conditional probability of misclassification
8 / 21
Unconditional probability of misclassification
9 / 21
Classification Rule
10 / 21
Cost of misclassification
11 / 21
The Rule
Result 1
The subsets R1 and R2 that minimizes the ECM are as follows :
f1 (x) p2 C(1|2)
R1 : ≥ (6)
f2 (x) p1 C(2|1)
f1 (x) p2 C(1|2)
and R2 : < (7)
f2 (x) p1 C(2|1)
12 / 21
Proof
so that
Z Z
ECM = p1 C(2|1) f1 (x)dx + p2 C(1|2) f2 (x)dx
R2 R1
Z Z
= p1 C(2|1)[1 − f1 (x)dx] + p2 C(1|2) f2 (x)dx
R1 R1
Z
= p1 C(2|1) + [p2 C(1|2)f2 (x) − p1 C(2|1)f1 (x)]dx (8)
R1
13 / 21
Proof (contd.)
14 / 21
Corollaries
Corollary 1
If the misclassification costs are equal i.e. C(1|2) = C(2|1),
f1 (x) p2 f1 (x) p2
R1 : ≥ and R2 : < (9)
f2 (x) p1 f2 (x) p1
15 / 21
Corollaries (contd.)
Corollary 2
If the prior probabilities are equal i.e. p1 = p2 ,
Corollary 3
If both the misclassification costs and the prior probabilities are
equal i.e. C(1|2) = C(2|1) and p1 = p2 ,
f1 (x) f1 (x)
R1 : ≥1 and R2 : <1 (11)
f2 (x) f2 (x)
16 / 21
The TPM
Classification Rule
f1 (x) p2 f1 (x) p2
R1 : ≥ and R2 : < (13)
f2 (x) p1 f2 (x) p1
By Bayes’ rule
p1 f1 (x)
P (G1 |x) =
p1 f1 (x) + p2 f2 (x)
p2 f2 (x)
and P (G2 |x) =
p1 f1 (x) + p2 f2 (x)
18 / 21
Alternative Rule
19 / 21
Summary
20 / 21
Thank You
21 / 21