Professional Documents
Culture Documents
Chap 18 B
Chap 18 B
Splitting Criterion
ID3 Algorithm
Cross Validation
Tenfold cross-validation
Ten iterations
Pull a different tenth of the dataset out
each time to act as a test set
Train on the remaining training set
Measure performance on the test set
Overfitting
Preventing overfitting by
cross validation
Information of message =
Randomness of dataset =
pP log p P p N log p N
0 log 2 0 1log 2 1 0
2
2 4
4
log 2 log 2 0.9183
6
6 6
6
1log 2 1 0 log 2 0 0
2
4
6
Weighted average (0) (0) (0.9183) 0.4591
12
12
12
1
1 1
1
log 2 log 2 1
2
2 2
2
2
2 2
2
log 2 log 2 1
4
4 4
4
Weighted average 1
Patrons split
Type split
Randomness = 0.4591
Randomness = 1
Learning Logical
Descriptions
Hypothesis
x WillWait ( x)
Patrons ( x, Some)
Patrons ( x, Full ) Hungry ( x) Type ( x, French )
Patrons ( x, Full ) Hungry ( x) Type ( x, Burger )
Patrons ( x, Full ) Hungry ( x) Type ( x, Thai ) FriSat ( x)
Learning Logical
Descriptions
Current-best-hypothesis
search
H 1 : x WillWait ( x) Alternate( x)
H 2 : x WillWait ( x)
Alternate( x) Patrons ( x, Some)
Current-best-hypothesis
search
H 2 : x WillWait ( x) Alternate( x) Patrons ( x, Some)
H 3 : x WillWait ( x) Patrons ( x, Some)
H 4 : x WillWait ( x)
Patrons ( x, Some)
( Patrons ( x, Full ) FriSat ( x))
Neural Networks