Professional Documents
Culture Documents
Chapter 5: Mind Map: Mathematical Functions
Chapter 5: Mind Map: Mathematical Functions
Chapter 5: Mind Map: Mathematical Functions
Generalization
Sectioning to get "pure" data Chapter 5: Mind Map Not fit with other data: over-fit
Over-fitting in Tree Induction
For previously unseen data
Comparing predicted values w/hidden true values Increases when you allow more flexibility
Generalizaiton Performance
Why is it bad?
estimated performance
estimates all data
Must mis-trust data on a training set Cross-validation:
More sophisticated
Churn Data-set Model will pick up harmful correlations
Tree induction
Stop growing the tree
Avoidance
Grow until it is too large hen prune it back