Professional Documents
Culture Documents
Course Enrollment On Blackboard: Announced On E-Com
Course Enrollment On Blackboard: Announced On E-Com
Announced on e-com
Extract Split data into Select relevant Build predictive Evaluate model on
Sensor Preprocessing Transform features
Features train/test/validate features model/ensemble validation set Built model
rejected
15
Some Key considerations
• Performance
• Error rate (Prob. of misclassification) on independent test samples
• Speed
• Cost
• Robustness
• Reject option
• Return on investment
Let’s consider one specific
example
Fish Classification: Salmon vs Sea Bass
Separates
decision regions
From Duda & Hart
Complex Decision Boundary: Issue of
Generalization
? Is wrongly
classified with
complex decision
boundary
Validation set
• the best hypothesis on the sample may not be the best overall.
• generalization is not memorization.
• complex rules (very complex separation surfaces) can be poor
predictors.
Handling Overfitting