Lecture 10

Advice
for applying
machine learning
Deciding what
to try next
Machine Learning
Debugging a learning algorithm:
Suppose you have implemented regularized linear regression to predict housing
prices.

However, when you test your hypothesis on a new set of houses, you find that
it makes unacceptably large errors in its predicDons. What should you try next?
-‐  Get more training examples
-‐  Try smaller sets of features
-‐  Try geJng addiDonal features
-‐  Try adding polynomial features
-‐  Try decreasing
-‐  Try increasing
Andrew Ng
Machine learning diagnos5c:
DiagnosDc: A test that you can run to gain insight what
is/isn’t working with a learning algorithm, and gain
guidance as to how best to improve its performance.
DiagnosDcs can take Dme to implement, but doing so

can be a very good use of your Dme.
Andrew Ng
Advice for applying
machine learning
EvaluaDng a
hypothesis
Machine Learning
Evalua5ng your hypothesis
Fails to generalize to new
price
examples not in training set.

size of house
no. of bedrooms
size no. of floors
age of house
average income in neighborhood
kitchen size
Andrew Ng
Dataset:
Size Price
2104 400
1600 330
2400 369
1416 232
3000 540
1985 300
1534 315
1427 199
1380 212
1494 243
Andrew Ng
Training/tes5ng procedure for linear regression
-‐  Learn parameter from training data (minimizing

training error )
-‐  Compute test set error:
Andrew Ng
Training/tes5ng procedure for logis5c regression
-‐  Learn parameter from training data
-‐  Compute test set error:

-‐  MisclassificaDon error (0/1 misclassificaDon error):
Andrew Ng
Advice for applying
machine learning
Model selecDon and
training/validaDon/test
sets
Machine Learning
OverfiBng example
Once parameters
price
were fit to some set of data

(training set), the error of the
parameters as measured on
size that data (the training error
xxxxx) is likely to be lower
than the actual generalizaDon
error.
Andrew Ng
Model selec5on
1.
2.
3.

10.
Choose
How well does the model generalize? Report test set
error .
Problem: is likely to be an opDmisDc esDmate of
generalizaDon error. I.e. our extra parameter ( = degree of
polynomial) is fit to test set.
Andrew Ng
Dataset:
Size Price
2104 400
1600 330
2400 369
1416 232
3000 540
1985 300
1534 315
1427 199
1380 212
1494 243
Andrew Ng
Train/valida5on/test error
Training error:
Cross ValidaDon error:
Test error:
Andrew Ng
Model selec5on
1.
2.
3.

10.
Pick
EsDmate generalizaDon error for test set
Andrew Ng
Advice for applying
machine learning
Diagnosing bias vs.
variance
Machine Learning
Bias/variance
Price
Price
Price
Size Size Size
High bias “Just right” High variance

(underfit) (overfit)
Andrew Ng
Bias/variance
Training error:
Cross validaDon error:
error
degree of polynomial d

Andrew Ng
Diagnosing bias vs. variance
Suppose your learning algorithm is performing less well than
you were hoping. ( or is high.) Is it a bias
problem or a variance problem?
Bias (underfit):
(cross validaDon
error
error)
Variance (overfit):
(training error)
degree of polynomial d
Andrew Ng
Advice for applying
machine learning
RegularizaDon and
bias/variance
Machine Learning
Linear regression with regulariza5on
Model:
Price
Price
Price
Size Size Size
Large xx Intermediate xx Small xx
High bias (underfit) “Just right” High variance (overfit)
Andrew Ng
Choosing the regulariza5on parameter
Andrew Ng
Choosing the regulariza5on parameter
Model:
1.  Try
2.  Try
3.  Try
4.  Try
5.  Try

12. Try
Pick (say) . Test error:
Andrew Ng
Bias/variance as a func5on of the regulariza5on parameter
Andrew Ng
Advice for applying
machine learning
Learning curves
Machine Learning
Learning curves
error
(training set size)

Andrew Ng
High bias
price
error
size
(training set size)
price
If a learning algorithm is suffering
from high bias, geJng more
training data will not (by itself)
help much. size
Andrew Ng
High variance (and small )
price
error
size
(training set size)
price
If a learning algorithm is suffering
from high variance, geJng more
training data is likely to help.
size
Andrew Ng
Advice for applying
machine learning
Deciding what to
try next (revisited)
Machine Learning
Debugging a learning algorithm:
Suppose you have implemented regularized linear regression to predict
housing prices. However, when you test your hypothesis in a new set
of houses, you find that it makes unacceptably large errors in its
predicDon. What should you try next?
-‐  Get more training examples

-‐  Try smaller sets of features
-‐  Try geJng addiDonal features
-‐  Try adding polynomial features
-‐  Try decreasing
-‐  Try increasing
Andrew Ng
Neural networks and overfiBng
“Small” neural network “Large” neural network
(fewer parameters; more (more parameters; more prone
prone to underfiJng) to overfiJng)
ComputaDonally cheaper ComputaDonally more expensive.

Use regularizaDon ( ) to address overfiJng.
Andrew Ng

Lecture 10

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Lecture 10

Uploaded by

Copyright:

Available Formats

Advice

DiagnosDcs can take Dme to implement, but doing so

examples not in training set.

-‐  Learn parameter from training data (minimizing

-‐  Compute test set error:

were ﬁt to some set of data

Cross ValidaDon error:

High bias “Just right” High variance

Cross validaDon error:

degree of polynomial d

degree of polynomial d

(training set size)

(training set size)

-‐  Get more training examples

ComputaDonally cheaper ComputaDonally more expensive.

You might also like

Lecture 10

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Lecture 10

Uploaded by

Copyright:

Available Formats

Advice

DiagnosDcs can take Dme to implement, but doing so

examples not in training set.

-­‐ Learn parameter from training data (minimizing

-­‐ Compute test set error:

were ﬁt to some set of data

Cross ValidaDon error:

High bias “Just right” High variance

Cross validaDon error:

degree of polynomial d

degree of polynomial d

(training set size)

(training set size)

-­‐ Get more training examples

ComputaDonally cheaper ComputaDonally more expensive.

You might also like

-‐  Learn parameter from training data (minimizing

-‐  Compute test set error:

-‐  Get more training examples