Professional Documents
Culture Documents
ML Models Concepts - I
ML Models Concepts - I
ML Models Concepts - I
What is Learning?
2. K-fold
a. Dataset is divided into
K subsets without
repetition.
b. Each time uses one of
the subsets as the test
set, and the remaining
K-1 subsets as the
training set repeats K
times to learn K-1
classification models
Cross-validation methods…
3. Independent test
a. The independent dataset test is one of the essential tests,
employed to assess the generalization performance of model in
the field of ML.
b. The model is trained to deploy a benchmark dataset, while an
independent dataset is utilized for its testing.
c. The data, consumed in the training phase are not used for the
testing phase, which means that the data in the benchmark
dataset and independent dataset are dissimilar.
d. The main reasons to exploit an independent dataset test are, to
evaluate whether a trained model over-fits over the training
dataset, and to adequately assess the generalization capabilities
of a trained model.
Cross-validation methods…
Cross-validation implementation
Overfitting in ML Models….
Overfitting refers to a model
behavior that models the training
data too well.
Overfitting happens when a model
learns the details and noise in the
training dataset to the extent that
it negatively impacts the
performance of the model.
Chapter Chapter 01
Pattern Recognition and
Machine Learning
by
Machine Learning
by
Tom Mitchell
Christopher M. Bishop