Professional Documents
Culture Documents
Binary Classification Machine Learning Models
Binary Classification Machine Learning Models
Binary Classification Machine Learning Models
B. The classification task’s logical context actually determines which possible outcome(s) are
relevant.
Key concepts to measuring classification performance are: (Géron, 2017) (Starmer, 2020)
Precision – the model’s ability to predict (positive) only the actual (positive) cases – refers to “how
precise are the model’s true positive (i.e., correct positive) predictions relative to all positive
predictions it made?” False-positives (FP) penalize performance.
1
For discussion purposes, ‘classification’ and ‘prediction’ are synonyms.
Figure 2 shows each measure’s mathematical formulation, while Figure 3 visualizes performance
concepts’ relations to outcome possibilities.
Precision/Recall Trade-Off
Precision and recall (aka sensitivity) can contend: i.e., improving one can degrade the other.
Significantly, figure 2 shows that precision and recall formulas:
A Unique Example
Consider Mars robotic mineral sampling at candidate collection locales:
▪ TPs are important because quality samples analysis is a prime exploration objective
▪ Avoiding FPs (higher precision) is important – collected, but shouldn’t have – depletes the
battery, reducing per-excursion quality yield; likewise, drill bit wear reduces sampling lifetime.
▪ Avoiding FNs (higher recall) is important – didn’t collect, but should have – directly risks failing
the exploration objective.
Finally, the model developer identifies a desirable trade-off point on the ROC curve, then selects the
point’s corresponding threshold cutoff value to include in the BC decision function.
Glen, S. (2019). ROC Curve Explained in One Picture. Retrieved from Data Science Central:
https://www.datasciencecentral.com/roc-curve-explained-in-one-picture/
Irizarry, R. (2019). Introduction to Data Science: Data Analysis and Prediction Algorithms with R.
Chapman & Hall.
Siegler, R. (2017, November 28). Fractions: Where It All Goes Wrong. Scientific American.
Starmer, J. (Director). (2020). Machine Learning Fundamentals: Sensitivity and Specificity [Motion
Picture].