Professional Documents
Culture Documents
Auto Loan Model Development and Validation
Auto Loan Model Development and Validation
- Vaibhav Pandey
Recap of Past Exploratory Deriving New
WoE Binning
Presentation Data Analysis Features
Training and Test Variable Selection Multicollinearity
Data Split using IV Check
Present
Objective
Logistic Regression Final Model
Model Validation
Analysis Development
• Whole Dataset contains 5837 records of past default behaviour of
Auto Loan borrowers.
Variable
Inflation Factor
Generally, value of VIF less than 5 is considered acceptable.
From the four sets of highly correlated variables we keep the ones with highest IV
and drop rest.
We recompute the VIFs of remaining variables and now find the values to be acceptable.
Variable
Inflation Factor
We run the Logistic Regression on the remaining variables and analyse the results.
Our Model, gives low PD values to Non Defaulters, while relatively high PD values to Defaulters.
Cumulatively, bad borrowers are given high PDs and bunched on the right.
We have considered KS Cutoff Value of 0.22 as PD cutoff. PD above 0.22 considered as default.
Model
Performance Metric Value
Confusion
Predicted
Non
Predicted
0.76
Metrics ROC_AUC Matrix
Defaults
Defaults