Download as xlsx, pdf, or txt
Download as xlsx, pdf, or txt
You are on page 1of 8

*Cluster code dropped as its VIF was h

Data Used
Model VIF Cluster
reference Library Algorithm used Outliers Smote Scaling treated code*
LR_model1 Sklearn Logistic Regression Treated No No Yes(3 variables dropped) No

LR_model2 Sklearn Logistic Regression Treated No No Yes(3 variables dropped) No

LR_model3 Sklearn Logistic Regression Treated No No Yes(3 variables dropped) No

LR_model4 Sklearn Logistic Regression Treated No No Yes(3 variables dropped) No

LR_model5 Sklearn Logistic Regression Treated No No Yes(3 variables dropped) No

LR_model6 Sklearn Logistic Regression Treated No No Yes(3 variables dropped) No

LR_model7 Sklearn Logistic Regression Treated No No Yes(3 variables dropped) No


LR_model8 Sklearn Logistic Regression Treated Yes No Yes(3 variables dropped) No
LR_model9 Sklearn Logistic Regression Treated No Yes Yes(3 variables dropped) No

LR_model10 Statsmodel Logistic Regression Treated No No Yes(3 variables dropped) No


LDA_model1 Sklearn LDA Treated No No Yes(3) No

LDA_model1 Sklearn LDA Treated No No Yes(3) No


QDA_model1 Sklearn QDA Treated No No Yes(3) No
SVM_model1 Sklearn SVM Treated No Yes Yes(3) No

SVM_model2 Sklearn SVM Treated No Yes Yes(3) No

SVM_model3 Sklearn SVM Not treated No Yes Yes(3) No


ANN_model1 Sklearn ANN Treated No Yes Yes(3) No

ANN_model2 Sklearn ANN Treated No Yes Yes(3) No


ANN_model3
alpha = 0.2 Sklearn ANN Treated No Yes Yes(3) No
ANN_model3
alpha = 0.1 Sklearn ANN Treated No Yes Yes(3) No
ANN_model3
alpha = 0.05 Sklearn ANN Treated No Yes Yes(3) No
ANN_model3
alpha = 0.04 Sklearn ANN Treated No Yes Yes(3) No
ANN_model3
alpha = 0.03 Sklearn ANN Treated No Yes Yes(3) No

KNN_model1 Sklearn KNN Treated No Yes Yes(3) No


KNN_model2 Sklearn KNN Treated No Yes Yes(3) No

RF_model1 Sklearn RandomForest Treated No No Yes(3) No

RF_model2 Sklearn RandomForest No No No Yes(3) No

RF_model3 Sklearn RandomForest Treated No No Yes(3) No


ADA_model1 Sklearn Adaboost Treated No No Yes(3) No

ADA_model2 Sklearn Adaboost Treated No No Yes(3) No


GB_model1 Sklearn Gradient Boost Treated No No Yes(3) No

GB_model2 Sklearn Gradient Boost Treated No No Yes(3) No


dropped as its VIF was high
Train data Test data
Hyper
parameters Accuracy Precision Recall F1 AUC Accuracy Precision Recall
Default base model 0.89 0.77 0.51 0.61 0.88 0.89 0.78 0.5
Default base model, RFE
variables=8 AUC = 0.77 AUC = 0.77
Default base model, RFE
variables =12 AUC = 0.78 AUC = 0.78
Default base model, RFE
vars=16 0.89 0.78 0.49 0.6 0.88 0.89 0.8 0.47
Default base model, RFE
vars=18 0.89 0.77 0.49 0.6 0.88 0.89 0.79 0.48
Default base model, RFE
vars=19 0.89 0.77 0.49 0.6 0.88 0.89 0.8 0.48
Gridsearch CV, best
model for f1 score 0.89 0.77 0.5 0.61 0.88 0.89 0.78 0.49
Default base model 0.81 0.8 0.82 0.81 0.89 0.79 0.44 0.79
Default base model 0.89 0.77 0.5 0.61 0.88 0.89 0.78 0.49
Iterated 4 times to
remove 4 variables 0.89 0.78 0.51 0.61 0.88 0.89 0.79 0.48
Default base model 0.89 0.77 0.47 0.58 0.88 0.88 0.77 0.45
Gridsearch CV, best
model for f1 score 0.89 0.77 0.47 0.59 0.88 0.88 0.77 0.45
Default base model 0.84 0.51 0.73 0.6 0.86 0.83 0.49 0.68
Default base model 0.94 0.93 0.71 0.8 0.97 0.93 0.9 0.64
Gridsearch CV, best
model for f1 score 0.99 0.93 1 0.96 1 0.97 0.87 0.93
Gridsearch CV, best
model for f1 score 0.98 0.89 0.99 0.94 1 0.95 0.83 0.92
Default base model 1 1 1 1 1 0.98 0.94 0.91
Gridsearch CV, best
model for f1 score 0.99 0.98 0.97 0.97 1 0.97 0.92 0.89
Default base model with
alpha = 0.2 0.97 0.95 0.89 0.92 0.99 0.95 0.92 0.8
Default base model with
alpha = 0.1 0.99 0.98 0.95 0.97 1 0.97 0.94 0.85
Default base model
with alpha = 0.05 0.99 0.98 0.98 0.98 1 0.97 0.94 0.9
Default base model with
alpha = 0.04 1 0.99 0.98 0.99 1 0.97 0.94 0.9
Default base model with
alpha = 0.03 1 0.99 0.98 0.99 1 0.97 0.95 0.86

Default base model 0.98 0.95 0.93 0.94 1 0.96 0.88 0.85
GridSearchCV, best
model for f1 score 1 1 1 1 1 0.98 0.93 0.93

Default base model 1 1 1 1 1 0.97 0.97 0.86


Default base model
with outliers 1 1 1 1 1 0.97 0.98 0.86
GridSearchCV, best
model for f1 score 0.98 0.98 0.91 0.94 1 0.95 0.9 0.78
Default base model 0.9 0.74 0.61 0.67 0.91 0.9 0.75 0.6
GridSearchCV, best
model for f1 score 0.9 0.75 0.6 0.67 0.92 0.9 0.76 0.6
Default base model 0.91 0.82 0.61 0.69 0.93 0.91 0.82 0.57
GridSearchCV, best
model for f1 score 1 1 1 1 1 0.99 0.99 0.94
Test data

F1 AUC
0.61 0.87 3 VIF variables dropped (20 predictors)

AUC = 0.77

AUC = 0.78

0.59 0.87 Using RFE, 16 predictors

0.6 0.87 Using RFE, 18 predictors

0.6 0.87

0.6 0.87 GridSearchCV


0.56 0.87 Smoted train data Bad performance on smote
0.6 0.87 Scaled data, default model with all predictors

0.6 0.87
0.57 0.86

0.57 0.86
0.57 0.84
0.75 0.94

0.9 0.98

0.87 0.97 This one is with data not treated for outliers
0.93 0.99

0.91 0.99

0.85 0.97 Too much penalty, reduce penalty

0.9 0.98 Same code is reused to change and record

0.92 0.99 Chosen model as the diff between train and test is less

0.92 0.99

0.91 0.99 Overfit - Recall

0.87 0.98
0.93 0.99

0.91 0.99

0.92 0.99

0.84 0.98
0.67 0.9

0.67 0.91
0.67 0.91

0.96 1

You might also like