Professional Documents
Culture Documents
Modeling Course Sec 02
Modeling Course Sec 02
Modeling Course Sec 02
using SAS
Jul-06
Exclusions :
– Accounts bankrupt or charged off ;
– Months on Book<12 ;
– Current Delinquency > 2 due
– Current delinquency <= 1 due
Performance Definition :
– Bad (Bad / Response =1) : 90+ DPD or worse in the next 6 months.
– Good (Bad / Response =0) : Never 90+ or worse in next 6 months
Validation : In time validation / Out of time validation
Hence all these details are finalized while fixing the project design and objective.
Observation and performance window are decided based on the objective and
History Performance window
the underlying Product
Observation Point or Scoring Point :
– The time period at which the model is being built.
Performance Window :
– The time window which is used to gauge the performance of account to
assign a good/ bad value.
History Window :
– History window contains the past performance of the account
– Used for Behavioral scores where on-us history is significant
– On –us history : Performance of the account on our books
– off Us- Generic performance of the customer provided by Bureau across all
products]
4/18/2014 7:33 AM Created by – Gopal Prasad Malakar 10
Performance Definition
3.00%
2.50%
What should
2.00% be the ideal
performance
1.50%
window?
1.00%
0.75% 0.75%
0.50%
0.50% 0.29% 0.25%
0.13% 0.15% 0.13% 0.11% 0.14%
0.03% 0.05%
0.00%
03 06 09 12 15 18 21 24 27 30 33 36
3.00%
Cumulative write off rate
2.50%
2.00%
1.50%
1.00%
0.50%
0.00%
03 06 09 12 15 18 21 24 27 30 33 36
Q4-2010 0.03% 0.08% 0.21% 0.50% 1.00% 1.75% 2.50% 2.75% 2.90% 3.03% 3.14% 3.28%
Q2-2011 0.05% 0.09% 0.28% 0.60% 1.15% 1.80% 2.75% 2.90% 3% 3.15%
Q4-2011 0.04% 0.08% 0.32% 0.62% 1.20% 1.85% 2.60% 2.75%
Q4-2012 0.10% 0.20% 0.45% 0.90%
Jul-11
• Why
Apr-11
Rolling
window /
Jan-11 Variable
window?
0 1 2 3 4 5 6 7 8 9 10 11 12
Variable Window
Q1-11
4/18/2014
0 7:331 AM 2 3 4 5 6
Created by 7– Gopal 8Prasad Malakar
9 10 11 12 16
Observation and Performance window (contd.)
Rule of thumb –
if the variance in performance window ( say 3 months when performance
window is 24 to 27 months) is quite smaller is comparison of minimum
performance window, variable performance window is ok
Where as if the variance in performance window (say 3 months when
performance window is 6 to 9 months) is quite big is comparison of minimum
performance window, one should go for rolling performance window
Checkpoints to be kept in mid while deciding the objective and Project Design :
Data Sufficiency involves ensuring that the data has the required attributes to
make the prediction as stated by objective
For ex : 1) To build a model to predict fraud, the given data doesn’t have any
key for identifying fraud accounts or those identifiers are erased than there are no
accounts which we can identify as ‘bad’ and build a model to predict the same.
2) Alternatively if we are building a response model specifically for internet
channel for a “airline card”. Then data should have a identifier for ‘channel of
acquisition’ to identify the right data base on which to build the model
Be careful with not selecting effect variable as cause variable
Be careful to select variable, which are available at the time scoring