Professional Documents
Culture Documents
Risks 09 00208 v2
Risks 09 00208 v2
Article
Development of an Impairment Point in Time Probability of
Default Model for Revolving Retail Credit Products:
South African Case Study
Douw Gerbrand Breed 1 , Niel van Jaarsveld 2 , Carsten Gerken 3 , Tanja Verster 1
and Helgard Raubenheimer 1, *
1 Centre for Business Mathematics and Informatics, North-West University, Private Bag X6001,
Potchefstroom 2520, South Africa; gerbrand.breed@gmail.com (D.G.B.); tanja.verster@nwu.ac.za (T.V.)
2 Independent Researcher, 20 Timbavati, St. Christopher Rd., St. Andrews, Bedfordview 2007, South Africa;
Niel.vj@hotmail.com
3 Independent Researcher, Hartmannsweilerstr. 55, 65933 Frankfurt am Main, Germany;
carsten.gerken@gmx.net
* Correspondence: helgard.raubenheimer@nwu.ac.za
Abstract: A new methodology to derive IFRS 9 PiT PDs is proposed. The methodology first derives a
PiT term structure with accompanying segmented term structures. Secondly, the calibration of credit
scores using the Lorenz curve approach is used to create account-specific PD term structures. The
PiT term structures are derived by using empirical information based on the most recent default
information and account risk characteristics prior to default. Different PiT PD term structures are
Citation: Breed, Douw Gerbrand, developed to capture the structurally different default risk patterns for different pools of accounts
Niel van Jaarsveld, Carsten Gerken, using segmentation. To quantify what a materially different term structure constitutes, three tests are
Tanja Verster, and Helgard proposed. Account specific PiT PDs are derived through the Lorenz curve calibration using the latest
Raubenheimer. 2021. Development of default experience and credit scores. The proposed methodology is illustrated on an actual dataset,
an Impairment Point in Time using a revolving retail credit portfolio from a South African bank. The main advantages of the
Probability of Default Model for proposed methodology include the use of well-understood methods (e.g., Lorenz curve calibration,
Revolving Retail Credit Products:
scorecards, term structure modelling) in the banking industry. Further, the inclusion of re-default
South African Case Study. Risks 9:
events in the proposed IFRS 9 PD methodology will simplify the development of the accompanying
208. https://doi.org/10.3390/
IFRS 9 LGD model due to the reduced complexity for the modelling of cure cases. Moreover, attrition
risks9110208
effects are naturally included in the PD term structures and no longer require a separate model.
Academic Editor: Jiří Witzany
Lastly, the PD term structure is based on months since observation, and therefore the arrears cycle
could be investigated as a possible segmentation.
Received: 16 August 2021
Accepted: 8 November 2021 Keywords: probability of default; point in time; IFRS 9; expected credit loss
Published: 15 November 2021
IFRS 9 proposes a “three stage model” when estimating ECL, based on changes in
credit quality since initial recognition (see Aptivaa (2016) and Beerbaum (2015) for further
discussion on the three stage methodology). A financial instrument is assigned to Stage 1 if
its credit risk (measured in terms of its probability of default as required by Section 5.5.9 of
(IFRS Foundation 2014)) has not increased significantly since initial recognition. A 12 month
ECL is recognised for such an instrument. On the other hand, a financial instrument is
assigned to Stage 2 if its credit risk has increased significantly since initial recognition. A
lifetime ECL is recognised for instruments in Stage 2. Finally, Stage 3 comprises all credit
impaired (defaulted) financial instruments for which a lifetime ECL is recognised.
The quantification of ECL is often broken down into its three components as similar
metrics are also required for other risk management purposes such as the quantification of
regulatory or economic capital requirements: the probability of default (PD), loss given
default (LGD) and exposure at default (EAD). A simplified expression for calculating
expected credit loss is ECL = PD × LGD × EAD. When using a PD term structure
(marginal PD’s), the expression changes as follows to calculate the ECL at account level:
ECLi = ∑tT=1 pi,tm × l × e , where pm is the marginal PD, l is the LGD when an account
i,t i,t i,t i,t
defaults at time t, and ei,t the EAD at time t, for account i (Schutte et al. 2020).
IFRS 9, being a principle based accounting standard, does not prescribe specific
methodologies for the 12 month or lifetime ECL estimation, and there is typically no single
methodology suitable to all portfolios (Global Public Policy Committee (GPPC) (2016)).
Literature is scarce on IFRS 9 specific PD methodologies. This paper aims to develop a new
methodology to derive the PiT (point in time) PD component (i.e., the marginal PD, pi,t m as
above) for the ECL calculations of financial instruments in Stages 1 and 2 (Stage 3 assets
are subject to a constant 100% PD estimate). In a PiT approach, borrowers are regraded
immediately as the economic cycle changes instead of a cycle-neutral, through-the-cycle
(TTC) grading (Taylor 2003).
The proposed methodology to estimate the marginal PD consists of two parts, the
term structure and the Lorenz calibration. The term structure part is a segment specific,
point in time based PD term structure that uses default information from the most recent
months. The Lorenz calibration is the calibration of credit scores to 12 month (Stage 1)
and lifetime (Stage 2) PDs to create account specific PD term structures. The proposed
methodology is illustrated in a case study for a revolving retail credit portfolio from a
South African bank.
2. Literature Overview
Several PD modelling approaches exist that may be considered as possible method-
ologies to derive the PD component for IFRS 9 purposes, although very few IFRS 9 PD
methodologies exist in published academic literature. In spite of the expansion of research
in respect to IFRS 9 in the past few years, it is still in its infancy in developing countries
(Dib and Feghali 2021).The focus of the literature overview will be on retail portfolios and
not wholesale portfolios, because some methodologies for wholesale portfolios are not
applicable for retail portfolios (see e.g., (Gubareva 2021)).
Some of these possible approaches will now be discussed in the literature overview.
How risk drivers are incorporated in each of these approaches will be mentioned, since
it is crucial to derive sufficiently granular PD estimates to assess significant increases in
credit risk as required by IFRS 9. Some advantages and disadvantages of each approach
will be discussed with specific consideration of the requirements of IFRS 9 (McPhail and
McPhail 2014). T, urlea (2021) highlights various rating methods and systems applicable
under the IFRS 9 framework with some advantages and disadvantages.
Logistic regression (T, urlea 2021) can be used in a scorecard to predict PDs (Thomas 2009).
Typically risk drivers will be included as independent variables (Anderson 2007). The
advantages of logistic regression include: it is simple to use, well known in the industry
(Siddiqi 2006), it produces account-level estimates, and it can regress multiple variables
without the need for segmentation (Siddiqi 2017). The key disadvantage from an IFRS 9
Risks 2021, 9, 208 3 of 22
perspective is that logistic regression is not designed for varying time horizons and, if used,
may result in unnecessary complexity.
Cumulative default curves can be estimated from empirical default and closure data
which is sometimes referred to as segmented empirical term structures (Schutte et al. 2020).
The PD term structures will often be segmented for different risk drivers. This method
is generally intuitive and well understood (Yang 2017) and directly includes re-defaults
and attrition effects. The one disadvantage is that the resulting estimates’ quality depends
significantly on how the term structures are segmented and granular segmentations are
often not possible due to data limitations such as a sufficient number of observations and
defaults for stable estimates per segment.
Markov chains (Aalen and Johansen 1978) can also be used to derive PD term struc-
tures by multiplication of empirically derived migration matrices that describe the tran-
sition between risk states (Cziraky and Zink 2017). To account for different risk drivers,
one can either use segmented migration matrices or incorporate these as specific risk states
into the migration matrix (e.g., delinquency). The advantages include that one can produce
PD estimates for any time horizon. They are easy to design and implement and provide
forecasts of future portfolio risk profiles (e.g., what % of today’s portfolio is expected to
be delinquent in 12 months) that can be used for budgeting/stress testing. However, the
disadvantages are that the standard time-invariant Markov chain assumption typically
does not result in a good fit for actual multi-period default behaviour. Additional time-
specific (e.g., two months after observation) migration matrices are, in these cases, required
to achieve acceptable fits, making the model very complex with limited benefit compared
to direct estimation of PD term structures. Furthermore, minor deviations in monthly
migrations may lead to significant over- or underestimation for multi-year PDs.
Run-off triangles (Braun 2004) use the most recent default and closure information to
predict the term structure for the performing portfolio. These run-off triangles are often
segmented for different risk drivers. The most significant advantage of this approach is that
it is easy to design and implement, easy to automate, and generates PiT estimates that are
often predictive for the near future. The disadvantages are that the derivation of account
specific PD term structures typically requires techniques like the simple linear scaling of
segment level PD term structures which may not result in a good fit. Structural changes in
the portfolio in the recent past might distort PD term structures (but can be corrected).
The chain ladder method (England and Verrall 2002) is a popular method that in-
surance companies use to estimate their required claim reserves and can also be used
to determine PDs. The chain ladder method utilises run-off triangles and has similar
advantages and disadvantages as the run-off triangles provided above.
Hazard models (T, urlea 2021) can be used to assess the riskiness of the obligor by
computing a score that indicates whether the obligor defaults within the specified horizon.
However, the models can be quite complex, and the model does not determine when the
obligor defaults will occur (Crook and Bellotti 2013). More generally, survival analysis can
also be used (Chimezda and Marimo 2017).
Panel models (T, urlea 2021) are also an alternative and has the advantage of capturing
effects over time (Crook and Bellotti 2010). However, these panels models can become
quite complex.
Autoregressive models (Glen 2015) can also be used where future default rates are
modelled as a function of previous default rates. Additional risk drivers can be included in
the modelling process as separate covariates in the regression. One of the most significant
advantages of autoregressive models is that they can incorporate macroeconomic forecasts
in the regression. However, disadvantages include that very granular segmentation is often
not possible, which leads to top-down approaches being required. Autoregressive models
are sometimes sensitive to recent changes (e.g., driven by credit policy changes).
Techniques within the data mining environment can also be applied, such as neural
networks (T, urlea 2021). However, there is no predetermined mechanism to determine the
Risks 2021, 9, 208 4 of 22
optimum network as the connections are seen as black boxes that are difficult to interpret
(Wielenga et al. 1999).
The herein presented PiT PD modelling approach uses calibration as a key model
component to derive account-level PD estimates. Calibration refers to the mapping of
credit score to a probability of default (Van Gestel and Baesens 2009). Several calibration
methods have been suggested in the literature (Medema et al. 2009; Glößner 2003). In
Glößner (2003), a method to estimate PDs using Lorenz curve construction based on credit
scores is described, which will be leveraged off in this paper.
3. Methodology
In the proposed methodology, we assume that the ECL for a period (0, T ], for an
account with a credit score s can be estimated as follows:
T
ECL T (s) = ∑ pemt (s) × eet (s) × elt (s), (1)
t =1
at the observation month Mk , and then defaulted in the period (t − 1, t] months after the
observation month and nk,t the number of performing accounts that survived in the period
(0, t − 1], where t = {0, 1, . . . , Tk } is the number of months since the observation month k,
i.e., nk,0 will be the number of performing accounts as at the observation month.
Table 1 is an illustrative example of the defaults table. The number of performing
accounts at the start of the observation month 201501 (i.e., M1 = 201501) was 500 accounts
(i.e., n1,0 = 500). Of these 500 accounts 10 accounts defaulted in 201502, during the period
(0, 1] (i.e., d1,1 = 10), and 5 accounts defaulted in 201503, during the period (1, 2] (i.e.,
d1,2 = 5). Note that no forecasting is required.
From the defaults table, the empirical PD’s, per observation month Mk can be calcu-
lated as follow:
dk,t
pmk,t = n , and (2)
k,0
t
pC
k,t = ∑ dk,i /nk,0 . (3)
i =0
After having derived the empirical marginal PD estimates pm k,t for the different ob-
servation months Mk and outcome horizons t = {1, 2, . . . , Tk }, it needs to be decided
which of these estimates should contribute to the final PD term structure and how these
contributions should be weighted. As mentioned earlier, the model’s objective is to yield
PiT PD estimates such that only defaults from the most recent R outcome months are used
in the final PD estimates. R is referred to as the ‘reference period’ and is a key parameter
for estimating the PD term structure. A shorter reference period will make the resulting
estimates more PiT but could result in unwanted volatility (and vice versa for longer
reference periods). If one uses a very long reference period over an entire economic cycle,
the resulting PD term structures can be considered to provide through-the-cycle (TTC) esti-
mates. It may sometimes also be required not to use the most recent data, e.g., if models are
refreshed in August but should only account for information until June. Hence, a reference
month M ∈ { M1 , M2 , . . . , MK } is used in order to denote the last observation month from
the set of observations months that is used in the derivation of pm k,t . Therefore only defaults
from the outcome months { M − ( R − 2), . . . , M, M + 1} are used for modelling purposes.
We estimated the marginal PD, pem t , from the defaults table, as the weighted average
of marginal PDs across the most recent R observation months with available outcome
horizons of at least t months. A weighting by the number of observations is performed to
smooth the impact of outlier months for segments with small and volatile population sizes.
Risks 2021, 9, 208 6 of 22
Thus given R and a reference month M (note that the term structure is generally
calculated taking the reference month to be the most recent observation month, i.e., M =
MK ) we estimate the marginal PD for time horizon t as
M−(t−1)
∑i= M−(t−1)−( R−1) di,t
pem
t ( R, M ) = M −(t−1)
. (4)
∑i= M−(t−1)−( R−1) ni,0
For example, if a reference period of 3 months is chosen (R = 3), and the reference month
is 201507 (M = 201507), then Table 2 illustrates the resulting marginal PDs.
To summarise, the approach will create PD term structures based on the most recent
default information and initial performing accounts as at observation month. It does not
require the explicit modelling of future cures or closures, as no survival analysis is required.
Below we discuss how the PD term structure is segmented.
L( x ) Fd ◦ Fa−1 ( x ). (6)
An example of a Lorenz curve (solid line) is shown in Figure 1. The x-axis of the Lorenz
curve indicates the cumulative proportion of all accounts, and the y-axis the cumulative
proportion of defaulted accounts. Suppose the defaulted accounts are distributed evenly
across scores amongst the portfolio. In that case, the y fraction is more or less equal to the x
fraction in which case the resulting Lorenz curve would be the 45-degree diagonal (dashed
line), indicating that the credit score s has no distinguishing power. The other extreme
would be a credit score s separating the good accounts from the bad ones perfectly (dotted
Risks 2021, 9, x FOR PEER REVIEW 8 of 22
Risks 2021, 9, 208 8 of 22
The other extreme would be a credit score 𝑠 separating the good accounts from the bad
line). In this case, the Lorenz curve runs linearly to y = 1, until the x value equal to the
ones perfectly (dotted line). In this case, the Lorenz curve runs linearly to 𝑦 = 1, until the
percentage of defaulted accounts is reached and stays there.
𝑥 value equal to the percentage of defaulted accounts is reached and stays there.
Figure1.1.Lorenz
Figure Lorenzcurve
curveexample.
example.
The
TheLorenz
Lorenzcurve
curve cancanbe beused
usedto toestimate
estimate thethe cumulative
cumulative PD,
PD, over horizon t,𝑡,for
over aa horizon for
accounts
accountswith withaagiven
givencredit
creditscore 𝑠 (Glößner
scores (Glößner 2003). The
2003). Theestimated
estimatedPDPDof of
accounts
accountswith a
with
given
a givencredit score
credit score 𝑠 isfraction
s is the of defaulted
the fraction accounts
of defaulted with a credit
accounts with ascore s ofscore
credit 𝑠 of all
all accounts
c ( s ) = f ( s ) / f ( s ), where f ( s ) the density of defaulted accounts
with a credit
accounts score
with s, i.e., pscore
a credit t 𝑠, i.e.,
d 𝑝 a(𝑠) = 𝑓 (𝑠)/𝑓d (𝑠), where 𝑓 (𝑠) the density of de-
and f a accounts and 𝑓 (𝑠) the density credit
faulted( s ) the density of all accounts for score s. Glößner
of all accounts (2003)
for credit 𝑠. Glößner
scorefurther shows the
(2003)
cumulative
further shows PD,the
over a horizon PD,
cumulative also: a horizon 𝑡, is also:
t, is over
F 0𝐹(s)(𝑠)
pct (𝑝s)(𝑠)
=L (𝑠)× p×=
=0 (ℒFa (𝐹s)) 𝑝̅ =d p, 𝑝̅ , (7)
(7)
(s)(𝑠)
Fa0 𝐹
where 𝑝̅ is the average PD over the whole portfolio.
where p is the average PD over the whole portfolio.
Specificinvariancy
Specific invariancyproperties
properties(Glößner
(Glößner2003) 2003)ofof the
the Lorenz
Lorenz curve
curve make
make this
this method
method a
a stable way to estimate PDs. Considerably less information is needed
stable way to estimate PDs. Considerably less information is needed to fit a numerically to fit a numerically
constructedLorenz
constructed Lorenzcurve
curvethan
thanto tofit
fitthe
thetwotwonumerical
numericaldistributions
distributionsititis
is constructed
constructed from.
from.
When applying the Lorenz curve, we make the following
When applying the Lorenz curve, we make the following two assumptions: two assumptions:
•• The Thevalidity
validityof ofthe
thecalculated
calculatedPD PDrelies
reliesononthe
theaccount’s
account’sscoring
scoringresults,
results, i.e.,
i.e., distin-
distin-
guishing power of the credit score
guishing power of the credit score used; and used; and
•• The Thedensity functions f 𝑓(s(𝑠)
densityfunctions
d ) andandf a (𝑓s)(𝑠)
areare assumed
assumed to be
to be lognormal.
lognormal.
Theensure
The ensurelognormality
lognormality assumption
assumption the the
credit credit
scorescore is transformed
is transformed (see Glößner
(see Glößner 2003)
2003) different
using using different transformations.
transformations. Assume Assume 𝑇(𝑠) refers
T (s) refers to a transformation
to a transformation applied
applied to theto
the credit
credit scorescore
s with 𝑠 awith
scorea score
range range
[s0 < [𝑠 s1 < < .𝑠. . <<⋯s J< 𝑠 ] where
] where 𝑠 sand
s0 and 𝑠 are the mini-
J are the minimum
mum and the maximum observed credit scores. Glößner
and the maximum observed credit scores. Glößner (2003) proves the Lorenz (2003) proves the Lorenzcurve’s
curve’s
transformation
transformation invariance and proposes three possible transformations:
and proposes three possible transformations: the quadratic, the quadratic, ex-
ponential and
exponential andlogarithm.
logarithm.
Byassuming
By assumingthat thatthe
the
densities f d (𝑓s)(𝑠)
densities andandf a (s𝑓) are
(𝑠) lognormal,
are lognormal, the cumulative
the cumulative PD
PD over
aover a horizon
horizon t can be𝑡 can be estimated
estimated from
from the datatheas:data as:
F̂d0 (s)
p̂ct (s) = p̂, (8)
F̂a0 (s)
Risks 2021, 9, 208 9 of 22
where F̂d (s) = LNorm( T (s); µd , σd ) and F̂a (s) = LNorm( T (s); µ a , σa ) are lognormal CDFs
of the transformed credit scores for defaulted and all accounts, respectively, with parame-
ters µd , σd , µ a , and σa . p̂ is the average PD estimated from the data.
The parameters may be estimated from the same data as in Table 1. For example, to
construct a three month cumulative PD from the data in Table 1, over an observation period
of three months, Table 3 gives an example of the data required to construct the Lorenz curve.
Furthermore, consider a scoring range between 1 and 3 with 1 unit between each score,
where 1 is the worst score and 3 is the best
score. The number of performing accounts and
cumulative defaulted accounts, dck,t s j = ∑it=0 dk,i s j , are determined for each score in
Observation Month (Mk ) Credit Score (sj ) Performing Accounts (nk,0 (sj )) Number of Defaults (dck,3 (sj ))
1 50 10
201501 2 250 5
3 200 4
1 70 12
201,502 2 280 7
3 200 3
1 90 14
201,503 2 300 8
3 210 3
D J
, where D = ∑ j=1 ∑k∈ M0 dck,t s j
The average PD of the portfolio p is estimated as p̂ = N
J
and N = ∑ j=1 ∑k∈ M0 nk,0 s j . M0 is any subset of Mk , the observation period (i.e., sam-
ple window) and t the horizon (i.e., the performance window) over which the Lorenz
curve is calibrated.
The parameters µd , σd , µ a and σa for the calibrated Lorenz curve
− 1
L̂(s) = F̂d F̂a (s) , can be estimated by minimising the sum of squares of differences to
the empirical Lorenz curve. The point wise-defined empirical Lorenz curve is
c
!J
∑k∈ M0 nk,0 s j ∑k∈ M0 dk,t s j
, . (9)
N D
j =1
where p̂cT (s) the cumulative Lorenz curve calibrated T 0 month PD for an account with
credit score s and pecT 0 ( R, M ) the cumulative T 0 month term structure PD. The resulting
marginal PD for accounts with a credit score s in the period (t − 1, t] is then defined as
0
pem em
t s, T = SFT 0 ( s ) × p t ( R, M ). (12)
This approach ensures that the average predicted lifetime PD remains aligned to actual
default behaviour in the respective segment.
4.
4. Case
Case Study
Study
This
This section illustrates the
section illustrates the proposed
proposedIFRS
IFRS99PiT
PiTPDPDmethodology
methodologydescribed
describedinin Section3
Section
3onona revolving
a revolving retail portfolio from one of the major banks in South Africa. Although
retail portfolio from one of the major banks in South Africa. Although little
little information
information will be will be provided
provided due todue to the confidential
the data’s data’s confidential
nature, nature, the purpose
the purpose of
of the case
the
studycase
is study
to show is how
to showour how our proposed
proposed methodologymethodology can be implemented.
can be implemented. For con-
For confidentiality
fidentiality purposes,
purposes, results wereresults
alteredwere
(PDaltered
values (PD values by
multiplied multiplied
a random byvalue).
a random value).4.1,
In Section In
Section 4.1, thePiT
the empirical empirical
PD termPiT PD term
structure structure
will will be constructed
be constructed and furtherand further segmented
segmented by relevant
by
riskrelevant
drivers.risk drivers.4.2,
In Section In Section 4.2, curve
the Lorenz the Lorenz curve is
calibration calibration
performed. is performed.
4.1. Empirical
4.1. Empirical PiT
PiT PD
PD Term
Term Structures
Structures
The empirical
The empirical PiT
PiT PD
PD term
term structures
structures will
will bebe constructed
constructed using
using aa dataset
dataset that
that contains
contains
monthly account level default status from September 2005 to April 2020.
monthly account level default status from September 2005 to April 2020. The set of obser- The set of
observation months is therefore { 200509, . . . , 202004 }
vation months is therefore {200509, … ,202004} with 𝐾 = 176. The reference period 𝑅 is
with K = 176. The reference period
R is taken c (24, 202004)) and marginal
p(24,202004)
taken as 24asmonths.
24 months.
ForFor
thethe portfolio,
portfolio, thethe cumulative
cumulative (𝑝(e t ) and marginal
pm
(e
(𝑝 (24, 202004)) PD
t (24,202004)) PDterm
termstructures
structuresasasofof
April
April 2020 areare
2020 displayed
displayed in Figure 2 panel
in Figure (a) and
2 panel (a)
(b) resepectively.
and (b) resepectively.
Figure
Figure 2. 2. Cumulative
Cumulative and
and Marginal
Marginal PDPD term
term structure.
structure.
The
The
The first
first
first test
test is is aa visual
visual
a visual comparison
comparison
comparison ofofofthe
the the segmented
segmented
segmented PDPDPD term
term
term structures
structures
structures vsvs vsun-
thethe the
un-
unsegmented
segmented PD PDtermterm structure
structure indicated
indicated in in Figure
Figure 3. 3.
The The
four four segmented
segmented
segmented PD term structure indicated in Figure 3. The four segmented PD term struc- PD PD
term term
struc-
structures
tures do cross
not cross and show clear risk differentiation.
tures dodo not
not cross and and show
show clear
clear risk
risk differentiation.
differentiation.
Figure
Figure
Figure 3.3.3. Segmented
Segmented
Segmented cumulative
cumulative
cumulative PD
PD
PD term
term
term structures.
structures.
structures.
In
InInthethe
the second
second
second test,
test,
test, Figure
Figure
Figure 4,4,4,
we we
we compare
compare
compare the the
the ratio
ratio
ratio of
ofof cumulative
cumulative
cumulative segmented
segmented
segmented PDs
PDs
PDs over
over
over
different
different horizons
horizons to
to the
the 12
12 month
month cumulative
cumulative segmented
segmented
different horizons to the 12 month cumulative segmented PD, using 𝑅𝑎𝑡𝑖𝑜 (24, 𝑀) for PD,
PD, using
using 𝑅𝑎𝑡𝑖𝑜
Ratio T ((24,
24, M𝑀)
) for
for
T𝑇 =
= 24,
24, 36,
36, 4848 months
months and
and 𝑀 M ∈ {201504,
∈ { 201504,… ,202004}.
. . . , 202004
𝑇 = 24, 36, 48 months and 𝑀 ∈ {201504, … ,202004}. The graph confirms that if no seg-}
The . graph
The confirms
graph that
confirms if
that noif seg-
no
segmentation
mentation
mentation is is is applied,
applied,
applied, then then
then this
this this
would would
would result
result
result inin a in a significant
a significant
significant over-
over-
over- and
and
and understatement
understatement
understatement ofof
lifetime PDs. Note that the unintuitive pattern (high spike) for Segment 3 (see Figure 4 44
of lifetime
lifetime PDs.
PDs. Note
Note that
that thethe unintuitive
unintuitive pattern
pattern (high
(high spike)
spike) for
for Segment 3 (see
(see Figure
Figure
panel
panel
panel (c))
(c)) inin
(c)) in earlier
earlier
earlier periods
periods
periods isisisdue
duedue toto
to low
lowlow marginal
marginal
marginal PD’s
PD’s
PD’s forfor
forsmall
small
small horizons,
horizons,
horizons, asas
as shown
shown
shown inin
in
Figure
Figure 33 (almost
(almost flat
flat cumulative
cumulative
Figure 3 (almost flat cumulative PD). PD).
PD).
(a)(a) Segment
Segment 11 (b)(b) Segment
Segment 22
Figure 4. Cont.
Risks 2021, 9, 208 13 of 22
Risks 2021, 9, x FOR PEER REVIEW 13 of 22
Risks 2021, 9, x FOR PEER REVIEW 13 of 22
Figure
Figure4.4.
4.The
Theratios
ratiosofof
ofcumulative
cumulativesegmented
segmentedPDs
PDsover
overdifferent
differenthorizons.
horizons.
Figure The ratios cumulative segmented PDs over different horizons.
Figure
Figure555compares
comparesthe thecumulative
cumulativesegmented
segmentedPDs PDstoto
tocumulative
cumulativeunsegmented
unsegmentedPDs PDs
Figure compares the cumulative segmented PDs cumulative unsegmented PDs
over
overtime
timefor
fordifferent horizons
different horizons in
in the
thethird
third test.
test.This
Thistest
testconfirms
confirms that
thatthe
thedifference
difference
over time for different horizons in the third test. This test confirms that the difference
between
betweenthethesegmented
segmentedPD PDterm termstructures
structuresandandthe
theunsegmented
unsegmentedone oneremains
remainsrelatively
relatively
between the segmented PD term structures and the unsegmented one remains relatively
consistent
consistentover
over different
different horizons
horizons 𝑇T ==12,
12,24,
24,36, 48.48.Again,
36, Again, the PDs
the were
PDs wereanalysed
analysed over
over
consistent over different horizons 𝑇 = 12, 24, 36, 48. Again, the PDs were analysed over
time with
time with 𝑀M∈∈{201504,
{ 201504,…. ,202004}.
. . , 202004 } .
time with 𝑀 ∈ {201504, … ,202004}.
Figure 5. Cont.
Risks 2021, 9, 208 14 of 22
Risks 2021, 9, x FOR PEER REVIEW 14 of 22
Figure
Figure5.
5.Cumulative
Cumulativesegmented
segmentedPDs
PDsto
tocumulative
cumulative unsegmented
unsegmented PDs
PDs over
over different
different horizons.
horizons.
For
For each
each of
of the
the four
four proposed
proposed segments
segments above,
above, the
the tests
tests above
above were
were performed
performed andand
confirmed
confirmed that these
theseterm
termstructures
structuresare
arematerially
materially different,
different, andand therefore,
therefore, we will
we will con-
continue
tinue with these
with these four segments.
four segments.
4.2.Lorenz
4.2. LorenzCurve
CurveCalibration
Calibration
Only three
Only three months
months ofof credit
credit scores
scores were
were available
available toto calibrate
calibrate the
the Lorenz
Lorenz curve
curve with
with
𝑀M0=={201712
{201712, ,201803
201803, ,201804}
201804} with
with aa 12 and 22 month
month performance
performancewindow
windowfor forStage
Stage1
1and
andStage
Stage2,2,respectively.
respectively.The
Thechoice of of
choice a 22 month
a 22 month performance
performance window
window for for
thethe
Stage 2 PD
Stage 2
was based on the bank-specific data availability, economic cycle and credit
PD was based on the bank-specific data availability, economic cycle and credit score avail-score availability.
ability.Four different calibrations were performed, one for each base PD term structure
segment,
Four to derive account
different levelwere
calibrations 12 month calibrated
performed, one PDs and base
for each another
PD four
termcalibrations for
structure seg-
the lifetime calibrated PDs. The behavioural credit score was used for
ment, to derive account level 12 month calibrated PDs and another four calibrations for these Lorenz curve
calibrations
the as described
lifetime calibrated in The
PDs. Section 3.2.
behavioural credit score was used for these Lorenz curve
Table 5 shows the selected transformation,
calibrations as described in Section 3.2. SSD and Gini for the 12 month Lorenz curve
calibrations
Table 5 shows the selected transformation, SSDcurves
for the four segments. The fitted Lorenz and Giniare presented
for the 12in FigureLorenz
month 6, and
the calibrated PD per credit score are presented in Figure 7.
curve calibrations for the four segments. The fitted Lorenz curves are presented in Figure
6, and the calibrated PD per credit score are presented in Figure 7.
Table 5. 12 month Lorenz curve calibration statistics.
Table 5. 12 month Lorenz curve calibration statistics.
Segment Transformation SSD Gini
Segment
Segment 1 Transformation
Quadratic SSD
0.57% Gini
56.60%
Segment
Segment 1
2 Quadratic
Quadratic 0.57%
0.46% 56.60%
61.30%
Segment 2
Segment 3
Quadratic
Exponential
0.46%
2.44%
61.30%
46.50%
Segment 3 Exponential 2.44% 46.50%
Segment 4 Quadratic 0.83% 55.90%
Segment 4 Quadratic 0.83% 55.90%
Risks 2021, 9, 208 15 of 22
Risks 2021, 9, x FOR PEER REVIEW 15 of 22
Note
Note that
that aa Gini
Gini ranges
ranges from
from 00 to
to 1, and the
1, and the higher
higher the
the Gini,
Gini, the
the better
better the
the scorecard
scorecard
differentiates
differentiates between
between the defaulters and
the defaulters and non-defaulters.
non-defaulters. The calibration function
The calibration function yields
yields
aa monotonic
monotonicPD PDcurve,
curve,asasshown
shownininFigure
Figure7. This clearly
7. This indicates
clearly that that
indicates the resulting cali-
the resulting
brated PDsPDs
calibrated are higher for lower
are higher credit
for lower scores
credit and and
scores lower for higher
lower credit
for higher scores.
credit All seg-
scores. All
ments used the quadratic transformation except for Segment
segments used the quadratic transformation except for Segment 3 where 3 where the exponential
transformation
transformation waswas used as the quadratic function did not yield yield anan acceptable
acceptable fit.
fit. The
resulting SSD
SSD for
forSegment
Segment3 3waswashigher
higher than
than forfor other
other Segments
Segments andandhadhad a lower
a lower Gini.Gini.
The
The calibration
calibration function
function showedshowed
slight slight non-monotonic
non-monotonic behaviour
behaviour for thescore
for the worst worst score
buckets,
buckets, which
which carry carryany
hardly hardly any observations
observations as Segmentas Segment
3 only 3consist
only consist of customers
of customers with with
very
very consistent
consistent paymentpayment behaviour
behaviour and low
and thus thusdefault
low default
risk. risk.
Table 6 shows the selected transformation, SSD and Gini for the 22 month Lorenz curve
calibrations for the four segments. The fitted Lorenz curves are presented in Figure 8, and
the calibrated PD per credit score are presented in Figure 9.
Risks 2021, 9, 208 16 of 22
Risks 2021, 9, x FOR PEER REVIEW 16 of 22
Figure 7. 12
Figure 7. 12 month
month calibrated
calibrated PD
PD by
by score.
score.
TableTable 6 shows
6. 22 month thecurve
Lorenz selected transformation,
calibration statistics. SSD and Gini for the 22 month Lorenz
curve calibrations for the four segments. The fitted Lorenz curves are presented in Figure
Segment
8, and the calibrated PD perTransformation
credit score are presentedSSD
in Figure 9. Gini
Segment 1 Quadratic 0.35% 54.50%
Table 6. 22 month Lorenz curve calibration statistics.
Segment 2 Quadratic 0.47% 58.30%
Segment3
Segment Transformation
Exponential SSD
2.59% Gini
39.40%
Segment
Segment 41 Quadratic
Quadratic 0.35%
0.62% 54.50%
51.50%
Segment 2 Quadratic 0.47% 58.30%
Segment 3 Exponential 2.59% 39.40%
Segment 4 Quadratic 0.62% 51.50%
Risks 2021, 9, 208 17 of 22
Risks 2021, 9, x FOR PEER REVIEW 17 of 22
Figure
Figure 8.
8. 22
22 month
month Lorenz
Lorenz curves.
curves.
Again,
Again, all segments used the quadratic transformation except for Segment 3 where
the exponential transformation
transformation was used. TheThe resulting
resulting SSD
SSD for Segment 3 was higher
than other Segments and had a lower Gini. The calibration
calibration function
function showed
showed slight
slight non-
non-
monotonic
monotonic behaviour
behaviour for
for the
the worst
worst score buckets,
buckets, which
which carry
carry hardly
hardly any observations.
observations.
All four lifetime PD calibration functions yielded monotonic PD curves, except for buckets
with very few observations.
4.3. Account-Specific
4.3. Account-Specific Term
Term Structures
Structures
We will
We will illustrate
illustrate the
the account
account specific
specific term
term structures.
structures. We We will
will present
present thethe scaling
scaling
factors for accounts of which 25% of the accounts had a credit score of 783 or less and for
factors for accounts of which 25% of the accounts had a credit score of 783 or less and for
accounts of
accounts of which
which75% 75%ofofthe
theaccounts
accountshad
hada credit score
a credit scoreof of
895895
or less. Table
or less. 7 provides
Table the
7 provides
associated
the calibrated
associated Lorenz
calibrated cumulative
Lorenz PD as
cumulative PDwell as the
as well as scaling factors
the scaling for Segment
factors 1.
for Segment
1.
Risks 2021, 9, 208 18 of 22
Risks 2021, 9, x FOR PEER REVIEW 18 of 22
0.8 10
Population (Thousands)
0.7
8
0.6
Calibrated PD
0.5 6
0.4
0.3 4
0.2
2
0.1
0 0
0 200 400 600 800
Credit Score
Table 7. Illustration
Table of scaling
7. Illustration factorsfactors
of scaling for Segment 1.
for Segment 1.
0 Performance Win- Lorenz Calibrated
c PD Scaling Factor
Performance Window (T ) Credit Score (s) Credit Score
Lorenz(𝒔)Calibrated PD (p^𝒄 t (s)) Scaling Factor (SFT0 (s))
dow (𝑻 ) (𝒑𝒕 (𝒔)) (𝑺𝑭𝑻 (𝒔))
25th Percentile (783)
25th Percentile (783) 14.07% 14.07% 1.294
1.294
12 month 12 month
75th Percentile (895)
75th Percentile (895) 1.89% 1.89% 0.173
0.173
25th Percentile (783) 27.03% 27.03%
25th Percentile (783) 2.486
2.486
22 month 22 month
75th
75th Percentile (895) Percentile (895) 6.19% 6.19% 0.569
0.569
In Figure 10, we present the adjusted term structures for Stage 1 and Stage 2, for ac-
In Figure 10, we present the adjusted term structures for Stage 1 and Stage 2, for
counts in Segment 1, with associated credits scores as above. As mentioned before, for
accounts in Segment 1, with associated credits scores as above. As mentioned before, for
Stage 2, a performance window of 22 months is used. Note that the resulting cumulative
Stage 2, a performance window of 22 months is used. Note that the resulting cumulative
lifetime PDs may be unrealistically high or low as seen in Figure 10b, if scaling factors are
lifetime PDs may be unrealistically high or low as seen in Figure 10b, if scaling factors
applied to marginal PDs for horizons 𝑡 > 𝑇 = 22.0 Therefore scaling factors are only ap-
are applied to marginal PDs for horizons t > T = 22. Therefore scaling factors are
plied to derive marginal PDs for 𝑡 ≤ 𝑇 = 22 (see 25th percentile (Cap) and 75th percen-
only applied to derive marginal PDs for t ≤ T 0 = 22 (see 25th percentile (Cap) and 75th
tile (Cap) in Figure 10b).
percentile (Cap) in Figure 10b).
Risks 2021, 9, 208 19 of 22
Risks 2021,
Risks 2021, 9,
9, xx FOR
FOR PEER
PEER REVIEW
REVIEW 19 of
19 of 22
22
(a) Stage
(a) Stage 11 (b) Stage
(b) Stage 22
Figure 10.
Figure 10. Adjusted
Adjusted cumulative
Adjusted cumulative PD term
cumulative PD term structure
structure for
for Segment
Segment 1.
1.
Figure 10
Figure 10 was
10 wasused
was usedtoto
used toillustrate
illustratethe
illustrate the
the adjusted
adjusted
adjustedtermterm
term structures
structures
structures for
for for
Stage Stage 11 and
1 and
Stage and
Stage Stage
2 and22
Stage
and the
the capping
and capping effect
effecteffect
the capping for
for Segment Segment
for Segment 1.
1. Figure Figure
11 below
1. Figure 11 below
compares
11 below compares the
the calibrated
compares calibrated
the calibrated and
and uncalibrateduncal-
and uncal-
ibrated
PD for all
ibrated PDfour
PD for segments
for all four
all four segments
segments
for Stagefor for Stagethat
1. Stage
Note 1. Note
1. Note that the
the scale
that the scale
for scale for the
the cumulative
for the cumulative
cumulative PD is
PD is different
PD is
different
for the for
four the
graphs fourto graphs
enhance to enhance
visibility. visibility.
In panel In
(a), panel
for (a),
Segment
different for the four graphs to enhance visibility. In panel (a), for Segment 1, the 25th for 1, Segment
the 25th 1, the 25th
percentile
percentile
percentile calibrated
calibrated calibrated
PD (dottedPDPDline)(dotted line) the
represents
(dotted line) represents
calibrated
represents thePD
the calibrated PD for
for an account
calibrated PD for with
an account
an account with
a creditwithscoreaa
credit
of 783, score
and of
the 783,
75th and the
percentile 75th percentile
calibrated PD calibrated
(dashed PD
line) (dashed
represents
credit score of 783, and the 75th percentile calibrated PD (dashed line) represents the cal- line)the represents
calibrated the
PD cal-
for
ibrated
an accountPD for
with ana account
credit with
score ofa credit
895. score
Similarly,of 895.
Segments Similarly,
2 to
ibrated PD for an account with a credit score of 895. Similarly, Segments 2 to 4 are illus-4Segments
are 2
illustratedto 4 are
in illus-
panels
trated in
(b)–(d).
trated inThe
panels (b)–(d).credit
respective
panels (b)–(d). The respective
The respective credit
scores (830credit
and 917 scores (830 and
for Segment
scores (830 and2,917
917 for
615for and Segment 2, 615
620 for Segment
Segment 2, 615 and
and3,
and
620 864
for and
Segment903 for
3, Segment
and 864 and 4) are
903 calculated
for Segment such
4) arethat 25%
calculated of
620 for Segment 3, and 864 and 903 for Segment 4) are calculated such that 25% of accounts accounts
such that had
25% credit
of scores
accounts
below
had
had and scores
credit
credit 75% ofbelow
scores accounts
below andhad
and 75%credit
75% of scores below
of accounts
accounts the respective
had credit
had credit scores below
scores belowvalues. This comparative
the respective
the respective values.
values.
analysis
This demonstrates
comparative the
analysis advantage
demonstrates of the
themethodology
advantage ofby
This comparative analysis demonstrates the advantage of the methodology by providing theproviding
methodology more granular
by providing PD
term
more structures
granular per
PD credit
term score.
structures per
more granular PD term structures per credit score. credit score.
(a) Segment
(a) Segment 11 (b) Segment
(b) Segment 22
Figure
Figure 11.
11. Comparison
Comparison of
of calibrated
calibrated and
and uncalibrated
uncalibrated PD
PD for
for Stage
Stage 1.
1.
Author Contributions: All authors contributed equally to this work. All authors have read and
agreed to the published version of the manuscript.
Funding: This work is based on research supported in part by the Department of Science and
Innovation (DSI) of South Africa. The grant holder acknowledges that opinions, findings and
conclusions or recommendations expressed in any publication generated by DSI-supported research
are those of the authors and that the DSI accepts no liability whatsoever in this regard.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
References
Aalen, Odd O., and Søren Johansen. 1978. An Empirical Transition Matrix for Non-Homogeneous Markov Chains Based on Censored
Observations. Scandinavian Journal of Statistics 5: 141–56.
Anderson, Raymond. 2007. The Credit Scoring Toolkit: Theory and Practice for Retail Credit Risk Management and Decision Automation.
Oxford: Oxford University Press.
Aptivaa. 2016. Building Blocks of Impairment Modeling (Issue 02). Available online: http://www.aptivaa.com/blog/wp-content/
uploads/2016/04/Blog_02-Building-Blocks-of-Impairment-Modeling.pdf (accessed on 5 June 2021).
Baesens, Bart, Daniel Rösch, and Harald Scheule. 2016. Credit Risk Analytics. Hoboken: SAS Institute, Wiley.
Beerbaum, Dirk. 2015. Significant Increase in Credit Risk According to IFRS 9: Implications for Financial Institutions. International
Journal of Economics and Management Sciences 4: 1–3. [CrossRef]
Braun, Christian. 2004. The prediction error of the chain ladder method applied to correlated run-off triangles. Astin Bulletin 34:
399–423. [CrossRef]
Breed, Douw G., Tanja Verster, Willem D. Schutte, and Naeem Siddiqi. 2019. Developing an impairment loss given default model using
weighted logistic reegression: Ilustrated on a secured retail bank portfolio. Risks 7: 123. [CrossRef]
Chimezda, Charles, and Mercy Marimo. 2017. Survival analysis of bank loans in the presence of long-term survivors. South African
Statistical Journal 51: 199–216.
Crook, Jonathan, and Tony Bellotti. 2010. Time varying and dynamic models for default risk in consumer loans. Journal of the Royal
Statistical Society: Series A (Statistics in Society) 173: 283–305. [CrossRef]
Crook, Jonathan, and Tony Bellotti. 2013. Forecasting and stress testing credit card default using dynamic models. International Journal
of Forecasting 29: 563–74.
Cziraky, Dario, and Ulrich Zink. 2017. Multi-State Markov Modelling of IFRS9 Default Probability. Oracle. Available online: https:
//www.oracle.com/us/industries/financial-services/multi-state-markov-model-wp-3432818.pdf (accessed on 19 November 2020).
Dib, Darine, and Khalil Feghali. 2021. Preliminary Impact of IFRS 9 Implementation on The Lebanese Banking Sector. Journal of
Accounting and Management Information Systems 20: 369–401. [CrossRef]
England, Peter D., and Richard J. Verrall. 2002. Stochastic claims reserving in general insurance. British Actuarial Journal 8: 443–18.
[CrossRef]
Glen, Stephanie. 2015. Autoregressive Model: Definition & the AR Process. Available online: https://www.statisticshowto.
datasciencecentral.com/autoregressive-model/ (accessed on 27 August 2019).
Global Public Policy Committee (GPPC). 2016. The Implementation of IFRS 9 Impairment Requirements by Banks: Considera-
tions for Those Charged with Governance of Systemically Important Banks. Global Public Policy Committee. Available on-
line: http://www.ey.com/Publication/vwLUAssets/Implementation_of_IFRS_9_impairment_requirements_by_systemically_
important_banks/$File/BCM-FIImpair-GPPC-June2016%20int.pdf (accessed on 10 April 2020).
Risks 2021, 9, 208 22 of 22
Glößner, Peter. 2003. Calculating Basel II Risk Parameters for a Portfolio of Retail Loans. Master’s Thesis, Kellogg College, Oxford
University, Oxford, UK.
Gubareva, Mariya. 2021. How to estimate expected credit losses–ECL–for provisioning under IFRS 9. Journal of Risk Finance 22: 169–90.
[CrossRef]
IFRS Foundation. 2014. IRFS9 Financial Instruments: Project Summary. Available online: http://www.ifrs.org/Current-Projects/IASB-
Projects/Financial-Instruments-A-Replacement-of-IAS-39-Financial-Instruments-Recognitio/Documents/IFRS-9-Project-
Summary-July-2014.pdf (accessed on 31 January 2016).
McPhail, Joseph, and Lihong McPhail. 2014. Forecasting lifetime credit losses: Modelling considerations for complying with the new
FASB and IASB current expected loss models. Journal of Risk Management in Financial Institutions 7: 375–88. Available online:
http://bit.ly/2qzrWAF (accessed on 8 September 2020).
Medema, Lydian, Rudd H. Koning, and Robert Lensink. 2009. A practical approach to validating a PD model. Journal of Banking &
Finance 33: 701–8.
SAS Institute. 2017. Development of Credit Scoring Applications Using SAS Enterprise Miner (SAS Course Notes: LWCSEM42). Cary: SAS
Institute. ISBN 978-1-63526-092-2.
Schutte, Willem D., Tanja Verster, Derek Doody, Helgard Raubenheimer, and Peet J. Coetzee. 2020. A proposed benchmark model
using a modularised approach to calculate IFRS9 expected credit loss. Cogent Economics & Finance 8: 1735681. [CrossRef]
Siddiqi, Naeem. 2006. Credit Risk Scorecards: Developing and Implementing Intelligent Credit Scoring. Hoboken: John Wiley & Sons.
Siddiqi, Naeem. 2017. Intelligent Credit Scoring: Building and Implementing Better Credit Risk Scorecards. Hoboken: John Wiley & Sons.
Taylor, Jeremy. 2003. Risk-grading philosophy: Through the cycle versus point in time. The Risk Management Association Journal 32–39.
Available online: https://cms.rmau.org/uploadedFiles/Credit_Risk/Library/RMA_Journal/Risk_Ratings/Risk-Grading%20
Philosophy_%20Through%20the%20Cycle%20versus%20Point%20in%20Time.pdf (accessed on 5 September 2020).
Tevet, Dan. 2013. Exploring model lift: Is your model worth implementing? Actuarial Review 40: 10–13.
Thomas, Lyn C. 2009. Consumer Credit Models: Pricing, Profit and Portfolios. Oxford: Oxford University Press.
T, urlea, Ioan-Codrut, . 2021. Development of Rating Models under IFRS 9. CECCAR Business Review 7: 64–72. [CrossRef]
Van Gestel, Tony, and Bart Baesens. 2009. Credit Risk Management: Basic Concepts. Oxford: Oxford University Press.
Wielenga, Doug, Bob Lucas, and Jim Georges. 1999. Enterprise Miner: Applying Data Mining. Cary: SAS Institute Inc.
Yang, Bill H. 2017. Point-in-Time PD Term Structure Models for Multi-Period Scenario Loss Projection: Methodologies and Implementations for
IFRS 9 ECL and CCAR Stress Testing. Paper No. 76271. Toronto: Munich Personal RePEc Archive (MPRA).