Credit Risk Scorecard Monitoring
Credit Risk Scorecard Monitoring
Shailendra S
Introduction
Nowadays, Retail Banks are more focused on finding or discriminating the right
clients and the wrong ones (Defaulters). From a Credit Risk perspective a Good
Client will be a customer/applicant who has least chances to do default (a row risk
client) i.e. the applicant has low chances to perform default in his obligations. This
detection process of identifying or separating a Good & bad applicant/client is where
Credit Risk Scorecard comes into play. It is an automated application which help
banks to consistently assess each client in a shorter period of time ie. To detect his
chances of delinquency (Probability of Default). With integration of loan approval
I.T applications it help banks to speed up the loan application process, reducing the
man hours resulting in increasing in the productivity with complete transparency.
Problem
Financial institutions have started paying lot of attention in tracking the performance
of existing Scorecard Models. This tracking of PD models helps them in
understanding the population shift in their data and knowing the change in
delinquency pattern of users which will help them in improving their bad definition
and validating clients.
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 1/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
Investigation
Data Gathering: In order to define the factors for Scorecard Tracking Project, we
must first gather the data with all the required fields in a specific database
format. Parameters generally include the “Good” “Bad” definitions, establishing the
Development and Current windows, defining the data exclusions. The following are
the general Data Fields that are generally collected from applications from past two
to five years depending on the requirement:
● Accept/Reject Indicator
● Product code
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 2/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
● Emi Frequency
Shailendra S
● Emi Date
Information Gathering :
In order to perform this tracking analysis we need to gather data for accounts opened
during a specific time period and then monitor their performance for another specific
period of time and see whether there is any change in any delinquency pattern in
these two data. We need to create two data sets from the user database ie.
Development data, Recent Data. Based on these two data sets we will be performing
tracking analysis and will see whether current Bad definition still holds good for
Recent Data. Monitoring the scorecard is divided into Front-End and Back-End
analysis :
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 3/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
A good place to start this comparison is by checking how two populations are
distributed across the risk bands created through the scorecard. The following is a
representation for the of the shift in population among Development and Current
data. Here Current Apps %’ is the population distribution for the Current data and
‘Development Apps%’ is the population distribution for the Development data in
each score bands. This change in score distribution is measured by the Population
Stability Index, which for this analysis is 0.02, indicating little or no change in the
score distributions.
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 4/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
Shailendra S
The last column in the above table is what we care for. Index for each score band
which is calculated by taking the difference between Recent and development
applications (in %) and multiplying it with the log of ratio between current and
development.
The final value for the PSI i.e. 0.02 which is the sum of all the values of the last
column. Now the question is how to interpret this value? The rule of thumb for the
PSI is displayed below:
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 5/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
Shailendra S
In this case we have observed a slight movement from the lower and higher Score
bands, toward the middle score bands, however, the shift is immaterial. As we are
getting a desired PSI value according to the benchmark so we won't need a further
Character Analysis in this case. So, the Population Stability Index is one of the
metrics to keep a check on changing conditions – however, the idea is clear that one
has to capture robust metrics to keep a close look on the ever changing economic
winds to prevent a crash landing.
Characteristic Analysis:
Characteristic Analysis measures the change between development and the recent
population at a characteristic level. The change in distributions are added for each
characteristic to estimate the impact on the total score. It answers which variable is
causing a shift in population distribution. It compares the distribution of an
independent variable in the current data set to a development data set. It detects shifts
in the distributions of input variables that are submitted for scoring over time.
It helps us to determine which variable has the most influence in causing the model
score shift.In above case as the change at an overall score level was insignificant, the
change at a characteristic level will follow the same trend. If a scorecard constitutes 9
characteristics and a change in any of these 9 characteristic of more than 5 points is
deemed significant. In below example we will see how to perform a Characteristic
Analysis for one variable (Salary ) we can use the same methodology on the
remaining characteristics to detect the influencing variable
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 6/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
Shailendra S
Finally the summation of this score is used to detect whether there is any shift in this
variable or not. Salary has a score difference of 0.54, indicating little or no change.
Generally a characteristic with more than score difference of 5 is considered deemed
significant.
In the end a Final Decision report is prepared by analyzing accept rate for each score
band. This report is generated for both Development & Recent populations and the
trend in acceptance rate is compared using this final report. The Final Decision report
analyses accept rate by score band. The expected trend is accept rate increasing as
score increases, thereby minimizing the impact of riskier accounts on the portfolio
performance.
Vintage Analysis: Vintage Analysis in credit Risk models helps you to understand
the Maturity of a portfolio and to establish the independent variable. The independent
variable in credit risk modelling usually depends of the maturity and the default
point. For example, one of the standards in Basel II is to modeling the probability
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 7/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
that a client hit the 90 day past due during the next 12 months. The examination of
Shailendra S
loans by the period in which they were originated is knows a Vintage Analysis. In
addition of monitoring your portfolio’s performance we are also analyzing how loans
of different age are performing i.e how they performs over their life span. It tells
when the delinquency rate gets constant over a tenure so that we can set a trademark
or we can fix a period throughout which we gonna monitor each applicant. This
vintage curve is monitored for both the Development and Current data across all the
30,60,90 DPD and the change in curve is monitored across both the data. It helps in
knowing whether the current Bad definition holds for the current data or not. It
checks for the shift in delinquency pattern of applicants over the loan tenure.
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 8/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
Shailendra S
For example in this table we are comparing delinquency buckets of present month
with respect to previous month. In Row 1 11% of the applicants who were in current
(zero delinquency) bucket went to 0-29 DPD. In Row 2 78% applicants went from 0-
29 DPD to Current bracket (roll backward), 4% went to 30-59 DPD (roll forward) ie.
went into higher delinquency bucket. As we can see in the last column of the table
there is a “Backward roll rate” column so what does it infer??
In Row 3 we can see that 54% of the applicants who were in 30-59 DPD last month
rolled backward to previous DPD bucked ie. improvement in their delinquency.
While in Row 5 only 7% of the applicants went to previous buckets. So our
definition of bad ie. 90 DPD makes more sense. Conversely 0-29 or 30+ DPD will be
less significant as 78% and 54% of the customers respectively have the chances to
roll back to previous buckets thus confirming out Bad Definition of 90+ DPD as
correct.
We should be Comparing this analysis for both the Development and Recent
Population data and see if there’s any significant change in BAD definition or not ie.
the point of no return has changed or not .
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 9/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
Shailendra S
● The backward roll rate for 60-89 DPD for both the development (50%) and
recent (48%) populations are largely in line, however, there has been a deterioration
in the backward roll rate for the 90+ DPD from development (7%) to the recent (5%)
population (If the backward roll rate for the 90+ DPD segment improved
significantly for the recent population, the bad definition could potentially
be inappropriate)
● As the backward roll-rate has slightly deteriorated for the earlier delinquency
ranges and more for the 90+ DPD range, the chosen bad definition of 90+ DPD over
a 12 month window holds for both the development and recent populations
KS Statistics
This technique is mostly used to validate a PD model ie. its ability to discriminate
between Good and Bad customers giving us the predictive power of the existing
model. It is a point estimate and tells the score-band where the difference between
cumulative good and cumulative bad customers is maximum.
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 10/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
Steps :
2. Find the difference between Cumulative % Good and Cumulative % Good for
each decile.
Ideally KS value should be in first of the 3 Deciles and score lying between 40-70%.
Ginni Coefficient
The GINI co-efficient for measuring credit models also has values between 0 and 1.
A higher value means that a particular credit model can better discriminate among
good and risky borrowers. A value of 1 means that the model predicts perfectly, and
with certainty, which borrowers will repay and which borrowers will default. A value
of 0 means that the model is completely random, or in other words, it is the statistical
equivalent of a coin toss, resulting in a 50/50 probability of repayment or default for
each applicant.
Ginni is nothing but the ratio of the area under the curve (A) but above the line of
perfect randomness to the entire area above the line of perfect randomness (A+B). It
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 11/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
compares the Lorentz curve the cumulative distribution with the line of perfect
Shailendra S
randomness. See the graph below.
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 12/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
Shailendra S
From the above table and plotted graph we can make following observation:
● The Gini Coefficient value has slightly reduced from 56% at time of
development, to 51% for the recent population. A Gini of 51% is classified as very
strong for an application scorecard and hence the predictive power of the scorecard
will add value to the decision making process.
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 13/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
● The KS has reduced marginally from 43% to 41% which is still a good score for
Shailendra S
an application scorecard.
Conclusion
In this step-by-step guide to application scorecards tracking we have covered all the
steps in tracking application scorecards using PSI, Characteristic Analysis, showing
vintage analysis, rollrates, scorecard statistics etc. The following are the conclusions
that we have drawn from our above problem case:
● As the Population Stability Index does not indicated any significant change in
score distributions, so similarly, there are no characteristics indicating a shift in their
distributions. This is an optional step it is needed only when we have a PSI>.25
showing change in score distribution.
● Roll Rate analysis (Backward) showed us that our current default definition of
90+DPD still holds good for the recent population.
● A marginal decrease in the Gini coefficient was observed, from 56% at
development to 51% for the current period. A Gini of 51% is still considered strong
for application scorecards as any value below 30 represents a weak model. The KS
has reduced marginally from 43% to 41%.
However we need to continue with the scorecard monitoring to ensure stability of the
scorecard performance, specifically on population shift and certain characteristics for
closer monitoring going forward.
173 ·
26 Comments
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 14/16
2/18/22, 12:31 PM Credit Risk Scorecard Monitoring
Like Comment Share
Shailendra S
Gaurav Amrutkar
5mo
Gaurav
Great article
Amrutkar
Like
Reply
Anwesha D.
7mo
Anwesha
Amazing Post !!
D. Like
Reply
11mo
Pawan
Great
KumarLike
Reply
Agrawal
Deepanker A.
2y
Deepanker
Good explanation Shalendra!!
A.
Like
Reply
Raymond Anderson
2y
Raymond
The naming convention used here for PSI and CSI is standard, but confusing. They should
Anderson
be called the Score PSI and Characteristic PSI, as both are calculated on population for dif‐
ferent features. Regarding treatment where one of the categories is new, it is probably best
to drop that bin from the calculation and highlight that characteristic as an anomaly.
Alternatively, replace the zero with a very small value, or group it with the category that is
most like it.
Like
Reply
3 Likes
See more comments
© 2022 About
Shailendra
Privacy Policy S Cookie Policy
Language
https://www.linkedin.com/pulse/credit-risk-scorecard-monitoring-tracking-shailendra 16/16