Credit Behavioral Model
Credit Behavioral Model
Overview
>
>
>
Borrower behaviour
Observation Period
Outcome?
Observation
Point
Outcome
Point
Calendar Time
3000
2000
1000
0
0
Balance
10
15
20
25
30
Months account open
Credit limit
Repayment
35
Debits
40
Example 6.2
Here is another case study of a credit card account. Notice this time, card
usage has some structure, but eventually the credit card holder defaults at
32 months.
3
1500
1000
1
500
0
0
0
Balance
10
Credit limit
15
20
Months account open
Repayment
Debits
25
35
30
Months
2000
>
Traditionally, behavioural models have been built using the same kind of
static models as are used in application scoring.
For example, typically, logistic regression is used.
The outcome of such a model is a behavioural score.
Since values of predictor variables change over the performance period,
aggregate values are used:
o eg mean, maximum or last values of variables over time.
The number of potential predictor variables can get quite large for
behavioural models, so automated variable selection may be required to
reduce their number.
9
>
Dynamic Models
>
Not only do dynamic models have the advantage that they can take account
of changes in credit use over time, they can be used as the basis of profit
estimation. This is covered in the next chapter.
10
>
11
12
>
>
Right-censored
Open
Closed
Open
Default
Calendar time
Start of
observation period
Right-censored
Uncensored and failure
Observation
date
14
Hazard Function
>
Pt T t t | T t
h(t ) lim
t 0
t
15
Example 6.3
Hazard rates for Default on a Store Card.
0.016
0.014
0.012
H
a 0.01
z 0.008
a
0.006
r
d 0.004
0.002
0
11
13
15
Time (months)
17
19
21
23
Probability of Default
>
17
f (t )
Pt T t t | T t
Pt T t t
h(t ) lim
PT t
lim
t 0
t
t
S (t )
t 0
where
Since
, where
since
,
]
. Therefore,
(
18
>
19
ht , xt , h0 t exp xt
20
Notice that the predictor variables are indexed by time. This means they
can change over time.
o They are called time varying covariates (TVCs).
o It is the availability of TVCs that enable us to include dynamic
behavioural data.
This model is a proportional hazards (PH) model since the hazard ratio
between two observations is constant over time when TVCs are not
included:
Hazard ratio
))
o However, this principle is no longer true when TVCs are included and
then PH is a misnomer!
21
>
It turns out that using just the second component is sufficient to get
estimates of .
This is called partial likelihood estimation.
The practical effect is that partial likelihood estimates have higher
standard errors than using MLE.
The probability that an observation fails at some time t, amongst all other
observations is therefore given by
ht , x i t ,
h
t
,
x
t
,
jR ( t )
exp x i (t )
exp x j (t )
jR ( t )
where R(t) is called the risk set and includes all observations that are
uncensored and have not failed by time t.
>
This gives the partial likelihood function for the Cox PH model:
n
l p ()
i 1
exp xi (t(i ) )
exp x j (t(i ) )
jR ( t( i ) )
ci
24
>
is needed.
25
>
There are different ways to estimate , but one approach that has been
suggested is to estimate the cumulative baseline hazard
))
Then
26
for all
))
27
>
28
Example 6.4
A behavioural model with default as failure event for a credit card data set
(
).
Coefficient estimates for a model with fixed application variables (AV) and
time varying monthly behavioural variables (BV).
Indicator variables are denoted by a plus sign (+).
Statistical significance levels are denoted by asterisks:
** is less than 0.001 and
* is less than 0.01 level.
Covariate
Selected AVs:
Estimate
-0.00250**
-0.146**
-0.0610**
-0.00129
29
Covariate
Employment + :
Self-employed
Homemaker
Retired
Student
Unemployed
Part time
Other
Excluded category: Employed
Age + : 18 to 24
25 to 29
30 to 33
34 to 37
38 to 41
48 to 55
56 and over
Excluded category: 42 to 47
Generic credit score
30
Estimate
+0.303**
+0.072
+0.111
-0.035
+0.231
-0.365**
-0.037
+0.074
-0.058
+0.010
+0.100**
+0.046
-0.108**
-0.243**
-0.00322**
Covariate
Estimate
Behavioural variables, lag 12 months
Payment status + :
Fully paid
Greater than minimum paid
Minimum paid
Less than minimum paid
Unknown
Excluded category: No payment
Current balance (log)
(log squared)
is zero +
is negative +
Credit limit (log)
Payment amount (log)
is zero +
is unknown +
Number of months past due
Past due amount (log)
is zero +
31
-0.390**
-0.090**
+0.149**
+0.714**
-0.148*
-1.58**
+0.517**
-1.05**
-0.802**
-1.22**
-0.154**
-0.133
-0.452**
+0.134*
+0.0795
-0.623**
Covariate
Estimate
Number of transactions
+0.00663**
Transaction sales amount (log)
-0.350**
is zero +
-0.567**
APR on purchases
-0.00487
is zero +
-0.482**
Behavioural data is missing +
-3.73**
32
150000
50000
140000
45000
130000
40000
120000
35000
AV & BV
lag 3
AV & BV
lag 6
AV & BV
lag 9
AV & BV
lag 12
AV only
Model
Model fit: - log likelihood ratio
33
Deviance .
-Log-likelihood
160000
Exercise 6.1
Interpret the association of each of these behavioural variables with the
default hazard rate in the model given in Example 17.1:
Payment status
Current balance
Credit limit
Number of months past due
Number of transactions
Transaction sales amount
34
Exercise 6.2
a) Let
be the hazard function at time . Show that the survival
probability is given by
(
).
b) A hazard function for default is given by
{
for some
and
. Suppose we want to ensure probability of
default at time is less than a given value
and is fixed. Then, what is
the inequality constraint on ?
35
-0.004
+0.20
0.002
0.121
1 (yes) or 0 (no)
+1.20
0.001
36
>
37
>
Markov transition models are especially useful for modelling revolving credit
with highly variable credit usage.
For instance, for tracking credit card use.
38
>
Some definitions:
Let
{
for all
and
We denote
and call this the transition
probability.
o The transition probability represents the probability of moving
from one state to another state .
The transition matrix
.
is defined as a
39
40
Therefore,
be the distribution of
Then, since
so that
41
]
.
Example 6.5
Consider a two state stationary Markov chain for behavioural score change
(state 1=high score, 2=low score) with transition matrix
(
>
.
43
where
}.
such that
and substitute
} { }
to get
[(
} { }
44
} { }
)]
Therefore,
where
and set to
But, the choice of is arbitrary so for consistency the result must hold
generally for all .
In particular, the MLE is
Notice that this result easily generalizes to the case when we have multiple
sequences of realizations (eg more than one borrower), so long as we
assume independence between each sequence.
45
Example 6.6
Consider three states (1=high score, 2=low score, 3=default) for a
stationary process. Transition probabilities are given as:
from a high score to a low score is 0.05;
from a low score to a high score is 0.1;
from a low score to default is 0.02.
It is impossible to move from high score to default. Also, it is impossible to
move out of default.
1.
2.
Solution
(
46
47
>
48
Example 6.7
Suppose we want to include credit usage, in terms of monthly spend in a
model for behavioural score (Low or High).
First discretize credit usage into levels:
eg three levels: monthly spend < 200,
Monthly spend
< 200
200 and < 1000
1000
< 200
200 and < 1000
1000
49
State
1
2
3
4
5
6
1000,
Example 6.8
Research suggests two broad categories of credit card usage: the movers
and stayers.
Movers are those whose credit card usage is erratic; having periods of
heavy credit card usage then quiet periods.
Stayers, by contrast, tend to be steady, and stay in the same state
over long periods.
We could build a static behavioural model to broadly categorize borrowers
into one of the two categories.
Then separate Markov transition models could be built separately for the
two segments.
50
Exercise 6.3
Three credit card account states are defined as
1 = Good; account being paid off fully;
2 = Minimum repayments in a month;
3 = Bad; minimum repayment is not made.
A mover account profile is then given as a sequence of states:
1,1,1,1,2,2,3,1,2,1,1,1,2,2,2,3.
A stayer account profile is then given as a sequence of states:
1,1,1,1,1,1,2,2,2,2,3,3,2,2,2,1.
1.Use maximum likelihood estimation to compute probability transition
matrices for both accounts for a first-order Markov transition model.
2.If each account is in state 2 in time , what is the probability that it will
move to state 3 in time
or
?
51
Roll-rate model
>
A roll-rate model is a type of Markov transition model but the focus is on the
number of accounts or value of loans that rolls over from one level of
delinquency to another over several months.
Consider states where 0 corresponds to no delinquency, states >0
correspond to increasing levels of delinquency and corresponds to
loan default with write-off.
Let be a vector of initial number of accounts or value of loans.
Let be a
transition matrix.
Then the vector of values in each state at month t is given by
52
Example 6.9
Let
Let
Let
.
, in GB.
(
).
) be January 2013.
Computation
State
0
50000
52500
53600
54033
54121
54020
1
10000
5250
3438
2684
2336
2157
53
2
5000
5750
4737
3637
2805
2244
3
1000
2500
4225
5646
6737
7579
Review of Chapter 6
>
54