Finalyse - A Practical Approach To Predicting The IFRS9 Macroeconomic Forward-Looking PD
Finalyse - A Practical Approach To Predicting The IFRS9 Macroeconomic Forward-Looking PD
A fresh take on risk and valuation
SUBSCRIBE
BACK
ARTICLE
A PRACTICAL APPROACH TO PREDICTING THE IFRS9
MACROECONOMIC FORWARD-LOOKING PD
23 OCT, 2022
Written by Long Le Hai, Consultant
Forward-looking information is a distinctive feature of an IFRS9 ECL model. Institutions are expected to update the macroeconomic
forecasts to reflect the current and future state of the economy. Therefore, it is critical to incorporate macroeconomic factors to
capture credit losses.
The estimated probability of default (PD) should include forward-looking information, whilst estimation of Lifetime PD should
include macroeconomic factors. Many macroeconomic factors can be used as inputs for forecasting Forward-Looking Information
PD for different economic scenarios.
Deflators
Employment
Labour Markets
Unemployment Rate
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 1/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
ECB rate
Interest Rate
10-year government bond
yields
Budget Deficit
Government
Output
Public Debt
Exchange Rates
Foreign Sector
Current Account
Incorporating economically stressed states of the economy impacts IFRS 9 PD the most. Macroeconomic PD is also known as
Point-in-Time (PIT) PD. PIT PD model has macroeconomic factors as independent variables, and the dependent variable is the 12-
month Observed Default Rate (ODR).
The 12-month ODR will be predicted based on multiple macroeconomic factors by solving a multivariate time-series problem. This
article outlines different methods to predict 12-month ODR; it will cover data collection, data transformation, feature engineering
and variable selection. The characteristics, functionalities as well as strength and weaknesses of each algorithm will be discussed.
Finally, this article will conclude with some closing remarks to determine the final model and the fit of these algorithms to predict
the 12-month ODR of Point-in-Time PD.
There are 2 main approaches: traditional multivariate time series and supervised machine learning. It is beneficial to apply data
transformation and feature engineering to raw time-series MF data. With the transformation, the dataset is tabular and labelled to
map inputs (MF) with associated outputs (ODR) so that machine learning algorithms can solve multivariate time-series problems.
From this point onward, 12-month ODR will be called PIT PD for simplicity.
The features should be normalized or standardized to improve the interpretability and functionality of learning algorithms. This step
applies a common scale to all the features and prevents distorting the differences in the ranges.
Lagging is the simplest and most popular way of transforming existing variables into new ones. It is possible to create various lag
features for a variable. Depending on expert knowledge, it can be useful to include lag features from last 6 months, 1 years, 2 years,
etc.
Other feature engineering techniques include Rolling Mean, Difference, and Quotient. These techniques will inevitably produce
missing values in the first few periods of the dataset. If the number of observations is sufficient, the first few rows that have missing
values can be removed to have a complete dataset.
Rolling Mean: the mean of a certain number of previous periods in a time series.
Difference: the difference between 2 periods.
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 2/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
Creating a combination of two or more input variables, such as ratios, e.g. HPI to CPI ratio, which could be useful for
modelling PIT PD for Mortgage portfolios;
Creating a combination of two or more already transformed variables, e.g. a ratio of current GDP to the long-period rolling
mean GDP, which can illustrate how a current state of economy compares to the TTC PD.
Example of Feature Engineering of EU Inflation Rate with Lag2, Difference2, Quotient3, RollingMean2 using Dplyr package in R:
Feature engineering allows one variable to be used in different contexts, which can be more appropriate in the modelling process. It
creates a larger set of potentially useful variables. For instance, Rolling_Mean(2) of the Inflation Rate captures the short-term trend.
However, the final model cannot include all features because of model overfitting and multicollinearity; the variable Selection step
is necessary.
Variable Selection aims to select the ‘best’ subset of predictors to include in the model. Both qualitative (expert knowledge) and
quantitative (statistical methods) processes can be applied to Variable Selection. Statistical variable selection is a critical step of
supervised learning regression. Multiple methods can be implemented in this process; for example, Ridge Regression, which can be
used for OLS, and VarImp (Variable Importance), which can be used for XGBoost.
Ridge regression is a modification of OLS - in this algorithm, the sum of squared estimate errors (SSE) is modified by adding a
penalty term called L2-norm. This penalty term is a sum of squared regression coefficients multiplied by a constant lambda.
The consequence of introducing this error term is that only the most significant variables are assigned a significantly non-zero
coefficient to minimise the overall error term. As a result, only these variables can be selected as input to the OLS algorithm.
Two important matters need to be handled before running Ridge Regression. Firstly, all the variables need to be standardised
to allow a direct comparison of the magnitude of their regression coefficients. Secondly, lambda hyperparameter needs to be
carefully selected; too large values will cause too many regression coefficients to shrink towards zero and too low lambda
values will reduce the penalty: when λ=0, the penalty term has no effect and Ridge Regression becomes OLS. Generalised
Cross Validation can be used to find an optimal lambda value.
VarImp is a function to measure the relative importance of predictors in tree-based machine learning models by averaging
Gini impurity across all decision trees. VarImp looks at the total decrease in node impurities from splitting the variables. A
variable is more important than others when it has a higher Mean Gini Decrease.
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 3/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
Term Memory). It is important to have a fruitful set of standardised or normalised features to improve algorithms' performance.
XGBoost and LSTM are black-box machine learning models. For regulatory requirements, the institution should note that there is a
trade-off between performance and interpretability when complex models are applied.
1.1 ARIMAX.
ARIMAX is an extension of ARIMA for the multivariate time-series problem. In other words, ARIMAX is ARIMA with the inclusion of
Exogenous Covariates. X term is added to ARIMA stands for “exogenous”. ARIMAX adds some extra outside variables to help
measure the endogenous variable. Auto.arima() function R will automatically select the optimal set of (p,d,q) that minimizes AIC or
BIC.
ARIMAX allows other covariates to be included in the model. It makes predictions of PIT PD using multiple covariates of
macroeconomic factors. A set of macroeconomic covariates acts as predictors to forecast PIT PD. It assumes that changes in
macroeconomic factors will impact the PIT PD.
1.2 VAR.
VAR stands for vector autoregression and is an extension of standard autoregressive models. It linearly combines all the predictors
values, and their lagged values with PIT PD. VAR makes prediction for all variables collectively. It can predict all variables of a time-
series in a single model. VAR treats macroeconomic factors and PIT PD the same and assumes that macroeconomic factors and PIT
PD influence each other. Thus, evolution of future PIT PD values can be obtained solely by using historical values of both PIT PD
and macroeconomic values in the calibration. The calibrated model can then produce future paths for both macroeconomic
factors and PIT PD without the need to use forecasted macroeconomic data.
Unlike ARIMAX, VAR predicts future values for macroeconomic factors and predicts PIT PD as a linear combination of forecasted
macroeconomic values.
2.1 OLS
OLS is a common technique to estimate the relationship between independent and dependent variables using a straight line. The
line minimizes sum of squared residuals between the actual and predicted values.
With Macroeconomic factors as the independent variables, PIT PD can be predicted using linear relationship. The best fit line of OLS
from estimating linear relationship between Macroeconomic factors and PIT PD will be used to forecast forward looking PIT PD
with a new set of macroeconomic inputs.
2.2 XGBoost
XGBoost stands for “Extreme Gradient Boosting”. It is an ensemble tree-based model and an extension of boosting technique.
Boosting builds multiple models; each model learns to fix the prediction errors of the prior model in the chain. XGBoost combines
both Boosting and gradient descent algorithms. XGBoost operates iteratively and converts previous weak learners to become a final
stronger leaner. It allows the optimization of loss functions using gradient descent. XGBoost can estimate both linear and non-linear
relationships between PIT PD and Macroeconomic factors.
LSTM is a Recurrent Neural Network based architecture. Characteristically, it learns to predict the future from sequences of past
data of variable lengths. It is useful as LSTM can memorise and learn the long term dependence of sequence data of
macroeconomic factors.
LSTM is a deep learning algorithm, and as such, it requires an initial definition of model hyperparameters such as batch size, epochs,
activation function, optimiser, etc. This initial choice can influence the final model outcomes. As it is a black-box model, it is hard to
assess how each choice influenced the outcomes. Similarly, it can be hard to explain why a given set of input values produced a
particular resulting value.
The table below compares the pros and cons of all algorithms in the article:
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 4/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
ARIMAX Easy to understand and interpret Data is too complex and there is no optimal solution
Limited variables with fewer hyperparameter for p and q.
Hard to put in production because of exponential
time complexity
OLS Simple to implement and easy to interpret the Assumption of linear relationship.
output coefficients. assumption of data independence.
Sensitive to outliers
Assumption of linear separability
XGBoost Efficient, high performance and accuracy Black box model, hard to interpret
Parameters can be tuned Overfitting
Time-consuming for tuning parameters
LSTM Powerful, high performance and accuracy Black box model, hard to interpret
Overfitting.
Computation considers historical information
Conclusion
Flowchart below summarizes end-to-end process of predicting Macroeconomic Forward-Looking PD of IFRS9:
This article introduced a practical and end-to-end approach to model Point-in-Time PD in a manner that includes Forward-Looking
Information for IFRS9 ECL calculation. Both traditional time series and supervised learning regression are applicable to achieve this
task. Input dataset usually needs to be transformed, and new features should be created to apply learning algorithms and to achieve
better performance. The variable selection process, including both quantitative and qualitative methods, should be applied in model
development to have a fruitful set of predictors. The final set of selected predictors serves as the inputs to algorithms such as
ARIMAX, VAR, OLS, XGBoost or LSTM, which were presented in this article. Knowing the characteristics and mechanisms of each of
these algorithms can be helpful in choosing appropriate algorithms for a specific problem. Walk-Forward Validation and regression
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 5/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
evaluation metrics (RMSE, MSE, MAE) are available to assess the performance of candidate models. Different techniques may
produce results of varying accuracy depending on a specific dataset. The ‘best’ model should be selected by accessing different
evaluation metrics and other aspects, e.g. related to model sensitivity.
TABLE OF CONTENTS
RELATED ARTICLES
11 JUN, 2020
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 6/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
MODEL VALIDATION
How confident are you that your risk models show what you need to see? Would your regulator agree?
Model Validation: are you sure your internal risk parameter estimates are adequate, robust and reliable?
Control your model risk with the right tool to develop and validate models, bridge the talent gap and reduce implementation risks
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 7/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 8/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 9/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 10/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 11/12
11/10/24, 4:42 PM Finalyse: A practical approach to predicting the IFRS9 Macroeconomic Forward-Looking PD
ABOUT
MANAGED SERVICES
CAREER
REGULATORY BRIEF
CONTACT
Finalyse Group
"The Artist" - Avenue des Arts, 9, B-1210 Brussels
Tel +32 2 537 43 73
[email protected]
©FINALYSE 2006 - 2024
https://www.finalyse.com/blog/a-practical-approach-to-predicting-the-ifrs9-macroeconomic-forward-looking-pd 12/12