0% found this document useful (0 votes)

34 views

Merged Document

This document introduces the concept of cointegration and its application to trading strategies. Cointegration models the relationship between two or more asset prices that have an economic link and tend to move together over time. The document discusses tests for detecting cointegration, such as the Augmented Dickey-Fuller test. It also provides examples of trading strategies that can be applied when cointegration is present, including beta hedging and pairs trading. Real commodity price data on Brent crude oil and gasoil futures is analyzed as a case study.

Uploaded by

gabriel p

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views

Merged Document

Uploaded by

gabriel p

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Learning Cointegration for Trading Strategies

Tanya Sandoval

July 25, 2016

Abstract
This report introduces the topic of cointegration and its application to trading strategies. By
modelling asset prices with an economic link in common, it is sometimes possible to arrive at
a stationary spread whose properties can be used to reduce exposure to systematic risk. The
essential elements of cointegration such as stationary and mean-reversion are discussed, as well
as some of the statistical tests available to detect this relationship. Simulated and real data
examples are provided, the latter focusing on a detected cointegration between Brent Crude and
Gasoil futures. Lastly, examples of simple trading strategies applying cointegration are discussed,
along with aspects that must be considered when trading under market real conditions.

Contents
1 Introduction 3

2 Datasets 3
2.1 Simulated Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2 Real Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

3 Stationarity and Mean-Reversion 4

3.1 Mean-Reversion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2 Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.2.1 Augmented Dickey-Fuller (ADF) . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3 Other useful plots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

4 Cointegration 9
4.1 Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.1.1 Engle-Granger Two-Step . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2 Error Correction Model (ECM) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.3 Quality of mean-reversion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.4 Real Data Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.4.1 Stationarity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.4.2 Cointegration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.4.3 ECM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
4.4.4 Quality of mean-reversion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.5 Granger Causality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

1
5 Trading Strategies 17
5.1 Regime changes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.2 Backtesting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.3 Naive Beta-Hedging Strategy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.4 Pairs Trading Strategy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
5.4.1 Bounds from OU process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
5.4.2 Optimal bounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

6 Conclusion 21

References 22

Appendix A Multivariate Regression 23

A.1 Autoregression Models - AR(p) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
A.1.1 Dickey-Fuller Test and ADF . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
A.1.2 Optimal Lag Order . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
A.1.3 Stability Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

Appendix B Cointegration between Italian and Dutch Gas 26

2
1 Introduction
Historically cointegration as a concept arose from statistical evidence that many US macroeconomic
time series (like GDP, wages, employment, etc.) did not follow conventional econometric theory
but rather were described by unit root processes, also known as “integrated of order 1” I(1). Before
the 1980s many economists used linear regressions on non-stationary time series, which Granger
and Newbold showed to be a dangerous approach that could lead to spurious correlation. For
integrated I(1) processes, Granger and Newbold showed that de-trending does not work to eliminate
the problem, and that the superior alternative is to check for cointegration, earning them the Nobel
prize.
This report summarises some first learnings of this concept and first-steps at applying it to
trading strategies. In particular, the focus is on studying two assets from the commodities space
to see if similar properties can be detected as in equities.

Section 2 provides details for the datasets used throughout the report
Section 3 introduces stationarity and mean-reversion in time series, which are key elements
of cointegration
Section 4 goes into detail about cointegration and how to test for it, as well as assessing its
quality
Section 5 is then dedicated to its application to trading strategies and assessing their perfor-
mance in terms of profit and loss (P&L)
Appendix A summarises some of the mathematical methods involved, such as Multivariate
Regression, Autoregressive models (AR(p)), Dickey-Fuller test, optimal lag order and stability
conditions
Appendix B starts to examine cointegration in other energy commodities, such as Italian and
Dutch gas

All the relevant scripts to arrive at the results can be found in the project repos-
itory “finalProject/TS ” in the attached USB drive. In particular the ipython notebook
Coint Brent Gasoil v2.ipynb demonstrates how to run the code, which is omitted in this report for
brevity. The project repository is also available online on Github https://github.com/tsando/
CQF/tree/master/finalProject.

2 Datasets
2.1 Simulated Data
Stochastic processes are used to simulate asset prices. Monte Carlo (MC) techniques are used when
relevant and random variables are drawn from the normal distribution N (µ = 0, σ = 1).

2.2 Real Data

For simplicity, only two financial series are used. As the focus is on commodities, Brent crude
and a by-product (Low Sulphur Gasoil) were selected since they were assumed to be good
candidates for cointegration given their deep economic link
The Brent [7] and Gasoil [6] Futures prices traded in the Intercontinental Exchange (ICE)
were taken from Quandl’s Steven Continuous Series [8] using the Roll on Last Trading Day
with No Price Adjustment version and the Settle field as the closing price on the day

3
The two series were then joined to produce a single dataset consisting of daily settlement
prices for Brent and Gasoil
The dataset spans 1.5 ‘trading years’ (equivalent to ∼ 252 days). The period selected was
Jan-2014 to Dec-2014 for the in-sample testing and Jan-2015 to Jun-2015 for the out-of-
sample testing. This was because several sources recommend to use one year of historic data
to estimate the cointegration parameters and trade the estimates for a 6-month period, given
that the parameters might change over time
Dates with missing values after joining the two series were removed from the dataset
Since Gasoil is traded in metric tons and Brent in barrels, the gasoil series was divided by
7.45, which is the ICE conversion factor [5]

The figure below shows the resulting dataset (spanning both in-sample and out-of-sample periods),
where the two series indeed seem to be closely related, having very similar trends.

3 Stationarity and Mean-Reversion

Before cointegration is introduced, it is important to understand the concept of stationarity. A
time series is stationary when the parameters of its generating process do not change over time.
In particular, its long-run mean and variance stay constant. This property is fundamental when
applying linear regression and forecasting models. Often, processes with a drift or trend, like
stock prices, are non-stationary but can be transformed to become stationary. For example, by
differencing prices we get returns, which are in general stationary. The figure below shows how
a simulated random walk with drift Yt = α + Yt−1 + t can be made stationary by differencing it
∆Yt = Yt − Yt−1 = α + t

4
3.1 Mean-Reversion
A stationary series is mean-reverting if over time it drifts towards its long-term mean (the historical
equilibrium level). A popular model in this category is the Ornstein–Uhlenbeck (OU) process:

dYt = θ(µ − Yt )dt + σdWt (1)

where θ is the speed of reversion, µ is the equilibrium level, σ the variance and Wt is a Wiener
Process (Brownian Motion). In a discrete setting this states that the further away the process is
from the mean, the greater the ‘pull back’ to it is. This is in contrast to the random walk above,
which has ‘no memory’ of where it has been at each particular instance of time.
The figure below shows three OU processes with the same mean µ = 10 but different mean-
reversion speeds. Indeed it can be noted the one with the highest θ reverts to the mean first. Their
differences dYt are plotted as well and these appear to become stationary significantly faster than
the process itself, almost insensitive to the speed θ. Therefore, if we are able to transform a time
series to be stationary and mean-reverting, we can design trading strategies using these properties
which are more independent of market effects. In a later section we shall see how the OU parameters
can be used to design exit/entry thresholds and also assess the quality of mean-reversion.

5
3.2 Tests
We require a more robust method to confirm whether a series is stationary than just by eye. Several
statistical tests exist, such as the Augmented Dickey-Fuller (ADF) test, Phillips–Perron test,
Hurst exponent, Kalman filters, etc. Here we only implement the ADF test and the mathematical
details can be found in Appendix A.1.1.

3.2.1 Augmented Dickey-Fuller (ADF)

The ADF test equation implemented was:
p
X
∆Yt = c0 + φYt−1 + φk ∆Yt−k + t (2)
k

where a time-trend term has not been included due to the nature of financial time series [?]. The
coefficients are estimated using the familiar linear regression (see Appendix A) whereas the optimal
lag order p is discussed below. The python script can be found in analysis.py. All the results were
validated against the popular python equivalents from the statsmodels library [9].

Optimal Lag Selection Choice of lag order can be a difficult problem. Standard approaches
use an information criteria, such as the Akaike Information Criterion (AIC). However, different
methods can lead to different results. Also, keeping more lags can lead to model overfitting. In
practice, the choice of optimal lag is also evident from the Partial Autocorrelation Function [27]
(PACF) since the significant lags would show above confidence limits. Typically for the ADF test
it is enough to take p=1, however in the interest of exploring this aspect, here we look at the results
using AIC and PACF.

6
AIC: Iterating over different lag orders, the one yielding the lowest AIC value is taken as the
optimal lag. For the simulated random-walk-with-drift and OU processes above the results
are summarised in the table below.

Process OU θ AIC optimal lag

Yt 0 22
Yt1 0.003 1
Yt2 0.010 10
Yt3 0.100 2

PACF: Given that the optimal lag order from AIC comes out quite high in some cases, we
instead use the empirical results from the PACF plot below, where it can be seen only the
first lag is well above the 95% confidence band (the first ‘spike’ represents p=0 ). Given this,
we therefore assume it is ‘safe’ to take p=1 to carry out the ADF test.

ADF Using as optimal lag p=1, we run the ADF test and compare the corresponding t-statistic
to the critical values (taken from statsmodels, based on MacKinnon(2010) [12]). The results are
summarised in the table below, where we confirm what we expected: the null hypothesis of non-
stationary is rejected for all, except for Yt , which by definition is non-stationary given its drift

7
Process θ ADF t-stat 5% Crit. Val. p-value Stationary Stable
Yt 0 0.1093 -2.8644 0.9667 No Yes
Yt1 0.003 -6.0020 -2.8644 1.65E-07 Yes Yes
Yt2 0.01 -8.7597 -2.8644 2.69E-14 Yes Yes
Yt3 0.1 -10.1133 -2.8644 9.87E-18 Yes Yes
dYt 0 -31.8476 -2.8644 0.0 Yes No
dYt1 0.003 -29.4587 -2.8644 0.0 Yes Yes
dYt2 0.01 -29.3639 -2.8644 0.0 Yes Yes
dYt3 0.1 -31.3506 -2.8644 0.0 Yes No

Stability Check To ensure further the reliability of results, a stability check can be done on the
estimated coefficients by looking at their eigenvalues within the unit circle (see Appendix A.1.3).
The results of the self-implementation are displayed in the table above. All cases were found stable,
except dYt and dYt3 . This demonstrates that stationarity does not imply stability. The unstable
nature of dYt may be due to the drift term added dYt = α+t . A close inspection of the problematic
root of dYt3 shows it is just right on the boundary of the unit circle.

3.3 Other useful plots

In addition to the PACF, when assessing stationarity the below plot types are useful (see references
[16] and [17]):

Autocorrelation plot: This shows the autocorrelation function (ACF) at varying time lags.
For perfectly stationary series or iid random variables, the autocorrelations should be near
zero for all time-lag separations. The horizontal lines displayed in the plot correspond to
95% and 99% confidence levels. Indeed in the example below none of the dYt processes show
significant autocorrelation. Also, the OU process with the highest mean-reversion speed Yt3
rapidly loses autocorrelation to its lags, looking stationary to the higher order lags.

8
Lag plot: this is a scatter plot between the series Yt and one of its lags Yt−p . Like in the
autocorrelation plot, a stationary series would not exhibit any relationship. Below an example
is shown for two of the OU processes and their differences. This confirms the results from the
autocorrelation plot - the fastest mean-reverting Yt3 has little relation to the 10th lag, unlike
Yt1 , which has a lower speed and a clear relationship. As expected, the stationary differences
show little relationship to the lags, even for Yt−1 .

4 Cointegration
Having covered key concepts, we now proceed to deep-dive into the topic of cointegration. Two or
more time series Yt = (y1t , . . . , ynt )0 are said to be cointegrated if a linear combination exists which
makes the collection ‘integrated of order zero’ I(0) i.e stationary:

β 0 Yt = β1 y1t + · · · + βn ynt ∼ I(0) (3)

This is known as the long-run equilibrium model and is expressed in normalised form as:

y1t = β2 y2t + · · · + βn ynt + et (4)

where et is referred to as the cointegrating residual or spread and β = (1, −β2 , . . . , −βn )0 is the
cointegrating vector. In particular, et ∼ I(0), i.e. the spread is ‘integrated of order 0’ - another way
to say stationary. The concept can be extended to higher orders of integration, although these are
more rare.
In real data, cointegration usually exists when there is a deep economic link between the assets
and hence these cannot drift too far apart because economic forces will act to restore the long-run
equilibrium. The figure below shows a simulated pair of cointegrated assets y1t and y2t , where y2t
is defined as an I(1) process. If y1t is supposed to have a strong link to y2t , the price of y1t should
vary similarly. This is simulated by shifting up y2t and adding some noise (residual) drawn from a
normal distribution, so y1t is defined as the dependent variable and y2t as the independent variable:

y1t = y2t + 5.0 + t (5)

9
4.1 Tests
Again, we need a robust method to confirm cointegration. The three main statistical tests are:

Engle–Granger two-step method: This can only be used to test a single cointegrating
relationship. The steps are:

(i) Estimate the cointegrating residual êt = β̂ 0 Yt , e.g. using linear regression
(ii) Test êt for stationarity, e.g. using the ADF test, where the hypotheses to be tested are:

H0 : êt = β̂ 0 Yt ∼ I(1) (nullhypothesis : nocointegration) (6)

H1 : êt = β̂ 0 Yt ∼ I(0) (alternativehypothesis : cointegration) (7)

Johansen test: Based on maximum likelihood techniques, this allows for more than one
cointegrating relationship, but it is subject to asymptotic conditions when the sample size is
too small

Phillips–Ouliaris test: Uses a modified version of the Dickey-Fuller distribution to test the
cointegrating spread for stationarity. This is a better choice when dealing with small samples

In this project only the Engle-Granger two-step method is considered.

4.1.1 Engle-Granger Two-Step

For our simulated cointegrated series, the cointegration relationship is then represented by the
regression:
y1t = c + β20 y2t + et (8)
whose parameters are estimated using OLS. Hence, the estimated cointegrating spread is:

êt = y1t − ĉ − β̂20 y2t (9)

We then test êt for stationarity using ADF. Since the mean of êt is zero, the ADF can be im-
plemented without a constant or trend [1]. Also note that the critical values used are taken as in

10
statsmodels from MacKinnon (2010) [11], whereas other sources suggest to use the Phillips-Ouliaris
tabulation. The figure below shows the fitted OLS result:

ŷ1t = 5.0330 + 1.0052y2t + êt (10)

which is in good agreement with the true parameters we defined.

The estimated êt and its PACF is shown below. The PACF shows the spread is memoryless,
i.e. no lag order appears significant, as expected from a random process. Conservatively the ADF
t-statistic can be computed with one lag. This is −12.051, which is below the 1% critical value
(−2.5748), confirming the stationarity of the spread, as expected.

11
4.2 Error Correction Model (ECM)
Cointegration implies the existence of an Error Correction Model (ECM), which provides an ad-
justment to the long-run equilibrium from the short-run dynamics. This is particularly useful when
modelling non-stationary series, like market prices, which can lead to spurious regression results.
Suppose the cointegrated pair is represented by Yt = (yt , xt )0 . One can arrive at the ECM result
as follows:
Consider a dynamic regression model to allow for a wide variety of dynamic patterns in the
data. This is done by including lags for both xt and yt :
yt = αyt−1 + β0 + β1 xt + β2 xt−1 + t (11)
By knowing the above equation should be consistent with the long-run equilibrium model
yt = b0 + b1 xt + et , it can be re-written as:
∆yt = β1 ∆xt − (1 − α)et−1 + t (12)
where et−1 is the lagged cointegrating spread from the equilibrium model. The parameter
−(1 − α) can be interpreted as a speed of correction towards the equilibrium level (see next
section)
Since all the variables in the ECM are I(0), OLS can be used to estimate the parameters

4.3 Quality of mean-reversion

If the cointegrating spread et is stationary, we could use the OU process to model it:
det = θ(µe − et )dt + σdW (13)
In discrete time this is written as:
∆et = αµe − αet−1 + t,τ (14)
where α = 1 − e−θτ and τ is a small period of time. This is in fact the implied ECM representation
for et . As discussed in Appendix A.1, the above equation can be written in its AR(1) representation
as:
et = αµe + (1 − α)et−1 + t,τ (15)
Re-writing in terms of C = αµe and B = 1 − α we see it is a simple regression which can be
determined with OLS:
et = C + Bet−1 + t,τ (16)
In particular, to assess the quality of the spread for trading strategies the parameters of interest
are: r
ln B C 2θ
θ=− µe = σOU = V ar[t,τ ] (17)
τ 1−B 1 − e−2θτ
µe is the long-run equilibrium level of the OU process
The mean-reversion speed θ can be translated into a half-life τ̃ ∝ ln 2/θ, which is the time
between equilibrium situations, i.e. when et = µe . Hence, a high θ (small τ̃ ) is desirable to
trigger trading signals
√
The standard deviation defined as σeq = σOU / 2θ can be used to plot trading bounds for
entry/exit signals as µe ± σeq
Here we use τ = 1/252 because our data has a daily frequency and the in-sample period
covers one trading year, i.e. ∼ 252 trading days

12
4.4 Real Data Example
We now move on to apply the concepts to real data, which will highlight some of the challenges
that can be encountered.

We use the in-sample dataset for Brent crude and Gasoil (Jan-2014 to Dec-2014) to estimate
the cointegration parameters. The out-of-sample dataset is later used to backtest trading
strategies

For now we assume Brent is the independent variable ‘xt ’ and Gasoil the dependent variable
‘yt ’, and later check if this is accurate via Granger’s causality test

4.4.1 Stationarity
First we check the individual price series for unit root. We apply the ADF test using one lag only as
recommended by their PACF plot below. A drift term (constant) is also included in the test. The
results are summarised in the table below. These support the assumption that Brent and Gasoil
are I(1) processes whilst their differences are I(0).

Series ADF t-stat 5% Crit. Val. p-value Stationary Stable

xt (Brent) 3.5768 -2.8728 1.00 No No
yt (Gasoil) 3.9374 -2.8728 1.00 No No
∆xt -18.1437 -2.8728 < 10−30 Yes Yes
∆yt -18.9247 -2.8728 0.00 Yes Yes

4.4.2 Cointegration
We then proceed to test whether the pair is cointegrated, using the Engle-Granger two-step proce-
dure described in Section 4.1.1.

The long-run equilibrium model is shown in the figure below. The OLS estimate was:

ŷx = 16.3229 + 0.9699xt + êt (18)

with goodness-of-fit R2 = 0.986.

13
The estimated cointegrating spread êt , its distribution and PACF are shown in Figure A. The
mean of êt was found to be zero for all practical purposes (< 10−13 ).
We also want êt to be normally distributed. The histogram shows the spread distribution
fitted to a normal distribution with µ ∼ 0 and σ = 1.66. Two tests (Lilliefors and Anderson-
Darling,[13, 18]) were carried out to assess the goodness of fit and neither rejected the null
hypothesis that êt is normally distributed
Next, we note that the PACF plot shows a peculiarity of this spread - it seems to have an
AR(3) memory, given the first three lags appear significant. This memory order is actually
not desirable to model with the OU process, which is an AR(1) process. It could have an
economic explanation, e.g. it could represent cyclicality or storage effects e.g. asset cannot
dramatically drop because there are storage/delivery carry costs borne by the sellers. Or it
could be from mere fluctuations in the data for this particular dataset, which only spans 1
trading year.
Next, êt was tested for stationarity using the CADF test. Since the mean is ∼ 0 the test was
implemented without a constant or trend. Also, as suggested by the PACF plot, the test was
ran with 3 lags. The results below show stationary could only be confirmed at the 95% C.L.
and not at the 99% C.L.

Stationary
Series ADF t-stat 5% Crit. Val. 1% Crit. Val. p-value Stable
(95% C.L.)
et -2.2335 -1.9421 -2.5746 0.0245 Yes Yes

4.4.3 ECM
The corresponding ECM adjustment was determined to be:
∆yt = −0.0861+ 0.6455∆xt − 0.1623et−1 + t (with constant fit, R2 = 0.407)
(19)
∆yt = 0.6596∆xt − 0.1633et−1 + t (no constant fit, R2 = 0.422)
This represents the second-order adjustment to the price, which as we see can be significant.

14
Figure A

4.4.4 Quality of mean-reversion

Given the peculiar PACF of êt , which suggests it is an AR(3) process, fitting to the OU process,
which is an AR(1) process, might not be suitable. We proceed nevertheless to compare the results
between both fits. For AR(3) we take the OU parameters as for AR(1): the constant term equals
αµe and the coefficient of the et−1 term is α = 1−e−θτ . The table below shows the results obtained,
from which the following trading bounds can be defined:

15
(
−0.1255 ± 1.6541 AR(1)
µe ± σeq = (20)
−0.0529 ± 0.8820 AR(3)
The AR(3) fit yields a lower θ standard error than AR(1), although not by much1 and yet the θ
values are quite different. This perhaps indicates the OU fit is not suitable regardless of the lag
order.

Process θ τ̃ µe σOU σeq s.e.θ

AR(1) 49.1253 0.0141 -0.1255 16.3957 1.6541 7.3850
AR(3) 258.9327 0.0027 -0.0529 20.0723 0.8820 5.6638

4.5 Granger Causality

When implementing the cointegrating regression to estimate et , an assumption must be made about
which the dependent variable is. It is actually important to determine the best choice for this as it
can largely influence the results.
Grange causality means that lags of x2 have a statistically significant effect on the current value
of x1 , taking lags of x1 into account as regressors. We reject the null hypothesis H0 that x2 does
not Granger cause x1 if the pvalues are below a desired size of the test. The ready implementation
from statsmodels [15] was used, which uses the F distribution critical values. Running the test over
the in-sample Brent-Gasoil data using up to 3 lags (which should be enough given theirs PACF
plot) yields the results in the table below. This confirms that taking Brent as the independent
variable is acceptable. The hypothesis of Gasoil causing Brent moves is rejected too, which further
supports the result.

F test p-value
H0 1 lag 2 lags 3 lags
Brent doesn’t cause Gasoil 0.0005 0.0001 <10ˆ-4
Gasoil doesn’t cause Brent 0.2315 0.2986 0.4797

1
To calculate the standard error, the simplified formula for error propagation in [22] was used

16
5 Trading Strategies
Below we describe a few simple strategies which exploit the cointegration properties so far dis-
cussed. The first step is to construct a stationary portfolio of separately interrelated assets. For
the demonstrations below however, we assume this portfolio is limited to the 2 assets from before:
Brent and Gasoil.

5.1 Regime changes

Tests for cointegration assume that the cointegrating vector is constant during the period of study.
In reality, it is common for the relationship of the underlying variables to change, e.g from changes
in their fundamental factors. This is especially likely to be the case if the sample period is long.
Hence, it should not be assumed that because the assets have passed a cointegration test historically,
they will continue to remain cointegrated. Practitioners’ advice is to estimate using one year of
historic data and trade the estimates for a 6-month period. This is then a good introduction to
backtesting.

5.2 Backtesting
Backtesting is the process of testing a trading strategy or model on historical data to gauge its
effectiveness. When we ‘backtest a model’, the results achieved are highly dependent on the tested
period and this can cause the strategy to fail in the future due regime changes or model overfitting.
To alleviate this, the test data is usually split into in-sample and out-of-sample (a ‘rule-of-thumb’
is to use an 80-20 split). The model/strategy is then fitted/tested to the in-sample data only and
then tested on the out-of-sample data, which was ‘unseen’. This process provides a better way to
assess the true performance of a strategy.

5.3 Naive Beta-Hedging Strategy

Factor models are a way of explaining the returns of one asset via a linear combination of the
returns of other assets. This can be expressed as a simple linear regression:

∆Y = α + β1 ∆X1 + β2 ∆X2 + · · · + βn ∆Xn + t (21)

We can interpret the betas as the exposure of asset Y to the other assets and α as the market-neutral
excess return:
α = E[∆Y − β1 ∆X1 + β2 ∆X2 + · · · + βn ∆Xn ] (22)
since E[t ] = 0. For the case of Brent and Gasoil only this would be:

∆Ygasoil = α + β∆Xbrent + t (23)

i.e. we could take a short position in Brent equal to β∆Xbrent to try to eliminate the risk in our
Gasoil position. On average, we would expect our returns to be:

E[∆Ygasoil − βXbrent ] = α (24)

For the in-sample data, running the above regression would actually yield negative returns, al-
though less severe than just going long on Gasoil:
is
E[∆Ygasoil is
− 0.6282∆Xbrent ] = α = −0.0915 (R2 = 0.354)
is (25)
E[∆Ygasoil ] = α = −0.2148 (no fit)

17
This shows the estimated beta is not constant as we walk forward in time. As such, the short
position we took out in Brent is not perfectly hedging our portfolio. Another is that additional
factors or assets should be included into the model. Surprisingly, our performance in the out-of-
sample seems to be better but that was just mere ‘luck’ from positive fluctuations in the data,
and still worse off than just going long on Gasoil over this period:
os
E[∆Ygasoil os ] = E[α ] = 0.02231
− 0.6282∆Xbrent (in-sample fit)
t
os (26)
E[∆Ygasoil ] = E[αt ] = 0.05806 (no fit)

5.4 Pairs Trading Strategy

Moving to a more sophisticated approach with cointegration, we now look into ‘pairs trading’. For
a pair of assets, if they are cointegrated, e.g. because they belong to the same ‘sector’, they are
likely to be exposed to similar market factors. Occasionally their relative prices will diverge due to
certain events, but eventually will revert to the long-run equilibrium. Hence, positions are taken
relative to where the cointegrating spread êt is with respect to this equilibrium. The objective is
to hedge the position from price level dynamics (systematic risk). The P&L will then be driven by
the quality of mean-reversion (Section 4.4.4). For Gasoil and Brent, we estimated the in-sample
relationship as:

is is
Ygasoil = 0.9699Xbrent + 16.3229 + êis
t (27)
Under the assumption that the relationship holds, we use the same beta and constant to estimate
the out-of-sample spread:

êos os os
t = Ygasoil − 0.9699Xbrent − 16.3229 (28)
This is then what we use to backtest the strategy.

5.4.1 Bounds from OU process

As we saw, we could define thresholds for trading the spread by fitting to the OU process and
using the fitted mean µe and relative standard deviation σeq . Assuming normality of the spread
2 ), a z-score could be used to ‘normalise’ the spread as:
et,τ →∞ ∼ N (µe , σeq

êt − µe
z= (29)
σeq

With the z-score, a simple strategy would then be:

Go “Long” the spread when the z-score is below -1.0, as the expectation is that it will rise
(this means we are longing Ygasoil and shorting Xbrent )

Go “Short” the spread when the z-score is above 1.0, as the expectation is that it will fall
(this means we are shorting Ygasoil and longing Xbrent )

Exit positions when the z-score is near zero, e.g. within [−0.1, 0.1]

The figure below shows the out-of-sample z-score spread with the AR(3) OU entry/exit bounds
marked.

18
The corresponding P&L is shown in Figure B for both the in-sample and out-of-sample periods.
A positive performance is observed throughout, although with prolonged drawdown periods. In
addition, note this does not include transaction costs and other market effects which could
certainly add up to an overall loss. The P&L using the AR(1) bounds is also shown, where it
is clear it is worse off than AR(3). Both however are better off than having simply gone long in
Gasoil or Brent - this is taken as the market ‘benchmark’. Given the positive results, it would
be interesting to see the P&L from extending to more than 2 assets cointegrated with oil, with a
strategy that trades multiple cointegrating spreads at the same time, or a single spread composed
of more factors. In this case, the Johansen test would be more suitable, although this is not
covered here.

5.4.2 Optimal bounds

Although earlier the Gasoil-Brent spread passed the Anderson-Darling test for normality, we also
saw it had memory AR(3). Hence, it is perhaps not optimal to use an OU fit to define the bounds.
The P&L plot also suggests the performance is sensitive to where the bounds are. This motivates the
search for the optimal bounds, which maximise the P&L or any other metric of interest, e.g. Sharpe
ratio or number of trades. Optimisation techniques can be used to test different combinations of
µe ±σeq in the in-sample data. A dynamic/rolling bound could even be defined and an optimisation
could be ran to find the optimal period for the rolling window, which could be different between
µe and σeq given their different time-scales. However, the impact on performance in the out-of-
sample would have to be addressed given that this introduces additional parameters into the model,
potentially causing model-overfitting.

19
Figure B

20
6 Conclusion
Through this project I learned for the first time how to apply cointegration to time series data and
design trading strategies around it that could potential generate business value. The P&L obtained
from trading a simple Brent-Gasoil spread in the out-of-sample was surprisingly good, given the
low performance one often finds at this step already. However, I feel the validity of results for real
trading is debatable due to the following:

Transaction costs and market effects weren’t really considered and this could have a large
effect given the high number of positions taken (see position plot above)
The AR(3) memory in the Brent-Gasoil spread wasn’t well understood and it wasn’t clear
whether this was a problem with the data or truly economic effects
A thorough data quality assessment on the Quandl dataset wasn’t done and this had some
adjustments applied (e.g. roll corrections) which would need to be understood
The strategy wasn’t backtested over other out-of-sample periods to see if the ‘good perfor-
mance’ holds

In addition, given the time limitations it wasn’t possible to deep-dive into some interesting aspects
which I would have liked to understand better, such as:

Designing trading strategies for more than 2 assets in the commodities space. In particu-
lar it would be good to gain some experience with the Johansen procedure (currently not
implemented in python’s statsmodels library)
Optimisation of P&L metrics, like Sharpe ratio and trading bounds
Getting additional plots and metric to assess the P&L, such as those found in the Pyfolio
package [20]

Regardless, cointegration should not be assumed to yield always meaningful results, specially when
working with real data. Some frequent issues to be aware of are:

Statistical tests may find cointegration when there is actually none. Moreover, the tests are
quite sensitive to how the regression is formulated, e.g. including a deterministic trend or
drift term could yield a different result, as well as different number of lags. Conversely, tests
can also fail at detecting it
Cointegration can be a temporary effect, and testing needs to be constantly done to confirm
the stationarity of the spread, yet, which period to use for testing is somehow arbitrary
The ‘long-term equilibrium’ is a relative concept and there is no ‘best’ way to define it
Using cointegration for forecasting can lead to large errors, specially when forecasting series
which aren’t stationary

21
References
[1] Modeling Financial Time Series with S-PLUS - Chapter 12 - Cointegration by Zivot, E. and
Wang J. (2006)

[2] CQF lectures on Cointegration and Statistical Arbitrage, Diamond, R. (2016)

[3] Learning and Trusting Cointegration in Statistical Arbitrage, Diamond, R., Wilmott Magazine
(2013)

[4] Personal Github repository https://github.com/tsando/CQF/tree/master/finalProject

[5] https://www.theice.com/publicdocs/futures/ICE Gas Oil Crack.pdf

[6] https://www.theice.com/products/34361119/Low-Sulphur-Gasoil-Futures

[7] https://www.theice.com/products/219/Brent-Crude-Futures

[8] https://www.quandl.com/data/SCF/documentation/about

[9] http://statsmodels.sourceforge.net/

[10] http://statsmodels.sourceforge.net/stable/vector ar.html#lag-order-selection

[11] http://statsmodels.sourceforge.net/stable/generated/statsmodels.tsa.stattools.adfuller.html#statsmodels.tsa.

[12] http://statsmodels.sourceforge.net/stable/generated/statsmodels.tsa.stattools.adfuller.html#statsmodels.tsa.

[13] http://statsmodels.sourceforge.net/notebooks/generated/statsmodels.stats.diagnostic.normal ad.html

[14] http://matthieustigler.github.io/Lectures/Lect2ARMA.pdf

[15] http://statsmodels.sourceforge.net/stable/generated/statsmodels.tsa.stattools.grangercausalitytests.html#sta

[16] http://pandas.pydata.org/pandas-docs/stable/visualization.html#autocorrelation-plot

[17] https://github.com/pydata/pandas/blob/master/pandas/tools/plotting.py

[18] http://statsmodels.sourceforge.net/devel/generated/statsmodels.stats.diagnostic.kstest normal.html

[19] Quantopian Lectures - quantopian.com/lectures

[20] https://github.com/quantopian/pyfolio/blob/master/pyfolio

[21] https://en.wikipedia.org/wiki/Information criterion

[22] https://en.wikipedia.org/wiki/Propagation of uncertainty#Simplification

[23] https://en.wikipedia.org/wiki/Autoregressive model

[24] https://en.wikipedia.org/wiki/Vector autoregression

[25] https://en.wikipedia.org/wiki/Augmented Dickey%E2%80%93Fuller test

[26] https://en.wikipedia.org/wiki/Dickey%E2%80%93Fuller test

[27] http://nl.mathworks.com/help/econ/autocorrelation-and-partial-autocorrelation.html

22
Appendix
A Multivariate Regression
Also known as ‘generalised linear model’, it generalises linear regression to multiple input variables
(regressors) and n observations. It is best expressed in matrix form as:

Y = Xβ + (30)

where Y is a vector representing the endogenous (dependent) variables, X is a matrix representing

the exogenous (independent) variables, β is the coefficients vector and the residuals vector. The
OLS method, which minimises the sum of squared residuals via the Maximum Likelihood Estimation
method (MLE), is used to estimate the parameters:

β̂ = (X 0 X)−1 X 0 Y (31)
ˆ = Y − X β̂ (32)

The covariance matrix of the residuals estimate is:

X
Σ̂ = scale × 0 (33)

where the scale equals 1/n using the MLE estimator or 1/(n − kp) using the OLS estimator for a
model with k variables and p lags.
The covariance matrix for the coefficients is:

(XX 0 )−1 ⊗ Σ̂ (34)

where ⊗ is the Kronecker product.

The Log Likelihood Function for OLS is:
n n n
log(L) = − log(2π) − log |Σ̂| − (35)
2 2 2
The variance of the residuals and parameters are therefore the diagonal elements of the corre-
sponding covariance matrices, from which the standard errors can be calculated. These are the
conventions used in the script analysis.py. Additional mathematical details can be found in refer-
ence [24].
There are several assumptions about the nature of the variables in multivariate regression. In
particular, these should be stationary and the residual homoscedastic (with finite variance) and
normally distributed. The main applications of the multivariate regression are:

Autoregression Models - these can be used to forecast and test or model stationary time series
Error Correction Model - these are used to model series which aren’t stationary or that have
stochastic trends, like prices

23
A.1 Autoregression Models - AR(p)
Also referred to as AR(p) where p is the lag order, it is simply a linear regression on a time series
and its lagged (past) values:
Xp
Yt = c + φi Yt−i + t (36)
i=1

where c is a constant (also known as the drift term), φi are the parameters of the model and t is the
error term. Whether to exclude the constant c or not depends on the nature of what we are trying
to model. Computationally, the model can be fitted in one go by using the OLS method described
above with a special matrix formulation. Example code for this can be found in analysis.py in the
project repository.
An AR(p) system can be re-written in terms of differences, which is how it is commonly ex-
pressed in some cases, for example in the ADF test. See more details in references [24] and [23].

A.1.1 Dickey-Fuller Test and ADF

The Dickey-Fuller test examines the null hypothesis of whether a unit root is present in the au-
toregressive model AR(p) of a time series. For example, a simple AR(1) model is:

Yt = βYt−1 + t (37)

If β = 1 the series is said to have a ‘unit root’ and hence is non-stationary. The equation can be
re-written as:
∆Yt = (β − 1)Yt−1 + t = φYt−1 + t (38)
where φ = β − 1. Hence, testing for unit root is equivalent to testing φ = 0. The value of the test
φ̂
statistic is then compared to the relevant critical values for the Dickey-Fuller distribution.
std.err(φ̂)
If found lower, then the null hypothesis φ = 0 is rejected and the series can be considered stationary.
There are three main versions of the test depending on whether drift and/or time-dependent terms
are included:

Test for a unit root: ∆Yt = φYt−1 + t

Test for a unit root with drift: ∆Yt = c0 + φYt−1 + t
Test for a unit root with drift and deterministic time-trend: ∆Yt = c0 + c1 t + φYt−1 + t

Each version of the test has its own critical values which depend on the size of the sample.
Which version to use is not straightforward and the wrong choice can lead to wrong result. In
general, financial time series exclude the time-trend. There is an extension to the test referred to as
the Augmented Dickey–Fuller test (ADF), which removes autocorrelation effects by including lagged
difference terms φp ∆Yt−p . The optimal lag order could then be determined from an information
criteria (see below).
As this belong to the family of generalised linear models, the parameters can be estimated
using the multivariate regression described above. Additional details on this topic can be found in
references [25], [26].

A.1.2 Optimal Lag Order

To select the optimal lag order, one approach uses the Akaike Information Criterion (AIC). Iterating
over different lag orders, the one yielding the lowest value of AIC is selected. Statsmodels suggests

24
to try up to a maximum lag order of 12 ∗ (n/100)1/4 where n is the number of observations. There
are different definitions of AIC used - we use the same as in statstools [10], which has different
definitions for AR(p) and the ADF test as:

1+k
AIC = log |Σ̂| + 2 (AR(p) model) (39)
n
AIC = −2 log(L) + 2k (ADF test) (40)
where k is the number of estimated parameters. Other information criteria can be used, see for
example reference [21].

A.1.3 Stability Condition

For an AR(p) system, it is required for the eigenvalues of the estimated coefficients to be inside the
unit circle (<1):
|λI − β̂| = 0 (41)
This is equivalent to requiring the roots of the characteristic polynomial of the AR(p) system to
be outside the unit circle - see [21].

25
B Cointegration between Italian and Dutch Gas
A similar study was started using the price series of Italian (PSV) and Dutch Gas (TTF) futures,
however, due to time limitations this wasn’t finished. In any case one could already see from
the spread plot that the quality wasn’t as good as that of Brent-Gasoil (subject to data quality
assessment). Some of the related plots are shown below. Note that the spread memory noticed in
Brent-Gasoil seems more severe in this case.

26
Credit Valuation Adjustment for an Interest Rate Swap
Tanya Sandoval
July 24, 2016

Abstract
In this report we demonstrate some of the techniques currently used to calculate Credit Valuation
Adjustments for fixed income instruments. We take the hypothetical scenario of an Interest Rate Swap
entered by two counterparties. Through the simulation of forward rates using the HJM model on recent
data and default probabilities using Credit Default Swaps spreads, we arrive at the fair price of the risk
taken by counterparty A in entering the swap with counterparty B.

Contents
1 Introduction 3

2 Default Probabilities 3
2.1 CDS bootstrapping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2 Forward LIBORs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2.1 PCA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2.2 HJM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

3 Discount Factors 9

4 Credit Valuation Adjustment 10

4.1 Mark-to-Market . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2 Exposure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.3 Expected Exposure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.4 CVA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

5 Conclusion 12

References 13

1
2
1 Introduction
We wish to model the scenario of an Interest Rate Swap (IRS) entered by two counterparties ‘A’ and ‘B’.
The IRS is assumed to be written on a 6M LIBOR L6M variable rate, with notional N = 1. To calculate
the Credit Valuation Adjustment the main inputs would be:

• Default Probabilities (PDs)

• Forward LIBORs
• Discount Factors (DFs)

Below we discuss each of these aspects separately. All the relevant scripts and spreadsheets to arrive
at the results can be found in the project repository “finalProject/CVA” in the attached USB drive. In
particular the ipython notebook CVA.ipynb demonstrates how to run the code, which is omitted in this
report for brevity. The project repository is also available online on Github https://github.com/tsando/
CQF/tree/master/finalProject.

2 Default Probabilities
The Default Probabilities can be implied from Credit Default Swaps (CDS) spreads using the bootstrapping
technique.

2.1 CDS bootstrapping

The bootstrapping formula used was:
PN −1
n=1 D(0, Tn )[LP (Tn − 1) − (L + ∆tn SN )P (Tn )] P (TN −1 )L
P (TN ) = + (1)
D(0, TN )(L + ∆tn SN ) (L + ∆tN SN )
where P is the survival probability, L = 1 − R is the expected loss calculated from the recovery rate R,
∆t is the payment frequency, S is the CDS spread and D is the discount factor

• The counterparty B in this case was chosen to be an airline company - AirFrance (“AIRF”)
• The CDS spreads were taken from Reuters on 27-Jun-2016 and are assumed to apply also for the period
31-May-2013 to 31-May-2016, which was used to calibrate the HJM model (see below)
• The Recovery Rate is assumed to be 40%
• Linear interpolation is used to approximate the CDS spreads at the half increments for which there
was no market data available
• For consistency and simplicity, the discount factors used were the same as those implied from the HJM
model after taking their average for each tenor (see methodology below)
• The full calculation can be found in the project repository file “CVA/my CDS Bootstrapping v2.xlsx ”

The table below summarises the results for each of the tenors [0.0, 1.0, . . . , 5.0], with ‘DF’ as the discount
factor, ‘Lambda’ as the hazard rate λi , ‘PD’ as the default probability P D(Ti , Ti−1 ) = P (Ti−1 ) − P (Ti ) and
‘P’ as the survival probability. Note that by definition P D is over a period, whereas P is cumulative.

CDS DF Lambda PD P
Tenor
0.0 NaN 1.000000 nan% nan% 100.0000%
0.5 114.400 0.995835 1.8976% 0.9443% 99.0557%
1.0 133.770 0.990963 2.2197% 1.2510% 97.8047%
1.5 167.180 0.985697 2.7798% 1.8887% 95.9161%
2.0 200.590 0.980105 3.3452% 2.3875% 93.5285%
2.5 233.965 0.974101 3.9174% 2.8579% 90.6707%
3.0 267.340 0.967832 4.4994% 3.2974% 87.3732%
3.5 296.545 0.961232 5.0170% 3.4776% 83.8957%

3
4.0 325.750 0.954239 5.5471% 3.7947% 80.1009%
4.5 353.200 0.946899 6.0576% 3.9605% 76.1405%
5.0 380.650 0.939187 6.5842% 4.1914% 71.9490%

The figures below show the term structure of the estimated default probability and hazard rates. As
these aren’t flat, this needs to be accounted in the CVA.

The plot below shows how the cumulative distribution for the survival probability P decreases with time,
and vice versa for the cumulative P D since by definition this is equal to 1 − P .

4
2.2 Forward LIBORs
We then proceed to simulate the forward rates with the HJM model. To give a more realistic picture of the
current CVA value, the HJM model was first calibrated to recent data . The steps taken are described in
detail below.

2.2.1 PCA
Dataset The HJM model requires a set of volatility functions which are estimated using Principal Com-
ponent Analysis (PCA) on historical forward rates.

• To calibrate these functions to recent data, the forward rates were taken from the Bank of England
(BoE) Bank Liability Curve (BLC) for the last 3 years (31-May-2013 to 31-May-2016). The data was
taken from:
– ukblc16 mdaily.xlsx
– ukblc05 mdaily.xlsx

• Although we only need the short-end of the curve for the CVA calculation, we take the full curve to
calibrate the HJM model, i.e. up the 25Y tenor. The dataset is hence constructed by taking the BLC
forward curve short-end data (“1. fwds,short end” tab) for the tenors [0.5Y, 1.0Y, 1.5Y, ..., 5Y ] as it
offers a better approximation to the short-end. For the remaining tenors [5.5Y, ..., 25.0Y ] we use the
full approximation (“2. fwd curve” tab)

• The forward rate for the tenor 0.08Y in “1. fwds,short end” is used as a a proxy for the spot rate
tenor 0.0Y , i.e. r(t) = f (t; t)
• Dates with missing data values were removed
• The resulting dataset is found in the project repository under CVA/PCA/my ukblc 310513 310516.xlsx

The figure below shows the forward curve for four sample dates spanning different years in the dataset.
This shows how the curve can move for each of the tenors, for example, at the long-end of the curve the
rates have decreased substantially in 2016 compared to 3 years ago. At the short-end of the curve we see
the rates tend to increase with tenor as expected.

The next figure shows the historical evolution of the rate in the dataset for 3 example tenors. As
mentioned, the 0.08Y tenor is a proxy for the spot rate. This shows the spot rate has remained quite
constant throughout. The rate increases for the shorter tenors but then decreases for the longer tenors, .e.g.
25Y vs 10Y below. This could be interpreted as a lack of liquidity for the longer tenors.

5
Principal Components To obtain the volatility functions for the HJM model, the day-on-day changes
(differences) for each tenor are calculated. This produces a set of independent random variables that can be
used to get the principal components (PC), taken as the 3 largest eigenvalues and corresponding eigenvectors
that explain 97.5% of the observed variance. The calculation details can be found in the project repository
under “PCA/my HJM Model - PCA.XLSM ”. The results are summarised below:

Tenor Eigenvalue Cum. R2

1st largest PC 10.0 0.0076790 0.9107
2nd largest PC 3.0 0.0004034 0.9586
3rd largest PC 6.5 0.0001406 0.9752

The figure below shows the resulting eigenvectors of the principal components. In general, the largest
component (PC1) is attributed to parallel shifts in the curve, the 2nd largest (PC2) to steepening/flattening
(skewness) and 3rd largest to bending about specific maturity points (convexity).

We now proceed to use these eigenvectors to obtain the volatility functions for the HJM model.

6
2.2.2 HJM
Volatility functions The volatility functions for the HJM model are defined as:
p
Voli = λi e(i) ∀ i = 1, 2, 3 (2)

where λi is the eigenvalue and e(i) the eigenvector. This is equivalent to one standard deviation move in the
e(i) direction.
The volatility functions have to then be fitted as we need analytical functions to carry out the integration
to get the drift in the HJM model. In this case, 3rd, 5th and 6th degree polynomials were fitted to guarantee
a goodness of fit of over 97%. These are shown in the figures below. In particular, for V ol1 fitting a constant
(red line) is not really suited and justifies the need for a polynomial fit (however, this could also lead to
overfitting issues).

7
Drift The drift function µ(t) is obtained by integrating over the principal components and assuming that
volatility is a function of time.

Monte Carlo The calibrated volatility and drift functions above are then entered into the HJM model
Monte Carlo simulation script my HJM model.py, which evolves the whole forward curve according to the
stochastic differential equation (SDE):
3 √
X dF
df¯ = µ(t)dt + V oli φi dt + dt (3)
i=1
dτ
where φi is a random number drawn from the standard normal distribution and the last term is the
Musiela correction. For brevity the details of the model are omitted here but additional details of the MC
simulation are outlined below:

• The forward curve was initialised using the the last observed forward curve (last row in the BLC data)
• Time step taken was dt = 0.01
• Number of simulations used was I = 1000
• The random variables were taken from python’s numpy module np.random.standard normal which
draws numbers from a standard normal distribution N (µ = 0, σ = 1)
• Due to time constrains, the antithetic variance reduction technique was not implemented in the script,
so the simulation error is less negligible

To obtain an expectation of LIBOR rate in the future L(t; Ti , Ti+1 ), the forward rate f is selected from
the corresponding tenor column τ = Ti+1 − Ti of the HJM output, from the correct simulated time t. This
is then converted to a LIBOR rate using L = τ1 (ef τ − 1) where τi = 0.5 ∀ i in this case.
For the IRS in question (written on 6M LIBOR for 5Y), the relevant simulated rates we need are:

L(t; T i, T i+1) for 6M IRS

L(t; 0, 0.5)
L(t; 0.5, 1)
L(t; 1, 1.5)
L(t; 1.5, 2)
L(t; 2, 2.5)
L(t; 2.5, 3)
L(t; 3, 3.5)
L(t; 3.5, 4)
L(t; 4, 4.5)
L(t; 4.5, 5)

The figure below shows a sample of 100 simulations for the above LIBOR rates

8
3 Discount Factors
The DFs are implied from the HJM Forward LIBOR for each simulation via the formula:
Y 1
DF (0, Ti+1 ) = (4)
i
1 + τi L(t; Ti , Ti+1 )
which is equivalent to ‘integrating under the curve’. This then gives 1000 simulations of the DF for each
tenor. For simplicity, an expectation across all simulations is taken to get a single value for DF (0, Ti+1 ).
This is then used to obtain the forward-starting discount factors as a single number DF (Ti , Ti+1 ) for the
IRS in question, where:

DF (0, Ti+1 )
DF (Ti , Ti+1 ) = (5)
DF (0, Ti )
and by definition DF (Ti+1 , Ti+1 ) = 1.0

Tenor 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
0.5 1.0 DF(0.5, 1.0) DF(0.5, 1.5) DF(0.5, 2.0) DF(0.5, 2.5) DF(0.5, 3.0) DF(0.5, 3.5) DF(0.5, 4.0) DF(0.5, 4.5) DF(0.5, 5.0)
1.0 - 1.0 DF(1.0, 1.5) DF(1.0, 2.0) DF(1.0, 2.5) DF(1.0, 3.0) DF(1.0, 3.5) DF(1.0, 4.0) DF(1.0, 4.5) DF(1.0, 5.0)
1.5 - - 1.0 DF(1.5, 2.0) DF(1.5, 2.5) DF(1.5, 3.0) DF(1.5, 3.5) DF(1.5, 4.0) DF(1.5, 4.5) DF(1.5, 5.0)
2.0 - - - 1.0 DF(2.0, 2.5) DF(2.0, 3.0) DF(2.0, 3.5) DF(2.0, 4.0) DF(2.0, 4.5) DF(2.0, 5.0)
2.5 - - - - 1.0 DF(2.5, 3.0) DF(2.5, 3.5) DF(2.5, 4.0) DF(2.5, 4.5) DF(2.5, 5.0)
3.0 - - - - - 1.0 DF(3.0, 3.5) DF(3.0, 4.0) DF(3.0, 4.5) DF(3.0, 5.0)
3.5 - - - - - - 1.0 DF(3.5, 4.0) DF(3.5, 4.5) DF(3.5, 5.0)
4.0 - - - - - - - 1.0 DF(4.0, 4.5) DF(4.0, 5.0)
4.5 - - - - - - - - 1.0 DF(4.5, 5.0)
5.0 - - - - - - - - - 1.0

The numerical values are shown below:

Tenor 0.0 0.5 1.0 1.5 2.0 2.5 3.0 \

0.0 1 0.995835 0.990963 0.985697 0.980105 0.974101 0.967832
0.5 0 1 0.995108 0.989819 0.984204 0.978175 0.97188
1.0 0 0 1 0.994686 0.989043 0.982984 0.976658
1.5 0 0 0 1 0.994327 0.988236 0.981876
2.0 0 0 0 0 1 0.993874 0.987478
2.5 0 0 0 0 0 1 0.993565
3.0 0 0 0 0 0 0 1
3.5 0 0 0 0 0 0 0
4.0 0 0 0 0 0 0 0
4.5 0 0 0 0 0 0 0
5.0 0 0 0 0 0 0 0

Tenor 3.5 4.0 4.5 5.0

0.0 0.961232 0.954239 0.946899 0.939187
0.5 0.965253 0.958231 0.950859 0.943115
1.0 0.969998 0.962941 0.955534 0.947752
1.5 0.975181 0.968086 0.960639 0.952816
2.0 0.980744 0.973609 0.96612 0.958252
2.5 0.986789 0.97961 0.972075 0.964158
3.0 0.993181 0.985955 0.978371 0.970403
3.5 1 0.992725 0.985088 0.977066
4.0 0 1 0.992308 0.984226
4.5 0 0 1 0.991856
5.0 0 0 0 1

9
4 Credit Valuation Adjustment
4.1 Mark-to-Market
The mark-to-market (MTM) value of the swap V (Ti ), i.e. the evolution of swap value over time, is obtained
via:

11
X
V (Ti = 0) = N τ D(0, Ti )(Li − K)
i=1
11
X
V (Ti = 0.5) = N τ D(0.5, Ti )(Li − K)
i=2
11
X
V (Ti = 1.0) = N τ D(1.0, Ti )(Li − K)
i=3
.. .. ..
. . .
11
X
V (Ti = 5.0) = N τ D(5.0, Ti )(Li − K)
i=11

where N is the notional, K the fixed agreed rate and Li = L(t; Ti−1 , Ti ) the LIBOR effective for the
period Ti . Here we assume a “par swap” (ATM) by choosing K = L(t; 0, 0.5) ≈ 0.0063845 to have zero
initial cashflows upon entering the swap.

4.2 Exposure
Finally, the exposure for each tenor Ei is calculated from the positive part of the MTM simulations as:

Ei = max(Vi , 0) (6)
The figure below shows some sample simulations of the exposure profile.

4.3 Expected Exposure

The Expected Exposure (EE) is calculated as the median of the Exposure profile. The median is here
preferred over the mean as the latter can be skewed by large observations and is more sensitive to outliers.

10
The figure below shows the period expected exposure EEi as calculated from the mean and median, as
well as the Potential Future Exposure (PFE), taken to be the 97.5 percentile of the Ei distribution. From
this we see that using the median, the maximum EE and PFE are attained at the beginning of the swap in
this case (equal to ∼ 5.33% and ∼ 14.89% of the notional respectively). Clearly the PFE is always bigger
than the EE - in a way this tells us it is 97.5% probable that our exposure will not exceed ∼ 14.89% .

4.4 CVA
Lastly, the CVA is approximated by a linear interpolation across the tenors 1 :
X Ti−1 − Ti Ti−1 − Ti Ti−1 − Ti
CV A ≈ (1 − R)E( )DF ( )P D( ) (7)
i
2 2 2
The figure below shows each of the components in this equation, where we see their term structure isn’t
flat, except for the loss factor (1 − R).

1 Notethat the discount factors are more appropriately extrapolated using a log-linear extrapolation instead of purely linear
as done here. Due to time constrains this wasn’t implemented.

11
The figure below shows the final CVA result for each tenor range in percentage terms over the notional
value. We see a ‘hump’-shape, telling us the maximum is found in the 2.5 − 3.0Y tenor period. Based on
this, the total CVA over the the lifetime of the swap amounts to 0.473% over the notional. So for example,
if the notional was $1m, the CVA would have amounted to ∼ $4, 730, which although small is not negligible
when it comes to pricing the true value of the swap.

5 Conclusion
For this hypothetical IRS scenario the CVA adjustment came out to be quite small. This was found a
bit ‘surprising’, given AirFrance’s relatively high probability of default compared to other companies CDS.
Whether the CVA value arrived at is accurate or not is debatable, since through the exercise the following
issues were noted:

• Typically a ‘hump’ structure should have been seen already in the Exposure profile, but this wasn’t
the case here and rather the maximum exposure was attained at the very start. This is atypical for
the long-term contracts such as a 5Y IRS because the forward rates at these tenors are relatively
high. One potential cause could be the HJM recalibration to recent data, given the rates have been
historically low. Another possibility could be that the forward rates are too low compared to the
estimated discount factors. On the other hand, the CVA plot did show a hump, but this is likely due
to the default probability term structure. Overall this would require further investigation

• Due to time constrains, a ‘shortcut’ was taken when calculating the discount factors for the swap - an
expectation across all simulations was taken to get a single value for each period. The effect on the
valuation from this would need to be understood
• The above is in addition to all the pros and cons of the HJM model and MC associated errors. In
particular, the antithetic reduction technique wasn’t implemented here, yielding a higher simulation
error
• The effect from accruals wasn’t estimated either, although this is expected to be small

12
References
[1] Tanya Sandoval, Github repository https://github.com/tsando/CQF/tree/master/finalProject
[2] Richard Diamond, CQF Lectures - Heath Jarrow & Morton Model, 2016
[3] Richard Diamond, CQF Lectures - Final Project Workshop Part I, 2016
[4] Riaz Ahmad, Stochastic Interest Rate Modeling, 2016

[5] Bank of England Yield Curves, 2016 http://www.bankofengland.co.uk/statistics/Pages/

yieldcurve/archive.aspx

Intrusion Detection Honeypots
From Everand
Intrusion Detection Honeypots
Chris Sanders
3/5 (2)
ACM Examples Guide
100% (8)
ACM Examples Guide
240 pages
Efficient Pair Selection For Pair-Trading Strategies
No ratings yet
Efficient Pair Selection For Pair-Trading Strategies
16 pages
Cointegration Based Statistical Arbitrage
No ratings yet
Cointegration Based Statistical Arbitrage
91 pages
Pairs Trading
No ratings yet
Pairs Trading
21 pages
Computational Laboratory For Economics
0% (1)
Computational Laboratory For Economics
461 pages
Basic Financial Econometrics PDF
No ratings yet
Basic Financial Econometrics PDF
167 pages
Pairs Trading: An Implementation of The Stochastic Spread and Cointegration Approach
No ratings yet
Pairs Trading: An Implementation of The Stochastic Spread and Cointegration Approach
29 pages
Introduction To Algorithmic Trading Strategies: Pairs Trading by Cointegration
No ratings yet
Introduction To Algorithmic Trading Strategies: Pairs Trading by Cointegration
42 pages
14 Cointegraion - ECM
No ratings yet
14 Cointegraion - ECM
31 pages
Bachelor Thesis
No ratings yet
Bachelor Thesis
55 pages
FULLTEXT01
No ratings yet
FULLTEXT01
57 pages
Submission 3 M7 Algorithmic Trading Strategy PDF
No ratings yet
Submission 3 M7 Algorithmic Trading Strategy PDF
29 pages
EMF Project1 Group9
No ratings yet
EMF Project1 Group9
27 pages
EC2019 Lectures
No ratings yet
EC2019 Lectures
94 pages
Co Integration
No ratings yet
Co Integration
42 pages
Co Integration
No ratings yet
Co Integration
43 pages
Trading Strategies Used by Hedge Funds
No ratings yet
Trading Strategies Used by Hedge Funds
177 pages
Introduction To Financial Econometrics
No ratings yet
Introduction To Financial Econometrics
191 pages
Master Thesis v2 4
No ratings yet
Master Thesis v2 4
53 pages
Modeling Long-Run Relationship in Finance
No ratings yet
Modeling Long-Run Relationship in Finance
43 pages
Cointegration and FX Trading
No ratings yet
Cointegration and FX Trading
15 pages
Basic Econometrics (Lectures)
100% (1)
Basic Econometrics (Lectures)
165 pages
Lecture Notes On Time Series
No ratings yet
Lecture Notes On Time Series
165 pages
Course v13
No ratings yet
Course v13
336 pages
Thesis Schmidt
No ratings yet
Thesis Schmidt
130 pages
Lecture Notes Part1
No ratings yet
Lecture Notes Part1
34 pages
ssrn-4771108
No ratings yet
ssrn-4771108
13 pages
Pairs Trading: A Statistical Arbitrage Strategy
No ratings yet
Pairs Trading: A Statistical Arbitrage Strategy
21 pages
Risk Estimation On High Frequency Financial Data
No ratings yet
Risk Estimation On High Frequency Financial Data
78 pages
FI4003 Lec Cointegration and Ecm
No ratings yet
FI4003 Lec Cointegration and Ecm
31 pages
[SSRN Electronic Journal ] Psaradellis, Ioannis_ Laws, Jason_ Pantelous, Athanasios A._ Ser - Pairs Trading, Technical Analysis and Data Snooping_ Mean Reversion vs Momentum (2018) [10.2139_ssrn.3128788] - libgen.li
No ratings yet
[SSRN Electronic Journal ] Psaradellis, Ioannis_ Laws, Jason_ Pantelous, Athanasios A._ Ser - Pairs Trading, Technical Analysis and Data Snooping_ Mean Reversion vs Momentum (2018) [10.2139_ssrn.3128788] - libgen.li
69 pages
SSRN Id3466882
No ratings yet
SSRN Id3466882
114 pages
Land Graf Research Paper
No ratings yet
Land Graf Research Paper
72 pages
Stata Guide PDF
No ratings yet
Stata Guide PDF
181 pages
Finding The Optimal Pre-Set Boundaries For Pairs Trading Strategy
No ratings yet
Finding The Optimal Pre-Set Boundaries For Pairs Trading Strategy
27 pages
Full download Introductory Econometrics for Finance (R Guide) Wichmann pdf docx
100% (17)
Full download Introductory Econometrics for Finance (R Guide) Wichmann pdf docx
55 pages
Explaining Cointegration Analysis Part II
No ratings yet
Explaining Cointegration Analysis Part II
35 pages
Gray Hat Hacking the Ethical Hacker's
From Everand
Gray Hat Hacking the Ethical Hacker's
Çağatay Şanlı
5/5 (1)
Unlocking Statistics for the Social Sciences
From Everand
Unlocking Statistics for the Social Sciences
Norma Sinclair
No ratings yet
Introductory Econometrics for Finance (R Guide) Wichmann - Quickly download the ebook to never miss any content
100% (2)
Introductory Econometrics for Finance (R Guide) Wichmann - Quickly download the ebook to never miss any content
68 pages
Qu Zhongjun
No ratings yet
Qu Zhongjun
41 pages
Time Series Analysis R
100% (3)
Time Series Analysis R
340 pages
2208834
No ratings yet
2208834
25 pages
Chris Brooks-3 PDF
No ratings yet
Chris Brooks-3 PDF
409 pages
Basic Financial Econometrics
No ratings yet
Basic Financial Econometrics
169 pages
Introductory Econometrics For Finance R Guide 4 Edition Wichmann instant download
No ratings yet
Introductory Econometrics For Finance R Guide 4 Edition Wichmann instant download
53 pages
Basic Financial Econometrics PDF
No ratings yet
Basic Financial Econometrics PDF
170 pages
Performance of Statistical Arbitrage in Future Markets
No ratings yet
Performance of Statistical Arbitrage in Future Markets
13 pages
Unit 4- Spurious Regression and Cointegration
No ratings yet
Unit 4- Spurious Regression and Cointegration
25 pages
Soederlind P. Lecture Notes For Econometrics (LN, Stockholm, 2002) (L) (86s) - GL - PDF
No ratings yet
Soederlind P. Lecture Notes For Econometrics (LN, Stockholm, 2002) (L) (86s) - GL - PDF
86 pages
Introductory Econometrics for Finance (R Guide) Wichmann download
100% (1)
Introductory Econometrics for Finance (R Guide) Wichmann download
68 pages
Time Series
No ratings yet
Time Series
40 pages
Time Series
No ratings yet
Time Series
40 pages
Cointegration For The Applied Economist
100% (1)
Cointegration For The Applied Economist
6 pages
ChatGPT for Business: Strategies for Success
From Everand
ChatGPT for Business: Strategies for Success
Matthew C. Smith
1/5 (1)
Advanced college algebra study guide
From Everand
Advanced college algebra study guide
Harrison Cook
No ratings yet
ADVANCED COLLEGE ALGEBRA STUDY GUIDE
From Everand
ADVANCED COLLEGE ALGEBRA STUDY GUIDE
Harrison K Cook
No ratings yet
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
From Everand
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
Matthew C. Smith
No ratings yet
The Linux Terminal for Advanced Users - The Command Line Made Easy: First Edition
From Everand
The Linux Terminal for Advanced Users - The Command Line Made Easy: First Edition
Michael Basler
No ratings yet
Risk Management and System Safety
From Everand
Risk Management and System Safety
Leonam dos Santos Guimarães
5/5 (1)
Image Method For Efficiently Simulating Small-Room
No ratings yet
Image Method For Efficiently Simulating Small-Room
9 pages
f. Rotation, Reflection, Translation and Enlargement (1)
No ratings yet
f. Rotation, Reflection, Translation and Enlargement (1)
33 pages
Hydrocyclone Research Poster
No ratings yet
Hydrocyclone Research Poster
1 page
Density
No ratings yet
Density
18 pages
CoE Screening Test Pattern & Syllabus - Medical Wing
100% (4)
CoE Screening Test Pattern & Syllabus - Medical Wing
2 pages
Object Detection Based Handwriting Localization
No ratings yet
Object Detection Based Handwriting Localization
15 pages
S4 MOT 1 PHYSICS 2025 JSS_250409_093354
No ratings yet
S4 MOT 1 PHYSICS 2025 JSS_250409_093354
3 pages
Method Statement Sonic Integrity Testing
No ratings yet
Method Statement Sonic Integrity Testing
25 pages
Nonlinear Oscillations Exact Solutions and their Approximations Ivana Kovacic pdf download
100% (4)
Nonlinear Oscillations Exact Solutions and their Approximations Ivana Kovacic pdf download
64 pages
What Is KVAR in Electrical - Full Form, Formula - ETechnoG
No ratings yet
What Is KVAR in Electrical - Full Form, Formula - ETechnoG
4 pages
Cucumber Notes
No ratings yet
Cucumber Notes
18 pages
A 0.6Ghz To 2Ghz Digital PLL With Wide Tracking Range
No ratings yet
A 0.6Ghz To 2Ghz Digital PLL With Wide Tracking Range
4 pages
PID110 Temprature Controllers
No ratings yet
PID110 Temprature Controllers
2 pages
Proportional Electro-Hydraulic Control Valves
No ratings yet
Proportional Electro-Hydraulic Control Valves
12 pages
Proses Pembuatan DIPHENYLAMINE Patent Translate
No ratings yet
Proses Pembuatan DIPHENYLAMINE Patent Translate
18 pages
RADAR
No ratings yet
RADAR
45 pages
Physics Investigatory Project Common Base Transistor
No ratings yet
Physics Investigatory Project Common Base Transistor
13 pages
A Coupling Approach of State-Based Peridynamics With Node-Based Smoothed Finite Element Method
No ratings yet
A Coupling Approach of State-Based Peridynamics With Node-Based Smoothed Finite Element Method
26 pages
Piezo, Ferro & Pyro Electric Materials
100% (1)
Piezo, Ferro & Pyro Electric Materials
14 pages
Introduction To Signal Processing: Professor Mike Brennan
No ratings yet
Introduction To Signal Processing: Professor Mike Brennan
40 pages
Linear Algebra and Random Processes (CS6015)
No ratings yet
Linear Algebra and Random Processes (CS6015)
5 pages
Deep Neural Nets - 33 Years Ago and 33 Years From Now
No ratings yet
Deep Neural Nets - 33 Years Ago and 33 Years From Now
17 pages
PHYSICS_ Magnetism_ IB MYP 4
100% (1)
PHYSICS_ Magnetism_ IB MYP 4
5 pages
Astm 20M PDF
No ratings yet
Astm 20M PDF
33 pages
FFF Solutions
No ratings yet
FFF Solutions
152 pages
"A Fault-Tolerant Permanent Magnet Traction Module For Subway Applications
No ratings yet
"A Fault-Tolerant Permanent Magnet Traction Module For Subway Applications
13 pages
X-Maths (BY Naresh Kumar Sharma)
No ratings yet
X-Maths (BY Naresh Kumar Sharma)
6 pages
MAT 2384 Summer Final Exam (2012)
No ratings yet
MAT 2384 Summer Final Exam (2012)
11 pages
Algebra 2 Unit 5 Real World Project
67% (3)
Algebra 2 Unit 5 Real World Project
7 pages

Merged Document

Uploaded by

Merged Document

Uploaded by

Learning Cointegration for Trading Strategies

July 25, 2016

3 Stationarity and Mean-Reversion 4

Appendix A Multivariate Regression 23

Appendix B Cointegration between Italian and Dutch Gas 26

2.2 Real Data

3 Stationarity and Mean-Reversion

dYt = θ(µ − Yt )dt + σdWt (1)

3.2.1 Augmented Dickey-Fuller (ADF)

Process OU θ AIC optimal lag

3.3 Other useful plots

β 0 Yt = β1 y1t + · · · + βn ynt ∼ I(0) (3)

y1t = β2 y2t + · · · + βn ynt + et (4)

y1t = y2t + 5.0 + t (5)

H0 : êt = β̂ 0 Yt ∼ I(1) (nullhypothesis : nocointegration) (6)

In this project only the Engle-Granger two-step method is considered.

4.1.1 Engle-Granger Two-Step

êt = y1t − ĉ − β̂20 y2t (9)

ŷ1t = 5.0330 + 1.0052y2t + êt (10)

which is in good agreement with the true parameters we defined.

4.3 Quality of mean-reversion

Series ADF t-stat 5% Crit. Val. p-value Stationary Stable

ŷx = 16.3229 + 0.9699xt + êt (18)

with goodness-of-fit R2 = 0.986.

4.4.4 Quality of mean-reversion

Process θ τ̃ µe σOU σeq s.e.θ

4.5 Granger Causality

5.1 Regime changes

5.3 Naive Beta-Hedging Strategy

∆Y = α + β1 ∆X1 + β2 ∆X2 + · · · + βn ∆Xn + t (21)

∆Ygasoil = α + β∆Xbrent + t (23)

E[∆Ygasoil − βXbrent ] = α (24)

5.4 Pairs Trading Strategy

5.4.1 Bounds from OU process

With the z-score, a simple strategy would then be:

5.4.2 Optimal bounds

[2] CQF lectures on Cointegration and Statistical Arbitrage, Diamond, R. (2016)

[4] Personal Github repository https://github.com/tsando/CQF/tree/master/finalProject

[5] https://www.theice.com/publicdocs/futures/ICE Gas Oil Crack.pdf

[10] http://statsmodels.sourceforge.net/stable/vector ar.html#lag-order-selection

[13] http://statsmodels.sourceforge.net/notebooks/generated/statsmodels.stats.diagnostic.normal ad.html

[18] http://statsmodels.sourceforge.net/devel/generated/statsmodels.stats.diagnostic.kstest normal.html

[19] Quantopian Lectures - quantopian.com/lectures

[21] https://en.wikipedia.org/wiki/Information criterion

[22] https://en.wikipedia.org/wiki/Propagation of uncertainty#Simplification

[23] https://en.wikipedia.org/wiki/Autoregressive model

[24] https://en.wikipedia.org/wiki/Vector autoregression

[25] https://en.wikipedia.org/wiki/Augmented Dickey%E2%80%93Fuller test

[26] https://en.wikipedia.org/wiki/Dickey%E2%80%93Fuller test

where Y is a vector representing the endogenous (dependent) variables, X is a matrix representing

The covariance matrix of the residuals estimate is:

(XX 0 )−1 ⊗ Σ̂ (34)

where ⊗ is the Kronecker product.

A.1.1 Dickey-Fuller Test and ADF

 Test for a unit root: ∆Yt = φYt−1 + t

A.1.2 Optimal Lag Order

A.1.3 Stability Condition

4 Credit Valuation Adjustment 10

• Default Probabilities (PDs)

2.1 CDS bootstrapping

Tenor Eigenvalue Cum. R2

L(t; T i, T i+1) for 6M IRS

The numerical values are shown below:

Tenor 0.0 0.5 1.0 1.5 2.0 2.5 3.0 \

Tenor 3.5 4.0 4.5 5.0

4.3 Expected Exposure

[5] Bank of England Yield Curves, 2016 http://www.bankofengland.co.uk/statistics/Pages/

You might also like

y1t = y2t + 5.0 + t (5)

∆Y = α + β1 ∆X1 + β2 ∆X2 + · · · + βn ∆Xn + t (21)

∆Ygasoil = α + β∆Xbrent + t (23)

Test for a unit root: ∆Yt = φYt−1 + t