Maher PDF
Maher PDF
by M. J. MAHER*
Abstract Previous authors have rejected the Poisson model for association football scores in
favour of the Negative Binomial. This paper, however, investigates the Poisson
model further. Parameters representing the teams’ inherent attacking and defensive strengths are
incorporated and the most appropriate model is found from a hierarchy of models. Observed and
expected frequencies of scores are compared and goodness-of-fit tests show that although there
are some small systematic differences, an independent Poisson model gives a reasonably accurate
description of football scores. Improvements can be achieved by the use of a bivariate Poisson
model with a correlation between scores of 0.2.
1 Introduction
MORONEY (1951) demonstrated that the number ofgoals scored by a team in a football
match was not well fitted by a Poisson distribution but that ifa “modified Poisson” (the
Negative Binomial, in fact) was used, the fit was much better. REEP, POLLARD and
BENJAMIN (1971) confirmed this, using data from the English Football League First
Division for four seasons, and then proceded to apply the Negative Binomial distribu-
tion to other ball games. The implication of this result is that the same Negative Bino-
mial distribution applied to the number of goals scored by a team, regardless of the
quality of that team or the quality of the opposition. In fact in an earlier paper, REEPand
BENJAMIN (1968) remarked that “chance does dominate the game”. HILL(1974) was un-
convinced by this and showed that football experts were able, before the season started,
to predict with some success the final league table positions. Therefore, certainly over a
whole season, skill rather than chance dominates the game. This would probably be
agreed by most people who watch the game of football; that whilst in a single match,
chance plays a considerable role (missed scoring opportunities, dubious offside deci-
sions and shots hitting the crossbar can obviously drastically affect the result), over
several matches luck plays much less ofa part. Teams are not identical; each one has its
own inherent quality, and, surely then we should expect that when a good team is play-
ing a weak team, the good team will have a high probability of winning and scoring
several goals. By using data from the whole or just a part of the season, these inherent
qualities of the teams in a league can be inferred by, for example, maximum likelihood
estimation (as in THOMPSON (1975)) or by linear model methodology (as in HARVILLE
(1977) and LEEFLANG and VANPRAAG(1971)).
2 The Model
There are good reasons for thinking that the number of goals scored by a team in a
match is likely to be a Poisson variable: possession is an important aspect of football,
may be imposed and so only 86 independent parameters need to be specified. Since the
X and
- are assumed to be independent of each other (representing separate “games” at
the two ends of the pitch), the estimation of the _a a n d 4 will be entirely from the gand
the estimation of the y and _S by means of the y alone.
For the home teams’ scores, therefore, the log likelihood function is:
i j+i
Therefore,
a log L
-=aai c (-A+;)
j+i
xii
110
An iterative technique, such as NEWTON-RAPHSON, enables these MLEs to be deter-
mined. One simpler scheme which works well is to be use the 6’s to estimate thej’s and
then to use the g’s to estimate the 63, and so on alternately. Good initial estimates can
be gained by regarding the denominator terms in the expressions above as summations
over all teams: that is,
Ci= 1 xu/,& and j j = 1 xu/&, where &=I
1 xu.
j+ i i+. j i j=ei
The question arises of whether all these parameters are necessary for an adequate des-
cription of the scores. Intuitively it seems that there must be real differences between
teams, but are these differences more apparent in the attacks or defences, and is it really
necessary to have separate parameters for the quality of a team’s attack at home and
111
away? Consideration of such questions leads to a possible hierarchy of models which
could be tested. At the bottom is model 0 in which aj= a,Ji =J,yi = yand 6,= 6Vi; that
is, all teams are identical in all respects. At the top is model 4, previously described, in
which all four types of parameter are allowed to take different values for the different
teams. The hierarchy is shown in Table 2. In this the notation is designed to show
whether a set of parameters (such at theJ) are free to take different values for the dif-
ferent teams (shown asg,) or whether the same value applies to all teams (shown asJ).
Table 2. Hierarchy of models, with changes in the value of twice the maxirnised log likelihood
shown for Division I 1971-1972
~ ~ ~~
Model 4
Models 3C, 3D
Model 2
Models lA, 1B
Model 0
In model 0 there are four parameters but in order to have a unique set ofparameter esti-
mates, the constraints a =J and y = 6 are imposed (or, equivalently, a =J,y = kJ and
6 = ka), giving just two independent parameters. Details of the constraints imposed in
the other models are as follows:
It can be seen, therefore, that moving up one level in the hierarchy of models leads to
the introduction of (n - 1) further parameters. Under the null hypothesis that these
extra parameters are unnecessary, 2 logel will be asymptotically Xj- I distribution by
the usual likelihood ratio test, where logelisthe increase in the log likelihood in moving
from one model to the other.
4 Goodness-of-fit-tests
For a match between team iand team jthe MLEs from model 2 may be used to estimate
,the means of Xu and Yti.Since X, and Yuare assumed to be Poisson and inde-
f l u and A,
pendent, the probabilities that Xu= xand Y, =ymay be easily calculated. By repeating
this for all pairs of i andj, the expected score distributions may be found and compared
with the observed score distributions. For Division 1 in the season 1971-1972, for
example, these observed and expected frequencies are shown in Table 4.
Table 4. Observed and expected frequencies of home and away scores for Division 1 1971-1972
home away
no. of goals obs. exp. obs. exp.
0 117 111.2 184 189.3
1 127 144.6 157 159.5
2 115 106.1 88 15.9
3 66 58.0 30 26.9
234 31 42.1 3 10.5
X 2 = 4.90 x2 = 7.79
J*i i*j
ai= Vi, j
(1 + i2)Zjj jJ= 1i?i
(1 + i2)
and
-
k2=-
z:L
rj*i
Yu
c c xu
ij*i
and that
which means that the sum of the means of the fitted Poisson distributions is equal to the
observed numbers of goals scored. The estimation of the parameters gives rise, there-
fore, to one linear constraint on the expected frequencies in each of the two X 2 good-
Table 5. Values of the X 2 statistic for home and away teams’ scores for the independent Poisson
model
x 2 values
season division homes aways
1971-1972 4.90 7.79
5.71 1.08
10.05* 8.96*
4.62 1.07
1972-1973 6.08 13.4 1**
3.44 9.77*
4.94 4.31
0.78 3.22
1973-1974 1 7.91* 1.33
2 1.97 1.12
3 0.89 5.28
4 3.61 1.92
5% level 7.81
critical values
1% level 11.3
* indicates a significant value at the 5% level
** indicates a significant value at the 1% level
The case where the model would be rejected are shown by an asterisk. For the home
teams’ scores there are two such cases and for the away teams’ scores there are three.
Overall, then, the Poisson model may be regarded as acceptable, although with some
slight doubt. If the observed and expected frequencies are compared for each of the
twelve data sets, some small but systematic differences can be seen. The overall
observed and expected proportions are:
HOME SCORES
no. of goals 0 1 2 3 24
observed 0.217 0.321 0.254 0.130 0.078
expected 0.230 0.318 0.238 0.128 0.086
AWAY SCORES
no. of goals 0 1 2 3 24
observed 0.388 0.371 0.177 0.051 0.014
expected 0.406 0.352 0.166 0.056 0.020
The model underestimates the number of occasions on which one and two goals are
scored, and overestimates the number oftimes that 0 or 4 goals are scored. This effect
Table 6. Observed and estimated frequencies for Z, the difference in the teams’ scores, for
Division 1 in 1971-1972, for (i) the independent Poisson model and (ii) the bivariate
Poisson with e = 0.2
Z (-3 -2 -1 0 +I +2 +3 +4 )5
observed 8 26 72 129 105 69 31 16 6
estimated (e=O) 14.4 30.3 69.8 113.0 104.9 68.7 35.8 15.8 9.3
estimated (e=0.2) 9.9 25.3 68.0 126.2 111.7 67.7 32.6 13.4 7.1
In this it can be seen that the number of drawn matches (Z= 0) is a little underestimat-
ed. This is a systematic feature noted in all twelve data sets. The X 2 goodness-of-fit
statistics are shown in Table 7; four of the twelve are significant at the 5% level, whilst
several others approach this. Only one of the twelve has a value of the X 2 statistic which
is less than the expected value of 7. (The number of degrees of freedom is reduced to 7
because of the linear constraint on the expected frequencies resulting from the estima-
tion of the a’s andJ’s.) This suggests that there may be some correlation between the X,,
and Yu.A bivariate Poisson model was tried; in this the marginal distributions are still
Poisson with means f l u ( = aiJj)and ,lo(= k2ajJi)but there is a correlation of e be-
tween the scores. One way of thinking of such a bivariate Poisson distribution is that
Xu = Uti+ W, and Yti= Vo+ Wuwhere U,, Vu and W” are independent Poisson with
Table 7. Values of the X 2 goodness-of-lit statistic for Z, th8 difference between the teams’ scores,
for (i) the independent model and (ii) the bivariate Poisson model with e = 0 . 2
~ ~
x2 values
season division independent model bivariate model
197 1-1972 1 9.67 1.86
2 16.42* 6.50
3 10.87 3.94
4 12.99 5.75
1972-1973 1 15.51* 4.77
2 13.70 11.98
3 4.79 2.50
4 15.30* 8.27
1973-1974 1 16.47* 9.08
2 9.76 12.29
3 13.53 8.00
4 10.36 5.40
5% level 14.1 12.6
critical values
1 % level 18.5 16.8
The results of fitting this bivariate Poisson model are shown in Tables 6 and 7, where it
can be seen that the introduction of the extra parameter Q has led to a considerable
improvement in the fit. The X 2 statistics in Table 7 are not only non-significant but are
fairly representative values from a Xi distribution. (It has been assumed that the fitting
of the extra parameter e will be roughly equivalent to the imposition of another linear
constraint on the expected frequencies, although in fact the same value of e has been
applied to all the twelve data sets). A bivariate Poisson model with correlation ofabout
0.2, therefore, would seem to give a very adequate fit to the differences in scores.
6 Summary
Previous work on the distribution of scores in football matches has rejected the Poisson
model in favour of the Negative Binomial. This work, however, has not allowed for the
different qualities of the teams in a league. The first model investigated here assumes
that the home team’s and aways team’s scores in any one match are independent Pois-
Acknowledgement
The author would like to thank an anonymous referee for his very helpful comments on
an earlier version of this paper.
References
HARVILLE,
D. (1977), The use of linear-model methodology to rate high school or college football
teams, J. Amer. Statist. Ass. 72, No. 3S8, pp. 278-289.
HILL,I. D. (1974), Association football and statistical inference, Appl. Statist. 23, No. 2, pp. 203-
208.
LEEFLANG,
P. S. H. and B. M. S. V A N PRAAG(1971), A procedure to estimate relative powers in
binary contacts and an application to Dutch football league results, Statist. Neerlandica 25, No. 1,
pp. 63-84.
MORONEY, M . (1951), Factsfrom figures, London, Pelican.
REEP,C. and B. BENJAMIN (1968), Skill and chance in association football, J. R. Statist. SOC.A, 131,
pp. 581-585.
REEP,C . , R. POLLARD and B. BENJAMIN (1971), Skill and chance in ball games, J. R. Statist. SOC.A,
134, pp. 623-629.
THOMPSON, M. (1975), On any given Sunday: fair competitor orderings with maximum likelihood
methods, J. Amer. Statist. Ass. 70, No. 351, pp. 536-541.