On The Block Maxima Method in Extreme Value Theory - PWM Estimators
On The Block Maxima Method in Extreme Value Theory - PWM Estimators
In the case of the POT method, exact conditions under which the statis-
tical method is justified can be described by a second-order term [see, e.g.,
Drees (1998) and de Haan and Ferreira (2006), Section 2.3]. In the case of
block maxima, usually it is taken for granted that the maxima follow very
well an extreme value distribution. In this paper, we take this misspecifi-
cation into account by quantifying it in terms of a second-order expansion;
cf. Condition 2.1 below. Since Gγ is not the exact distribution for those
observations, a bias may appear.
The POT method picks up all “relevant” high observations. The BM
method on the one hand misses some of these high observations and, on the
other hand, might retain some lower observations. Hence the POT seems to
make better use of the available information.
There are practical reasons for using the BM method:
• The only available information may be block maxima, for example, yearly
maxima with long historical records or long range simulated data sets
Kharin et al. (2007).
• The BM method may be preferable when the observations are not ex-
actly independent and identically distributed (i.i.d.). For example, there
may be a seasonal periodicity in case of yearly maxima or, there may be
short range dependence that plays a role within blocks but not between
blocks; cf. for example, Katz, Parlange and Naveau (2002) and Madsen,
Rasmussen and Rosbjerg (1997) for further discussion.
• The BM method may be easier to apply since the block periods ap-
pear naturally in many situations [Naveau et al. (2009), van den Brink,
Können and Opsteegh (2005), de Valk (1993)]. On the other hand, the
POT method allows for greater flexibility in many cases since it might be
difficult to change the block size in practice.
When working with BM, there are two sets of estimators that are widely
used: the maximum likelihood (ML) estimators [e.g., Prescott and Walden
(1980)] and the probability weighted moment (PWM) estimators Hosking,
Wallis and Wood (1985). Recently, Dombry (2013) has proved consistency
of the former. The present paper concentrates on the latter. Our work has
given rise to the paper Bücher and Segers (2014) on the multivariate case.
The PWM estimators under the Gγ model are very popular, for exam-
ple, in applications to hydrologic and climatologic extremes, because of their
computational simplicity, good performance for small sample sizes and ro-
bustness even for location and scale parameters [Diebolt et al. (2008), Katz,
Parlange and Naveau (2002), Caires (2009), Hosking (1990)].
The relative merits of POT and BM have been discussed in several pa-
pers, all based on simulated data: Cunnane (1973) states that for γ = 0
and ML estimators, the POT estimate for a high quantile is better only if
the number of exceedances is larger than 1.65 times the number of blocks;
ON THE BLOCK MAXIMA METHOD 3
Wang (1991) writes that POT is as efficient as BM model for high quan-
tiles, based on PWM estimators; Madsen, Pearson and Rosbjerg (1997) and
Madsen, Rasmussen and Rosbjerg (1997) write that POT is preferable for
γ > 0, whereas for γ < 0, BM is more efficient, again with the number of ex-
ceedances larger than the number of blocks; Martins and Stedinger (2001)
state that the gains (when using historical data) with the BM model are in
the range of the gains with the POT model, based on ML estimators; Caires
(2009) in a vast simulation study writes that with POT samples having an
average of two or more observations per block, the estimates are more accu-
rate than the corresponding BM estimates, and with more than 200 years
of data the accuracies of the two approaches are similar and rather good,
based on several estimators including the PWM and ML estimators.
From all these studies, some even with mixed views, the following two
features seem dominant. First, POT is more efficient than BM in many
circumstances, though needing, on average, a number of exceedances larger
than the number of blocks. Secondly, POT and BM often have comparable
performances, for example, for large sample sizes.
Our theoretical comparison shows that BM is rather efficient. The asymp-
totic variances of both extreme value index and quantile estimators are
always lower for BM than for POT. Moreover, the approximate minimal
mean square error is also lower for BM under usual circumstances. The op-
timal number of exceedances is generally higher than the optimal number
of blocks.
The paper is organized as follows. In Section 2, we state exact conditions
to justify the BM method, along with the asymptotic normality result for
the PWM estimators including high quantile estimators. In Section 3, we
provide a theoretical comparison between the two methods, BM and POT.
The analysis is based on a uniform expansion of the relevant quantile pro-
cess given in Section 2.1. This expansion also provides a basis for analysing
alternative estimators besides the PWM estimator. Proofs are postponed to
Section 4.
Throughout the paper, we assume that the observations are i.i.d. In future
work, we shall extend the results to the non-i.i.d. case and to the maximum
likelihood estimator.
2. The estimators and their properties. Let X̃1 , X̃2 , . . . be i.i.d. ran-
dom variables with distribution function F . Define for m = 1, 2, . . . and
i = 1, 2, . . . , k the block maxima
(1) Xi = max X̃j .
(i−1)m<j≤im
for all x > 0 [see, e.g., de Haan and Ferreira (2006), Theorem B.3.1]. Note
that the function |A| is regularly varying with index ρ ≤ 0.
k
1 X (i − 1) · · · (i − r)
(3) βr = Xi,k , r = 1, 2, 3, . . . , k > r,
k (k − 1) · · · (k − r)
i=1
Theorem 2.2. Assume the conditions of Theorem 2.1 with γ < 1/2.
Then
√ (r + 1)βr − bm
k − Dr (γ)
am
Z 1
d
→ (r + 1) sr−1 (− log s)−1−γ E(s) ds + λIr (γ, ρ) =: Qr ,
0
as n → ∞, jointly for r = 0, 1, 2, 3, . . . , where →d means convergence in dis-
tribution, E is Brownian bridge,
(r + 1)ξ Γ(1 − ξ) − 1
Dr (ξ) = , ξ<1
ξ
6 A. FERREIRA AND L. DE HAAN
R∞
Note that Γ′ (1 − γ) = 0 u−γ e−u (log u) du and Γ′′ (1) = 1.97811.
√
Remark 2.1. The condition kA(m) → λ ∈ R means that the growth
of kn , the number of blocks, is restricted with respect to the growth of
mn , the size of a block, as n → ∞. In particular this condition implies that
(log k)/m → 0, as n → ∞.
√
âk,m d 1 log 2 ′
k −1 → (Q1 − Q0 ) + ∆ + Γ (1) ,
am log 2 2
√ b̂k,m − bm
k →d Q0 − Γ′′ (1)∆ + Γ′ (1)Λ.
am
Fig. 1. Asymptotic variances of γ PWM estimators with dashed line for POT.
Fig. 3. Contour plot for the ratio of asymptotic minimal mean square error of γ PWM
estimators.
R∞
or, writing r for n/k, inf r ((r/n)σi2 + Bi2 r s(u) du). Setting the derivative
equal to zero and using properties of regularly varying functions one finds
(i)
for the optimal choice of r, r0 ∼ (1/s)← (n)(Bi2 /σi2 )1/(1−2ρ) and, in terms
of k,
2 1/(1−2ρ)
(i) n σi
k0 ∼ .
(1/s) (n) Bi2
←
(i)
Note that the optimal k0 is different but of the same order for both
(i)
methods. Next, inserting k0 in (6), after some manipulation we get the
following asymptotic expression for MINMSE,
1 − 2ρ (1/s)← (n) 2 1/(1−2ρ) 2 −2ρ/(1−2ρ)
(Bi ) (σi ) .
−2ρ n
It follows that MINMSE(BM)/MINMSE(POT) is, approximately,
Fig. 5. Asymptotic variances of quantile PWM estimators with dashed line for POT.
Lemma 4.1. 1. As k → ∞,
(log k)Z1,k →P 1.
Fig. 8. Contour plot for the ratio of asymptotic minimal mean square error of quantile
PWM estimators.
14 A. FERREIRA AND L. DE HAAN
Fig. 9. Contour plot for the ratio of the optimal values of k in quantile case.
2. [Csörgő and Horváth (1993), page 381] Let 0 < ν < 1/2. With {Ek }k≥1 ,
an appropriate sequence of Brownian bridges,
s(− log s) √
Ek (s)
sup k((− log s)Z⌈ks⌉,k −1)− = oP (1),
(s(1 − s))ν
1/(k+1)≤s≤k/(k+1) s(− log s)
as k → ∞ (⌈u⌉ represents the smallest integer larger or equal to u).
3. Similarly, with 0 < ν < 1/2 for an appropriate sequence {Ek }k≥1 of Brow-
nian bridges and ξ ∈ R,
sup (s(1 − s))−ν
1/(k+1)≤s≤k/(k+1)
Zξ
√ ⌈ks⌉,k − 1 (− log s)−ξ − 1
1+ξ
× ks(− log s)
− − Ek (s) = oP (1),
ξ ξ
as k → ∞.
Lemma 4.2. Under Condition 2.1, there are functions A0 (t) ∼ A(t) and
a0 (t) = a(t)(1 + o(A(t))), as t → ∞, such that for all ε, δ > 0 there exists
t0 = t0 (ε, δ) such that for t, tx > t0 ,
(V (tx) − V (t))/a0 (t) − (xγ − 1)/γ
− Hγ,ρ (x)
A0 (t)
ON THE BLOCK MAXIMA METHOD 15
(8)
≤ ε max(xγ+ρ+δ , xγ+ρ−δ ).
Moreover,
a0 (tx)/a0 (t) − xγ
γ ρ
− x (x − 1)/ρ
A0 (t)
(9)
≤ ε max(xγ+ρ+δ , xγ+ρ−δ )
and
A0 (tx) ρ
ρ+δ ρ−δ
A0 (t) − x ≤ ε max(x , x ).
Note that
1 xγ+ρ − 1 xγ − 1
− , ρ 6= 0 6= γ,
ρ γ+ρ γ
xγ − 1
1 γ
x log x − , ρ = 0 6= γ,
Hγ,ρ (x) = γ γ
1 xρ − 1
− log x , ρ 6= 0 = γ,
ρ ρ
1
(log x)2 ,
ρ = 0 = γ.
2
According to (9) of Lemma 4.2, for each ε, δ > 0 there exists t0 such that
the factor I.2 is bounded (above and below) by
(− log s)−ρ − 1
−ρ+δ −ρ−δ
1 + A0 (m) ± ε max((− log s) , (− log s) )
ρ
provided m ≥ t0 and s ≥ e−m/t0 . According to (8) of Lemma 4.2, for factor
I.1 we have the bounds
γ
−γ ((− log s)Z⌈ks⌉,k ) − 1
m
(− log s) + A0 (− log s)−γ
γ − log s
× {Hγ,ρ ((− log s)Z⌈ks⌉,k ) ± ε max(((− log s)Z⌈ks⌉,k )γ+ρ+δ ,
is bounded by
(s(1 − s))ν
1 γ+ρ 1 Ek (s) ε
(− log s) √ ±√
ρ k s(− log s)1+γ+ρ k s(− log s)1+γ+ρ
ε (s(1 − s))ν
γ 1 Ek (s)
− (− log s) √ ∓√
k s(− log s)1+γ k s(− log s)1+γ
2ε (s(1 − s))ν
=± √ ,
ρ k s(− log s)
and similarly for cases other than ρ 6= 0 6= γ. The remaining part of I.1b,
namely ±ε max(((− log s)Z⌈ks⌉,k )γ+ρ+δ , ((− log s)Z⌈ks⌉,k )γ+ρ−δ ), is similar.
Part II, by the inequalities of Lemma 4.2, is bounded by
1 −γ−ρ+δ −γ−ρ−δ
A0 (m) Hγ,ρ ± ε max((− log s) , (− log s) )
− log s
√ 1
hence it contributes kA0 (m)Hγ,ρ ( − log s ) to the result.
Collecting all the terms, one finds the result.
Then
√ (r + 1)βr − bm (r + 1)γ Γ(1 − γ) − 1
k −
am γ
√ (r + 1) 0 X⌈ks⌉,k Jk(r) (s) ds − bm
R1
= k
am
Z 1
(− log s)−γ − 1 r
− (r + 1) s ds
0 γ
√ Z 1
X⌈ks⌉,k − bm (− log s)−γ − 1 (r)
= k(r + 1) − Jk (s) ds
0 am γ
18 A. FERREIRA AND L. DE HAAN
√ Z 1
(− log s)−γ − 1 r (r)
− k(r + 1) (s − Jk (s)) ds
0 γ
√ 1/(k+1) X
⌈ks⌉,k − bm (− log s)−γ − 1 (r)
Z
= k(r + 1) − Jk (s) ds
0 am γ
√ Z k/(k+1)
X⌈ks⌉,k − bm (− log s)−γ − 1
(r)
+ k(r + 1) − Jk (s) ds
1/(k+1) a m γ
√ 1 X⌈ks⌉,k − bm (− log s)−γ − 1 (r)
Z
+ k(r + 1) − Jk (s) ds
k/(k+1) am γ
√ Z 1
(− log s)−γ − 1 r (r)
− k(r + 1) (s − Jk (s)) ds
0 γ
= I.1 + I.2 + I.3 + I.4.
(r) √
For I.4: since (sr − Jk (s)) = O(1/k) uniformly in s, I.4 = O(1/ k).
For I.1, note that
Z 1/(k+1) √
X⌈ks⌉,k − bm r
(10) k s ds = oP (1).
0 am
This follows since, the left-hand side of (10) equals, in distribution,
√
k V (mZ1,k ) − V (m)
(k + 1) r+1 am
which, by Lemmas 4.1.1, 4.2 and the fact that m/ log k → ∞, is bounded
(below and above) by
√ γ
Z1,k − 1
k γ+ρ+δ γ+ρ−δ
+ A(m)Hγ,ρ (Z1,k ) ± εA(m) max(Z1,k , Z1,k ) .
(k + 1)r+1 γ
ξ
√
This is easily seen to converge to zero in probability, since Z1,k / k=
√ √
{(log k)Z1,k }ξ log−ξ k/ k →P 0 for all real ξ and kA(m) → λ. Hence, I.1 =
oP (1).
Next, we show that
Z 1 √ X⌈ks⌉,k − bm (r)
(11) k Jk (s) ds = oP (1).
k/(k+1) am
(r)
The left-hand side equals, in distribution, since Jk (s) ≡ 1 for s ∈ (k(k +
1)−1 , 1),
√ V (mZk,k ) − V (m)
k
1− k .
k+1 am
ON THE BLOCK MAXIMA METHOD 19
Proof of Theorem 2.4. The proof follows the line of the comparable
result for the POT method [see, e.g., de Haan and Ferreira (2006), Chap-
ter 4.3]. Let cn = 1/(mpn ). Then
√
k(x̂k,m − xn )
am qγ (cn )
√ γ̂
cnk,m − 1
k 1
= b̂k,m + âk,m −V
am qγ (cn ) γ̂k,m − log(1 − pn )
√ √ γ̂k,m
cn − 1 cγn − 1
k b̂k,m − bm âk,m k
= + −
qγ (cn ) am am qγ (cn ) γ̂ γ
√
V (m/(−m log(1 − pn ))) − V (m) cγn − 1
k
− −
qγ (cn ) am γ
γ
cn − 1 √ âk,m
+ k −1 .
γqγ (cn ) am
Similarly, as on pages 135–137 of de Haan and Ferreira (2006), this converges
in distribution to
γ−
∆ + (γ− )2 Ξ − γ− Λ − λ .
γ− + ρ
1 − Γ(1 − γ)
kb,2 = Cγ kγ,2 Cb + kγ,2 ;
γ
kx,0 = Cγ kγ,0 + (γ− )2 kb,0 − γ− ka,0 ,
kx,1 = Cγ kγ,1 + (γ− )2 kb,1 − γ− ka,1 , kx,2 = Cγ kγ,2 + (γ− )2 kb,2 − γ− ka,2
and, for γ = 0,
Cγ = 2(log(3/2)) −1 , kγ,0 = (log 2)−1 − (log 3)−1 ,
kγ,1 = −(log 2)−1 , kγ,2 = (log 3)−1 ;
Ca = 2−1 log 2 + Γ′ (1), Cb = −Γ′′ (1)
and the rest follow similarly by continuity. Then the asymptotic variances,
covariances and biases follow by combining (12) with (13)–(16) in the obvi-
ous way.
REFERENCES
Bücher, A. and Segers, J. (2014). Extreme value copula estimation based on block
maxima of a multivariate stationary time series. Extremes 17 495–528. MR3252823
Cai, J.-J., de Haan, L. and Zhou, C. (2013). Bias correction in extreme value statistics
with index around zero. Extremes 16 173–201. MR3057195
Caires, S. (2009). A comparative simulation study of the annual maxima and the peaks-
over-threshold methods. SBW-Belastingen: Subproject “Statistics”. Deltares Report
1200264-002.
Csörgő, M. and Horváth, L. (1993). Weighted Approximations in Probability and Statis-
tics. Wiley, Chichester. MR1215046
Cunnane, C. (1973). A particular comparison of annual maxima and partial duration
series methods of flood frequency prediction. J. Hydrol. 18 257–271.
de Haan, L. and Ferreira, A. (2006). Extreme Value Theory: An Introduction. Springer,
New York. MR2234156
de Valk, C. (1993). Estimation of marginals from measurements and hindcast data.
WL—Delft Hydraulics Report H1700.
Diebolt, J., Guillou, A., Naveau, P. and Ribereau, P. (2008). Improving probability-
weighted moment methods for the generalized extreme value distribution. REVSTAT
6 35–50. MR2386298
Dombry, C. (2013). Maximum likelihood estimators for the extreme value index based
on the block maxima method. Available at arXiv:1301.5611.
Drees, H. (1998). On smooth statistical tail functionals. Scand. J. Stat. 25 187–210.
MR1614276
Drees, H., de Haan, L. and Li, D. (2003). On large deviation for extremes. Statist.
Probab. Lett. 64 51–62. MR1995809
Gumbel, E. J. (1958). Statistics of Extremes. Columbia Univ. Press, New York.
MR0096342
24 A. FERREIRA AND L. DE HAAN