Two-Parameter Rayleigh Distribution: Different Methods of Estimation
Two-Parameter Rayleigh Distribution: Different Methods of Estimation
Methods of Estimation
∗
Department of Statistics, St. Anthony’s College, Shillong, Meghalaya, India. Email:
sanku [email protected]
†
Department of Mathematics, Jones Hall, The College of William & Mary, Williamsburg, Virginia, USA.
Email: [email protected]
‡
Department of Mathematics and Statistics, Indian Institute of Technology, Kanpur, India. Email:
[email protected]
1
2
Abstract
Key Words and Phrases: Maximum likelihood estimators; method of moment estimators;
L-moment estimators; least squares estimators; weighted least squares estimators; percentile
1 Introduction
Lord Rayleigh (1880) introduced the Rayleigh distribution in connection with a problem in
the field of acoustics. Since then, extensive work has taken place related to this distribution
in different areas of science and technology. It has some nice relations with some of the well
known distributions like Weibull, chi-square or extreme value distributions. An important
characteristic of the Rayleigh distribution is that its hazard function is an increasing function
of time. It means that when the failure times are distributed according to the Rayleigh
law, an intense aging of the equipment/ item takes place. Estimations, predictions and
inferential issues for one parameter Rayleigh distribution have been extensively studied by
several authors. Interested readers may have a look at the book by Johnson, Kotz and
Balakrishnan (1994) for an excellent exposure of the Rayleigh distribution, and see also
Abd-Elfattah, Hassan and Ziedean (2006), Dey and Das (2007), Dey (2009) for some recent
references.
In this paper we consider two-parameter Rayleigh distribution; one scale and one location
parameter, and it has the following probability density function (PDF)
2
f (x; λ, µ) = 2λ(x − µ)e−λ(x−µ) ; x > µ, λ > 0. (1)
Here λ and µ are the scale and location parameters respectively. Interestingly, although ex-
tensive work has been done on one-parameter Rayleigh distribution, not much attention has
been paid to two-parameter Rayleigh distribution, although Johnson, Kotz and Balakrish-
nan (1994) briefly mentioned about the two-parameter Rayleigh distribution in their book.
Recently, Khan, Provost and Singh (2010) considered two-parameter Rayleigh distribution
The main aim of this paper is to consider different estimation procedures of the two-
parameter Rayleigh distribution both from the Bayesian and frequentist points of view. We
first consider the most natural frequentist’s estimators namely the maximum likelihood esti-
mators (MLEs). The MLEs of the two-parameter Rayleigh distribution cannot be obtained
in explicit forms. They can be obtained by solving a one dimensional optimization process.
We have provided a very simple iterative technique which can be used to compute the MLEs
of the unknown parameters. The two-parameter Rayleigh distribution does not satisfy the
standard regularity conditions of Cramer. In fact, some elements of the expected Fisher in-
formation matrix are not finite. Therefore, the standard asymptotic normality results of the
maximum likelihood estimators do not hold here. We provide the asymptotic distribution
Although, the MLEs do not have explicit forms, the method of moment estimators
(MMEs) can be obtained in explicit forms. We provide the MMEs and study their asymp-
totic properties. We also consider other estimators, namely least squares estimators, weighted
least squares estimators, percentile estimators and L-moment estimators. We further con-
sider the Bayes estimators of the unknown parameters under the assumptions of independent
gamma and uniform priors on the scale and shape parameters respectively. It is observed
that the Bayes estimators under the squared error loss function cannot be obtained in closed
forms. We have suggested use of importance sampling procedure to compute simulation con-
sistent Bayes estimate and the associated credible interval. We compare the performances
of the different methods using extensive computer simulations. Finally, we analyze a data
Rest of the paper is organized as follows. In Section 2, we provide the MLEs of the
unknown parameters and discuss their asymptotic properties. The MMEs and their asymp-
totic properties are presented in Section 3. Percentile estimators and least squares estimators
5
are provided in Section 4 and Section 5 respectively. L-moment estimators are provided in
Section 6. Bayes estimators are presented in Section 7. In Section 8 we present the Monte
Carlo simulation results. The analysis of a real data set is provided in Section 9, and finally
conclusions appear in Section 10.
Let x1 , · · · , xn be a random sample of size n from (1), then the log-likelihood function l(µ, λ)
without the additive constant for µ < x(1) = min{x1 , · · · , xn } can be written as
n
X n
X
l(µ, λ) = C + n ln λ + ln(xi − µ) − λ (xi − µ)2 . (3)
i=1 i=1
b
From (4), we obtain the MLE of λ as a function of µ, say λ(µ), as
b n
λ(µ) = Pn 2
. (6)
i=1 (xi − µ)
b
Substituting λ(µ) in (3), we obtain the profile log-likelihood function of µ without the addi-
tive constant as
Xn Xn Xn
2 (xi − µ)
g(µ) = −n ln( (xi − µ) ) + ln(xi − µ) = ln Pn 2
. (7)
i=1 i=1 i=1 i=1 (x i − µ)
It can be shown that the maximum of (7) can be obtained as a fixed point solution of the
following equation
h(µ) = µ, (8)
6
where
n
X n
X n
X
2
h(µ) = 2 (xi − µ) × (xi − µ) × (xi − µ)−1 . (9)
i=1 i=1 i=1
Once µ bM LE = λ(b
bM LE is obtained, the MLE of λ, λ b µM LE ) can be easily obtained. Observe
that very simple iterative technique namely h(µ(j) ) = µ(j+1) , where µ(j) is the j − th iterate,
Since they are not in explicit forms, it is expected that the exact distributions of the MLEs
will not be possible to obtain. We therefore, mainly rely on the asymptotic properties of the
MLEs. The two-parameter Rayleigh distribution does not satisfy the standard regularity
be obtained in explicit form, from Corollary of Theorem 3 of Smith (1985), it follows that
V (b
µ − µ) can be well approximated by the inverse of the observed information, and in this
case it is
1
V (b
µM LE − µ) ≈ 1
Pn . (11)
2µ + n i=1 (xi − µ)−2
Therefore, 100(1 − α)% approximate confidence intervals of λ and µ can be obtained as
bM LE 1/2
bM LE ± zα/2 λ√
λ and µ
bM LE ± zα/2 Pn
1
, (12)
1
n 2b
µM LE + n i=1 (xi −µ
bM LE )−2
respectively, where zα/2 is the α/2-th percentile point of the standard normal distribution.
7
bM M E = 1 1 − Γ2 (3/2)
λ and µ b−1/2 Γ(3/2).
bM M E = x̄ − λ (13)
MME
s2
n n
1X 1 X
Here x̄ = xi and s2 = (xi − x̄)2 are the sample mean and sample variance
n i=1 n − 1 i=1
respectively. The exact distributions of λbM M E and µ bM M E are not possible to obtain. The
bM M E and µ
asymptotic distribution of λ bM M E can be obtained. For that we need the following
notations. Suppose the random variable X has a Rayleigh distribution with parameters λ
Then
Now we will provide the asymptotic properties of the MMEs. Since the first two moments
exist, it is immediate that the MMEs are consistent estimators to the corresponding param-
eters. Using the δ-method we can easily obtain the asymptotic distribution of the MMEs.
√ √
b d
n(b
µM M E − µ), n(λM M E − λ) −→ N2 [0, Σ], (17)
d
where ‘−→’ means convergence in distribution, and Σ is a 2 × 2 matrix and it has the
following form;
Σ = C−1 AC−1 ,
and if we denote
a a12 c c12
A = 11 , C = 11
a21 a22 c21 c22
8
Γ(3/2)
then for c1 = p and c2 = 1 − Γ2 (3/2)
2
1 − Γ (3/2)
4 L-Moments Estimators
In this section we provide the L-moments estimators, which can be obtained as the linear
Hosking (1990), and it is observed that the L-moments estimators are more robust than the
usual moment estimators. The L-moment estimators are also obtained the same way as the
ordinary moment estimators, i.e. by equating the sample L-moments with the population
L-moments.
In this case the L-moments estimators can be obtained by equating the first two sample
L-moments with the corresponding population L-moments. The first two sample L-moments
are
n n
1X 2 X
l1 = x(i) , and l2 = (i − 1)x(i) − l1 , (18)
n i=1 n(n − 1) i=1
and the first two population L-moments are
1
λ1 = E(X1,1 ) = E(X) = Γ(3/2) + µ (19)
λ1/2 √
1 Γ(3/2) 2−1
λ2 = (E(X2:2 ) − E(X2:1 )) = × √ . (20)
2 λ1/2 2
Here Xm,n denotes the m-th order statistic of a sample of size n. Therefore, the L-moment
9
Kao (1958, 1959) had suggested the estimators based on percentiles, when the distribution
function is in closed form. Kao (1958) had proposed and then successfully implemented, see
Kao (1959), the percentile based estimators for Weibull distribution. Later on, it has been
used quite successfully for other distributions, where the distribution functions are in com-
pact form. Gupta and Kundu (2001) and Kundu and Raqab (2005) used the percentile based
estimators and compared with the other estimators for generalized exponential distribution
The main advantage of the percentile based estimators is that in many situations, they
can be obtained in explicit forms. The percentile based estimators are mainly obtained by
minimizing the Euclidean distance between the sample percentile and population percentile
points. Since the cumulative distribution function of the two-parameter Rayleigh distribution
denotes some estimate of F (x(i) ; µ, λ). Although, several estimates of p’s are available in the
i
literature, see for example Mann, Schafer and Singpurwalla (1974), pi = seems to an
n+1
i
acceptable choice. We have also used pi = in this paper. The minimization of (23)
n+1
10
can be performed explicitly, using the linear regression technique and the percentile based
The least squares estimators and weighted least squares estimators (LSEs) were proposed by
Swain, Venkataraman and Wilson (1988) to estimate the parameters of a Beta distribution.
The weighted least squares estimators (WLSEs) of the unknown parameters can be ob-
tained by minimizing
n
X 2
j
wj F (x(j) ) −
j=1
n+1
with respect to µ and λ, where wj denotes the weight function at the j-th point. The weights
are taken to be
1 (n + 1)2 (n + 2)
wj = = .
V (F (X(j) ) j(n − j + 1)
11
Therefore, in this case, the WLSEs of µ and λ, say, µ̂W LSE and λ̂W LSE , respectively, can be
obtained by minimizing
n
X 2
(n + 1)2 (n + 2) 2 j
1 − exp(−λ(xj − µ) ) −
j=1
j(n − j + 1) n+1
7 Bayes Estimators
So far we have discussed the estimation of the unknown parameters using different frequen-
tists method. In this section we consider the Bayesian inference of the unknown parameters
of the two-parameter Rayleigh distribution when both the parameters are unknown. We
mainly discuss the Bayes estimates and the associated credible intervals. Although, we have
developed the estimates based on the squared error loss functions, other loss functions can
be easily incorporated.
It is well known that when the both the parameters are unknown, the conjugate priors
do not exist. Moreover, since all the elements of the expected Fisher information matrix are
not finite, Jeffreys’ priors also do not exist. We consider the following fairly general priors
on λ and µ.
Note that for known µ, λ has a conjugate gamma prior, and therefore, we consider the
Now based on the observed sample, the joint posterior density function of λ and µ
becomes;
Pn 2 Q
λn+a−1 e−λ(b+ i=1 (xi −µ) ) ni=1 (xi − µ)
π(µ, λ|Data) = R ∞ R x(1) n+a−1 −λ(b+Pn (x −µ)2 ) Qn ; (27)
0 −∞
λ e i=1 i
i=1 (xi − µ)dµdλ
0 < λ < ∞, −∞ < µ < x(1) .
Therefore, if we want to compute the Bayes estimate of any function of λ and µ, say θ(λ, µ),
Clearly, for general θ(λ, µ) it cannot be obtained in explicit form. Numerical procedure,
or some approximation like Lindley’s approximations may be used to compute (28). But
unfortunately, by this method although the Bayes estimate can be obtained, the associated
Due to this reason, we propose to use the importance sampling method to compute the
Bayes estimate and also to compute the associated credible interval. It is known that by
proper importance sampling method, simulation consistent Bayes estimate and the associ-
ated credible interval can be constructed (Kundu and Pradhan (2009)). To implement the
importance sampling procedure we re-write (28) as follows;
where
Pn
(b + − µ)2 )n+a n+a−1 −λ(b+Pni=1 (xi −µ)2 )
i=1 (xi
f1 (λ|µ, Data) = λ e ; λ > 0, (30)
Γ(n + a)
Pn
k 2( i=1 xi − µ)
f2 (µ|Data) = Pn Pn 2 n+a ; −∞ < µ < x(1) (31)
(µ2 − 2µ i=1 xi + b + i=1 xi )
n n
!n+a−1
X X
k = (n + a − 1) x2(1) − 2x(1) xi + b + x2i
i=1 i=1
13
and Qn . P
i=1 (xi − µ) 2( ni=1 xi − µ) for µ < x(1)
h(µ) = (32)
0 for µ ≥ x(1) .
Note that, f1 (λ|µ, Data) is a gamma density function with the shape and scale parameters
P
as n + a and b + ni=1 (xi − µ)2 respectively. f2 (µ|Data) is a proper density function and it
Now we propose to use the following procedure to compute the Bayes estimate of g(λ, µ)
Now we would like to construct the highest posterior density credible interval (HPD)
squared error loss function can be obtained from the generated sample {(µ1 , λ1 ), · · · , (µN , λN )}
14
as follows. Let
h(µi )
w i = PN ,
i=1 h(µ i )
and θ1 = θ(λ1 , µ1 ), · · · , θN = θ(λN , µN ). Rearrange, (θ1 , w1 ), · · · , (θN , wN ) as follows
{(θ(1) , w[1] ), · · · , (θ(N ) , w[N ] )}, where θ(1) < · · · < θ(N ) . Note that, w[i] ’s are not ordered,
they are associated with θ(i) ’s. Then a simulation consistent Bayes estimate of θp can be
Now using the above procedure, a 100(1-α)% credible interval can be obtained as
N1−α
X
(θbδ , θbδ+1−α ), for δ = w[1] , w[1] + w[2] , · · · , w[i] . (36)
i=1
Therefore, a 100(1-α)% HPD credible interval of θ becomes, ((θbδ∗ , θbδ∗ +1−α ), where δ ∗ is such
8 Numerical Simulations
In this section we present some experimental results which have been performed to see the
effectiveness of the different methods for different sample sizes. We mainly compare different
methods in terms of the biases and mean squared errors. When we compare the different
frequentists methods with the Bayes estimate we have taken non-informative prior namely
(very large). We present the average estimates of λ and µ based on 10,000 replications based
on different methods and the associated mean squared errors are presented in parentheses
below. The results are presented in Tables 1 and 2.
15
Some of the points are quite cleat from Tables 1 and 2. In all the cases it is observed
that as sample size increases, the average biases and the mean squared errors decrease. It
Now comparing the performances of the different estimators, it is observed that among
all the estimators presented here, the L-moment estimators have the smallest biases for both
µ and λ. The L-moment estimators perform better than the moment estimators in all the
cases considered in terms of biases and mean squared errors. In case of µ the L-moment
estimators behave better than percentile estimators and least squares estimators in terms
of biases and mean squared errors. In case of λ the mean squared errors of the L-moment
estimators are almost same with the percentile and least squares estimators.
The biases of the Bayes estimators are slightly larger than the L-moment estimators
although they have smaller mean squared errors. The performances of the MLEs are also
quite satisfactory, although it has slightly higher biases and mean squared errors compared
to the Bayes estimators. Comparing all these we propose to use the MLEs or Bayes estima-
tors for all practical purposes in estimating the parameters of the two-parameter Rayleigh
distribution.
9 Data Analysis
In this section we present analysis for illustrative purposes using strength data set originally
reported by Badar and Priest (1982). It describes the strength measured in GPA for single
carbon fibers and impregnated 1000-carbon fiber tows. The data are presented in Table 3.
Preliminary data analysis indicates that the data are positively skewed. A device called
scaled TTT (total time on test) transform and its empirical version are relevant to study the
16
shape of the hazard function of a distribution function. For a family with survival function
S(y) and distribution function F (y) with positive supports, the scaled TTT transform is
Z F −1 (u)
defined for 0 < u < 1, as φF (u) = HF (u)/HF (1), where HF (u) =
−1 −1 −1
S(y)dy. The
0
empirical version of the scaled TTT transform for j = 1, . . . , n, is given by
j
!, n !
X X
φn (j/n) = Hn−1 (j/n)/Hn−1 (1) = x(i) + (n − j)x(j) x(i) ,
i=1 i=1
see Aarset (1987). Aarset (1987) showed that the scaled TTT transform is concave (convex)
if the hazard function is increasing (decreasing). To check the shape of the empirical hazard
function, we have plotted the scaled TTT transform in Figure 1. Since it is a concave func-
tion, it indicates that the hazard function of the distribution will be an increasing function.
First we would like to compute the maximum likelihood estimators of the unknown
parameters. The profile log-likelihood g(µ) of µ as provided in (7) is provided in Figure 2.
17
Clearly, it is an unimodal function, and the MLE of µ is obtained as 0.560, which maximizes
g(µ). The MLE of λ becomes 0.648. The 95% confidence intervals of λ and µ based on MLEs
The Kolmogorov-Smirnov distance between the fitted and empirical cumulative distribution
functions is 0.128 and the associated p values is 0.207. Therefore, based on the MLEs we
cannot reject the null hypothesis that the data are coming from a two-parameter Rayleigh
0.562 0.564 0.729 0.802 0.950 1.053 1.111 1.115 1.194 1.208
1.216 1.247 1.256 1.271 1.277 1.305 1.313 1.348 1.390 1.429
1.474 1.490 1.503 1.520 1.522 1.524 1.551 1.551 1.609 1.632
1.632 1.676 1.684 1.685 1.728 1.740 1.761 1.764 1.785 1.804
1.816 1.824 1.836 1.879 1.883 1.892 1.898 1.934 1.947 1.976
2.020 2.023 2.050 2.059 2.068 2.071 2.098 2.130 2.204 2.262
2.317 2.334 2.340 2.346 2.378 2.483 2.683 2.835 2.835
18
0.9
0.8
0.6
0.5
0.4
0.3
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
u
−325
−326
µ Profile log−likelihood of
−327
−328
−329
−330
−331
0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6
µ
distribution.
as 0.543 and 0.629 respectively. The 95% symmetric credible intervals of λ and µ are (0.445,
0.798) and (0.530, 0.562) respectively. The Kolmogorov-Smirnov distance between the fitted
and empirical cumulative distribution functions is 0.130 and the associated p values is 0.191.
Therefore, Bayes estimates also provide satisfactory goodness of fit of the two-parameter
10 Conclusions
In this paper we have considered different estimation procedures for estimating the unknown
considered the maximum likelihood estimators, the method of moment estimators, the L-
moment estimators, percentile estimators, least squares estimators and the Bayes estimators.
It is not possible to compare different methods theoretically, and we have used some simu-
lations to compare different estimators. We have compared different estimators mainly with
respect to biases and mean squared errors. It is observed that the Bayes estimators with
non-informative priors work very well in terms of biases and mean squared errors, although
practical purposes.
20
Acknowledgments
The authors would like to thank the editors and the reviewers who helped to substantially
improve the paper. The authors are thankful to Professor R.G. Surles for providing the
strength data.
References
likelihood estimators under different censored sampling schemes for Rayleigh distribution.
[2] Aarset, M.V. (1987). How to identify a bathtub shaped hazard rate?. IEEE Transactions
[3] Badar, M.G. & Priest, A.M. (1982). Statistical aspects of fiber and bundle strength in
hybrids composite. Progress in Science and Engineering Composites (T. Hayakashi, K.
[4] Dey, S. (2009). Comparison of Bayes Estimators of the parameter and reliability function
for Rayleigh distribution under different loss functions. Malaysian Journal of Mathemat-
[5] Dey, S. & Das, M.K. (2007). A Note on Prediction Interval for a Rayleigh Distribution:
[6] Gupta, R.D. & Kundu, D. (2001). Generalized exponential distributions: different meth-
ods of estimation. Journal of Statistical Computation and Simulation, 69, 315 - 338.
21
[7] Hosking, J.R.M. (1990). L-moment: analysis and estimation of distributions using linear
combinations of order statistics. Journal of Royal Statistical Society, Ser. B, 52, 105 -
124.
[8] Johnson, N.L., Kotz, S. & Balakrishnan, N. (1994). Continuous Univariate Distribution
[9] Kao, J.H.K. (1958). Computer methods for estimating Weibull parameters in reliability
studies. Trans. IRE Reliability Quality Control, 13, 15 - 22.
[10] Kao, J.H.K. (1959). A graphical estimation of mixed Weibull parameters in life testing
[11] Khan, H.M.R., Provost, S.B. & Singh, A. (2010). Predictive inference from a two-
[12] Kundu, D. & Raqab, M.Z. (2005). Generalized Rayleigh distribution: different methods
[13] Rayleigh, J.W.S. (1880). On the resultant of a large number of vibrations of the some
pitch and of arbitrary phase. Philosophical Magazine, 5-th Series, 10, 73 - 78.
[14] Smith, R.L. (1995). Maximum likelihood estimation in a class of non-regular cases.
[15] Swain, J.J., Venkataraman, S. & Wilson, J.R. (1988). Least squares estimation of dis-