On Probability Limits For Phase II Control Chart
On Probability Limits For Phase II Control Chart
On probability limits
for phase II S control chart
Summary - The Shewhart S charts are based on 3 sigma limits. The control limit
constant 3 is due to the normal approximation to the underlying distribution namely
the chi distribution, which is rather skewed. In this paper, we critically review the 3-
sigma control limits used for Phase II S charts; we examine the ARL properties. Based
on our study, we recommend the adoption of either of the equally well performing
ARL-Unbiased design or the Two-Point design for Phase II S charts in order to detect
both quality improvement and degradation.
1. Introduction
which will help to search for special causes that may result in a smaller process
spread. Chen et al. (2001) presented a new single EWMA-type chart based
on the transformations of sample mean and sample variance in place of two
separate EWMA charts for the mean and variance.
In the S chart, the control statistic plotted is the subgroup standard devia-
tion, i.e. at time i we plot
,
-
- 1 n
Si = . (xi j − x̄i )2 , for i ≥ 1 ,
n − 1 j=1
where {xi1 , xi2 , . . . , xin } denotes a subgroup of size n and its mean is x̄i . We
assume that all observations within and between subgroups are independent and
are from a normal population. The distribution of Si is closely related to a chi
(or chi-square) distribution, which is a right skewed distribution, particularly
for small subgroup sizes. The following lower and upper control limits recom-
mended in textbooks (see for example Montgomery 2005, Devore and Farnum
2005) are based on the 3 sigma limits, derived using a normal approximation
to the chi distribution.
LC L = c4 σ0 − 3σ0 1 − c42 , (1)
U C L = c4 σ0 + 3σ0 1 − c42 , (2)
not specifically address the issue of Phase II control, whereas the current work
does.
We denote throughout this paper that the process standard deviation is
σ = ρσ0 , where ρ > 1, 0 < ρ < 1 and ρ = 1 indicate whether the process
standard deviation has increased, decreased and is in control, respectively. The
average run length ARL(ρ) is therefore a function of ρ and other parameters
brought in by the lower and upper control limits. Suppose that LC L and
U C L have been specified, say LC L = aσ0 and U C L = bσ0 . It is shown in
Appendix A that
1
ARL(ρ) = # $, (3)
1 − F(sb2 /ρ 2 ) − F(sa 2 /ρ 2 )
x
where s = (n − 1)/2 and F(x) = 0 (1/ (s)) u s−1 e−u du.
We assert that the following are the reasons against using the 3-sigma
limits given as equations (1) and (2):
1) The lower control limit LC L must be positive to detect quality improve-
ment. This forces the subgroup size n to be greater than or equal to 6.
2) The S chart with the 3-sigma limits does not achieve the advertised in-
control ARL such as 370.
3) The S chart with the 3-sigma limits is not sensitive to decreases in process
standard deviation. This means that the important objective of detecting
quality improvement will not be immediately realised.
4) The problems listed in 2) and 3) becomes less severe only when n is as
large as 20. However, in practice n is usually small, with n = 5 and
n = 10 being the reasonable choices.
Table 1: Some ARL values of 3-sigma S charts with different subgroup size n. The in-control
ARL equals ARL(1.0).
n=6
n=7
n=8
8
n=9
n=10
n=15
n=20
ln(ARL(rho))
6
4
2
0
3. Probability limits
The study given in the last section convinced us that 3-sigma limits should
not be used for Phase II S charts. It is desirable to obtain the control limits
meeting the criteria (C1) and (C2) mentioned in the Introduction section. The
On probability limits for phase II S control chart 309
criterion (C1) is equivalent to setting that the false alarm rate is equal to (or
less than) a preassigned small fraction α (e.g. α = 0.0027). To be precise,
(C1) requires that the sum of left and right tail probability is equal to α when
process standard deviation σ = σ0 . Denote the left and right tail probability
by αl and αr , respectively, then (C1) is translated to
αl + αr = α . (4)
Given the tail probabilities, we obtain the control limits as the probability limits
as:
/
χα2l , n−1 c1
LC L = σ0 ≡ σ0 ,
n−1 n−1
/
χ1−α
2
r , n−1
c2
U C L = σ0 ≡ σ0 ,
n−1 n−1
where χα2l , n−1 and χ1−α
2
r , n−1
, denoted by c1 and c2 , respectively, are the αl and
1 − αr quantiles of the chi-square distribution with n − 1 degrees of freedom.
Note that once αl is fixed, then αr is determined from (4) and then the above
probability limits LC L and U C L can be obtained. Now, the important question
is: How do we determine αl (or c1 )?
3.1. Half-Half split design
One simple solution to the above problem is the half-half split, i.e. let
αl = αr = α/2, which is the conventionally so-called probability limits design
(see Ryan 2000). However, the underlying chi (or chi-square) distribution is
skewed and this design of the S chart ignores criterion (C2); hence this is not
the best design.
3.2. ARL-Unbiased Design
Criterion (C1) is indeed a restricted design, but criterion (C2) suggests that
we should seek an “optimal” αl such that the resulted S chart has uniformly
small out-of-control ARL values. Unfortunately, such an optimal αl does not
exist as pointed out by the following Theorem.
Theorem. Suppose we have two choices for αl in Equation (4), say αl1 and αl2 .
Let ARL(ρ; αli ) denote the ARL function when we choose the left tail probability
to be αli , where i = 1, 2. If αl1 < αl2 , then
and
ARL(ρ; αl1 ) < ARL(ρ; αl2 ), for ρ > 1 .
A proof of the above theorem is given in Appendix C. This theorem shows
that with the restriction of (4) it is impossible to find a left tail probability αl
such that the resulted S chart has uniformly smaller out-of-control ARL than
any other S chart. In fact, choosing a small αl makes the chart sensitive to
increases in process standard deviation, while choosing a large αl makes the
chart sensitive to decreases in process standard deviation. Therefore, we need
to replace criterion (C2) by a weaker criterion. In line with Zhang et al. (2004,
2005), we use the following one.
(C2 ) The best choice of αl is determined by minimizing
ARL(1 − ε1 ; αl ) + ARL(1 + ε2 ; αl ) ,
4. Comparisons
TPC 0.1
TPC 0.2
TPC 0.3
TPC 0.5
300
ARL(rho)
200
100
0
are visibly different from the curve for Half-Half Split Design—the discrepancy
becomes less obvious when n = 10 than when n = 5. From our observations
we conclude that the ARL-Unbiased and TPC designs bring in more balance
between the increase and decrease in process standard deviation than the Half-
Half Split Design does. Hence we recommend the ARL-Unbiased design or
TPC design in practice when n is small (e.g. n = 4 or 5).
400
TPC 0.3
TPC 0.5
ARL(rho)
200
100
0
Consider the following finer points for choosing between the ARL-Unbiased
design and TPC design.
• They are both for correcting the inherent bias in ARL resulting from the
skew of the distribution of the control statistic for small subgroup sizes.
When subgroup size is large, say n ≥ 30, the skew becomes less severe
and we can simply use the S chart from the Half-Half Split Design. If a
decrease in process standard deviation is not of interest, an upper-sided S
chart (i.e. c1 = 0 and c2 = χ1−α,
2
n−1 ) is suitable.
• ARL-Unbiased design and TPC design are two natural but different methods
to assign control limits for S charts. If ε1 and ε2 are properly chosen,
On probability limits for phase II S control chart 313
TPC design usually results in a quite ARL-Unbiased S chart (for this see
Figure 2 and Figure 3), although strictly speaking it is still ARL-biased.
• TPC design has the disadvantage that practitioners have to choose proper
values of ε1 and ε2 . However, TPC design is useful for achieving balanced
and small ARL values at specified values of ρ such as ρ = 0.5 and ρ = 1.5.
Table 2: Various designs for n = 5, 10 and α = 0.0027. The ε1 = ε2 = ε is for TPC designs.
ε n=5 n = 10
αl c1 c2 αl c1 c2
5. Concluding remarks
The reason why 3-sigma control limits should not be used for Phase II
S charts stems out from the skew nature of the distribution of S for small
subgroup sizes. Control limits must be obtained as probability limits. If the
subgroup size n is small (say not larger than 10), we recommend the use
of ARL-Unbiased design or TPC design that are discussed in Sections 3.2
and 3.3. R programs for ARL-Unbiased design and TPC design are available
in Appendix D.
The suggested probability limit designs require that the true process stan-
dard deviation σ0 is estimated correctly during the Phase I study. For a dis-
cussion of the size of observations to make a good estimate of σ0 in Phase I
study, we refer readers to Chen (1998) and Zhang et al. (2005).
First, let us recall our notation. The process standard deviation is assumed
to
x be σ = ρσ0 ; s = (n − 1)/2, where n is the subgroup size; and F(x) =
0 (1/ (s)) u s−1 e−u du.
The run length of an S chart follows a geometric distribution and its mean
is equal to the reciprocal of the tail probability. We need to obtain the tail
314 LINGYUN ZHANG – K. GOVINDARAJU
we have
(n − 1)Si2
Pr(aσ0 < Si < bσ0 ) = Pr (n − 1)a 2 < < (n − 1)b2
σ0
2
(n − 1)Si2 2
= Pr (n − 1)a 2 < ρ < (n − 1)b 2
σ2
(n − 1)a 2
(n − 1)Si2 (n − 1)b2
= Pr < <
2ρ 2 2σ 2 2ρ 2
= F(sb /ρ ) − F(sa /ρ ) .
2 2 2 2
Thus,
1
ARL(ρ) = # $.
1 − F(sb2 /ρ 2 ) − F(sa 2 /ρ 2 )
The first equation in (5) is the requirement of criterion (C1). The following
is an explanation for the second equation in (5).
In Section 1 we defined that F(x) = 0x (1/ (s)) u s−1 e−u du. By (3), the
ARL function is
1
ARL(ρ) = #
c2
c1
$ ,
1− F 2ρ 2
−F 2ρ 2
which has the same monotonicity as the function g(ρ) = F( 2ρc22 )− F( 2ρc12 ) does,
and which attains its maximum at where g(ρ) does. The derivative of g(ρ) is
c2 c2 c1 c1
g (ρ) = F − 3 − F − 3
2ρ 2 ρ 2ρ 2 ρ
c1 c2
1 s − 2ρ 2 s − 2ρ 2
= s−1 2s+1 ce − c2 e .
2 ρ (s) 1
On probability limits for phase II S control chart 315
Let F −1 (·) denote the inverse function of F(·) that is defined in Section
1. Because c1 = χα2l , n−1 and c2 = χ1−α
2
r , n−1
, we have
c1 = 2F −1 (αl ) ,
c2 = 2F −1 (1 − αr ) = 2F −1 (1 − α + αl ) .
We define
F −1 (1 − α + αl ) F −1 (αl )
p(αl ) = F −F , for 0 < αl < α .
ρ2 ρ2
Note that 1− p(αl ) is the tail probability, and its reciprocal is equal to the ARL.
To prove the theorem, it is equivalent to show that: p(αl ) is monotonically
increasing as αl increases, if ρ > 1; and p(αl ) is monotonically decreasing as
αl increases, if 0 < ρ < 1. Again, we resort to differentiation. Denote the
derivative of F(·) by f (·), and then the derivative of p(αl ) is
F −1 (1 − α + αl ) 1 1
p (αl ) = f
ρ2 ρ f F (1 − α + αl )
2 −1
F −1 (αl ) 1 1
− f
ρ2 ρ f F (αl )
2 −1
F −1 (1−α+αl ) F −1 (αl )
f ρ2
f F −1 (αl ) − f ρ2
f F −1 (1 − α + αl )
1
= .
ρ2 f F −1 (αl ) f F −1 (1 − α + αl )
316 LINGYUN ZHANG – K. GOVINDARAJU
Appendix D: R Programs
obj.func<-function(k)
{y<- pgamma(q=my.s*log(k)/(k-1), shape=my.s) -
pgamma(q=my.s*k*log(k)/(k-1), shape=my.s) - (1 - my.alpha)
y
}
while(1)
{midp<- (p1 + p2)/2
On probability limits for phase II S control chart 317
result<- obj.func(k=midp)
if(abs(result) <= 1e-8) break
else
{ if (result > 1e-8) p1<-midp
else p2<-midp
}
}
my.s<-(n-1)/2
my.obj.fun<-function(x)
{ my.c1<- x
my.c2<- 2*qgamma(p=1-my.alpha+pgamma(my.c1/2,shape=my.s),shape=my.s)
my.re<- 1.0/( 1 - ( pgamma(my.c2/2.0/(1-my.eps1)**2, shape=my.s) -
pgamma(my.c1/2.0/(1-my.eps1)**2,shape=my.s))) +
1.0/( 1 - ( pgamma(my.c2/2.0/(1+my.eps2)**2, shape=my.s) -
pgamma(my.c1/2.0/(1+my.eps2)**2, shape=my.s)))
my.re
}
Acknowledgments
We thank the two anonymous referees for their critical comments and useful suggestions.
REFERENCES
Acosta-Mejia, C. A. (1998) Monitoring Reduction in Variability with the Range, IIE Transactions,
30, 515–523.
Champ, C. W. and Lowry, C. A. (1994) Adjusting the S-Chart for Detecting Both Increases and
Decreases in the Standard Deviation, Proceedings of the Decision Sciences Institute Annual
Conference, Honolulu, Hawaii, November 20-22, Vol. 3, 2112–2114.
Chen, G. (1998) The Run Length Distributions of the R, s and s 2 Control Charts when σ is Estimated,
The Canadian Journal of Statistics, 26, 311–322.
Chen, G., Cheng, S. W. and Xie, H. (2001) Monitoring Process Mean and Variability With One
New EWMA Chart, Journal of Quality Technology, 33, 223–233.
Devore, J. and Farnum, N. (2005) Applied Statistics for Engineers and Scientists, 2nd ed., Curt
Hinrichs.
Montgomery, D. C. (2005) Introduction to Statistical Quality Control, 5th ed., John Wiley & Sons,
Inc.
Pignatiello J. J. Jr., Acosta-Mejia C. A. and Rao, B. V. (1995) The Performance of Control
Charts for Monitoring Process Dispersion, Proceedings of the 4th Industrial Engineering
Research Conference, 320–328.
R development core team (2006) R: A language and environment for statistical computing. R Foun-
dation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL http://www.R-
project.org.
Ryan, T. P. (2000) Statistical Methods for Quality Improvement, 2nd ed., Wiley, New York, NY.
Woodall, W. H. and Montgomery D. C. (1999) Research Issues and Ideas in Satistical Process
Control, Journal of Quality Technology, 31, 376–387.
Zhang, L., Govindaraju, K., Bebbington, M. and Lai, C. D. (2004) On the Statistical Design of
Geometric Control Charts, Quality Technology and Quantitative Management, 1 (2), 233–
243.
Zhang, L., Bebbington, M. S., Lai, C. D. and Govindaraju, K. (2005) On Statistical Design of
the S 2 Control Chart, Communications in Statistics–Theory and Methods, 34, 229–244.