0% found this document useful (0 votes)
15 views12 pages

04 - Mean, SD & Peer - David Plaut QC Article - Without Notes

Mean sd peer

Uploaded by

AnamariaCiobica
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views12 pages

04 - Mean, SD & Peer - David Plaut QC Article - Without Notes

Mean sd peer

Uploaded by

AnamariaCiobica
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

Mean & Peer Group Data


A Discussion of Three Common Questions
in Quality Control
David Plaut

Three related questions are frequently asked concerning QC data:


1. How many data points are needed to establish a mean for my new lot of control?
2. Once I have a mean how do I establish an SD?
3. In an inter-laboratory report, how many laboratories means are sufficient to feel secure about
using that mean for peer comparison?

No one would argue that the simple answer to these questions is “the more the better.” While that is
indeed a simple answer it does not really answer our question - we are looking for a useful and practical
number.

My first serious study of the first question began about 30 years ago when I was working with a short-
dated whole blood-based blood gas control. Since the customary answer of 20 to 30 points (*) made it
expensive and impractical (since the product lasted only 60 days, one-third to one-half of the life of the
material would be depleted before the mean was established and ready to use for quality monitoring).

In order to answer the question from a practical standpoint point, the group I was working with in Miami
examined the data from several lots of three levels of blood gas control using a form similar to that
shown in Figure 1. The data in the Cumulative Mean column indicates that by and large the mean for
these data becomes stable after some 8 data points. It was this study that was published in American
Association for Respiratory Therapy as an abstract for the national meeting in 1978 (1).

Cumulative Cumulative Cumulative


Run ID Result Mean Run ID Result Mean Run ID Result Mean
1 12.1 12.1 11 12.2 12.0 21 11.0 12.0
2 11.8 11.9 12 11.6 12.0 22 11.4 12.0
3 13.2 12.4 13 12.0 12.0 23 12.0 12.0
4 11.6 12.2 14 11.7 12.0 24 11.4 12.0
5 12.6 12.3 15 11.7 12.0 25 12.1 12.0
6 12.0 12.2 16 11.9 12.0 26 11.4 12.0
7 12.0 12.2 17 12.2 12.0 27 12.3 12.0
8 11.4 12.1 18 12.9 12.0 28 13.6 12.0
9 11.8 12.0 19 12.6 12.1 29 11.4 12.0
10 11.9 12.0 20 12.9 12.1 30 11.2 12.0

Mean (8) 12.1


Mean (30) 12.0
Figure 1. Calculation of Laboratory Mean.

(*) I had occasion recently to go back and look at the original Levey and Jennings paper from 1954 and
found that while they referenced earlier work on charting procedures and statistics, they did not give any
reason for why they chose to run their home-made pooled serum control in duplicate over 20 days!

Rev. 3/20/2005 1
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

This material is adapted and enlarged from articles that appeared in Advance in March and April, 2005.
The author thanks Lynn Nace and the publishers of Advance for allowing me to use this material.

There are two aspects of this study that the group felt were important enough to explore in more detail.
First we knew that blood gas instruments, using no true reagents or reactions in the usual sense and
calibrated as they were with primary standards, were more precise than most other instruments in the
laboratory. Secondly it seemed to us from the data on the three levels of control for a number of
parameters including oxygen, carbon dioxide and pH that there seemed little, if any, correlation between
the SD and the CV and the number of replicates to establish the mean.

To pursue these aspects of the question how Example 1 Example 2 Example 3


Lab
many points to establish a mean we went in ID Mean Cum Mean Mean Cum Mean Mean Cum Mean
two different directions. On one hand we 1 35.4 33.8 36.2
began looking at data from chemistry tests, 2 34.6 33.3 32.0
including the routine chemistries such as 3 35.7 33.1 32.4
Glucose, BUN, AST etc. and immunoassays 4 34.8 35.1 33.7 33.5 33.6 33.6
such as TSH as well as protein and LDH 5 36.2 35.3 36.2 34.0 41.8 35.2
Isoenzyme electrophoresis. Again but some- 6 35.9 35.4 34.9 34.2 34.4 35.1
what to our surprise, we found that as a rule 7 35.4 35.4 37.6 34.7 33.9 34.9
eight replicates (often fewer) were all that 8 35.4 35.4 37.4 35.0 34.8 34.9
9 34.9 35.4 32.9 34.8 36.6 35.1
were needed to obtain a mean. These
10 34.9 35.3 35.8 34.9 32.2 34.8
studies were extended from work in Miami 11 35.3 35.3 37.1 35.1 30.0 34.4
to include hospital data from Phoenixville, PA 12 35.1 35.3 32.1 34.8 35.3 34.4
and Washington, DC and were presented at 13 35.5 35.3 35.1 34.8 36.3 34.6
a later AACC meeting in 1997. Represent- 14 34.6 35.3 33.7 34.8 35.2 34.6
ative data appear in Figure 2. 15 34.7 35.2 36.0 34.9 33.7 34.6
16 35.9 35.3 36.0 34.9 34.8 34.6
The data again suggested that eight 17 34.5 35.2 34.8 34.9 32.3 34.4
replicates were sufficient to establish a mean 18 35.6 35.2 34.1 34.9 32.7 34.3
and, interestingly to us, the eight is 19 35.2 35.2 33.0 34.8 36.7 34.5
20 36.6 35.3 35.2 34.8 30.1 34.3
sufficient (or as we liked to say “Eight is
21 35.2 35.3 35.1 34.8 35.3 34.3
enough”) seemed to be true regardless of
22 34.9 35.3 36.8 34.9 34.8 34.3
the mean) and regardless, insofar as 23 35.3 35.3 37.0 35.0 32.5 34.2
laboratory data were concerned, of the CV 24 35.0 35.3 36.0 35.0 34.2 34.2
(see Fig 2). When the t-test was applied to 25 34.4 35.2 34.9 35.0 33.6 34.2
the means at 8 and 30 replicates in these 26 35.1 35.2 36.1 35.1 32.8 34.2
examples no statistical difference was found. 27 33.6 35.2 33.8 35.0 40.9 34.4
28 35.8 35.2 33.6 35.0 33.8 34.4
29 34.0 35.2 35.1 35.0 40.0 34.6
30 35.1 35.2 32.0 34.9 32.4 34.5

Mean 35.16 34.88 34.52


SD 0.63 1.57 2.76
CV 1.80 4.50 7.99
Range Min 35.13 33.48 33.57
Range Max 35.43 35.07 35.21
Figure 2

Rev. 3/20/2005 2
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

Our second approach to this question


was to use the Minitab software program y
which could generate considerable • 68.3% of values are within
quantities of data quite similar to +/- 1SD of the mean
laboratory data – data that fit a Gaussian • 95.5% of values are within
(bell-shaped) curve (see Fig 3). Using +/- 2SD of the mean

Minitab we could ask for as many data • 99.7% of values are within
+/- 3SD of the mean
points that were normally distributed
(bell-shaped) as we wished, with 0 x
whatever mean and SD we wanted and Figure 3
look at the cumulative means for the
data sets. These data sets were generated hundreds of times with various means and SDs - all returned
the same conclusion “Eight is enough.”

Appendix A is an expanded set of data that lead to the same conclusion. In the table I have added the
% difference of the given mean from the mean after 30 replicates. Also in the table is the SDI which is
defined as

meann - mean 30
SDI = -----------------------
SD30

In general if the SDI is less than 1.0 there is little difference between the means. Another way to
evaluate the SDI in the table is to compare it to the SD of the data set for the mean when n = 30. If SDI
is less than SD for n = 30, there is little chance the means are different. Perhaps the most significant test
though is the student t-test. This value was calculated where the largest difference in the mean when n
was less than 30 and the mean when n = 30. In each case the t value was not statistically significant.

As you may wish to explore these ideas on your own, we offer two simple ways that you can do this:

First take your own quality control data for the tests you wish to study and set up an Excel sheet as you
see in Figures 1 or Table 1. The formulas are given in Appendix B. Enter the raw data in the data point
column and the formulas as shown in Appendix B. Should you conclude that eight (or 7 or 9) points are
sufficient, save the data in case you want to demonstrate it to others (e.g. laboratory inspector). It is
quite easy to generate quite a few means from n = 8 using a set of 30 points. Again this is outlined in
the appendix.

The second approach is to use a nifty tool available in Microsoft Excel – the random number generator.
In essence this algorithm will provide you with as many, say 30, data points with a given mean and SD.
Keep in mind since these are random (Gaussian) numbers the mean and SD returned will differ ever so
slightly from the one you type in. But the exercise is certainly worth the few minutes it takes. Again we
suggest you save the results.

Although the data certainly indicate that 8 replicates will suffice to provide a good estimate of the mean
for a new lot of control, there is an additional question to be discussed relating to this: Over what period
do I collect these eight data points? It would not be wise to obtain all 8 data points within a single run.
The best answer it over 8 days, although 3 or 4 days (or runs) will usually suffice. During this period
when the new lot of control is being phased in, good practice indicates that the ‘old’ control is being run
concurrently. By running the old and new controls in parallel any bad runs should be detected by the old
control and the data on the new control can be discarded. Another way that an outlier from the set of 8
can be detected is to use the following procedure.

Rev. 3/20/2005 3
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

Detecting and Rejecting Outliers When Establishing a New Control Mean


1. Collect the 8 data points from the new control
2. Calculate the mean and SD for these data points
3. Any single point that is two SDs from the mean may
be rejected. Should more than one value fall outside 2 SD
there is a severe problem somewhere (reagents, etc)
NOTE: This is NOT intended as a daily QC monitoring system.
It is used only when establishing the new mean for a control.

The two approaches to the answer to the question how many points does it take to establish the mean
for a new lot of control, are pragmatic and based on calculations with data from laboratory QC programs
and similar computer generated data. Another approach is based on the widely accepted method to
compare two means – the t-test. I have mentioned this test earlier when it was used to determine
whether the means of eight replicates were different from the mean when 30 points were used. It can
also be used to help determine n. This application is discussed in Statistical Methods in Analytical
Chemistry by Meier and Zund3 titled in their section “Number of Determinations.” As it is somewhat more
mathematical, I have added it as Appendix D.

Moving on to the second question 2: Once I have a mean how to I establish an SD?

When the mean for the new lot of control has been established using say 8 replicates, the next task is to
establish a measure of the imprecision as measured by the SD for the new lot of control in order to set
QC limits. Again the traditional number is 20 – 30 replicates of the control. I have not found any real
statistical discussion of the source of these values for n. In the discussion above we developed the idea
that a mean for the new lot of the control can be set with as few as 8 data points. Few of us would
suggest using 8 replicates of any measurement in the clinical laboratory to establish a measurement of
the inherent variation (random error, SD) in an analytical system. The list of sources of random error in
such measurements would include calibration, new lots of reagents and calibrators, maintenance to the
instrument, line voltage fluctuations (that would create both temperature fluctuations and lamp
fluctuations), and variations between operators would be on the list. There are certainly more. Given
this, one has to ask whether even as many as 30 points are sufficient to include all the sources of
random error. In other words is the SD after 30 points really enough. Any one who has not said “My SD
is too tight” may be using an SD that is indeed too tight simply because it was based on too few data
points!

To answer the question of how


SD SD 9
best to establish an SD, I have
Analyte Mean 1 month months SD1/SDn F-ratio
used data from multiple analytes
TSH 0.6 0.03 0.04 0.70 2.05 ns
to illustrate simply that over time
ALP 378 4.60 8.70 0.53 3.58 * SDs can (but now always) vary
Sodium 135 1.00 1.60 0.63 2.56 ns substantially. This would suggest
Urine Creat 235 2.30 6.80 0.34 8.74 * that more than 30 points would
Potassium 6.0 0.04 0.09 0.44 5.07 * often give a better estimate of the
ns = not statistically significant at 5% true total variation (true SD and
* = statistically significant at 5% CV) in the system.

Figure 4

Rev. 3/20/2005 4
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

A few examples of this approach are shown in Figure 4. A larger set is


Before After given in Appendix C using data from chemistry, immunochemistry and
29.4 32.8 coagulation. In the case of Alkaline Phospatase (ALP), Urine Creatinine
27.4 31.6 and Potassium as well as in a number of the cases in Table 2, it is clear
30.5 32.2 that in some cases the SD over a period longer than 30 days is
32.6 31.8 significantly different from the SD from 30 days. Depending on which
32.4 28.9 SD is used, there can be a significant effect on the interpretation of the
33.5 30.7 QC data (e.g. “My SD is too tight!” or “Why am I getting all these
25.6 34.2 rejects?”) Another way to look at this question is to use the Excel
29.5 32.4 program to generate two sets of data say with 20 – 30 data points with
32.2 33.7 slightly different means as might occur when recalibrating or changing
27.8 31.0 reagents.
28.6 29.5
26.6 35.2
This experiment would reproduce the sort of data in Figure 5. Calculate
26.3 32.9
the mean and SD for the first set of data (call it “Before) then combine
28.0 31.5
that with the second set of data and look to see how many of the
28.5 33.0
25.8 28.9
second set would have been rejected using the 2 SD range based on
28.9 34.9 the first set.
29.2 28.2
30.3 31.2 Figure 6 is an example of this with means that differ only by about 1
29.3 38.9 SD. Note that the “After” set has four “outliers” and they are all high.
(The Gaussian curve tells us that from 60 data points we should have 3
MEAN 29.1 32.2 outliers and they should be both above and below the 2 SD limits – our
SD 2.3 2.5 data have nearly 3 times what is statistically likely!) This experiment
CV 7.5 7.8 suggests the reasons the SD (and mean) and why we may want to look
Upper Limit 33.7 for another solution.

Figure 5

45
Given that more than 30 replicates may be
better for establishing and SD (and CV), I
40 suggest the following approach:
35
1. Find the mean for the new lot of control
from the 8 replicates.
30 2. Find the cumulative CV from an
interlaboratory survey (such as the data
25
from the entire previous year on the
20
previous lot of control).
0 5 10 15 20 25 30 35 40 45 3. Using the formula SD (new) = CV
(cumulative) * Mean (new from n = 8).
Figure 6

This new CV contains more of the sources of variation than that from 8 or 20 or 30 replicates and should
be a better overall statistic with which to set the QC limits. The CV is used as it is a measure of relative
precision and thus accounts for minor variation from lot-to-lot of control.

Rev. 3/20/2005 5
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

Turning now to the third question: How many laboratory means are needed for a group comparison?

From the above discussion it should be apparent that the number of lab means needed to find a useful
mean for interlaboratory comparisons is probably fewer than expected. In looking for an answer to this
question we need first to keep in mind not only what the data I’ve discussed above have shown us, but
one of the prime tenets of statistics – the central limit theorem. This theorem simply says that the
distribution of means is narrower than the distribution of the raw data making up the means. See Figure
7. In looking at these data you will note that the range of the group of means 85 – 101) is narrower than
the range of the 5 laboratory means (81 – 102). This concept is plotted in Figure 8. It can also be shown
that the t-value for any of these means is not different from the group mean.

Lab Number of You also see that the distribution


Mean SD 98% Range of the five data sets from the five
No. Data Points
1 30 92.3 2.1 94 - 102 laboratories follows the bell or
2 30 98.0 1.9 88 - 96 Gaussian curve. It is generally
assumed that data on control
3 30 86.8 1.81 89 – 96
material run on a stable instru-
4 30 93.3 2.49 88 - 98
ment using a single lot of reagent
5 30 92.8 2.89 81 - 93 and calibrator without any major
Group - 92.64 4.00 85 - 101 maintenance are distributed in this
manner.
Figure 7
Figure 7 illustrates this for five
means and their ranges from the
Group Range CK data where each of the labor-
atories submitted 30 data points.

I used the same two approaches to


m4 m2 examine the question of how many
m3 m1 m5
laboratory means are needed for a
good estimate of the true mean
(remember, in statistics without
85 90 95 100 105 110 the entire data set we can only
‘estimate’ any statistic such as the
Figure 8 mean, the SD or the range).

Using a data set from a group of laboratories reporting CK on a control (Figure 9), I have first presented
all the unsorted data and in the next column I have calculated the cumulative mean as we did for the
data when establishing the mean for the control in a lab. From the data in the cumulative mean column
you note that at an n of 5 the mean is about 2% different from the mean after 60 some points. This
difference is not statistically significant, nor is it beyond what might be seen from the within run precision
of the data for any of the laboratories nor what might occur after changing reagents or recalibration.

Rev. 3/20/2005 6
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

Cumulative Cumulative Cumulative


Lab ID Result Lab ID Result Lab ID Result
Mean Mean Mean
1 99 99 11 98 100 21 99 100
2 96 98 12 95 100 22 99 100
3 101 99 13 94 100 23 104 100
4 104 100 14 97 100 24 100 100
5 104 101 15 98 100 25 99 100
6 105 101 16 94 100 26 98 100
7 93 100 17 98 100 27 106 100
8 99 100 18 99 100 28 103 100
9 103 101 19 100 100 29 107 100
10 97 100 20 99 100 30 98 100
Figure 9 (Note: data have been rounded to a whole number)

The next table (Figure 10) uses data from an interlaboratory survey where the imprecision was somewhat
higher as well as with the data from the Excel tool box. Figure 11 depicts the individual means of
participating laboratories, the group cumulative mean and percentage difference between cumulative
mean of the participants and the mean after 30 laboratories were included. Again the t-test did not
detect a significant difference between the cumulative mean after five laboratories were included and the
entire set of 30.

Lab Lab Cum % Lab Lab Cum % Lab Lab Cum %


ID Mean Mean Diff ID Mean Mean Diff ID Mean Mean Diff
1 22.7 22.7 8.2 11 28.9 25.1 1.5 21 25.2 24.6 0.5
2 21.8 22.3 10.1 12 19.6 24.7 0.4 22 28.3 24.8 0.1
3 21.5 22.0 11.1 13 25.2 24.7 0.2 23 28.8 25.0 0.8
4 22.5 22.1 10.6 14 22.6 24.6 0.8 24 27.0 25.1 1.2
5 27.3 23.2 6.4 15 26.9 24.7 0.2 25 24.8 25.1 1.1
6 24.8 23.4 5.3 16 26.9 24.9 0.3 26 27.0 25.1 1.4
7 29.9 24.4 1.6 17 24.7 24.8 0.3 27 22.8 25.0 1.1
8 29.4 25.0 0.9 18 23.2 24.8 0.1 28 22.4 24.9 0.7
9 21.1 24.6 0.8 19 21.2 24.6 0.8 29 25.3 25.0 0.7
10 26.5 24.8 0.1 20 25.4 24.6 0.7 30 19.4 24.8 0.0
Figure 10

Rev. 3/20/2005 7
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

35

30

25
Mean Value

20 Lab Mean
Cum Mean
15 % Diff

10

0
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29
Lab ID

Figure 11 (plot of the individual means, the cumulative mean and percentage difference
from the mean after 30 points)

It would seem from our discussion then that using the Excel tool box to generate data which represent
different laboratory’s mean (as opposed to a single laboratory’s data on a control material) would serve
our needs to answer the question of how many lab’s means (data points) are necessary to find the group
mean. Again I suggest we look at Table I for the answer to this question. These data as we have said
earlier indicate that an n of 8 (or even 5) is sufficient to establish a mean to use in interpreting for an
interlaboratory survey.

Note: It is worthwhile to keep in mind when assessing interlaboratory data from a QC program that one
of the main goals is to pass surveys by keeping the QC data within CLIA limits. Thus only the group
mean and the CLIA limits as reported in some Quality Assurance Programs are necessary to begin setting
QC limits. The group SD is not a necessary component.

From these studies I have concluded the following:


1) That to find a working mean for a new lot of control 5 – 8 points are all that are needed.
2) The working mean can be updated as more data are accumulated. Keeping in mind of course that
one’s own mean will change with recalibration and new lots of reagents. Making any mean a
‘working mean.’
3) Five laboratory means from a group are all that are needed for a comparison of one laboratory to
another in an interlaboratory comparison.
4) To establish an SD for a new lot of control, using cumulative data from the previous lot of control and
the formula
SD (new) = CV (cumulative) * Mean (new from n = 8)

provides as good as and often a better estimation of the new SD than 20 or 30 data points.

Rev. 3/20/2005 8
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

Appendix A.

ID Data Mean Abs % diff SDI Data Mean Abs % diff SDI Data Mean Abs % diff SDI
n = 30 n = 30 n = 30
1 30.08 29.70 28.12
2 29.79 28.72 27.34
3 31.45 30.24 27.09
4 29.51 31.28 27.93
5 30.69 30.23 1.26 0.32 31.20 30.23 1.26 0.32 31.95 28.49 4.44 -0.54
8 29.26 30.06 0.68 0.17 29.77 30.06 0.68 0.17 33.71 30.00 0.66 0.08
9 29.75 30.17 1.07 0.27 31.10 30.17 1.07 0.27 26.73 29.64 0.57 -0.07
10 29.84 30.05 0.65 0.16 28.91 30.05 0.65 0.16 31.22 29.80 0.04 0.00
15 29.68 29.63 0.74 -0.19 29.23 29.63 0.74 -0.19 31.62 29.77 0.14 -0.02
20 31.04 29.56 0.99 -0.25 29.63 29.56 0.99 -0.25 30.36 29.68 0.45 -0.05
25 30.15 29.66 0.64 -0.16 29.81 29.66 0.64 -0.16 29.83 30.04 0.78 0.09
30 29.04 29.85 0.00 0.00 29.35 29.85 0.00 0.00 25.35 29.81 0.00 0.00

mean 29.99 29.85 29.81


SD 0.71 1.19 2.45
CV 2.37 3.98 8.23
min 28.84 27.82 25.35
max 31.88 32.38 34.08

1 59.84 90.50 126.93


2 68.09 89.49 125.19
3 59.16 83.36 119.63
4 57.70 89.78 121.47
5 58.49 60.65 0.38 0.09 90.72 88.77 1.32 -0.31 125.65 123.77 3.24 0.82
8 57.73 59.99 0.73 -0.16 90.85 90.23 0.29 0.07 116.79 122.03 1.79 0.45
9 61.43 60.15 0.46 -0.10 91.06 90.32 0.40 0.09 123.61 122.20 1.94 0.49
10 63.42 60.47 0.08 0.02 91.26 90.41 0.50 0.12 114.49 121.43 1.29 0.33
15 59.65 60.55 0.20 0.05 92.47 90.53 0.63 0.15 120.47 120.99 0.93 0.23
20 60.78 60.67 0.41 0.09 89.11 89.97 0.00 0.00 117.43 119.94 0.05 0.01
25 61.77 60.72 0.49 0.11 86.53 90.20 0.27 0.06 111.09 120.07 0.15 0.04
30 59.46 60.42 0.00 0.00 96.03 89.96 0.00 0.00 118.37 119.88 0.00 0.00

mean 60.42 89.96 119.88


SD 2.67 3.83 4.75
CV 4.42 4.26 3.96
min 55.49 81.19 111.09
max 68.09 96.88 128.41

1 1.29 1.24 1.21


2 1.27 1.27 1.41
3 1.15 1.16 1.29
4 1.21 1.01 1.37
5 1.30 1.24 3.70 0.87 1.04 1.14 2.73 -0.34 1.00 1.26 4.64 0.42
8 1.26 1.23 2.35 0.55 1.23 1.17 0.61 -0.08 1.35 1.22 1.42 0.13
9 1.18 1.22 1.90 0.45 1.13 1.16 0.94 -0.12 1.00 1.19 0.59 -0.05
10 1.12 1.21 1.09 0.26 1.18 1.16 0.81 -0.10 0.99 1.17 2.32 -0.21
15 1.22 1.20 0.18 0.04 1.29 1.19 1.57 0.19 1.34 1.18 1.65 -0.15
20 1.16 1.20 0.09 -0.02 1.18 1.18 0.87 0.11 1.08 1.18 1.30 -0.12
25 1.22 1.20 0.27 -0.06 1.10 1.18 0.50 0.06 1.27 1.19 0.64 -0.06
30 1.23 1.20 0.00 0.00 1.12 1.17 0.00 0.00 1.32 1.20 0.00 0.00

mean 1.20 1.17 1.20


SD 0.05 0.09 0.13
CV 4.26 8.07 11.16
min 1.12 0.99 0.98
max 1.30 1.35 1.41

Rev. 3/20/2005 9
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

Appendix B.
1. Generation of random numbers using Microsoft Excel.
a. Open Microsoft Excel
b. From the Menu Bar select Tools.
c. From the Tools menu select Data Analysis. Note: The Data Analysis package in Excel is not
automatically installed when the Excel program is first loaded. When selecting the Data Analysis
pack you maybe asked for the Microsoft Office CD to complete the installation. Follow the on-
screen instructions to complete this task.
d. In Data Analysis select Random Number
Generator from the Menu that appears.
e. In Random Number Generator there are several cells
where data will be entered (see image below):
1. Under Number of Variables - enter 1.
2. Under Number of Random Numbers - select any
number you want (e.g. 30)
3. Under Distribution - select Normal
4. Under Mean and SD - enter the numbers of the
example mean/SD (e.g. 100 and 3)
5. Under Output put the column and row in which
you want the first Data Point (random number to
appear).

Random Number Generation in Microsoft Excel

This can be repeated numerous times to build a population to expand on the ideas included in this
paper.

2. Formulas you will want to include.


In the first column to the right of your column of random numbers you will want the cumulative
mean. The formula for that beginning with your 4th row and assuming you began you data set in
Column B Row 1
Is = Average (C$1.C4). This will place the average of the first 4 columns in Cell B4. Then you can
copy the formula down column C until you reach row 30. (If you are curious about how the SD
changes down that column use the formula = Stdev (C$1.C4) in Column D. Again copy that down
the page.
The Minimum of the column is found using the formula =Min(B1.B30). The Maximum of the column
is found using the formula =Max(B1.B30).

You may wish to add the absolute % difference (from the mean of n =30) in the column to the right
of your cumulative SD (or cumulative mean if you did not add the column for SD) To do this use the
formula, beginning in cell E4, =Abs(100*D4-C$30)/C30. Again copy the formula down the column to
point number 30. Another interesting statistic is the SDI which measures the difference between the
mean of say n = 8 and n = 30. You may have seen the SDI on your interlaboratory report. There, too, it
measures the difference between two means -ñ in that case between your laboratory’s mean and the
group. Use the formula

=(C4-C$30)/STDEV(C1.C30).

Rev. 3/20/2005 10
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

Appendix C.

The following illustration shows how to best establish an SD using cumulative data over several months.

Coag Conc SD for 1 SD > 1 F Chem Conc SD SD F


month month
Fib - 1 210 5.1 5.9 1.3 Alb - 1 4 0.05 0.07 2.0
Fib - 2 100 5.1 6.5 1.6 Alb - 2 2.5 0.035 0.057 2.7
APTT - 1 29 0.7 1.5 4.6
APTT - 2 60 1 5 25.0 ALP - 1 82 5.4 6 1.2
PT - 1 12.8 0.16 0.3 3.5 ALP - 2 320 11 16 2.1
PT - 2 40 0.85 0.95 1.2
ALT -1 39 1.3 2 2.4
ALT -2 85 1.4 4.7 11.3

immunoass TBILI -1 0.8 0.01 0.06 36.0


DIG -1 0.5 0.02 0.05 6.3 TBILI -2 4.4 0.04 0.16 16.0
DIG -2 1.6 0.06 0.09 2.3
2.5 0.05 0.015 0.1 Ca - 1 8.2 0.14 0.16 1.3
Ca - 2 12.2 0.22 0.2 0.8
Hcg -1 6 0.6 0.9 2.3
hCG -3 467 12 23 3.7 CHL - 1 97 1.1 1.23 1.3
CHL - 2 88 1.02 1.06 1.1
LI - 1 0.44 0.04 0.09 5.1
LI -2 1.3 0.05 0.14 7.8 CHO -1 283 4.2 4.7 1.3
LI -3 1.9 0.06 0.1 2.8 CHO -2 132 2.4 2.8 1.4

PSA -1 0.4 0.06 0.05 0.7 CK -1 117 3.3 5.4 2.7


PSA -3 18 1 1.3 1.7 CK -2 359 10.5 19.1 3.3

T4 -1 0.4 0.04 0.08 4.0 CO2 -1 35 2 2 1.0


T4 -2 3.2 0.3 0.2 0.4 CO2 -2 16 1 1 1.0

THEO -1 5 0.16 0.24 2.3 GGT -1 45 1.3 1.5 1.3


THEO -2 15 0.5 0.6 1.4 GGT -2 198 3.3 9.3 7.9
THEO -3 31 1.3 1.6 1.5
LD -1 118 3.1 4.1 1.7
LD -2 356 7.3 10.6 2.1

LIP -1 148 8 14.1 3.1


LIP-2 335 11.9 16.1 1.8

NA -1 148 0.9 1.4 2.4


NA -2 129 0.7 1.9 7.4

Rev. 3/20/2005 11
Mean & Peer Group Data A Discussion of Three Common Questions in Quality Control

Appendix D.
Beginning with the t-test statistic
t = (mean –µ)/SD of the mean

replacing our data in the equation we have


t = (mean 8 - mean30)/ SD of the mean

then substituting √n/ SD for SD of the mean we have


t = (mean 8 - mean30)* √ n / SD8

If we use the usual value of 2.0 for t and replace (mean 8 - mean30)/ SD8 with SDI and solve for SDI, we
find that an SDI of 0.70 or greater indicates that the means were different. In referring to Figure 2 and
table I we note that only in rare cases does this occur – suggesting again that 8 replicates is sufficient.

Reference:
1. Herring K, Matthews H, Pulwer E. Paper presented at the American Association for Respiratory
Therapy 1978.
2. Surkin, K. and Hershberger, D. Clinical Chemistry 43 ( 6):, S140-141, 1997.
3. P. Meier and R. Zünd, Statistical Methods in Analytical Chemistry, Wiley Interscience, NY, 2000.

David is at [email protected]

Rev. 3/20/2005 12

You might also like