Bootstrap Resampling Methods: Something For Nothing?: Gary L. Grunkemeier,, and Yingxing Wu
Bootstrap Resampling Methods: Something For Nothing?: Gary L. Grunkemeier,, and Yingxing Wu
Simulation
Typically, we collect a data sample and compute a statistic of interest, say the mean value of the individual data
points. We realize there is variability associated with this
estimate, such that if we, or others, repeated our experiment or data collection maneuvers, the estimate would
be different. We need a measure of the variability of this
statistic, say the standard deviation (SD) or a CI. The
conventional method is to assume we know the distribution of the statistic of interest, often the familiar normal
(bell-shaped) distribution. Even if the underlying popuAddress reprint requests to Dr Grunkemeier, Providence St. Vincent
Hospital and Medical Center, 9205 SW Barnes Rd, Suite 33, Portland,
OR 97225; e-mail: [email protected].
Resampling
This exercise confirms that fact that simulations can
recreate critical values for any known distribution. But
there are other situations in which we do not know or do
not want to assume the distribution. In such cases we can
use a simulation technique, similar to that demonstrated
in Figure 1, called bootstrap resampling. Instead of generating observations from a known theoretical distribution as before, we generate observations from the distribution of the sample itselfthe empirical distribution.
Each simulation results in a new sample, typically of the
same size as the original, by randomly selecting (with
replacement) individuals from the original sample. With
replacement means that at each step in the selection
process, every individual from the original sample is
again eligible to get selected, whether or not he has
already been selected. Thus, in each bootstrap sample,
some of the original individuals may not be represented
and others may be represented more than once.
Ann Thorac Surg 2004;77:1142 4 0003-4975/04/$30.00
doi:10.1016/j.athoracsur.2004.01.005
1143
0
1
2
3
4
5
6
7
8
9
10
1114
NA
Total
Percent
Patients
34
75
88
73
91
110
94
70
51
47
31
21
128
913
Observed
Deaths
0
0
1
1
4
11
19
19
8
12
6
7
17
105
11.50%
NRMI
Mortality
(%)
0.17
0.67
1.70
2.88
5.32
9.43
14.12
18.86
19.93
24.30
26.22
33.42
14.42
Expected
Deaths
0.06
0.51
1.49
2.10
4.85
10.37
13.27
13.20
10.16
11.42
8.13
7.02
18.45
101.03
11.04%
NA not available;
NRMI National Registry of Myocardial Infarction;
STEM Myocardial infarction with ST elevation;
TIMI
Thrombolysis in myocardial infarction.
Conventional Method
The usual way to compute a CI uses a mathematical
expression derived from assumptions about the underlying statistical distribution. For example, assume the
statistic has a normal distribution, compute the SD of the
statistic, and then use the fact that 95% of the values of a
normal distribution are within 1.96 SD of the mean. For
the O/E ratio in our example, this method [5] gives a 95%
CI of 0.86 to 1.22. This method has two shortcomings: the
lower limit can become negative, and the CI is symmetric
about the point estimate. But an O/E ratio cannot be
negative, and its distribution is not symmetric. The smallest it can be is 0 (only if no deaths were observed),
although it can range to an arbitrarily high value (when
many more deaths than expected are observed). Thus, an
alternative with better theoretical properties is based on
a normal approximation to the logarithm of the O/E ratio
[6], which produces an asymmetric, always-positive interval: 0.88 to 1.23 for the present example.
Bootstrap Method
To produce a bootstrap CI, the number of samples (B) to
be generated from the original data set is specified, and
for each sample the statistic of interest is computed. The
range of values of the statistic is determined by the
1144
Comment
The birth of probability theory is usually taken to be 1654,
when a French gambler sought the help of mathematicians to determine the probabilities of a dice game [8]. He
wanted to learn, using equations derived from the properties of the statistical distribution, what the long-term
results would be of his random throwing of the dice. It is
ironic that we have now come full circle, using random
simulation methods to derive properties of the statistical
distribution. In reference to their close association with
gambling, techniques using this simulation approach are
called Monte Carlo methods. Two widely referenced
books that provide a thorough treatment of bootstrapping methods are suggested for further reading [9, 10].
Comparison
In our example, the bootstrap intervals, after only 1,000
resamplings, produced CIs similar to the normal approximation method. The advantage is that no distributional
assumptions are needed, particularly important in complex statistics in which such assumptions may be difficult.
Interestingly, of the two conventional intervals, the more
usual method (indicated by dashed horizontal lines in Fig
2), not the one involving a logarithmic transformation
and possessing theoretical advantages, corresponded
better to the bootstrap intervals.
Patient data were supplied by the following hospitals: Providence Anchorage Medical Center (Anchorage, AK), Providence
Everett Medical Center (Everett, WA), Providence Portland
Medical Center (Portland, OR), Providence St. Vincent Medical
Center (Portland, OR), Providence Milwaukie Hospital (Milwaukie, OR), Providence Newberg Hospital (Newberg, OR),
Providence Seaside Hospital (Seaside, OR), Providence Medford
Medical Center (Medford, OR), and Little Company of Mary
Hospital (Torrance, CA). Data management for NRMI is provided by STATPROBE, Inc (Ann Arbor, MI), who kindly provided the stratified data shown in the table.
References
1. Brunelli A, Monteverde M, Borri A, Salati M, Marasco RD,
Fianchini A. Predictors of prolonged air leak after pulmonary lobectomy. Ann Thorac Surg 2004;77:120510.
2. Efron B. Bootstrap methods: another look at the jackknife.
Ann Statist 1979;7:126.
3. Morrow DA, Antman EM, Charlesworth A, et al. TIMI risk
score for ST-elevation myocardial infarction. A convenient,
bedside, clinical score for risk assessment at presentation: an
intravenous nPA for treatment of infarcting myocardium
early II trial substudy. Circulation 2000;102:20317.
4. Morrow DA, Antman EM, Parsons L, et al. Application of the
TIMI risk score for ST-elevation MI in the National Registry
of Myocardial Infarction 3. JAMA 2001;286:1356 9.
5. Shwartz M, Ash AS, Iezzoni LI. Comparing outcomes across
providers. In: Iezzoni LI, ed. Risk adjustment for measuring
healthcare outcomes. Chicago: Health Administration Press,
1997:471516.
6. Hosmer DW, Lemeshow S. Confidence interval estimates of
an index of quality performance based on logistic regression
models. Stat Med 1995;14:216172.
7. Carpenter J, Bithell J, Swift MB. Bootstrap confidence intervals: when, which, what? A practical guide for medical
statisticians. Stat Med 2000;19:114164.
8. Weaver W. Lady luck, the theory of probability. New York:
Anchor Books, Doubleday and Co, 1963.
9. Efron B, Tibshirani R. An introduction to the bootstrap. New
York: Chapman and Hall, 1993.
10. Davison AC, Hinkley DV. Bootstrap methods and their
application. New York: Cambridge University Press, 1997.