Report Geochemical
Report Geochemical
Denver, Colorado
1999
U.S. DEPARTMENT OF THE INTERIOR
BRUCE BABBITT, Secretary
The use of firm, trade, and brand names in this report is for identification purposes only and does
not constitute endorsement by the U.S. Geological Survey.
For additional information write to: Copies of this report can be purchased
from:
CONTENTS III
5.6 Additional Trend Considerations........................................................................................................................ 35
5.7 Outlier Detection................................................................................................................................................ 35
5.8 Cross Validation for Model Verification............................................................................................................. 36
5.8.1 Calibration Statistics ............................................................................................................................. 36
5.8.2 Variogram-ParameterAdjustments....................................................................................................... 37
6.0 Practical Aspects of Geostatistics in Hazardous-, Toxic-, and Radioactive-Waste-Site Investigations....................... 39
6.1 Ground-Water-Level Examples.......................................................................................................................... 40
6.2 Bedrock-Elevation Examples ............................................................................................................................. 43
6.3 Ground-Water-Quality Examples....................................................................................................................... 54
7.0 Review of Kriging Applications................................................................................................................................... 60
7.1 Applicability of Kriging..................................................................................................................................... 60
7.2 Important Elements of Kriging Applications..................................................................................................... 63
7.3 Errors in Measured Data..................................................................................................................................... 64
8.0 Other Spatial Prediction Techniques............................................................................................................................ 64
8.1 Global Measure of Central Tendency (Simple Averaging)................................................................................ 65
8.2 Simple Moving Average..................................................................................................................................... 65
8.3 Inverse-Distance Squared Weighted Average..................................................................................................... 65
8.4 Triangulation ...................................................................................................................................................... 66
8.5 Splines ................................................................................................................................................................ 67
8.6 Trend-Surface Analysis...................................................................................................................................... 67
8.7 Simulation........................................................ 67
9.0 Summary .........................................................................................................................^ 69
10.0 References......................................................._^^ 70
FIGURES
1-3. Diagrams showing:
1. Covariance function properties A, hypothetical study area; B, stationary covariance functions;
and C, isotropic covariance function.................................................................................................................. 8
2. Variogram and features...................................................................................................................................... 13
3. Theoretical variograms indicating A, exponential; B, spherical; C, Gaussian; and D, linear models................ 14
4. Map showing measured water levels from Saratoga data.......................................................................................... 26
5-13. Graphs showing:
5. Squared differences of values for all possible pairs of points for Saratoga data............................................... 27
6. Initial sample variogram points for Saratoga data............................................................................................. 28
7. Sample variogram points for ordinary least-squares trend residuals for Saratoga data..................................... 30
8. Sample variogram points for ordinary least-squares trend residuals for Saratoga data binned to
6.5 kilometers..................................................................................................................................................... 31
9. Initial directional sample variogram points for raw Saratoga data A, north-south and B, east-west.............. 33
10. Sample variogram points and theoretical spherical fit for iterated Saratoga residuals...................................... 38
11. Sample variogram points and theoretical Gaussian fit for iterated Saratoga residuals...................................... 39
12. Cross-validation probability plot for Saratoga data........................................................................................... 40
13. Scatterplot of measured versus kriging estimates from cross validation of Saratoga data................................ 41
14. Maps showing location of measured data for ground-water-level examples A, original data; B, original
data without dropped sites; and C, original data with added sites ............................................................................. 42
15. Graphs showing variogram and variogram cross-validation plots for residuals in water-level examples
A, theoretical variogram; B, cross-validation scatterplot; and C, cross-validation probability plot........................... 44
16. Maps showing kriging results for ground-water-level examples A, kriging estimates for original data;
B, kriging standard deviations for original data; C, ratio (original data to original with dropped sites)
of kriging standard deviations; and D, kriging standard deviations for original data with added sites..................... 46
17. Maps showing location of measured data for bedrock-elevation examples A, original data and
B, restricted data......................................................................................................................................................... 49
18. Graphs showing variogram and variogram cross-validation plots for bedrock-elevation examples
A, theoretical variogram; B, cross-validation scatterplot; and C, cross-validation probability plot........................... 50
IV CONTENTS
19. Maps showing kriging results for bedrock-elevation examples A, kriging estimates; 5, kriging standard
deviations; C, block kriging results; and D, block kriging standard deviations ........................................................ 52
20. Maps showing location of measured data for ground-water-quality examples ......................................................... 54
21. Graphs showing directional variograms and variogram cross-validation plots for ground-water-
quality examples A, theoretical major-direction variogram; B, theoretical minor-direction variogram;
C, cross-validation scatterplot; and D, cross-validation probability plot................................................................... 56
22. Maps showing kriging results for ground-water-quality examples A, kriging estimates back-transformed;
B, kriging estimates in log space; C, kriging standard deviations in log space; and D, 95-percent confidence
level for kriging estimates back-transformed............................................................................................................. 58
23. Graphs showing directional variogram plots for indicator kriging ground-water-quality example
A, theoretical major-direction variogram and B, theoretical minor-direction variogram........................................... 61
24. Map showing indicator kriging results for ground-water-quality example................................................................ 62
25. Diagram showing Voronoi polygons.......................................................................................................................... 66
TABLES
1. Geostatistical software characteristics.................................................................................................................... 24
2. Univariate statistics for example data sets.............................................................................................................. 26
3. Variogram characteristics and cross-validation statistics........................................................................................ 45
4. Univariate statistics for gridded kriging estimates in example applications........................................................... 48
Multiply By To obtain
Length
kilometer 0.6214 mile
meter 1.094 yard
CONTENTS
NOTATION
N(.) Number of squared differences in variogram bin. 6K (x) Kriging standard deviation at location x.
2
P Probability. GK &) Kriging variance at location x.
2
Sn Sample variance of n measurements. \i(x) Spatial population mean of Z at location x.
NOTATION VII
Overview and Technical and Practical Aspects
for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
ByC.R. Bossong, M.R. Karlinger, B.M. Troutman, and A.V. Vecchia
Abstract 1
estimates, at unmeasured or unsampled locations; report also includes a brief literature and software
additional definition of kriging is provided throughout review, a presentation of kriging applications, a
this report. Kriging is well suited for making point discussion of the review of kriging applications, and a
and block estimates; however, much of the advantage discussion of more advanced geostatistical techniques,
of using geostatistical techniques, such as kriging, such as conditional simulation.
is not just in the point and block estimates but in The scope of this report is limited to discussions
the information provided concerning the uncertainty and examples of two-dimensional point and block
associated with the estimates. The uncertainty infor- estimations using a geostatistical method known
mation is usually quantified by a kriging variance that as kriging. The technical aspects of geostatistics are
is associated with a kriging estimate. The uncertainty presented through discussion of the assumptions
also is sometimes referred to as the kriging standard about, and the mechanics of, several types of kriging,
deviation, which is simply the square root of the including ordinary kriging, which is applicable when
kriging variance. the mean for the variable of interest is constant over
the region of interest, and universal kriging, which is
Original geostatistical work involved making
applicable when the mean for the variable of interest
estimates for the areal extent and concentrations of
changes gradually over the region. A specialized form
economic mineral deposits in relation to mining.
of kriging known as indicator kriging and the use of
Today (1998), geostatistical techniques continue
information concerning uncertainty associated with
to have a function in mining. However, a well-
kriging estimates also are discussed. The fundamental
developed method that is capable of interpolating a
concepts of geostatistical kriging theory are discussed,
given set of measured values at discrete locations into but various references are provided for more detailed
estimates for new locations or developing an indi- information.
vidual estimate for an area including many locations,
or both, has attracted users from many disciplines, and
there is a trend toward incorporating geostatistics as 1.2 Organization
standard curriculum for most geoscience educational
programs. The use of geostatistical techniques as part This report is divided into eight sections
of HTRW-site investigations is becoming common described here.
because of the almost routine need for data interpola- Section 1.0 presents introductory material and
tion as part of these investigations. includes an overview of the use of geostatistics
Once investigators have established that the at HTRW-sites.
data are adequate as to quality and quantity, geostatis- Section 2.0 presents an overview of some important
tics can be a powerful analytical tool that results in technical aspects of geostatistics with a minimum
quantitative characterization of areas of special of theory and equations.
interest within the study area or the entire study area.
These characterizations could be used to determine Section 3.0 discusses the assumptions and theory
spatial variation; for example, where concentrations behind kriging, including equations and concepts
of contaminants in soils are relatively high or low, that are useful for obtaining a better under-
are less than or greater than a specified concentration, standing of the technical aspects, or mathematics,
or even have a high or low probability of exceeding of kriging interpolation. Many of the concepts
developed in section 3.0 are discussed in general
a certain concentration.
terms in section 2.0, so those readers desiring
only an overview of kriging concepts may wish to
read only section 2.0 and bypass section 3.0.
1.1 Purpose and Scope
Section 4.0 reviews texts that contain much more
The purpose of this report is to address the detailed information regarding kriging theory
use of geostatistics in HTRW-site investigations by than material included in section 3.0. Section 4.0
presenting an overview of geostatistical methods and also provides a brief generic discussion of kriging
discussing their technical and practical aspects. The software.
Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
Section 5.0 discusses detailed step-by-step vario- or interdisciplinary branches, or both; the geologist,
gram construction and demonstrates some pitfalls for example, will occasionally benefit from knowl-
and solutions to this crucial process. Section 5.0 edge of geophysics. Interdisciplinary input also can
also discusses techniques that investigators may be very helpful, especially in geostatistics, where
use to evaluate their variograms. earth-science disciplines rely on assistance from
Section 6.0 discusses practical aspects of geostatis- statisticians.
tics by presenting several examples of kriging
applications using data from the HTRW field. The 1.4 Acknowledgments
examples illustrate a few of the many different
ways kriging can be used in HTRW-site investiga- Several individuals have provided valuable
tions and are not presented with the same level of technical review of this report. Dave Becker,
detail used in section 5.0. USAGE, HTRW Center for Expertise, not only
Section 7.0 provides additional detail on some provided a consistent and thorough technical review,
crucial aspects of kriging applications and but also coordinated additional technical reviews.
includes considerations that may be helpful to Additional technical review from the USAGE were
determine if kriging is feasible for the intended provided by Terry Walker and Tom Georgian, both
use. from the HTRW Center for Expertise; Brad Call,
Earl Edris, Dave Kachek, Doug Mullendore, and
Section 8.0 briefly discusses other methods for Kerry Walker, all from USAGE field offices; and
spatial modeling and also includes discussion of Tommiann McDaniel from USAGE headquarters.
advanced stochastic methods, such as simulation. Additional technical reviews from outside the USAGE
were provided by Evan Englund, U.S. Environmental
Protection Agency National Exposure Research
1.3 Overview of the Use of
Laboratory; Ed Gilroy, U.S. Geological Survey
Geostatistics in Hazardous-, (USGS); Mohan Srivastava, Froidevaux, Srivastiva,
Toxic-, and Radioactive- and Schofield; and Wayne Woldt, University of
Waste-Site Investigations Nebraska.
Investigations of HTRW sites involve complex
administrative, scientific, and engineering functions
and are truly interdisciplinary. For instance, adminis- 2.0 OVERVIEW OF SOME TECHNICAL
trative functions that are associated with fiscal, ASPECTS OF GEOSTATISTICS
managerial, or regulatory input can guide or constrain This section provides an overview of some of
scientific or engineering work. Similarly, scientific the procedures and concepts discussed in detail in this
or engineering findings may define the scope of the report. Some of the technical ideas and terminology
administrative effort. are introduced in very general terms to familiarize the
Scientists and engineers involved in HTRW- reader with geostatistics.
site investigations have found that they have an
implicit need for many disciplines to fulfill the
objectives of each particular investigation. Frequently, 2.1 General Considerations in
an HTRW-site investigation will benefit from special- Spatial Prediction
ized information available from earth-science
disciplines such as geology, hydrogeology, and The principal consideration in this report is
geochemistry, among others. Some HTRW-site inves- spatial prediction or modeling values of a spatial
tigations are large enough to use several individuals process; in particular, to make best use of measure-
from each of these disciplines, as well as many others, ments of a variable (such as pollutant concentration) at
for the duration of multi-year investigations. Most sampled locations so as to make inferences (or predic-
disciplines associated with HTRW-site investigations tions) about that variable at unsampled locations or for
will benefit from knowledge or input from specialized the region as a whole.
Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
techniques that are usually applied nonstochastically 2.2.2 Directional Variogram and Anisotropy
are discussed briefly. Most of these techniques are
Spatial correlation often depends not only on
designed to incorporate spatial continuity, but the
the distance between points, but also on the direction
way it is incorporated may be subjective. Use of
along which the points plot. For example, measure-
kriging provides an objective means of incorporating
ments at pairs of points that are 100 meters apart
partial correlation and makes the background assump-
and are oriented north-south may have a different
tions explicit.
correlation than measurements at points that are the
same distance apart, but that are oriented east-west.
Correlations dependent on direction indicate anisot-
2.2 Important Geostatistical Concepts ropy, and when anisotropy is present, a directional
variogram needs to be used for the geostatistical
This section presents some of the key
analysis.
concepts in geostatistics that are discussed in
detail in section 3.0. The concepts are presented
in about the same order as they are discussed in 2.2.3 Kriging and Kriging Variance
section 3.0. Kriging yields optimal spatial estimates at
points where no measurements exist using measure-
2.2.1 Variograms ments at points where there are data. As discussed
in section 2.1, placing an analysis in a stochastic
A central concept in geostatistics is the use of framework enables precision in defining optimality.
spatial correlation to improve spatial predictions or In kriging, a restriction that the predicted value at
interpolations. The variogram is the principal tool used any point is a linear combination of the measured
to characterize the degree of spatial correlation present values is imposed first; that is, the kriging estimate
in the data and is fundamental to kriging. The correla- is a linear predictor. Given this restriction, the values
tion between measurements at two points is usually of the coefficients in this linear function are chosen
assumed to depend on the separation between the two to ensure the predictor to be optimal.
points. This dependence can be examined by squaring The first optimality criterion imposed is that the
the difference between the measured values at each estimate be unbiased, or that on average, the difference
pair of locations and then categorizing the squared between the predicted value and the actual value is
differences according to the separating distance zero. The second optimality criterion is that the vari-
between the paired locations. For small separations, ance of the predictions be minimized. The variance in
or lags, the squared differences are usually small and the predictions is a statistical error measure defined to
increase as the lag increases. A plot of the squared be the average squared difference between predicted
differences per sample pair as a function of lag is and actual values. Because the kriging estimate mini-
referred to as the sample variogram. mizes this variance, the estimate, or prediction, is
The general behavior of the points in the known as the best (minimum variance) unbiased linear
sample variogram is affected by the spatial correlation predictor. This minimization is performed algebra-
between sample sites and can provide investigators ically and results in equations known as the kriging
with qualitative information about the spatial process, equations, which are explicit representations of the
but to use this information rigorously as a basis for optimal coefficients (weights) in terms of the vario-
interpolation, a function, that has specific properties, gram. These equations are presented in section 3.0.
needs to be fit to the sample variogram points. The An expression for the kriging variance also is
fitting passes a smooth curve through the scattered discussed in section 3.0. This variance depends on
points. The curve, which can be represented by a the geometry of the data sites, with the variance
mathematical expression or function, is called a at locations near measured points tending to be small.
model. There are several models introduced in A variance then can be associated with any spatial
section 3.0 that have characteristic features that prediction, which gives an indication of the uncer-
are commonly used in geostatistics. The variogram tainty about that predicted value. The fact that kriging
model is used to determine kriging weights for use provides this measure of uncertainty is one of its prin-
in interpolation. cipal advantages over many other techniques.
Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
usual manner; first, a variogram is obtained, and then obtain the best weights for spatial prediction is
the kriging equations yield the optimal linear predictor discussed, and the computation of the average-, or
and the kriging variance for the indicators. mean-, squared prediction error for these predictions
Although the indicator kriging analysis uses also is discussed. In section 3.4, co-kriging, which is
only O's and 1 's, the interpolated estimates are not prediction of one variable based on measurements of
restricted to these two values. In most analyses, the that variable and other variables, is discussed. Finally,
estimates are between 0 and 1, which is interpreted in section 3.5, the application of kriging to determine
to be the probability that the actual value is less than not just optimal spatial predictions, but also probabili-
or equal to the threshold c. Use of this analysis for ties associated with various events, such as extreme
a number of different threshold values can provide events that may be of importance in risk-based anal-
information about the probability distribution of yses, is discussed.
contaminant values at a location, which may be used
to obtain prediction intervals. Such prediction inter-
vals may even be more valuable than having only 3.1 Regionalized Random Variables
the optimal predictor and variance provided by the
usual kriging analysis, particularly if the behavior Suppose the extent of ground-water contamina-
of extremes may be of interest. An advantage of using tion by a particular pollutant in a given study area
indicator kriging to obtain prediction intervals is that is being determined. To simplify the presentation,
there is no need to assume a normal distribution for all data are assumed to be distributed over a two-
the data. dimensional region. In three-dimensional ground-
water flow systems, one could study the depth-
averaged concentration of a pollutant or the concentra-
3.0 TECHNICAL ASPECTS OF tion of the pollutant in a particular horizontal stratum
GEOSTATISTICS of the flow system. Let a vector x = (u, v) denote an
arbitrary spatial location in the study area. Unless
This section provides the technical aspects otherwise stated, u is assumed to be the east-west
or the necessary theoretical background for under- coordinate and v the north-south coordinate (fig. I A).
standing kriging applications. Emphasis is placed Use z(x) to denote a measurement at location x, such
on presentation of the basic ideas; long formulae or as the concentration of a pollutant. The ultimate goal
derivations are minimized. Statistical terms that are is to determine z(x) for all locations in the study
commonly used in geostatistical applications are in area. However, without explicit knowledge of the
bold text and are briefly defined as they are intro- ground-water flow and transport field, this goal
duced; notation used in this report also is listed in cannot be achieved. Therefore, suppose that the
the "Notation" section. More thorough discussions goal is to estimate the values of z(x) with a given
of these fundamental concepts are indicated by refer- error tolerance. For some studies, a small estimation
ences cited in section 4.0. A knowledge of engineering error for some parts of the study area (for instance,
statistics at the level of Devore (1987) and Ross (1987) near a domestic water supply) may need to be
would help in understanding some parts of this obtained, while allowing larger estimation errors in
section. Readers who have limited statistical experi- other parts of the study area. The theory of regional-
ence may wish to briefly scan this section and refer ized random variables is designed to accomplish
back to it after reading the remaining sections. these goals.
In section 3.1, regionalized random variables In the regionalized random-variable theory,
are discussed. Regionalized random variables consti- the true measurement, z(x), is assumed to be the value
tute the random process that is sampled to obtain of a random variable, Z(x). A random variable Z(x)
the observed data that are available for analysis. is associated with a true measurement z(x) to charac-
Basic ideas related to probability distributions, aver- terize the degree of uncertainty in the quantity of
ages, variances, and correlation are introduced. In interest at point x. If there is no measurement obtained
section 3.2, the variogram, which is the fundamental at x, then the values acquired by Z(x) represent poten-
tool used in geostatistics to analyze spatial correlation, tial measurements at x\ that is, Z(x) represents possible
is introduced. In section 3.3, the use of kriging to values that might be expected if a measurement were
Overview and Technical and Practical Aspects for Use of Geostatistics 1n Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
joint probability distribution is defined to be the 3.1.1 Example 3.1.1
probability P[Z(x{) < q, Z(*2) ^ c2] for any constants
An experiment consists of injecting a conserva-
GI and c2. This latter probability means the probability
that both Z(xl) < cl and Z(*2) < c2. If the variables tive tracer at a particular well in a steady-state ground-
Z(x_i) and Z(*2) are statistically independent of one water flow system and measuring the concentration,
another, then the joint probability distribution can be Zi(x), of the tracer in a neighboring well 24 hours
obtained as the product of the individual probability later. The tracer then is allowed to flush from the
distributions, system, and the experiment is repeated a second time
to obtain another concentration measurement, Z2(x),
at the same location. If this process is repeated n times,
P[Z(x l )<c l ,Z(x2 )<c2 ] n concentration measurements Zi(x), Z2(x), ..., Zn(x)
(3-1)
= P[Z(x l )<c l ] P[ZU2 )<c2 ]. would be obtained, all at locations x. The average
concentration at location x is
However, in most applications, Z(x{) and Z(*2) are
not statistically independent, and their joint distribu-
tion cannot be obtained from the individual distribu- Z2 (x) n (*)], (3-3)
tions. When this joint description is applied to more
than two locations, specification of the full spatial
which would change depending on n and on the actual
distribution of Z would need the joint distribution of
values obtained for ZjQc), Z2(x), ..., Zn(x). However,
Z(x{), ..., Z(xn) for any set of n spatial locations and for
any n; however, except in very special cases, working in the limit as n increases, Zn(x) becomes closer and
with the full set of distribution functions of Z(x) is not closer to the true mean, or expected, concentration
feasible and is not done.
To simplify the problem even further, various
parameters of the distributions are used rather than
using the entire distributions. The parameter most Zn (x) > as n increases. (3-4)
commonly used to characterize a distribution is the
mean; because the mean in geostatistical applications This theoretical limit is a constant value, or
depends on the spatial variable x, the mean may be population parameter, as opposed to Zn (x),
called the spatial mean, or the drift. In statistics, which is a random variable, or a property of
the mean is referred to as the expectation (E) of the the particular sample that is obtained.
random variable Z(jt), and the symbol |i is used in this In example 3.1.1, no assumptions were
report to denote this expectation. Thus, needed concerning whether the mean changed with
spatial location because all sampling was done at
*) = E[Z(x)} (3-2) one sampling location, x. In most HTRW-site applica-
tions, the mean probably changes, depending on
the sampling location. In addition, usually only one
is used to denote the mean, or expected value, of
the bracketed term, in this case Z(x). Thinking of the measurement is available at any particular location.
expectation as an average can be helpful. In fact, if Therefore, some assumptions regarding the structure
the distribution of Z(x) assigned equal probability to of (!(*) must be made. For example, to assume that
a finite number of values, then the expectation of Z(x) |i(jt) = (i, is constant for all x sometimes is appropriate,
would indeed be the simple average of these numbers. in which case, Z(x) has a stationary mean. For
However, in geostatistics, Z(x) is usually assumed to example, data that have no underlying trend, such
take on any value in a continuous range of possible as hydraulic conductivity in a homogeneous aquifer,
values rather than being limited to a discrete set of might be assumed to have a constant mean. If the
values. Therefore, calculus needs to be used to define mean is constant, estimating it with the sample
the expectation. The following example illustrates the average of n measurements obtained at different
difference between averages and expectations. spatial locations x_i, x2, ..., x^ is reasonable; therefore,
C(/i) = average
This equation gives a measure of dispersion of the (3-10)
Z/(x) values from their sample mean. The sample h-Ah<hfj <h
variance depends on n and on the particular values
measured for Zjfe), Z2(*), ..., Zn(x). However, in where hy is the distance between *,- and Xj and
the limit as n increases, Sn (x) gets closer and closer the average is from all pairs of points so that h^
to a constant value, which is denoted by a^)- Thus, is between h Ah and h + Ah. The quantity Ah is
G (x) is a population parameter, and Sn (x) is a random called the lag tolerance. There are more effective
variable. ways to estimate C(h) besides using equation 3-10;
The mean and the variance both can be calcu- for example, see Isaaks and Srivastava (1989).
lated from the probability distribution of Z(x). Again, However, because the emphasis in this report is
in geostatistics, the relations among regionalized on the variogram (to be defined below) rather
variables at different locations are of interest. From than on the covariance function, this method of
the joint distribution of Z(x{) and Z(x2\ the spatial estimating the covariance function does not need
covariance function, to be used.
10 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
A covariance function is called stationary
if it does not depend on the origin of the coordinate 2 ) = C(h,a),
system; that is, h = J(u,-u
(3-15)
C(x l b) = C(xv x (3-11) a - atan
12 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
theoretical variogram is monotonically increasing, the spherical variogram (parameters: sill, s > 0;
signifying that the farther two measurements are nugget, 0 < g < s; range, r > 0),
apart, the more their residuals tend to differ, on
average, from one another. Several properties common
to many theoretical variograms are shown in figure 2. h>r
If the variogram either reaches or becomes asymptotic
to a constant value as h increases, that value is called sr (3-24)
the sill (fig. 2). The distance (value of K) after which
the variogram remains at or close to the sill is called 0, h=Q
the range. Measurements whose locations are farther
apart than the range have the same, or even no, degree
the Gaussian variogram (parameters: sill, s > 0;
of association and are assumed to be uncorrelated.
nugget, 0 < g < s; range, r > 0),
Often, a variogram has a discontinuity at the origin,
signifying that even measurements obtained very
close together are not identical. Such variation in the
measurements at small scales is called the nugget
(3-25)
effect. The size of the discontinuity is called the
0, h=0\
nugget. Although the nugget effect is sometimes
confused with the measurement error, there is a
subtle difference between these two concepts that the linear variogram (parameters: nugget, g > 0;
is explained in section 3.3. A simple monotonic func- slope, b > 0),
tion is usually selected to approximate the variogram.
Four such functions that are often used in practice are:
The exponential variogram (parameters: sill, s > 0; (3-26)
nugget, 0 < g < s; range, r > 0), /i=0
2//3
Lag (h) Lag (h)
GAUSSIAN LINEAR
Figure 3. Theoretical variograms indicating A, exponential; B, spherical; C, Gaussian; and D, linear models.
Although the variogram is commonly used in From equation 3-28, high values of y(h) (that is, close
a geostatistical analysis, an intuitive understanding to s) signify low values of p(fc). In fact, p(/z) = 0 when-
of geostatistical techniques may be more easily ever y(/0 = j, indicating that measurements whose
obtained by using the covariance function, or equiva- locations are farther apart than the range are uncorre-
lently, the spatial variance and the correlation function. lated. As h decreases, a nugget in y(/i) is reflected in
When Z(x) has a stationary, isotropic covariance a correlation that is less than 1,
function (eq. 3-9), there is a one-to-one correspon-
dence between the variogram and the covariance
function: - £ as h -> 0. (3-29)
s
14 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
3.3 Kriging Var[Z(*0)-Z(*0)]
(3-33)
Given a regionalized random variable Z(x) = {E(2(xQ)-Z(xQ)] 2 }.
that has a known theoretical variogram, how can the
value of Z(x) be predicted at an arbitrary location, The smaller the prediction variance, the closer Z(*0)
based on measurements taken at other locations? is (on average) to the true value Z(XQ). The geostatis-
To answer that question, suppose that Z is measured tical technique of kriging computes the best linear
at n specified locations: Z(x{), ..., Z(x^). For example, unbiased predictor of Z(*Q), which is the linear
Z could represent hydraulic conductivity and the unbiased predictor (eqs. 3-31 and 3-32) that has
locations might correspond to n preexisting wells in the smallest possible prediction variance (eq. 3-33).
an aquifer. Let a new location be given by XQ = (MO»VO) The best linear unbiased predictor depends
and denote the ith measurement location by xt = (uit v/). on the mean of Z(x). For example, if Z(x) has a
Suppose that, based on prior knowledge of the geology constant mean (eq. 3-30) and a pure nugget vario-
in the study area, there are no prevailing trends gram [y(/0 = s for all h > 0], the best linear unbiased
in hydraulic conductivity, so the mean of Z(x) is predictor of Z(XQ) is the average of the measured
assumed to be constant over the entire study data,
area:
(3-34)
*)= H (constant). (3-30)
=i
Suppose the value of Z(XQ) is to be predicted
Because the variogram is the same for all h > 0 and
by using a linear predictor, 2(*o), which is defined as there is no trend in the data, there is no reason to
a weighted linear combination of the measured data, favor any of the measurements over any of the other
measurements. Therefore, the weights are all the same.
Ordinary kriging, which is discussed in section 3.3.1,
(3-31) deals with the constant-mean model (the assumption in
= i eq. 3-30) in which the variogram is not a pure nugget
variogram. The weights of the best linear unbiased
where predictor reflects the information in the variogram and
results in an improved predictor over the sample mean.
is the weight assigned to
In section 3.3.2, universal kriging, which is the exten-
To determine specific values for the weights, some sion of ordinary kriging to a nonconstant mean, is
criteria need to be specified for Z(*Q) to be a good discussed. Universal kriging is a very powerful tool
predictor of Z(XQ). The first criterion is that Z(XQ) that can be used to combine regression models and
needs to be an unbiased predictor of Z(*Q), which spatial prediction into one unifying theory. Other, more
is expressed as specialized types of kriging that are discussed in this
section are block kriging (section 3.3.3), co-kriging
(section 3.4), and indicator kriging (section 3.5.2).
£[Z(*0)-Z(*0)] = 0. (3-32) There also is a prediction technique in geostatis-
tics known as simple kriging, which uses the best
An unbiased predictor neither consistently over- linear unbiased prediction in the case when the
predicts nor underpredicts Z(XQ) because the statistical mean of Z(x) is fixed and known. Simple kriging
expectation of the prediction errors is zero. The second is not discussed in this report because, in most
criterion for a good predictor is that it have small applications, the mean is not known and has to be
prediction variance as defined by estimated.
where
I"J = (3-35b)
p is a fixed proportion.
/= 1
The quantity p is sometimes referred to as a relative
where
nugget.
py = p(/iy) is the correlation between measurements, The ordinary kriging equations 3-35a and
/ and7, h^ is the distance between locations / and 3-35b are given by
7, and
A, is a coefficient resulting from the con- X
- = Pio (3-39a)
strained optimization.
16 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
Effect of correlations: If Z(XQ) is more highly
7 = P20 (3-39b) correlated with Z(x{) than with Z(^2), then wj is
larger than ^2, indicating that the measurement at
the first location has more predictive information
(3-39c) than the measurement at the second location. Also,
correlation in the data always decreases the kriging
These three equations have three unknowns vvj, W2, variance compared to the variance using uncorrelated
and A,; the solution is data.
Effect of data clumping: If ZQcj) and Z(^2)
are highly correlated, as indicated by pj 2 being close
lPio-P20
(3-40a) to 1, then the two measurements contain much of the
same information. Two situations then can occur:
PlO = P20» where the weights are both equal, or
PlO > P20 [PlO < P2ol' where wj is much larger [or
= 1 !PlQ-P20
(3-40b) smaller] than w2. In either case, the kriging variance
/2 2 2 l-p 12
increases to reflect the same information in the two
measurements. The automatic adjustment of the
and
kriging weights and kriging variance to account for
data clumping is an important property of the kriging
predictor.
(3-41)
3.3.1.2 Example 3.3.1.2
The resulting kriging variance is (Nugget Effect Versus Measurement Error)
(3-43) (3-46a)
7 =
and (3-45) = s l~
i = 1
(3-47)
/2 (w, v) = M,
18 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
the variable is defined has been discussed. Implicit In this model, the individual predicted values are
in this discussion is the assumption that the support obtained from either the ordinary or the universal
of the variable being predicted is defined in the same kriging equations. However, computation of the block
way as the variables that make up the measurements. kriging variance is not as simple as computation of the
However, there may be applications where estimating point kriging variance because the individual kriging
the average value of Z for an estimation block of much estimates are not independent of one another. There
larger area than is represented by an individual sample are simple modifications to the kriging equations,
is necessary. For example, an estimate of the average discussed in sections 3.3.1 and 3.3.2, that can be
concentration of a contaminant in an entire aquifer that used to directly compute the kriging estimate of ZB
is based on point measurements at various locations and its kriging variance (Isaaks and Srivastava, 1989,
might be needed. In other applications, an estimate of chap. 13). The equations are not presented in this
the average concentration of soil contaminant, in daily report. The computer packages described in the
excavation volumes that are much larger than the next section can be used to compute block kriging
volume of an individual sample, may be needed. estimates. In general, kriged values of block averages
Let ZB be the average value of Z(x) for a particular are less variable than kriged values at single locations.
block B, Consequently, the blocked kriging variance tends to
be smaller than the kriging variance at a single
location.
(3-48)
3.4 Co-Kriging
where *o/» i = !,. ,«, denotes m prediction locations Kriging as discussed so far provides a way of
in block B. The object is to predict this average predicting values of a regionalized variable ZQt) at a
rather than the regionalized random variable at a location XQ based on measurements of the same vari-
single location. In many applications, the locations *o/ able at locations *i, *2» » %n- ^n some situations,
might correspond to nodes of a regular grid or finite- however, measurements could be available not only
element nodes in a ground-water model. The results of ZQt), but also of one or more other variables that
of the block kriging are dependent on m and on the can be used to improve predictions of Z(XQ). The vari-
prediction locations. Selecting a large number of loca- able Z(x) is called the primary variable because it is
tions in block B, where each location has approxi- the one to be predicted, and the other variables are
mately the same representative area, probably is the called secondary variables. Co-kriging is a technique
best approach to block kriging (Isaaks and Srivastava, that uses the information contained in secondary vari-
1989, chap. 13). ables to predict a primary variable. For example,
The objective of block kriging is to obtain the suppose that Z(x) is a regionalized variable repre-
best linear unbiased predictor of Z# and an estimate of senting the hexavalent chromium concentration, a
the block kriging variance based on the measurements. relatively difficult determination, and suppose that
The model for Z(x) can be the constant-mean model the hexavalent chromium concentration needs to be
(eq. 3-30) assumed for ordinary kriging or the more predicted at a location XQ based on measurements of
general linear-regression model (eq. 3-43) assumed hexavalent chromium at other locations. However,
for universal kriging. For either, the predicted value there also are measurements of a second, relatively
of ZB coincides with the average of the predicted easily determined contaminant, such as lead, that, for
values of the individual measurements in the block; the purposes of this example, tend to be correlated
that is, with hexavalent chromium concentration, and these
data are to be used as well. Denote the second vari-
able, lead, by a regionalized variable W(x), and assume
that measurements have been made on W at m loca-
(3-49) tions *'i, *'2, ..-,x m. The co-kriging predictor of Z(XQ)
i = 1 then is
20 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
Section 3.5.1 contains methods for using kriging in which Y(x) = \n[Z(x)]. A 95-percent prediction
output to obtain prediction intervals or quantiles when interval for Z(x) then is (exp[T(*o) - l^a^feo)],
the regionalized random variable is either normally espfTfeo) + 1-96 a^feo)]}. As long as the transforma-
distributed or can be transformed to a near-normal tion is a one-to-one function, such as a logarithmic
distribution. Section 3.5.2 discusses indicator kriging, transform, prediction intervals for the original data can
which is a nonparametric technique for obtaining be obtained by simply back-transforming prediction
quantiles when data cannot be adequately transformed intervals for the transformed data.
to a normal distribution. Although prediction intervals and probabili-
ties can be easily obtained using simple back-
3.5.1 Normal Distributions and Transformations transformation, obtaining a predictor of the untrans-
formed data that is unbiased and optimal in some
For prediction at a location XQ, a kriging anal- sense is more difficult. For example, using a loga-
ysis produces the predictor Z(XQ) and the associated rithmic transformation, a kriging analysis using the
kriging variance c^(*o)- If more informative proba- transformed data yields a predictor Y(XQ), which is
bility assessments are to be made, the ideal situation the best linear unbiased predictor of Y(XQ). However,
is when Z(x) is assumed to be a Gaussian, or normal, the back-transformed value Z(XQ) = exp[T(*o)]
process, which means that [Z(*i),..., Z(xn)] has a joint does not possess the same optimality properties as a
normal probability distribution for any set of n loca- predictor of Z(XQ). The technique known as log-normal
tions and any value of n. Then the conditional proba- kriging, and more generally as trans-normal kriging,
bility distribution of Z(XQ), given the n measurements, has been developed to obtain predictors when transfor-
is a normal distribution that has a conditional mean mations are made (Journal and Huijbregts, 1978), but
equal to the kriging predictor Z(XQ) and conditional because of the complexity involved, the technique is
variance equal to the kriging variance G%(XQ). not usually used by practitioners. For example, if a
This normal distribution can be used to obtain a predicted value corresponding to Zfeo) needs to be
prediction interval for Z(^Q) (conditional on the obtained for contour plotting, the kriging predictions
measured data). For example, from a table of the Y(XQ) may be back-transformed and plotted, as long
normal distribution, a value of 1 .96 corresponding as the investigator realizes that such values do not have
to a 0.95 (two-sided) probability can be obtained. the usual kriging optimality properties.
Then the assertion that there is a 95-percent chance
that Z(XQ) is in the 95-percent prediction interval
3.5.2 Indicator Kriging
[Z(XQ) - \9.6aK(xQ), Z(XQ) + \.96aK (xQ)] can be
made. Knowing this interval is much more useful than There may be situations when a transforma-
simply knowing the kriging predictor and variance. tion that makes Z(x) approximately normal cannot
To illustrate quantile estimation, suppose be easily determined. In such situations, indicator
that contaminant concentrations are being studied, kriging can be used to infer the probability distribu-
and a concentration that has only a 1 -percent tion of Z(x). Because no distributional assumptions
chance of being exceeded at location *o needs to are made, this technique is known as a nonparametric
be determined. The appropriate (one-sided) value statistical technique. An example of indicator kriging
from a normal table is 2.33, so the desired estimate is included in section 6.0, and an article by Journel
(1988) is a good reference for additional information
about indicator kriging.
Even if Z(x) is not Gaussian, a transformation,
Y(x) = T[Z(x)], can often be found so that Y(x) is To perform indicator kriging, a special trans-
approximately Gaussian. When a transformation is formation, known as an indicator transformation, is
made, the kriging analysis is performed using the applied to Z(x):
transformed data Y(x), and the inverse transformation
may be applied to obtain prediction intervals for the
original data. For example, the most common transfor- '1, Z(x)<c
I(x, c) = (3-51)
mation is the (natural) logarithmic transformation, 0, Z(x)>c
22 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
written by statisticians that present general treatments 4.3 Software
of spatial processes, but that lack detailed discussions
of kriging, are by Cliff and Ord (1981) and by Ripley The geostatistics software described in this
(1981). section is limited to a few readily available public-
David's (1977) text was the first extensive domain packages that are executable at least on
discussion of the practice of geostatistics and kriging the DOS and sometimes on the UNIX platforms.
in mining applications, and the discussion is presented There are several commercial packages that are
from a practitioner's viewpoint. The text references being marketed, but these packages are not reviewed
many specific mining applications and results for in this report. Each of the software packages described
geostatistics. A broad statistics text by Davis (1973) in this report are listed in table 1, which may serve
with a bent toward geological applications, serves as a reference guide to other software packages.
as a reference for standard statistical procedures One of the earliest interactive kriging software
needed in geological applications of geostatistics. packages was developed by Grundy and Miesch
A book by Bras and Rodriguez-Iturbe (1985) that (1987). Overall, this general statistics package, known
discusses a range of techniques for stochastic as STATPAC, contains a series of programs that can
modeling in hydrology includes a chapter on handle two-dimensional kriging, including universal
applications of kriging. There is a fairly complete kriging. The package has capabilities for univariate
mathematical development of kriging with details statistics, transformations, variogram analysis, and
of an application to predict mean areal precipitation. cross validation (table 1). The graphics in the package
In an article prepared for the U.S. Environmental are limited to simple line-printer plots of the sample
Protection Agency, Journel (1993) discussed geostatis- variogram points and data maps. The menu-driven
tics as it relates to environmental science. Finally, Olea package includes a tutorial using all of the kriging
(1991) presented a useful glossary of geostatistical routines. The package is distributed with not all, but
terms. most source codes, and, therefore, can be modified
by the user if desired. All two-dimensional kriging
routines can be executed from the command line,
4.2 Journals which provides users with the opportunity for batch
The journal Mathematical Geology by the processing.
International Association for Mathematical Geologists The geostatistical environmental-assessment
reports new developments in the theory and applica- software, known as GEO-EAS (Englund and Sparks,
tion of kriging. Although many of the articles present 1991), also is an interactive, menu-driven kriging
new applications of kriging tools, many articles software package for performing two-dimensional
also are dedicated to the derivation of statistical kriging. It has no direct provisions for universal
properties of the variogram, to kriging estimation, kriging (table 1). GEO-EAS does have an advantage
and to cross-validation results. Journals such as over STATPAC through its enhanced graphics capabil-
Water Resources Research, published by the ities, which are useful in the interactive fitting of theo-
American Geophysical Union, and Groundwater, retical variograms to sample variogram points. In
published by the Association of Ground Water addition, in the computation of the sample variogram
Scientists and Engineers, contain articles describing points, GEO-EAS allows for variable bin sizes, the
special applications of kriging techniques in the envi- use of which are further discussed in section 5.0.
ronmental arena. Water Resources Research includes STATPAC and GEO-EAS were originally devel-
many theoretical articles. Other journals that may oped for the personal computer. Since then, versions
contain information addressing geostatistics are the of GEO-EAS have been developed for some types of
Journal of Environmental Engineering, published by work stations. The kriging routines in STATPAC have
the American Society of Civil Engineers; Stochastic not been adapted to workstations.
Hydrology and Hydraulics, published by Springer A third software package, the geostatistical
International; and the North American Council on software library known as GSLIB (Deutsch and
Geostatistics, published by the Colorado School of Journel, 1992), is a suite of programs developed over
Mines. the years at Stanford University, Stanford, California.
[Note: STATPAC, statistical software package developed by Grundy and Miesch (1987); GEO-EAS geostatistical environmental-assessment software
developed by Englund and Sparks (1991); GSLIB geostatistical software library developed by Deutsch and Journel (1992); CMS, ground-water modeling
system developed for the U.S. Department of Defense]
24 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
It is presented as a collection of routines that are the variogram, is a mathematical function or model
machine independent (table 1) and are intended to be that is fitted to sample variogram points obtained
used as a modular concept. The package is distributed from data. Permissible models, which include those
as a suite of FORTRAN source codes that need to be models discussed in section 3.0, belong to a family
compiled. To use GSLIB effectively, a relatively high of smooth curves having particular mathematical
level of familiarity with geostatistics is required. Like properties and are each specified by a set of
the other two software packages, GSLIB handles vari- parameters. Section 5.0 describes a sequence of
ogram analysis and kriging techniques (table 1). Two stages for estimating and investigating sample
of its primary advantages over the other two packages variogram points and a calibration procedure for
are its simulation techniques and its ability to analyze specifying the parameters of the variogram model
three-dimensional data sets. Such techniques are eventually to be fitted to the sample points. Although
useful especially in estimating potential extreme the calibration procedure is largely an objective way
outcomes in a geostatistical analysis. for evaluating theoretical variograms, the process of
The U.S. Department of Defense Groundwater obtaining sample variogram points and finalizing a
Modeling System (GMS) is a fourth software package theoretical variogram remains an art as much as a
that has kriging capabilities. GMS is a windows-based science. An understanding of the material presented
integrated modeling environment for site characteriza- in section 3.0 and professional judgment achieved
tion, ground-water flow and transport modeling, and through experience in geostatistical studies are impor-
visualization of results. The GSLIB software has tant in effectively using the guidelines presented in
been implemented in GMS to facilitate two- and section 5.0.
three-dimensional kriging and interactive variogram An accurate estimate of a variogram from a
modeling. GMS also provides comprehensive visual- kriging perspective is needed because the correlation
ization techniques and other interpolation techniques matrix used to obtain the kriging weights is developed
that can be used as alternatives to kriging. The GMS from the variogram values. Even more directly, the
system was developed for the U.S. Department variogram affects the computation of the kriging vari-
of Defense by the Brigham Young University ance (eqs. 3-36 and 3-47) through the product of the
Engineering Computer Graphics Laboratory. GMS kriging weights and the correlation values. An accu-
may be obtained from the U.S. Army Groundwater rate variogram also can be used outside the strict
Modeling Technical Support Center, Waterways context of kriging. For example, in augmenting a
Experiment Station, Vicksburg, MS 39180. spatial network with new data-collection sites, the
In geostatistical software and literature, there range parameter of the variogram could be used as
can be differences in jargon or notation. These differ- the minimum distance of separation between the new
ences may cause some initial confusion if users or sites and between the new and the existing sites to
readers do not familiarize themselves with the jargon maximize overall additional regional information. In
or notation. For example, some authors may use another nonkriging-specific application, the variogram
the term "semi-variogram" rather than "variogram"; is used in dispersion variance computations in which
others may express random variables as other than Z the variance of areal or block values is estimated
(which has been used in this report); and different soft- from the variance of point-data values (Isaaks and
ware often has different references for directional Srivastava, 1989, p. 480).
angles when discussing anisotropy. The stages of variogram construction are
described based on an example data set of ground-
water elevations measured near Saratoga, Wyoming
5.0 PRACTICAL ASPECTS OF (Lenfest, 1986). The data set is summarized in table 2
VARIOGRAM CONSTRUCTION and the relative locations of the data are shown in
AND INTERPRETATION figure 4.
The sequence of steps in computing sample
Section 3.0 presented the mathematical variogram points depends on the stationarity properties
foundation for geostatistics and the kriging technique. of the regional variable represented by the data. If
One theme that pervades the technique is the impor- the mean of the regional variable is the same for
tance of the theoretical variogram. The theoretical all locations, then the mean is said to be spatially
variogram, or what is often referred to simply as stationary; if the mean changes with location, then
[Note: Base unit for Saratoga, water level A and B and bedrock A and B is meters; base unit for water quality A is log concentration, concentration in
micrograms per liter]
26 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
The approximation to equation 3-19 begins local or regional gradient. In such studies, sample
by computing squared differences, £>y, from the variogram computations need to be made using resid-
data values z(x{), z(x,2),...z(xn) collected at locations uals obtained by subtracting the estimated drift value
*1, x2... XH at each location from the value of the datum at the
location.
The differencing of the data in equation 5-1
(5-1)
is done without considering the relative direction
between the locations; that is, D/; is isotropically
If the spatial mean is stationary, then the squared computed. A plot of D/; versus h^ for all i,j (i >j),
differences of the data are equal to the squared differ- where h^ = \£f- Xj I, produces a cloud of points
ences of the residuals, and sample variogram computa- whose properties govern the behavior of y. The
tions can be continued using the data themselves. If
central tendency of the cloud generally increases
the spatial mean is strongly nonstationary, the plot of
with h. A substantial increase in the central tendency
equation 5-1 versus the distance between associated
points may indicate a trend or drift that needs to be that persists for large h can indicate a nonstationary
removed before further variogram computations can spatial mean. The cloud computed for the Saratoga
be made. Drift needs to be considered in HTRW data, with ground-water levels (z) in meters and
studies such as determining contaminant concentra- lag (h) in kilometers, is shown in figure 5 and does
^
tions areally dispersed from localized sources or show increasing D (meters squared) with increasing
determining ground-water elevations that follow a h, indicating potential nonstationarity.
6.000
Dashed line is ordinary least squares fit indicating slight parabolic shape
5.000
O
CO
(/)
rr 4,000
LJ
LJ
LJ 3,000
LJ
rr
LJ
Q 2,000
Q
LJ
rr
Z>
(/)
<
^ 1.000
10 20 30 40 50 60 70 80 90 100
DISTANCE BETWEEN PAIRS, IN KILOMETERS
Figure 5. Squared differences of values for all possible pairs of points for Saratoga data.
1,500
+ Lag with greater than or equal to 30 pairs
x Lag with less than 30 pairs
Q
LJ
< 1,000
ID
a x
o:
LJ
500
10 20 30 40 50 60 70 80 90 100
LAG, KILOMETERS
28 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
stage is adequate to indicate if nonstationarity needs 5.3 Variogram Refinement
to be addressed before sample variogram refinement
is done. In section 5.1, an initial y was specified by
points computed from equation 5-2. In general, the
larger N(hk) is for any bin k, the more reliable are the
5.2 Nonstationarity points defining y (h^. Also, the larger K is, the greater
the number of sample variogram points shaping y.
An indication of substantial nonstationarity or However, N(h^) and K are competing elements of y.
drift in the spatial mean would be a parabolic shape Journel and Huijbregts (1978) suggested that each
through all lags in a plot of y. This shape occurs bin k could have N(hk) equal to at least 30 pairs.
because differences between data contain differences The American Society for Testing Materials (1996)
in the drift component that increase as h increases. If suggested 20 pairs for each lag interval. For small
equation 3-16 is inserted into equation 3-17, squaring data sets, the number of intervals may have to be
the differences in jo, greatly amplifies the increase with small to guarantee either number of suggested pairs
h. In this case, drift, generally a low-order (less than in all bins.
three) polynomial drift in (M,V), is fitted to the data and The minimum number of data, n, needed
subsequently subtracted from the data to obtain resid- to satisfy the N(hk) requirements for all bins of a
uals. Trend surfaces are not necessarily limited to sample variogram is difficult to determine. Simple
polynomial forms. For example, a numerical model of combinatorial analysis can establish a sample size
ground-water flow may be used to obtain residuals of needed for a given total number of distinct pairs
ground-water head data. obtained from the sample, but the analysis does not
In theory, the polynomial trend indicates a address the spatial considerations needed for proper
slowly varying drift in the spatial mean and, as lagging. For example, for data collected on a uniform
such, one regional trend surface should be fitted to grid and equal-sized bins, fixing an n to just satisfy
all the data. However, often the drift and residuals are the minimum N(hk) for the small lags would yield
obtained locally; that is, using moving neighborhoods insufficient data pairs to meet the minimum Nty^
of locations. Therefore, estimates of these values at for the larger lags. Fixing an n to ensure the minimum
any point are made using a decreased number (usually N(hk) for the large lags would generally have N(hk)
between 8 and 16) of surrounding locations, which is much greater than the minimum for the small lags.
done because, ultimately, the kriging estimates are Therefore, the question of how much data are needed
made using only the data values in the given neighbor- to adequately compute a variogram also needs to
hood. Manipulating the kriging matrices takes less address the relative locations of the data-collection
time when a small number of data are used to make sites.
estimates, and these efficiencies can be substantial The first 10 of the 12 bins for y for the Saratoga
for dealing with large data sets. Little accuracy is lost data contained more than 30 data pairs. Therefore, the
because the nearest neighbors have the most effect in bin width was decreased to have more points define
the kriging weighting scheme. the early part of y. These bin-width adjustments were
A parabolic shape for y in the Saratoga data made to refine y whether it was computed from the
is shown in figure 6 for the sample variogram points data or from the residuals. A plot of y for the residuals
plotted for lags up to about 32 kilometers (the first four for the Saratoga ground-water elevations with the bin
points) and for lags greater than about 56 kilometers. width narrowed to about 6.5 kilometers is shown in
A parabolic shape in the sample variogram points was figure 8.
not surprising because analysis of the data indicates a Spatial data usually are not collected on a
north-south gradient in the ground-water levels. The uniform grid, but occur in a pattern that reflects
simplest polynomial trend, linear in u and v, was fitted problem areas, accessibility, and general spatial
to all the data using ordinary least-squares estimation. coverage. In the Saratoga data set, nonuniform data
Residuals obtained by subtracting this regional trend spacing resulted in the number of data pairs in each
surface from the data were used to reestimate y in bin being highly variable among the bins, although
equation 5-2, and the sample variogram for the resid- there were still greater than 30 data pairs. This vari-
uals is shown in figure 7. ability yields different reliabilities for the points
250
+ Lag with greater than or equal to 30 pairs
x Lag with less than 30 pairs
200
Q
LJ
o:
a
in 150
CO
C£
LJ
100
<
o
50
10 20 30 40 50 60 70 80 90 100
LAG, IN KILOMETERS
Figure 7. Sample variogram points for ordinary least-squares trend residuals for Saratoga data.
30 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
250
+ Log with greater than or equal to 30 pairs
x Lag with less than 30 pairs
200
Q
UJ
150
CO
O:
Ld
h-
Ld
100
<
2
2
<
50
10 20 30 40 50 60 70 80 90 100
LAG, IN KILOMETERS
Figure 8. Sample variogram points for ordinary least-squares trend residuals for Saratoga data binned to 6.5 kilometers.
Second, a proper transformation of data, subtleties in interpreting the kriging results of the
whose probability distribution is highly skewed, transformed data or in back-transforming kriging
often produces a set of values that can be approxi- results into the untransformed (original) units, as
mately normally distributed by mitigating the effect discussed in section 3.5.1. If a satisfactory variogram
of problematic extreme data. A data set having a of the original data cannot be achieved and a transfor-
normal distribution is important in kriging when confi- mation is indicated, the computation of a sample vari-
dence levels of the estimates are desired. The use of ogram needs to begin again with equation 5.2. Even
confidence levels in a kriging analysis is discussed though no transformation was needed for the Saratoga
in section 6.0. data, an example using a logarithmic transformation
Among the more common transformations is and an example using the indicator transformation are
the natural logarithmic (log) transform. For example, presented in section 6.0.
in this transformation, the y is the sample variogram
of logarithms, and subsequent kriged estimates are 5.4.2 Directional Variograms and Anisotropy
logarithms. Another transformation that is often used,
especially in spatial analyses of contaminant levels, is Anisotropy in the data can be investigated by
the indicator transformation described in section 3.5.2. computing sample variograms for specific directions.
Although a transformation might result in a better Locations included in a given direction from an orig-
distribution of sample variogram points, there are inal location are contained in a sector of a circle of
32 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
1.500
+ Lag with greater than or equal to 30 pairs
x Lag with less than 30 pairs
< 1.000
z>
a
c/o
C/O
a:
LJ
500
+ +
10 20 30 40 50 60 70 80 90 100
LAG, IN KILOMETERS
e 1.500
+ Lag with greater than or equal to 30 pairs
\ x Lag with less than 30 pairs
a
LJ
< 1.000
o
C/l
C/l
a:
LJ
500
<
o
, , , rtr , . . . I
10 20 30 40 50 60 70 80 90 100
LAG, IN KILOMETERS
Figure 9. Initial directional sample variogram points for raw Saratoga data A, north-south and
B, east-west.
34 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
5.5.3 Gaussian Variogram 1. An initial variogram is specified and drift coeffi-
cients are computed to obtain residuals. For this
The Gaussian variogram parameters (eq. 3-25)
step, a pure nugget (that is, a constant) variogram
are a nugget value, g, and a sill, s; and this variogram
can be used to compute the initial estimates of
also has a practical range, r. The Gaussian variogram
the drift coefficients. These initial coefficients
is horizontal from the nugget, becomes a concave yield an ordinary least-squares estimate of the
upward function at small lags, inflects to concave drift and a first-iteration sample variogram of
downward, and asymptotically approaches a sill residuals.
(fig. 3). After a nugget and sill are specified based on
the Y points, the variogram value at a lag of one-half 2. A theoretical variogram is fitted to the sample
the estimated practical range is two-thirds of the sill. variogram of the residuals and is used to obtain
Again, this fitted variogram needs to be supported by updated drift coefficients.
the Y points to a reasonable degree. As is described 3. The residuals from the drift that were obtained in
in the example using the Saratoga data, the Gaussian step 2 are used to compute an updated sample
variogram often is used where the analyzed variable variogram.
is spatially very continuous, such as a ground-water 4. The sample variogram computed at the end of
potentiometric surface. step 3 is compared to the sample variogram from
step 2. If the two sample variograms compare
5.5.4 Linear Variogram favorably, then the theoretical variogram from
step 2 is accepted as the variogram of residuals
Parameters for a linear variogram (eq. 3-26) for subsequent kriging computations. If the
are a nugget value, g, and a slope, b. Sample points sample variogram from step 3 differs markedly
that indicate a linear variogram increase linearly from from the sample variogram from step 2, steps 2
the nugget and fail to reach a sill even for large lags through 4 are repeated using the sample vario-
(fig. 3). Using the nugget as the intercept, the slope is gram from the most recent step 3.
computed for the line passing through the Y points. A
Generally, the plot of the points of Y from
pseudosill can be defined as the value of the line at the
a set of residuals initially increases with h, reaches
greatest lag, hmax, between any two locations. This lag
a maximum, and then decreases as shown in figure 7.
becomes the defacto range, r, for a linear variogram. This typical haystack-type behavior, discussed by
Examples of the use of the linear variogram are in David (1977, p. 272-273), is attributed to a bias
hydrogeochemical studies of specific conductance and resulting from the estimation error in the drift and its
in studies of trace elements, such as barium and boron coefficients. This behavior in the variogram of the
(Myers and others, 1980). residuals generally would more readily occur with a
high degree of drift polynomial and need not prohibit
acceptable variogram determination because the initial
5.6 Additional Trend Considerations points of the sample variogram of residuals are still
indicative of the theoretical variogram. For example,
If a drift in the data is indicated as in section 5.2,
the lag associated with the maximum of Y of the resid-
the theoretical variogram of residuals that has been uals can be a good first approximation for the range of
fitted thus far is used to update the drift equation. the theoretical variogram.
Although ordinary least squares often suffices for
computing a polynomial drift equation, drift determi-
nation is a function of Y when the data are spatially 5.7 Outlier Detection
correlated. But Y cannot be estimated until a drift
equation is obtained to yield the residuals. Therefore, Outliers in a data set can have a substantial
obtaining a sample variogram and a subsequent adverse effect on Y- However, divergent data can
theoretical variogram from drift residuals of a speci- be screened for evaluation using a Hawkins statistic
fied drift form is an iterative process (David, 1977, (Hawkins, 1980), which is described in the context of
p. 273-274) using the following steps: kriging by Krige and Magri (1982). A neighborhood
36 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
Using the Saratoga data, a spherical variogram
mm was fitted to the refined sample variogram of the resid-
i= i uals. The estimated nugget was about 1.49 meters
squared, the sill was 133.8 meters squared, and the
(5-5)
range was about 48 kilometers. Because of the diffi-
culty in determining an exact extrapolated value for
the nugget, the value of 1.49 meters squared was
selected based on an estimated measurement error
- related to obtaining water levels at the well depths
i=i in the Saratoga Valley.
After two iterations using drift residuals, as
The expression to be minimized is called the described in section 5.6, a final variogram was chosen
kriging root-mean-squared error and the constraint that had a nugget of 1.49 meters squared, a sill of
is called the reduced root-mean-squared error. The 148.6 meters squared, and a range of 44.8 kilometers
reduced root-mean-squared error needs to be well (fig. 10). These parameters defined the theoretical
within the interval having endpoints variogram used to obtain the cross-validation errors
through universal kriging with an assumed linear
drift. The best combination of statistics that could be
\T obtained after several attempts at refining the model
1+ 2 -
were a root-mean-squared error of 3.45 meters and
a reduced root-mean-squared error of 0.5794. The
and reduced root-mean-squared error is too small, indi-
cating that the kriging variances produced by the
model are relatively large compared to the actual
1- squared errors. This fact, coupled with the rather
large root-mean-squared error, warranted additional
variogram refinements. In section 5.8.2, a Gaussian
(Delhomme, 1978). An additional check on the good- variogram was fitted to the data; the Guassian vario-
ness of the cross-validation results is the unbiasedness gram produces much better cross-validation results
condition where than the results from the spherical variogram.
LJ
O
CO
CO
or
LJ
LJ
<
^
s
<
50 60 90 100
KILOMETERS
Figure 10. Sample variogram points and theoretical spherical fit for iterated Saratoga residuals.
If the reduced root-mean-squared error is too The reduced kriging errors may not approximate
small, as in the Saratoga example, extending the range a standard normal distribution. If so, a transformation
(equivalent to decreasing the slope for a linear vario- of the data may be needed to achieve a more normal
gram) decreases the kriging variance and, thus, distribution, and the variogram estimation procedure
increases the reduced root-mean-squared error. If a would be repeated.
shift in the nugget is needed, a decrease in the nugget Because no convergence could be reached for
decreases the kriging variance. If the reduced root-
parameter values of a spherical variogram for the
mean-squared error is too large, then a contraction of
Saratoga data, a Gaussian theoretical variogram was
the range or a positive shift in the nugget can be made,
fitted to the sample variogram of residuals in figure 8.
based on the priority and the extent of the changes.
Generally, changes in these parameters also have This choice was made because the initial sample
an effect on the mean-squared error. The larger the variogram points seemed to have a slight upward
nugget is as a percentage of the sill, the larger the concavity, but eventually reached a sill. This behavior
mean-squared error is. In general, improvements in can be attributed to correlation rather than to further
one statistic are usually made at the expense of the drift. After an iterated cross validation using the
other statistics. The optimization of the statistics as a Gaussian parameters, a Gaussian variogram that had a
set is, in effect, a trial-and-error procedure that is oper- nugget of 1.49 meters squared, a sill of 185.81 meters
ationally convergent. squared, and a range of 27.52 kilometers (fig. 11)
38 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
yielded a root-mean-squared error of 2.33 meters and 6.0 PRACTICAL ASPECTS OF
a reduced root-mean-squared error of 1.083 meters. GEOSTATISTICS IN HAZARDOUS-,
The mean cross-validation error was 0.0195 meter. TOXIC-, AND RADIOACTIVE-
These values represented an improvement over the WASTE-SITE INVESTIGATIONS
spherical variogram and were deemed acceptable for
the Gaussian variogram. In this section, several example applications
A probability plot of the reduced kriging are described. The applications have been developed
errors using the final Gaussian variogram is shown using hydrologic, geologic, and contaminant data
in figure 12. The plot is reasonably linear between from established and well-studied hazardous-waste
two standard deviations and, thus, approximates a sites. The real nature of the data enables discussion
standard-normal-distribution function. A plot in of some problems that can occur during HTRW-site
figure 13 of the measured data versus their kriged investigations that originate, not only from natural
estimates indicates that the linear drift/Gaussian field conditions, but also from typical problems that
variogram model selected for the Saratoga data are associated with the types of data involved. In
would produce accurate estimates of ground-water addition, the real nature of the example data provides
elevations for interpolation or contour gridding in the an opportunity for comparison between kriging esti-
region. mates and the real data; these comparisons are brief
300
+ Lag with greater than or equal to 30 pairs Gaussian fit
x Lag with less than 30 pairs
Gaussian model fitting parameters
250 Nugget = 1.49 meters squared
Sill = 185.81 meters squared
Range = 27.52 kilometers
200
ID
O
CO
CO
150
X
100
<
O
50
10 20 30 40 50 60 70 80 90 100
LAG, KILOMETERS
Figure 11. Sample variogram points and theoretical Gaussian fit for iterated Saratoga residuals.
2.5
1.5
ct:
o
z
O
CO
O -0-5
CO
LU
P
< -1.5
-2.5
-3.5
-3.5 -2.5 -1.5 -0.5 0.5 1.5 2.5 3.5
REDUCED KRIGING ERRORS
and general. This report does not provide comprehen- The GEO-EAS software has been used
sive analyses of data that are available in other more whenever the example data did not require universal
elaborate studies. kriging; for those examples, STATPAC was used.
The principal intent of the examples is to As indicated in section 4.0, both of these software
provide systematic descriptions for a few of the packages run on the DOS platform (table 1), which
large number of possible applications that may be is probably most convenient to readers. The results
used during HTRW-site investigations. The examples of kriging estimates are portrayed by gray-scale maps
are not intended to provide guidance for comprehen- rather than by contours because of the objective nature
sive analysis of the included data. However, this report of the gray-scale format. North is at the top of all maps
presents some fundamental problems that can occur presented although this orientation may represent
in geostatistical applications and, in some examples, some deviation from the real data.
indicates some possible alternatives.
With each example, a purpose is established 6.1 Ground-Water-Level Examples
and a general environmental setting is described.
Most aspects of variogram construction and calibra- The principal purpose of the ground-water-level
tion are briefly described and are shown in figures examples is to familiarize the reader with a kriging
and listed in tables. A comprehensive treatment of exercise using ground-water levels and to indicate
variogram construction has been presented in simply how kriging standard deviations may be useful
section 5.0. in evaluating monitoring networks. The data are from
40 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
a water-table setting in unconsolidated sediments the form a + bu + cv, observed in the measured
where the local relief for the land surface is about water levels, where #, b, and c are constants
30 meters. The data involved in this example are determined in the iterative process.
considered virtually free of actual measurement error. 3. After drift was removed, residuals were determined
The location of measured water levels is shown to be stationary and universal kriging with a
in figure 144, and the basic univariate statistics for this linear drift was appropriate.
data set are listed in table 2 (water level A); modifica-
tions to the measured data, in the form of removal and 4. A Gaussian model was used to fit the stabilized
addition of measured values, are shown in figures 145 variogram of residuals (fig. ISA), which
and C. The techniques described in section 5.0 were has a nugget of 0.09 meter squared, a
used to guide the following steps for variogram sill of 2.69 meters squared, and a range of
construction: 1,219 meters (table 3).
1. A raw variogram analysis and basic hydrologic Cross validation was performed, and the results are
knowledge of water-level behavior indicated shown in figures 15B and C and listed in table 3.
that universal kriging would be needed for this The cross-validation statistics conform to the criteria
analysis. discussed in section 5.0.
2. To obtain a stable variogram of residuals, an itera- Linear drift is commonly observed in ground-
tive, generalized least-squares operation was water-elevation data where there are no major anthropo-
initially used to remove prominent linear drift of genic activities, such as large ground-water withdrawals.
750
700
CO
o:
LJ
I
Ld
650
Ld
CO
LU
600
O
550
500
500 550 600 650 700 750
MEASURED VALUE, IN METERS
Figure 13. Scatterplot of measured versus kriging estimates from cross validation of Saratoga data.
1,470
1,370
1,470
1,370
_J
1,270 1,270
1,170 1,170
1,070 1,070
970 970
870 870 m m
770 770
ff
670 670
570 570
470 470
370 370
270
270 EB EB EB
170
170
70
m EB EB
70
mm m m m m mmmmmmmmmm
240 390 540 690 840 990 1,140 1,290 1,440 1,590
-60 90 240 390 540 690 840 990 1,140 1,290 1,440 1,590
MAP DISTANCE, IN METERS MAP DISTANCE, IN METERS
EXPLANATION
EXPLANATION
S SJiSl
10 15 17 20
INDEX TO PLOTTED VALUES, IN METERS
INDEX TO PLOTTED VALUES, IN METERS
1,270
1,170
E
1,070 A Gaussian model is usually appropriate
970 for variograms of highly continuous variables,
870
such as ground-water-elevation data, and this model
770
670
is particularly appropriate in this example. The vario-
570 gram (fig. 15A) at small lags beyond the nugget has
470 an upward concavity that cannot be fit with a linear,
370
spherical, or exponential model. The observed shape
270
170
was interpreted as a function of continuous small-scale
variability. The Gaussian model fits the bowl shape of
BUB
240 390 540 690 840 990 1,140 1,290 1,440 1,590
the small lag data and other data well to a lag of about
MAP DISTANCE, IN METERS 610 meters, but it is not flexible enough to closely fit
EXPLANATION
the points much beyond 610 meters, indicating that
kriging estimates should be computed using neighbor-
INDEX TO PLOTTED VALUES, IN METERS
hoods with a search radius less than 610 meters. In
section 5.0, the initial part of the variogram was
Under these circumstances, there is usually a fairly described as having the most effect on subsequent
uniform and general ground- water movement along a kriging estimates.
flow path. This uniform and general nature introduces The established variogram then was used with
a nonstationary element to the data that, in geostatistics, the measured data to produce universal kriging esti-
is referred to as drift. As indicated in section 5.0, the mates for all points in a 26-by-26 grid that had a grid
presence of drift is indicated by a parabolic variogram size of about 61-by-61 meters. A gray-scale map
shape. In this example, the initial variogram in the raw of the kriging water levels is shown in figure 16A and
variogram analysis had a characteristic parabolic shape, basic univariate kriging estimate statistics are listed in
and a linear drift was identified. Once the drift was iden- table 4 (water level A). The kriging results are a good
tified and characterized, universal kriging procedures representation of the results from other more elaborate
were used. studies.
42 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
The kriging standard deviations for the kriging To produce the third map (fig. 14Q, nine
estimates are shown in figure \6B. The magnitude locations were added in the southwest corner where
of kriging standard deviations can provide investiga- the sampling density was relatively low and the kriging
tors with a direct indication of where the uncertainty standard deviation was relatively high. In section 3.3,
associated with kriging estimates is relatively high equation 3-47 indicates that the universal kriging
or low. The areas of the greatest uncertainty for variance depends on the variogram, the type of trend,
the kriged water levels are in the upper right and and the measurement locations; in this respect, the
kriging standard deviation does not depend on the
the lower left corners of figure 16B, where standard
values at the measurement locations. Consequently,
deviations are as high as about 1.4 and 0.8 meters.
values of zero were used for the nine new measurement
These areas are where the density of the measured locations and only the resultant map of kriging standard
data is relatively low. Throughout much of the deviations (fig. 16D) is of interest. The map shows that
remainder (about 70 percent) of figure 16B, the the kriging standard deviations in the lower left corner,
kriging standard deviation is almost constant at which formerly had values of about 0.8, approximately
about 0.35 meter. have been decreased by a factor of about 0.25, which
To use the kriging standard-deviation values indicates that the kriging estimates, based on the
more quantitatively, some assurance is needed that geometry of the network, are more reliable.
the measured data and the reduced kriging errors
are approximately normally distributed and that the
assumption of stationary residuals after drift removal 6.2 Bedrock-Elevation Examples
is correct. If assumptions are valid, then the basic
The following examples are for bedrock eleva-
statistical principles involving confidence intervals can tions. The principal purposes of the examples are to
be applied. In this example, the kriging standard devia- familiarize the reader with a kriging exercise using
tion of 0.35 meter throughout most of the map indi- bedrock elevations and to describe block kriging. The
cates that there is a 95-percent chance that the true data are from an area where bedrock consists of a
value at a location where there is a kriging estimate series of intercalated terrestrial deposits that have been
is within 0.76 meter (twice the kriging standard devia- weathered somewhat and then covered with alluvium.
tion) of the kriging estimate. The opportunity for measurement error in these data
As an example of evaluating network density is inevitable because the determination of just where
and the accuracy of kriging estimates, two new maps bedrock begins is complicated and subjective.
were developed. To compile the first map, a decrease The set of measured locations, set A, is shown
in network density was effected by removing nine in figure 11A, and the basic univariate statistics are
measured locations from the northwest part of the listed in table 2 (bedrock A); modifications to the
area (fig. 14#) where sampling density was high measured data, such as removal of sites, are shown
and kriging standard deviations were low. Kriging in figure \1B. The techniques described in section 5.0
were used to guide the following steps for variogram
estimates were produced for the same grid and
construction:
the basic univariate kriging estimate statistics are
listed in table 4 (water level B). The map shown 1. The raw variogram indicated a stationary spatial
in figure 16C indicates that the ratio of the original mean. The data were assumed to be suitable
kriging standard deviations and the kriging standard for ordinary kriging.
deviations with the nine measured locations removed 2. An isotropic Gaussian model was used to fit the
is always very close to 1.00, which indicates that there variogram, which had a nugget of 0.65 meter
is very little difference between the two sets of kriging squared, a sill of 12.54 meters squared, and a
standard deviations and that water levels are over- range of 914 meters (table 3, bedrock A).
sampled in the area where the nine measured locations 3. Cross validation was performed, and the results
were removed. (table 3, bedrock A) were not acceptable.
o
CO
1.5
0.5
25
20
CO
"< 15
CO
LJ
O
z
o
rr^
10
10 15 20 25
MEASURED VALUE, IN METERS
Figure 15. Variogram and variogram cross-validation plots for residuals in water-level examples
A, theoretical variogram; B, cross-validation scatterplot; and C, cross-validation probability plot.
44 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
3.5
2.5
1.5
or
o
O
or
°-5
CO
o
CO
< -1.5
z>
o
-2.5
-3.5
-3.5 -2.5 -1.5 -0.5 0.5 1.5 2.5 3.5
REDUCED KRIGING ERRORS
Figure 15. Variogram and variogram cross-validation plots for residuals in water-level examples
A, theoretical variogram; B, cross-validation scatterplot; and C, cross-validation probability plot Continued.
[Note: NA, not applicable; base unit for water levels and bedrock is meters; base unit for water quality A is log concentration, concentration in micrograms
per liter; base unit for water quality B is indicator units]
Bedrock A None 0/NA Gaussian 0.65 12.54 914 0.045 2.53 2.146
Bedrock B None 0/NA Gaussian 0.74 8.36 732 -0.010 1.34 1.192
Water quality A Natural log 150/45 Exponential 1.00 3.20 1,295 0.105 1.54 0.938
Water quality A Natural log 240/45 Exponential 1.00 3.20 228 0.105 1.54 0.938
NORTH
CL
-60 90 240 390 540 690 840 990 1,140 1,290 1,440 1,590
MAP DISTANCE, IN METERS
EXPLANATION __________
7 9 12 14 17 19
e
00
970
o
670
NORTH
CL
<
-30
-60 90 240 390 540 690 840 990 1,140 1,290 1,440 1,590
MAP DISTANCE, IN METERS
EXPLANATION
*« *:
i i i:rr:::3Piisiiiiiiiifcii£;^Tirf"^^^^aB^B^^B
i i i i
0.31 0.53 0.74 0.96 1.17 1.40
46 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
00
o:
LJ
o
z
NORTH
CL
-60 240 390 540 690 840 990 1,140 1,290 1,440 1,590
MAP DISTANCE, IN METERS
EXPLANATION
1,570
1,470
1,370
1,270
00
o: 1,170
1,070
970
870
LJ
O 770
z
670
570 NORTH
o
CL 470
370
270
170
70
-30
-60 90 240 390 540 690 840 990 1,140 1,290 1,440 1,590
MAP DISTANCE, IN METERS
EXPLANATION
[Note: Base unit for water level A and B and bedrock B and C is meters; base unit for water quality A is log concentration, concentration in micrograms
per liter]
Standard
Minimum Maximum Mean Median Skewness
Example deviation
Transformation (base (base (base (base (dimension-
identifier (base
units) units) units) units) less)
units)
Water level A Drift 7.42 19.8 14.0 13.6 3.09 0.11
Water level B Drift 7.49 19.8 14.0 13.5 3.09 0.11
Bedrock B None 7.96 19.8 12.6 12.1 2.35 0.82
Bedrock C None 8.14 19.8 12.6 12.1 2.33 0.82
Water quality A Natural log 2.92 7.07 5.17 5.03 0.72 -0.06
The cross-validation exercise produced a the kriging process so that each homogeneous
reduced root-mean-squared error of 2.146 (table 3, domain is addressed independently, becomes more
bedrock A), which indicates that the kriging variance attractive. In more complicated applications where
is underestimated. Further attempts to fit the Gaussian a large number of domains are present, a distributed
model to the sample variogram points produced better approach may be necessary to avoid an undue amount
cross-validation statistics; however, the Gaussian of compromise.
curve began to depart substantially from the sample The restriction of measured data, set B, is shown
variogram points at the low lag sample points. in figure 17B, and the basic univariate statistics are
As a result, the distribution of the residuals was listed in table 2 (bedrock B). The restriction exercise
examined, and the eastern, and especially north- resulted in removing 18 measured locations and in
eastern, parts of the area were determined to the truncation of the northeastern part of the area
contain problematic data that rendered the distribu-
so that the area became polygonal rather than rectan-
tion nonhomogeneous. The nonhomogeneous nature
gular. The techniques described in section 5.0 were
was related to an incised channel present on the
used to guide the following steps for variogram
bedrock surface. Therefore, the measured data
construction:
were restricted to exclude the outlying measurements.
Before the restriction, two alternative techniques for 1. A Gaussian model was used to fit the variogram,
dealing with the outlying measurements were consid- which had a nugget of 0.65 meter squared,
ered and deemed beyond the scope of this effort. a sill of 8.36 meters squared, and a range of
However, a brief discussion of the alternatives is 732 meters. The variogram indicated a station-
appropriate. ary spatial mean.
The first alternative was to fit a contrived 2. Initial cross validation was performed, and the
and nongradual surface to the measured data to remove nugget was changed from 0.65 meter squared to
the outlier effect. A splined surface might be capable
0.74 meter squared to improve cross-validation
of producing the desired result. The decision whether
statistics. The final variogram is shown in
or not to pursue such an alternative becomes some-
figure ISA, and the characteristics are listed in
what philosophical. In a relatively simple example,
table 3.
as in this bedrock example, such an alternative may
be entirely appropriate; however, this alternative 3. Final cross validation was performed, and the
may actually involve two unique and homogeneous results, shown in figures 18Z? and C and listed
domains. Therefore, the second alternative, distributing in table 3 (bedrock B), were acceptable.
48 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
: ' 'r4 ttLl 1 |'lr-1-|'lf-'i 1 'rt-i' 1 r>-t' 1 ''l' 1 ''l 1111 l 1 '''l'' 11 l 1111 l 11 ''l 1111 1 ' ' :
1,570 - LJLjf] LJ LJ LJ rLJ B~
A 1,470
\ 1_1 |
1,370 JljiijJWp ^ p_ Eg rj B n
(f) 1,270 rffii-3^ ^T1 ^
LJ
h-
LJ
1,170
1,070
L a D EI IB n B j
970
L a D H " j
Z
I_J
870
r a EO B H -
0
z
770
^ n D S ^
<c 670
:L i i m BOH Pl H H H :
Ul l"1 iSBeSEH b£i ^ j
570 NORTH
Q : .^ ;
470 - rr-| 91 -;
Q_ - LJ] Bj :
< 370
r '
270 -_ m B ^
170 ~- -;
70 i. ^
-30
60 140 340 540 740 940 1,140 1,340 1,540 1,740 1,940 2,140 2,340 2,540
MAP DISTANCE, IN METERS
EXPLANATION
i i i i i
3 7 11 14 18 22
yiSss '|
INDEX TO PLOTTED VALUES, IN METERS
1,570
B 1,470
1,370
in 1,270
ct : LJR [T| :
LJ 1,170
h- : t_J j I L-iJ :
LJ
^>
1,070 -; n D
n H
-;
970 ~ L _J ^^?
Z ; Q H ;
870 r D dig --.
Ld
O 770
f LI B 0 "
<C 670
: n
gg pg^JiiL«||
n
BB
^:
(f) 570
: ^_ ; NORTH
Q
470 H-j ||
CL "I
^ i
<C 370
^
270
170 - H
70
-30
60 140 340 540 740 940 1,140 1,340 1,540 1,740 1,940 2,140 2,340 2,540
MAP DISTANCE, IN METERS
EXPLANATION
11 14 22
INDEX TO PLOTTED VALUES, IN METERS
Figure 17. Location of measured data for bedrock-elevation examples A, original data and
B, restricted data.
O
LJ
1 "
O
CO
CO
or
CO
o;
< 15 -
CO
LJ
o
2
o
50 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
3.5
2.5
1.5
o
Q
a:
§ 0.5
I
-0.5
o
CO
y x"
< -1.5
-2.5
-3.5
-3.5 -2.5 -1.5 -0.5 0.5 1.5 2.5 3.5
REDUCED KRIGING ERRORS
Figure 18. Variogram and variogram cross-validation plots for bedrock-elevation examples
A, theoretical variogram; B, cross-validation scatterplot; and C, cross-validation probability plot-
Continued.
The large difference between the sill defined table 4 (bedrock B). The kriging results indicate
for the initial data set and the sill for the restricted channel-like features in the bedrock surface and a
data set [12.54 meters squared and 8.36 meters squared prominent bedrock high at the south border of the
(table 3)] supports the hypothesis that the original data area; the results are a good representation of the
set is actually two different domains. The final vario- results from other more elaborate studies.
gram then was used, along with the measured data, For an example of block kriging, an investiga-
to produce ordinary kriging estimates for all points tive goal of establishing block values of bedrock eleva-
in a 52-by-52 grid that had a spacing of about tion for a finite-difference ground-water-model grid
30-by-30 meters, which was truncated along the having about 120-by- 120-meter cells was assumed.
northeastern border because of the restriction operation. The same variogram and search criteria were used
For the kriging procedure, a search radius of about to estimate block values for a 13-by-13 grid that had
about 120-by-120-meter spacing; a 4-by-4 block was
914 meters, with a maximum of 16 and a minimum
specified. Each kriging value shown in figure 19C is
of 8 surrounding locations, was specified. It is not
an estimate of the average value of bedrock elevation
uncommon to specify a search radius that is greater throughout the about 120-by-120-meter block. The
than the variogram range; this practice helps ensure standard deviation for the block estimates is less than
that, in this case, between 8 and 16 points would be the standard deviation for the point estimates (table 4).
obtained to develop the kriging estimate. Gray-scale Gray-scale maps of the kriging estimates and of the
maps of the kriging estimates and kriging standard kriging standard deviations are shown in figures 19C
deviations are shown in figures 19A and B, respectively, and D, and the univariate kriging estimate statistics are
and the univariate kriging estimate statistics are listed in listed in table 4 (bedrock C).
O 770
670
570 NORTH
470
370
270
170
70
-30
-60 140 340 540 740 940 1,140 1,340 1,540 1,740 1,940 2,140 2,340 2,540
MAP DISTANCE, IN METERS
EXPLANATION
lit?
12 15
INDEX TO PLOTTED VALUES, IN METERS
1 ' I I I -
1,570 ^ ;;;::::::;::;::;
D 1,470 -:
,,,,-,. ,. , , , . . . ,..
;;;;;;;;;;;;;fc
,.:,. .,
_
1,370 F , 3 , , i.tfiiTlii.
(T) 1,270 A.«
cr i.ij:::l*i
LU
h- 1,170 : f
!*.
^
b
!v
LU 1,070 -
t f
ifi
z 970 ii.
1 jiis ~
^9 1 ,
^ 870 I «K
LJ 1
O 770 j 1
< 670 I
mmmtmiL
tn 570 1 ^^KXT NORTH
£ - ^^ ^7
^^BfT
KXXT
< 370 |
^^^^^^^ ' ~
270 J E
PV
170 I
[5[TTT'
70 1 rjl^lrlrT
(yHfTj
^n '.''f*' 1*' , , , | , , , , | . . , , 1 , , , , | , , . , 1 . .
-60 140 340 540 740 940 1,140 1,340 1,540 1,740 1,940 2,140 2,340 2,540
MAP DISTANCE, IN METERS
EXPLANATION
Figure 19. Kriging results for bedrock-elevation examples A, kriging estimates; B, kriging standard
deviations; C, block kriging results; and D, block kriging standard deviations.
52 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
I , , , , i , , , |-,-,-,-,-r ,-r-,-,-|-,-,-,-,-[-,-,- ,-i-p-r-,-,-|-
cr
Ld
I
LJ
(J
I
co NORTH
Q
Q_
-60 140 340 540 740 940 1,140 1,340 1,540 1,740 1,940 2,140 2,340 2,540
MAP DISTANCE, IN METERS
EXPLANATION
NORTH
60 140 340 540 740 940 1,140 1,340 1,540 1,740 1,940 2,140 2,340 2,540
MAP DISTANCE, IN METERS
EXPLANATION
M
Figure 19. Kriging results for bedrock-elevation examples A, kriging estimates; B, kriging standard
deviations; C, block kriging results; and D, block kriging standard deviations Continued.
D D
LU
1 -440 Dnn
I
LJ
n
r n
LJ 940
o
n
rn LJ
D
1vi
Q
0- 440
DD NORTH
rn
L_jJ
pi
EftffJ
-60
-30 470 970 1,470 1,970 2,470 2,970 3,470
MAP DISTANCE, IN METERS
EXPLANATION
i I
307 607 907 1,207 1,507 1,807 2,107 2,407 2,707 3,007
54 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
3. No trends were indicated during preliminary explo- back-transformed and log-space concentrations, as
ration, and ordinary kriging was tentatively well as the kriging standard deviations in log space,
selected as the appropriate technique. are shown in figures 22A, B, and C.
Natural log transformations are routinely The back-transformation procedure was a
needed for concentration data that vary over several simple exponentiation of the log-space kriging esti-
orders of magnitude, which is common in areas of mates. Such a back-transformation does not use bias-
contaminant plumes. The data were transformed to log correction factors to deal with moment bias; conse-
space and fit acceptable criteria for normality. After quently, the back-transformed values need to be inter-
transformation to log space, the techniques described preted as median values rather than average, or mean,
in section 5.0 were used to guide the following steps values. A simple back-transformation, however, is
for variogram construction: convenient and was performed, principally, to enhance
visual interpretation of the kriging estimates. Univariate
1. An exponential model was used to fit a directional statistics for the log-space kriging estimates are listed in
variogram at an angle of 150 counterclockwise table 4 (water quality A). The kriging results do have
degrees to the east-west base line. The variogram noticeable smoothing; however, they also indicate a
had a nugget of 1.00 log concentration squared, plume emanating from a location just northwest of the
a sill of 3.20 log concentration squared, and a center of the area and indicate movement and some
range of 1,295 meters [fig. 21A and table 3 (water dispersion to the southeast; the estimates are a very
quality A)]. good representation of the results from other more elab-
2. An exponential model also was fit to a directional orate studies.
variogram at an angle of 240 counterclockwise Additionally, to indicate the effect of the log
degrees to the east-west base line. The variogram transform on probabilities in converting, or back-
had a nugget of 1.00 log concentration squared, transforming, kriging estimates, the kriging estimates
a sill of 3.20 log concentration squared, and a and the kriging standard deviations, in log space,
range of 228 meters [fig. 21B and table 3 (water were used to estimate the one-sided 95th percentile
quality A)]. at each kriging-estimate location according to the
3. Cross validation was performed using the geometric formula:
anisotropy of the two variograms and the results
[figs. 21 C and D, and table 3 (water quality A)]
were acceptable. Co.95 = exp[Z(*0)
The residuals are symmetrically distributed
(fig. 21D). However, the scatterplot (fig. 21Q indi- where
cates that small concentrations were overestimated and
is the kriging estimate at location, £,,
that large concentrations were underestimated. This
in log space; and
discrepancy in the estimates does not indicate an error
in the model, but rather, indicates a consequence of GK feo) is tne corresponding kriging standard
data that have a large nugget compared to the sill; in deviation in log space.
this example, the nugget is approximately 30 percent The resulting map is shown in figure 22D and can
of the sill. The large nugget decreases the predictive be used to indicate areas where the true concentration
capacity of the model and increases the smoothing has only a 5-percent chance of exceeding the value
introduced by kriging. indicated.
The established variogram then was used, To perform indicator kriging, the indicator
along with the measured locations, to produce ordinary transformation, as described in section 3.0, was
kriging estimates for all points in a 40-by-20 grid using applied. An indicator cutoff equal to the median
a grid spacing of about 91 -by-91 meters. For the kriging value of 270 micrograms per liter for the untrans-
procedure, a search radius of about 1,524 meters with formed measured data was selected. The model for
maximum of 16 and a minimum of 8 locations was indicator kriging estimates the probability that the
specified. Gray-scale maps of kriging estimates, in concentration would be less than the indicator cutoff.
o:
<
o
oo
< 1
o
300 600 900 1,200 1,500 1,800 2,100 2,400 2.700 3,000
LAG, IN METERS
I 4
O
O
O
o
3 -
o
CO
2 -
1 -
Figure 21. Directional variograms and variogram cross-validation plots for ground-water-quality
examples A, theoretical major-direction variogram (southeast); B, theoretical minor-direction vario-
gram (northeast); C, cross-validation scatterplot; and D, cross-validation probability plot.
56 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
10
O 8
line of agreement __
o:
£ 7
LJ
O
8
O
O
< 4
^
CO
^ 3
O
z
o
2345678 10
MEASURED VALUE, IN LOG CONCENTRATION
3.5
2.5
1.5
o:
o
a
o:
0.5
£
b_ -0.5
O
CO
UJ
< -1.5
Z)
O
-2.5
-3.5
-3.5 -2.5 -1.5 -0.5 0.5 1.5 2.5 3.5
REDUCED KRIGING ERRORS
Figure 21. Directional variograms and variogram cross-validation plots for ground-water-quality
examples A, theoretical major-direction variogram (southeast); B, theoretical minor-direction vario-
gram (northeast); C, cross-validation scatterplot; and D, cross-validation probability plot Continued.
-60
-30 470 970 1,470 1,970 2,470 2,970
MAP DISTANCE, IN METERS
EXPLANATION
NORTH
-60
-30 470 970 1,470 1,970 2,470 2,970 3,470
MAP DISTANCE, IN METERS
EXPLANATION
Figure 22. Kriging results for ground-water-quality examples A, kriging estimates back-transformed; B, kriging
estimates in log space; C, kriging standard deviations in log space; and D, 95-percent confidence level for kriging
estimates back-transformed.
58 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
NORTH
1.29 1.36 1.43 1.50 1.57 1.64 1.71 1.78 1.85 1.92 2.00
NORTH
-60
-30 470 970 1,470 1,970 2,470 2,970 3,470
MAP DISTANCE, IN METERS
EXPLANATION
155 1,512 2,869 4,226 5,582 6,939 8,296 9,653 11,010 12,367 13,724
Figure 22. Kriging results for ground-water-quality examples A, kriging estimates back-transformed; B, kriging
estimates in log space; C, kriging standard deviations in log space; and D, 95-percent confidence level for kriging
estimates back-transformed Continued.
60 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
^ 0.75
+ Lag with greater than or equal to 30 pairs Spherical fit
0.70
x Lag with less than 30 pairs
0.65 - Spherical model-fitting parameters
Nugget = 0.05 squared indicator unit
0.60 Sill = 0.25 squared indicator unit
Range = 610 meters
0.55
CO
2 0.50
I a45
O 0.40
O
^ 0.35
Z
^- 0.30
0.25
a 0.20
0.15
0.10
0.05
0.00
300 600 900 1,200 1.500 1.800 2.100
LAG, IN METERS
U./J
K
0.25
X +
O :
0.20 :
0.15
\
0.10 i
0.05
0.00 ^
0 300 600 900 1.200 1.500 1.800 2.100
LAG, IN METERS
Figure 23. Directional variogram plots for indicator kriging ground-water-quality example A, theoretical
major-direction variogram and B, theoretical minor-direction variogram.
Initially, many investigators have a tendency to and, consequently, is more difficult to fit compared
focus on the amount of measured data that is available to the other standard variogram models, with 8 to
as an initial consideration; however, the applicability 10 optimally located sample variogram points (enough
of kriging techniques cannot be based simply on the points to define the nugget, two areas of curvature, and
amount of measured data. Unless the investigator the sill). In this ideal case, about 25 measured values
is presented with a reliable variogram, the amount would be needed to fulfill the conservative minimum
and spatial distribution of measured data can be of 30 pairs per lag. In this case, the relatively few
a constraint. If, for instance, there are fewer than measured data points need to be systematically located
25 measured values at optimal locations from the field, so that the optimally located variogram points can
there may not be enough data to confidently estimate be computed. If the measured data were not located
Gaussian variogram parameters; however, a small systematically, as is usually the case, then more
amount of measured data may be suitable for other measured data would be needed.
variogram models. Once sample variogram points meeting the
How much data are needed to apply kriging required number of pairs can be defined, the resultant
techniques is not easy to determine, but information variogram still must have structure. The variogram, for
in this report, especially in section 5.0, and the litera- instance, may simply exhibit noise about a horizontal
ture cited can provide some guidance. Section 5.3 line, a case that has no structure. If measured data are
points out that a good minimum for the number of clustered and the number of lags has been minimized
pairs of locations in each variogram lag is 30, and to meet the required number of pairs of locations, the
the American Society for Testing and Materials variogram may seem horizontal because it is domi-
(1996) has suggested that 20 may work well also. nated by small-scale effects in the clustered data.
Most investigators would probably feel comfortable The investigator then has latitude to adjust the lags
defining a Gaussian form, which has more inflection and attempt to balance the lag spacing and required
62 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
number of pairs per lag interval, as described in The construction of the variogram needs to
section 5.3. However, the variogram also could seem be described and included as part of the kriging
horizontal because the actual sill is reached within a application documentation. The description needs
very small lag. If that lag is smaller than the minimum to address the number of pairs of locations in each
spacing of measured data, obtaining structure in the variogram lag and to demonstrate that the variogram
variogram would not be possible. In such cases, the has structure. A plot of the variogram is helpful to
measured data need to be considered independent, demonstrate the presence or absence of structure.
and kriging techniques, at the lag of the measured The variogram construction discussion also needs
data, would be ineffective, or at least, offer little to establish the presence of or lack of isotropy. If
advantage over other interpolation techniques. anisotropy is present, its nature needs to be estab-
lished, and it needs to be addressed by variogram
adjustments similar to the adjustments presented
7.2 Important Elements of
in section 5.4.2.
Kriging Applications
The variogram cross-validation statistics
Many important elements of kriging applica- described in section 5.8 are useful and, if available,
tions have been discussed in this report. These they can aid in the evaluation of a kriging application;
discussions have been presented as a systematic and authoritative and definitive kriging applications
sequential method designed to provide guidance in should include cross validation. Often, the most
kriging applications. Occasionally, an investigator is useful variograms have cross-validation statistics that
presented with the results of a previous kriging appli- conform to the guidelines discussed in section 5.8.
cation and needs to evaluate the application before Section 5.8.1 indicates that the cross-validation exer-
deciding whether or not to use the results. This section cise needs to balance minimizing the kriging cross-
presents a brief review of some important elements validation errors with efforts to guard against bias.
of kriging applications that can be used in that Also, as discussed in section 5.8.1, if probabilistic
evaluation. For a more detailed discussion of impor- statements are part of the kriging application, there
tant elements of geostatistical applications, the reader needs to be some investigation of the normality of
is referred to the American Society for Testing the reduced kriging error, such as the cross-validation
and Materials (1994) for content of geostatistical probability plots included with the examples in
investigations. section 6.0.
The presence of or lack of stationarity in the Maps of the kriging estimates and standard
spatial mean needs to be demonstrated definitively. deviations need to be presented or discussed.
If the spatial mean is nonstationary, then drift is The maps of kriging estimates need to conform to
indicated and appropriate measures to address the any qualitative information about the information
nonstationarity, which are similar to the measures portrayed on the maps that is available to the investi-
presented in section 5.2, need to be part of the
gator. The maps of kriged standard deviations can
application. In ideal situations, nonstationarity occurs
be used to delineate areas of large uncertainty in the
as a gradual change. HTRW-site investigations may
kriging estimates.
present cases, especially when using with ground-
water-quality data in and around plumes, that have Finally, the variogram and kriging algorithms
abrupt step-like changes at plume boundaries and do are intended for interpolation rather than extrapolation
not appear as regional drift. In these cases, the investi- tools. Once the application extends to areas beyond the
gator needs to be aware that, without knowledge of the geographic extremes of the measured data, or perhaps
plume boundaries, points from within the plume will those extremes plus the range, there needs to be some
be grouped with points from outside the plume in qualification of the area of extrapolation. For instance,
computing the sample variogram. The effect of this in universal kriging, the practitioner would need to
problem is minimized as long as the investigator can have some assurance that the conditions of drift
define lags that allow data points within the plume to defined in the study area continue into the area of
be grouped together. extrapolation.
64 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
In most of the following techniques, the 8.2 Simple Moving Average
predictor of the process at location XQ is a linear
combination of the measurements at locations x,-, Let hi0 be the distance of XQ from */, let h^ be
i=l, 2,..., n. Using Z(XQ) to denote an arbitrary the ordered (from smallest to largest) distances, and fix
predictor [this notation distinguishes the predictors 1 < k < n. Then the weights w, are (Cressie, 1991)
to be discussed in this section from the kriging
predictor, which is denoted by Z(XQJ], the definition
of Z (XQ) is -,
(8-3)
hiO> h [kO]
(8-1)
i= 1 Thus, this predictor is the average of the measure-
ments at the k nearest locations from XQ.
Although this form is the same form that is taken by If k is equal to n, this predictor is identical to the
the kriging predictor, the difference is in the way the simple average, with weights as given in equation 8-2.
coefficients w,- are computed. A choice of k smaller than n assumes that the predictor
needs to incorporate more of the local fluctuation
measured in the data, or, equivalently, that measure-
8.1 Global Measure of Central ments at locations near *o need be more informative
than measurements at other locations in predicting
Tendency (Simple Averaging) z(*o); the smaller k is, the more variable the predictor.
If k = 1, the predictor is an exact interpolator and is
The predictor for the process at any location XQ
constant on the Voronoi polygons (see section 8.4)
is the simple average of the measurements; that is, the induced by the measurement locations.
weights Wj are all equal and are given by (Cressie, There are several variations of this predictor. In
1991) one such variation, a distance r may be fixed (rather
than fixing k) and averages over locations that are
within distance r of XQ may be obtained. Additionally,
w,- = -. (8-2) a moving median may be used rather than a moving
n
average. Sorting and testing distances can slow
computations compared to obtaining the simple
This predictor represents the smoothest possible average, and use of medians rather than means results
predictor surface. In using this predictor, a certain in a predictor that is more resistant to outliers.
degree of spatial homogeneity is assumed. No attempt
is made to incorporate any detectable patterns (or
trends) in the mean or variance of the data as a func- 8.3 Inverse-Distance Squared
tion of location, and the fact that measurements made Weighted Average
at points that are close together may be related is dis-
regarded. Such a predictor has the advantage of being The weights w, are (Journel and Huijbregts,
very simple to compute; it needs no estimation of a 1978)
variogram or other model parameters. The disadvan-
tage is that representing the spatial field by a single
value ignores much of the relevant and interesting h iO
n 2
structure that may be very helpful in improving W
' =
n (8-4)
predictions. As discussed in section 3.3, if applied in
a stochastic setting, this predictor would be optimal
(best linear unbiased) if there is no drift and if resid-
uals are uncorrelated and have a common variance. where again hfQ is the distance of XQ from Xj.
8.4 Triangulation
The weight assigned to a point is proportional to the
To compute this predictor, the region R is parti- area of the triangle opposite the point.
tioned into what are referred to as Voronoi polygons Computation of this predictor is slower than
Vj, V2» > ^n, with Vj being the set of locations closer computation of the predictors in sections 8.1, 8.2, and
to measurement location */ than to any other measure- 8.3. The predictor is an exact interpolator, and the
ment location. If any two polygons, V^- and Vp share a surface produced is continuous but not differentiable
common boundary, *,- and X: are joined with a straight at the edges of the triangulation.
line. The collection of all such lines defines what is
known as the Delauney triangulation. One such
triangle contains the prediction location XQ', the
vertices of this triangle, which are measurement loca-
tions, are labeled x_j, x^, and x^. The spatial prediction
at XQ is the planar interpolant through the coordinates
[*p z(x_j)], [xfr zQcfc)], and [x^, z(x$\. By joining XQ and
x_j, x_fc, and x±, three subtriangles are formed. The
weights w,- are (Cressie, 1991)
W; = -, i = j, k, or 1
. (8-5)
o, otherwise
where
l
A; is the area of the subtriangle opposite
vertex x;.
These definitions are shown in figure 25. In this EXPLANATION
figure, the dashed lines depict the Voronoi polygons Delauney Triangulation
associated with points x_i, x_2, ..., x^, and the solid lines Voronoi Polygons
define the Delauney triangulation. Vertices of the -***** Subtriangles
triangle containing the prediction point XQ are x±, x$,
and X£, and dotted lines represent the subtriangles Figure 25. Diagram showing Voronoi polygons.
66 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
8.5 Splines Under some conditions, a solution to the optimi-
zation problem (eq. 8-7) also may be obtained by a
In spline modeling, the measurements are inter- kriging algorithm, if the smoothing parameter T| is
polated using combinations of certain so-called basis equal to the variance of the measurement error and if
functions. These basis functions are usually piecewise a special form is chosen for the covariance function. In
polynomials of a certain degree that are determined by this situation, spline approximation is a special type of
the user; let this degree be k. The coefficients of these kriging. However, the variogram that needs to be used
polynomials are chosen so that the function values and in the kriging equations to make the kriging predictor
the first k-\ derivatives agree at the locations where equal to the spline predictor is determined by the basis
they join. The larger k is, the smoother the prediction functions selected for the spline. Because the basis
surface is. Spline techniques are often applied in a functions selected are subjective on the part of the
nonstochastic framework; in such, they represent a user, the resulting equivalent variogram may not
way of fitting a surface that has certain smoothness be representative of the true variogram of the data.
properties to measurements at a set of locations with Because kriging uses the data to indicate reasonable
variogram choices, kriging has an important advantage
no explicit consideration of statistical optimality.
over splines. Another advantage of using the kriging
There is, however, a considerable body of work
framework is the interpretation of the smoothing
in which this technique is applied in a stochastic
parameter in terms of measurement errors. Many
setting. Splines may be used, for example, in non-
times, an objective estimate of the magnitude of the
parametric regression estimation problems (Wegman measurement error can be obtained. The connections
and Wright, 1983). between kriging and splines are discussed further by
A typical approach to formulating a spline Wegman and Wright (1983), Watson (1984), and
problem is to pose the problem as an optimization Cressie (1991).
problem. In one special formulation, the first two
derivatives of the prediction surface are assumed to
exist, which imposes a certain degree of smoothness, 8.6 Trend-Surface Analysis
and the spline function is assumed to minimize
Trend-surface analysis is the process of fitting
a function, such as that in equation 3-43 to the data,
using least squares to determine the coefficients that
(8-7) yield the best fit. Computationally, trend-surface
i=l analysis is equivalent to universal kriging, with an
assumption that the Z*(^,)'s in equation 3-16 are
where uncorrelated. Thus, there is no need to estimate a vari-
ogram, and readily available regression packages may
Q is a term that depends on the first two deriva- be used for estimating the coefficients. As in universal
tives of the predictor surface. kriging, polynomial surfaces are the most commonly
The parameter r| is a nonnegative number that needs used. When trend surfaces are applied in a stochastic
to be specified by the user; the value of this parameter setting, the resulting predictor is optimal if deviations
indicates the trade-off between goodness of fit to the from the surface are uncorrelated and have a common
data, measured by the first term, and smoothness, as variance.
measured by Q. If r| is 0, the spline is an exact interpo-
lator and passes through all the data points. If rj > 0,
8.7 Simulation
the spline is not an exact interpolator. (Splines that
are not exact interpolators are referred to as smoothing In this section, a regionalized random variable
splines.) There are a number of numerical procedures Z(x), where x is a location in a two-dimensional study
that may be used for fitting splines, but allowing the region /?, is considered. Kriging is an interpolation
smoothing parameter rj to be greater than 0 renders the algorithm that yields spatial predictions ZQc) that
computational problem more complex. are optimal, as has been discussed in this report. The
68 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations
at a single location, but at all of the grid locations 9.0 SUMMARY
jointly, can be obtained. These realizations then
can be used to calculate probabilities associated The geostatistical technique known as kriging
with any number of spatial locations together. For can be used to determine optimal weighting of
measurements at sampled locations for obtaining
example, the probability that the largest (maximum)
predictions, or kriging estimates, at unsampled loca-
contaminant value over a certain subregion is greater
tions. Kriging also provides information concerning
than a particular concentration might be assessed.
the uncertainty associated with kriging estimates.
(If the word "largest" here were replaced with The uncertainty information available from kriging,
"average," then block kriging could be used to as well as the optimal weighting, distinguishes kriging
obtain the answer.) from other techniques used for spatial modeling.
A central point that needs to be emphasized is The theory of regionalized random variables
that simulation is especially useful when probabilities is the basis for different forms of kriging. Ordinary
associated with complicated, usually nonlinear, func- kriging is used when the spatial mean is considered
tions of the regionalized variables for a region need constant. Universal kriging is an extension of ordinary
to be analyzed. The maximum function mentioned kriging that can be used to address a nonconstant
in the preceding paragraph is one simple example. spatial mean. Block kriging is used to obtain kriging
Another example is the problem of determining place- estimates for a block of area that is larger than the area
ment of ground-water monitoring wells to detect and represented by an individual sample. Indicator kriging
monitor ground-water contamination emanating from implements the kriging equations nonparametrically.
a potential point source. Given an existing set of A fundamental step in kriging applications is
hydraulic-head data, kriging might be applied and development of a variogram. The variogram is usually
flow paths determined from resulting hydraulic-head developed from the results of measurements at many
gradients. Intersection of the flow path from the point locations within the application area. The variogram
source with the regional boundary then might be used describes spatial correlation within the application
to determine monitoring-well placement. Conditional area and provides basic information required to deter-
simulation would be useful to determine the uncer- mine optimal weights for measurements to be used in
tainty associated with well placement or to give an making kriging estimates. Information from the exer-
indication of how many monitoring wells might be cise of variogram development can be used to cross
validate the variogram and the cross-validation statis-
appropriate. In this example, the variable of interest,
tics can, in turn, fine tune variogram development.
well location, is a complicated function of hydraulic
Example applications of kriging illustrate basic
heads so this is a problem for which simulation is
techniques and some constraints that apply to kriging.
well suited. The reader may refer to Easley and others
These applications also illustrate how different types
(1991) for a more detailed discussion of this applica-
of kriging, such as ordinary, universal, block, and indi-
tion of kriging.
cator, can be used.
The complicated functions of interest in ground-
Other spatial modeling techniques include
water studies often involve physically based ground- nonstochastic techniques such as simple averaging,
water flow models. Conditional simulation may be inverse-distance squared weighted averaging, triangu-
used, for example, to generate a suite of hydraulic- lation, splines, and trend-surface analysis. These
conductivity realizations to be used as input to a nonstochastic techniques can be simpler to apply
model that produces as output a set of corresponding than kriging and may be appropriate to use for
hydraulic-head realizations. Weber and others (1991) some problems, especially when it is not necessary
discussed how ground-water modeling might be used to evaluate results with respect to statistical criteria.
with conditional simulation to study the monitoring- Another extension to kriging, simulation, is intended
well-placement problem discussed in the preceding to preserve overall variability and to compensate for
paragraph. the tendency of kriging to smooth results.
9.0 SUMMARY 69
10.0 REFERENCES Journel, A.G., 1988, Non-parametric geostatistics for risk
and additional sampling assessment, in Kieth, L., ed.,
American Society for Testing and Materials, 1994, Standard Principles of environmental sampling: Washington,
guide for content of geostatistical site investigations, D.C., American Chemical Society, p. 45-72.
D 5549-94: Philadelphia, Pa., American Society for Journel, A.G., 1993, Geostatistics for the environmental
Testing and Materials Committee D-18 on Soil and sciences: Las Vegas, Nev., U.S. Environmental
Rock, 5 p. Protection Agency, 135 p.
1996, Standard guide for analysis of spatial varia- Journel, A.G., and Huijbregts, C, 1978, Mining geostatis-
tion in geostatistical site investigations D 5922-96: tics: London, Academic Press, 600 p.
Philadelphia, Pa., American Society for Testing and Krige, D.G., and Magri, E.J., 1982, Studies of the effects
Materials Committee D-18 on Soil and Rock. of outliers and data transformation on variogram esti-
Bras, R.L., and Rodriguez-Iturbe, Ignacio, 1985, Random mates for a base metal and a gold ore body: Mathemat-
functions and hydrology: Reading, Mass., Addison- ical Geology, v. 14, no. 6, p. 557-564.
Wesley Publishing Company, 559 p.
Lenfest, L.W, Jr., 1986, Ground-water levels and use of
Clark, Isobel, 1979, Practical geostatistics: London,
water for irrigation in the Saratoga Valley, south-
Applied Science Publishers, 129 p.
central Wyoming, 1980-81: U.S. Geological Survey
Cliff, A.D., and Ord, J.K., 1981, Spatial processes, models Water-Resources Investigations Report 84-4040,
and applications: London, Pion Limited, 266 p.
23 p.
Cressie, Noel, 1991, Statistics for spatial data: New York,
Myers, D.E., Begovich, C.L., Butz, T.R., and Kane, V.E.,
Wiley, 900 p.
1980, Application of kriging to the hydrogeochemical
David, Michel, 1977, Geostatistical ore reserve estimation
data from the National Uranium Resource Evaluation
in the collection Developments in geomathematics 2:
Program: Oak Ridge, Tenn., Union Carbide Corpora-
Amsterdam, Elsevier, 364 p.
tion, Nuclear Division Oak Ridge Gaseous Diffusion
Davis, J.C., 1973, Statistics and data analysis in geology: Plant, Report GJBX-57-81, 124 p. [Available
New York, Wiley, 646 p.
from U.S. Department of Energy, Grand Junction,
Delhomme, J.P., 1978, Kriging in the hydrosciences:
Colorado.]
Advances in Water Resources, v. 1, no. 5,
Olea, R.A., 1991, Geostatistical glossary and multilingual
p. 251-266.
dictionary, in the collection Wiley series in probability
Deutsch, C.V., and Journel, A.G., 1992, GSLIB, geostatis-
and mathematical statistics: New York, Oxford Univer-
tical software library and user's guide: New York,
sity Press, 177 p.
Oxford University Press, 340 p.
Ripley, B.D., 1981, Spatial statistics: New York, Wiley,
Devore, J.L., 1987, Probability and statistics for engineering
and the sciences: Monterey, Calif., Cole Publishing 252 p.
Company, 672 p. Ross, S.M., 1987, Introduction to probability and statistics
Easley, D.H., Borgman, L.E., and Weber, D., 1991, Moni- for engineers and scientists: New York, Wiley, 492 p.
toring well placement using conditional simulation of U.S. Army Corps of Engineers, 1995, Technical project
hydraulic head: Mathematical Geology, v. 23, no. 8, planning Guidance for hazardous, toxic, and
p. 1059-1080. radioactive waste data quality design: Washington,
Englund, E.J., and Sparks, A.R., 1991, GEO-EAS D.C., U.S. Army Corps of Engineers Engineer
(Geostatistical Environmental Assessment Software) Manual 200-1-2.
users guide: Las Vegas, Nev., Environmental Moni- ___1997, Practical aspects of applying geostatistics
toring Systems Laboratory Report EPA/600/8-91/008. at hazardous, toxic, and radioactive waste sites:
[Available from National Technical Information Washington, D.C., U.S. Army Corps of Engineers,
Service, Springfield, VA 22161 as NTIS Engineer Technical Letter ETL 1110-1-175.
Report PB89-151252.] Watson, G.S., 1984, Smoothing and interpolation by kriging
Grundy, W.D., and Miesch, A.T, 1987, Brief description of and with splines: Mathematical Geology, v. 16, no. 6,
STATPAC and related statistical programs for the IBM p. 601-615.
personal computer A, Documentation: U.S. Geolog- Weber, D., Easley, D.H., and Englund, E.J., 1991, Proba-
ical Survey Open-File Report 87-411-A, 34 p. bility of plume interception using conditional simula-
Hawkins, D.M., 1980, Identification of outliers: London, tion of hydraulic head and inverse modeling:
Chapman and Hall, 188 p. Mathematical Geology, v. 23, no. 2, p. 219-239.
Isaaks, E.H., and Srivastava, R.M., 1989, An introduction to Wegman, E.J., and Wright, I.W, 1983, Splines in statistics:
applied geostatistics: New York, Oxford University American Statistical Association Journal, v. 78,
Press, 561 p. no. 382, p. 351-365.
70 Overview and Technical and Practical Aspects for Use of Geostatistics in Hazardous-, Toxic-,
and Radioactive-Waste-Site Investigations