GIS Interpolation
GIS Interpolation
Interpolation
presented by:
Tim Haithcoat
University of Missouri
Columbia
4
Point Interpolation/Areal Interpolation
Point based:
Given a number of points whose locations and values are
known, determine the value of other points at predetermined
locations 10 9
9.5
8 7
Point interpolation is used for data which can be collected at
point locations (e.g. weather station readings, spot heights, oil well
readings, porosity measurements)
Interpolated grid points are often used as the data input to
computer contouring algorithms
Once the grid of points has been determined, isolines (e.g., contours)
can be threaded between them using a linear interpolation on the
straight line between each pair of grid points
Point to point interpolation is the most frequently performed
type of spatial interpolation done in GIS
5
Point Interpolation/Areal Interpolation
(continued)
Lines to points
E.g. contours to elevation lines
Areal Interpolation
Given a set of data mapped on one set of source zones
determine the values of the data for a different set of target
zones
Example: given population counts for census tracts, estimate
populations for electoral districts
6
Global/Local Interpolators
Global Interpolations determine a single function
which is mapped across the whole region
A change in one input value affects the entire map
Local Interpolators apply an algorithm repeatedly
to a small portion of the total set of points
A change in an input value only affects the result within
the window
Global algorithms tend to produce smoother surfaces
with less abrupt changes
Used when theres an hypothesis about the form of the
surface (ex: a trend)
7
Global/Local Interpolators
(continued)
8
Exact/Approximate Interpolators
9
Exact/Approximate Interpolators
(Continued)
10
Stochastic/Deterministic Interpolators
13
PROXIMAL (1 of 2)
All values are assumed to be equal to the nearest
known point
Is a local interpolators
Computing load is relatively light
Output data structure is Thiessen polygons with
abrupt changes at boundaries
Has ecological applications such
as territories and influence zones
14
PROXIMAL (2 of 2)
15
B-SPLINES (1 of 2)
Uses a piecewise polynomial to provide a series
of patches resulting in a surface that has
continuous first and second derivatives
Ensures continuity in:
Elevation (zero-order continuity)
surface has no cliffs
Slope (first-order continuity)
slopes do not change abruptly, there are no kinks in contours
Curvature (second order continuity)
minimum curvature is achieved
Produces a continuous surface with minimum
curvature
16
B-SPLINES (2 of 2)
Output data structure is points on a raster
Note: maxima & minima do NOT necessarily occur at
the data points
Is a local interpolator
Can be exact or used to smooth surfaces
Computing load is moderate
Best for very smooth surfaces
Poor for surfaces which show marked fluctuations, this
can cause wild oscillations in the spline
Are popular in general surface interpolation
packages, but not common in GISs
Can be approximated by smoothing contours
drawn through a TIN model
17
KRIGING
Developed by George Matheron as the theory
of regionalized variables and D.G. Krige as an
optimal method of interpolation for use in the
mining industry
20
KRIGING
Deriving the Variogram (1 of 2)
The input data for Kriging is usually an irregularly
spaced sample of points
To compute a variogram we need to determine how
variance increases with distance
Begin by dividing the range of distance into a set of
discrete intervals
Example: 10 intervals between distance of 0 and the
maximum distance in the study area
For every pair of points, compute distance and the
squared difference in z values
21
KRIGING
Deriving the Variogram (2 of 2)
Assign each pair to one of the distance ranges, and
accumulate total variance in each range
After every pair has been used (or a sample of pairs
in a large dataset) compute the average variance in
each distance range
Plot this value at the midpoint distance of each
range
22
KRIGING
Computing the Estimates (1 of 2)
23
KRIGING
Computing the Estimates (2 of 2)
24
MANUAL or EYEBALLING (1 of 2)
27
Trend Surface Analysis (1 of 3)
30
Fourier Series
Approximates the surface by overlaying a series
of sine and cosine waves
A global interpolator
Computing load is moderate
Output data structure is the Fourier series which
can be used to estimate grid values
Best for data sets which exhibit
marked periodicity, such as ocean waves
Rarely incorporated in computing packages
31
Moving Average/Distance
Weighted Average (1 of 2)
Estimates are averages of the values at n known
points:
Z = wizi/ wi
Where w is some function of distance, such as:
W = 1/dk w = e-kd
An almost infinite variety of algorithms may be
used, variations include:
The nature of the distance function
Varying the number of points used
The direction from which they are selected
Is the most widely used method
32
Moving Average/Distance
Weighted Average (2 of 2)
Objections to this method arise from the fact
that the range of interpolated values is limited
by the range of the data
No interpolated value will be outside the observed
range of z values X X interpolated
X
X actual
Other problems include:
How many points should be included in the
averaging
What to do about irregularly spaced points
How to deal with edge effects
33
A further examination of
Spatial Interpolation
Looking at areal interpolation techniques and
some applications
Areal interpolation : problem of transferring
data from one set of areas (source reporting
zones) to another (target reporting zones)
This is easy if the target set is an aggregation of the
source set, but more difficult if the boundaries of the
target set are independent of the source set
Also, applications that do not fall easily into
either point or areal interpolation categories
34
Areal Interpolation
Non-Volume Preserving (Point Based)
Example: interpolating population counts from
census tracts to school districts
x x
x x x
x
x x x x
x x x x x
x
36
Non-Volume Preserving
CRITICISMS of PROCEDURE
Choosing the center point is ill defined
Inadequacy of point based interpolation
methods
Most importantly, the total value of each zone
is not conserved
Example: if a source zone is divided into two
target zones, the total population of the target
zones after interpolation need not equal the
population of the source zone
37
Areal Interpolation
Volume-Preserving: Overlay
Procedure
Overlay of target and source zones
Determining the proportion of
each source zone that is assigned
to each target zone
Apportioning the total value of the
attribute for each source zone to
target zones according to the area
proportions
Assumes uniform density of the attribute within
each zone
e.g. uniform population density if the attribute is total
zone population
38
Areal Interpolation
Volume-Preserving: Pycnophylactic (1 of 2)
This technique has two objectives:
Create a smooth surface, no steps
Attribute values should not change suddenly at zone
boundaries
The total value of the attribute within each zone
must be correct
Does not require an assumption of homogeneity
within zones but rapid variation within zones
may affect the quality of interpolation
Output is a contour or continuously shaded map
39
Areal Interpolation
Volume-Preserving: Pycnophylactic (2 of 2)
Procedure
Overlay a dense raster on a choropleth map
Divide each zones total value equally among the
raster cells that overlap the zone
Smooth the values by replacing each cells value with
the average of its neighbors
Sum the values of the cells in each zone
Adjust the values of all cells within each zone
proportionally so that the zones total is the same as
the original total
Example: if the total is 10% low, increase the value of
each cell by 10%
Repeat steps 3,4, and 5 until no more changes occur
40
Areal Interpolation
Volume-Preserving: Boundary Conditions
At the boundary of the reporting zones,
pixels will have neighbors outside the study
area and therefore without values
Some decision must be made about the behavior
of the surface outside the study area
Examples:
Population density equals zero
(a lake or rural area)
population density unknown,
assumed equal to the values of
the outermost pixels of the
study area
41
Special Cases of Spatial Interpolation
Mapping Populated Areas (1 of 4)
Simple Approach:
estimate the populated area using an empirical
relationship like:
A is proportional to p0.84
And draw a circle around the point of radius:
(a/)
Alternative Approach:
Establish a critical population density for defining
an urban area
Spread the population over each urban area so that
population density is highest in the center and
decreases gradually outwards
e.g. use a normal distribution function
44
Special Cases of Spatial Interpolation
Mapping Populated Areas (4 of 4)
47
Special Cases of Spatial Interpolation
Estimating Trade Areas (3 of 3)
OR
Divide the area into sectors, average the distance to
customers within the sectors and draw a distance arc
for each sector
X X
53
Expert Systems for Spatial
Interpolation Algorithms (2 of 2)
54
CONCLUSION: If computer contouring and surface generation
techniques are to be incorporated successfully in GIS, they must be
easy to use and be effective.
easy to use implies that those effective means that these
without a detailed knowledge of the techniques should be
mathematical & statistical informative, highlighting the
characteristics of the procedure essential nature of the data
should be able to choose the correct and/or surface and serving the
technique for displaying a particular purpose of the
data set for a particular purpose researcher/analyst.
Note: Statisticians argue that this is The researchers measure of
NOT an ideal goal as people may use success will be largely
techniques without a proper subjective & visual - does the
understanding of the underlying result look right?
assumptions.
This purpose may vary from an attempt to model all the real intricacies
of the surface to simply trying to highlight the general, spatial trend of the
data in order to aid in the decision-making process.