L9 Digital Image Classification
L9 Digital Image Classification
N.S. Maphosa
Department of Geomatics and Survey
Introduction
Feature space
In one pixel two bands can be regarded as
components of a 2D vector, the feature vector
An example of a feature vector is (13,15) which
tells that 13 DN and 15 DN are stored in
band 1 and band 2 respectively
This vector can be plotted on a 2D.
Such graphs are called scatter plots
Feature space
Chivero Landsat TM band 5:4:3
Chivero Landsat TM band
4 and 3 Scatter
Soil Line
Band 4 and 3 Scatter: Water
Band 4 and 3 Scatter: Woodland
NIR
RED
Feature space
Feature space
Scatter plot gives information about the
distribution of corresponding pixel values in
two bands of an image
Each cluster of a feature vector occupies its own
area in the feature space
Basic assumption of image classification is that one
cluster can be separated from the other
A specific part of a feature space corresponds
to a specific class
The principle of image classification is that the pixel is
assigned a class according to its feature vector by
comparing it to predefined clusters in the feature
space
Doing so for all image pixels results in a classified
image
The principle of image
classification
Classes to be distinguished in an image need to
have different spectral characteristics e.g.,
comparing reflectance curves
Once classes have been defined in the feature space,
each image pixel can be compared to these classes &
assigned to the corresponding class
Definition of clusters is an interactive process and is
carried out during a training process
Comparison of individual pixels with the clusters takes
place using classifier algorithms
Supervised Classification
Supervised classification
algorithms
Trainin es plot
e using
Supervised classification
algorithms
Box classifier or parallelepiped requires only the lower
and upper limit pixel values
It is a quick and cheap method to classify
Problems when parallelepipeds overlap (due
to high covariance)
To compensate, rectangles with stepped
borders might be used
Parallelepiped
Box classifier or
parallelepiped
requires only the
lower and upper
limit pixel values
It is a quick and
cheap method to
classify
DISADVANTAGE
🞑 overlapping
boxes
🞑 poorly
adapted to
cluster shape
BOX CLASSIFICATION
255
Band 2
B
a
n
d
2
5
5
Minimum distance to
means 45
40
30
pixels
Band 4
of training sets in n-
25
20
dimensional space
15
10
5
Band 3
classified according to
the
class mean to which
they
are closest
CHARACTERISTICS
emphasis on the
location of cluster
centre
class
labelling by
considering
minimum distance to
the cluster centres
DISADVANTAGE
avoids the
presence of
variability within a
class
shape and size of
the clusters are not
important
Supervised classification algorithms
Unsupervised Classification
The classes that result from unsupervised
classification are spectral classes as they are
based solely on the natural groupings in
the image values
The identity of the spectral classes will not
be initially known
The analyst must compare the classified
data with some form of reference data
(such as larger scale imagery or maps) to
determine the identity and informational
value of the spectral classes
Results of
Unsupervised
Classification
Problems in image
classification
Spectral classes-different type of grasses
each with a different spectral signature
Same type of grass may have different
spectral characteristics over large areas due
to different soils, climate conditions
Mixel: one pixel assigned to one class,
what if that pixel has two or more
cover types?
Assign to several classes?
Validation: Accuracy Assessment
Accuracy Assessment
Quality assessment is done by comparing the
classified image to true world class
One way to do this is by creating an error
Error Matrix
Overall accuracy/proportion correctly
classified: sum diagonal (correctly
classified)/total number of pixels checked.
This is the Observed
The expected value for each cell is calculated
as (row total/grand total * column
total/grand total)
Error Matrix
From the confusion matrix, it can be seen
that at least two methods can be used to
determine individual category accuracies
The ratio between the number of
correctly classified and the row total
🞑 the user's accuracy - because users are
concerned about what percentage of the classes
has been correctly classified
The ratio between the number of
correctly classified and the column total
🞑 is called the producer's accuracy
Kappa Statistic
k OE
1E
Kappa Interpretation