Automatic Fuzzy Object-Based Analysis of VHSR Images For Urban 2
Automatic Fuzzy Object-Based Analysis of VHSR Images For Urban 2
Imane Sebari
a,
, Dong-Chen He
b
a
Filire de Sciences Gomatiques et Ingnierie Topographique, IAV Hassan II, Rabat, Morocco
b
Centre dapplications et de recherches en tldtection (CARTEL), Universit de Sherbrooke, Sherbrooke, Qubec, Canada
a r t i c l e i n f o
Article history:
Received 15 September 2012
Received in revised form 5 February 2013
Accepted 5 February 2013
Available online 25 March 2013
Keywords:
Automatic object extraction
Object Based Image Analysis (OBIA)
Fuzzy rule base
VHSR satellite images
Urban areas
a b s t r a c t
We present an automatic approach for object extraction from very high spatial resolution (VHSR) satellite
images based on Object-Based Image Analysis (OBIA). The proposed solution requires no input data other
than the studied image. Not input parameters are required. First, an automatic non-parametric coopera-
tive segmentation technique is applied to create object primitives. A fuzzy rule base is developed based
on the human knowledge used for image interpretation. The rules integrate spectral, textural, geometric
and contextual object proprieties. The classes of interest are: tree, lawn, bare soil and water for natural
classes; building, road, parking lot for man made classes. The fuzzy logic is integrated in our approach in
order to manage the complexity of the studied subject, to reason with imprecise knowledge and to give
information on the precision and certainty of the extracted objects. The proposed approach was applied
to extracts of Ikonos images of Sherbrooke city (Canada). An overall total extraction accuracy of 80% was
observed. The correctness rates obtained for building, road and parking lot classes are of 81%, 75% and
60%, respectively.
2013 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS) Published by Elsevier
B.V. All rights reserved.
1. Introduction
Mapping impervious surface from remote sensing imagery is
signicant to a range of issues especially to sustainable develop-
ment of urban areas (Demarchi et al., 2012; Gao et al., 2012;
Longley et al., 2005; Holland et al., 2006; Donnay et al., 2001). Very
high spatial resolution (VHSR) images and advanced image pro-
cessing algorithms both driven the technologic advance in remote
sensing of impervious surfaces (Xu, 2013). One of the emerging
trends in this eld is OBIA, object-based image analysis (Weng,
2012). OBIA is considered as a powerful tool for classication and
analysis of VHSR images compared to the traditional per-pixel
classiers (Blaschke, 2010; Navulur, 2007; Blaschke et al., 2000).
The advantage of OBIA is that it does not use individual pixels
but adjacent pixel groups that can be characterized by spectral,
textural, geometric and contextual information. Taking this infor-
mation into account through the object-based approach allow
obtaining enhanced results (Campbell, 2007).
Object based image analysis has been dened as a new disci-
pline at the rst international conference on Object-Based Image
Analysis:
Object-Based Image Analysis (OBIA) is a sub-discipline of
GIScience devoted to partitioning remote sensing (RS) imagery into
meaningful image-objects, and assessing their characteristics
through spatial, spectral and temporal scale. (Hay and Castilla,
2006).
The OBIA, also called object oriented image analysis, aims to
replicate and/or to surpass the human interpretation of images
automatically or semi-automatically (Hay and Castilla, 2006).
Two main stages can form the OBIA process: (1) creation of image
objects and (2) classication of image objects. Usually, the rst
step is performed through a segmentation technique (Lang and
Blaschke, 2006; Jensen, 2005). This step is a crucial since it pro-
vides the basic units (image objects) on which later process will
be applied. Therefore, the success of OBIA approach is related to
segmentation quality. The second stage, classication, tries to cre-
ate real objects from image objects. The classication method is
chosen with relation to the desired goal, to the studied image, and
also to its ability to integrate ancillary information. Several meth-
ods can be used at the two stages of the OBIA approach. The chosen
algorithms strongly inuence the nal results (Lu and Weng, 2007;
Caloz and Pointet, 2003).
The rst known reference that used the object-based approach
was Kettig and Landgrebe (1976). They proposed a classication
approach of multispectral images by extracting and classifying
homogenous objects. Their approach consists rst in subdividing
the image in spectrally homogenous pixel groups. These groups
are then classied through supervised technique (maximum likeli-
hood). They applied their approach to aerial and satellite images
0924-2716/$ - see front matter 2013 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS) Published by Elsevier B.V. All rights reserved.
http://dx.doi.org/10.1016/j.isprsjprs.2013.02.006
a
2
b
2
_
n
i0
n
i0
1
1 i j
2
h
c
i; j
Geometric
attributes
Elongated shape Elongation index I
elongation
4pA
P
2
A and P are the area and the perimeter of the exterior border
Compact shape Compactness index
Icompactness
4pAHull
P
2
Hull
A
Hull
and P
Hull
are the area and the perimeter of the convex hull polygon
Convex shape Convexity index I
convexity
A
AHull
A is the objects area and A
Hull
is the area of the convex hull polygon
Large size Area A = Area of the object
Contextual
attributes
Neighboring object Adjacency Determines the neighboring objects that share a part of their limits
Direction between two
objects
Azimuth
Az
12
arctg
X2X1
Y2Y1
_ _
(X
1
, Y
1
) coordinates of the centroid of object 1
(X
2
, Y
2
) coordinates of the centroid of object 2
Elevated object Adjacency + Shadow + Azimuth Determines if object is adjacent to shadow zones in the direction of the
sun
Frequencies (f)
f
1
A
2
f
2
Area
(A)
Area Threshold
D
Pi
P
1
P
2
A
1
Fig. 4. Automatic detection of areas threshold.
I. Sebari, D.-C. He / ISPRS Journal of Photogrammetry and Remote Sensing 79 (2013) 171184 175
able and P is the fuzzy subset studied. The subset is dened by a
membership function l
p
that associates to each value x of the attri-
bute a real value l
p
(x) in the interval [0, 1]. This value l
p
(x) trans-
lates the satisfaction degree of the property P by an object. The
higher the membership degree is, the more the property is satis-
ed, and vice versa.
For the determination of the satisfaction degree of the opposite
property, the complement operator is used. For example, if l
c
(x)
is the membership degree for the compact shape property, the
membership degree to the non-compact shape property l
non c
(x)
is equal to:
l
non c
x 1 l
c
x 3
There are many types of membership functions: continuous
(e.g. Gaussian or sigmoid functions) or piecewise linear (e.g. trian-
gular or trapezoidal functions) (Zadeh, 2003). The latest type is a
simple form which contains straight line segments. We have
choose this type of membership functions since it will allows to
represent the knowledge collected during the modeling step. Thus,
the shapes of membership functions are dened with relation to
information related to attributes mathematic formulations and
thresholds values. The shape of the membership function is spe-
cic for each property.
The methodology starts by specifying the intervals for which
the associated attribute values characterize the property, as well
as those for which the property is not validated. Between those
intervals, a transition interval, dened with relation to the attri-
butes threshold, reects a gradual membership and minimizes
the imprecision related the attributes thresholds determination.
Fig. 5 represents the overall shape of the used membership
function.
2.2.2.2. Fuzzy rules on object classes. An object class rule constitutes
a conjunction of propositions on the properties describing this
class. Generally, it follows this form:
IF X
1
is P
1
AND . . . AND X
i
is P
i
AND . . . X
n
is P
n
THEN Y is C
i
4
where X
i
is P
i
is a fuzzy proposition on a property and Y is C
i
the
conclusion translating membership to the class C
i
. The degree of
satisfaction of a rule is obtained by using a fuzzy aggregation oper-
ator which aggregates the membership degrees like a logical AND.
We adopt the following operator:
l
C
i
x
a
i1
l
P
i
x
_ _
1=a
5
where l
C
i
x is the membership degree of object x to class C
i
,
l
P
i
x is the satisfaction degree of object x to property P
i
and a is
the number of fuzzy properties propositions that constitute the
class rule. We have chosen this operator because since the aggrega-
tion of the rules is performed in a conjunctive way, if one of the
rules concludes that the membership degree to the class C
i
is 0,
then no other rule can change this conclusion, and the class C
i
will
be considered impossible. In other terms, if an object does not sat-
isfy an object class property, it will not be affected to this class. Its
satisfaction degree will be null. Also, this operator veries the nor-
malization condition (Eq. (6)); the comparison of the satisfaction
degrees of the various class rules is possible.
sup
x2A
px 1 6
The membership function of each class is represented by a sin-
gleton. The membership degree is a real value in the interval [0, 1].
Fig. 6 presents a simplied example that illustrates the principle of
determination of class membership degree. Class A is described by
two properties elongated shape and non-compact shape. The
fuzzy rule of this class is:
IF object shape is long AND object shape is non-compact THEN
object belongs to class A
The studied object presents values for the attributes associated
to these properties: the elongation index for the elongated shape
propriety and the compactness index for the non-compact shape
propriety. First, the satisfaction degrees of the object to the two
fuzzy proprieties propositions are determined, l
L
(x) and l
NC
(x).
The membership degree of the object to the class A is obtained
by aggregating the two degrees with the fuzzy operator obtained
through the Eq. (3): l
Road
(x) = (l
L
(x). l
NC
(x))
.
The following rules were used:
WATER _RULE = {(RS is water)}
TREE_RULE = {(RS is vegetation) AND (texture is rough)}
LAWN_RULE = {(RS is vegetation) AND (texture is non rough)}
BARE_SOIL_RULE = {(RS is bare soil)}
BUILDING_RULE = {(RS is non-vegetation) AND (RS is non-bare
soil) AND (RS is non-water) AND (elevated object) AND (Shape
is compact) AND (shape is convex)}
ROAD_RULE = {(RS is non-vegetation) AND (RS is non-bare soil)
AND (RS is non-water) AND (Object non-shadow) AND (Object
non-elevated) AND (shape is Elongated) AND (size is large)}
PARKING_LOT _RULE = {(RS is non-vegetation) AND (RS is non-
soil) AND (RS is non-water) AND (Object non-shadow) AND
(Object non-elevated) AND (large size) AND (shape is
compact) AND (shape is convex)}
2.2.2.3. Auto-learning process. Since no knowledge on the real pro-
prieties of objects is used during rules formulation, we integrate an
auto-learning process to automatically retrieve information about
potential objects. The retrieved knowledge is integrated into new
rules in order to extract more objects. This process starts after
the application of the dened fuzzy class rules. It retrieves infor-
mation from potential objects that are dened as objects having re-
sponded to the fuzzy class rules with high membership degrees.
They are considered as representative of their class and conse-
quently the values of their attributes characterize the objects of
this class. The auto-learning process retrieves spectral and geomet-
ric attribute values and integrates them in a library created for this
purpose. The spectral information is used to evaluate the spectral
membership of each object to the class spectral library. The geo-
metric information is integrated in order to evaluate the geometric
conformity with potential objects. Contextual information is used
with the retrieved information to dene new class rules. This pro-
cess is applied for man made classes: building, road and parking lot.
For the road class, the new rule is dened in order to connect the
extracted road network. Thus, the object, in addition to having
membership to the road spectral library, must be adjacent to an al-
ready extracted road object. For the parking lot class, the corre-
0
1
0
Threshold
1
Attribute
A
Fig. 5. General shape of the used membership function.
176 I. Sebari, D.-C. He / ISPRS Journal of Photogrammetry and Remote Sensing 79 (2013) 171184
sponding rule includes membership to the spectral and geometric
parking lots libraries. The new rules for the building class include
membership to the spectral and geometric buildings librarys and
shapes compactness condition. In order to take more advantage
of the collected spectral library, the primitives that do not belong
to the building, road or parking lot classes, and have membership
to the spectral libraries, are extracted as impervious surfaces.
2.2.3. Assignment of objects to classes
After application of the fuzzy rule base, each object has a mem-
bership degree that varies from 0 to 1 to each class.
After applying the rule base, an object has membership degrees
to the all studied classes (varying from 0 to 1). In order to interpret
theses results and to take a decision, we have adopted some no-
tions of the possibility theory. The possibility theory is linked with
the fuzzy sets theory and allows to reason on imprecise or vague
knowledge.
If C is the set of classes, a possibility distribution p is dened as
a function that attributes a possibility coefcient p
x
(C
i
) to each ele-
ment of C. This coefcient represents the possibility that an object
x belongs to a class C
i
. If p
x
(C
i
) = 0 then the object cannot belong to
the class C
i
. If p
x
(C
i
) = 1 then it is possible (but not certain) that the
object belongs to the class C
i
.
The interest of using possibility distribution is the denition of
measures that will estimate the quality of the extraction. We have
used possibility and necessity measures. These measures quantify
the imprecision and the uncertainty of the extraction. A possibility
measure is elaborated by considering, for any part of A, the coef-
cients of the elements of A that compose it (Dubois et al., 2007):
A sup
u2A
p
x
u 7
The value of G(A) corresponds to the element(s) of A having the
greatest possibility degree according to p
x
. It satises the following
max-decomposability characteristic property (Dubois et al., 2007):
PA [ B maxPA; PB 8
The possibility measure of an event estimates to what extent it
is possible. But it is insufcient to inform if an event will be real-
ized. Information about the complement event is useful. The neces-
sity measure can be dened as the impossibility of A
, the
complement of A (not A):
NA 1
A inf
uRA
1 p
x
u 9
Thus, P(A) corresponds to the degree with which it is possible
that A is true, N(A) as the degree with which it is certain that A is
true. The necessity measure is an evaluation of the certainty (Bou-
chon-Meunier and Marsala, 2003). By using these two measures,
the imprecision and the uncertainty of the object extraction pro-
cess is quantied and consequently its quality is estimated.
The following example claries these two measures of possibil-
ity and necessity. If C is the set of classes {Building; Road; Parking
lot}. We suppose that, after applying the fuzzy rule base, an object
x presents the following membership degrees {1/Building; 0.1/
Road; 0.5/prkg_lot}. The object x presents the possibility distribu-
tion illustrated by Fig. 7. The object is assigned to the building class
because it presents the greatest possibility degree. The certainty
degree of this assignment is obtained through the necessity
measure:
NC
Building
1
C
Building
1 max
C
Road
;
C
prkg lot
_ _ _ _
10
NC
Building
1
C
prkg lot
1 0:5 0:5 11
So, the certainty that the object x belong to the building class is
0.5. Thus, the object x is a building with a precision degree of 1 and
a certainty degree of 0.5. It is 50% certain that the object x is 100% a
building.
The conict between two classes regarding an object is dened
by the difference between its membership degrees to two classes.
The smaller is the difference, the higher is the conict. Ideally, a
well-extracted object will present very high precision and cer-
tainty degrees and a low to null conict with the other classes.
The following cases can be found:
- If a unique class C
i
such as p
X
(C
i
) = 1 exists, the class C
i
is the
most certain since N(C
i
) > 0 and N(C
j
) = 0 for all j i.
- If several C
i
are such as p
X
(C
i
) = 1, no decision can be taken
between these classes: the rule base does not have enough
knowledge to discriminate. However, if the classes with possi-
bility degrees equal to 1 correspond to a signicant class subset,
then the certainty of this subset is strictly positive.
- If p
X
(C
i
) = 1 everywhere, then there is a total uncertainty on the
class.
This information on certainty and precision of the object extrac-
tion is given with the object extraction results. Indeed, the pro-
Object x I
Comp
(x)
Road
F
u
z
z
y
a
g
g
r
e
g
a
t
i
o
n
o
p
e
r
a
t
o
r
Elongation
Index
Road
(object x)
NC
class A
L
(x)
NC
(x)
Non-compact shape Elongated shape
Compactness
Index
I
Elong
(x)
Class : Road Property :
Fig. 6. Objects membership degree to a class.
Building Road
Parking lot
1
0
Fig. 7. Possibility distribution.
I. Sebari, D.-C. He / ISPRS Journal of Photogrammetry and Remote Sensing 79 (2013) 171184 177
posed approach allows the organization of the extracted objects in
layers by class. Each object is described by a set of spectral, geo-
metric and contextual attributes and also by precision and cer-
tainty information related to the extraction quality.
2.3. Evaluation of the results
The evaluation of extracted objects was performed by compar-
ison with reference data representing ground truth, according to:
extraction accuracy assessment and geometric quality assessment.
For accuracy assessment, two indices were used rst to appre-
ciate the overall objects extraction with relation to reference data:
completeness and correctness. The completeness index represents
the percentage of the reference objects area that is extracted. The
correctness index is the percentage of the extracted objects area
that is correctly extracted.
Completeness
S
CE
S
R
12
Correctness
S
CE
S
E
13
With S
CE
being the objects area correctly extracted, S
R
the total area
of the reference objects, and S
E
being the total area of the extracted
objects.
We also produced error matrices according to percentage of the
area of objects. Correctness, completeness, Producers accuracy and
Users accuracy were generated. For man made classes, the accu-
racy assessment has also been performed according to the number
of objects for buildings and parking lots classes, and according to
object length for road class.
For geometric quality assessment, two indexes are used to ana-
lyze the geometric quality of extracted objects: area ratio and po-
sition error. The area ratio (R
A
) yields information on the
percentage of the area accurately extracted for each object. It is de-
ned as follows:
R
A
AReal Objet \ Extracted Objet
AReal Objet
14
where A is the area. The position error (E
p
) is related to the mean
distance between extracted objects and their corresponding objects
in a reference data. It is determined according to characteristic
points in the two data. For buildings, the characteristic points cor-
respond to building corners, and for roads, they are chosen along
road axes. The position error (E
p
) is determined by the formula:
E
p
n
k1
x
ext
x
ref
2
y
ext
y
ref
2
_
_ _
n
15
where n is the number of the characteristic points, (x
ext
, y
ext
) corre-
spond to extracted points coordinates and (x
ref
, y
ref
) are coordinates
of the corresponding characteristic point on the reference layer.
3. Application
3.1. Data and study area
The proposed approach was applied to Ikonos image of the city
of Sherbrooke (Canada) acquired on 20 May 2001. We selected
three subsets from Ikonos image presenting different land use (ob-
jects size and density) in order to evaluate the approach on differ-
ent urban environment. In addition, we selected another Ikonos
image acquired on November 2004 over Sherbrooke (Canada) to
evaluate the effectiveness of the approach and its transferability
on the future. We intentionally selected an unchanged zone from
the two dates to examine if the approach was able to extract the
same objects. This way, the eventual observed changes in the re-
sults will be attributed to the approach. The reference data (ground
truth) was generated by photo-interpretation of the studied image.
The Ikonos image contains four spectral bands (blue, green, red
and near-infrared) and a panchromatic band with spatial resolu-
tions of 4 m and 1 m, respectively. In order to mutually benet
from multispectral information and very high spatial resolution,
the four bands have been individually fused with the panchromatic
image. The image fusion method used is the one proposed by He
et al. (2004). This method allows preserving faithfully the spectral
aspect of the low resolution image while integrating the spatial
information of the high resolution image.
3.2. Results and discussion
Fig. 8 shows the results on three extracts. 80% of the total refer-
ence area was extracted (completeness index) (Table 5). The cor-
rectness rate was of 78% (Table 3). The confusion matrices of the
three study sites are given by Tables 46 according to classes of le-
vel II of Table 1. The results show that the correctness rates for the
natural classes are high between 84% and 90%. This is due to their
distinct spectral responses. The spectral rules were the most
important to discriminate the natural classes and to differentiate
them from the man made classes. The main source of errors is re-
lated to the photo-interpretation conducted to create the ground
truth: it was difcult during photo-interpretation to identify some
vegetation areas due to their low contrast. For the man made clas-
ses, the results are encouraging with 80% as mean correctness rate.
For building class, 80% of the total reference area was correctly ex-
tracted. According to the number of extracted building, the results
analysis has revealed completeness rates of 93% for site 1, 77% for
site 2 and 96% for site 3, and a mean of correctness rate of 93% (Ta-
ble 7). The auto-learning process has contributed to reach these
values. It has allowed the extraction of more building (For example,
8 correct buildings were extracted in site 2 through the auto-learn-
ing rules). Some buildings have been omitted by the approach be-
cause they not have the properties dened for the building class.
Even with the auto-learning process, their spectral responses did
not correspond to those recorded in the retrieved library. Another
reason of this omission is that the distinction of these buildings
with relation to their neighborhood was difcult during the stage
of the creation of the primitives (segmentation). They were fused
with their neighborhood and thus did not have the characteristics
of the building class objects. The objects incorrectly extracted as
buildings correspond to articial areas that have similar geometric
and contextual properties. The complete and accurate extraction of
buildings objects faces two major problems: the confusion of
buildings with the neighborhood and the heterogeneity of the
buildings roofs.
Considering the road network class, the correctness rates are
about 75%. The false classied areas from the road network repre-
sent less than 4%. An analysis of the results of the two separate
subclasses road and parking lot is performed (Table 7). For the road
class, 80% of the total length of the roads for the studied sites has
been extracted. Their elongated shape is an interesting property for
their extraction. For a homogenous road, the approach allows its
quasi-complete extraction. Some omissions are due to the hetero-
geneity of roads surface and to projected shadows on roads either
by trees or by buildings.
For parking lot class, the confrontation with the ground truth
has showed that the quality of the extraction depends on the stud-
ied site. For some sites, all lots were extracted while for some oth-
ers, some parking lots have been omitted (site 2). Two reasons can
be identied. First, the denition of the objects of this class in the
rule base uses the property large size object. The approach allowed
178 I. Sebari, D.-C. He / ISPRS Journal of Photogrammetry and Remote Sensing 79 (2013) 171184
the extraction of the objects respecting this characteristic, but in
site 2, parking lots are located in residential areas and are of small
sizes. The second reason is the segmentations results. In some
case, due to presence of cars, the shape of the segmented parking
lot becomes elongated instead of compact and can be classied
as road. Also, two adjacent road and parking lot objects can form
one segment since they are built with the same material and ex-
tracted as one object.
The objects of the building and road classes are extracted with
an average of 80% of their area (Fig. 9). The objects of the parking
lot class have less important rates (average of 60%). This is due to
the complexity and the heterogeneity of these objects, as well as
by the presence of details (for example cars). For the position error,
the buildings present high spreads (higher than 11 m in the case of
site 3). For roads, a maximal value of 6 m was observed. These
spreads are due to: in the case of road objects, the central axe of
road can be offset due to the modied boundaries by shadows of
trees or buildings; for buildings, the corners of extracted objects
Fig. 8. Application of the proposed approach on Sherbrookes Ikonos image extracts (Site 1, Site 2, Site 3) (a) image extracts (b) extracted objects (c) reference map.
Table 3
Extraction quality index.
Indices Site 1 Site 2 Site 3
Completeness (%) 79 83 78
Correctness (%) 81 83 73
I. Sebari, D.-C. He / ISPRS Journal of Photogrammetry and Remote Sensing 79 (2013) 171184 179
can be offset due to incorrect delimitation by the approach and the
confusion between some roofs and fronts.
3.2.1. Shadow extraction
In order to evaluate the shadow detection performance, we
have compared extracted shadow areas to reference shadow areas
(created by photo-interpretation) for the extract 3 (Fig. 10). 92% of
shadow areas were well extracted. Producers accuracy (about 28%)
Table 5
Confusion matrix site 2.
Extracted objects Reference data
Vegetation Bare Soil Water Building Road network Correctness rate Producers accuracy
Vegetation 84 0 3 2 2 81 19
Bare soil 0 0 0 0 0
Water 0 0 90 0 0 99 1
Building 0 0 0 78 0 81 19
Road network 2 0 0 4 71 72 28
Completeness 84 90 78 71
Users accuracy 16 10 21 29
The values in the error matrix are presented in % of objects area.
Table 4
Confusion matrix site 1.
Extracted objects Reference data
Vegetation Bare soil Water Building Road network Correctness rate Producers accuracy
Vegetation 84 0 0 0 1 84 16
Bare soil 0 0 0 0 0
Water 0 0 0 0 0
Building 1 0 0 80 0 84 16
Road network 3 0 0 2 74 78 21
Completeness 84 80 74
Users accuracy 16 20 26
The values in the error matrix are presented in % of objects area.
Table 6
Confusion matrix site 3.
Extracted objects Reference data
Vegetation Bare Soil Water Building Road network Correctness rate Producers accuracy
Vegetation 85 0 0 0 0 71 29
Bare soil 0 0 0 0 0
Water 0 0 0 0 0
Building 0 0 0 77 2 78 22
Road network 0 0 0 1 81 75 25
Completeness 85 0 0 77 81
Users accuracy 15 0 0 23 19
The values in the error matrix are presented in % of objects area.
Table 7
Road and parking lot classes objects extraction quality indices.
Indices Road Parking lots
Site 1 Site 2 Site 3 Site 1 Site 2 Site 3
Completeness (%) 84 96 90 100 56 70
Correctness (%) 90 86 75 100 56 77
% of road length % of number of objects
0
20
40
60
80
100
Site 1 Site 2 Site 3 Site 1 Site 2 Site 3
A
r
e
a
R
a
t
i
o
(
%
)
Building Objects
0
2
4
6
8
10
12
P
o
s
i
t
i
o
n
E
r
r
o
r
(
m
)
Building objects
Fig. 9. Geometric quality assessment.
Extracted shadow Real shadow (by photo-interpretation)
Fig. 10. Comparison between extracted and reference shadow areas.
180 I. Sebari, D.-C. He / ISPRS Journal of Photogrammetry and Remote Sensing 79 (2013) 171184
is due in major part to the self shadows. The Users accuracy is esti-
mated to 8%. It is due to some low albedo materials. The advantage
of using fuzzy logic is that these areas present membership degrees
to shadow and also to the real class. The use of others attributes
(spectral, geometric and contextual attributes) minimizes the er-
rors of urban objects extraction. Even though detecting shadow
areas was useful for the identication of elevated objects (specially
building), some problems were observed: the projected shadow
has prevented the extraction of some parts of roads and parking
lots; the self shadow (the part of the object that is not illuminated,
i.e., the faade of the building) is not always detected, which has
cause in some cases the incorrect extraction of the geometric form
of objects.
3.2.2. Membership degrees
The objects membership degrees to nal classes are generally
high (greater than 0.6). The mean degrees are about 0.8 for the veg-
etation class, 0.8 for the building class, 0.85 for the road class and
0.65 for the parking lot class. The membership conict analysis has
revealed that the objects of natural classes present membership
degrees only to their corresponding class, whereas non-natural ob-
ject classes can present membership degrees to more than one
class. The objects that have a higher membership degree belong
to the appropriate class. The analysis of the precision and certainty
degrees of their extraction has revealed that more than 60% of the
objects extracted have a precision degree higher than 0.6 and a cer-
tainty degree higher than 0.8 (Fig. 11). For the certainty degrees,
they are higher than 0.8 and can reach a value of 1. Typical ob-
jects present high degrees of precision and certainty and low or
null conicts with others classes.
For all the classes, there are objects that present membership
degree equal to 1 which means that the class rule is descriptive
of the studied object. Also, no object has presented membership
degree equal to 1 for all the classes. We can conclude that the rule
base contains enough knowledge to discriminate between the
objects.
3.2.3. Thresholds determination
In order to evaluate the adopted methodology for automatically
determinate attributes thresholds, we have conducted a sensitiv-
ity analysis by introducing variations to the determined thresholds
values. The corresponding rule is then applied and the correctness
rate, Producers and users accuracies are determined. Table 8 pre-
sents the example of results of this analysis in the case of bright-
ness index (S) used to detect shadow areas. Variations DS of 10
were added and subtracted from the thresholds value.
The analysis shows that the use of obtained threshold yields the
best results (correctness rate of 92% with producers accuracy of
28%). Using S + DS, the correctness rate is the same but the pro-
ducers accuracy is greater (40%) while with S DS, even if the
producers accuracy is lower, the Users accuracy is still important
(23%).
The determination of thresholds is very sensitive task. We
think that using threshold in rigid way is not adequate to our case
(for example, object with a area of 700 m
2
is considered large and
another with an area of 701 is not?). So, using fuzzy threshold is in
our opinion more interesting. The proposed approach uses many
thresholds which are automatically derived. But, we should precise
that the automatically determined thresholds are used as approx-
imate values which will be useful to establish the membership
functions. The proposed methodologies for automatic detection
are not perfect. The imprecision related to the determination of
threshold is considered by fuzzy logic. Indeed, the imprecision of
adopted thresholds is considered by the used membership func-
tions: each attribute threshold is considered as the crossover point
of the corresponding membership function. A transition interval,
dened with relation to the attributes threshold, reects a gradual
membership and minimizes the imprecision related the attributes
thresholds determination (Fig. 5).
All the thresholds are important because they are useful to
establish the membership functions. During rule formulation, the
incorrect determination of thresholds may be cause the incorrect
classication of an object if the difference between the correct
and the incorrect values of threshold is important. If not, the object
will have a low membership degree to its real class.
3.2.4. Transferability test
We have also conduct a transferability test in order to verify if,
for the same zone, the approach will extract the same objects with-
out any changes in the rule base. The transferability of the extrac-
tion approach was evaluated on two subsets from two different
Ikonos images over the same zone (Fig. 12). The same rule base
was applied on two subsets. The decision rules are not modied
between the two images and no new rule was used. The thresholds
of the attributes dependant of the image are automatically deter-
mined by the approach. The objects were well extracted on both
images. They have the same assignations to the classes on the
2001 and 2004 extracts. The only differences are observed at object
areas and also at shadow zones.
3.2.5. Some limitations
For very large image, the application of the proposed approach
will give satised results when extracting typical objects. But, it
Site 1 Site 2 Site 3
0.0
0.2
0.4
0.6
0.8
1.0
D
e
g
r
e
e
o
f
p
r
e
c
i
s
i
o
n
Site 1 Site 3 Site 2
0.0
0.2
0.4
0.6
0.8
1.0
D
e
g
r
e
e
o
f
C
e
r
t
a
i
n
t
y
Fig. 11. Statistics on (a) the precision degrees and (b) certainty degrees of extracted objects on the three study sites.
Table 8
Sensitivity analysis of the brightness index thresholds.
Real shadow (reference data)
Correctness rate (%) Users accuracy (%) Producers accuracy (%)
Extracted shadow
S 92 8 28
S DS 77 23 12
S + DS 93 7 40
I. Sebari, D.-C. He / ISPRS Journal of Photogrammetry and Remote Sensing 79 (2013) 171184 181
can present some limitations according to the difculty and the
complexity of the urban environment. The typical objects can
be extracted without problems (except the exact geometric shape).
A typical object can be for example a building that presents a com-
pact form and a cast shadow. Some buildings in dense zone are
contiguous and it will be difcult to extract individual building.
Also, for roads, on some images, they are hided by building and
shadows. Their extraction is not going to be exhaustive and com-
plete. The extraction results will depend on the segmentation re-
sults. The used algorithm used an adaptive threshold when
segmenting each segment. The algorithm presents satisfactory re-
sults. But, in dense environment, the algorithm may confuse some
objects presenting same spectral responses and consequently pro-
vide one segment.
4. Conclusion
This paper presents a solution for automatic extraction of geo-
graphic urban information from VHSR multispectral images. The
proposed approach is object-based image analysis. It does not
use no auxiliary data or parameter to introduce and does not re-
quire any examples for learning process. The extraction is based
on simple, objective, and transferable rules. Formulating the rules
is performed through the translation of the knowledge used by
the photo-interpreters in order to interpret urban objects from
VHSR image. It used different types of properties. The thresholds
of the attributes, which are dependent on the images and used in
the rules, are automatically determined. Integrating the fuzzy logic
and the possibility theories into the rule base has contributed to
manage the complexity of the subject studied. It allowed to reason
with imprecise and uncertain knowledge and to give information
on the quality of the extracted objects. The used fuzzy rule base
has the benet of not requiring any order of application for the
rules. The objects respond to the different rules with satisfaction
degrees ranging from 0 to 1. The order of application has no effect
on the nal results. The nal decision is taken at the end according
to the precision, certainty and conict degrees observed on the dif-
ferent classes. The results obtained translate the pertinence of the
rules. A global extraction rate of 80% was observed. The correctness
rates obtained for the building, road and parking lot classes are of
81%, 75% and 60%, respectively.
The challenge was great given the complexity of the studied
area and the particularities of the used images. The automatic
extraction approach proposed has displayed a transferability
power. One of the limitations of this approach is the conformity
of the geometric shape of the extracted objects to the reality.
Enhancing this aspect would be useful for the future integration
of the objects extracted in spatial databases. It would also be inter-
esting to study the application of the approach on high spectral sa-
tellite images in order to study the effect of introducing additional
bands on the improvement of mapping accuracy.
Acknowledgments
This work was supported by the NSERC (the Natural Sciences
and Engineering Research Council of Canada) and the PCBF (Prog-
amme Canadien de Bourses de la Francophonie) of CIDA (Canadian
International Development Agency).
References
Baatz, M., Schpe, M., 2000. Multiresolution segmentation an optimization
approach for high quality multi-scale image segmentation. In: Strobl, J.,
Fig. 12. Transferability study of the approach (a) Ikonos image subset of 2001, (b) Ikonos image subset of 2004, (c) extracted objects from Ikonos 2001, (d) extracted objects
from Ikonos 2004.
182 I. Sebari, D.-C. He / ISPRS Journal of Photogrammetry and Remote Sensing 79 (2013) 171184
Blaschke, T., Griesebner, G. (Eds.), Angewandte Geographische Informations-
Verarbeitung XII. Wichmann Verlag, Karlsruhe, pp. 1223.
Baltsavias, E.P., 2004. Object extraction and revision by image analysis using
existing geodata and knowledge: current status and steps towards operational
systems. ISPRS Journal of Photogrammetry and Remote Sensing 58 (34), 129
151.
Bauer, T., Steinnocher, K., 2001. Per-parcel land use classication in urban areas
applying a rule-based technique. GIS 6 (1), 2427.
Blaschke, T., Lang, S., Lorup, E., Strobl, J., Zeil, P., 2000. Object-oriented image
processing in an integrated GIS/remote sensing environment and perspectives
for environmental applications. In: Cremers, A., Greve, K. (Eds.), Environmental
Information for Planning, Politics and the Public, vol. 2. Metropolis Verlag,
Marburg, pp. 555570.
Blaschke, T., 2010. Object-based image analysis for remote sensing. ISPRS Journal of
Photogrammetry and Remote Sensing 65 (1), 216.
Benz, U.C., Hofmann, P., Willhauck, G., Lingenfelder, I., Heynen, M., 2004. Multi-
resolution, object-oriented fuzzy analysis of remote sensing data for GIS-ready
information. ISPRS Journal of Photogrammetry and Remote Sensing 58 (34),
239258.
Biehl, L., Landgrebe, D., 2002. MultiSpec: a tool for multispectral hyperspectral
image data analysis. Computers and Geosciences Archive 28 (10), 11531159.
Bouchon-Meunier, B., Marsala, C., 2003. Logique oue, principes, aide la dcision.
Hermes science publications, Paris.
Bouziani, M., Gota, K., He, D.-C., 2010. Automatic change detection of buildings in
urban environment from very high spatial resolution images using existing
geodatabase and prior knowledge. ISPRS Journal of Photogrammetry and
Remote Sensing 65 (1), 143153.
Caloz, R., Pointet, A., 2003. Analyse comparative de la classication contextuelle et
du maximum de vraisemblance: synthse et cas dtude. Tldtection 3 (23
4), 311322.
Campbell, J.B., 2007. Introduction to Remote Sensing. Guildford Press, New York.
Caprioli, M., Tarantino E., 2003. Urban features recognition form VHR satellite data
with an object-oriented approach. International Symposium of Remote Sensing
(ISPRS) Commission IV Joint Workshop Challenges in Geospatial Analysis,
Integration and Visualization II. <http://www.igf.uniosnabrueck.de/
mitarbeiter/schiewe/papers/24.pdf>.
Demarchi, L., Chan, J.C.-W., Ma, J., Canters, F., 2012. Mapping impervious surfaces
from superresolution enhanced CHRIS/Proba imagery using multiple
endmember unmixing. ISPRS Journal of Photogrammetry and Remote Sensing
72, 99112.
Donnay, J.-P., Barnsley, M.J., Longley, P.A., 2001. Remote Sensing and Urban
Analysis. Taylor and Francis, London.
Dubois, D., Esteva, F., Godo, L., Prade, H., 2007. Fuzzy-set based logicsan history-
oriented presentation of their main developments. In: Gabbay, D.M., Woods, J.
(Eds.), Handbook of the History of Logic. Elsevier, pp. 325449.
Gao, F., Brown de Colstoun, E., Ma, R., Weng, Q., Masek, J.G., Chen, J., Pan, Y., Song, C.,
2012. Mapping impervious surface expansion using medium-resolution
satellite image time series: a case study in the Yangtze River Delta, China.
International Journal of Remote Sensing 33 (24), 76097628.
Forestier, G., Puissant, A., Wemmert, C., Ganarski, P., 2012. Knowledge-based
region labeling for remote sensing image interpretation. Computers,
Environment and Urban Systems 36 (5), 470480.
Han, J., Chi, K., Yeon, Y., 2005. Land cover classication of IKONOS multispectral
satellite data: neuro-fuzzy, Neural network and maximum likelihood methods.
In: S