s3
s3
Abstract
Canopy height models (CHMs) derived from lidar data have calculate a raster CHM from the first return lidar points and
been applied to extract forest inventory parameters. How- then extract local maxima from that raster CHM (Hyyppä et al.,
ever, variations in modeled height cause data pits, which 2008; Lim et al., 2003b).
form a challenging problem as they disrupt CHM smooth- The main challenges faced in treetop detection are com-
ness, negatively affecting tree detection and subsequent mission errors (falsely detected trees) and omission errors
biophysical measurements. These pits appear where laser (undetected trees) (Hosoi et al., 2012; Pouliot et al., 2005).
beams penetrate deeply into a tree crown, hitting a lower These errors are mainly attributed to natural variation in tree
branch or the ground before producing the first return. In crown size (Pitkänen et al., 2004) as well as to height irregu-
this study, we develop a new algorithm that generates a larities within individual tree crowns in the input CHM (Sol-
pit-free CHM raster, by using subsets of the lidar points to berg et al., 2006). To address natural variation in crown size,
close pits. The algorithm operates robustly on high-density researchers have developed processing methods that adapt to
lidar data as well as on a thinned lidar dataset. The evalu- the crown (object) size. Pitkänen et al. (2004) developed and
ation involves detecting individual trees using the pit-free tested three different adaptive methods for individual tree
CHM and comparing the findings to those achieved by us- detection based on canopy differences. Wulder et al. (2000)
ing a Gaussian smoothed CHM. The results show that our proposed the use of a local maxima filter with variable win-
pit-free CHMs derived from high- and low-density lidar data dow sizes. However, if the selected window size is smaller
significantly improve the accuracy of tree detection. or larger than the crown size, then the commission or omis-
sion error, respectively, will increase. In order to select the
correct window size, Popescu and Wynne (2004) introduced
Introduction an adaptively varying window technique, based on the idea
The use of airborne Light Detection and Ranging (lidar) has that a moving local maxima filter should be adjustable to an
been increasing in forestry. Lidar is capable of providing appropriate width to account for different crown sizes.
accurate three-dimensional information on forest structure To address irregularities in crown height, a number of
(Lim et al., 2003a), contributing significantly to the improved researchers have suggested pre-processing CHMs to reduce
accuracy of forest inventories (Magnussen et al., 2010; Yu commission and omission errors (Bortolot and Wynne, 2005;
et al., 2011) and subsequent biophysical parameters such as Brandtberg et al., 2003; Chen et al., 2006; Solberg et al., 2006).
biomass (Nelson et al., 1988; Popescu, 2007). Irregularities in canopy surface elevation, also called “data
Typically, a lidar-derived Canopy Height Model (CHM) or a pits,” form a challenging problem due to their disruptive
normalized Digital Surface Model (nDSM) is used for extract- influence on a CHM, reducing accuracy in tree detection and
ing relevant forest inventory information, such as detecting subsequent biophysical measurements (Ben-Arie et al., 2009;
single trees for subsequent height estimation and crown delin- Gaveau and Hill, 2003; Zhao et al., 2009). For example, Sham-
eation (Bortolot and Wynne, 2005; Forzieri et al., 2009). The soddini et al. (2013) indicated that data pits may significantly
CHM represents absolute canopy height above ground, and it affect the estimation of structural forest parameters, especially
is typically calculated by interpolating the first return lidar basal area and stand volume. Since the processing of raw
points and determining their height above a digital terrain lidar point clouds into a meaningful raster is a composition
model. (Hyyppä et al., 2008; Van Leeuwen et al., 2010). Tree of many different procedures, there is no unified agreement
height measurement and crown delineation mainly rely on on the cause of data pits. Axelsson (1999) found that some in-
the identification of local maxima, with each local maximum formation from raw point clouds with similar x-y coordinates
corresponding to the location of an individual treetop and the and different z values is lost when the points are interpolated
surrounding segments forming the tree crown (Véga and Dur- into a raster. Such lost data become significant when multiple
rieu, 2011). Therefore, to be able to extract relevant structural echoes are registered in a forested area. Ben-Arie et al. (2009)
parameters of trees (e.g., tree height) the correct location of and Véga and Durrieu (2011) stated that the problem of data
single trees in the CHM is of fundamental importance (Chen et pits was due to laser scanning processing and/or postprocess-
al., 2006; Persson et al., 2002; Yao et al., 2012). While some ing of lidar point clouds. Data pits may also occur during
researchers have tried to find local maxima directly in the li- classification of lidar point clouds into ground and non-
dar points (Li et al., 2012), most operational users of lidar first ground points when creating a Digital Surface Model (DSM) or
a Digital Terrain Model (DTM), depending on the classification
Anahita Khosravipour, Andrew K. Skidmore, Tiejun Wang,
Yousif A. Hussin are with the Department of Natural Resourc- Photogrammetric Engineering & Remote Sensing
es, Faculty of Geo-Information Science and Earth Observa- Vol. 80, No. 9, September 2014, pp. 863–872.
tion, University of Twente, P.O. Box 217, 7500 AA Enschede, 0099-1112/14/8009–863
The Netherlands ([email protected]). © 2014 American Society for Photogrammetry
Martin Isenburg is with Rapidlasso GmbH, Friedrichshafener and Remote Sensing
Straße 1, 82205 Gilching, Germany. doi: 10.14358/PERS.80.9.863
Lidar Data Table 2. The Airborne Laser Scanning Characteristics (Razak et al., 2011)
The lidar data and aerial photographs (0.15 m pixel size) were
simultaneously acquired during the leaf-on season in July Acquisition (month/year) July–2009
2009 using a helicopter flying about 300 m above ground level Laser scanner Riegl VQ480i
(Table 2). A full-waveform airborne laser scanning system
IMU system iMAR FSAS - record 500Hz
(RIEGL VQ-480), which was developed for surveying moun-
tainous forested areas, was utilized by Helimap (Vallet and GPS system Topcon legacy - record 5Hz
Skaloud, 2004). The system provides high-speed data acquisi- Laser pulse repetition rate 300 kHz
tion using a fast line scanning mechanism and a narrow in-
frared laser beam. Moreover, the system performs on-line full Measurement rate Up to 150 000 s–1
waveform analysis (in hardware) to extract discrete returns Laser wavelength Near infrared
from the waveforms. For our data set, the VQ-480 system used
a laser pulse repetition rate of up to 300 kHz and recorded up Beam divergence 0.3 mrad
to five returns for each pulse. The initial aim of this survey Laser beam footprint 75 mm at 250 m
was to accurately map complex landslides in forested terrain
(Razak et al., 2011). Therefore, to increase the point cloud Field of view 60°
density, the area was covered by seven flights. The mean Scanning method Rotating multi-facet mirror
point density was 160 point/m2 with an average distance
between laser points of 0.08 m.
There is a time difference between the collected data by Methods
lidar acquisition in July 2009 and the field measurements in
September 2011 and 2012. However, our study area consists Preprocessing of Lidar Data
of mature forest and is characterized by relatively low tem- Our original lidar dataset was stored separately in adjacent
peratures during most of the year, which results in a relatively irregular tiles and classified into ground-points and non-
low tree growth rate. Therefore, we assume that the difference ground-points. To avoid edge effects along tile boundaries
in tree height due to natural growth between the time of the (Brandtberg et al., 2003), the lidar points were retiled with a 25
lidar survey and the acquisition of ground data is negligible. meter buffer around each tile. This essentially moves potential
artifacts into the tile buffer. Unlike other methods that create
a raster CHM from the difference between two rasters (i.e., by
Figure 2. The generated TINs and rasters of partial chms for AI (%) = [(n – O + C)/n] × 100
an individual tree.
where n is the number of reference trees in the study area, O
is the omission error, and C is the commission error. Based
Experimenting with this rasterizing process, we found 1.5 m on the DBH distribution (Table 1) the range in DBH of the trees
to be an optimal rasterization threshold for such a low-densi- was divided into three classes: DBH less than 20 cm, DBH from
ty lidar dataset. 20 to 40 cm, and DBH greater than 40 cm. The proportion of
correctly detected trees was calculated as a percentage of the
Smoothing CHM Using Gaussian Filter total number trees in each class.
The degree of smoothness of a CHM is determined by the
standard deviation (Gaussian scale) and the window kernel
Figure 5. Example of detected trees, illustrating correctly identified crowns, omission errors, and commission errors of the pit-free
CHM derived from high-density lidar data.chm
Table 4. Tree Detection Results for the Smoothed CHM and the Pit-free chm; Both Models have a Pixel Size of 0.50 m and were Derived from Low-Density
Lidar Data
Field-measured trees Gaussian Smoothed CHM Pit-free CHM
number Species Correct Omission Commission AI Correct Omission Commission AI
of trees n (%) n (%) n (%) (%) n (%) n (%) n (%) (%)
33 deciduous trees 17 (51.5) 16 (48.5) 10 (30.3) 21.2 20 (60.6) 13 (39.4) 8 (24.2) 33.3
29 Larix decidua 26 (89.7) 3 (10.3) 6 (23.0) 69 26 (89.7) 3 (10.3) 1 (3.4) 86.2
325 Pinus sylvestris 257(79.0) 68 (21.0) 139 (42.7) 36.3 268(82.5) 57 (17.5) 35 (10.7) 71.7
307 Pinus uncinata 184(60.0) 123 (40.0) 81 (26.3) 33.6 205(66.8) 102 (33.2) 4 (1.3) 65.5
694 Total 484(69.8) 210 (30.2) 236 (34.0) 35.7 519(74.8) 175 (25.2) 48 (6.9) 67.7
number of correctly detected trees in the smoothed CHM (511) Table 5. Proportion of Correctly Identified Trees for the Three Stem Diam-
and the pit-free CHM (544): x2 = 4.302, and p = 0.038. eter Classes for Each chm and Point Density.
The results of the tree detection assessment in evaluating All (n = 694)
the pit-free algorithm based on poor quality lidar data can be
seen in Table 4. As expected, the decrease of the point density Portion of DBH range <20 20 to 40 >40
from 160 points/m2 to 7 points/m2 resulted in a very low (cm) (n = 258) (n = 397) (n = 39)
accuracy (i.e., more commission error) when using Gaussian High- smoothed CHM 156 (60.4 %) 322 (81.1 %) 33 (84.6 %)
filters for both deciduous and coniferous trees. The pit-free density
CHM improved the total accuracy of the tree detection in Lidar Pit-free CHM 184 (71.3 %) 326 (82.1 %) 34 (87.1 %)
comparison to the smoothed CHM, resulting in Accuracy Index
Low- smoothed CHM 130 (50.3 %) 322 (81.1 %) 32 (82 %)
values of 67.7 percent and 35.7 percent, respectively. Con- density
sidering all 694 reference trees, 74.8 percent of the trees were Lidar Pit-free CHM 160 (62 %) 327 (82.3 %) 32 (82 %)
detected correctly in the pit-free CHM and 69.8 percent in the
smoothed CHM. The total commission error was 34 percent
for the smoothed CHM, while it was only 6.9 percent for the Discussion
pit-free CHM. The total omission error was 25.2 percent for the One of the challenges in creating a canopy height model
pit-free CHM and 30.2 percent for the smoothed CHM. In addi- through interpolation of first returns is that pits appear when-
tion, the Chi-square test indicated a statistically significant ever a laser beam is able to penetrate a tree crown deeply
difference between the pit-free CHM and the smoothed CHM, before it generates the first return. Based on this observa-
with the pit-free CHM resulting in a higher number of correctly tion we propose a new “pit-free” algorithm, which is able to
detected trees (x2 = 4.403, p = 0.036). remove pits efficiently and shows good potential for improv-
The range of DBH values in the study site (Table 5) allowed ing the detection of forest trees. Our algorithm is novel in that
us to assess how the pit-free CHM compared to the smoothed it removes the data pits directly from the lidar point clouds
CHM for both datasets. Trees with a DBH of more than 20 cm during the CHM creation. Other approaches, including auto-
were successfully detected with both different types of CHM mated methods (e.g., a smoothing filter) and semi-automated
and data. However, a large portion of the missed trees had a methods (i.e., a pit-filling algorithm) do not accurately target
DBH of less than 20 cm, with 71.3 percent and 62 percent of only the pits, but alter all pixels of the CHM raster (Ben-Arie
them correctly detected with a pit-free CHM using high- and et al., 2009; Shamsoddini et al., 2013). Moreover, automated
low-density data, respectively. Only 60.4 percent and 50.3 and semi-automated methods do not consider the natural 3D
percent of trees this size were detected with a smoothed CHM structure of individual tree crowns when removing height
derived from high- and low-density data, respectively. The variations. They consider the crown shape of forest trees as a
Chi-square test showed a significant difference between the circle (in nadir view on the raster CHM) (Zhao et al., 2013).
pit-free CHM and the smoothed CHM regarding the number Our algorithm makes use of the knowledge that a tree
of correctly identified trees with DBH less than 20 cm (high- crown tends to create lidar returns at a wide range of heights
density: p = 0.009; low-density: p = 0.008). However, there and generates a set of partial CHMs to remove the pits. There
was no significant difference between the types of CHM in are two important thresholds in generating the partial CHMs:
detecting trees with a DBH of more than 20 cm (high-density: p (a) the height thresholds, which define the set of partial CHMs,
= 0.0714; low-density: p = 0.646) or more than 40 cm (high- and (b) the rasterization threshold, which includes only those
density: p = 0.745; low-density: p = 1.000). triangles with edge lengths below a particular value. There is