Comparing Statistical Techniques To Clas PDF
Comparing Statistical Techniques To Clas PDF
doi: 10.5721/EuJRS20144706
Received 16/07/2013, accepted 01/02/2014
www.aitjournal.com
Introduction
Forest structure analysis and characterization is part of most forest planning processes for
sustainable forest management and is of special interest from a silvicultural point of view.
Knowledge about current forest structural conditions is crucial to understanding the specific
75
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
Methods
Study area
The province of Trento (6212 Km2) is situated in the North-East of Italy on the Southern
side of the Alps chain (Fig. 1).
The territory is almost entirely mountainous, and 60% of its surface is covered by
forests [De Natale and Pignatti, 2011]. Approximately, 72% of the forest surface is
owned by public institutions and is subject to a forest management plan with broad
objectives, such as maintaining productive function of the forest, management of the
services provided by the forest (protection, tourism and recreation, carbon dioxide
fixation, etc), and improvement and conservation of biodiversity in terms of species,
76
European Journal of Remote Sensing - 2014, 47: 75-94
The area of Trento province, according to its geographical and climatic gradients, can be
divided into three zones: endalpica, mesalpica and esalpica [Odasso, 2002]. The endalpica
zone includes upland areas with higher elevation and landlocked valleys. There are
environments with harsh continental climate, particularly favourable to forest communities
dominated by boreal conifers like Norway spruce (Picea abies) or cembro pine (Pinus
cembra). The mesalpica zone include mountains with relatively lower elevations, generally
found on plateaus and in valleys, typically with East-West orientation and with average
elevation around 1000 m a.s.l.. Cool climate, from sub-continental to sub-oceanic,
characterizes this zone, where forests are dominated by mesophile tree species like silver
fir (Abies alba) and beech (Fagus sylvatica). The esalpica zone is concentrated in a central
strip orientated North-South in the Trento province territory, with elevation below 1000 m
a.s.l., characterized by incursions of species with sub-Mediterranean or steppe character,
dominated by the forests composed of thermophile broad leaved trees (Ostrya carpinifolia,
Carpinus betulus, Fraxinus ornus, Quercus pubescens, Quercus petrae, etc.).
The forest structural classification system used by the Forest Service of Trento Province
The Forest Service of Trento Province is nowadays using a structural classification system
based on the distribution of basal area in specific tree size classes both in inventory and
forest management decision making phases. The classification in forest structural types
77
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
is one of the primary reference (with the species composition) for the forest managers to
identify the strata in the process of forest inventory based on a stratified sampling. During
the decision making process, forest managers use the forest structural classification to
identify the potential evolution of a stand, that is to say the potential transition of a specific
structural type toward another one, and consequently decide which kind of silvicultural
interventions to implement estimating the potential volume to fell in each tree size class.
The recently forest structural classification system adopted by the Forest Service of Trento
Province is based on the percentage of basal area in three classes of trees [Scrinzi et al.,
2011], defined as small trees (17.5 cm to 27.4 cm diameter at breast height, dbh), medium
trees (27.5 to 47.4 cm dbh), and large trees (above 47.5 cm dbh).
When the percentage of basal area per hectare is greater than 15% in all classes, then the
structure is classified as irregular, which could be small trees dominated (I1), medium tree
dominated (I2) or large tree dominated (I3). If the percentage of basal area per hectare is
greater than 15% in two classes, then the stand is classified as regular dominated by small
(R21 and R31), medium (R12 and R32) or large trees (R13 and R23). When the percentage
of basal area per hectare is greater than 15% in only one class, then the stand is classified as
regular widely dominated by small (R11), medium (R22) or large trees (R33). For example,
if in a forest stand the basal area of small trees is 25%, the basal area of medium trees
is 30% and the basal area of large trees is 45% then the stand is classified as irregular
dominated by large trees, and identified by the code I3 (I stands for irregular and 3 is the
number that distinguishes large trees).
Based on these combinations, the structure of forest stands will classified as one of twelve
forest structural types, including three irregular types and nine regular types (Fig. 2).
78
European Journal of Remote Sensing - 2014, 47: 75-94
d2
h = 1.3 + [1]
a + bd + cd 2
The main statistic parameters of the height-diameter function are reported in Table 1.
79
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
By means of the height-diameter function we estimated that the height of small trees ranged
from 8.5 to 15.6 m, the height of medium trees ranged from the 15.6 to 26.5 m and large
trees were greater than 26.5 m.
The 90 plots surveyed were classified into nine forest structure types: I2 (6 plots), I3 (3
plots), R11 (9 plots), R12 (7 plots), R21 (14 plots), R23 (22 plots), R31 (1 plot), R32 (25
plots), and R33 (3 plots).
80
European Journal of Remote Sensing - 2014, 47: 75-94
The raw data were filtered and classified by the vendor using Terrascan software (Terrasolid
Ltd.) which utilizes the Axelsson [1999] filtering algorithm based on Triangulated Irregular
Network densification. Data delivered by the vendor included a DSM and a DTM as a
regular grid with a spatial resolution of 1 m2. We calculated the CHM as the DSM minus
the DTM by means of spatial analyst tools in ArcGis 9.x software (ESRI). Canopy height
values greater than 2 m were assumed to be vegetation hits, this means that values less than
2 m were set to 0.
From the CHM for each of the 90 plots, grid cells, corresponding in shape and size at those
of each field plot, were extracted using ArcGIS. The maptools package in R [Lewin-Koh
and Bivand, 2012] was used to compute the predictor variables from the height probability
distributions (thirteen predictor variables), and from the relative frequency distributions
of vegetation heights (four predictor variables) of each plot: in total, seventeen candidate
predictor variables were generated for modelling (Tab. 3).
H50PCT Height at which 50% of pixels fall below (median of height pixels intersecting plot)
Statistical analyses
Supervised classification approach
Linear discriminant analysis
Linear discriminant analysis (LDA) is a supervised classification approach. In LDA, a
81
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
k
w1 = a1 x1 + a2 x2 + ....... + ak xk = ∑a x
i =1
i i [ 2]
The weights ai are chosen to maximize the separation between groups. A classification rule
is developed in combination with the weighting function in [2]. Using maximum likelihood
theory, classifications are assigned to ranges of values. E.g., if w1 ∈ ( l1 , l2 ] for an observation
where l1 and l2 represent limits for a given class, then the observation is assigned to that class.
If w1 ∉ ( l1 , l2 ] then the observation is assigned to an alternate class (Fig. 3).
LDA was performed in R using the MASS package [Venables and Ripley, 2002], which
implements a Bayesian decision theory approach. For LDA the covariates are assumed to
have a common multivariate normal distribution.
Random forest
RF is a classification technique, based on the use of classification and regression trees,
developed by Breiman [2001].
Classification trees build rules for assigning current observations into classes using numerical
and/or categorical predictor variables by recursive binary partitioning into regions that are
increasingly homogeneous with respect to the class variable. The homogeneous regions are
called nodes [Cutler et al., 2007]. RF fits many classification trees to a data set, and then
combines the predictions from all the trees. The algorithm begins with the selection of many
(e.g., 500) bootstrap samples from the data. Observations in the original data set that do not
occur in a bootstrap samples are called out-of-bag observations. A classification tree is fit to
each bootstrap sample, and at each node, a small number of randomly selected variables are
available for the binary partitioning. The trees are fully grown and each is used to predict the
out-of-bag observations. The predicted class of an observation is calculated by majority vote
of the out-of-bag predictions for that observation, with ties split randomly [Ok et al., 2012].
Breiman [2001] called this procedure random forest because the base constituents of the
ensemble are tree-structured predictors, and because each of these trees is constructed using
an injection of randomness.
82
European Journal of Remote Sensing - 2014, 47: 75-94
The RF classification was carried out using the randomForest package in R [Liaw and
Wiener, 2002]. The RF algorithm in R works as follows:
• grow a specified number of trees ( ntree ) by the bootstrap method from the original
data;
• for each of the bootstrap samples, grow an unpruned classification so that at each
node randomly sample a number of variables ( mtry ) of the predictors as candidates
at each split;
• predict new data by aggregating the predictions of the ntree trees.
The randomForest package optionally produces the measure of the importance of the
predictor variables, and a measure of the internal structure of the data, that give information
about the proximity of different data points to one another.
Parametric regression
The second strategy used to predict basal areas for small medium and large trees used
parametric linear and nonlinear models.
Parametric regression analysis is performed to evaluate the dependence of a response
variable on one or several predictors. The function is specified explicitly and can be linear
and non linear in the parameters. In this study, a linear regression model with the selected
explanatory variables was developed by using ordinary least squares (OLS). There are
extensive examples in which OLS has been successfully applied to predict basal area [e.g.
Lefsky et al., 1999; Hudak et al., 2006].
A non linear regression model with multiple explanatory variables was also developed by
using the non-linear least squares (NLS).
83
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
Performance statistics
Performance statistics were computed both for the classification accuracy of our multiple
approaches, and for the prediction accuracy from modelling basal area with CHM
derivatives.
We used leave-one-out (LOO) cross-validation to evaluate our ability to predict forest
structure classes from CHM data. The results of LOO were categorized in confusion
matrices, and evaluated by the overall, producer’s, and user’s accuracies. The overall model
accuracy represents the percentage of plots correctly classified with respect to the total
number of sample plots. The producer’s accuracy refers to the probability that a certain
forest structural type of a stand on the ground is classified as such. The user’s accuracy
refers to the probability that a stand labelled as a certain forest structural type is really this
type.
We looked at the prediction performances of basal area models using the Root Mean Square
Error (RMSE). According to Kendall and Buckland [1975], “in general, the mean square
error of a set of values is the arithmetic mean of the squares of their differences from
some given value, namely their second moment about that value. When the mean square is
regarded as an estimator of certain parental variance components the sum of the squares
about the observed mean is usually divided by the number of degrees of freedom, not the
number of observation”.
When considering the basal area per hectare of small, medium, large and all trees as the
response variables, the RMSE was calculated as follows:
n
∑ (G − G )
2
i i
RMSE = i =1
[3]
n- p
where Gi is the observed basal area per hectare of sample plot i, Gi is the basal area of
sample plot i estimated from the predicted distribution, n - p is the degrees of freedom.
To facilitate comparisons between the performances of basal area prediction techniques,
84
European Journal of Remote Sensing - 2014, 47: 75-94
we also report a scaled RMSE obtained by dividing the RMSE by the standard deviation of
the training dataset.
For all basal area prediction techniques considered, we also computed the coefficient of
determination, which is the proportion of total variation explained by the model.
Results
The results of the analyses within the supervised classification approach are shown with
confusion matrices in Table 4 (LDA technique) and Table 5 (RF technique).
The overall accuracy of LDA was 41%. The user’s accuracy has not exceeded 57% while
the producer’s accuracy has not exceeded the 67%. The forest structure types that had
85
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
especially good producer’s accuracy were the R11 and R32, while the I3, R12 and the R31
reached low levels of producer’s accuracy.
In the case of RF classification, a stable overall accuracy (37%) was obtained setting the
number of classification trees per response variable equal to 200 and choosing the number
of predictor variables to use in each split equal to 4: using lower or greater than 200 trees
resulted in overall declining accuracy. Also in this case, the user’s and producer’s accuracy
has not respectively exceeded 57% and 68%. The worst producer’s accuracy was in the
classification of plots characterized by irregular structure (I2 and I3) and regular structure
dominated by large trees (R31 and R33), while the better produce’s accuracy was in
classifying the R32 forest structure type.
The results of the analyses within the prediction-based classification approach are shown
with confusion matrices in Table 6 and Table 7 (k-NN with RF and MSN methods
respectively) and Table 8 and Table 9 (multiple linear regression and multiple non linear
regression respectively). Between the techniques used in the prediction-based classification
approach, the k-NN with the RF method provided better results with respect to the multiple
linear regression and non linear regression.
The overall accuracy for k-NN imputation by the RF method was 37% (Tab. 6). The
maximum level of user’s accuracy was 62% while the maximum level of producer’s
accuracy was 56% . The irregular structures (I2 and I3) and the regular stands dominated
by large trees (R31 and R33) had poor producer’s accuracy, while the R11 and R32 forest
type had the higher accuracy (56% and 52% respectively).
The overall accuracy for k-NN imputation by the MSN method was 34%. The maximum
value of producer’s accuracy was 40%, while the user’s accuracy did not pass 46% (R23).
The forest structure types that had especially good producer’s accuracy were the R23 and
R32, while the I3, R31 and the R31 reached low levels of producer’s accuracy (Tab. 7).
86
European Journal of Remote Sensing - 2014, 47: 75-94
The overall accuracy in the classification by the multiple linear regression was 28%. (Tab.
8) The maximum level of producer’s accuracy was 83% while the maximum level of user’s
accuracy was 54%: this prediction-based classification technique shows poor level of
producer’s accuracy in the classification of I1, I3, R11,R21, R31 and R33, and good level in
the classification of I2.
The overall accuracy of the classification based on the results of non linear regression was
26% (Tab. 9). The confusion table shows that the user’s accuracy has not exceeded 57% while
the producer’s accuracy has not passed the 67%. The forest structure types that had especially
good producer’s accuracy were the R11 and R32, while the I3, R12 and the R31 reached low
levels of producer’s accuracy.
87
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
We can note a common trend across all techniques (used both in supervised classification
and prediction-based classification) consisting in high levels of producer’s accuracy of the
forest structural type R32. Supervised classification techniques are poor in classifying the
irregular structure, i.e. multi-stored stands, vice versa the linear and non-linear regression
techniques provided the higher levels of producers’ accuracy in these types of forest
structure.
RF k-NN, MSN k-NN, OLS and NLS had comparable scaled RMSE values in prediction
the basal area per hectare for small, medium and large trees and of all trees (Tab. 10). In
all cases the error was greater in the prediction of basal area per hectare of medium trees
(around 10 m2/ha), and lower in the prediction of basal area per hectare of small trees
(around 6.5 m2/ha).
The technique that provided the best performance in basal area prediction was the k-NN
with the MSN method: the R2 to predict small, medium, large and all trees was 0.71, 0.48,
0.69 and 0.56 respectively.
Table 10 - Root Mean Square Error (RMSE), scaled Root Mean Square Error (scaled RMSE) and
determination coefficient (R2) for the prediction basal areas techniques.
RMSE (m2/ha) Scaled RMSE R2
Small trees 7.03 6.29 6.91 6.48 0.70 0.63 0.69 0.65 0.68 0.71 0.54 0.62
Medium trees 11.06 11.09 9.49 9.49 1.06 1.06 0.91 0.91 0.49 0.48 0.20 0.20
Large trees 8.74 8.37 8.56 8.43 0.67 0.64 0.66 0.65 0.68 0.69 0.58 0.58
All trees 11.16 10.65 8.82 8.86 0.93 0.89 0.73 0.74 0.54 0.56 0.48 0.47
88
European Journal of Remote Sensing - 2014, 47: 75-94
Discussion
To date, it often happens that the Italian forest technicians can exploit the CHM as a product
of ALS surveys made for other non-forestry purposes at low [Clementel et al., 2012] or even
no cost [Corona et al., 2012]. This availability has meant that the technicians, who deal with
forest management planning, familiarize themselves with this product. For these reasons, it
was considered essential to evaluate if this kind of limited lidar dataset could have application
to forest structural prediction, and consequently for mapping forest structural types.
The results of this study suggest that the lidar-derived predictor variables extracted from
CHMs may be used just for preliminary analyses within, for example, forest inventory
processes that require the recognition of forest structural types with the advantage of
reducing time-consuming and expensive fieldworks activities. In fact, the techniques used
both in supervised classification approach and prediction-based classification approach
did not produced satisfactory levels of classification accuracy. In this context, the non
parametric techniques produced more accurate classifications with respect to the parametric
techniques, and in particular the linear discriminant analysis provided the best results.
It is difficult to compare our results in terms of other related works, because in other
studies lidar metrics were derived from the normalized cloud points, anyway it seems to us
important to evaluate our results in relation to those obtained from researches with similar
goals where classification approaches were applied.
Zhang et al. [2011] obtained a higher level of overall accuracy (91.4%) performing the
linear discriminant analysis to classify five forest types in the Strzelecki Ranges (southeast
Victoria, Australia) using eighteen lidar-derived predictor variables from the normalized
returns. We are aware that the level of accuracy obtained in our study is not comparable to
the Zhang et al. [2011] results, but probably the high number of forest structure types we
wanted to classified (eleven vs. five) is a limiting factor of classification accuracy.
Chirici et al. [2013], to classify nine forest fuel types in the Mediterranean province of
Palermo and Catania (Italy) previously observed and identified by photo interpretation,
applied the RF technique using thirty-one ALS-based metrics calculated from the normalized
height returns. The overall accuracy obtained through this technique was 45%, very similar
to that obtained in this study.
Hudak et al. [2008], imputing plot-level basal area of eleven conifer species in Moscow
Mountain and St. Joe Woodlands in north-central Idaho (USA) with MSN and RF imputation
methods, obtained similar scaled RMSD to the present work (if the basal area of all trees
is considered).
Considering that the results we obtained are very similar to the mentioned studies, we can
say that our inference about the potential to use limited lidar data (i.e. CHM) to predict
forest structure is here confirmed: variables extracted from the CHM have the same
potentiality of those extracted from lidar point cloud datasets in predicting forest structure
types. Furthermore, forests of Trento province cover a wide range of structural conditions
which are not typical of this geographic area, are independent of site fertility and of climate
conditions, this means that the findings from this study can be extrapolated and applied to
other forests located outside the province of Trento.
We have to point out that the sample design of this study could have potentially influenced
the statistical scope of inference limiting the goodness of the results: the unbalanced number
of cases per forest structural type, due to the fact that in some cases the real forest structure
89
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
type didn’t match with those selected by the sampling, could have influenced the model
classification accuracy. In fact if the within forest type variability (internal variability) is
relatively high with respect to the between variability (external variability), then probably
the difference among the forest structural types is the result of the internal variability. The
availability of more data, acquired by future surveys carried out during the operations of
forest plans revision, will allow retesting the techniques considered in this work. In addition,
next investigations will consider families of structure that group together the twelve types
of the structural classification system: one option could be to take in consideration the
mono-stored forest stands (R11, R22, and R33), the bi-stored forest stands (R12, R13, R21,
R23, R31, and R32), and the multi-stored forest stands (I1, I2, and I3).
The methodology used to calculate the canopy cover of small, medium and large trees
through a diameter-height function working for all trees of the different forest structural
types, does not represent a limitation of this study. The uncertainty when assigning a tree to
a diameter class based only in its height by the diameter-height function subsists even on
plot level, but this method of cover estimation doesn’t pose a limitation in our analysis. In
fact, if we consider for example the cover of small trees, at a first analysis the cover of this
category of trees seems include the cover of medium and large trees, but each cover stratum
includes only the pixels which height value falls in the range of tree height modelled by the
diameter-height function and in this way we avoid the risk of double or triple count.
Hudak et al. [2008], instead of calling this predictor variable cover, called these variables
STRATUMn, they considered seven strata where for example STRATUM4 is the
percentage of vegetation returns >10 m and <= 20 m in height. The height interval can be
choice arbitrarily or can be define in some objective way, as in our study, anyway the cover
estimation does not represent a problem in the analysis we would carry out. Moreover,
considering that the CHM takes into account the outer canopy surface, it is more likely that
the CHM “blanket” represents the tallest part of the crown. Lidar data coming from ALS
campaigns that records more than first and last return per pulse allow to better see what
happens inside each stratum, for this reason forest technician should be involved in the
process of flight protocol definition.
Conclusion
We can conclude that the canopy structure metrics computed from the CHM and used
in the present study can be moderately useful for basal area imputation and hence in the
classification process.
At any rate, this study confirmed the fact that coupling lidar data and sample inventory data
we can support the forest management from multiple perspectives [Corona, 2010]. The
classification methods here investigated may be used to validate the preliminary expeditious
classification made by the foresters through a visual estimation during the forest inventory
operations.
In spite of LDA provided the best levels of accuracy, we suggest to use this technique with
sagacity when an high number of predictor variables is considered. In fact LDA is too
flexible in situations with hundreds of highly correlated predictor variables, and it is too
rigid in situations where the class boundaries in predictor space are complex and nonlinear
[Hastie et al., 1995]. On the contrary, RF algorithm can handle thousands of input variables
without variable deletion and runs efficiently on large data sets. It does not over fit, and
90
European Journal of Remote Sensing - 2014, 47: 75-94
there is no need for variable pre-selection [Strobl et al., 2007]. Moreover, RF is a flexible
classification algorithm that directly provides measures of variable importance (related to
the relevance of each variable in the classification process), that has the ability to model
complex interactions among predictor variables and include an algorithm to imputing
missing values.
References
Alberti G., Boscutti F., Pirotti F., Bertacco C., De Simon G., Sigura M., Cazorzi F., Bonfanti
P. (2013) - A LiDAR-based approach for a multi-purpose characterization of Alpine
forests: an Italian case study. iForest, 6: 156-168. doi: http://dx.doi.org/10.3832/
ifor0876-006.
Ancel P., Wilhelm M.E., Lacombe E. (1999) - Peuplements forestiers du massif Vosgien;
typologie et sylvicultures. 51 pp. CNPF.
Andersen H.E., Foster J.R., Reutebuch E. (2003) - Estimating forest structure parameters
on Fort Lewis Military Reservation using Airborne Laser Scanner (LIDAR) data.
Proceedings of the Second International Precision Forestry Symposium, Seattle,
Washington, June 15-17, 2003, pp. 45-53.
Axelsson P. (1999) - Processing of laser scanner data - algorithms and applications. Journal
of Photogrammetry and Remote Sensing, 54: 138-147. doi: http://dx.doi.org/10.1016/
S0924-2716(99)00008-8.
Barilotti A., Sepic F., Abramo E., Crosilla F. (2007) - Assessing the 3D structure of the
single crowns in mixed alpine forests. In: Stilla, U., Mayer, H., Rottensteiner, F., Heipke,
C., Hinz, S. (Eds.) PIA07. International Archives of Photogrammetry, Remote Sensing
and Spatial Information Sciences, XXXVI-3/W49A.
Breiman L. (2001) - Random Forests. Machine Learning, 45 (1): 5-32. doi: http://dx.doi.
org/10.1023/A:1010933404324.
Chirici G., Scotti R., Montaghi A., Barbati A., Cartisano R., Lopez G, Marchetti M.,
McRoberts R.E., Olsonn H., Corona P. (2013) - Stochastic gradient boosting classification
trees for forest fuel types mapping through airborne laser scanning and IRS LISS-III
imagery. International Journal of Applied Earth Observation and Geoinformation, 25:
87-97. doi: http://dx.doi.org/10.1016/j.jag.2013.04.006.
Clementel F., Colle G., Farruggia C., Floris A., Scrinzi G., Torresan C.(2012) - Estimating
forest timber volume by means of “low-cost” LiDAR data. Italian Journal of Remote
Sensing, 44 (1): 125-140. doi: http://dx.doi.org/10.5721/ItJRS201244110.
Corona P. (2010) - Integration of forest mapping and inventory to support forest management.
iForest, 3: 59-64. doi: http://dx.doi.org/10.3832/ifor0531-003.
Corona P., Cartisano R., Salvati R., Chirici G., Floris A., Di Martino P., Marchetti M.,
Scrinzi G., Clementel F., Travaglini D., Torresan C. (2012) - Airborne Laser Scanning
to support forest resource management under alpine, temperate and Mediterranean
environments in Italy. European Journal of Remote Sensing, 45: 27-37. doi: http://
dx.doi.org/10.5721/EuJRS20124503.
Crookston N.L., Finley A.O. (2008) - yaImpute: An R Package for kNN Imputation. Journal
of Statistical Software, 23 (10): 1-16.
Curtis R. (1967) - Height-diameter and height-diameter-age equations for second-growth
Douglas-fir. Forest Science, 13: 365-375.
91
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
Cutler R.D., Edwards T.C., Beard K.H., Cutler A., Hess K.H., Gibson J., Lawler J.J. (2007)
- Random forests for classification in ecology. Ecology, 88 (11): 2783-2792. doi: http://
dx.doi.org/10.1890/07-0539.1.
De Natale F., Pignatti G. (2011) - Estensione e composizione dei boschi italiani. In:
Gasparini, P., Tabacchi, G. (a cura di) 2011 - L’Inventario Nazionale delle Foreste e
dei serbatoi forestali di Carbonio INFC 2005. Secondo inventario forestale nazionale
italiano. Metodi e risultati. Ministero delle Politiche Agricole, Alimentari e Forestali,
Corpo Forestale dello Stato; Consiglio per la Ricerca e Sperimentazione in Agricoltura,
Unità di Ricerca per la Pianificazione ed il Monitoraggio e la Pianificazione Forestale.
Edagricole, Milano, pp. 99-111.
Dubayah R.O, Knox R.G, Hofton M.A., Blair J.B., Drake J.B. (2000) - Land surface
characterization using lidar remote sensing. Hill, M., Aspinall, R. (Eds.), Spatial
information for land use management, International Publishers Direct, Singapore.
Ferraz A., Bretar F., Jacquemound S., Gonçalves G., Pereira L., Tomé M., Soares P. (2012)
- 3-D mapping of multy-layered Mediterranea forest using ALS data. Remote Sensing of
Environment, 121: 210-223. doi: http://dx.doi.org/10.1016/j.rse.2012.01.020.
Hastie T., Buja A., Tibshirani R. (1995) - Penalized discriminant analysis. The Annals of
Statistics, 23 (1): 73-102. doi: http://dx.doi.org/10.1214/aos/1176324456.
Heurich M., Thoma F. (2008) - Estimation of forestry stand parameters using laser
scanning data in temperate, structurally rich natural European beech (Fagus sylvatica)
and Norway spruce (Picea abies) forests. Forestry, 81 (5): 645-661. doi: http://dx.doi.
org/10.1093/forestry/cpn038.
Hudak A.T., Crookston N.L., Evans J.S., Falkowski M.J., Smith A.M.S., Gessler P.E.,
Morgan P. (2006) - Regression modeling and mapping of coniferous forest basal area
and tree density from discrete-return lidar and multispectral satellite data. Canadian
Journal of Remote Sensing, 32 (2): 126-138. doi: http://dx.doi.org/10.5589/m06-007.
Hudak A.T., Crookston N.L., Evans J.S., Hall D.E., Falkowski M.J. (2008) - Nearest
neighbor imputation of species-level, plot-scale forest structure attributes from LiDAR
data. Remote Sensing of Environment, 112: 2232-2245. doi: http://dx.doi.org/10.1016/
j.rse.2007.10.009.
Kendall M.G., Buckland W.R. (1975) - A Dictionary of Statistics Terms. Third edition.
Longman Group Limited. 166 pp. ISBN 0 582 46304 1.
Lefsky M.A., Harding D., Cohen W.B., Parker G., Shugart H.H. (1999) - Surface Lidar
Remote Sensing of Basal Area and Biomass in Deciduous Forests of Eastern Maryland,
USA. Remote Sensing of Environment, 67:83-98. doi: http://dx.doi.org/10.1016/S0034-
4257(98)00071-6.
Lewin-Koh N.J., Bivand R. (2012) - Maptools: Tools for reading and handling spatial
objects. R package version 0.8-14.
Liaw A., Wiener M. (2002) - Classification and regression by randomForest. R News, 2
(3): 18-22, ISSN 1609-3631.
Lumley T. (2004) - Leaps: regression subset selection. R package version 2.7. Uses Fortran
code by Alan Miller.
Maltamo M., Malinen J., Kangas A., Härkönen S., Pasanen A.M. (2003) - Most similar
neighbor based stand variable estimation for use in inventory by compartments in
Finland. Forestry, 76 (4): 449-464. doi: http://dx.doi.org/10.1093/forestry/76.4.449.
92
European Journal of Remote Sensing - 2014, 47: 75-94
Maltamo M., Packalén P., Yu X., Eerikäinen K., Hyyppä J., Pitkänen J. (2005) - Identifing
and quintifying structural characteristics of heterogeneous borela forests using
laser scanner data. Forest Ecology and Management, 216: 41-50. doi: http://dx.doi.
org/10.1016/j.foreco.2005.05.034.
Means J.E., Acker S.A., Fitt B.J., Renslow M., Emerson L., Hendrix C.J. (2000) - Predicting
forest stand characteristics with airborne scanning lidar. Photogrammetric Engineering
& Remote Sensing, 66 (11): 1367-1371.
Means J.E., Acker S.A., Harding D.J, Blair J.B., Lefsky M.A., Cohen W.B., Harmon
M.E., McKee W.A. (1999) - Use of large-footprint scanning airborne lidar to estimate
forest stand characteristics in the Western Cascades of Oregon. Remote Sensing of
Environment, 67: 298-308. doi: http://dx.doi.org/10.1016/S0034-4257(98)00091-1.
Muinonen E., Maltamo M., Hyppanen H., Vainikainen V. (2001) - Forest stand
characteristics estimation using a most similar neighbor approach and image spatial
structure information. Remote Sensing of Environment, 78: 223-228. doi: http://dx.doi.
org/10.1016/S0034-4257(01)00220-6.
Næsset E. (1997) - Estimating timber volume of forest stands using airborne laser scanner
data. Remote Sensing of Environment, 61: 246-253. doi: http://dx.doi.org/10.1016/
S0034-4257(97)00041-2.
Næsset E. (2002) - Predicting forest stand characteristics with airborne scanning laser
using a practical two-stage procedure and field data. Remote Sensing of Environment,
80: 88-99. doi: http://dx.doi.org/10.1016/S0034-4257(01)00290-5.
Odasso M. (2002) - I tipi forestali del Trentino. Report del Centro di Ecologia Alpina, 25, 192 pp.
Ok A.O., Akar O., Gungor O. (2012) - Evaluation of random forest method fro agricultural
crop classification. European Journal of Remote Sensing, 45:421-432. doi: http://dx.doi.
org/10.5721/EuJRS20124535.
Reimann C., Filzmoser P., Garrett R., Dutter R. (2008) - Statistical Data Analysis Explained:
Applied Environmental Statistics with R. WILEY, 2008, 362 pp. ISBN 047098581X.
Riaño D., Valladares F., Condésand S., Chuvieco E. (2004) - Estimation of leaf area index
and covered ground from airborne laser scanner (Lidar) in two contrasting forests.
Agricultural and Forest Meteorology, 124: 269-275. doi: http://dx.doi.org/10.1016/
j.agrformet.2004.02.005.
Schwarz G. (1978) - Estimating the dimension of a model. Annals of Statistics, 6: 461-464.
doi: http://dx.doi.org/10.1214/aos/1176344136.
Scrinzi G., Clementel F., Colle G., Farruggia C., Floris A., Torresan C. (2011) - L’inventario
dendromemtrico nella nuova pianificazione forestale aziendale trentina (NPFAT).
Provincia Autonoma di Trento - Servizio Foreste e Fauna, Consiglio per la Ricerca e la
Sperimentazione in Agricoltura, 105 pp. ISBN 978-88-7702-322-3.
Staudhammer C.L., LeMay V.M. (2001) - Introduction and evaluation of possible indices
of stand structural diversity. Canadian Journal of Forest Research, 31: 1105-1115. doi:
http://dx.doi.org/10.1139/cjfr-31-7-1105.
Strobl C., Boulesteix A.L., Augustin T. (2007) - Unbiased split selection for classification
trees based on the Gini index. Computational Statistics & Data Analysis, 52 (1): 483-
501. doi: http://dx.doi.org/10.1016/j.csda.2006.12.030.
Suárez J., Fonweban J., Gardiner B. (2012) - Supporting precision forestry in Great Britain.
Earthzine (http://www.earthzine.org/2012/06/22/supporting-precision-forestry-in-great-
93
Torresan et al. Comparing statistical techniques to classify forest structure using CHM-derived metrics
britain/).
Travaglini D., Bottalico F., Brundu P., Chirici G., Minari E. (2007) - Testing neighborhood-
based indices for monitoring spatial forest stand structure. In: Gianelle,D., Travaglini
D., Mason F., Minari E., Chirici G., Chemini C. (Eds.), 2007. Canopy Analysis and
Dynamics of a Floodplain Forest. Rapporti Scientifici, 3. Centro Nazionale per lo studio
e la conservazione della Biodiversità Forestale - Bosco della Fontana. Cierre Grafica
Editore, Verona, pp. 96.
Venables W.N., Ripley B.D. (2002) - Modern Applied Statistics with S. Fourth Edition.
Springer, New York. doi: http://dx.doi.org/10.1007/978-0-387-21706-2.
Zhang Z., Liu X., Peterson J., Wright W. (2011) - Statistical analysis of airborne LiDAR
data for forest classification in the Strzelecki Ranges, Victoria, Australia. In: MODSIM
2011. Sustaining Our Future: Understanding and Living with Uncertainty, 12-16 Dec
2011, Perth, Australia.
Zimble D.A., Evans D.L., Carlson G.C., Parker R.C., Grado S.C., Gerard P.D. (2003)
- Characterizing vertical forest structure using small-footprint airborne LiDAR.
Remote Sensing of Environment, 87:171-182. doi: http://dx.doi.org/10.1016/S0034-
4257(03)00139-1.
94