Advances in Meteorology - 2016 - Campozano - Comparison of Statistical Downscaling Methods For Monthly Total Precipitation
Advances in Meteorology - 2016 - Campozano - Comparison of Statistical Downscaling Methods For Monthly Total Precipitation
Advances in Meteorology
Volume 2016, Article ID 6526341, 13 pages
http://dx.doi.org/10.1155/2016/6526341
Research Article
Comparison of Statistical Downscaling Methods for
Monthly Total Precipitation: Case Study for the Paute River
Basin in Southern Ecuador
Copyright © 2016 L. Campozano et al. This is an open access article distributed under the Creative Commons Attribution License,
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Downscaling improves considerably the results of General Circulation Models (GCMs). However, little information is available
on the performance of downscaling methods in the Andean mountain region. The paper presents the downscaling of monthly
precipitation estimates of the NCEP/NCAR reanalysis 1 applying the statistical downscaling model (SDSM), artificial neural
networks (ANNs), and the least squares support vector machines (LS-SVM) approach. Downscaled monthly precipitation estimates
after bias and variance correction were compared to the median and variance of the 30-year observations of 5 climate stations in the
Paute River basin in southern Ecuador, one of Ecuador’s main river basins. A preliminary comparison revealed that both artificial
intelligence methods, ANN and LS-SVM, performed equally. Results disclosed that ANN and LS-SVM methods depict, in general,
better skills in comparison to SDSM. However, in some months, SDSM estimates matched the median and variance of the observed
monthly precipitation depths better. Since synoptic variables do not always present local conditions, particularly in the period going
from September to December, it is recommended for future studies to refine estimates of downscaling, for example, by combining
dynamic and statistical methods, or to select sets of synoptic predictors for specific months or seasons.
Code Station Regime Latitude (∘ ) Longitude (∘ ) Elevation (m asl) Annual precipitation (mm)
M141 El Labrado TM −2.732 −79.073 3335 1286
M139 Gualaceo BM −2.882 −78.776 2230 820
M138 Paute TM −2.800 −78.763 2194 750
M045 Palmas UM −2.716 −78.630 2400 1341
M137 Biblián BM −2.709 −78.892 2640 1001
development of multiple, low cost, single-site scenarios of 79∘ 30 0 W 79∘ 0 0 W 78∘ 30 0 W
daily surface weather variables and is considered as a stochas- 2∘ 0 0 S 2∘ 0 0 S
tic weather generator on a daily scale. The limited use of
this technique in the Andean mountain region is perhaps
the consequence of its complex topography, location in the
tropical zone, and the influence of the warm and cold ENSO Ecuador
2∘ 30 0 S 2∘ 30 0 S
phases, El Niño and La Niña, respectively. All these influences
add complexity to the atmospheric processes, making the M141 M137 M045
# #
#
representation by downscaling techniques more difficult. # M138
# M139
Artificial neural networks (ANNs) are another SD technique
commonly used, however, to a limited extent, in the Andes 3∘ 0 0 S 3∘ 0 0 S
region. As stated by [4], the success of this technique is
primarily due to its ability to map highly nonlinear relations
between inputs and outputs of the model. Reference [5]
applied ANN for downscaling monthly precipitation and 79∘ 30 0 W 79∘ 0 0 W 78∘ 30 0 W
temperature in the Paute basin in the Andes of Ecuador. # Stations
The application of ANN based SD was conducted in order Paute basin
to evaluate the performance on seasonality representation
against DD using a regional climate model, Weather Research Figure 1: The study area Paute basin in the Andes of Ecuador.
and Forecasting, WRF, model [6]. With respect to rainfall
representation, they found that although both downscaling
approaches represent qualitatively well seasonality in this
2. Study Area and Data
highly complex terrain, ANN estimates of rainfall were
more accurate than WRF estimates. This fact highlights The Paute River basin, tributary of the Amazon basin and
the applicability of ANN when the understanding of the 6148 km2 in size, was selected for the comparative evaluation
processes involved is not required. Another technique with of the downscaling methods, a basin located between the
recent application for the downscaling of GCMs is SVM, eastern and western cordillera of the Andes in Ecuador. The
support vector machines [7]. In particular the least squares basin is characterized by a high spatial and temporal variation
support vector machines, LS-SVM [8], downscale GCM out- in precipitation that broadly can be classified into three
put even better, mainly due to the reduction of the optimiza- rainfall regimes, respectively, subregions with a uni-, bi-, and
tion problem to the resolution of a linear system reducing three-modal precipitation pattern [10, 11]. Data of 5 rainfall
considerably the computational requirements. Despite SD stations (see Table 1) were used of whom the geographical
methods presenting greater accuracy than DD methods, from distribution is given in Figure 1. The measured monthly
a statistical point of view, they possess two handicaps: bias rainfall for the 30-year period 01-1980 to 12-2009 was used
and low variance. Normally, the quantile mapping (QM) to quantify the differences in the performance of the selected
technique [9] is applied to improve the representation of the downscaling methods. The dataset was split in a first set for
distribution of derived applications. the calibration of the methods, encompassing 75% of the total
This paper presents a comparative evaluation of down- dataset, and the remaining 25% was used for the validation.
scaled GCM estimates of monthly precipitation at the scale The results presented herein belong all to the validation set.
of a large river basin situated in the Andean mountain region Quality control of the data was performed using double mass
in southern Ecuador, applying SDSM and two artificial intel- curves on the time series with gaps not exceeding 20% of the
ligence (AI) techniques: artificial neural networks (ANNs) observations. The infilling of the data was accomplished using
and the least squares support vector machines (LS-SVM) multiple linear regression with stations with higher Pearson
approach. The downscaled results were corrected for bias and correlation [12].
variance inflation applying the QM technique prior to the NCEP/NCAR reanalysis 1 data [13] with 2.5∘ × 2.5∘
comparative analysis. For the evaluation, historic data was resolution was used as input. The selected synoptic predictors
used. for the SDSM and AI models are presented in Table 2.
1306, 2016, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2016/6526341 by Readcube (Labtiva Inc.), Wiley Online Library on [03/02/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Advances in Meteorology 3
ANN/LS-SVM SDSM
# Synoptic predictors
(All stations) El Labrado Gualaceo Paute Palmas Biblián
1 Precipitation ∗ ∗ ∗ ∗ ∗ ∗
2 Pressure (surface) ∗ ∗ ∗ ∗
3 Relative humidity (surface) ∗ ∗ ∗ ∗ ∗
4 Specific humidity 700 hPa ∗
5 Sea level pressure ∗
6 Temperature 2 m ∗ ∗ ∗ ∗ ∗
7 Potential temperature 700 hPa ∗ ∗ ∗ ∗
8 Zonal wind (surface) ∗
9 Omega 500 hPa ∗ ∗ ∗
10 Geopotential height 200 hPa ∗ ∗
11 Geopotential height 500 hPa ∗
12 Geopotential height 850 hPa ∗ ∗
Table 3: Artificial intelligence models. Finally the LS-SVM model for function estimation is
Model Type Transfer function 𝑁
1 Linear & 1 hidden layer 𝑦 (𝑥) = ∑ 𝛼𝑖 𝐾 (𝑥𝑖 , 𝑥) + 𝑏. (7)
2 Sigmoidal & 1 hidden layer 𝑖=1
Ann
3 Linear & 2 hidden layers
A Bayesian framework with three levels of inference was
4 Sigmoidal & 2 hidden layers
developed for the optimization of parameters [8]. The LS-
5 Linear SVMlab tool [17] developed in Matlab applying three Kernel
6 LS-SVM Polynomial tuning options (linear, polynomial, and RBF) was used within
7 Radial basis functions the ensemble of AI methods.
Table 4: Quantile mapping parameters for artificial intelligence and Table 5: Statistical metrics for ANN and LS-SVM ensembles.
SDSM ensembles.
Station Metric ANN LS-SVM
Parameters Pearson correlation 0.49 0.58
Model Stations
𝑎 𝑏 𝑥0 IRF 0.59 0.53
Biblián 0.03280 1.83727 0.87053 El Labrado Mean-bias 4.20 6.16
El Labrado 0.00820 1.98426 −8.57836 Cum bias 31.63 37.79
AI Palmas 0.00334 2.21628 6.04480 RMSE 40.73 37.63
Gualaceo 0.31943 1.40124 16.92140 Pearson correlation 0.70 0.71
Paute 0.02791 1.94766 0.03257 IRF 0.55 0.40
Biblián 0.00986 2.08840 3.21195 Gualaceo Mean-bias −2.29 −0.21
El Labrado 0.39313 1.19070 −45.84738 Cum bias 33.41 41.70
SDSM Palmas 0.00056 2.46774 −13.06023 RMSE 34.82 35.33
Gualaceo 0.00077 2.62716 −1.14051 Pearson correlation 0.55 0.54
Paute 0.04141 1.80403 3.91130 IRF 0.41 0.31
Paute Mean-bias −9.90 −8.34
Cum bias 44.16 48.21
As statistical metrics the following were used:
RMSE 31.61 31.04
(1) Pearson correlation (𝑅): to evaluate the linear corre- Pearson correlation 0.25 0.45
lation. IRF 0.45 0.53
(2) Mean bias (MB): to evaluate the mean (the 50th Palmas Mean-bias −1.85 7.87
percentile) difference between modeled and observed Cum bias 36.99 37.81
distributions. RMSE 52.44 47.51
(3) The root mean squared error (RMSE): to evaluate the Pearson correlation 0.67 0.65
error between modeled and observed time series. IRF 0.57 0.54
(4) The interquartile relative fraction (IRF): to evaluate Biblián Mean-bias −18.39 −7.05
the modeled variability representation relative to the Cum bias 52.34 35.49
observed: RMSE 44.75 40.97
𝑄3𝑚 − 𝑄1𝑚
IRF = , (9)
𝑄3𝑜 − 𝑄1𝑜 into one ensemble considering both models as transfer
function based models. Although previous studies [19, 20]
where IRF is the interquartile relative fraction. A demonstrated the relative superior performance of SVM
value of IRF > 1 represents overestimation of the based downscaling methods over other approaches including
variability, IRF = 1 is a perfect representation of the ANN based methods, we compared for the first time in the
variability, and IRF < 1 is an underestimation of region, to the best of our knowledge, an ANN ensemble
the variability; 𝑄3𝑚 and 𝑄3𝑜 and the 75th modeled and against a LS-SVM ensemble to evaluate their downscaling
observed percentile; 𝑄1𝑚 and 𝑄1𝑜 and the 25th modeled performance. The ensembles were not bias corrected in order
and observed percentile. to evaluate their actual performance. Table 5 presents the
(5) The absolute cumulative bias (ACB): to evaluate the values of 𝑅, RMSE, MB, IRF, and ACB comparing ANN
bias of the 25th, 50th, and 75th percentiles; or and LS-SVM ensembles. In El Labrado and Paute station
similar results of both ensembles are obtained. However,
ACB = 𝑄1𝑚 − 𝑄1𝑜 + 𝑄2𝑚 − 𝑄2𝑜 + 𝑄3𝑚 − 𝑄3𝑜 , (10) both ensembles for Gualaceo station underrepresent the
variability. IRF is 0.55 and 0.40 for ANN and LS-SVM,
where ACB is the absolute cumulative bias. A value respectively. Therefore LS-SVM represents 15% less variability
of ACB = 0 is a perfect representation of the three than ANN ensemble. The MB for ANN is −2.29 mm whereas
percentiles (resp., the 25th, 50th, and 75th percentile) for LS-SVM it is −0.21 mm, which in monthly scale is very
of modeled and observed distributions, while under- low. The ACB metric for ANN is 33.41 mm whereas for LS-
or overestimation indicates a divergence of ACB from SVM it is 41.70 mm, meaning that although the MB is lower
zero to positive values. for LS-SVM, the bias in the 25th and 75th percentiles is higher
than for the ANN ensemble. In Palmas station 𝑅 is 0.25 and
4. Results and Discussion 0.45, respectively, for the ANN and LS-SVM ensembles. IRF
values of 0.45 and 0.53 for ANN and LS-SVM are obtained,
4.1. Evaluation of SDSM and AI Ensembles indicating a greater representation in variability by the latter
approach. However, MB values of −1.85 and 7.87 mean that
4.1.1. Relative Performance of ANN and LS-SVM Models. As LS-SVM presents more bias in the 50th percentile than ANN.
mentioned before, ANN and LS-SVM models were grouped For Biblián station, MB is −18.39 mm and −7.05 mm and ACB
1306, 2016, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2016/6526341 by Readcube (Labtiva Inc.), Wiley Online Library on [03/02/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
6 Advances in Meteorology
0
20
40
60
80
100
120
140
160
180
200
0
20
40
60
80
100
120
140
160
180
200
0
20
40
60
80
100
120
140
160
180
200
January January January
February February February
SDSM
SDSM
SDSM
March March March
Obs._Paute
SDSM_QM
SDSM_QM
SDSM_QM
April April April
Obs._Biblian
Obs._El Labrado
May May May
(e)
(c)
(a)
June June June
July July July
August August August
September September September
October October October
November November November
December December December
0
20
40
60
80
100
120
140
160
180
200
0
20
40
60
80
100
120
140
160
180
200
Figure 4: Continued.
January January January
AI
February February February
SDSM
SDSM
AI_QM
March March March
SDSM_QM
SDSM_QM
April April
Obs._Palmas
April
Obs._Gualaceo
Obs._El Labrado
May May May
(b)
(f)
(d)
June June June
July July July
August August August
September September September
October October October
November November November
December December December
Advances in Meteorology
1306, 2016, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2016/6526341 by Readcube (Labtiva Inc.), Wiley Online Library on [03/02/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
1306, 2016, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2016/6526341 by Readcube (Labtiva Inc.), Wiley Online Library on [03/02/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Advances in Meteorology 9
200 200
180 180
160 160
Precipitation (mm)
140 140
Precipitation (mm)
120 120
100 100
80 80
60 60
40 40
20 20
0 0
October
March
April
June
August
September
November
December
January
February
May
July
October
March
April
June
August
September
November
December
January
February
May
July
Obs._Gualaceo Obs._Paute
AI AI
AI_QM AI_QM
(g) (h)
200 200
180 180
160 160
Precipitation (mm)
140 140
Precipitation (mm)
120 120
100 100
80 80
60 60
40 40
20 20
0 0
October
March
April
June
August
September
November
December
January
February
May
July
October
March
April
June
August
September
November
December
January
February
May
July
Obs._Palmas ́
Obs._Biblian
AI AI
AI_QM AI_QM
(i) (j)
Figure 4: Multiyear monthly mean precipitation for SDSM ensemble and SDSM ensemble bias corrected (a–e) and multiyear monthly mean
precipitation for AI and AI bias corrected (f–j).
AI ensemble represents poorly the distribution of the Palmas Table 7: Pearson correlations between observations and SDSM QM
station (Figure 4(i), UM regime), even after QM application. and AI QM models.
For the Biblián station (Figure 4(j)) the AI QM captures the
Station AI QM SDSM QM
April peak one month earlier but fails to correctly depict the
El Labrado 0.741 0.516
magnitude of the November peak.
Results clearly reveal that the application of QM to Gualaceo 0.946 0.672
the output of both modeling approaches, SDSM and AI, Paute 0.730 0.629
overall improves the representation of seasonality, as well Palmas 0.788 0.593
as the representation of rainy and dry periods. However, Biblián 0.691 0.532
both approaches underestimate the median value of the
precipitation depth in November. This fact could indicate that
the set of synoptic predictors do not include a variable that 4.2.2. Representation of Monthly Variability by Downscaled
is related to an enhancement of precipitation in this period. Results. To compare the representativeness of SDSM QM
Further studies are needed to determine the variables and and AI QM, the correlation of the mymm time series
related phenomena. with the observed values is shown in Table 7. For all
10
0
20
40
60
80
100
120
140
160
180
200
200
200
0
20
40
60
80
100
120
140
160
180
0
20
40
60
80
100
120
140
160
180
January January January
February February February
AI_QM
AI_QM
AI_QM
March March March
Obs._Paute
SDSM_QM
SDSM_QM
SDSM_QM
April April April
Obs._Gualaceo
Obs._El Labrado
May May May
(a)
(e)
(c)
June June June
July July July
August August August
September September September
October October October
November November November
December December December
200
250
200
250
0
50
100
150
0
50
100
150
January January January
Figure 5: Continued.
February February February
AI_QM
AI_QM
AI_QM
March March March
SDSM_QM
SDSM_QM
SDSM_QM
Observations
Observations
Observations
(f)
(b)
(d)
June June June
July July July
August August August
September September September
October October October
November November November
December December December
Advances in Meteorology
1306, 2016, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2016/6526341 by Readcube (Labtiva Inc.), Wiley Online Library on [03/02/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
1306, 2016, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2016/6526341 by Readcube (Labtiva Inc.), Wiley Online Library on [03/02/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
Advances in Meteorology 11
200 350
180
300
160
Precipitation (mm)
140 250
Precipitation (mm)
120
200
100
80 150
60 100
40
20 50
0 0
October
October
March
April
June
August
September
November
December
March
April
June
August
September
November
December
January
February
May
July
February
May
July
January
Obs._Palmas Observations
AI_QM SDSM_QM
SDSM_QM AI_QM
(g) (h)
200 250
180
160 200
Precipitation (mm)
140
Precipitation (mm)
120 150
100
80 100
60
40 50
20
0 0
October
March
April
June
August
September
November
December
February
May
July
January
October
March
April
June
August
September
November
December
February
May
July
January
Obs._Biblián Observations
AI_QM SDSM_QM
SDSM_QM AI_QM
(i) (j)
Figure 5: Comparison of SDSM and AI ensembles bias corrected (a, c, e, g, i) and Box-Whisker plots for SDSM and AI ensembles bias
corrected, from January through December (b, d, f, h, j).
stations AI QM presents greater Pearson correlation coeffi- for SD should be based on several models. The median
cient than SDSM QM. The multiyear median observed and and interquartile range are relatively well captured for the
estimated monthly precipitation depth, using, respectively, Gualaceo station (Figures 5(c) and 5(d)); the variability of
the SDSM QM and AI QM model ensembles are presented the months from January to September is similar for the
in Figures 5(a), 5(c), 5(e), 5(g), and 5(i). For the graphical SDSM-QM and AI QM estimates. October and November
presentation the Box-Whisker plot type was selected, as to variability is different for the two models, but the median
show in addition to the median the variation in estimates (see is well represented. The variability and the median are
Figures 5(b), 5(d), 5(f), 5(h), and 5(j)). better represented by AI QM and slightly overestimated by
For the El Labrado station (Figures 5(a) and 5(b)) the SDSM QM in the period of June to August. Figures 5(e)
observed interquartile range is higher for the period of and 5(f) depict the results for the Paute station, illustrating
January to April, with lower values in August and September. that both models relatively well represent the median and
It is worthwhile noticing that although AI QM captures interquartile ranges in the period of January to September but
seasonality, the intra-annual variability is not captured. Even fail to do so for the period of October to December. This fact
for some months SDSM QM captures the variability better, highlights the need to further explore the relation between
as is the case for March. This fact suggests that an assessment the synoptic conditions and rainfall. Neither the SDSM QM
1306, 2016, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2016/6526341 by Readcube (Labtiva Inc.), Wiley Online Library on [03/02/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
12 Advances in Meteorology