Paper 4
Paper 4
A Case Study
1
Department of Civil Engineering, Tamkang University, New Taipei City 251,
Taiwan; email: [email protected]
2
Department of International Trade, Chihlee Institute of Technology, New Taipei
City 220, Taiwan; email: [email protected]
3
T.Y.Lin Taiwan, Inc., Taipei City 106, Taiwan; email: [email protected]
ABSTRACT
Artificial neural networks (ANN) have been used in many pavement prediction
modeling analyses. However, the convergence characteristics and model selection
guidelines are rarely studied due to the requirement of extensive network training
time. Thus, the techniques and applications of back propagation neural networks
were briefly reviewed. Three ANN models were developed using deflection
databases generated by factorial BISAR runs. A study of the convergence
characteristics indicated that the resulting ANN model using all dominating
dimensionless parameters was proved to have higher accuracy and require less
network training time and data than the other counterpart using purely input
parameters. Increasing the complexity of ANN models does not necessarily
improve the modeling statistics. With the incorporation of subject-related
engineering and statistical knowledge into the modeling process, reasonably good
predictions may be achieved with more convincing generalization and explanation
yet requiring minimal amount of time and effort.
INTRODUCTION
Predictive models have been widely used in various pavement design procedures,
evaluation, rehabilitation, and network management systems. Empirical and
mechanistic-empirical approaches using statistical regression techniques have been
utilized extensively in predicting extremely complicated pavement responses and
performance indicators for more than four decades. Using purely empirical
concepts to develop predictive models is not recommended. Lee (1993) proposed a
systematic statistical and engineering modeling approach which strongly
recommends to incorporate theoretical engineering knowledge, expert experience,
heuristics, and statistical data analysis and regression techniques altogether into the
framework to develop more mechanistic-based predictive models. In additional to
the conventional “parametric” linear and nonlinear regression techniques, several
ingenious iterative regression techniques in the area of “robust” and
“nonparametric” regressions were also incorporated. The proposed approach has
been successfully implemented in the development of many purely mechanistic
(Lee, 1993; Lee & Darter, 1994) and purely empirical predictive models (Lee &
Darter, 1995), as well as the mechanistic -empirical predictive models adopted in
the early analyses of LTPP general pavement studies data (Simpson et al., 1993).
289
simulated data using artificial neural networks (ANN). However, the convergence
characteristics and model selection guidelines are rarely studied due to the
requirement of extensive network training time. As part of continuous research
efforts in pavement design and analysis (Lee & Darter, 1994; Lee et al., 2004), the
techniques and applications of back propagation neural networks were briefly
reviewed. Factorial BISAR runs for different pavement systems are conducted to
generate the deflection databases for the analysis. Artificial neural networks were
utilized to improve the prediction accuracy of simulated pavement deflections (Liu,
2004). This study strives to illustrate the benefits of incorporating the principles of
dimensional analysis, subject-related knowledge, and statistical knowledge into
modeling process.
Back propagation networks (BPN) can be taught from one data space to another
using representative set of data to be learned. The learning process actually refers to
Challenges and Advances in Sustainable Transportation Systems
CHALLENGES AND ADVANCES IN SUSTAINABLE TRANSPORTATION SYSTEMS 291
identified for a three-layer pavement system: E1/E2, E2/E3, h1/h2, and a/h2. In which,
Downloaded from ascelibrary.org by University of California, San Diego on 12/19/15. Copyright ASCE. For personal use only; all rights reserved.
a is the radius of the applied load, (L); h1 and h2 are the thickness of the surface and
base layers, (L); E1, E2, and E3 are the Young’s moduli of the surface layer, base
layer, and subgrade, respectively, (FL-2). A series of factorial BISAR runs was
conducted with the following ranges to cover most practical pavement data:
0.5≦E1/E2≦170, 0.5≦E2/E3≦170, 0.2≦h1/h2≦2.4, and 0.5≦a/h2≦5.0. A BASIC
program written by Dr. Alaeddin Mohseni was used to automatically generate the
input files and summarize the results to avoid untraced human errors. A pavement
response database including the aforementioned dimensionless variables,
deflections at the center of load (D0), horizontal strain (ɛt and vertical strain (ɛv) at
the bottom of the surface layer was obtained. A training database with 3,600 data
points and an independent testing database with 1,728 data points were used in this
study (Liu, 2004).
Development of ANN Models for Flexible Pavements. The training database was
randomly separated into 3,400 data points for actual training and the remaining 200
observations for monitoring the training process. Hyperbolic tangent activation
function was chosen in this case study (Vesta Services, Inc., 2000). The learning
rate was set as 0.01. At the first trial (NET1) as shown in Table 1, no transformation
was made on both explanatory and response variables. In which, the dependent
variable is the maximum deflection (D0), whereas the explanatory variables include
E1/E2, E2/E3, h1/h2, and a/h2. Nevertheless, extreme difficulty was encountered in
obtaining reasonable convergence. Several other attempts were also conducted
using simply the pertinent input parameters (such as a, h1, h2, E1, E2, and E3) as the
explanatory variables, but the results of ANN prediction modeling were even
worse.
Based on the basic assumptions of conventional regression techniques that
the random errors are mutually uncorrelated and normally distributed with zero
mean and constant variance, and additive and independent of the expectation
function, it is desirable to check the normality of the response variable. The Box and
Cox (1964) transformation procedure was adopted to find the approximate power
transformation of the response variable (D0). The S-PLUS Statistical Analysis
Software (Mathsoft, Inc., 1997) was used for this analysis. The maximum
likelihood estimator λ of various power transformations (Weisberg, 1985) was
approximately 0 indicating that a logarithm transformation was appropriate for D0.
The normal Q-Q plot which graphically compares the distribution of log(D0) to the
normal distribution represented by a straight line. This indicates that the logarithm
of D0 is approximate to normally-distributed. In the second trial (NET2),
convergence was obtained though the number of learning cycles and modeling time
were still very high. The root mean squared (RMS) errors were computed
accordingly.
According to general statistical principles or using the alternating
conditional expectations (ACE) algorithm (Mathsoft, Inc., 1997; Breiman &
Friedman, 1985) together with the Box-Cox power transformation technique
proposed by Lee (1993), logarithm transformations of D0, E1/E2, and E2/E3 were
recommended for NET3 model. As shown in Table 1, with more statistical
knowledge incorporated into the ANN modeling process, the resulting ANN model
was proved to have higher accuracy and less network training time than the other
counterpart using purely input parameters. Figure 1(a) and 1(b) depict the network
convergence results for NET2 and NET3 during the training process. The goodness
Downloaded from ascelibrary.org by University of California, San Diego on 12/19/15. Copyright ASCE. For personal use only; all rights reserved.
of the prediction of log(D0) for NET2 and NET3 were also provided in Figure 1(c)
and 1(d) during the testing phase. The goodness of the prediction of D0 for NET2
and NET3 were also provided in Figure 1(e) and 1(f) during the testing phase. With
more statistical knowledge incorporated into the modeling process, the resulting
ANN model was proved to have higher accuracy and less network training time
than the other counterpart using purely input parameters.
CONCLUSIONS
A case study was conducted to illustrate the benefits of incorporating statistical and
subject-related knowledge into pavement prediction modeling process. The
resulting ANN model using all dominating dimensionless parameters was proved to
have higher accuracy and require less network training time and data than the other
counterparts using purely input parameters. Increasing the complexity of ANN
models does not necessarily improve the modeling statistics. The results also
showed that using higher number of neurons and hidden layers sometimes lead to
even worse modeling statistics which was an indication of over training and should
be avoided. Statistical and subject-related knowledge can be used to guide
modeling in most real-world problems and so enable much more convincing
generalization and explanation, in ways which can never be done by ‘black-box’
learning systems (7).
ACKNOWLEDGMENTS
The financial support provided by the National Science Council, Taiwan is
gratefully acknowledged.
0.20
0.032
0.16
0.024
0.12
RMS
RMS
0.016
0.08
0.008
0.04
0.000
0.00
0 40000 80000 120000 160000 200000 0 5400 10800 16200 21600 27000
Iteration Iteration
-3.0
-2.5
-2.5
Log Predicted Deflection(in.)
-2.0
-1.5
-1.5
-1.0
-1.0
-0.5
-0.5
0.0
0.0
0.0 -0.5 -1.0 -1.5 -2.0 -2.5 -3.0 0.0 -0.5 -1.0 -1.5 -2.0 -2.5 -3.0
Log Acturl Deflection(in.) Log Acturl Deflection(in.)
0.6
Predicted Deflection(in.)
Predicted Deflection (in.)
0.6
0.4
0.4
0.2
0.2
0.0
0.0
REFERENCES
Ceylan, H. (2004). “Use of artificial neural networks for the analysis & design of
Downloaded from ascelibrary.org by University of California, San Diego on 12/19/15. Copyright ASCE. For personal use only; all rights reserved.