Application of ARIMAX Model
Application of ARIMAX Model
Vigneswaran(3)
Abstract: In the post-war climes, the government’s two cropping seasons in the district corresponding
mandates focus on reviving one of the paddy with the northeast monsoon, or Maha season, and the
production region Trincomalee district in SriLanka to south-west monsoon, or Yala season. This dry-zone
meet the growing demands of the nation. Such a district show a significant relationship between annual
Paddy production and rainfall in both Maha and Yala
rehabilitation program requires the understanding of
seasons(Table3) (Yoshino, 1984b: 95). In regard to
how the paddy producing industry has fared along the
Yoshino’s Sri Lankan data, he confirmed the generality
historical time-lines. This understanding is essential
of the relationship between rainfall and the planted
for developing the necessary development plans for the area.
Rice sector.
Applications of time series technique ARIMA
When an ARIMA model includes other time series as have been used to model for forecasting agriculture
input variables, the model is referred to as an ARIMAX product. Some of research papers are Applying
model. Pankratz (1991). In this paper, ARIMAX model ARIMA models are Hossian et al. (2006) forecasted
has been applied to forecast annual paddy production three different varieties of pulse prices namely motor,
with includes rainfall time series as input variable for mash and mung in Bangladesh with monthly data
both seasons in this district. The validity of the model from Jan 1998 to Dec 2000; Wankhade et al. (2010)
is verified with various model selection criteria such as Forecasted pigeon pea production in India with annual
Adj R2, minimum of AIC and SBC lowest MAPE data from 1950-1951 to 2007-2008; Mandal (2005)
forecasted sugarcane production in India; Iqbal et al.
values.
(2005) forecasted area and production of wheat in
Keywords: ARIMAX model; Forecasting; AIC; SBC; Pakistan; Masuda and Goldsmith (2009) forecasted
MAPE world Soybean productions; Cooray (2006) forecasted
Sri Lanka’s monthly total production of tea beyond
Sept 1988 using monthly data from January 1988 to
Introduction September 2004. With these exceptions, there is
paucity of studies regarding applications of ARIMA
Rice is the most extensively cultivated crop in
model for forecasting agricultural products
Trincomalee district in SriLanka. Due to the unsettled
situation for last two decades in this district the rice When an ARIMA model includes other time
sector entered the difficult stage of development and series as input variables, the model is referred to as an
faces adjustment problems(IRI). Attention should be ARIMAX model. Pankratz (1991). In this paper,
drawn to the fact that there is room for further ARIMAX model has been applied to forecast annual
improvement by finding ways and means of fully paddy production with includes rainfall time series as
utilizing the general cultivable area in this district. input variable for both seasons in this district.This
paper applies Autoregressive Integrated Moving
In this paper rainfall and annual paddy
Average (ARIMAX) forecasting model, the most
production are time series event processes. There are
[21]
Proceedings of the Third International Symposium,
SEUSL: 6-7 July 2013, Oluvil, Sri Lanka
popular and widely used forecasting models for uni- Data Collection and Arrangement
variate time series data. Although it is applied across
various functional areas, it’s application is very limited The monthly rainfall and the yearly paddy
in agriculture, mainly due to unavailability of required production data from the period of 1970 to 2010
data and also due to the fact that agricultural product collected by the Trincomalee meteorological station
depends typically on monsoonal rain and other factors, and the Statistical Abstract release 1992, 1996, 2000,
which the ARIMA models failed to incorporate 2004, 2008 and 2010 by the Department of Census and
Statistics are used. These data are used to explore
annual climatic trend of paddy production and
Materials and Methods seasonal variation of Rainfall of the district. These data
are used for driving the Auto Regressive Integrated
The existing study applies Box-Jenkins (1970) Moving Average (ARIMAX) models with rainfall as
forecasting model popularly known as ARIMA model. input variables for both seasons. The collected data will
The ARIMA is an extrapolation method, which be divided into two sets, calibration data and validating
requires historical time series data of underlying data, in order to testify the performance of the
variable. The ARIMA approach was first popularized suggested model.
by Box and Jenkins, and ARIMA models are often
referred to as Box-Jenkins models. The general transfer
function model employed by the ARIMA procedure
Result and Discussion
was discussed by Box and Tiao (1975). When an From the Table1 the Correlation between
ARIMA model includes other time series as input production in maha season(Mprd) and rainfall of the
variables, the model is sometimes referred to as an season (MRfl) is 0.48207. The Correlation between
ARIMAX(p,d,q) model. Pankratz (1991) refers to the production in yala season(Yprd) and rainfall of the
season (YRfl) is 0.42209. Therefore this auxiliary
ARIMAX model as dynamic regression.
information of rainfall can be used for this model
building process.
=1−1−…− is the moving-average operator, TotalPrd 1.000 0.926 0.809 0.377 0.160
represented as a polynomial in the back shift operator: Mprd 0.926 1.000 0.612 0.482 0.101
at is the independent disturbance, also called the Yprd 0.809 0.612 1.000 0.091 0.423
MRfl 0.377 0.482 0.091 1.000 0.011
random
YRfl 0.160 0.101 0.423 0.011 1.000
[22]
B.Yogarajah, C.Elankumaran and R.Vigneswaran
Application of ARIMAX Model for Forecasting Paddy
Production in Trincomalee District in Sri Lanka
Model Building and Analysis Figure 3 gives scatter plot ACF, PACF and IACF graph
at autoregressive order p=1. This shows the stationarity
of rainfall in the maha season.
Decode
Rainfall Data Arrange Time Series Data
Analyze Trend and Correlation Analysis for MRFI
Viriance 1.0
1400
Analysis routine
1200 0.5
Variance NOT stable
MRFI
AFC
1000 0.0
800 -0.5
600
Obitain -1.0
Defferencing ACF Stationery Data
PACF 0 5 10 15 20 25 30 0 2 4 6
Observation Lag
1.0 1.0
No
Conclusions SCAN 0.5 0.5
Valdate Test of EACF
PAFC
IAFC
siginificance 0.0 0.0
-0.5 -0.5
[23]
Proceedings of the Third International Symposium,
SEUSL: 6-7 July 2013, Oluvil, Sri Lanka
the same ACF and PACF functions. Obtain ACF and Diagnostic checking:
PACF of residual term up to certain lags of the
estimated ARIMAX model and then check whether 1.0
Residual correlation Diagnostic For YRfI (1)
1.0
PACF
ACF
0.0 0.0
IACF
checking can also be done through Adjusted R2, 0.0
05
minimum of Akaike Information Criteria (AIC) and -0.5
-1.0 1.0
Schwarz Bayesian Criteria (SBC) Table III reports the 0 2 4 6 0 2 4 6
Lag Lag
estimated results.
Figure 6: Residual Correlaon diagnosc
Conclusion
While applying various quantitative and
qualitative models for forecasting, it is essential to
Maha season is ARIMA(1,1,0) understand the productivity is not an exception to it.
In this paper ARIMA model has been applied on few
selected agricultural products in India. As the model
requires large data points, considering the availability
of required annual data, 34 different agricultural
products has been selected. Annual data from 1950
and 1957 onwards to 2010 as the case may be have
been used. All the necessary steps of ARIMA model
[24]
B.Yogarajah, C.Elankumaran and R.Vigneswaran
Application of ARIMAX Model for Forecasting Paddy
Production in Trincomalee District in Sri Lanka
have been applied systematically for forecasting 5 The findings of this work can be applied in
periods ahead from 2011 onwards. Among these items, decision making for paddy cultivation and related
tea provides lowest MAPE value, whereas cardamom water resources management.
provides lowest AIC value. Similarly, highest MAPE is
obtained for papaya and highest AIC value is for
sugarcane. Now the question may arise is since Future work
agricultural productivity depend upon many factors Modeling Paddy production in the district and
such as rainfall, irrigation facility, monsoon, climate, determine whether the annual rainfall of the district
soil, fertilizer etc., forecasted values might be more impacts on Paddy production in the district.
accurate only with ceteris paribus assumption.
However, generally all the factors do not go well every
time and in right direction; therefore reliability of these References
forecasted values might be questionable. In this context
Box, G.E.P., Jenkins, G.M., and Reinsel, G.C. “Time Series
one need to rethink about other forecasting model,
Analysis: Forecasting and Control”. Third Edition,
which could incorporate more information for Englewood Cliffs, NJ: Prentice Hall, pp.197-199 1994.
forecasting the agricultural products. This could be
Box, G.E.P. and Tiao, G.C. “Intervention Analysis with
one of the limitations of the paper.
Applications to Economic and Environmental
underlying factors affecting it. Thus, forecasting Problems”. JASA, pp.70, 70-79 1975.
agricultural B.Yogarajah. “A time-series modeling approach for
understanding the behavior of rainfall patterns in the
In this model fitting process for the monthly Anuradhapura District”. Journal of the SrLanka
rainfall data from 1952 to 2010 in Trincomalee district Association of Geographers.pp.82-92 2010.
the more appropriate model than other models is
Brocklebank, J.C. and Dickey, D.A. “SAS System for
ARIMA(0,0,1)(0, 0,1)12
Forecasting Time Series”. Edition, Cary, North
The seasonal ARIMA model for monthly data Carolina: SAS Institute Inc.1986.
with the following mathematical form: Chatfild, C. “The Analysis of Time Series-an introduction.”
5th Edn., Chapman and Hall, UK. 1996
(1- B)(1- B12)Monthly Rainfall= (1 - 0.903 B)
(1 - 0.999 B**(12))at Demroes, M. “Aspects of aridity and drought in the monsoon
climate of Sri Lanka”. Indian Journal. Hydrology and
Where B is the backshift operator; that is and at is geophysics. 21(1and2): 384-394. 1978.
the independent disturbance, also called the random Manfred Domroes, Edmound Ranatunge “A statistical
error. approach towards a regionalization of daily rainfall in
Sri Lanka” Article first published online:
Key findings indicate that the rainfall patterns in DOI: 10.1002/joc.3370130704. 2006
the study area are non auto-regressive, as such - they
Manobavan, M., “The Response Of Terrestrial Vegetation To
do not depend on the past history of rainfall; but are El Niño Southern Oscillation”. Unpublished PhD
predominantly depending on nonlinear trend and a Thesis. Faculty of Science, Kingston University, Surry,
seasonal pattern of order 12. This indicates that in- United Kingdom. (2003).
order to arrive at a comprehensive forecasting model Meyler, A., Kenny, G., and Quinn, T. “Forecasting Irish
for rainfall in Trincomalee, the need to focus on the Inflation Using ARIMA Models”. Technical Paper
influences of non-endemic and regional-to-global (3/RT/98). Economic Analysis, Research and
climatic phenomena is apparent. Publication Department, Central bank of Ireland.
1998.
However, the model does not account for
Sabita Madhvi Singh, “Statistical Modeling of Climate
variations in precipitation due to cyclonic activity, Parameters”. Asian Journal of Current Engineering and
which is a significant factor of variation in the local Maths1: 2 March - April (2012) 29.
climate. This shortcoming should be addressed in the Thambyahpillay, G. “Climatic changes in Ceylon”. MA.
future research projects of this nature to enable a much Thesis, University of Cambridge.1958.
clearer picture of climatic changes in the district.
[25]