Neural Networks For The Analysis and For PDF
Neural Networks For The Analysis and For PDF
Abstract Allocating advertising expenses and forecasting total sales levels are the key
issues in retailing, especially when many products are covered and significant
cross-effects among products are likely. Various statistical and econometric
methods could be applied for such analyses. We explore how well neural
networks can be used in analyzing the effects of advertising and promotion
on sales in this article. The results reveal that the predictive quality of neural
networks depends on the different frequency of data observed, i.e. daily or
weekly data models, and the specific learning algorithms used. The study also
shows that neural networks are capable of capturing the nonlinear aspects of
complex relationships in non-stationary data. By performing sensitivity analysis,
neural networks can potentially single out important input variables, thereby
making it useful for scenario development and practical use. 1998 John
Wiley & Sons, Ltd.
Keywords: neural networks; sensitivity analysis; sales prediction; advertisement allocaion.
and the growth of organization, sales pro- the macro level or seasonal effects, resulting in
motion, special events, marketing services and fewer training sets.
visual merchandising. There are four sections The graphs of daily sales and weekly sales
in the advertising and promotion department and advertising expenditures are shown in
of the retailer organization under study. The Figures 3 and 4, respectively. In Figure 3, the
sales promotion and special events section graph of the observed daily sales over the
make the core decision while the other two specified period is a series of discrete peaks,
play support roles. The manager’s decision in which represent sales during the weekends
advertising or promotion depends on the rec- (Saturday and Sunday). However, in some per-
ords of past advertising and promotion cam- iods, the graph allows the capture of continu-
paigns, the latest trend, competitors’ activities, ous trends, while giving rise to chaotic regimes
budgets, inventory data, sales figures, profit in others due to seasonal effects. Figure 4 shows
loss statement and other raw data. To establish data sets organized on a weekly basis. As this
budgets, a simple historical method, adjusted pattern represents an aggregation of daily
from previous-year budgets, is used. values, the seasonal effects are now more obvi-
The original form of data is the daily model, ous, although the trend and the cycle cannot
and the weekly model is easily derived from be detected a priori. It is interesting to observe
it. The daily data model takes into account a the dynamics of input variables, such as adver-
very short lag effect of advertising with obser- tising expenditure levels versus sales levels. At
vation of an immediate sales response. Sales times it constitutes a uniform policy, while at
results are taken one day after the associated other times, it is a pulsing policy. The high
advertisement placement. The weekly data peak in July 1989 is due to a massive promotion
model is expected to aggregate this effect at campaign for a special anniversary of the retail
1998 John Wiley & Sons, Ltd. Int. J. Intell. Sys. Acc. Fin. Mgmt. 7, 253–268, (1998)
Figure 4 Sales data of store 1 and advertising expenditure on a weekly basis for the period: April 1989 to
March 1990 (units are in 1000 Singapore dollars)
1998 John Wiley & Sons, Ltd. Int. J. Intell. Sys. Acc. Fin. Mgmt. 7, 253–268, (1998)
ficient values. We used a t-test to test the sig- (where Sx and Sy are the standard deviations
nificance of each individual coefficient. For the of x and y respectively) is used to check on
daily model, the variables with significant or the collinearity of data set x and y. The results
nearly significant t-statistics are: var2 (Weekend have shown that there are seven variables that
Indicator), var4 (Ladies’ Accessories), var8 (Gift are correlated with dependent variables for the
Ideas), var9 (Cosmetics), var12 (Seasonal Fashion weekly model. They are var1 (Man Fashion,
Statements), var13 (Fashion Shows) and var15 0.335), var4 (Toys/Stationery, 0.204), var5
(Newspaper 2). For the weekly model, the vari- (Children’s wear Products, 0.296), var7 (Gift
ables with significant or nearly significant t- Ideas, 0.368), var8 (Cosmetics, −0.216), var10
statistics are: var4 (Toys/Stationery), var7 (Gift (Seasonal Fashion Statements, −0.202) and var13
Ideas), var8 (Cosmetics), var11 (Fashion Shows), (Newspaper 2, 0.267). More results on statistical
var13 (Newspaper 2) and var18 (Cost). These models may be found in Jas̆ic (1993) where the
variables are generally consistent for both mod- impact on seasonal indices is also discussed.
els except for Toys/Stationery and Statements.
Pearson’s correlation coefficient which is BUILDING THE NEURAL NETWORK
defined as MODELS
冘 (x − x) (y − y)/(N − 1)S S
N
Stores 1 2 3 4 5 6
Figure 5 Three months’ daily prediction for store 1 using a 22-10-1 network
Table 3. The best forecasting results of a network (18-9-1) using weekly data for different stores
Stores 1 2 3 4 5 6
between the inputs and outputs. If the points then the network will perform well, as the
in the input space that are close in Euclidean initial clustering in the Kohonen layer will be
distance have widely varying outputs, the net- easier.
work will not perform well. In that case, a For both the daily and weekly data models,
network may require a large number of the counterpropagation network is trained with
Kohonen nodes in order to adequately rep- 100, 500, and 1000 iterations using the following
resent the problem. On the other hand, if the parameters: 1 to 60 nodes in the Kohonen layer,
input data are from a homogeneous source, the learning rate of the Kohonen layer is set to
1998 John Wiley & Sons, Ltd. Int. J. Intell. Sys. Acc. Fin. Mgmt. 7, 253–268, (1998)
Figure 7 Forecasting error for the different counterpropagation daily models (data of store 1)
1998 John Wiley & Sons, Ltd. Int. J. Intell. Sys. Acc. Fin. Mgmt. 7, 253–268, (1998)
All i,k
0
ik
(7)
Table 4 Sensitivity analysis of var13 at values 0.00, 0.25, and 0.50 for a perturbation value of ±0.25
1998 John Wiley & Sons, Ltd. Int. J. Intell. Sys. Acc. Fin. Mgmt. 7, 253–268, (1998)
higher degrees of correlations of input variables ent product categories and different periods
in the weekly model which is captured by of time.
the counterpropagation network. However, this
type of neural network cannot be used for
sensitivity analysis as opposed to the back- CONCLUSION AND FUTURE RESEARCH
propagation network.
Another point which should be noted is that The results from this study indicate that a neu-
the validity of neural network prediction per- ral network is able to capture the nonlinear
formance assumes that the dynamics of input relationships in a causal model even though
variables is similar to that of the recent past, there is no explicit structures of the domain
especially if insufficient data sets are available field. The model developed thus can be used
for training. in assisting in the short-term forecasting of a
variable of interest. In addition, it can be used
to conduct a sensitivity analysis, which is a
Marketing Implications
useful procedure for analyzing possible stra-
By conducting a sensitivity analysis on a suit- tegies with respect to the observed variables.
ably trained neural network model, the follow- The results indicate that backpropagation is
ing findings of this study can be translated into an efficient method for studying the relation-
practical applications in marketing manage- ship between input and output variables. It
ment. First, the results indicate that continuous could be a useful tool for planning and allocat-
and heavy promotion of certain items (e.g. ing advertising expenses. However, the fore-
cosmetics) is either not correlated or may have casting accuracy of the dependent variables
negative effects on the overall sales. Second, depends on different data models and size of
special events (shows, statements) and the data sets. In the case of the daily model, a
intermittent advertising and promotion of some suitable network architecture is not easy to
product categories are positively correlated and identify. For the weekly data, the choice of a
have a significant positive impact on the overall suitable network architecture is easier and can
sales. Third, the results indicate that advertising be determined by looking at the best fore-
in the leading local language (Chinese) news- casting result.
paper has a more significant effect on the total Speed is an advantage when using a counter-
sales than through other channels. On the other propagation network as compared to backpro-
hand, featuring advertisements in the English pagation. However, for the daily data model,
newspaper that has the largest circulation, counterpropagation does not yield better results
which carried the majority of advertisements, than backpropagation. A possible explanation
is found to have no significant effect on the is that some points in the input space that are
overall sales. close in Euclidean distance have widely varying
These results have been verified by the esti- outputs, causing neural networks to perform
mation of different regression methods. How- poorly. In particular, changing one bit in the
ever, it is the intention of this study to gen- input corresponding to a weekend indicator in
eralize the results. It is quite possible that the daily model induces significant change in
different advertising retention exists for differ- the output. This will cause unsatisfactory per-
1998 John Wiley & Sons, Ltd. Int. J. Intell. Sys. Acc. Fin. Mgmt. 7, 253–268, (1998)
1998 John Wiley & Sons, Ltd. Int. J. Intell. Sys. Acc. Fin. Mgmt. 7, 253–268, (1998)