Data Analyis
Data Analyis
GUIDANCE BY
PROF. CHINTAN PATEL
COMPARATIVE STUDY ON
FORECASTING TECHNIQUES
FLOW OF PRESENTATION
• Machine learning (ML) is a type of artificial intelligence (AI) that allows software
applications to become more accurate at predicting outcomes without being explicitly
programmed to do so.
• Machine learning algorithms use historical data as input to predict new output values.
ARTIFICIAL NEURAL NETWORK
• RBF network only consists of an input layer, a single hidden layer, and an output layer.
Input Layer
• The input layer simply feeds the data to the hidden layers.
• No computation is performed.
Hidden layer
• The computations in the hidden layers are based on comparisons with prototype vectors which is a
vector from the training set.
• Each neuron computes the similarity between the input vector and its prototype vector.
Output layer
• The output layer uses a linear activation function for both classification or regression
tasks.
• The resulting prediction can be used for both classification or regression tasks.
DECAY RBF NEURAL NETWORKS (DRNN)
• DRNN is the simplest algorithm. In this the number of output weights ωh is same as the number of
training patterns. Here output weights are equal to the outputs of training patterns.
• DRNN networks have very fast learning processes because output weights are being set to the
output value of the outputs of the corresponding training pattern, but results are seldom satisfactory.
SUPPORT VECTOR REGRESSION
• In simple regression, the idea is to minimize the error rate while in SVR the idea is to fit the error
inside a certain threshold which means, work of SVR is to approximate the best value within a
given margin.
• In SVM (support vector machine) to separate groups of patterns, the number of patterns in the
analysis is significantly reduced. Instead of using all patterns, only patterns closest to the separation
surface are used and these patterns are known as support vectors.
• Various nonlinear functions can be used to create the separation surface. SVR can be relatively fast
when high accuracy is not required.
No of RBF γ C ε Training MAPE(%)
time(hours) (mean absolute
percent error)
• the ISO algorithm is capable of solving the same problem with about ten times a smaller network
with superb generalization abilities.
• Here the problem is that it is very computationally intensive and many trials with different starting
points are needed before an optimal solution is found.
No of RBF Iteration Training time(hours) MAPE(%)
(mean absolute percent
error)
Output
24 hours ahead load
Case 2: 6 input parameters
1. Dry bulb temperature in Fahrenheit
2. Dew point temperature in Fahrenheit
3. Hour of day
4. Previous 24-hr average load
5. 24-hr lagged load
6. 168-hr (previous week) lagged load
Output
24 hours ahead load
DATA SHEET IMAGE
RESULT AND COMPARISION
• We can see that the DRNN algorithm was able to solve problem relatively fast, but the results
had over 20 times larger errors than the results reached or obtained with other algorithm. Hence it
was not used in further studies.
• ELM algorithm produced reasonable results and faster then other algorithms.
• SVR produced the worse results.
• Best results were obtained with the ErrCor algorithm. Therefore, the execution time for the
network obtained with the ErrCor is 25 times shorter than the execution time of the network
obtained with the ELM, and it is 90 times shorter than the network obtained with SVR.
Algorithm Numbers of Training time(Hours) MAPE(%)
RBFs (mean absolute
percent error)