Unit-2 - Stat Modeling & Analysis
Unit-2 - Stat Modeling & Analysis
Dr. K. Subrahmanyam
Professor in CSE
Statistical Modelling and Analysis
• Probability distributions
• Variations
Probability distributions
• Probability is defined as the chance of occurrence
of an event
• Probability distribution can be classified into
discrete and continuous.
• Discrete probability distribution
– Poisson distribution, binomial distribution, tossing 3
coins, etc..
• Continuous Probability distribution
– Exponential distribution, Uniform distribution, Normal
distribution and t-distribution
Variation
• The variation (spread) is a measure of the
extent to which the individual observations
vary with reference to the average of the
given set of observations.
• Different measures of variation are
– Range
– Quartile deviation
– Mean deviation (average deviation)
– Standard deviation
– Coefficient of variation
Variation
Range - The difference between the highest
value (H) and the lowest value (L) in
the given set of observations
R = H-L
Coefficient of range = (H-L)/ (H+L)
Variation
• Quartile deviation is the half of the difference
between 3rd quartile and the 1st quartile.
• Mean deviation is the mean of the absolute
deviations of the individual observations from
either arithmetic mean or median of the given
set of observations.
Variation
• Standard deviation ( root mean square
deviation) is widely used measure of variation.
http://220.227.161.86/21538sm_finalnew_vol2_cp19.pdf
Time series
http://220.227.161.86/21538sm_finalnew_vol2_cp19.pdf
Components of a time series
• Any time series can contain some or all of the
following components:
1. Trend (T)
2. Cyclical (C)
3. Seasonal (S)
4. Irregular (I)
• These components may be combined in different ways. It is
usually assumed that they are multiplied or added, i.e.,
– yt = T x C x S x I
– yt = T + C + S + I
Components of a time series
Trend component
The trend is the long term pattern of a time series. A trend can be positive or negative
depending on whether the time series exhibits an increasing long term pattern or a decreasing
long term pattern. If a time series does not show an increasing or decreasing pattern then the
series is stationary in the mean.
Cyclical component
Any pattern showing an up and down movement around a given trend is idenfied as a
cyclical pattern. The duration of a cycle depends on the type of business or industry
being analyzed.
Seasonal component
Seasonality occurs when the time series exhibits regular fluctuations during the same month
(or months) every year, or during the same quarter every year. For instance, retail sales peak
during the month of December.
Irregular component
This component is unpredictable. Every time series has some unpredictable component that makes it a
random variable. In prediction, the objective is to \model" all the components to the point that the only
component that remains unexplained is the random component.
Methods of fitting a straight line to a time series
http://220.227.161.86/21538sm_finalnew_
vol2_cp19.pdf
Time series Analysis
• 8 Steps in time series Analysis:
1. Input the original time-series data (y)
2. Find moving Averages (MA)
3. Obtain seasonal indices(S)
4. Obtain deseasonalised value (Y) of y by using the formula
Y = y/S
5. Fit a regression model for the depersonalised forecast Y =
a+bX
6. Obtain trend estimates (Y) using Y = a+bX
7. Find the cyclical component using the equation C= Y/Y
8. Stop.
Note : The cyclical component cannot be used for estimation of
forecast, but is used only for Analysis. (Y is Y cap)
Error Analysis
• Types of errors
Error Analysis
• Standard measures of errors
– Absolute Deviation
– Relative % Deviation
– Standard Deviation
Error Analysis
• Propagation of Errors
Spectral Analysis
• Digital Signal Processing (DSP) has revolutionized spectral analysis. Where the
swept spectrum analyzer dominated the market in the past, the Fast Fourier
Transform (FFT) based spectrum analyzer is now gaining acceptance as the
method of choice. This is due in part to the prevalence of high speed, high
dynamic range Analog-to-Digital Converters (ADC) and high speed signal
processing devices such as Field Programmable Gate Arrays (FPGA). Because the
FFT-based spectrum analyzer is readily implemented with a limited set of generic
hardware, it is an attractive technique for Synthetic Instruments (SI), where the
goal is to form multiple measurement functions from a limited set of generic
hardware modules. In this paper some of the fundamental design parameters and
specifications of modern spectrum analyzers such as dynamic range,
instantaneous bandwidth, and image rejection are presented. These parameters
are explored with a focus on maintaining system performance without sacrificing
flexibility.