0% found this document useful (0 votes)

5 views

AI Notes

Introduction to AI

Uploaded by

vikasverma20130

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

AI Notes

Introduction to AI

Uploaded by

vikasverma20130

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

what is data driven model

A data-driven model is a type of model that relies on empirical data to understand, predict, or make
decisions about a system or process, rather than relying on predefined theories or assumptions. These
models are built by analyzing large datasets and identifying patterns, trends, and relationships within the
data.

Key characteristics of a data-driven model include:

1. Data as the Foundation: The model's parameters, structures, and behavior are inferred directly
from the data rather than being imposed by human expertise or prior assumptions.

2. Learning from Data: These models use machine learning algorithms or statistical methods to
"learn" the underlying patterns in the data. Examples include regression models, decision trees,
neural networks, and clustering algorithms.

3. Adaptability: Data-driven models can improve as more data becomes available. This makes them
highly flexible and suitable for complex or dynamic environments where human-designed
models may fall short.

4. Types of Models:

o Predictive Models: These predict future outcomes based on historical data (e.g.,
demand forecasting, stock price prediction).

o Descriptive Models: These describe underlying structures or relationships in the data

(e.g., market segmentation).

o Prescriptive Models: These recommend actions based on the analysis of data (e.g.,
optimization models for decision-making).

5. Applications: Data-driven models are widely used in fields like business analytics, finance,
healthcare, and engineering. Examples include recommendation systems, fraud detection,
predictive maintenance, and natural language processing systems.

In essence, data-driven models shift from traditional rule-based models to models that are entirely
derived from the data.
AR, MA, ARMA, ARIMA, SARIMA
These terms—AR, MA, ARMA, ARIMA, and SARIMA—refer to statistical models used primarily for time
series analysis. These models aim to predict future values of a variable based on its past values and can
help in forecasting, identifying trends, or understanding seasonality in data. Here's a breakdown of each:

1. AR (AutoRegressive) Model

• Concept: In an AR model, the value of a variable at time ttt is regressed on its previous values.
This means the model uses past observations to predict future ones.

• Use Case: AR models are useful when past values have a linear relationship with future values.

2. MA (Moving Average) Model

• Concept: In an MA model, the current value of a time series is expressed as a linear combination
of past error terms (random shocks).

• Use Case: Useful when past errors (noise) influence future observations.

3. ARMA (AutoRegressive Moving Average) Model

• Concept: ARMA combines both AR and MA components. It models a time series using both past
values (AR) and past errors (MA).

• Use Case: ARMA is effective when both the past values of the series and past random shocks are
significant in explaining future values.

4. ARIMA (AutoRegressive Integrated Moving Average) Model

• Concept: ARIMA extends ARMA by adding a differencing step to make the time series stationary
(i.e., constant mean and variance over time). This model is suited for non-stationary time series.

• Formula:
ARIMA(p, d, q) where:

o ppp: the order of the AR term,

o ddd: the degree of differencing (number of times data is differenced to make it

stationary),

o qqq: the order of the MA term.

• Use Case: ARIMA is widely used for forecasting time series that are non-stationary but can be
made stationary through differencing.

5. SARIMA (Seasonal AutoRegressive Integrated Moving Average) Model

• Concept: SARIMA is an extension of ARIMA that accounts for seasonality in the data. It adds
seasonal components to the ARIMA model to handle patterns that repeat at regular intervals
(e.g., monthly, quarterly).

• Formula:
SARIMA(p, d, q)(P, D, Q, s) where:

o p,d,qp, d, qp,d,q: the non-seasonal components,

o P,D,QP, D, QP,D,Q: the seasonal AR, differencing, and MA components,

o sss: the number of periods in each season (e.g., s=12s=12s=12 for monthly data with
yearly seasonality).

• Use Case: Used for time series with both trend and seasonal patterns, like sales data with yearly
cycles.

Summary of Use Cases:

• AR: Use when past values of the series are important for predicting future values.

• MA: Use when past errors influence the future.

• ARMA: Use when both past values and past errors are important for prediction.

• ARIMA: Use for non-stationary data that needs to be differenced to become stationary.

• SARIMA: Use when both trend and seasonality are present in the data.

These models are widely used in finance, economics, environmental science, and other fields requiring
time series forecasting.

TRAINING AND VALIDATION IN AI

Training and validation are crucial stages in building and evaluating AI models, particularly in machine
learning and deep learning. These stages ensure that the model learns effectively from data and can
generalize well to new, unseen data.

1. Training in AI

Training is the process where an AI model learns from data. It involves feeding the model a dataset
(called the training set) and adjusting the model’s parameters based on this data to minimize the error
between the model’s predictions and the actual values.

Key Steps in Training:

• Data Feeding: The model is provided with labeled data (input and corresponding output). For
supervised learning, this input-output pair is crucial for learning.

• Learning: The model uses algorithms (e.g., gradient descent) to iteratively adjust its internal
parameters (e.g., weights in a neural network) to improve predictions.
• Error Calculation: The model makes predictions on the training data, and the difference between
predicted and actual values (the loss or error) is computed.

• Optimization: The model’s parameters are adjusted to minimize the loss. Optimizers like
stochastic gradient descent (SGD) or Adam are used for this process.

• Repetition: This process repeats for several epochs (iterations over the dataset) until the model
converges (the error is minimized sufficiently or the performance plateaus).

2. Validation in AI

Validation is the process of evaluating the model's performance on a separate dataset (called the
validation set) during training. The key idea is to test the model on data it has not seen before to assess
how well it generalizes to unseen data. It helps prevent overfitting, where the model performs well on
training data but poorly on new data.

Key Steps in Validation:

• Hold-out Validation Set: A portion of the data is set aside and not used for training. The model is
validated on this set during training.

• Evaluation Metrics: After training for a certain number of epochs, the model’s performance is
measured on the validation set using metrics like accuracy, precision, recall, F1-score, or loss.

• Early Stopping: If the model performs well on training data but starts performing poorly on the
validation set (i.e., validation loss increases), training may be stopped early. This prevents
overfitting.

Importance of Training and Validation:

1. Preventing Overfitting: Validation helps detect overfitting, where the model memorizes the
training data but fails to generalize.

2. Hyperparameter Tuning: During training, hyperparameters (e.g., learning rate, regularization

strength) can be tuned using the validation set to achieve better generalization.

3. Model Selection: Different models (or variations of a model) can be trained and compared based
on their validation performance to choose the best one.

Common Practices:

• Train/Validation Split: A typical dataset might be split into 80% for training and 20% for
validation.

• Cross-Validation: Instead of using a single validation set, k-fold cross-validation involves splitting
the dataset into k subsets, using k-1 for training and 1 for validation in turns, to get more robust
performance estimates.

Relationship Between Training and Validation:

• Training: Focuses on adjusting model parameters to fit the training data.

• Validation: Evaluates how well the model generalizes to unseen data, ensuring it’s not just
memorizing the training set.

Together, training and validation work to ensure the AI model performs well both on the data it has seen
(training set) and the data it hasn’t (validation set), striking a balance between learning patterns and
maintaining generalizability.

AKAIKE CRITERIA

The Akaike Information Criterion (AIC) is a widely used metric in statistics and machine learning to
evaluate and compare different models. It helps to identify the model that best explains the data while
penalizing overfitting, balancing the trade-off between goodness of fit and model complexity.

Key Concepts of AIC:

1. Goodness of Fit: This refers to how well the model fits the data. Typically, models that have a
lower error or loss (such as residual sum of squares in regression) are considered to have a
better fit.

2. Model Complexity: More complex models (e.g., models with more parameters) can often fit the
data better but may lead to overfitting, where the model performs well on the training data but
poorly on unseen data. AIC penalizes models that are unnecessarily complex.

Formula for AIC:

AIC=2k−2ln⁡(L)\text{AIC} = 2k - 2\ln(L)AIC=2k−2ln(L)

Where:

• kkk is the number of parameters in the model,

• LLL is the maximum likelihood of the model (how likely the model is given the data).

Explanation of Terms:

• Maximum Likelihood: LLL is a measure of how well the model explains the observed data.
Higher likelihood means the model fits the data better.

• Penalty for Complexity: The term 2k2k2k penalizes models with more parameters, preventing
overfitting by discouraging overly complex models.

Interpretation of AIC:

• Lower AIC values indicate a better model. The AIC score can only be interpreted relative to other
models: the model with the lowest AIC is generally considered the best.

• Model Comparison: AIC is primarily used to compare different models fitted to the same
dataset. It doesn't give an absolute measure of model quality, only a relative one.
AIC and Model Selection:

• Overfitting Prevention: By penalizing the number of parameters, AIC helps in selecting simpler
models that generalize better to unseen data, reducing the risk of overfitting.

• Trade-off: AIC tries to find a balance between goodness of fit and model complexity, but it does
not guarantee that the selected model is the most accurate for future predictions.

Use Cases of AIC:

• Time Series Models: AIC is often used to compare models like AR, MA, ARMA, ARIMA, SARIMA,
etc., to select the best model for forecasting.

• Regression Models: In linear regression, AIC can be used to compare models with different sets
of predictors.

• Machine Learning: AIC can be applied in model selection when dealing with probabilistic
models, though modern machine learning frameworks often rely on cross-validation for more
robust evaluation.

Limitations of AIC:

• Sample Size: AIC might perform poorly for small sample sizes. In such cases, the corrected AIC
(AICc) is preferred, as it adjusts for small samples.

• Non-Nested Models: AIC is more effective when comparing nested models (i.e., models that can
be obtained by adding or removing parameters). It may not work as well for comparing
fundamentally different types of models.

Conclusion:

The Akaike Information Criterion is a powerful tool for model selection in both statistics and machine
learning. It balances between the goodness of fit and the simplicity of the model, helping avoid
overfitting. It is widely used, especially in time series forecasting, regression, and probabilistic models.

MA2
In the context of time series analysis, MA(2) stands for a Moving Average model of order 2. This is a
specific instance of the broader Moving Average (MA) model, commonly used in statistics and AI for
time series forecasting.

Breakdown of MA(2):

1. Moving Average (MA) Model: The MA model expresses the value of a time series as a linear
combination of past error terms (random shocks or noise). Unlike the AutoRegressive (AR)
model, which depends on past values of the series itself, the MA model depends on past errors.

2. MA(q): In general, an MA(q) model is defined by the number of lagged error terms included in
the model. Here, qqq is the order of the model. For example, in MA(2), the current value of the
series depends on the current error and the two most recent past errors.
Formula for MA(2):

The mathematical representation of an MA(2) model is:

Yt=μ+ϵt+θ1ϵt−1+θ2ϵt−2Y_t = \mu + \epsilon_t + \theta_1 \epsilon_{t-1} + \theta_2 \epsilon_{t-2}Yt=μ+ϵt

+θ1ϵt−1+θ2ϵt−2

Where:

• YtY_tYt is the value of the time series at time ttt,

• μ\muμ is the mean of the series (can be zero if detrended),

• ϵt\epsilon_tϵt is the error term (or noise) at time ttt,

• θ1\theta_1θ1 and θ2\theta_2θ2 are the coefficients of the lagged error terms,

• ϵt−1\epsilon_{t-1}ϵt−1 and ϵt−2\epsilon_{t-2}ϵt−2 are the errors at times t−1t-1t−1 and t−2t-
2t−2, respectively.

How MA(2) Works:

• The current value YtY_tYt depends on the current random shock ϵt\epsilon_tϵt and the previous
two shocks (errors) ϵt−1\epsilon_{t-1}ϵt−1 and ϵt−2\epsilon_{t-2}ϵt−2.

• The coefficients θ1\theta_1θ1 and θ2\theta_2θ2 control the influence of the past errors on the
current value.

Example of MA(2):

Suppose we are trying to model a time series of daily temperature fluctuations. An MA(2) model could
account for today’s temperature (YtY_tYt) by considering not only today’s weather-related random
factors (ϵt\epsilon_tϵt) but also the random effects from the previous two days (ϵt−1\epsilon_{t-1}ϵt−1
and ϵt−2\epsilon_{t-2}ϵt−2).

Use Cases for MA(2):

• Time Series Forecasting: MA models like MA(2) are used to predict future values of a series
based on past errors. It can capture short-term dependencies and noise in the data.

• Stationary Time Series: MA models are best suited for time series that are stationary, meaning
they have a constant mean and variance over time.

• Financial Markets: In stock market analysis, moving average models can help in predicting price
movements based on random past fluctuations.

Advantages of MA(2):

• Simple Model: MA models are easier to interpret and apply compared to more complex models.

• Capturing Noise: MA(2) can capture the short-term correlations in the errors that affect the
series over time.

Limitations:
• Lag Dependency: The MA(2) model only considers the two most recent errors. For time series
with longer-term dependencies, higher-order models or different models (e.g., ARMA, ARIMA)
may be more appropriate.

• Stationarity Requirement: Like other moving average models, MA(2) assumes that the time
series is stationary.

Conclusion:

An MA(2) model in time series analysis expresses the current value of a series as a linear combination of
the last two error terms, plus a current error. It is useful for modeling short-term noise or fluctuations in
stationary time series data.

AR 3
In the context of time series analysis, AR(3) stands for an AutoRegressive model of order 3. It is a
specific case of the more general AutoRegressive (AR) model, which predicts future values of a time
series based on its own past values.

Breakdown of AR(3):

1. AutoRegressive (AR) Model: The AR model assumes that the current value of a time series is a
linear combination of its previous values. This model is often used for forecasting and analyzing
time series data that exhibits autocorrelation (i.e., the values are correlated with past values).

2. AR(p): In general, an AR(p) model of order ppp predicts the current value of the time series
based on its ppp previous values. In the case of AR(3), the current value depends on the three
most recent past values.

Formula for AR(3):

The mathematical representation of an AR(3) model is:

Yt=ϕ1Yt−1+ϕ2Yt−2+ϕ3Yt−3+ϵtY_t = \phi_1 Y_{t-1} + \phi_2 Y_{t-2} + \phi_3 Y_{t-3} + \epsilon_tYt=ϕ1

Yt−1+ϕ2Yt−2+ϕ3Yt−3+ϵt

Where:

• YtY_tYt is the value of the time series at time ttt,

• Yt−1,Yt−2,Yt−3Y_{t-1}, Y_{t-2}, Y_{t-3}Yt−1,Yt−2,Yt−3 are the previous values at times t−1,t−2,t−3t-

1, t-2, t-3t−1,t−2,t−3,

• ϕ1,ϕ2,ϕ3\phi_1, \phi_2, \phi_3ϕ1,ϕ2,ϕ3 are the coefficients that quantify the impact of the
three lagged values on YtY_tYt,

• ϵt\epsilon_tϵt is the error term or random noise at time ttt.

How AR(3) Works:

• The current value YtY_tYt depends on the three most recent past values: Yt−1,Yt−2,Yt−3Y_{t-1},
Y_{t-2}, Y_{t-3}Yt−1,Yt−2,Yt−3.

• The coefficients ϕ1,ϕ2,ϕ3\phi_1, \phi_2, \phi_3ϕ1,ϕ2,ϕ3 determine the weight given to each of
these past values in predicting the current value.

• The error term ϵt\epsilon_tϵt captures any random fluctuations not explained by the past values.

Example of AR(3):

Suppose you are modeling daily stock prices, and today's price YtY_tYt depends on the prices from the
last three days Yt−1,Yt−2,Yt−3Y_{t-1}, Y_{t-2}, Y_{t-3}Yt−1,Yt−2,Yt−3. The AR(3) model would estimate
today's stock price using the influence of the previous three days' prices, adjusted by their respective
coefficients ϕ1,ϕ2,ϕ3\phi_1, \phi_2, \phi_3ϕ1,ϕ2,ϕ3.

Use Cases for AR(3):

• Time Series Forecasting: AR(3) can be used to predict future values based on past observations.
It's useful for financial data, sales forecasting, weather prediction, etc.

• Data with Short-Term Memory: If the current value of a series is heavily influenced by a few
previous observations, an AR(3) model can capture these dependencies.

• Stationary Time Series: AR models generally assume that the time series is stationary, meaning
it has a constant mean and variance over time.

Advantages of AR(3):

• Capturing Temporal Dependencies: AR(3) effectively captures the influence of the last three
observations, making it suitable for time series with short-term autocorrelation.

• Simple Interpretation: The model is easy to interpret because it directly shows how previous
values affect the current value.

Limitations:

• Fixed Lag Order: AR(3) only accounts for the three most recent values. If longer-term
dependencies are present in the data, a higher-order model or a different model (like ARMA or
ARIMA) may be needed.

• Stationarity Requirement: The AR(3) model assumes the time series is stationary, so it might not
perform well on non-stationary data without additional transformations like differencing.

Conclusion:

An AR(3) model in time series analysis predicts the current value of a series based on the three most
recent past values. It is effective for short-term forecasting and capturing autocorrelation in stationary
time series data. By using the previous three values, the model can reveal how recent history impacts
current outcomes, making it useful in various applications, from finance to weather prediction.
ARMA 2,1
ARMA(2,1) is a combined AutoRegressive Moving Average model used for time series forecasting and
analysis. It incorporates both the AutoRegressive (AR) component and the Moving Average (MA)
component, where the numbers refer to the order of each component:

• The AR(2) part represents an AutoRegressive model of order 2, meaning it uses the last two
past values of the time series to predict the current value.

• The MA(1) part represents a Moving Average model of order 1, meaning it uses the most recent
past error term to model the random shocks influencing the current value.

Breakdown of ARMA(2,1):

1. AutoRegressive (AR) Component: In AR(2), the current value of the series is a linear
combination of the previous two values. This part accounts for the dependence on the past
values of the series itself.

2. Moving Average (MA) Component: In MA(1), the model depends on the current error term
(random noise) and the error from the previous time step. This part helps capture shocks or
random noise in the system that impacts future values.

Formula for ARMA(2,1):

The ARMA(2,1) model can be mathematically represented as:

Yt=ϕ1Yt−1+ϕ2Yt−2+ϵt+θ1ϵt−1Y_t = \phi_1 Y_{t-1} + \phi_2 Y_{t-2} + \epsilon_t + \theta_1 \epsilon_{t-

1}Yt=ϕ1Yt−1+ϕ2Yt−2+ϵt+θ1ϵt−1

Where:

• YtY_tYt is the value of the time series at time ttt,

• Yt−1,Yt−2Y_{t-1}, Y_{t-2}Yt−1,Yt−2 are the values of the time series at times t−1t-1t−1 and t−2t-
2t−2 (previous observations),

• ϵt\epsilon_tϵt is the error term (random shock) at time ttt,

• ϵt−1\epsilon_{t-1}ϵt−1 is the error term at time t−1t-1t−1,

• ϕ1,ϕ2\phi_1, \phi_2ϕ1,ϕ2 are the autoregressive coefficients that quantify the influence of past
values on the current value,

• θ1\theta_1θ1 is the moving average coefficient that represents the influence of past error on
the current value.

How ARMA(2,1) Works:

• AR(2) Component: The model predicts the current value YtY_tYt based on the two most recent
past values Yt−1Y_{t-1}Yt−1 and Yt−2Y_{t-2}Yt−2.
• MA(1) Component: The model also incorporates the current random noise ϵt\epsilon_tϵt and
the random shock from the previous time step ϵt−1\epsilon_{t-1}ϵt−1, which allows it to account
for random fluctuations in the data.

Example of ARMA(2,1):

Suppose you're analyzing a time series of monthly sales data. An ARMA(2,1) model would predict the
sales for the current month YtY_tYt based on the sales from the previous two months Yt−1Y_{t-1}Yt−1
and Yt−2Y_{t-2}Yt−2, as well as the random noise in the current and previous months ϵt\epsilon_tϵt and
ϵt−1\epsilon_{t-1}ϵt−1.

Use Cases for ARMA(2,1):

• Financial Forecasting: ARMA(2,1) models are commonly used in financial time series, such as
stock prices or currency exchange rates, where both the recent past and random fluctuations are
important.

• Demand Forecasting: Retailers can use ARMA(2,1) models to predict future demand based on
past sales and market trends.

• Environmental Data: ARMA models can be applied to time series data like temperature, air
quality, or rainfall, where the past values and recent random variations both play a role in
determining the current outcome.

Advantages of ARMA(2,1):

• Capturing Both Patterns and Noise: The ARMA model combines both autoregressive and
moving average components, allowing it to capture relationships between past values and also
account for the noise in the system.

• Flexibility: ARMA(2,1) offers a good balance of model complexity while capturing important
dynamics in the data.

Limitations:

• Stationarity Requirement: ARMA models assume the time series is stationary (constant mean
and variance over time). If the series is non-stationary, transformations like differencing may be
needed before applying ARMA.

• Short-Term Focus: ARMA(2,1) is designed to capture short-term dependencies and random

shocks but may not work well for long-term trends or seasonality. For such cases, an ARIMA or
SARIMA model might be more appropriate.

Conclusion:

An ARMA(2,1) model combines both the AR(2) (AutoRegressive of order 2) and MA(1) (Moving Average
of order 1) components to forecast time series data. It uses the last two observations and the last error
term to predict future values, making it useful for capturing both patterns in the data and random noise.

Univariate Time Series Analysis With Matlab - M. Perez
No ratings yet
Univariate Time Series Analysis With Matlab - M. Perez
147 pages
Note - Unit-4
No ratings yet
Note - Unit-4
12 pages
Key Terms in Machine Learning
No ratings yet
Key Terms in Machine Learning
6 pages
Notes XII AI.docx
No ratings yet
Notes XII AI.docx
11 pages
Unit 1 AAM
No ratings yet
Unit 1 AAM
16 pages
dsa unit 2
No ratings yet
dsa unit 2
132 pages
Module 5 PDF
No ratings yet
Module 5 PDF
23 pages
Lecture 1 introduction PM (1)
No ratings yet
Lecture 1 introduction PM (1)
21 pages
Machine Learning Assignment (1)
No ratings yet
Machine Learning Assignment (1)
5 pages
ADS LAB7 Removed
No ratings yet
ADS LAB7 Removed
3 pages
Machine learning assignment (3) (1)
No ratings yet
Machine learning assignment (3) (1)
5 pages
Machine learning assignment (3)
No ratings yet
Machine learning assignment (3)
5 pages
Model Selection NEW
No ratings yet
Model Selection NEW
24 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Fam QB Ans
No ratings yet
Fam QB Ans
9 pages
Steps of Implementation of A GLM
No ratings yet
Steps of Implementation of A GLM
8 pages
Time Series Analysis
No ratings yet
Time Series Analysis
23 pages
Note - Before Use Check Answers According To Your Syllabus.: Importance
No ratings yet
Note - Before Use Check Answers According To Your Syllabus.: Importance
31 pages
Resumos Forecasting
No ratings yet
Resumos Forecasting
17 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
4 pages
chapter3
No ratings yet
chapter3
9 pages
Unit 5
No ratings yet
Unit 5
11 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
25 pages
Answer 4
No ratings yet
Answer 4
3 pages
Session 6
No ratings yet
Session 6
18 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
STOCK REPORT
No ratings yet
STOCK REPORT
51 pages
Machine Learning
No ratings yet
Machine Learning
34 pages
Practical - Regression
No ratings yet
Practical - Regression
114 pages
Ads - Phase 2
No ratings yet
Ads - Phase 2
6 pages
Supervised Learning in Machine Learning
No ratings yet
Supervised Learning in Machine Learning
6 pages
DSUR_EA2352001010391_W3
No ratings yet
DSUR_EA2352001010391_W3
3 pages
dbms-10 marks
No ratings yet
dbms-10 marks
32 pages
PYTHON PROGRAMMING FOR MACHINE LEARNING-220901004_compressed (1)
No ratings yet
PYTHON PROGRAMMING FOR MACHINE LEARNING-220901004_compressed (1)
6 pages
Week 4 - Intro to ML
No ratings yet
Week 4 - Intro to ML
37 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
The Realization of A Type of Supermarket Sales Forecast Model & System
No ratings yet
The Realization of A Type of Supermarket Sales Forecast Model & System
6 pages
Time Series Linear Models
No ratings yet
Time Series Linear Models
121 pages
ML Fundamentals
No ratings yet
ML Fundamentals
15 pages
Unit 3
No ratings yet
Unit 3
13 pages
All DL
No ratings yet
All DL
72 pages
Session 7 Feature Selection & Dimensionality Reduction
No ratings yet
Session 7 Feature Selection & Dimensionality Reduction
20 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
8 pages
Thinespary Sitharam 841007106016-Supply Chain Management Data Analytic
No ratings yet
Thinespary Sitharam 841007106016-Supply Chain Management Data Analytic
6 pages
Modeling
No ratings yet
Modeling
4 pages
Xii Ai Capstone Project
No ratings yet
Xii Ai Capstone Project
35 pages
Ba Unit 4 - Part1
No ratings yet
Ba Unit 4 - Part1
7 pages
Unit 4-3
No ratings yet
Unit 4-3
21 pages
24
No ratings yet
24
4 pages
Lecture 12 - Machine Learning
No ratings yet
Lecture 12 - Machine Learning
18 pages
5 no ans.
No ratings yet
5 no ans.
38 pages
Predictive Analysis 1
No ratings yet
Predictive Analysis 1
22 pages
Unit 4_Question Bank and answers
No ratings yet
Unit 4_Question Bank and answers
23 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
Assignment
No ratings yet
Assignment
5 pages
Slide 2
No ratings yet
Slide 2
5 pages
Unit IV - Time Series Methods
No ratings yet
Unit IV - Time Series Methods
9 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
112 pages
ML Question Answer
No ratings yet
ML Question Answer
4 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Reliability Test Sample
No ratings yet
Reliability Test Sample
5 pages
ADA Module1 Part 1
No ratings yet
ADA Module1 Part 1
14 pages
Ai PPT
No ratings yet
Ai PPT
93 pages
1404Deep learning for the Earth Sciences A comprehensive approach to remote sensing climate science and geosciences 1st Edition Camps-Valls - Download the full ebook now for a seamless reading experience
100% (3)
1404Deep learning for the Earth Sciences A comprehensive approach to remote sensing climate science and geosciences 1st Edition Camps-Valls - Download the full ebook now for a seamless reading experience
79 pages
Data Serumen Dan Modalitas Tipe Belajar SD Ibnu Abbas
No ratings yet
Data Serumen Dan Modalitas Tipe Belajar SD Ibnu Abbas
16 pages
ECON 581. Introduction To Arrow-Debreu Pricing and Complete Markets
No ratings yet
ECON 581. Introduction To Arrow-Debreu Pricing and Complete Markets
32 pages
Scikit Learn
No ratings yet
Scikit Learn
17 pages
1breohbaq 289967
No ratings yet
1breohbaq 289967
94 pages
1.5 Module-1
No ratings yet
1.5 Module-1
21 pages
Branch and Bound
No ratings yet
Branch and Bound
30 pages
Probability Coin Flip Ext
No ratings yet
Probability Coin Flip Ext
12 pages
All Cut Sets in Graph Theory
No ratings yet
All Cut Sets in Graph Theory
13 pages
Degenerate Bose Gases: 13.1 The Degeneracy Temperature
No ratings yet
Degenerate Bose Gases: 13.1 The Degeneracy Temperature
4 pages
Exercise - Binary Integer Programming
No ratings yet
Exercise - Binary Integer Programming
3 pages
Option B Shahine Porter
No ratings yet
Option B Shahine Porter
4 pages
Design and Implementation of A Robust Current Controller For VSI Connected To The Grid Through An LCL Filter
No ratings yet
Design and Implementation of A Robust Current Controller For VSI Connected To The Grid Through An LCL Filter
9 pages
Galgotias University, Greater Noida FALL Semester 2017-2018 Course Handout
No ratings yet
Galgotias University, Greater Noida FALL Semester 2017-2018 Course Handout
3 pages
Medicinal Plant Classification Using Particle Swarm Optimized Cascaded Network
No ratings yet
Medicinal Plant Classification Using Particle Swarm Optimized Cascaded Network
14 pages
Election Algorithm and Distributed Processing - Unit 2
100% (1)
Election Algorithm and Distributed Processing - Unit 2
2 pages
Transactions Papers: Rickard Stridh, Mats Bengtsson, and BJ Orn Ottersten
No ratings yet
Transactions Papers: Rickard Stridh, Mats Bengtsson, and BJ Orn Ottersten
9 pages
Unconditional_Stability_of_a_Three-Port_Network_Characterized_with_S-Parameters
No ratings yet
Unconditional_Stability_of_a_Three-Port_Network_Characterized_with_S-Parameters
5 pages
Explainable Data-Driven Digital Twins for Predicting Battery States in Electric Vehicles
No ratings yet
Explainable Data-Driven Digital Twins for Predicting Battery States in Electric Vehicles
22 pages
Topic 4 - Data Mining Tools and Technique
No ratings yet
Topic 4 - Data Mining Tools and Technique
22 pages
CE2407B Lecture 1 PDF
No ratings yet
CE2407B Lecture 1 PDF
12 pages
Chapter 3 - Transporatition and Assignment Models & Programming
No ratings yet
Chapter 3 - Transporatition and Assignment Models & Programming
32 pages
PGCET Question Categorization
No ratings yet
PGCET Question Categorization
25 pages
Lab 4
No ratings yet
Lab 4
2 pages
Back Propagation Algorithm To Solve Ordinary Differential Equations
No ratings yet
Back Propagation Algorithm To Solve Ordinary Differential Equations
3 pages
Multiple Linear Regression Model For Predicting Bidding Price
No ratings yet
Multiple Linear Regression Model For Predicting Bidding Price
9 pages
EE2211 Introduction To Machine Learning
No ratings yet
EE2211 Introduction To Machine Learning
99 pages

AI Notes

Uploaded by

AI Notes

Uploaded by

what is data driven model

Key characteristics of a data-driven model include:

o Descriptive Models: These describe underlying structures or relationships in the data

2. MA (Moving Average) Model

3. ARMA (AutoRegressive Moving Average) Model

4. ARIMA (AutoRegressive Integrated Moving Average) Model

o ppp: the order of the AR term,

o ddd: the degree of differencing (number of times data is differenced to make it

o qqq: the order of the MA term.

5. SARIMA (Seasonal AutoRegressive Integrated Moving Average) Model

o p,d,qp, d, qp,d,q: the non-seasonal components,

o P,D,QP, D, QP,D,Q: the seasonal AR, differencing, and MA components,

Summary of Use Cases:

• MA: Use when past errors influence the future.

TRAINING AND VALIDATION IN AI

Key Steps in Training:

Key Steps in Validation:

Importance of Training and Validation:

2. Hyperparameter Tuning: During training, hyperparameters (e.g., learning rate, regularization

Relationship Between Training and Validation:

• Training: Focuses on adjusting model parameters to fit the training data.

Key Concepts of AIC:

Formula for AIC:

• kkk is the number of parameters in the model,

Use Cases of AIC:

The mathematical representation of an MA(2) model is:

Yt=μ+ϵt+θ1ϵt−1+θ2ϵt−2Y_t = \mu + \epsilon_t + \theta_1 \epsilon_{t-1} + \theta_2 \epsilon_{t-2}Yt=μ+ϵt

• YtY_tYt is the value of the time series at time ttt,

• μ\muμ is the mean of the series (can be zero if detrended),

• ϵt\epsilon_tϵt is the error term (or noise) at time ttt,

How MA(2) Works:

Use Cases for MA(2):

Formula for AR(3):

The mathematical representation of an AR(3) model is:

Yt=ϕ1Yt−1+ϕ2Yt−2+ϕ3Yt−3+ϵtY_t = \phi_1 Y_{t-1} + \phi_2 Y_{t-2} + \phi_3 Y_{t-3} + \epsilon_tYt=ϕ1

• YtY_tYt is the value of the time series at time ttt,

• Yt−1,Yt−2,Yt−3Y_{t-1}, Y_{t-2}, Y_{t-3}Yt−1,Yt−2,Yt−3 are the previous values at times t−1,t−2,t−3t-

• ϵt\epsilon_tϵt is the error term or random noise at time ttt.

How AR(3) Works:

Use Cases for AR(3):

Formula for ARMA(2,1):

The ARMA(2,1) model can be mathematically represented as:

Yt=ϕ1Yt−1+ϕ2Yt−2+ϵt+θ1ϵt−1Y_t = \phi_1 Y_{t-1} + \phi_2 Y_{t-2} + \epsilon_t + \theta_1 \epsilon_{t-

• YtY_tYt is the value of the time series at time ttt,

• ϵt\epsilon_tϵt is the error term (random shock) at time ttt,

• ϵt−1\epsilon_{t-1}ϵt−1 is the error term at time t−1t-1t−1,

How ARMA(2,1) Works:

Use Cases for ARMA(2,1):

• Short-Term Focus: ARMA(2,1) is designed to capture short-term dependencies and random

You might also like