Modelos y Simulación - Clase 4-2016
Modelos y Simulación - Clase 4-2016
Probability
Density
Data?
Statistical analysis
Data
Probability
Density?
Statistical analysis
Statistical analysis
Statistical analysis
Statistical test
Statistical analysis
t29,0.025 2.045
Statistical analysis
Statistical analysis
Statistical analysis
Problem
Rogers and Girolani (2012). A first course in Machine Learning. CRC Press.
?????
published this
book
He introduced and
published this
book
He states that he
remains!
Recall
Three important results:
Linear Equations
Unconstrained Optimization
Probabilities
Linear Systems
Linear Systems
If M is non
singular, this
equation has
only one
solution
Unconstrained Optimization
Theorem
Gradient
A book
intended for
people
interested in
solving
optimization
problems
Probability Theory
Joint Probability
Stop!
No more, for now:
Linear Equations
Unconstrained Optimization
Probabilities
Models
Linear Models
The simplest model we can
assume is the linear:
Winning time
(output)
Unknown
Parameters
Olympics
number
(input)
Loss Function
A good candidate
Vector Formulation
Vector Formulation
Vector Formulation
System of
Linear Equations
Yes, it is
the same equation!
Over-fitting
The 4th order model fits the training
Validation Data
One way to detect
over-fitting is to
use a validation
data set (not used
for training) to test
the predictive
performance of the
model
Cross - Validation
Set 1
1/3
Available
Data
Random
Selection
Set 2
1/3
Set 3
1/3
Cross - Validation
Training
Training
Validate
Training
Validate
Training
Validate
Training
Training
K-fold cross-validation
The data is
splited into K
equally sized
blocks.
Each block takes
its turn as a
validation set
K-fold cross-validation
Averaging over the resulting K loss
Leave-One-Out Cross-Validation
This form of cross-validation is given
Leave-One-Out Cross-Validation
Rogers and Girolani (2012). A first course in
Machine Learning. CRC Press.
Regularised LSM
Regularised LSM
Regularised LSM
Regularised LSM
Regularised LSM
Stochastic Model
What information can we extract
from errors?
What can we expect about the
Stochastic Model
When
simulating a
system, it is
often
convenient to
consider the
random
behavior we
can observe
in real data
Stochastic Model
Stochastic Model
Error is different
each year,
sometimes
positive,
sometimes
negative
There is not
obvious
relationship
between errors
and years
Stochastic Model
Likelihood
Likelihood
Log of Likelihood
Taking log:
C does not
depend on w
Maximum Likelihood
Be careful!
Maximum
Likelihood
criterion may
favour high
order models:
risk of overfitting!
Coefficient of Determination
Coefficient of Determination
Coefficient of Determination
Variability in Parameters
How much could change the optimal
parameters given a different data
set?
Variability in Parameters
Variability in Parameters
Variability in Parameters
Estimation of Parameter
Covariance Matrix
Variability in Predictions
Variability in Predictions
Variability in Predictions
Variability in Predictions
Comments