Pattern Recognition and Machine Learning
Pattern Recognition and Machine Learning
Example
Handwritten Digit Recognition
Over-fitting
Polynomial Coefficients
Regularization
Penalize large coefficient values
Regularization:
Regularization:
Regularization:
vs.
Polynomial Coefficients
Probability Theory
Apples and Oranges
Probability Theory
Marginal Probability
Joint Probability
Conditional Probability
Probability Theory
Sum Rule
Product Rule
Bayes Theorem
Probability Densities
Transformed Densities
Expectations
Conditional Expectation
(discrete)
Approximate Expectation
(discrete and continuous)
Properties of
and
Maximum Likelihood
Determine
Predictive Distribution
Determine
Model Selection
Cross-Validation
Curse of Dimensionality
Curse of Dimensionality
Polynomial curve fitting, M = 3
Gaussian Densities in
higher dimensions
Decision Theory
Inference step
Determine either
or
Decision step
For given x, determine optimal t.
Truth
Decision
Regions
Reject Option
Decision step
For given x, make optimal
prediction, y(x), for t.
Loss function:
Generative vs Discriminative
Generative approach:
Model
Use Bayes theorem
Discriminative approach:
Model
directly
Entropy
Important quantity in
coding theory
statistical physics
machine learning
Entropy
Coding theory: x discrete with 8 possible states; how many
bits to transmit the state of x?
All states equally likely
Entropy
Entropy
In how many ways can N identical objects be allocated M
bins?
Entropy
Differential Entropy
Put bins of width along the real line
) when
Conditional Entropy
Mutual Information