0% found this document useful (0 votes)

22 views

04 Sampling

This document describes a lecture on sampling methods for probabilistic inference and learning. It introduces Monte Carlo methods, which approximate integrals using samples from the distribution. Monte Carlo methods replace integrals with sums over samples. This allows computing expectations, which are essential for probabilistic inference. Examples are given of using Monte Carlo to compute pi and expectations.

Uploaded by

ian

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

04 Sampling

Uploaded by

ian

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Probabilistic Inference and Learning

Lecture 04
Sampling

Philipp Hennig
27 April 2021

Faculty of Science
Department of Computer Science
Chair for the Methods of Machine Learning
# date content Ex # date content Ex
1 20.04. Introduction 1 14 09.06. Logistic Regression 8
2 21.04. Reasoning under Uncertainty 15 15.06. Exponential Families
3 27.04. Continuous Variables 2 16 16.06. Graphical Models 9
4 28.04. Monte Carlo 17 22.06. Factor Graphs
5 04.05. Markov Chain Monte Carlo 3 18 23.06. The Sum-Product Algorithm 10
6 05.05. Gaussian Distributions 19 29.06. Example: Topic Models
7 11.05. Parametric Regression 4 20 30.06. Mixture Models 11
8 12.05. Understanding Deep Learning 21 06.07. EM
9 18.05. Gaussian Processes 5 22 07.07. Variational Inference 12
10 19.05. An Example for GP Regression 23 13.07. Example: Topic Models
11 25.05. Understanding Kernels 6 24 14.07. Example: Inferring Topics 13
12 26.05. Gauss-Markov Models 25 20.07. Example: Kernel Topic Models
13 08.06. GP Classification 7 26 21.07. Revision

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 1
A Computational Challenge
Integration is the core computation of probabilistic inference

Probabilistic inference requires integrals:

▶ Evidences. Example from Lecture 3:
QN QN n
p(xi | π)p(π) π (1 − π)N−n
p(π|x1 , . . . , xN ) = R 1 QNi = R 1 QNi
i p(xi | π)p(π) dπ i π (1 − π)
n N−n dπ
0 0

▶ Expectations (actually, evidences are expectations, too)

Z
⟨f⟩p := Ep [f] := f(x)p(x) dx “Expectation of f under p”

f(x) = x mean
f(x) = (x − Ep (x)) 2
variance
p
f(x) = x p-th moment
f(x) = − log x entropy
..
.

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 2
The Toolbox

Framework:
Z
p(y | x)p(x)
p(x1 , x2 ) dx2 = p(x1 ) p(x1 , x2 ) = p(x1 | x2 )p(x2 ) p(x | y) =
p(y)

Modelling: Computation:
▶ Directed Graphical Models ▶ Monte Carlo
▶ ▶
▶ ▶
▶ ▶
▶ ▶
▶

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 3
Randomized Methods — Monte Carlo
the idea

▶ the “simplest thing to do”: replace integral with sum:

Z Z
1X X
S
f(x)p(x) dx ≈ f(xi ); p(x, y) dx ≈ p(y | xi ); if xi ∼ p(x)
S
i=1 i

▶ this requires being able to sample xi ∼ p(x)

Definition (Monte Carlo method)

Algorithms that compute expectations in the above way, using samples xi ∼ p(x) are called Monte Carlo
methods (Stanisław Ulam, John von Neumann).

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 4
Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 image source: wikipedia u:fruitpunchline 5
A method from a different age
Monte Carlo Methods and the Manhattan Project images: Los Alamos National Laboratory / wikipedia

Stanisław Ulam Nicholas Metropolis John von Neumann

1909–1984 1915–1999 1903–1957

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 6
The FERMIAC
analog Monte Carlo computer images: wikipedia

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 7
Example
a dumb way to compute π

1
π/4
▶ ratio of quarter-circle to square:
R 1 0.8
▶ π = 4 I(x⊺ x < 1)u(x) dx
▶ draw x ∼ u(x), check x⊺ x < 1, count 0.6

1 from numpy.random import rand

2 S = 100000
0.4
3 sum((rand(S,2)**2).sum(axis=1) < 1) / S * 4

0.2
> 3.13708
> 3.14276
0
0 0.2 0.4 0.6 0.8 1

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 8
Monte Carlo works on every Integrable Function
is this a good thing?

Z
ϕ := f(x)p(x) dx = Ep (f)

▶ Let xs ∼ p, s = 1, . . . , S iid. (i.e. p(xs = x) = p(x) and p(xs , xt ) = p(xs )p(xt ) ∀s, t)

1X
S
ϕ̂ := f(xs ) ^ the Monte Carlo estimator is …
S
s=1
Z Z
1X 1X
S S
E(ϕ̂) =: f(xs )p(xs ) dxs = f(xs )p(xs ) dxs
S S
s=1 s=1

1X
S
E(f(xs )) = ϕ = ^ … an unbiased estimator!
S
s=1
R
▶ the only requirement for this is that f(x)p(x) dx exists (i.e. f must be Lebesgue-integrable
relative to p). Monte Carlo integration can even work on discontinuous functions.
Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 9
Sampling converges slowly
expected square error

▶ The expected square error (variance) drops as O(S−1 )

" #2
1X
S
E(ϕ̂ − E(ϕ̂)) = E (f(xs ) − ϕ)
2
S
s=1

1 X
S X
S
= E(f(xs )f(xr )) − ϕE(f(xs )) − E(f(xr ))ϕ + ϕ2
S2
s=1 r=1
  
1 X
S
 X 
= 2  ϕ2 − 2ϕ2 + ϕ2  + E(f2 ) − ϕ2 
S | {z } | {z }
s=1 r̸=s =0 =:var(f)
1
= var(f) = O(S−1 )
S
▶ Thus, the expected error (the square-root of the expected square error) drops as O(S−1/2 )

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 10
sampling is for rough guesses
recall example computation for π

101
√
MC var(f)/s
6 π
0
10

4
10−1
ϕ̂

2
10−2

0
10−3

100 101 102 103 104 105 100 101 102 103 104 105
# samples # samples

▶ need only ∼ 9 samples to get order of magnitude right (std(ϕ)/3)

▶ need 1014 samples for single-precision (∼ 10−7 ) calculations!
▶ sampling is good for rough estimates, not for precise calculations
▶ Always think of other options before trying to sample!
Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 11
▶ samples from a probability distribution can be used to estimate expectations, roughly, without
having to design an elaborate integration algorithm
▶ The error of the estimate is independent of the dimensionality of the input domain!

How do we generate random samples from p(x)?

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 12
Reminder: Change of Measure
The transformation law

Theorem (Change of Variable for Probability Density Functions)

Let X be a continuous random variable with PDF pX (x) over c1 < x < c2 . And, let Y = u(X) be a
monotonic differentiable function with inverse X = v(Y). Then the PDF of Y is
−1
dv(y) du(x)
pY (y) = pX (v(y)) · = pX (v(y)) · .
dy dx

Let X = (X1 , . . . , Xd ) have a joint density pX . Let g : Rd _ Rd be continously differentiable and injective,
with non-vanishing Jacobian Jg . Then Y = g(X) has density
(
pX (g−1 (y)) · |Jg−1 (y)| if y is in the range of g,
pY (y) =
0 otherwise.
∂gi (x)
The Jacobian Jg is the d × d matrix with [Jg (x)]ij = ∂xj .

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 13
Some special cases
sampling from an exponential distribution is analytic

0.5

0
0 0.5 1 1.5 2 2.5 3 3.5 4

Z
1
p(x) = e−x/λ p(x) dx = 1 − e−x/λ
λ
1 − u = 1 − e−x/λ x = −λ log(u)
Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 14
Example: Sampling from a Beta Distribution
uniform variables
Consider u ∼ U[0, 1] (i.e. u ∈ [0, 1], and p(u) = 1). The variable x = u1/α has the Beta density

∂u(x)
px (x) = pu (u(x)) · = α · xα−1 = B(x; α, 1).
∂x

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 15
Example: Sampling from a Beta Distribution
uniform variables
Consider u ∼ U[0, 1] (i.e. u ∈ [0, 1], and p(u) = 1). The variable x = u1/α has the Beta density

∂u(x)
px (x) = pu (u(x)) · = α · xα−1 = B(x; α, 1).
∂x

Homework:
Consider two independent variables

X ∼ G(α, θ) Y ∼ G(β, θ)

where Γ(ξ; α, θ) = 1
Γ(α)θ k
ξ α−1 e−ξ/θ is the Gamma distribution. Show that the random variable
X
Z= X+Y is Beta distributed, with the density

Γ(α + β) α−1
p(Z = z) = B(z; α, β) = z (1 − z)β−1 .
Γ(α)Γ(β)
Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 15
▶ samples from a probability distribution can be used to estimate expectations, roughly
▶ ‘random numbers’ don’t really need to be unpredictable, as long as they have as little structure as
possible
▶ uniformly distributed random numbers can be transformed into other distributions. This can be
done numerically efficiently in some cases, and it is worth thinking about doing so
What do we do if we don’t know a good transformation?

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 16
Why is sampling hard?
Sampling is harder than global optimization

To produce exact samples:

▶ need to know cumulative density everywhere
▶ need to know regions of high density (not just local maxima!)
▶ a global description of the entire function
Practical Monte Carlo Methods aim to construct samples from

p̃(x)
p(x) =
Z
assuming that it is possible to evaluate the unnormalized density p̃ (but not p) at arbitrary points.
Typical example: Compute moments of a posterior

p(D | x)p(x) 1X n
p(x | D) = R as Ep(x|D) (xn ) ≈ x with xi ∼ p(x | D)
p(D, x) dx S s i

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 17
Rejection Sampling
a simple method [Georges-Louis Leclerc, Comte de Buffon, 1707–1788]

0.4
0.3
0.2
0.1
0
−4 −2 0 2 4 6 8 10

▶ for any p(x) = p̃(x)/Z (normalizer Z not required)

▶ choose q(x) s.t. cq(x) ≥ p̃(x)
▶ draw s ∼ q(x), u ∼ Uniform[0, cq(s)]
▶ reject if u > p̃(s)

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 18
The Problem with Rejection Sampling
the curse of dimensionality [MacKay, §29.3]

0.4
Example:
p(x) ▶ p(x) = N (x; 0, σp2 )
cq(x)
▶ q(x) = N (x; 0, σq2 )
0.3
▶ σq > σ p
▶ optimal c is given by
D
p(x)

0.2
(2πσq2 )D/2 σq σq
c= = = exp D ln
(2πσp2 )D/2 σp σp
0.1
▶ acceptance rate is ratio of volumes: 1/c
▶ rejection rate rises exponentially in D
0
−4 −2 0 2 4 ▶ for σq /σp = 1.1, D = 100, 1/c < 10−4
x

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 19
Importance Sampling
a slightly less simple method

▶ computing p̃(x), q(x), then throwing them away seems wasteful

▶ instead, rewrite (assume q(x) > 0 if p(x) > 0)
Z Z
p(x)
ϕ = f(x)p(x) dx = f(x) q(x) dx
q(x)
1X p(xs ) 1X
≈ f(xs ) =: f(xs )ws if xs ∼ q(x)
S s q(xs ) S s
▶ this is just using a new function g(x) = f(x)p(x)/q(x), so it is an unbiased estimator
▶ ws is known as the importance (weight) of sample s
▶ if normalization unknown, can also use p̃(x) = Zp(x)
Z
11X p̃(xs )
f(x)p(x) dx = f(xs )
ZS s q(xs )
1 X p̃(xs )/q(xs ) X
= f(xs ) 1 P =: f(xs )w̃s
S s S t 1p̃(xt )/q(xt ) s

▶ this is consistent, but biased

Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 20
What’s wrong with Importance Sampling?
the curse of dimensionality, revisited

▶ recall that var ϕ̂ = var(f)/S — importance sampling replaces var(f) with var(g) = var f qp

▶ var f qp can be very large if q ≪ p somewhere. In many dimensions, usually all but everywhere!
▶ if p has “undiscovered islands”, some samples have p(x)/q(x) _ ∞
4
p(x)
q(x)
w(x) 3

log10 sample count

0
−2 0 2 4 6 8 −20 0 20 40 60 80 100
x f(x), g(x)
Probabilistic ML — P. Hennig, SS 2021 — Lecture 04: Sampling— © Philipp Hennig, 2021 CC BY-NC-SA 3.0 21
Sampling (Monte Carlo) Methods
Sampling is a way of performing rough probabilistic computations, in particular for expectations
(including marginalization).
▶ samples from a probability distribution can be used to estimate expectations, roughly
▶ uniformly distributed random numbers can be transformed into other distributions. This can be
done numerically efficiently in some cases, and it is worth thinking about doing so
▶ Rejection sampling is a primitive but exact method that works with intractable models
▶ Importance sampling makes more efficient use of samples, but can have high variance (and this
may not be obvious)
Next Lecture:
▶ Markov Chain Monte Carlo methods are more elaborate ways of getting approximate answers to
intractable problems.

ALL ST218 Lecture Notes
No ratings yet
ALL ST218 Lecture Notes
87 pages
Tungban Probabilistic ML 2021 - 04 - Sampling
No ratings yet
Tungban Probabilistic ML 2021 - 04 - Sampling
24 pages
05 MCMC
No ratings yet
05 MCMC
36 pages
MCMC Brief
No ratings yet
MCMC Brief
69 pages
L11 TopicModels 2
No ratings yet
L11 TopicModels 2
37 pages
27 Revision
No ratings yet
27 Revision
80 pages
Lec26 RandomVariableGeneration
No ratings yet
Lec26 RandomVariableGeneration
38 pages
Importance Sampling
No ratings yet
Importance Sampling
13 pages
Statistics 202C Study Guide: Part I: Sampling Basic Unstructured Distributions and Monte Carlo Basics
No ratings yet
Statistics 202C Study Guide: Part I: Sampling Basic Unstructured Distributions and Monte Carlo Basics
14 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
55 pages
Rarefied Gas Dynamics - DSMC Course
No ratings yet
Rarefied Gas Dynamics - DSMC Course
50 pages
Lecture5 Maximum Likelihood
No ratings yet
Lecture5 Maximum Likelihood
13 pages
Probabilistic Machine Learning: Exponential Families
No ratings yet
Probabilistic Machine Learning: Exponential Families
19 pages
Annotated_L19
No ratings yet
Annotated_L19
53 pages
17 Notes MFML Probreview
No ratings yet
17 Notes MFML Probreview
19 pages
3logistic Regression
No ratings yet
3logistic Regression
61 pages
08 Learning Representations
No ratings yet
08 Learning Representations
38 pages
AM207 2 Transforms Sampling
No ratings yet
AM207 2 Transforms Sampling
50 pages
01 Lectureslides ProbTheory
No ratings yet
01 Lectureslides ProbTheory
42 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
Monte Carlo Sampling For Random Differential Equations: Master INVESTMAT 2017-2018 Unit 4
No ratings yet
Monte Carlo Sampling For Random Differential Equations: Master INVESTMAT 2017-2018 Unit 4
31 pages
Probability and Randomized Algorithms
No ratings yet
Probability and Randomized Algorithms
14 pages
Notes
No ratings yet
Notes
56 pages
7 Inference L8 Unlocked
No ratings yet
7 Inference L8 Unlocked
29 pages
Monte Carlo Sampling Methods
No ratings yet
Monte Carlo Sampling Methods
25 pages
21 Efficient Inference A K-Means
No ratings yet
21 Efficient Inference A K-Means
32 pages
15 Exponential Families
No ratings yet
15 Exponential Families
33 pages
Introduction To Probability Theory: A Short Course On Graphical Models
No ratings yet
Introduction To Probability Theory: A Short Course On Graphical Models
30 pages
Stat513 l11
No ratings yet
Stat513 l11
17 pages
MCMC - Silver Bullet of Probabilistic Modeling
No ratings yet
MCMC - Silver Bullet of Probabilistic Modeling
17 pages
BaseR Cheat Sheet
No ratings yet
BaseR Cheat Sheet
21 pages
Lec29 ImportanceSampling
No ratings yet
Lec29 ImportanceSampling
84 pages
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
No ratings yet
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
78 pages
Putational Statistics Using Matlab
No ratings yet
Putational Statistics Using Matlab
78 pages
Probab Refresh
No ratings yet
Probab Refresh
7 pages
9. Bayesian_Lec_4
No ratings yet
9. Bayesian_Lec_4
25 pages
PT2425_cheatsheet_updatedV2
No ratings yet
PT2425_cheatsheet_updatedV2
5 pages
MCMC
No ratings yet
MCMC
7 pages
Foundations of Machine Learning: Part A: Probability Basics
No ratings yet
Foundations of Machine Learning: Part A: Probability Basics
75 pages
Cheat Sheet 4
No ratings yet
Cheat Sheet 4
2 pages
endsem_solutions
No ratings yet
endsem_solutions
19 pages
CS115 Probability 2
No ratings yet
CS115 Probability 2
58 pages
25 Customizing Models A Algorithms
No ratings yet
25 Customizing Models A Algorithms
38 pages
Essentials of Bayesian Inference 1706204646
No ratings yet
Essentials of Bayesian Inference 1706204646
21 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
Vectors of Random Variables: Guy Lebanon January 6, 2006
No ratings yet
Vectors of Random Variables: Guy Lebanon January 6, 2006
2 pages
Basic Sampling Methods: Sargur Srihari Srihari@cedar - Buffalo.edu
No ratings yet
Basic Sampling Methods: Sargur Srihari Srihari@cedar - Buffalo.edu
30 pages
Lecture # 2-1 Probabilistic Models
No ratings yet
Lecture # 2-1 Probabilistic Models
40 pages
Learning Models From Data: 1 Parametric Estimation
No ratings yet
Learning Models From Data: 1 Parametric Estimation
14 pages
On The Markov Chain Monte Carlo (MCMC) Method: Rajeeva L Karandikar
No ratings yet
On The Markov Chain Monte Carlo (MCMC) Method: Rajeeva L Karandikar
24 pages
Monte Carlo Integration
No ratings yet
Monte Carlo Integration
38 pages
lec-1 probabilistic models
No ratings yet
lec-1 probabilistic models
29 pages
Lec-1 Probabilistic Models
No ratings yet
Lec-1 Probabilistic Models
29 pages
Lec25 MonteCarloMethods
No ratings yet
Lec25 MonteCarloMethods
57 pages
mcmc
No ratings yet
mcmc
76 pages
lec-note-6-2025_ffb04277-b5b0-4829-8d64-15053e1b235b
No ratings yet
lec-note-6-2025_ffb04277-b5b0-4829-8d64-15053e1b235b
6 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
18 Sum Product
No ratings yet
18 Sum Product
22 pages
11 Slides
No ratings yet
11 Slides
6 pages
20 Latent Dirichlet Allocation
No ratings yet
20 Latent Dirichlet Allocation
27 pages
23 Free Energy
No ratings yet
23 Free Energy
29 pages
14 Generalized Linear Models Note
No ratings yet
14 Generalized Linear Models Note
5 pages
HP M602 CPMD
No ratings yet
HP M602 CPMD
202 pages
5.4 Sensor Data Mining
No ratings yet
5.4 Sensor Data Mining
36 pages
Emerging Modes of Business - Part 3: Objectives
No ratings yet
Emerging Modes of Business - Part 3: Objectives
11 pages
Community, Politics, and Regulation
No ratings yet
Community, Politics, and Regulation
59 pages
VMware vSphere 7 Design Supplemental Slides
No ratings yet
VMware vSphere 7 Design Supplemental Slides
18 pages
Hanel Rotomat Storage Carousels 2018
No ratings yet
Hanel Rotomat Storage Carousels 2018
58 pages
Introduction To Web Design Cat 1
No ratings yet
Introduction To Web Design Cat 1
5 pages
iL-Remedy 7.6.04 Incident Management-5 Utilizing Tasks-F-ITS-GEN-IT015
No ratings yet
iL-Remedy 7.6.04 Incident Management-5 Utilizing Tasks-F-ITS-GEN-IT015
23 pages
PSIM Practise 13001618027
No ratings yet
PSIM Practise 13001618027
15 pages
KPMG's KM
100% (1)
KPMG's KM
19 pages
Complete Download Modelling Financial Times Series 2nd Edition Stephen J. Taylor PDF All Chapters
100% (5)
Complete Download Modelling Financial Times Series 2nd Edition Stephen J. Taylor PDF All Chapters
72 pages
TOPIC FOR MINDMAP-Robot
No ratings yet
TOPIC FOR MINDMAP-Robot
2 pages
UAV-based Localization of Mobile Phones For Search and Rescue Applications
No ratings yet
UAV-based Localization of Mobile Phones For Search and Rescue Applications
4 pages
F1 Visa (With Job Experience)
No ratings yet
F1 Visa (With Job Experience)
13 pages
Floor Cleaning Robot Report
No ratings yet
Floor Cleaning Robot Report
40 pages
3-PPSM Fuente de Poder Primaria 120V
No ratings yet
3-PPSM Fuente de Poder Primaria 120V
4 pages
Alumni Tracking System: A Major Project Report ON
No ratings yet
Alumni Tracking System: A Major Project Report ON
65 pages
Animated Animals: Low Poly
No ratings yet
Animated Animals: Low Poly
20 pages
Yellow Book Ed 61 Excerpt PDF
No ratings yet
Yellow Book Ed 61 Excerpt PDF
57 pages
Module I. Lesson 1ooooo
No ratings yet
Module I. Lesson 1ooooo
15 pages
Centargo Infographic - Feb 2020
No ratings yet
Centargo Infographic - Feb 2020
2 pages
Cizme Od 7 Milja
No ratings yet
Cizme Od 7 Milja
69 pages
Audit Trail Note
No ratings yet
Audit Trail Note
4 pages
Prajwal M Pawar Resume
No ratings yet
Prajwal M Pawar Resume
1 page
Screenshot 2025-04-14 at 4.56.00 PM
No ratings yet
Screenshot 2025-04-14 at 4.56.00 PM
4 pages
Product Drawing PDF
No ratings yet
Product Drawing PDF
47 pages
Retail Portal User Guide
No ratings yet
Retail Portal User Guide
45 pages
Digital Image Processing - Filtering in Frequency Domain
No ratings yet
Digital Image Processing - Filtering in Frequency Domain
53 pages
FID GC Manual
No ratings yet
FID GC Manual
32 pages
Recruitment - Fraud - 123
No ratings yet
Recruitment - Fraud - 123
2 pages

04 Sampling

Uploaded by

04 Sampling

Uploaded by

Probabilistic Inference and Learning

Probabilistic inference requires integrals:

▶ Expectations (actually, evidences are expectations, too)

▶ the “simplest thing to do”: replace integral with sum:

▶ this requires being able to sample xi ∼ p(x)

Definition (Monte Carlo method)

Stanisław Ulam Nicholas Metropolis John von Neumann

1 from numpy.random import rand

▶ The expected square error (variance) drops as O(S−1 )

▶ need only ∼ 9 samples to get order of magnitude right (std(ϕ)/3)

How do we generate random samples from p(x)?

Theorem (Change of Variable for Probability Density Functions)

To produce exact samples:

▶ for any p(x) = p̃(x)/Z (normalizer Z not required)

▶ computing p̃(x), q(x), then throwing them away seems wasteful

▶ this is consistent, but biased

log10 sample count

You might also like