0% found this document useful (0 votes)

55 views3 pages

Machine Learning and Pattern Recognition Minimal Stochastic Variational Inference Demo

This Python code demonstrates black-box stochastic variational inference applied to logistic regression. It defines functions for calculating the variational cost and gradients, and for the logistic regression negative log-likelihood and gradients. These are used in a stochastic gradient descent loop to learn the variational posterior approximation.

Uploaded by

zeliawillscumberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views3 pages

Machine Learning and Pattern Recognition Minimal Stochastic Variational Inference Demo

Uploaded by

zeliawillscumberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

#!

/usr/bin/env python

"""
This is a demo (not production code!) of black-box stochastic variational
inference applied to logistic regression.

The main black-box stochastic variational "engine" is svi_grad.

In principle it could be applied to any model with a differentiable negative

log-likelihood function. Although for large models you may want to make the
posterior approximation diagonal, and you may want to build a more structured
prior than one shared variance for all weights.

This demonstration is a quick line-by-line port of the Matlab/Octave code in

svi_minimal.m. The Matlab/Octave tar-ball has some more code with some checks,
which I haven't ported.

Iain Murray, November 2016, November 2017

"""

import numpy as np
import matplotlib.pyplot as plt

def svi_grad(mm, LL, Lsigma_w, neg_log_like_grad):

"""
cost function and gradients for black-box stochastic variational inference

Inputs:
mm D, mean of variational posterior
LL D,D lower-triangular Cholesky decomposition of
variational posterior, with diagonal log-transformed
Lsigma_w scalar: log of prior standard deviation over weights
neg_log_like_grad fn -ve log-likelihood of model and gradients wrt weights
Could be an unbiased estimate based on a mini-batch,
we only get unbiased estimates of cost and gradients
anyway.

Outputs:
J scalar: estimate of variational cost function = -ELBO
mm_bar D, with derivatives wrt mm, ...
LL_bar D,D ...LL, ...
Lsigma_w_bar ...and Lsigma_w
"""

# Unpack Cholesky factor of posterior covariance and prior variance

# from their unconstrained forms.
D = mm.size
L = np.tril(LL)
diag = np.diag_indices_from(L)
L[diag] = np.exp(LL[diag])
sigma2_w = np.exp(2*Lsigma_w)

# The estimate of the variational cost function

J1 = -0.5*D - np.sum(np.diag(LL)) # - D/2*log(2*pi)
tmp = (np.sum(L*L) + np.dot(mm, mm)) / sigma2_w
J2 = tmp/2.0 + D*Lsigma_w # + D/2*log(2*pi)
nu = np.random.randn(D)
ww = mm + np.dot(L, nu) # Using random weight
J3, ww_bar = neg_log_like_grad(ww)
J = J1 + J2 + J3

# The derivatives
mm_bar = mm/sigma2_w + ww_bar
L_bar = L/sigma2_w + np.tril(np.dot(ww_bar[:,None], nu[None,:]))
LL_bar = L_bar
LL_bar[diag] = (L_bar[diag] * L[diag]) - 1
Lsigma_w_bar = D - tmp

return J, mm_bar, LL_bar, Lsigma_w_bar

def logreg_negLlike(ww, X, yy):

"""
Negative log-likelihood and gradients of logistic regression

negLlike, ww_bar = logreg_negLlike(ww, X, yy)

There's no separate bias term. So X needs augmenting with a

constant column to include a bias.

Inputs:
ww D,
X N,D
yy N,

Outputs:
negLlike scalar
ww_bar D,
"""

# Force targets to be +/- 1

yy = 2*(yy==1) - 1

# forward computation of error

sigma = 1.0/(1.0 + np.exp(-yy*np.dot(X, ww)))
negLlike = -np.sum(np.log(sigma))

# reverse computation of gradients

ww_bar = np.dot(X.T, yy*(sigma - 1))

return negLlike, ww_bar

if __name__ == "__main__":
# Generate synthetic dataset and note corresponding likelihood
np.random.seed(0)
D = 3
ww = 5*np.random.randn(D)
N = 20
X = np.hstack((np.random.randn(N, D-1), np.ones((N,1))))
yy = np.random.rand(N) < (1.0/(1.0 + np.exp(-np.dot(X,ww))))
neg_log_like_grad = lambda w: logreg_negLlike(w, X, yy)

# If you rerun the fitting with different seeds you'll see we don't
# get quite the same answer each time. I'd need to set the learning rate
# schedule better, and/or run for longer, to get better convergence.
np.random.seed(2)

# Simple stochastic steepest descent with decreasing learning rate.

# Here each update includes the whole dataset because the dataset is
# tiny. However, we still have stochastic updates, as each update
# uses a different random weight drawn from the current variational
# approximation to the posterior.
Lsigma_w = np.log(10) # Initialize prior width broader than it actually was.
(We'll learn to fix that.)
mm = np.zeros(D)
LL = np.zeros((D, D))
diag = np.diag_indices_from(LL)
LL[diag] = Lsigma_w
eta0 = 0.1
tau = 0.1
for ii in range(10000):
eta = eta0 / (1 + tau*eta0*ii)
J, mm_bar, LL_bar, Lsigma_w_bar = svi_grad(mm, LL, Lsigma_w,
neg_log_like_grad)
mm = mm - eta*mm_bar
LL = LL - eta*LL_bar
Lsigma_w = Lsigma_w - eta*Lsigma_w_bar
# hack to stop extreme settings of hyperparameters early in training:
Lsigma_w = np.min([100.0, Lsigma_w])

# Extract covariance of the variational posterior from its

# unconstrained parameterization.
L = LL
L[diag] = np.exp(LL[diag])
V = np.dot(L, L.T)

# Plot data:
plt.clf()
plt.plot(X[yy==1, 0], X[yy==1, 1], 'bx')
plt.plot(X[yy==0, 0], X[yy==0, 1], 'ro')
plt.legend(('y=1', 'y=0'))

# Overlay contour plot of approximate predictive distribution:

x_grid = np.arange(-3, 3, 0.05)
X1, X2 = np.meshgrid(x_grid, x_grid)
NG = X1.size
X_test = np.hstack((X1.reshape(NG,1), X2.reshape(NG,1), np.ones((NG,1))))
kappa = 1.0 / np.sqrt(1 + (np.pi/8)*np.sum(np.dot(X_test,V)*X_test, 1))
p_test = 1.0 / (1+np.exp(-np.dot(X_test,mm)*kappa))
P = np.reshape(p_test, X1.shape)
CS = plt.contour(X1, X2, P, [0.1,0.25,0.5,0.75,0.9])
plt.clabel(CS)
plt.xlabel('x_1')
plt.ylabel('x_2')
plt.title('Contours of p(y=1|x,D)')
plt.show()

Cookbook For Dock Appointment Scheduling Integration With External Systems
No ratings yet
Cookbook For Dock Appointment Scheduling Integration With External Systems
31 pages
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
No ratings yet
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
14 pages
Machine Learning and Pattern Recognition - Variational - Details
No ratings yet
Machine Learning and Pattern Recognition - Variational - Details
3 pages
Machine Learning and Pattern Recognition Bayesian Linear Regression Demo
No ratings yet
Machine Learning and Pattern Recognition Bayesian Linear Regression Demo
3 pages
Backpropagation: Loading Data
No ratings yet
Backpropagation: Loading Data
12 pages
Ai Last 5
No ratings yet
Ai Last 5
4 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 5
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 5
27 pages
neural net python sleep study -
No ratings yet
neural net python sleep study -
3 pages
Machine learning CODE
No ratings yet
Machine learning CODE
19 pages
Representer Function
No ratings yet
Representer Function
12 pages
''' Function To Load Dataset ''': Open List Range Len Float
No ratings yet
''' Function To Load Dataset ''': Open List Range Len Float
3 pages
Machine Learning and Pattern Recognition Minimal GP Demo
No ratings yet
Machine Learning and Pattern Recognition Minimal GP Demo
3 pages
Lab-5 Report
No ratings yet
Lab-5 Report
11 pages
Practicals 2.odt
No ratings yet
Practicals 2.odt
21 pages
DatA414 Prac 2 Linear Regression 2024.pdfasisipho
No ratings yet
DatA414 Prac 2 Linear Regression 2024.pdfasisipho
21 pages
Codigo Ipynb
No ratings yet
Codigo Ipynb
2 pages
ANN PR Code and Output
No ratings yet
ANN PR Code and Output
25 pages
CVDL Exp4 Code
No ratings yet
CVDL Exp4 Code
22 pages
Use Julia
No ratings yet
Use Julia
68 pages
Experiment No
No ratings yet
Experiment No
29 pages
vertopal.com_mlee_lab4
No ratings yet
vertopal.com_mlee_lab4
11 pages
Notes
No ratings yet
Notes
9 pages
Assignment No. 3: 1. Plot of Loss Function J Vs Number of Iterations
No ratings yet
Assignment No. 3: 1. Plot of Loss Function J Vs Number of Iterations
6 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
ML Labs
No ratings yet
ML Labs
46 pages
Bayesian NN
No ratings yet
Bayesian NN
82 pages
Building_your_Deep_Neural_Network_Step_by_Step_v8a (1)
No ratings yet
Building_your_Deep_Neural_Network_Step_by_Step_v8a (1)
16 pages
Deep Learning Lectures - 2
No ratings yet
Deep Learning Lectures - 2
73 pages
Aiml Lab
No ratings yet
Aiml Lab
14 pages
04_training_linear_models
No ratings yet
04_training_linear_models
35 pages
Etc3400 Tute Ex 6 2022
No ratings yet
Etc3400 Tute Ex 6 2022
5 pages
Black Box Variational Inference
No ratings yet
Black Box Variational Inference
11 pages
MMDS Da3
No ratings yet
MMDS Da3
8 pages
Neural Network Code
No ratings yet
Neural Network Code
5 pages
SiddharthShah 1032221195 DivC 50 DL LabAssignment5
No ratings yet
SiddharthShah 1032221195 DivC 50 DL LabAssignment5
10 pages
Six Lectures On NN - Montanari
No ratings yet
Six Lectures On NN - Montanari
77 pages
DNN ALL Practical 28
No ratings yet
DNN ALL Practical 28
34 pages
AIML_LAB
No ratings yet
AIML_LAB
37 pages
1-Linear Regression and TensorFlow
No ratings yet
1-Linear Regression and TensorFlow
79 pages
vertopal.com_mlee4
No ratings yet
vertopal.com_mlee4
12 pages
Bananini_chimpanzini
No ratings yet
Bananini_chimpanzini
8 pages
Cheat Sheet 1
No ratings yet
Cheat Sheet 1
2 pages
AIML PRACTICALS
No ratings yet
AIML PRACTICALS
22 pages
Untitled
No ratings yet
Untitled
3 pages
CH 1
No ratings yet
CH 1
24 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 4
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 4
24 pages
X OR problem using DNN
No ratings yet
X OR problem using DNN
3 pages
ML - LAB - 7 - Jupyter Notebook
100% (1)
ML - LAB - 7 - Jupyter Notebook
7 pages
Experiment 2
No ratings yet
Experiment 2
15 pages
Classification Review
No ratings yet
Classification Review
8 pages
CVDL(Practical No. 3)
No ratings yet
CVDL(Practical No. 3)
1 page
UNIT 1,2,3
No ratings yet
UNIT 1,2,3
17 pages
SkriptOptMach
No ratings yet
SkriptOptMach
49 pages
Auto Encoding Variational Bayes
No ratings yet
Auto Encoding Variational Bayes
14 pages
Linear Regr Gd
No ratings yet
Linear Regr Gd
3 pages
Software Laboratory II Code
No ratings yet
Software Laboratory II Code
27 pages
21BCE5775 Neural Networks
No ratings yet
21BCE5775 Neural Networks
19 pages
n27 PDF
No ratings yet
n27 PDF
3 pages
Lecture 2
No ratings yet
Lecture 2
8 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Biological Data Science Lecture4
No ratings yet
Biological Data Science Lecture4
21 pages
Award_in_Education_and_Training_sample
No ratings yet
Award_in_Education_and_Training_sample
9 pages
w2c_central_limit
No ratings yet
w2c_central_limit
1 page
BDS 2016-17
No ratings yet
BDS 2016-17
4 pages
BDS 2018-19
No ratings yet
BDS 2018-19
6 pages
w2e_multivariate_gaussian
No ratings yet
w2e_multivariate_gaussian
6 pages
Biological Data Science Lecture6
No ratings yet
Biological Data Science Lecture6
29 pages
PMRslides 02
No ratings yet
PMRslides 02
13 pages
MATH11183 Week 1-Part 2
No ratings yet
MATH11183 Week 1-Part 2
18 pages
MDA3S
No ratings yet
MDA3S
22 pages
Part 5
No ratings yet
Part 5
31 pages
TS Part2
No ratings yet
TS Part2
62 pages
MLPR w0f - Machine Learning and Pattern Recognition
No ratings yet
MLPR w0f - Machine Learning and Pattern Recognition
3 pages
Week 2 Naive Bayes
No ratings yet
Week 2 Naive Bayes
15 pages
PMRslides 03 B
No ratings yet
PMRslides 03 B
45 pages
Part 4
No ratings yet
Part 4
24 pages
Week 8 Pca
No ratings yet
Week 8 Pca
26 pages
w9b Netflix Prize
No ratings yet
w9b Netflix Prize
3 pages
Part 3
No ratings yet
Part 3
29 pages
Slides 03 A
No ratings yet
Slides 03 A
21 pages
W6a Gaussian Process Kernels
No ratings yet
W6a Gaussian Process Kernels
6 pages
Bio Statslectures
No ratings yet
Bio Statslectures
60 pages
Bayesian Workshop1 Solution
No ratings yet
Bayesian Workshop1 Solution
3 pages
Heat Advection
No ratings yet
Heat Advection
12 pages
Bayesian Week4 LectureNotes
No ratings yet
Bayesian Week4 LectureNotes
15 pages
2019 AMAM Exam Paper
No ratings yet
2019 AMAM Exam Paper
3 pages
2017 AMAM Exam Paper
No ratings yet
2017 AMAM Exam Paper
6 pages
Machine Learning and Pattern Recognition - Laplace - Approximation
No ratings yet
Machine Learning and Pattern Recognition - Laplace - Approximation
4 pages
Machine Learning and Pattern Recognition Variational KL
No ratings yet
Machine Learning and Pattern Recognition Variational KL
5 pages
Machine Learning and Pattern Recognition Sampling Based Approximations
No ratings yet
Machine Learning and Pattern Recognition Sampling Based Approximations
3 pages
User Interface Design Principles For Interaction Design
No ratings yet
User Interface Design Principles For Interaction Design
24 pages
Social Engineering - The Art of Human Hacking (461-470)
No ratings yet
Social Engineering - The Art of Human Hacking (461-470)
10 pages
Pci 6 Edition: Handbook History
No ratings yet
Pci 6 Edition: Handbook History
36 pages
Section One1
No ratings yet
Section One1
85 pages
AUTOSAR SWS CANInterface
No ratings yet
AUTOSAR SWS CANInterface
217 pages
IPR & Cyber Laws Manual
73% (22)
IPR & Cyber Laws Manual
28 pages
Government of Pakistan: Model Customs Collectorate Office of The Deputy Collector of Customs, Kict West Wharf
No ratings yet
Government of Pakistan: Model Customs Collectorate Office of The Deputy Collector of Customs, Kict West Wharf
37 pages
01-28010-0015-20050617 FortiGate CLI Reference
No ratings yet
01-28010-0015-20050617 FortiGate CLI Reference
400 pages
Worthy Goals PDF
No ratings yet
Worthy Goals PDF
2 pages
Sym Drive
No ratings yet
Sym Drive
14 pages
Databases Ii
No ratings yet
Databases Ii
146 pages
G2 - 3 - Personalized Stock Market DBMS
No ratings yet
G2 - 3 - Personalized Stock Market DBMS
5 pages
Interview Questions and HTML5 Interview Questions in 2023
No ratings yet
Interview Questions and HTML5 Interview Questions in 2023
58 pages
No Safe Harbor - United States Pirate Party
No ratings yet
No Safe Harbor - United States Pirate Party
104 pages
Otp Validation: Project Report On
100% (1)
Otp Validation: Project Report On
10 pages
Oracle Linux 8 - Managing Core System Configuration (OL8-OSMANAGE)
No ratings yet
Oracle Linux 8 - Managing Core System Configuration (OL8-OSMANAGE)
114 pages
Asia V7.0 Indicator
100% (1)
Asia V7.0 Indicator
11 pages
4400fquickstartguide e
No ratings yet
4400fquickstartguide e
73 pages
Sportscode v7
No ratings yet
Sportscode v7
120 pages
1 s2.0 S0305750X14002678 Main PDF
No ratings yet
1 s2.0 S0305750X14002678 Main PDF
1,061 pages
Ospf Conformance Test Suite: Agilent Routertester
No ratings yet
Ospf Conformance Test Suite: Agilent Routertester
8 pages
ACTION PLAN TLE Teacher PDF
No ratings yet
ACTION PLAN TLE Teacher PDF
4 pages
Handover (GBSS13)
No ratings yet
Handover (GBSS13)
245 pages
Exit Exam Model2
No ratings yet
Exit Exam Model2
20 pages
Account Deletion Form
No ratings yet
Account Deletion Form
2 pages
Bill of Exchange Sent For Collection
No ratings yet
Bill of Exchange Sent For Collection
7 pages
A Medical Chatbot: Mrs. Rashmi Dharwadkar, DR - Mrs. Neeta A. Deshpande
No ratings yet
A Medical Chatbot: Mrs. Rashmi Dharwadkar, DR - Mrs. Neeta A. Deshpande
5 pages
Cramers Rule 2 by 2 Notes
No ratings yet
Cramers Rule 2 by 2 Notes
4 pages
Operation Barbarossa 1941 3 Army Group Center Illustrated Robert Kirchubel download
No ratings yet
Operation Barbarossa 1941 3 Army Group Center Illustrated Robert Kirchubel download
32 pages