Assignment 1

This document provides the details for Assignment #1 of CSE 575: Statistical Machine Learning at [University]. It includes 5 questions covering topics like Bayes classifier, parameter estimation, naive Bayes classifier, logistic regression, and comparing naive Bayes and logistic regression. Students are asked to submit their solutions in a PDF or word file named with their name by February 8, 2018.

Uploaded by

Razin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views3 pages

Assignment 1

Uploaded by

Razin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

CSE 575: Statistical Machine Learning Assignment #1

Instructor: Prof. Jingrui He

Out: Jan 19, 2018; Due: Feb 8, 2018
Submit electronically, using the submission link on Blackboard for Assignment #1, a file named
yourFirstName-yourLastName.pdf containing your solution to this assignment (a .doc
or .docx file is also acceptable, but .pdf is preferred).

1 Bayes Classifier [15 points]

Suppose that in your coin flip experiment, you observed a set of αH heads and αT tails. Let θ
denote the probability of observing heads, whose prior distribution follows Beta(βH , βT ), where
βH and βT are two positive parameters. Prove that the posterior distribution P (θ|D) (D denotes
the observed coin flips) follows Beta(βH + αH , βT + αT ). What is the mean of P (θ|D)? What is
the MAP estimator θ̂M AP of θ?

2 Parameter Estimation [15 points]

For this question, assume that x1 , . . . , xN ∈ R are i.i.d samples drawn from the same underlying
distribution. Assume that the underlying distribution is Gaussian N (µ, σ 2 ).

1. (5 points) Let µ̂M LE denote the MLE estimator of µ. Please prove that µ̂M LE is unbiased.
Hint: The bias of an estimator of the parameter µ is defined to be the difference between
the expected value of the estimator and µ.

2. (10 points) If the true value of µ is unknown, then the MLE estimator of σ 2 is as follows.
N
2 1 X
σ̂M LE = (xi − µ̂M LE )2
N i=1

2
Please prove that σ̂M LE is biased.

3 Naı̈ve Bayes Classifier [20 points]

Given the training data set shown in Figure 1, we train a Naı̈ve Bayes classifier with it. Each row
refers to a person, where the categorical features (age, income etc.) and the class label (whether
he/she buys a computer) are shown.

1. (5 points) How many independent parameters would be there for the Naı̈ve Bayes classifier
trained with this data? What are they? Justify the your answers.

2. (10 points) Using standard MLE, what are the estimated values for these parameters?

3. (5 points) Given a new person with features x = (youth, medium, yes, f air), what is P (y =
yes|x)? Would the Naı̈ve Bayes classifier predict y = yes or y = no for this person?

1
Figure 1: Training Data for Naı̈ve Bayes Classifier

4 Logistic Regression [20 points]

Suppose we have two positive examples x1 = (1, 0) and x2 = (0, −1) and two negative examples
x3 = (0, 1) and x4 = (−1, 0). Apply standard gradient ascent method to train a logistic regression
classifier (without any regularization terms). Initialize the weight vector with two different values
and set w00 = 0 (e.g. w0 = (0, 0, 0)0 , w0 = (0, 1, 0)0 ). Would the final weight vector (w∗ ) be the
same for the two different initial values? What are the values? Please explain your answer. You
may assume the learning rate to be a positive real constant η.

5 Naı̈ve Bayes Classifier and Logistic Regression [30 points]

1. (5 points) Gaussian Naı̈ve Bayes and Logistic Regression. Suppose a logistic regression
model and a Gaussian Naı̈ve Bayes classifier are trained for a binary classification task f :
X → Y where X is real-valued features X =< X1 , ..., Xd >∈ Rd , Y = {0, 1} is the
binary label. After training, we get the weight vector w =< w0 , w1 , ..., wd > for the logistic
regression model.
Recall that in Gaussian Naı̈ve Bayes, each feature Xi (i = 1, ..., d) is assumed to be
conditional independent given the label Y so that P (Xi |Y = k) = N (µik , σik ) (k =
0, 1; i = 1, ..., d). We assume that the marginal distribution of class labels P (Y ) follows
Bernoulli(θ, 1 − θ) (P (Y = 1) = θ, P (Y = 0) = 1 − θ).

– How many independent parameters are there in this Gaussian Naı̈ve Bayes classifier?
What are them?
– Can we translate w into the parameters of an equivalent Gaussian Naı̈ve Bayes classifier
without any extra assumption? If that is the case, justify your answer. Otherwise, please
specify what extra assumption(s) you need to complete the translation and explain why.

2. (25 points) Implementation of Gaussian Naı̈ve Bayes and Logistic Regression. Compare
the two approaches on the bank note authentication dataset, which can be downloaded from

2
http://archive.ics.uci.edu/ml/datasets/banknote+authentication. Complete description of the
dataset can be also found on this webpage. In short, for each row the first four columns are
the feature values and the last column is the class label (0 or 1). You will observe the learn-
ing curves similar to those Dr. He mentioned in class. Implement a Gaussian Naı̈ve Bayes
classifier (recall the conditional independent assumption mentioned before) and a logistic
regression classifier. Please write your own code from scratch and do NOT use existing
functions or packages which can provide you the Naı̈ve Bayes Classifier/Logistic Re-
gression class or fit/predict function (e.g. sklearn). But you can use some basic linear
algebra/probability functions (e.g. numpy.sqrt(), numpy.random.normal()). For the Naı̈ve
Bayes classifier, assume that P (xi |y) ∼ N (µi,k , σi,k ), where xi is a feature in the bank note
data, and y is the class label. Use three-fold cross-validation to split the data and train/test
your models.

– (5 points) For each algorithm: briefly describe how you implement it by giving the
pseudocode. The pseudocode must include equations for estimating the model pa-
rameters and for classifying a new example. Remember, this should not be a print-
out of your code, but a high-level outline description. Include the pseudocode in
your pdf file (or .doc/.docx file). Submit the actual code as a single zip file named
yourFirstName-yourLastName.zip IN ADDITION TO the pdf file (or .doc/.docx file).
– (10 points) Plot a learning curve: the accuracy vs. the size of the training set. Plot 6
points for the curve, using [.01 .02 .05 .1 .625 1] RANDOM fractions of you training
set and testing on the full test set each time. Average your results over 5 runs using
each random fraction (e.g. 0.05) of the training set. Plot both the Naı̈ve Bayes and
logistic regression learning curves on the same figure. For logistic regression, do not
use any regularization term.
– (10 points) Show the power of generative model: Use your trained Naı̈ve Bayes classi-
fier (with the complete training set) to generate 400 examples from class y = 1. Report
the mean and variance of the generated examples and the corresponding training data
(for each fold, over 1 run). and compare with those in your training set (examples in
training set with y = 1). Try to explain what you observed in this comparison.

SMAI Question Papers
No ratings yet
SMAI Question Papers
13 pages
Questions and Solutions On Bayes Theorem
No ratings yet
Questions and Solutions On Bayes Theorem
10 pages
Homework3 Sol
No ratings yet
Homework3 Sol
5 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Midterm 2006
No ratings yet
Midterm 2006
11 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
ISYE6740_Fall2024_HW4_Rubric
No ratings yet
ISYE6740_Fall2024_HW4_Rubric
5 pages
KDAG Task
No ratings yet
KDAG Task
2 pages
Machine Learning PYQ 2021
No ratings yet
Machine Learning PYQ 2021
4 pages
Machine Learning - Unit 2
No ratings yet
Machine Learning - Unit 2
104 pages
Co3&4
No ratings yet
Co3&4
22 pages
2022_Machine Learning
No ratings yet
2022_Machine Learning
6 pages
Homework 4
0% (1)
Homework 4
4 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
ESGB - Naive Bayes and Logistic Regression
No ratings yet
ESGB - Naive Bayes and Logistic Regression
36 pages
Midterm Exam - Summer 21
No ratings yet
Midterm Exam - Summer 21
6 pages
Midterm Sp16 Solutions
100% (1)
Midterm Sp16 Solutions
17 pages
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 2
No ratings yet
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 2
8 pages
5 ML NaiveBayes
No ratings yet
5 ML NaiveBayes
45 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
Adobe Scan 30-May-2023
No ratings yet
Adobe Scan 30-May-2023
7 pages
Machine Leraning Unit 2
No ratings yet
Machine Leraning Unit 2
62 pages
hw3_red
No ratings yet
hw3_red
4 pages
2023_Machine_Learning
No ratings yet
2023_Machine_Learning
8 pages
Data Science and ML - End Term
No ratings yet
Data Science and ML - End Term
4 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
Homework2 v1.0
No ratings yet
Homework2 v1.0
5 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
Exam 2011
No ratings yet
Exam 2011
22 pages
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
No ratings yet
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
53 pages
05_lecturenote_NB
No ratings yet
05_lecturenote_NB
10 pages
Logistic Regression
No ratings yet
Logistic Regression
26 pages
Machine 2021 Jul-Dec
No ratings yet
Machine 2021 Jul-Dec
46 pages
Assign 1
No ratings yet
Assign 1
5 pages
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
No ratings yet
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
67 pages
Machine Learning PYQ 2023
No ratings yet
Machine Learning PYQ 2023
8 pages
Mid Sem Solution 2019
No ratings yet
Mid Sem Solution 2019
9 pages
ML 2022
No ratings yet
ML 2022
6 pages
Midterm Solutions For Machine Learning
No ratings yet
Midterm Solutions For Machine Learning
13 pages
Assignment1 2020
No ratings yet
Assignment1 2020
6 pages
Lista Fabio Cozman
No ratings yet
Lista Fabio Cozman
6 pages
Solution of Final Exam: 10-701/15-781 Machine Learning: Fall 2004 Dec. 12th 2004
No ratings yet
Solution of Final Exam: 10-701/15-781 Machine Learning: Fall 2004 Dec. 12th 2004
27 pages
Midterm Solution
No ratings yet
Midterm Solution
6 pages
Logistic Regression(Probability Concepts) and Perceptron
No ratings yet
Logistic Regression(Probability Concepts) and Perceptron
20 pages
Mid Term Test
No ratings yet
Mid Term Test
6 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
MLT UNIT-2 notes
No ratings yet
MLT UNIT-2 notes
16 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
ML Midsem 2018 Solutions
No ratings yet
ML Midsem 2018 Solutions
7 pages
hw1
No ratings yet
hw1
4 pages
Bayesian linear regression for Posterior Predictive Distribution MATLAB
No ratings yet
Bayesian linear regression for Posterior Predictive Distribution MATLAB
46 pages
ML Practice 1
No ratings yet
ML Practice 1
106 pages
Sci ML Mock Exam 2023
No ratings yet
Sci ML Mock Exam 2023
8 pages
University of Edinburgh College of Science and Engineering School of Informatics
No ratings yet
University of Edinburgh College of Science and Engineering School of Informatics
5 pages
Version 1
No ratings yet
Version 1
18 pages
Statistics Quiz
No ratings yet
Statistics Quiz
20 pages
2011_end_spring_2011_Computer_Science_Machine_Learning
No ratings yet
2011_end_spring_2011_Computer_Science_Machine_Learning
10 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
Pandas PD: Import As
No ratings yet
Pandas PD: Import As
19 pages
Introduction To Turbo Prolog - Townsend, Carl, 1938 - 1987 - Berkeley - Sybex - 9780895883599 - Anna's Archive
No ratings yet
Introduction To Turbo Prolog - Townsend, Carl, 1938 - 1987 - Berkeley - Sybex - 9780895883599 - Anna's Archive
340 pages
Modern Resume With QR Code
No ratings yet
Modern Resume With QR Code
2 pages
Restaurant Manager Resume
No ratings yet
Restaurant Manager Resume
3 pages
Cover Letter For Entry-Level Resume
No ratings yet
Cover Letter For Entry-Level Resume
2 pages
Human Resources Resume
No ratings yet
Human Resources Resume
2 pages
Assignment 2 Solution
No ratings yet
Assignment 2 Solution
12 pages
Sem 4 OOP (JAVA - TECHNICAL - PUBLICATION PDF
No ratings yet
Sem 4 OOP (JAVA - TECHNICAL - PUBLICATION PDF
251 pages
Assignment 2 Specification
No ratings yet
Assignment 2 Specification
3 pages
Assignment 3 Specification
No ratings yet
Assignment 3 Specification
3 pages
Unit 2 Reference Material PDF
No ratings yet
Unit 2 Reference Material PDF
124 pages
DCN Unit 1 PDF
No ratings yet
DCN Unit 1 PDF
78 pages
DCN Unit 3 PDF
No ratings yet
DCN Unit 3 PDF
42 pages
Unit 3 Reference Material PDF
No ratings yet
Unit 3 Reference Material PDF
93 pages