0% found this document useful (0 votes)

12 views10 pages

Wine Classification

Uploaded by

raahim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views10 pages

Wine Classification

Uploaded by

raahim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Q3: Wally's Winery

CS 189 Spring 2022

Author: Sean Lin
Wally Walter and the Walter family run the most popular winery in Napa. This summer, UC
Berkeley students are flooding in to try the Walter Family's fine wines! Wally's children,
Wilma and Willy, are slowly helping Wally take ownership of the family business. However,
they do not possess the quintessential Walter talent: identifying red and white wines based
on attributes! To help them, Wally has decided to train a machine learning classifier to
distinguish Walter's Red Wine from Walter's White Wine.
In this exercise, we will build a logistic regression classifier to classify whether a specific
wine is a white (class label 0) or red (class label 1) wine, based on its features
Use of automatic logistic regression libraries/packages is prohibited for this question. If
you are coding in python, it is better to use np.true_divide for evaluating logistic functions
as its code is numerically stable, and doesn’t produce NaN or MathOverflow exceptions.
# Import Relevant Libraries

import scipy
from scipy import io
import numpy as np
import matplotlib.pyplot as plt
import time

# Load Data

data = scipy.io.loadmat('wine_data/wine.mat')
features = data['X']
labels = data['y']
num_features=len(features[0])
num_examples = len(features)

print("Wine data currently has {} features and {} data

points".format(num_features, num_examples))

Wine data currently has 12 features and 6000 data points

Q3.1 Preprocessing the Data

Now that we have loaded the data, we are ready to preprocess it. Preprocessing comes in
three steps:
1) Append an extra feature and label to the data points
2) Split the data into training and test sets
3) Normalize the features
Notice that we split the data into training and test sets before we normalize the features. If
we normalized the features and then split the data into different sets, then the mean and
variance of the features is no longer normalized per dataset.
# Q3.1

def append_feature(features, labels, num_features, num_examples):

'''
Append an extra feature to the data set.
Then, append the labels to the data set.
The resulting data matrix should look like this: [X', y]

Where X' is feature matrix with an additional feature, and y are

the labels

Inputs:
- features: n x d
- labels: n x 1
- num_features: n
- num_examples: d

Output:
- augmented data matrix: n x (d + 2)
- new num_features: n + 1
'''

### YOUR CODE HERE ###

#append extra feature
n, d = features.shape
X0 = np.ones((n, 1))
featuresNew = np.hstack((features, X0))
# print(featuresNew.shape)
# print(features)

#append labels to data matrix

featuresNew = np.hstack((featuresNew, labels))
# print(featuresNew)
# print(featuresNew.shape)
return featuresNew

def split_data(augmented_data, val_size=1000):

'''
Split the data into training and validation sets

Input:
- augmented data matrix: n x (d + 1)
- val_size: k
Output:
- Training Set: (n - k) x (d + 1)
- Validation Set: k x (d + 1)

'''

### YOUR CODE HERE ###

n, d = augmented_data.shape
x = np.arange(0, n).astype(int)
np.random.shuffle(x)

training_size = n - val_size

t_id = x[:training_size]
v_id = x[training_size:]

training = augmented_data[t_id,:]
validation = augmented_data[v_id,:]

return training, validation

def normalize_features(train_set, val_set, num_features):

'''
Normalize the data by shifting it to 0 mean and rescaling it to
have variance 1

Input:
- Training Set: (n - k) x (d + 1)
- Validation Set: k x (d + 1)
- num_featres: d

Output:
- Normalized Training Set: (n - k) x (d + 1)
- Normalized Validation Set: k x (d + 1)
'''

### YOUR CODE HERE ###

# calculate the mean of each column

sub = train_set[:,:-2]

# calcuate the standard dev

mean_train = np.mean(sub, axis=0)
dev_train = np.std(sub, axis=0)

# normlize matrix based on mean and standard dev

train_set[:,:-2] = (sub - mean_train) / dev_train
# calculate the mean of each column
sub = val_set[:,:-2]

# calcuate the standard dev

mean_val = np.mean(sub, axis=0)
dev_val = np.std(sub, axis=0)

# normlize matrix based on mean and standard dev

val_set[:,:-2] = (sub - mean_val) / dev_val

return train_set, val_set

val_size = 1000
train_size = num_examples - val_size

augmented_data = append_feature(features, labels, num_features,

num_examples)
train_set, val_set = split_data(augmented_data, val_size)
train_set, val_set = normalize_features(train_set, val_set,
num_features)

Q3.3 Training Your Classifier

Now it is time to build and train your classifier! Every machine learning pipeline consists of
4 components:
1) Model/Classifier – In this case, Logistic Regression
2) Optimizer – In this case, Batch Gradient Descent
3) Loss – In this case, log-likelihood using binary cross entropy
4) Data – Wine Data
In this exercise, our model is a logistic regression model. We will begin by optimizing it
with batch gradient descent, and the loss is the log of the binary cross entropy. In batch
gradient descent, we calculate the gradient at each time step using all of the data points in
the training set.

Your tasks are as follows:

1) Implement the Logistic Regression Function
2) Implement training with Batch Gradient Descent

To train the model with batch gradient descent, you must:

1) Commpute the predictions from the classifier
2) Compute the loss (negative of the likelihood)
3) Compute the gradient of the loss with respect to weight vector w
4) Perform the gradient descent step
5) Loop through steps 1-4 num_iters times
# Logistic Regression function
def sigmoid(w, X):
'''
Implement the logistic regression function

Inputs:
- w: d x 1 weight vector
- X: n x d feature matrix

Outputs:
- s: n x 1 output of sigmoid
'''

### YOUR CODE HERE ###

return expit(np.matmul(X, w.T))

# Q3.2

def batch_train(train_set, total_features, num_iter, lr=0.0001,

reg_const=0.1):
'''
Learn a weight vector for the logistic regression classifier by
iterating over the data
and updating the weights using batch gradient descent

Inputs:
- train set: n x (d + 1)
- total features: d
- # of training iterations
- learning rate
- l2 regularization constant

Outputs:
- w: d x 1 weight vector
- list of losses: # iters x 1 vector
'''

# We want to keep track of the loss per iteration so that we can

plot it later
loss = np.zeros((num_iter+1,))

# Initialize variables
w = np.zeros((total_features,))
grad = np.zeros((total_features,))

### YOUR CODE HERE ###

s =
loss[0]=

for i in np.arange(num_iter):
grad =
w =
s =

loss[i+1] =

if i % 500 == 0:
print("Loss at Iteration {} is {}:".format(i, loss[i+1]))
return w, loss

def calc_time(start_time, end_time):

'''
Prints the time that a process takes to complete in a readable
manner
'''
hours, rem = divmod(end_time-start_time, 3600)
minutes, seconds = divmod(rem, 60)
print("Training completed with elapsed time {:0>2}:{:0>2}:
{:05.2f}".format(int(hours),int(minutes),seconds))

File "<ipython-input-130-b564bb3bb9d7>", line 45

s =
^
SyntaxError: invalid syntax

num_iter = 7000

start_time = time.time()

w, loss = batch_train(train_set, total_features, num_iter)

end_time = time.time()
calc_time(start_time, end_time)

# Plotting cost vs. iterations

plt.plot(np.arange(num_iter+1), loss)
plt.xlabel('Number of iterations in training')
plt.ylabel('Cost at the end of training')
plt.title('Training loss vs. Number of iterations for Batch Gradient
Descent')
plt.savefig('Wine_GD.png')
plt.show()
# Checking on validation set
s_test = sigmoid(w,val_set[:,:total_features])
diffe = np.rint(s_test)-val_set[:,total_features]
accuracy = (np.true_divide(diffe.size-
np.count_nonzero(diffe),val_size))*100
print(np.rint(s_test).sum())
print("Validation Accuracy is %.2f%%" % (accuracy))

You should expect to see the loss decrease significantly in the first 1000 iterations. The
validation accuracy should be greater than 99%

Q3.5 Training Your Classifier using Stochastic Gradient Descent

Now, we will try to train the classifier using stochastic gradient descent. In stochastic
gradient descent, we calculate the gradient at each time step using a single, randomly
selected data point. Because we are calculating the gradient from one sample per iteration,
stochastic gradient descent serves only as an approximation of batch gradient descent.
However, it is much faster to compute, because we do not need to use the whole training
set to compute the gradient.
Implement training with stochastic gradient descent below.
Note: Because SGD only computes the gradient from one sample per iteration, the
magnitude of an SGD gradient will be n times smaller than the magnitude of a batch
gradient, where n is the number of data points. To mitigate this issue, make sure to
multiply the appropriate part of the gradient by n.
# Q3.5
def stochastic_train(train_set, total_features, train_size, num_iter,
lr=1e-6, reg_const=0.1, decay=False):
'''
Learn a weight vector for the logistic regression classifier by
iterating over the data
and updating the weights using stochastic gradient descent

Inputs:
- train set: n x (d + 1)
- total features: d
- train_size: n
- # of training iterations
- learning rate
- l2 regularization constant
- decaying lr: boolean to determine whether or not we are
decaying the learning rate

Outputs:
- w: d x 1 weight vector
- list of losses: # iters x 1 vector
'''
# We want to keep track of the loss per iteration so that we can
plot it later
loss = np.zeros((num_iter+1,))

# Initialize variables
w = np.zeros((total_features,))
grad = np.zeros((total_features,))

### YOUR CODE HERE ###

s =
loss[0]=

original_lr = lr

for i in np.arange(num_iter):

if decay:
lr =

sample_index =
grad =
w =
s =

loss[i+1]=

if i % 500 == 0:
print("Loss at Iteration {} is {}:".format(i, loss[i+1]))

return w, loss

start_time = time.time()

w, loss = stochastic_train(train_set, total_features, train_size,

num_iter, lr=1e-6)

end_time = time.time()
calc_time(start_time, end_time)

# Plotting cost vs. iterations

# Checking on validation set

s_test = sigmoid(w,val_set[:,:total_features])
diffe = np.rint(s_test)-val_set[:,total_features]
accuracy = (np.true_divide(diffe.size-
np.count_nonzero(diffe),val_size))*100
print(np.rint(s_test).sum())
print("SGD Validation Accuracy is %.2f%%" % (accuracy))

What do you notice about the speed of computation for the classifier trained on SGD vs
Batch Gradient Descent?
What do you notice about the accuracy?

Q 3.6 SGD with a Decaying Learning Rate

Instead of using a constant step size (learning rate) in SGD, you could use a step size that
slowly shrinks from iteration to iteration. Run your SGD algorithm from question 3.4 with a
learning rate that decays in every iteration. The learning rate in every iteration should
lr
decay as l r= 0 , where l r 0 was the original learning rate. Begin with a learning rate of 1e-
i+1
4, and report your results.
start_time = time.time()
decay_w, decay_loss = stochastic_train(train_set, total_features,
train_size, num_iter, lr=1e-4, decay=True)
end_time = time.time()

calc_time(start_time, end_time)

# Plotting cost vs. iterations

plt.plot(np.arange(num_iter+1), loss, "-r", label="no decay")
plt.plot(np.arange(num_iter+1),decay_loss, "-b", label="decay")
plt.xlabel('Number of iterations in training')
plt.ylabel('Cost at the end of training')
plt.legend(loc="upper left")
plt.title('SGD Training loss vs. iterations with decaying & const
learning rate')
plt.savefig('Wine_SGD_combined.png')
plt.clf()
plt.close()

# Checking on validation set

s_test=sigmoid(w,val_set[:,:total_features])
ss_test=sigmoid(decay_w,val_set[:,:total_features])
diffe=np.rint(s_test)-val_set[:,total_features]
diffe_s=np.rint(ss_test)-val_set[:,total_features]
accuracy=(np.true_divide(diffe.size-
np.count_nonzero(diffe),val_size))*100
accuracy_s=(np.true_divide(diffe_s.size-
np.count_nonzero(diffe_s),val_size))*100
print("SGD Validation Accuracy (constant learning rate) is %.2f%%" %
(accuracy))
print("SGD Validation Accuracy (decaying learning rate) is %.2f%%" %
(accuracy_s))

Great Job Finishing! Here are some Key Takeaways from this Exercise:
1) Logistic Regression is a classifier that can predict between two classes. It can be
iteratively optimized via gradient descent
2) Stochastic Gradient Descent runs computationally quicker than Batch Gradient Descent,
at the cost of some model performance
3) One strategy to improve model training is to decay the learning rate of the optimizer
over time. While this can lead to good results, one must be tricky about setting the decay
rate and the initial learning rate.
Wally is infinitely grateful for your help! Now that he has a wine classifier, he can pass
down the family business in peace :)

Image Processing
No ratings yet
Image Processing
5 pages
Pattern Recognition Lab
No ratings yet
Pattern Recognition Lab
24 pages
Fresco
100% (2)
Fresco
17 pages
Two Factor Theory
100% (1)
Two Factor Theory
5 pages
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
No ratings yet
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
35 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
C2W3_Lab_01_Model_Evaluation_and_Selection
No ratings yet
C2W3_Lab_01_Model_Evaluation_and_Selection
21 pages
ml
No ratings yet
ml
9 pages
Machine Learning Laboratory Manual
No ratings yet
Machine Learning Laboratory Manual
11 pages
ML Lab Record
No ratings yet
ML Lab Record
33 pages
Machine Learning practical file
No ratings yet
Machine Learning practical file
31 pages
Assignment 1: Q1. Task Description
No ratings yet
Assignment 1: Q1. Task Description
12 pages
50inference
No ratings yet
50inference
31 pages
Machine learning
No ratings yet
Machine learning
27 pages
Advance AI and ML LAB
No ratings yet
Advance AI and ML LAB
16 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
Ml-Exp-3 - Jupyter Notebook
No ratings yet
Ml-Exp-3 - Jupyter Notebook
6 pages
ML Lab Manual
No ratings yet
ML Lab Manual
12 pages
Sofcomputing Da2
No ratings yet
Sofcomputing Da2
7 pages
Ai Last 5
No ratings yet
Ai Last 5
4 pages
21bit0706 VL2024250106861 Da
No ratings yet
21bit0706 VL2024250106861 Da
7 pages
Aiml Lab
No ratings yet
Aiml Lab
14 pages
ML Lab
No ratings yet
ML Lab
7 pages
Pytorch (Tabular) - Regression
No ratings yet
Pytorch (Tabular) - Regression
13 pages
ML Lab Experiments (1) - Pages-2
No ratings yet
ML Lab Experiments (1) - Pages-2
10 pages
Lab2 Linear Regression
100% (1)
Lab2 Linear Regression
18 pages
AI LAB
No ratings yet
AI LAB
19 pages
ATUL MLT EXP 4-11
No ratings yet
ATUL MLT EXP 4-11
17 pages
ML LAB P-1
No ratings yet
ML LAB P-1
10 pages
AI Lab M.Tech
No ratings yet
AI Lab M.Tech
29 pages
ML Lab Manual PDF
No ratings yet
ML Lab Manual PDF
9 pages
Lab-5 Report
No ratings yet
Lab-5 Report
11 pages
MIT-ans
No ratings yet
MIT-ans
216 pages
Machine learning lab manual
No ratings yet
Machine learning lab manual
9 pages
Machine Learning LAB
No ratings yet
Machine Learning LAB
20 pages
Machine learning CODE
No ratings yet
Machine learning CODE
19 pages
lab-report-03
No ratings yet
lab-report-03
14 pages
Machine Learning Lab (3) Report (21 CP 81)
No ratings yet
Machine Learning Lab (3) Report (21 CP 81)
7 pages
A3 Classification and Feature Engineering
No ratings yet
A3 Classification and Feature Engineering
2 pages
ML Lab File Batch 1
No ratings yet
ML Lab File Batch 1
20 pages
12212159
No ratings yet
12212159
59 pages
Machine Learning Practice
No ratings yet
Machine Learning Practice
17 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Vertopal.com C1 W2 Lab03 Feature Scaling and Learning Rate Soln
No ratings yet
Vertopal.com C1 W2 Lab03 Feature Scaling and Learning Rate Soln
10 pages
R20!63!20ITC27 Deep Learning Lab Manual(Minor Proj 2) Dr.K.ramu
No ratings yet
R20!63!20ITC27 Deep Learning Lab Manual(Minor Proj 2) Dr.K.ramu
47 pages
Ml Manual
No ratings yet
Ml Manual
30 pages
Experiment No
No ratings yet
Experiment No
29 pages
C1 W2 Lab05 Sklearn GD Soln
No ratings yet
C1 W2 Lab05 Sklearn GD Soln
3 pages
MLWP LAB Experiment's
No ratings yet
MLWP LAB Experiment's
11 pages
ML Experiments
No ratings yet
ML Experiments
22 pages
ML lab manual
No ratings yet
ML lab manual
25 pages
Deep Learning Perceptron
No ratings yet
Deep Learning Perceptron
10 pages
Advance Machine Learning
No ratings yet
Advance Machine Learning
28 pages
ML LAB
No ratings yet
ML LAB
23 pages
ML_Lab_01999676272
No ratings yet
ML_Lab_01999676272
12 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
code mlt
No ratings yet
code mlt
9 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
TOYM
No ratings yet
TOYM
3 pages
Chapter 5
No ratings yet
Chapter 5
34 pages
1888 1982 Reff2021
No ratings yet
1888 1982 Reff2021
13 pages
A Detailed Study of An Internet of Things Iot
No ratings yet
A Detailed Study of An Internet of Things Iot
7 pages
Quiz 5
0% (1)
Quiz 5
8 pages
Chapter 12 Wireless Sensor Networks: From Reference Book: Adhoc Wireless Networks By: B.S. Manoj
No ratings yet
Chapter 12 Wireless Sensor Networks: From Reference Book: Adhoc Wireless Networks By: B.S. Manoj
42 pages
Qdareus Lomas 2022 12 01-2023 01 01 Bank Statement Form
No ratings yet
Qdareus Lomas 2022 12 01-2023 01 01 Bank Statement Form
2 pages
Short Tutorial On Generative Grammars: - A Generative Grammar Is Given by
No ratings yet
Short Tutorial On Generative Grammars: - A Generative Grammar Is Given by
9 pages
Senior Housing Business Plan
No ratings yet
Senior Housing Business Plan
37 pages
‏‏Heat treatment of materials - نسخة
No ratings yet
‏‏Heat treatment of materials - نسخة
7 pages
Yiwuke Limited - Wholesale and Retail 2021-01 Price List: Picture Barcode Upload Name Moq Unit Wholesale Retail
No ratings yet
Yiwuke Limited - Wholesale and Retail 2021-01 Price List: Picture Barcode Upload Name Moq Unit Wholesale Retail
9 pages
Technology - Personal Statement
No ratings yet
Technology - Personal Statement
7 pages
HGA Cat
No ratings yet
HGA Cat
31 pages
Spec FVL 240 300 2304R
No ratings yet
Spec FVL 240 300 2304R
4 pages
Appendicitis Clinical Presentation
No ratings yet
Appendicitis Clinical Presentation
6 pages
Entrepreneurship Development in India: Presented To:-Presented By
No ratings yet
Entrepreneurship Development in India: Presented To:-Presented By
16 pages
Thanks For Travelling With Us, Raju: Ride Detailsbill Details
No ratings yet
Thanks For Travelling With Us, Raju: Ride Detailsbill Details
2 pages
Windows Phone
No ratings yet
Windows Phone
29 pages
Download ebooks file Business and society : stakeholders, ethics, public policy Sixteenth Edition Anne T. Lawrence - eBook PDF all chapters
100% (1)
Download ebooks file Business and society : stakeholders, ethics, public policy Sixteenth Edition Anne T. Lawrence - eBook PDF all chapters
48 pages
Sds-Plustek PB300G33BK11 PDF
No ratings yet
Sds-Plustek PB300G33BK11 PDF
9 pages
Assignment
No ratings yet
Assignment
6 pages
Folder HCMBOK To Agile
No ratings yet
Folder HCMBOK To Agile
4 pages
Walter Bunda Portfolio
100% (1)
Walter Bunda Portfolio
18 pages
Recording Calls Study
No ratings yet
Recording Calls Study
97 pages
Osisense XCC Xcc1510ps10y
No ratings yet
Osisense XCC Xcc1510ps10y
2 pages
(Ebook) Sequence Alignment: Methods, Models, Concepts, and Strategies by Michael S. Rosenberg ISBN 9780520256972, 0520256972 instant download
100% (1)
(Ebook) Sequence Alignment: Methods, Models, Concepts, and Strategies by Michael S. Rosenberg ISBN 9780520256972, 0520256972 instant download
58 pages
"Network Engineer": Visvesvaraya Technological University Jnana Sangama, Belagavi-590018
No ratings yet
"Network Engineer": Visvesvaraya Technological University Jnana Sangama, Belagavi-590018
6 pages
2-nd-merit-list-notice-list-XI-Science
No ratings yet
2-nd-merit-list-notice-list-XI-Science
8 pages
History of Mobile Phones
No ratings yet
History of Mobile Phones
5 pages

Wine Classification

Uploaded by

Wine Classification

Uploaded by

Q3: Wally's Winery

CS 189 Spring 2022

print("Wine data currently has {} features and {} data

Wine data currently has 12 features and 6000 data points

Q3.1 Preprocessing the Data

def append_feature(features, labels, num_features, num_examples):

Where X' is feature matrix with an additional feature, and y are

### YOUR CODE HERE ###

#append labels to data matrix

def split_data(augmented_data, val_size=1000):

### YOUR CODE HERE ###

return training, validation

def normalize_features(train_set, val_set, num_features):

### YOUR CODE HERE ###

# calculate the mean of each column

# calcuate the standard dev

# normlize matrix based on mean and standard dev

# calcuate the standard dev

# normlize matrix based on mean and standard dev

return train_set, val_set

augmented_data = append_feature(features, labels, num_features,

Q3.3 Training Your Classifier

Your tasks are as follows:

To train the model with batch gradient descent, you must:

### YOUR CODE HERE ###

def batch_train(train_set, total_features, num_iter, lr=0.0001,

# We want to keep track of the loss per iteration so that we can

### YOUR CODE HERE ###

def calc_time(start_time, end_time):

File "<ipython-input-130-b564bb3bb9d7>", line 45

w, loss = batch_train(train_set, total_features, num_iter)

# Plotting cost vs. iterations

Q3.5 Training Your Classifier using Stochastic Gradient Descent

### YOUR CODE HERE ###

w, loss = stochastic_train(train_set, total_features, train_size,

# Plotting cost vs. iterations

# Checking on validation set

Q 3.6 SGD with a Decaying Learning Rate

# Plotting cost vs. iterations

# Checking on validation set

You might also like