0% found this document useful (0 votes)

4 views4 pages

classification_2_ex

This document outlines an exercise on classification techniques in machine learning, focusing on Naive Bayes and discriminant analysis. It includes practical tasks such as computing predictions, handling numeric features, and visualizing decision boundaries for different classifiers. The exercises aim to enhance understanding of classification methods and their applications using R and Python libraries.

Uploaded by

Tef Elbert

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views4 pages

classification_2_ex

Uploaded by

Tef Elbert

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Exercise 4 – Classification II

Introduction to Machine Learning

Hint: Useful libraries

# you may need the following packages for this exercise sheet:

library(mlr3)
library(mlr3learners)
library(ggplot2)
library(mlbench)
library(mlr3viz)

Python

# Consider the following libraries for this exercise sheet:

# general
import numpy as np
import pandas as pd
from scipy.stats import norm
# plotting
import matplotlib.pyplot as plt
import seaborn as sns
# sklearn
from sklearn.naive_bayes import CategoricalNB # import Naive Bayes Classifier for categori
from sklearn.naive_bayes import GaussianNB # import Naive Bayes Classifier for normal dist
from sklearn.preprocessing import OrdinalEncoder
from sklearn.preprocessing import LabelEncoder

1
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis as LDA
from sklearn.discriminant_analysis import QuadraticDiscriminantAnalysis as QDA
from sklearn.inspection import DecisionBoundaryDisplay
from sklearn.metrics import confusion_matrix
from sklearn.metrics import precision_recall_fscore_support

Exercise 1: Naive Bayes

Learning goals

Compute Naive Bayes predictions by hand

You are given the following table with the target variable Banana:

ID Color Form Origin Banana

1 yellow oblong imported yes
2 yellow round domestic no
3 yellow oblong imported no
4 brown oblong imported yes
5 brown round domestic no
6 green round imported yes
7 green oblong domestic no
8 red round imported no

We want to use a Naive Bayes classifier to predict whether a new fruit is a Banana or not.
Estimate the posterior probability 𝜋(x
̂ ∗ ) for a new observation x∗ = (yellow, round, imported).
How would you classify the object?

Assume you have an additional feature Length that measures the length in cm. Describe in
1-2 sentences how you would handle this numeric feature with Naive Bayes.

2
Exercise 2: Discriminant analysis

Learning goals

1) Set up discriminant analysis by hand

2) Make predictions with discriminant analysis
3) Discuss difference between LDA and QDA

4.0

3.5

3.0
y

2.5

2.0

0 2 4 6 8
x
The above plot shows 𝒟 = ((x(1) , 𝑦(1) ) , … , (x(𝑛) , 𝑦(𝑛) )), a data set with 𝑛 = 200 observations
of a continuous target variable 𝑦 and a continuous, 1-dimensional feature variable x. In the
following, we aim at predicting 𝑦 with a machine learning model that takes x as input.

To prepare the data for classification, we categorize the target variable 𝑦 in 3 classes and call
the transformed target variable 𝑧, as follows:

⎧1, 𝑦(𝑖) ∈ (−∞, 2.5]

{
𝑧(𝑖) = 2, 𝑦(𝑖) ∈ (2.5, 3.5]
⎨
{3, 𝑦(𝑖) ∈ (3.5, ∞)
⎩

Now we can apply quadratic discriminant analysis (QDA):

3
Estimate the class means 𝜇𝑘 = 𝔼(x|𝑧 = 𝑘) for each of the three classes 𝑘 ∈ {1, 2, 3} visually
from the plot. Do not overcomplicate this, a rough estimate is suﬀicient here.

Make a plot that visualizes the different estimated densities per class.

How would your plot from ii) change if we used linear discriminant analysis (LDA) instead of
QDA? Explain your answer.
Why is QDA preferable over LDA for this data?

Given are two new observations x∗1 = −10 and x∗2 = 7. Assuming roughly equal class sizes,
state the prediction for QDA and explain how you arrive there.

Exercise 3: Decision boundaries for classification learners

Learning goals

Get a feeling for decision boundaries produced by LDA/QDA/NB

We will now visualize how well different learners classify the three-class mlbench::mlbench.cassini
data set.

• Generate 1000 points from cassini using R or import cassini_data.csv in Python.

• Then, perturb the x.2 dimension with Gaussian noise (mean 0, standard deviation 0.5),
and consider the classifiers already introduced in the lecture:
– LDA (Linear Discriminant Analysis),
– QDA (Quadratic Discriminant Analysis), and
– Naive Bayes.

Plot the learners’ decision boundaries. Can you spot differences in separation ability?
(Note that logistic regression cannot handle more than two classes and is therefore not listed
here.)

Introduction To Machine Learning - Ethem Alpaydin
100% (4)
Introduction To Machine Learning - Ethem Alpaydin
432 pages
UPSC IAS Prelims GS - General Studies Question Paper 1995 With Answers
No ratings yet
UPSC IAS Prelims GS - General Studies Question Paper 1995 With Answers
47 pages
Final Copy of 5 Project Report
No ratings yet
Final Copy of 5 Project Report
4 pages
Thesis Topics Related To Landscape Architecture
100% (3)
Thesis Topics Related To Landscape Architecture
5 pages
Best Science Books
100% (1)
Best Science Books
22 pages
Research Project Proposal by Slidesgo
No ratings yet
Research Project Proposal by Slidesgo
45 pages
Murphy Book Solution
No ratings yet
Murphy Book Solution
100 pages
Tugas Mata Kuliah Pengenalan Pola: Sistem Komputer Fakultas Ilmu Komputer Universitas Sriwijaya 2019
No ratings yet
Tugas Mata Kuliah Pengenalan Pola: Sistem Komputer Fakultas Ilmu Komputer Universitas Sriwijaya 2019
6 pages
DSA Exam Results
No ratings yet
DSA Exam Results
1 page
ECS7020P ClassificationExercises II
No ratings yet
ECS7020P ClassificationExercises II
3 pages
Research Summary - Chinese-Canadian Famers and The Metro Vancouver Local Food Movement
No ratings yet
Research Summary - Chinese-Canadian Famers and The Metro Vancouver Local Food Movement
12 pages
Brochure SaphyGATE GN en
No ratings yet
Brochure SaphyGATE GN en
2 pages
Exercises695Clas Solution
100% (2)
Exercises695Clas Solution
13 pages
Kanishka Sharma A - 26
No ratings yet
Kanishka Sharma A - 26
2 pages
Problem Set4 - Reflection - Mirrors and Images FOR GOOGLE CLASSROOM POSTING
No ratings yet
Problem Set4 - Reflection - Mirrors and Images FOR GOOGLE CLASSROOM POSTING
3 pages
A-24.2 Determination of Cortisol in Serum: Key Words
No ratings yet
A-24.2 Determination of Cortisol in Serum: Key Words
4 pages
Learning Curve Tutorial Activity
No ratings yet
Learning Curve Tutorial Activity
2 pages
Print Utility v3_Quick Tips
No ratings yet
Print Utility v3_Quick Tips
2 pages
Barangay Sta Purok
No ratings yet
Barangay Sta Purok
3 pages
Lecture 3 Deep Learning
No ratings yet
Lecture 3 Deep Learning
98 pages
Example Problems On Support Vector Machines: Problem 1
No ratings yet
Example Problems On Support Vector Machines: Problem 1
2 pages
Discriminant Functions
No ratings yet
Discriminant Functions
33 pages
2402.03046v1
No ratings yet
2402.03046v1
25 pages
Week#5
No ratings yet
Week#5
33 pages
classification_1_ex
No ratings yet
classification_1_ex
3 pages
Aids to Trade Project
No ratings yet
Aids to Trade Project
10 pages
Lemlem Abebaw Asaye Asignment 7
No ratings yet
Lemlem Abebaw Asaye Asignment 7
9 pages
06b Discriminant Analysis
No ratings yet
06b Discriminant Analysis
18 pages
Supervised Classification 3601
No ratings yet
Supervised Classification 3601
39 pages
COurse 4
No ratings yet
COurse 4
12 pages
Business Account Opening Diaspora Uganda
No ratings yet
Business Account Opening Diaspora Uganda
11 pages
Problem 1: Cse352 AI Homework 3 Solutions
No ratings yet
Problem 1: Cse352 AI Homework 3 Solutions
31 pages
Van's Gauge Installation
No ratings yet
Van's Gauge Installation
10 pages
Temporary Marking Inverted Spray Paint: Technical Data Sheet
No ratings yet
Temporary Marking Inverted Spray Paint: Technical Data Sheet
1 page
A. Install Relevant Package For Classification. B. Choose Classifier For Classification Problem. C. Evaluate The Performance of Classifier
No ratings yet
A. Install Relevant Package For Classification. B. Choose Classifier For Classification Problem. C. Evaluate The Performance of Classifier
10 pages
Probabilistic Reasoning Lab Procedure
No ratings yet
Probabilistic Reasoning Lab Procedure
4 pages
Model Paper - Applied Machine Learning
No ratings yet
Model Paper - Applied Machine Learning
3 pages
The Economic Survey 2009
No ratings yet
The Economic Survey 2009
261 pages
CSE 474/574 Introduction To Machine Learning Fall 2011 Assignment 3
No ratings yet
CSE 474/574 Introduction To Machine Learning Fall 2011 Assignment 3
3 pages
Machine Learning - Classification
No ratings yet
Machine Learning - Classification
13 pages
Machine Figure
No ratings yet
Machine Figure
153 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
06b Discriminant Analysis
No ratings yet
06b Discriminant Analysis
18 pages
Machine learning with Titanic dataset tutorial
No ratings yet
Machine learning with Titanic dataset tutorial
7 pages
PB Rena Mero (Wrestling Superstar Sable) 1999 - Text
0% (3)
PB Rena Mero (Wrestling Superstar Sable) 1999 - Text
81 pages
phython 3
No ratings yet
phython 3
10 pages
Slides Classification Discranalysis
No ratings yet
Slides Classification Discranalysis
11 pages
ML File - Merged
No ratings yet
ML File - Merged
24 pages
Machine Learning-Lecture 3(Student)
No ratings yet
Machine Learning-Lecture 3(Student)
4 pages
Types of Regression
No ratings yet
Types of Regression
21 pages
Lec-04_Linear Discriminant Analysis
No ratings yet
Lec-04_Linear Discriminant Analysis
23 pages
datamining-lect12
No ratings yet
datamining-lect12
75 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
178 hw1
No ratings yet
178 hw1
4 pages
Lecture 1, Part 2: Linear Classification: Roger Grosse
No ratings yet
Lecture 1, Part 2: Linear Classification: Roger Grosse
10 pages
8 Classification
No ratings yet
8 Classification
45 pages
IntroClassificationDA-2024
No ratings yet
IntroClassificationDA-2024
129 pages
Mutoh VJ-1x24 Supplemental Training Guide Rev 51911
No ratings yet
Mutoh VJ-1x24 Supplemental Training Guide Rev 51911
45 pages
Introduction To Arduino
100% (1)
Introduction To Arduino
25 pages
Mod09-ppt2-ML_in_Image_Classification
No ratings yet
Mod09-ppt2-ML_in_Image_Classification
30 pages
DEXTROSE AND SODIUM CHLORIDE-dextros e and S Odium Chloride Injection, S Olution Baxter Healthcare Corporation
No ratings yet
DEXTROSE AND SODIUM CHLORIDE-dextros e and S Odium Chloride Injection, S Olution Baxter Healthcare Corporation
30 pages
5S Filipino &amp English
100% (9)
5S Filipino &amp English
8 pages
Week2_Part1_Summer_Partial_Notes
No ratings yet
Week2_Part1_Summer_Partial_Notes
75 pages
Dl Highlights
No ratings yet
Dl Highlights
6 pages
1906.02590v1
No ratings yet
1906.02590v1
16 pages
Session 5 ppt
No ratings yet
Session 5 ppt
36 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
Course: DD2427 - Exercise Class 1: Exercise 1 Motivation For The Linear Neuron
No ratings yet
Course: DD2427 - Exercise Class 1: Exercise 1 Motivation For The Linear Neuron
5 pages
Slides on DataI
No ratings yet
Slides on DataI
33 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
Statlearn PDF
No ratings yet
Statlearn PDF
123 pages
Introduction To Machine Learning: ETH Zurich Janik Schuettler Marcel Graetz FS18
No ratings yet
Introduction To Machine Learning: ETH Zurich Janik Schuettler Marcel Graetz FS18
18 pages
Data Structures and Algorithms
No ratings yet
Data Structures and Algorithms
211 pages
20MEMECH Part 3 - Classification
No ratings yet
20MEMECH Part 3 - Classification
49 pages
(She's A) Bad Mama Jama - Trumpet in Bb
No ratings yet
(She's A) Bad Mama Jama - Trumpet in Bb
2 pages
Lecture 9: Classification, LDA: Reading: Chapter 4
No ratings yet
Lecture 9: Classification, LDA: Reading: Chapter 4
55 pages
ML Lab 8 - LDA
No ratings yet
ML Lab 8 - LDA
4 pages
Using KWL Strategy To Improve Students R
No ratings yet
Using KWL Strategy To Improve Students R
10 pages
Legal 3 AI
No ratings yet
Legal 3 AI
3 pages
Numpy @CodeProgrammer
No ratings yet
Numpy @CodeProgrammer
64 pages
Problemset2 PDF
No ratings yet
Problemset2 PDF
4 pages
Machine Learning Project 1
No ratings yet
Machine Learning Project 1
19 pages
AIS Report
No ratings yet
AIS Report
33 pages
2 Machine Learning
No ratings yet
2 Machine Learning
21 pages
Handout - DRRM CC Terminologies
No ratings yet
Handout - DRRM CC Terminologies
3 pages
W4 Ecs7020p
No ratings yet
W4 Ecs7020p
48 pages
Classification
No ratings yet
Classification
4 pages
Non Disclosure
No ratings yet
Non Disclosure
59 pages
n9 PDF
No ratings yet
n9 PDF
6 pages
Exercises
No ratings yet
Exercises
69 pages
ECS7020P ClassificationExercisesSolutions II
No ratings yet
ECS7020P ClassificationExercisesSolutions II
7 pages
CS178 Homework #1: Problem 0: Getting Connected
No ratings yet
CS178 Homework #1: Problem 0: Getting Connected
4 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)