ML Hota Assign4

This document discusses using a Gaussian Naive Bayes classifier to predict student grades and classify iris flowers. It provides code snippets and instructions to split datasets into train and test, encode variables, train Gaussian NB models, calculate accuracy, and compare performance to other algorithms. Key tasks are to encode data, split datasets, train models on two datasets and evaluate performance by calculating metrics like accuracy and confusion matrices.

Uploaded by

f20211088

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

ML Hota Assign4

Uploaded by

f20211088

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Birla Institute of Technology and Science Pilani, Hyderabad Campus

2nd Semester 2023-24, BITS F464: Machine Learning

Assignment No: 4 (Gaussian Naïve Bayes Generative Model)
------------------------------------------------------------------------------------------------------------------------------------
Date Given: 26.03.2024 Max. Marks: 5 Submission date: 05.04.2024
The Naive Bayes Generative Classifier is a widely-used algorithm in machine learning, it operates on
the principles of Bayes' theorem and assumes independence among features, allowing it to make
predictions quickly and with minimal computational resources. It predicts the class for a given
instance based on the class probabilities computed using Bayes’ theorem. After calculating the
posterior probability of each class given the instance's features, the algorithm selects the class with
the highest probability as the predicted class for that instance. In other words, it assigns the class that
maximizes the posterior probability. As we discussed in the class, the maximum a posteriori (MAP)
that selects the best hypothesis for Naïve Bayes classifier is as given below:

Equation (1)

In Gaussian Naive Bayes, the likelihood P(xi∣y) is often modelled using a Gaussian (normal)
distribution. The probability density function (PDF) of a Gaussian distribution is:

Equation (2)

In practice, the parameters μ and σ2 are estimated from the training data for each feature xi and each
class y. Then, during classification, these parameters are used to compute the likelihood P(xi∣y) for
each feature given each class.

Your task in this assignment is to experiment with Gaussian NaiveBayes algorithm for the grading
file attached here (Data-NB.xlsx). Grading is based on the test scores. Below are the code snippets in
Scikit learn to import the classifier and other required libraries:
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.naive_bayes import GaussianNB
from sklearn.metrics import accuracy_score

To get first few records of the Pandas DataFrame, use the following:

(Page 1 of 3)
The output in your notebook should be as below:

The data distribution of the given xlsx file (Grading) is Gaussian as discussed in the class. Plot the
below pattern (Fig.1) to visualize it in your Python code. The Gaussian Naïve Bayes Classifier’s
performance metric for an 80-20 rule is as shown below (Fig.2).

(Fig.1)

(Fig.2)

The second part of this assignment is to classify flowers using iris.csv data file that is also attached with this
assignment using GaussianNB. There are 150 records in this file, and plot the flowers using
matplotlib.imshow method to view the flowers as shown below:

(Page 2 of 3)
Each record has features as Sepal length, Sepal width, Petal length, Petal width, and Species
(Categorical feature: Setosa, Versicolor, and Virginica). You may also import the in-built iris dataset
from sklearn learn as below:

The classification report is as given below with a prediction accuracy of 97% and other related metrics.

Complete the following tasks:

o In the Data-NB.xlsx file, few attributes like Gender, Attendance and Grade columns are nominal
variables and require encoding before the model training. Use appropriate encoding.
o Split the dataset into training and testing subsets (80-20 or 70-30). Train a Gaussian Naive Bayes
classifier on the training data and predict the grades in the test data. Calculate the accuracy and the
confusion matrix to assess the classifier's performance.
o Split the Iris dataset (iris.csv) into training and testing subsets, followed by training a Gaussian
Naive Bayes classifier on the training data. Evaluate the classifier's performance by plotting the
classification report as shown above.
o Compare and contrast the performance of the Naïve Bayes classifier built in this assignment with that of
Random Forest and Gradient Boosted Trees (developed in Assignment 2) on the identical datasets. Analyse
the reasons behind any observed differences in their performances.
o For both the datasets visualize the correlation matrix to check and verify the assumptions of Naïve
Bayes algorithm.
Submission Instructions: Same as that of earlier assignments. Any clarification on this coding
assignment may be emailed to I/C or Paryetri Banerjee ([email protected]) or
Anish Shandilya ([email protected]).

References: 1. https://scikit-learn.org/stable/modules/naive_bayes.html
2. https://towardsdatascience.com/the-naive-bayes-classifier-how-it-works-e229e7970b8

(Page 3 of 3) ---------------------------- ~ --------------------------

History of Artificial Intelligence
No ratings yet
History of Artificial Intelligence
5 pages
Naïve Bayes
No ratings yet
Naïve Bayes
15 pages
Homework3 Sol
No ratings yet
Homework3 Sol
5 pages
Best Practices For Managing Grocery Retail Supply Chains RELEX PDF
No ratings yet
Best Practices For Managing Grocery Retail Supply Chains RELEX PDF
66 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
Pgm5 With Output
No ratings yet
Pgm5 With Output
13 pages
AML_4_non_evaluative_assignment_b6a8ba2cf711baa588629112ee1622ee
No ratings yet
AML_4_non_evaluative_assignment_b6a8ba2cf711baa588629112ee1622ee
1 page
Wa0001
No ratings yet
Wa0001
39 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
DWM Exp5 C49
No ratings yet
DWM Exp5 C49
12 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Exp 4
No ratings yet
Exp 4
3 pages
Unit 2 AAM
No ratings yet
Unit 2 AAM
32 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
Naive Biase
No ratings yet
Naive Biase
6 pages
ASSIGNMENT 3 - Probabilistic Models, GBDT, SVM
No ratings yet
ASSIGNMENT 3 - Probabilistic Models, GBDT, SVM
3 pages
ML Lab Experiments (1) - Pages-3
No ratings yet
ML Lab Experiments (1) - Pages-3
11 pages
Ass 6 DSBDL
No ratings yet
Ass 6 DSBDL
6 pages
5 ML NaiveBayes
No ratings yet
5 ML NaiveBayes
45 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
Experiment No 6
No ratings yet
Experiment No 6
3 pages
Performance Comparison and Implementation of Bayesian Variants For Network Intrusion Detection
No ratings yet
Performance Comparison and Implementation of Bayesian Variants For Network Intrusion Detection
5 pages
Assignment No 2
No ratings yet
Assignment No 2
5 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Quantitative Methods Module 1
No ratings yet
Quantitative Methods Module 1
24 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
ECE304_SP25_Assgn-1
No ratings yet
ECE304_SP25_Assgn-1
1 page
AI and ML Lab Manual
No ratings yet
AI and ML Lab Manual
29 pages
Assignment - 01
No ratings yet
Assignment - 01
4 pages
Naive-By
No ratings yet
Naive-By
23 pages
Ai 5
No ratings yet
Ai 5
7 pages
NBayes Log Reg
No ratings yet
NBayes Log Reg
18 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
07_Naive_Bayes
No ratings yet
07_Naive_Bayes
6 pages
Mllabprog 5
No ratings yet
Mllabprog 5
6 pages
ML Lab1 pgm
No ratings yet
ML Lab1 pgm
4 pages
Naive_Bayes (1)
No ratings yet
Naive_Bayes (1)
4 pages
Supervised Classification 3601
No ratings yet
Supervised Classification 3601
39 pages
Prac4 AAM
No ratings yet
Prac4 AAM
2 pages
DM Lab Cycle 6 1
No ratings yet
DM Lab Cycle 6 1
5 pages
Lecture10 - Bayesian Classifier
No ratings yet
Lecture10 - Bayesian Classifier
40 pages
Part A Assignment 6
No ratings yet
Part A Assignment 6
2 pages
EX - No:6 Naive Bayesian Classifier
No ratings yet
EX - No:6 Naive Bayesian Classifier
2 pages
Exp 3 Bi
No ratings yet
Exp 3 Bi
12 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Bayesian Learning
No ratings yet
Bayesian Learning
58 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
L10-Naive Bayes Continuous
No ratings yet
L10-Naive Bayes Continuous
16 pages
L6 - SLM Notes (Bayes Algorithm)
No ratings yet
L6 - SLM Notes (Bayes Algorithm)
28 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
12 pages
I239-5 Naive Bayes
No ratings yet
I239-5 Naive Bayes
35 pages
EXP-10
No ratings yet
EXP-10
9 pages
Exp-6
No ratings yet
Exp-6
5 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Broad Agency Announcement Assured Neuro Symbolic Learning and Reasoning (ANSR) Information Innovation Office HR001122S0039 June 1, 2022
No ratings yet
Broad Agency Announcement Assured Neuro Symbolic Learning and Reasoning (ANSR) Information Innovation Office HR001122S0039 June 1, 2022
48 pages
Stress Detection Using Deep Neural Networks
No ratings yet
Stress Detection Using Deep Neural Networks
11 pages
Paper - Andrew NG - Statement
No ratings yet
Paper - Andrew NG - Statement
6 pages
The AI Economy Free Summary by Roger Bootle
No ratings yet
The AI Economy Free Summary by Roger Bootle
14 pages
Deep Learning For Power System Applications Case Studies Linking Artificial Intelligence And Power Systems Fangxing Li instant download
100% (1)
Deep Learning For Power System Applications Case Studies Linking Artificial Intelligence And Power Systems Fangxing Li instant download
39 pages
24년 10월 고1 교육청 모의고사 변형 (18~45번) 1차 검수 완료_241218_15_241218_164612
No ratings yet
24년 10월 고1 교육청 모의고사 변형 (18~45번) 1차 검수 완료_241218_15_241218_164612
29 pages
A SURVEY ON MACHINE LEARNING ALGORITHMS TECHNIQUES AND
No ratings yet
A SURVEY ON MACHINE LEARNING ALGORITHMS TECHNIQUES AND
6 pages
CMRIT B.tech Minor Honors Courses Regulations Syllabus
No ratings yet
CMRIT B.tech Minor Honors Courses Regulations Syllabus
75 pages
Deep Learning Rooted Potential Piloted RRT For Expeditious Path Planning
No ratings yet
Deep Learning Rooted Potential Piloted RRT For Expeditious Path Planning
8 pages
Perfect Crowd Counting Presentation
No ratings yet
Perfect Crowd Counting Presentation
13 pages
Three Types of Innovation
No ratings yet
Three Types of Innovation
1 page
1.applied Machine Learning in Health Care 1 New
No ratings yet
1.applied Machine Learning in Health Care 1 New
11 pages
CBAM: Convolutional Block Attention Module
100% (1)
CBAM: Convolutional Block Attention Module
17 pages
AI On Azure Custom Vision and Cognitive Services
No ratings yet
AI On Azure Custom Vision and Cognitive Services
60 pages
A Network Analysis of Cross-Occupational Skill Transferability For The Hospitality Industry
No ratings yet
A Network Analysis of Cross-Occupational Skill Transferability For The Hospitality Industry
22 pages
Plant Disease Detection Using Machine Learning
No ratings yet
Plant Disease Detection Using Machine Learning
16 pages
Ultimate Python Guide (2024)
100% (1)
Ultimate Python Guide (2024)
715 pages
Virtual Try On Documentation
No ratings yet
Virtual Try On Documentation
60 pages
AI Subfields
No ratings yet
AI Subfields
18 pages
Alisha Industrial Report
No ratings yet
Alisha Industrial Report
34 pages
VIBE IT Synopsis
No ratings yet
VIBE IT Synopsis
9 pages
IOE UNIT 1
No ratings yet
IOE UNIT 1
12 pages
短篇研究论文
100% (1)
短篇研究论文
7 pages
01ce0715 - Machine Learning
No ratings yet
01ce0715 - Machine Learning
4 pages
Mlops: 5 Steps To Operationalize Machine Learning Models
No ratings yet
Mlops: 5 Steps To Operationalize Machine Learning Models
17 pages
Object Detection With Deep Learning: A Review
No ratings yet
Object Detection With Deep Learning: A Review
21 pages
Areas of Artificial Intelligence Control in Chemical Process Industries
No ratings yet
Areas of Artificial Intelligence Control in Chemical Process Industries
3 pages
Artificial Intelligence, Machine Learning, and Deep Learning Applications in Smart and Sustainable Industry Transformation
No ratings yet
Artificial Intelligence, Machine Learning, and Deep Learning Applications in Smart and Sustainable Industry Transformation
25 pages

ML Hota Assign4

Uploaded by

ML Hota Assign4

Uploaded by

Birla Institute of Technology and Science Pilani, Hyderabad Campus

2nd Semester 2023-24, BITS F464: Machine Learning

Complete the following tasks:

(Page 3 of 3) ---------------------------- ~ --------------------------

You might also like