0% found this document useful (0 votes)

2 views

DWM Exp 4

The document details the implementation of a Naive Bayes classifier using Python, including data preprocessing, model training, and evaluation metrics. It explains the theoretical foundation of Bayes' Theorem and its applications in various fields such as spam filtering and medical diagnosis. The experiment demonstrates the classifier's performance with an accuracy of 52% and provides a confusion matrix and classification report.

Uploaded by

2021.rishi.kokil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

DWM Exp 4

Uploaded by

2021.rishi.kokil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

10/10/23, 10:48 PM Naive-bayes - Colaboratory

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import LabelEncoder
from sklearn.naive_bayes import GaussianNB
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix
from sklearn.ensemble import RandomForestClassifier
from imblearn.over_sampling import SMOTE
import matplotlib.pyplot as plt
import seaborn as sns

df = pd.read_csv('/content/MOCK_DATA.csv')

# Explore the dataset

print(df.head())

account_circle 0 Five-year
contract
contract
gender
non-binary
Churn
0
1 Five-year contract non-binary 0
2 Three-year contract female 1
3 Two-year contract non-binary 0
4 Five-year contract non-binary 1

# Handle missing values (if needed)

df = df.dropna()

# Encode categorical variables

label_encoder = LabelEncoder()
df['gender'] = label_encoder.fit_transform(df['gender'])
df['contract'] = label_encoder.fit_transform(df['contract'])
# You may need to encode other categorical columns as well.
print(df['contract'])

# Define features (X) and target variable (y)

X = df.drop(columns=['Churn'])
y = df['Churn']

0 0
1 0
2 3
3 4
4 0
..
995 1
996 0
997 3
998 4
999 2
Name: contract, Length: 1000, dtype: int64

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Apply SMOTE to address class imbalance

smote = SMOTE(random_state=42)
X_resampled, y_resampled = smote.fit_resample(X_train, y_train)

# Train the Naïve Bayes classifier

nb_classifier = GaussianNB()
nb_classifier.fit(X_resampled, y_resampled)

▾ GaussianNB
GaussianNB()

y_pred = nb_classifier.predict(X_test)

# Calculate accuracy
accuracy = accuracy_score(y_test, y_pred)
print(f'Accuracy: {accuracy:.2f}')

# Create a confusion matrix

confusion = confusion_matrix(y_test, y_pred)
print('Confusion Matrix:')
print(confusion)

# Display classification report

https://colab.research.google.com/drive/1YY7qFj8bBY3XEcq-gzrRNmXwssLBoRaV?usp=sharing#printMode=true 1/3
10/10/23, 10:48 PM Naive-bayes - Colaboratory
print('Classification Report:')
print(classification_report(y_test, y_pred))

Accuracy: 0.52
Confusion Matrix:
[[36 65]
[32 67]]
Classification Report:
precision recall f1-score support

0 0.53 0.36 0.43 101

1 0.51 0.68 0.58 99

accuracy 0.52 200

macro avg 0.52 0.52 0.50 200
weighted avg 0.52 0.52 0.50 200

import seaborn as sns

# Create a confusion matrix
confusion = confusion_matrix(y_test, y_pred)

# Visualize the confusion matrix using Seaborn

plt.figure(figsize=(8, 6))
sns.set(font_scale=1.2) # Adjust the font size if needed
sns.heatmap(confusion, annot=True, fmt='d', cmap='Blues', cbar=False,
xticklabels=['Not Churn', 'Churn'], yticklabels=['Not Churn', 'Churn'])

plt.title('Confusion Matrix')
plt.xlabel('Predicted')
plt.ylabel('Actual')
plt.show()

https://colab.research.google.com/drive/1YY7qFj8bBY3XEcq-gzrRNmXwssLBoRaV?usp=sharing#printMode=true 2/3
10/10/23, 10:48 PM Naive-bayes - Colaboratory

https://colab.research.google.com/drive/1YY7qFj8bBY3XEcq-gzrRNmXwssLBoRaV?usp=sharing#printMode=true 3/3
Rishi Kokil | D12C | 38 | DWM

Experiment No. 4

Aim
Implementation of Bayesian algorithm.

Theory
Bayes’ Theorem describes the probability of an event, based on precedent knowledge of
conditions which might be related to the event. In other words, Bayes’ Theorem is the
add-on of Conditional Probability.
With the help of Conditional Probability, one can find out the probability of X given H, and it
is denoted by P(X | H). Now Bayes’ Theorem states that if we know Conditional Probability
(P(X | H)) then we can find out P(H | X), given the condition that P(X) and P(H) are already
known to us.
Bayes’ Theorem is named after Thomas Bayes. He first makes use of conditional probability
to provide an algorithm which uses evidence to calculate limits on an unknown parameter.
Bayes’ Theorem has two types of probabilities :

Prior Probability [P(H)]

Posterior Probability [P(H/X)]
Where,
X – X is a data tuple.
H – H is some Hypothesis.

1. Prior Probability
Prior Probability is the probability of occurring an event before the collection of new data. It
is the best logical evaluation of the probability of an outcome which is based on the present
knowledge of the event before the inspection is performed.

2. Posterior Probability
When new data or information is collected then the Prior Probability of an event will be
revised to produce a more accurate measure of a possible outcome. This revised
probability becomes the Posterior Probability and is calculated using Bayes’ theorem. So,
the Posterior Probability is the probability of an event X occurring given that event H has
occurred.

1
Rishi Kokil | D12C | 38 | DWM

Formula
Bayes’ Theorem, can be mathematically represented by the equation given below :

P(H/X) =P(X/H)P(H)/P(X)

Where,

H and X are the events and,

P (X) ≠ 0
P(H/X) – Conditional probability of H.
Given that X occurs.

P(X/H) – Conditional probability of X.

Given that H occurs.
P(H) and P(X) – Prior Probabilities of occurring H and X independent of each other.
This is called the marginal probability.

Naive Bayes classifier assumes that the presence (or absence) of a particular feature of
a class is unrelated to the presence (or absence) of any other feature. For example, a
fruit may be considered to be an apple if it is red, round, and about 4" in diameter. Even
though these features depend on the existence of the other features, a naive Bayes
classifier considers all of these properties to independently contribute to the probability
that this fruit is an apple.
An advantage of the naive Bayes classifier is that it requires a small amount of training
data to estimate the parameters (means and variances of the variables) necessary for
classification. Because independent variables are assumed, only the variances of the
variables for each class need to be determined and not the entire covariance matrix.

Naïve Bayesian classifier Algorithm

Step 1: Convert the data set into a frequency table.
Step 2: Create Likelihood table by finding the probabilities.

2
Rishi Kokil | D12C | 38 | DWM

Step 3: Now, use Naive Bayesian equation to calculate the posterior probability for each
class. The class with the highest posterior probability is the outcome of prediction.

Applications of Bayes’ Theorem

Bayes' theorem or Bayesian classification in data mining has a wide range of applications in
many fields, including statistics, machine learning, artificial intelligence, natural language
processing, medical diagnosis, image and speech recognition, and more. Here are some
examples of its applications -

Spam filtering - Bayes' theorem is commonly used in email spam filtering, where it helps to
identify emails that are likely to be spam based on the text content and other features.
Medical diagnosis - Bayes' theorem can be used to diagnose medical conditions based on
the observed symptoms, test results, and prior knowledge about the prevalence and
characteristics of the disease.
Risk assessment - Bayes' theorem can be used to assess the risk of events such as
accidents, natural disasters, or financial market fluctuations based on historical data and
other relevant factors.
Natural language processing - Bayes' theorem can be used to classify documents,
sentiment analysis, and topic modeling in natural language processing applications.
Recommendation systems - Bayes' theorem can be used in recommendation systems like
e-commerce websites to suggest products or services to users based on their previous
behavior and preferences.
Fraud detection - Bayes' theorem can be used to detect fraudulent behavior, such as credit
card or insurance fraud, by analyzing patterns of transactions and other data.

3
Rishi Kokil | D12C | 38 | DWM

Conclusion
In conclusion, our experiment with Bayesian algorithms in Python has provided valuable
insights into the power and versatility of Bayesian inference for solving a wide range of
problems. Bayesian methods have proven to be a robust and principled framework for
handling uncertainty and making informed decisions in various applications.

Bayes Theorem in Machine learning
No ratings yet
Bayes Theorem in Machine learning
37 pages
Keycms 3
No ratings yet
Keycms 3
221 pages
Introducing Biological Energetics - Cheetham (2010)
100% (1)
Introducing Biological Energetics - Cheetham (2010)
345 pages
An Introduction to Naive Bayes Algorithm for Beginners
No ratings yet
An Introduction to Naive Bayes Algorithm for Beginners
11 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
11 pages
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
No ratings yet
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
47 pages
Bayes Rule PR-2
No ratings yet
Bayes Rule PR-2
5 pages
Unit-4
No ratings yet
Unit-4
36 pages
Bayes Theorem
No ratings yet
Bayes Theorem
7 pages
ML Material-I (2)
No ratings yet
ML Material-I (2)
35 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
RE
No ratings yet
RE
22 pages
Machine Ass
No ratings yet
Machine Ass
33 pages
Naive Bayes
No ratings yet
Naive Bayes
24 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
6 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
ml last document group 2.pdf
No ratings yet
ml last document group 2.pdf
13 pages
unit-3(after_mid)
No ratings yet
unit-3(after_mid)
10 pages
ML Naive Bayes 1
No ratings yet
ML Naive Bayes 1
19 pages
UNIT 2 AAM notes (1)
No ratings yet
UNIT 2 AAM notes (1)
38 pages
6 Easy Steps To Learn Naive Bayes Algorithm With Codes in Python and R
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm With Codes in Python and R
6 pages
651276118-Naive-Bayes-Classifier-in-Machine-Learning-Javatpoint
No ratings yet
651276118-Naive-Bayes-Classifier-in-Machine-Learning-Javatpoint
23 pages
DM NaiveBayes
No ratings yet
DM NaiveBayes
15 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Wk08
No ratings yet
Wk08
10 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
NOTES
No ratings yet
NOTES
15 pages
Unit 6
No ratings yet
Unit 6
19 pages
Naive Bayes-1 Th....
No ratings yet
Naive Bayes-1 Th....
13 pages
Unit II Probabilistic Reasoning
No ratings yet
Unit II Probabilistic Reasoning
28 pages
29-Naive Bayes-03-10-2024
No ratings yet
29-Naive Bayes-03-10-2024
48 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
ML-09-naive-bayes-classifier
No ratings yet
ML-09-naive-bayes-classifier
24 pages
Bayes Learning
No ratings yet
Bayes Learning
37 pages
Bayes Decision Theorylect3
No ratings yet
Bayes Decision Theorylect3
12 pages
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
3 pages
Examples - Naive Bayes - Baysian Network
No ratings yet
Examples - Naive Bayes - Baysian Network
24 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Data Analytics Unit-2 PPT Notes
No ratings yet
Data Analytics Unit-2 PPT Notes
190 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
Naive Bayes
No ratings yet
Naive Bayes
21 pages
Naive Bayes Classification Numerical Example - Coding Infinite
No ratings yet
Naive Bayes Classification Numerical Example - Coding Infinite
14 pages
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
No ratings yet
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
22 pages
BSC ML CH2.pptx
No ratings yet
BSC ML CH2.pptx
79 pages
Unit-3
No ratings yet
Unit-3
157 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
ML-9
No ratings yet
ML-9
15 pages
Mod 4
No ratings yet
Mod 4
26 pages
05-Classification-II-2024
No ratings yet
05-Classification-II-2024
54 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
ML pp8_u2
No ratings yet
ML pp8_u2
35 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
Unit 2
No ratings yet
Unit 2
20 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
10 pages
Classification-Alternative Techniques: Bayesian Classifiers
No ratings yet
Classification-Alternative Techniques: Bayesian Classifiers
7 pages
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
GB Unit3 Mutual Exclusion
No ratings yet
GB Unit3 Mutual Exclusion
58 pages
GB Unit3 Election Algo
No ratings yet
GB Unit3 Election Algo
16 pages
GB_Final_unit3_clock-synchronization
No ratings yet
GB_Final_unit3_clock-synchronization
38 pages
ML Exp 7
No ratings yet
ML Exp 7
6 pages
BC Exp 10 Output
No ratings yet
BC Exp 10 Output
2 pages
HO Strategies
No ratings yet
HO Strategies
5 pages
S digital+Governor+Ed1SI
100% (1)
S digital+Governor+Ed1SI
8 pages
CSB BATTERY GP 1272
No ratings yet
CSB BATTERY GP 1272
2 pages
Evaluation of Thin Bed Using Resistivity Borehole
No ratings yet
Evaluation of Thin Bed Using Resistivity Borehole
8 pages
DLL chemNOV14
No ratings yet
DLL chemNOV14
5 pages
Term Paper Math Topics
100% (1)
Term Paper Math Topics
6 pages
Momentum Practice Problems: Physics Chapter 8
No ratings yet
Momentum Practice Problems: Physics Chapter 8
23 pages
Sonar - Wikipedia
No ratings yet
Sonar - Wikipedia
44 pages
Frac Pump Catalog-Evergrow-2024.6
No ratings yet
Frac Pump Catalog-Evergrow-2024.6
12 pages
Chapter 8 Biomechanics
No ratings yet
Chapter 8 Biomechanics
7 pages
Ipv4 and Ipv6 - Binary Conversion: Ipv4 Address 192.168.50.112 /24 Address in Binary Netmask in Binary
No ratings yet
Ipv4 and Ipv6 - Binary Conversion: Ipv4 Address 192.168.50.112 /24 Address in Binary Netmask in Binary
12 pages
Class Note 2 UG Crystal
No ratings yet
Class Note 2 UG Crystal
16 pages
Geometric Shapes: Circular Cylinder
No ratings yet
Geometric Shapes: Circular Cylinder
3 pages
Pengenalan Bahasa Fortran
No ratings yet
Pengenalan Bahasa Fortran
45 pages
Tutorial - II EEU 08106
No ratings yet
Tutorial - II EEU 08106
2 pages
Stratix4 Device Overview
No ratings yet
Stratix4 Device Overview
20 pages
Cs201 Final Notce by Mohsin Raza (1) - 1
No ratings yet
Cs201 Final Notce by Mohsin Raza (1) - 1
30 pages
The New Enhancement Framework - Weblog Series
No ratings yet
The New Enhancement Framework - Weblog Series
25 pages
B-64482EN 03 Descriptions
No ratings yet
B-64482EN 03 Descriptions
508 pages
Excel Vba On Error Resume Next
100% (1)
Excel Vba On Error Resume Next
4 pages
(2013) Key computational issues in ICME
No ratings yet
(2013) Key computational issues in ICME
22 pages
Circular Tank Design
No ratings yet
Circular Tank Design
9 pages
Calculating Equivalent Resistance, Total Current and Total
100% (1)
Calculating Equivalent Resistance, Total Current and Total
39 pages
Function Generator
No ratings yet
Function Generator
5 pages
Chapter 7 - Steam Condenser - Modified
No ratings yet
Chapter 7 - Steam Condenser - Modified
97 pages
Catalouge-10-2020 PDF
No ratings yet
Catalouge-10-2020 PDF
85 pages
Ruris-731k Documentatie Tehnica Schita Motoutilaj
100% (1)
Ruris-731k Documentatie Tehnica Schita Motoutilaj
8 pages
Caterpillar - Steering Selenoid 785
No ratings yet
Caterpillar - Steering Selenoid 785
4 pages

DWM Exp 4

Uploaded by

DWM Exp 4

Uploaded by

10/10/23, 10:48 PM Naive-bayes - Colaboratory

# Explore the dataset

# Handle missing values (if needed)

# Encode categorical variables

# Define features (X) and target variable (y)

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Apply SMOTE to address class imbalance

# Train the Naïve Bayes classifier

# Create a confusion matrix

# Display classification report

0 0.53 0.36 0.43 101

accuracy 0.52 200

import seaborn as sns

# Visualize the confusion matrix using Seaborn

Prior Probability [P(H)]

H and X are the events and,

P(X/H) – Conditional probability of X.

Naïve Bayesian classifier Algorithm

Applications of Bayes’ Theorem

You might also like