0% found this document useful (0 votes)

5K views

Assignment4 VidulGarg

The document describes a machine learning project to predict wine quality. It involves loading wine quality data, preprocessing the data by removing outliers, then building and evaluating several machine learning models to predict wine quality based on chemical and sensory attributes. Key steps include data splitting, model building with KNN, logistic regression and decision trees, and evaluating the models using metrics like accuracy and confusion matrix.

Uploaded by

vidulgarg1524

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5K views

Assignment4 VidulGarg

Uploaded by

vidulgarg1524

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Vidul Garg

Project Title:
Grapes to Greatness: Machine Learning in Wine Quality Prediction

Description:
Predicting wine quality using machine learning is a common and valuable application in the field
of data science and analytics. Wine quality prediction involves building a model that can assess
and predict the quality of a wine based on various input features, such as chemical composition,
sensory characteristics, and environmental factors.

Tasks:
Load the Dataset, Data preprocessing including visualization, Machine Learning Model building,
Evaluate the model, Test with random observation

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.preprocessing import MinMaxScaler
from sklearn.model_selection import train_test_split
from sklearn.neighbors import KNeighborsClassifier
from sklearn.linear_model import LogisticRegression
from sklearn.tree import DecisionTreeClassifier
from sklearn.metrics import accuracy_score, confusion_matrix
from sklearn.metrics import classification_report

df=pd.read_csv(r"D:\MachineLearning\DataScienceCourse\winequality-
red.csv")
df

fixed acidity volatile acidity citric acid residual sugar

chlorides \
0 7.4 0.700 0.00 1.9
0.076
1 7.8 0.880 0.00 2.6
0.098
2 7.8 0.760 0.04 2.3
0.092
3 11.2 0.280 0.56 1.9
0.075
4 7.4 0.700 0.00 1.9
0.076
... ... ... ... ...
...
1594 6.2 0.600 0.08 2.0
0.090
1595 5.9 0.550 0.10 2.2
0.062
1596 6.3 0.510 0.13 2.3
0.076
1597 5.9 0.645 0.12 2.0
0.075
1598 6.0 0.310 0.47 3.6
0.067

free sulfur dioxide total sulfur dioxide density pH

sulphates \
0 11.0 34.0 0.99780 3.51
0.56
1 25.0 67.0 0.99680 3.20
0.68
2 15.0 54.0 0.99700 3.26
0.65
3 17.0 60.0 0.99800 3.16
0.58
4 11.0 34.0 0.99780 3.51
0.56
... ... ... ... ...
...
1594 32.0 44.0 0.99490 3.45
0.58
1595 39.0 51.0 0.99512 3.52
0.76
1596 29.0 40.0 0.99574 3.42
0.75
1597 32.0 44.0 0.99547 3.57
0.71
1598 18.0 42.0 0.99549 3.39
0.66

alcohol quality
0 9.4 5
1 9.8 5
2 9.8 5
3 9.8 6
4 9.4 5
... ... ...
1594 10.5 5
1595 11.2 6
1596 11.0 6
1597 10.2 5
1598 11.0 6

[1599 rows x 12 columns]

df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1599 entries, 0 to 1598
Data columns (total 12 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 fixed acidity 1599 non-null float64
1 volatile acidity 1599 non-null float64
2 citric acid 1599 non-null float64
3 residual sugar 1599 non-null float64
4 chlorides 1599 non-null float64
5 free sulfur dioxide 1599 non-null float64
6 total sulfur dioxide 1599 non-null float64
7 density 1599 non-null float64
8 pH 1599 non-null float64
9 sulphates 1599 non-null float64
10 alcohol 1599 non-null float64
11 quality 1599 non-null int64
dtypes: float64(11), int64(1)
memory usage: 150.0 KB

Checking null values

df.isnull().sum()

fixed acidity 0
volatile acidity 0
citric acid 0
residual sugar 0
chlorides 0
free sulfur dioxide 0
total sulfur dioxide 0
density 0
pH 0
sulphates 0
alcohol 0
quality 0
dtype: int64

df.describe()

fixed acidity volatile acidity citric acid residual sugar \

count 1599.000000 1599.000000 1599.000000 1599.000000
mean 8.319637 0.527821 0.270976 2.538806
std 1.741096 0.179060 0.194801 1.409928
min 4.600000 0.120000 0.000000 0.900000
25% 7.100000 0.390000 0.090000 1.900000
50% 7.900000 0.520000 0.260000 2.200000
75% 9.200000 0.640000 0.420000 2.600000
max 15.900000 1.580000 1.000000 15.500000

chlorides free sulfur dioxide total sulfur dioxide

density \
count 1599.000000 1599.000000 1599.000000
1599.000000
mean 0.087467 15.874922 46.467792
0.996747
std 0.047065 10.460157 32.895324
0.001887
min 0.012000 1.000000 6.000000
0.990070
25% 0.070000 7.000000 22.000000
0.995600
50% 0.079000 14.000000 38.000000
0.996750
75% 0.090000 21.000000 62.000000
0.997835
max 0.611000 72.000000 289.000000
1.003690

pH sulphates alcohol quality

count 1599.000000 1599.000000 1599.000000 1599.000000
mean 3.311113 0.658149 10.422983 5.636023
std 0.154386 0.169507 1.065668 0.807569
min 2.740000 0.330000 8.400000 3.000000
25% 3.210000 0.550000 9.500000 5.000000
50% 3.310000 0.620000 10.200000 6.000000
75% 3.400000 0.730000 11.100000 6.000000
max 4.010000 2.000000 14.900000 8.000000

Data Visualization
plt.figure(figsize=(5,3))
df["quality"].value_counts().plot(kind='bar')
plt.xticks(rotation=0)

(array([0, 1, 2, 3, 4, 5]),
[Text(0, 0, '5'),
Text(1, 0, '6'),
Text(2, 0, '7'),
Text(3, 0, '4'),
Text(4, 0, '8'),
Text(5, 0, '3')])
Wines with quality '5' and '6' are more!!

plt.figure(figsize=(8,8))
l=["fixed acidity","volatile acidity","citric acid","residual
sugar","chlorides","free sulfur dioxide","total sulfur
dioxide","density","pH","sulphates","alcohol"]
for i in l:
plt.subplot(4, 3, l.index(i) + 1) # 4 rows, 3 columns
sns.barplot(x=df["quality"],y=df[i])
plt.tight_layout()

# sns.barplot(x=df["quality"],y=df["alcohol"])
Correlation Check
plt.figure(figsize=(12, 8))
cor=df.corr()
sns.heatmap(cor,annot=True)

<Axes: >
As we can see there is no such correlated features in the dataset

Checking outliers
sns.boxplot(data=df, orient='h') # 'orient' is set to 'h' for
horizontal box plots

plt.xlabel('Values')
plt.title('Box Plot of All Columns')

Text(0.5, 1.0, 'Box Plot of All Columns')

There are so many outliers present in the dataset !

l1=["fixed acidity","volatile acidity","citric acid","residual

sugar","chlorides","free sulfur dioxide","total sulfur
dioxide","density","pH","sulphates","alcohol"]
for i in l:
q1=df[i].quantile(0.25)
q3=df[i].quantile(0.75)
iqr=q3-q1
upperL=q3+1.5*iqr
lowerL=q1-1.5*iqr

df[i]=np.where(df[i]>upperL,upperL,np.where(df[i]<lowerL,lowerL,df[i])
)

sns.boxplot(data=df, orient='h') # 'orient' is set to 'h' for

horizontal box plots

plt.xlabel('Values')
plt.title('Box Plot of All Columns')

Text(0.5, 1.0, 'Box Plot of All Columns')

df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 1599 entries, 0 to 1598
Data columns (total 11 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 fixed acidity 1599 non-null float64
1 volatile acidity 1599 non-null float64
2 citric acid 1599 non-null float64
3 residual sugar 1599 non-null float64
4 chlorides 1599 non-null float64
5 free sulfur dioxide 1599 non-null float64
6 total sulfur dioxide 1599 non-null float64
7 density 1599 non-null float64
8 pH 1599 non-null float64
9 sulphates 1599 non-null float64
10 alcohol 1599 non-null float64
dtypes: float64(11)
memory usage: 137.5 KB
<class 'pandas.core.series.Series'>
RangeIndex: 1599 entries, 0 to 1598
Series name: quality
Non-Null Count Dtype
-------------- -----
1599 non-null int64
dtypes: int64(1)
memory usage: 12.6 KB

Train, Test, Split

x_train,x_test,y_train,y_test=train_test_split(x,y,test_size=0.2,rando
m_state=32)

Model Training
KNN Classifier
model1=KNeighborsClassifier(n_neighbors=3)
model1.fit(x_train, y_train)
y_pred1 = model1.predict(x_test)
print(classification_report(y_test, y_pred1))
print(confusion_matrix(y_test,y_pred1))

precision recall f1-score support

3 0.00 0.00 0.00 1

4 0.00 0.00 0.00 8
5 0.46 0.58 0.51 120
6 0.54 0.44 0.48 146
7 0.38 0.30 0.33 40
8 0.00 0.00 0.00 5

accuracy 0.46 320

macro avg 0.23 0.22 0.22 320
weighted avg 0.47 0.46 0.46 320

[[ 0 0 1 0 0 0]
[ 2 0 3 2 1 0]
[ 0 6 70 37 7 0]
[ 1 9 62 64 10 0]
[ 0 0 15 13 12 0]
[ 0 0 1 2 2 0]]

C:\Users\Vidul\AppData\Local\Programs\Python\Python311\Lib\site-
packages\sklearn\metrics\_classification.py:1469:
UndefinedMetricWarning: Precision and F-score are ill-defined and
being set to 0.0 in labels with no predicted samples. Use
`zero_division` parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
C:\Users\Vidul\AppData\Local\Programs\Python\Python311\Lib\site-
packages\sklearn\metrics\_classification.py:1469:
UndefinedMetricWarning: Precision and F-score are ill-defined and
being set to 0.0 in labels with no predicted samples. Use
`zero_division` parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
C:\Users\Vidul\AppData\Local\Programs\Python\Python311\Lib\site-
packages\sklearn\metrics\_classification.py:1469:
UndefinedMetricWarning: Precision and F-score are ill-defined and
being set to 0.0 in labels with no predicted samples. Use
`zero_division` parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))

Logistic Regression
model2=LogisticRegression(max_iter=5000)
model2.fit(x_train, y_train)
y_pred2 = model2.predict(x_test)
print(classification_report(y_test, y_pred2))
print(confusion_matrix(y_test,y_pred2))
precision recall f1-score support

3 0.00 0.00 0.00 1

4 0.00 0.00 0.00 8
5 0.60 0.77 0.67 120
6 0.55 0.56 0.56 146
7 0.41 0.17 0.25 40
8 0.00 0.00 0.00 5

accuracy 0.57 320

macro avg 0.26 0.25 0.25 320
weighted avg 0.53 0.57 0.54 320

[[ 0 0 1 0 0 0]
[ 0 0 2 6 0 0]
[ 0 0 92 28 0 0]
[ 0 0 57 82 7 0]
[ 0 0 2 31 7 0]
[ 0 0 0 2 3 0]]

Decision Tree Classifier

model3=DecisionTreeClassifier()
model3.fit(x_train, y_train)
y_pred3 = model3.predict(x_test)
print(classification_report(y_test, y_pred3))
print(confusion_matrix(y_test,y_pred3))

precision recall f1-score support

3 0.00 0.00 0.00 1

4 0.12 0.12 0.12 8
5 0.62 0.75 0.68 120
6 0.66 0.53 0.59 146
7 0.36 0.40 0.38 40
8 0.20 0.20 0.20 5

accuracy 0.58 320

macro avg 0.33 0.33 0.33 320
weighted avg 0.59 0.58 0.58 320

[[ 0 0 0 1 0 0]
[ 1 1 3 2 1 0]
[ 1 4 90 22 3 0]
[ 0 3 45 77 21 0]
[ 0 0 7 13 16 4]
[ 0 0 0 1 3 1]]

Accuracy Check
print("KNN Classifier Accuracy:", accuracy_score(y_test, y_pred1)*100)
print("Logistic Regression Accuracy:", accuracy_score(y_test,
y_pred2)*100)
print("Decision Tree Accuracy:", accuracy_score(y_test, y_pred3)*100)

KNN Classifier Accuracy: 45.625

Logistic Regression Accuracy: 56.56250000000001
Decision Tree Accuracy: 57.8125

Predicting with random values

sample_check=[[6.5, 0.6, 0.3, 2.2, 0.07, 15.0, 40.0, 0.996, 3.4, 0.6,
9.5],
[8.0, 0.4, 0.4, 2.8, 0.085, 22.0, 55.0, 0.998, 3.2, 0.55,
11.2],
[6.8, 0.55, 0.15, 2.4, 0.075, 25.0, 62.0, 0.9962, 3.1,
0.75, 9.0],
[7.5, 0.45, 0.35, 2.5, 0.09, 30.0, 70.0, 0.9978, 3.5,
0.6, 11.5],
[7.0, 0.5, 0.2, 2.5, 0.08, 20.0, 60.0, 0.997, 3.3, 0.7,
10.0]
]

for i in sample_check:
x=model2.predict([i])
if(x>=6):
print(x, "--> Good" )
elif(x<6):
print(x, "--> Not Good")
[5] --> Not Good
[6] --> Good
[5] --> Not Good
[6] --> Good
[5] --> Not Good

C:\Users\Vidul\AppData\Local\Programs\Python\Python311\Lib\site-
packages\sklearn\base.py:464: UserWarning: X does not have valid
feature names, but LogisticRegression was fitted with feature names
warnings.warn(
C:\Users\Vidul\AppData\Local\Programs\Python\Python311\Lib\site-
packages\sklearn\base.py:464: UserWarning: X does not have valid
feature names, but LogisticRegression was fitted with feature names
warnings.warn(
C:\Users\Vidul\AppData\Local\Programs\Python\Python311\Lib\site-
packages\sklearn\base.py:464: UserWarning: X does not have valid
feature names, but LogisticRegression was fitted with feature names
warnings.warn(
C:\Users\Vidul\AppData\Local\Programs\Python\Python311\Lib\site-
packages\sklearn\base.py:464: UserWarning: X does not have valid
feature names, but LogisticRegression was fitted with feature names
warnings.warn(
C:\Users\Vidul\AppData\Local\Programs\Python\Python311\Lib\site-
packages\sklearn\base.py:464: UserWarning: X does not have valid
feature names, but LogisticRegression was fitted with feature names
warnings.warn(

Principles of Engineering Thermodynamics - SI Version 8th Edition
No ratings yet
Principles of Engineering Thermodynamics - SI Version 8th Edition
47 pages
The Vintage Rolex Field Guide: A survival manual for the adventure that is vintage Rolex
From Everand
The Vintage Rolex Field Guide: A survival manual for the adventure that is vintage Rolex
Morningtundra
No ratings yet
The New Homemade Kitchen: 250 Recipes and Ideas for Reinventing the Art of Preserving, Canning, Fermenting, Dehydrating, and More
From Everand
The New Homemade Kitchen: 250 Recipes and Ideas for Reinventing the Art of Preserving, Canning, Fermenting, Dehydrating, and More
Joseph Shuldiner
4.5/5 (5)
B.arch Synopsis Format
74% (38)
B.arch Synopsis Format
2 pages
Quality Prediction
No ratings yet
Quality Prediction
20 pages
Wine Quality Prediction
No ratings yet
Wine Quality Prediction
6 pages
tp
No ratings yet
tp
13 pages
Karisma_23011101119_eda_rec
No ratings yet
Karisma_23011101119_eda_rec
88 pages
EDA RED WINE
No ratings yet
EDA RED WINE
16 pages
ML LAB 12 - Jupyter Notebook
No ratings yet
ML LAB 12 - Jupyter Notebook
11 pages
Quality Prediction Checkpoint
No ratings yet
Quality Prediction Checkpoint
14 pages
Wine DS
No ratings yet
Wine DS
14 pages
Equilibrio de Fases (Benceno/Metanol) 1. Utilización de Software (Chemcad) Por Raoult
No ratings yet
Equilibrio de Fases (Benceno/Metanol) 1. Utilización de Software (Chemcad) Por Raoult
15 pages
Equilibrio de Fases (Benceno/Metanol) 1. Utilización de Software (Chemcad) Por Raoult
No ratings yet
Equilibrio de Fases (Benceno/Metanol) 1. Utilización de Software (Chemcad) Por Raoult
15 pages
Code
No ratings yet
Code
5 pages
TABLE OF CORRELATION ANALYSIS 1
No ratings yet
TABLE OF CORRELATION ANALYSIS 1
2 pages
Winequality White
No ratings yet
Winequality White
84 pages
Property Tables in English Units
No ratings yet
Property Tables in English Units
48 pages
14-May - Jupyter Notebook
No ratings yet
14-May - Jupyter Notebook
15 pages
Tabela Agua
No ratings yet
Tabela Agua
12 pages
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
100% (1)
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
10 pages
MIE210 Property Tables
No ratings yet
MIE210 Property Tables
26 pages
Oxychem NaOH Membrane Vs Diaphragm
No ratings yet
Oxychem NaOH Membrane Vs Diaphragm
1 page
Rutile R040049-1 Powder DIF File 3181
No ratings yet
Rutile R040049-1 Powder DIF File 3181
1 page
Steam Table
No ratings yet
Steam Table
10 pages
Fractionators
No ratings yet
Fractionators
9 pages
Water Portability Sunig R
No ratings yet
Water Portability Sunig R
4 pages
Record
No ratings yet
Record
27 pages
Steam Tables
No ratings yet
Steam Tables
20 pages
Procemin 2015 Flotation Plant Design With Aminfloat Simulator
No ratings yet
Procemin 2015 Flotation Plant Design With Aminfloat Simulator
24 pages
Distillation Theoretical Stages Calculator
No ratings yet
Distillation Theoretical Stages Calculator
2,155 pages
BASRID 922 March 2024 16
No ratings yet
BASRID 922 March 2024 16
1 page
BASRID 922 March 2024 16
No ratings yet
BASRID 922 March 2024 16
1 page
) of Pure Sucrose Solutions) of Impure Sucrose Solutions: DS DS
0% (1)
) of Pure Sucrose Solutions) of Impure Sucrose Solutions: DS DS
23 pages
Physical Properties of Sucrose Solution
0% (1)
Physical Properties of Sucrose Solution
23 pages
Database XRD
No ratings yet
Database XRD
6 pages
2
No ratings yet
2
6 pages
Property Tables
No ratings yet
Property Tables
33 pages
Specific Gravity of Formic Acid by Percent
No ratings yet
Specific Gravity of Formic Acid by Percent
3 pages
Andres Felipe Silva y Santiago Arroyave Datos
No ratings yet
Andres Felipe Silva y Santiago Arroyave Datos
5 pages
NISTIR5078 Tab3
No ratings yet
NISTIR5078 Tab3
60 pages
Steam Table For Compressed Liquid & Superheated Steam PDF
No ratings yet
Steam Table For Compressed Liquid & Superheated Steam PDF
60 pages
Steam Tables
No ratings yet
Steam Tables
21 pages
Steam Tables PDF
No ratings yet
Steam Tables PDF
21 pages
Coding An
No ratings yet
Coding An
19 pages
Steam Tables
No ratings yet
Steam Tables
12 pages
Brix T° Baume y Alcohol
100% (1)
Brix T° Baume y Alcohol
3 pages
USL - 21070126112 - Colaboratory
No ratings yet
USL - 21070126112 - Colaboratory
3 pages
890 Tables in SI Units: Table A-1
No ratings yet
890 Tables in SI Units: Table A-1
22 pages
Certificate of Analysis: Brammer Standard Company, Inc
No ratings yet
Certificate of Analysis: Brammer Standard Company, Inc
2 pages
Tabel Sifat Air
No ratings yet
Tabel Sifat Air
11 pages
Practical04.ipynb - Colab
No ratings yet
Practical04.ipynb - Colab
2 pages
PVT Data of Molten Copolymers 6.1. Experimental Data And/or Tait Equation Parameters
No ratings yet
PVT Data of Molten Copolymers 6.1. Experimental Data And/or Tait Equation Parameters
90 pages
vertopal.com_undersatnding_data
No ratings yet
vertopal.com_undersatnding_data
20 pages
Planilha Sem Título
No ratings yet
Planilha Sem Título
56 pages
Properties Table
No ratings yet
Properties Table
36 pages
Atomic or Molecular Weights and Critical Properties of Selected Elements and Compounds
No ratings yet
Atomic or Molecular Weights and Critical Properties of Selected Elements and Compounds
21 pages
Tabel Termo Moran PDF
No ratings yet
Tabel Termo Moran PDF
47 pages
2011f F II.4
No ratings yet
2011f F II.4
29 pages
A List of Factorial Math Constants
From Everand
A List of Factorial Math Constants
Archive Classics
No ratings yet
The Fibonacci Number Series
From Everand
The Fibonacci Number Series
Michael Husted
5/5 (1)
Email Protected
No ratings yet
Email Protected
14 pages
Litcoder Brief For VIT Students
No ratings yet
Litcoder Brief For VIT Students
14 pages
Assignment1 VidulGarg
No ratings yet
Assignment1 VidulGarg
2 pages
Assignment3 VidulGarg
No ratings yet
Assignment3 VidulGarg
14 pages
Solution Architecture
No ratings yet
Solution Architecture
4 pages
VIT Assignment 3
No ratings yet
VIT Assignment 3
2 pages
2023 SH1 Promo Schedule
No ratings yet
2023 SH1 Promo Schedule
3 pages
CV - Adarsh Sharma
No ratings yet
CV - Adarsh Sharma
3 pages
2.2 CDMA Link Buget
No ratings yet
2.2 CDMA Link Buget
31 pages
(Ebook) System Health Management: with Aerospace Applications by Stephen B. Johnson, Thomas Gormley, Seth Kessler, Charles Mott, Ann Patterson-Hine, Karl Reichard, Philip Scandura Jr. ISBN 9780470741337, 0470741333 - The ebook with all chapters is available with just one click
100% (1)
(Ebook) System Health Management: with Aerospace Applications by Stephen B. Johnson, Thomas Gormley, Seth Kessler, Charles Mott, Ann Patterson-Hine, Karl Reichard, Philip Scandura Jr. ISBN 9780470741337, 0470741333 - The ebook with all chapters is available with just one click
28 pages
6 The Rio Grande Free Sample PDF
No ratings yet
6 The Rio Grande Free Sample PDF
5 pages
EGRA-GRADE2
No ratings yet
EGRA-GRADE2
4 pages
Application of Mathematics in Social Sciences
No ratings yet
Application of Mathematics in Social Sciences
3 pages
1589293176csu200-1gamp6 Product-Folder 2019 01 Final
No ratings yet
1589293176csu200-1gamp6 Product-Folder 2019 01 Final
2 pages
S. Pedersen Shima v12n2
No ratings yet
S. Pedersen Shima v12n2
6 pages
BASE DE HUMIDIFICACAO VHB15A Operators Manual
No ratings yet
BASE DE HUMIDIFICACAO VHB15A Operators Manual
28 pages
Action Plan SY 2018-2019: Saint Mary's University
No ratings yet
Action Plan SY 2018-2019: Saint Mary's University
10 pages
Entrepreneurship Project
No ratings yet
Entrepreneurship Project
2 pages
marksheet management system
No ratings yet
marksheet management system
14 pages
Business & Finance Chapter - 04
No ratings yet
Business & Finance Chapter - 04
49 pages
Arduino: Piezo Diagrams & Code: Projects 01 & 02: Scale and Playing A Tune
No ratings yet
Arduino: Piezo Diagrams & Code: Projects 01 & 02: Scale and Playing A Tune
9 pages
A Comparative Study of Using Fly Ash and Rice Husk Ash in Soil Stabilization
No ratings yet
A Comparative Study of Using Fly Ash and Rice Husk Ash in Soil Stabilization
6 pages
COMP4702 Notes 2019: Week 2 - Supervised Learning
No ratings yet
COMP4702 Notes 2019: Week 2 - Supervised Learning
23 pages
English Paper II
No ratings yet
English Paper II
86 pages
Isaac Newton's Temple of Solomon and His Reconstruction of Sacred Architecture PDF
0% (1)
Isaac Newton's Temple of Solomon and His Reconstruction of Sacred Architecture PDF
19 pages
1.PP & NPD Overview Training (Part 1)
No ratings yet
1.PP & NPD Overview Training (Part 1)
103 pages
How To Characterize Low-Noise Amplifiers
No ratings yet
How To Characterize Low-Noise Amplifiers
11 pages
Unit-1.7 - Basic Concepts of OOP in C++
No ratings yet
Unit-1.7 - Basic Concepts of OOP in C++
8 pages
CGP Chemistry L2 Mark Scheme
No ratings yet
CGP Chemistry L2 Mark Scheme
2 pages
Enumerative Combinatorics: Volume 2: Second Edition Richard P. Stanley 2024 scribd download
100% (8)
Enumerative Combinatorics: Volume 2: Second Edition Richard P. Stanley 2024 scribd download
20 pages
IBM Object Oriented CUA Interface Design (F29al000)
No ratings yet
IBM Object Oriented CUA Interface Design (F29al000)
348 pages
Beurer BC 08 Blood Pressure Monitor
No ratings yet
Beurer BC 08 Blood Pressure Monitor
76 pages
Chapter 1 - Kinetic Particle Theory
100% (1)
Chapter 1 - Kinetic Particle Theory
2 pages
Lecture Three - Review of DC-AC Inverters
No ratings yet
Lecture Three - Review of DC-AC Inverters
87 pages
Guide To Good Commercial Refrigeration Practice: Safety & Environmental Considerations & Standards
No ratings yet
Guide To Good Commercial Refrigeration Practice: Safety & Environmental Considerations & Standards
20 pages

Assignment4 VidulGarg

Uploaded by

Assignment4 VidulGarg

Uploaded by

Vidul Garg

fixed acidity volatile acidity citric acid residual sugar

free sulfur dioxide total sulfur dioxide density pH

[1599 rows x 12 columns]

Checking null values

fixed acidity volatile acidity citric acid residual sugar \

chlorides free sulfur dioxide total sulfur dioxide

pH sulphates alcohol quality

Text(0.5, 1.0, 'Box Plot of All Columns')

l1=["fixed acidity","volatile acidity","citric acid","residual

sns.boxplot(data=df, orient='h') # 'orient' is set to 'h' for

Text(0.5, 1.0, 'Box Plot of All Columns')

Train, Test, Split

precision recall f1-score support

3 0.00 0.00 0.00 1

accuracy 0.46 320

3 0.00 0.00 0.00 1

accuracy 0.57 320

Decision Tree Classifier

precision recall f1-score support

3 0.00 0.00 0.00 1

accuracy 0.58 320

KNN Classifier Accuracy: 45.625

Predicting with random values

You might also like