0% found this document useful (0 votes)

38 views11 pages

PA Lab2

The document discusses predicting customer buying behavior using random forest and decision tree classification algorithms on a dataset containing customer information like age, annual salary and whether they purchased a product or not. It loads the dataset, splits it into training and test sets, performs feature scaling and fits a logistic regression classifier to the training set for prediction.

Uploaded by

syedkashif047

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views11 pages

PA Lab2

Uploaded by

syedkashif047

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

1 #Random Forest Classification, decision tree classification

1 import numpy as np
2 import matplotlib.pyplot as plt
3 import pandas as pd

1 df = pd.read_csv('car_data.csv')

1 display(df.head(5))

User ID Gender Age AnnualSalary Purchased

0 385 Male 35 20000 0

1 681 Male 40 43500 0

2 353 Male 49 74000 0

3 895 Male 40 107500 1

4 661 Male 25 79000 0

1 X = df.iloc[:400, [2,3]].values
2 y = df.iloc[:400, -1].values

1 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

3 print(X)

[[ 35 20000]
[ 40 43500]
[ 49 74000]
[ 40 107500]
[ 25 79000]
[ 47 33500]
[ 46 132500]
[ 42 64000]
[ 30 84500]
[ 41 52000]
[ 42 80000]
[ 47 23000]
[ 32 72500]
[ 27 57000]
[ 42 108000]
[ 33 149000]
[ 35 75000]
[ 35 53000]
[ 46 79000]
[ 39 134000]
[ 39 51500]
[ 49 39000]
[ 54 25500]
[ 41 61500]
[ 31 117500]
[ 24 58000]
[ 40 107000]
[ 40 97500]
[ 48 29000]
[ 38 147500]
[ 45 26000]
[ 32 67500]
[ 37 62000]
[ 41 79500]
[ 44 113500]
[ 47 41500]
[ 38 55000]
[ 39 114500]
[ 42 73000]
[ 26 15000]
[ 21 37500]
[ 59 39500]
[ 39 66500]
[ 43 80500]
[ 49 86000]
[ 37 75000]
[ 49 76500]
[ 28 123000]
[ 59 48500]
[ 40 60500]
[ 38 99500]
[ 51 35500]
[ 55 130000]
[ 23 56500]

2 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

[ 23 56500]
[ 49 43500]
[ 49 36000]
[ 48 21500]
[ 49 98500]

1 sampled_df = df.sample(n=400)
2 X = sampled_df.iloc[:, [2,3]].values
3 y = sampled_df.iloc[:, -1].values
4 print(X)

[[ 22 55000]
[ 26 23500]
[ 46 23000]
[ 26 35000]
[ 43 63500]
[ 30 62500]
[ 38 63500]
[ 36 21500]
[ 50 37500]
[ 38 72500]
[ 33 69000]
[ 32 86000]
[ 32 35500]
[ 51 45500]
[ 20 74000]
[ 38 74500]
[ 29 60500]
[ 35 22000]
[ 44 73500]
[ 35 47000]
[ 26 80500]
[ 35 57000]
[ 35 79000]
[ 57 61500]
[ 40 123500]
[ 45 82500]
[ 28 138500]
[ 32 72500]
[ 30 76500]
[ 38 65000]
[ 49 36500]
[ 20 86500]
[ 40 97500]
[ 36 51500]
[ 39 52500]
[ 25 90000]
[ 25 56500]
[ 26 17000]
[ 59 135500]
[ 49 119500]
[ 34 72000]
[ 28 59000]
[ 52 67500]
[ 50 45500]

3 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

[ 50 45500]
[ 40 148500]
[ 48 114500]
[ 20 77500]
[ 29 45500]
[ 26 118000]
[ 31 108500]
[ 41 60000]
[ 39 42000]
[ 59 42000]
[ 40 82500]
[ 60 124500]
[ 31 90500]
[ 51 35500]
[ 50 107500]

1 #splitting the dataset into the Training set and Test set
2 from sklearn.model_selection import train_test_split
3 X_train, X_test, y_train, y_test = train_test_split(X,y, test_size = 0.25, random_state =

1 #feature scaling
2 from sklearn.linear_model import LogisticRegression
3 from sklearn.preprocessing import StandardScaler
4 classifier = LogisticRegression()
5 sc = StandardScaler()
6 X_train = sc.fit_transform(X_train)
7 X_test = sc.transform(X_test)
8 y_train = y_train.ravel()
9 classifier.fit(X_train, y_train)

▾ LogisticRegression
LogisticRegression()

1 print(X_train)

[[ 1.85560654 2.30625342]
[ 0.41442093 -0.17358897]
[ 0.89481613 0.13639133]
[ 0.99089517 -0.14259094]
[-0.06597427 0.18288838]
[ 0.51049997 -0.45257124]
[-1.3150018 0.3068805 ]
[ 0.89481613 -0.99503676]
[ 0.89481613 1.99627313]
[-0.3542114 -0.79354957]
[-1.3150018 -1.21202297]
[ 0.12618381 0.24488444]
[ 0.03010477 1.11282927]
[-0.45029044 -0.43707222]
[ 0.41442093 -0.5145673 ]
[ 0.79873709 -1.13452789]
[ 1.08697421 -1.53750228]

4 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

[ 1.08697421 -1.53750228]
[-0.8346066 1.15932632]
[-1.41108084 0.44637163]
[ 0.31834189 0.29138148]
[-0.3542114 2.39924751]
[ 0.22226285 -0.54556533]
[ 0.03010477 0.16738936]
[-0.06597427 -1.10352986]
[ 0.31834189 -0.48356927]
[ 0.03010477 0.04339724]
[-0.45029044 0.24488444]
[ 0.60657901 -1.47550622]
[-1.12284372 -1.53750228]
[ 0.70265805 -1.16552592]
[-2.08363413 -0.09609389]
[-0.06597427 0.04339724]
[-1.21892276 0.52386671]
[ 0.03010477 1.99627313]
[ 0.22226285 -0.57656336]
[-0.3542114 -0.32857912]
[ 0.99089517 0.57036375]
[ 0.70265805 -0.32857912]
[-1.89147604 0.33787853]
[-0.45029044 -1.52200327]
[-0.93068564 0.4153736 ]
[-1.21892276 0.39987459]
[ 0.89481613 -1.07253183]
[ 1.18305325 0.16738936]
[ 0.51049997 -0.80904858]
[-0.93068564 -0.8245476 ]
[-1.3150018 0.27588247]
[ 0.79873709 -1.18102494]
[ 0.22226285 -0.18908798]
[-0.25813236 0.21388641]
[-0.06597427 -0.45257124]
[ 0.12618381 -0.39057518]
[ 2.2399227 -0.8245476 ]
[ 0.03010477 0.29138148]
[ 0.03010477 2.12026525]
[-1.50715988 -1.36701312]
[-0.3542114 0.66335784]
[-0.8346066 -0.40607419]

1 print(classifier.predict(sc.transform([[49, 74000]])))

[1]

1 y_pred = classifier.predict(X_test)
2 print(np.concatenate((y_pred.reshape(len(y_pred),1), y_test.reshape(len(y_test),1)),1))

[[0 0]
[0 0]
[1 1]
[0 1]

5 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

[0 0]
[1 1]
[1 1]
[1 1]
[0 0]
[0 1]
[0 0]
[1 1]
[1 1]
[1 1]
[0 0]
[1 1]
[0 1]
[0 0]
[1 1]
[0 0]
[1 1]
[1 1]
[1 1]
[1 1]
[0 0]
[0 0]
[0 0]
[0 0]
[1 1]
[0 0]
[0 0]
[1 0]
[0 0]
[1 1]
[0 0]
[0 0]
[0 0]
[0 0]
[1 1]
[1 1]
[0 0]
[0 0]
[0 0]
[0 0]
[1 1]
[1 1]
[0 0]
[0 1]
[1 0]
[1 1]
[0 1]
[1 1]
[0 0]
[0 0]
[0 0]
[0 0]
[1 0]
[0 0]

6 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

1 import seaborn as sns

2 import matplotlib.pyplot as plt
3 from sklearn.metrics import confusion_matrix
4 from sklearn.metrics import plot_confusion_matrix
5 from sklearn.metrics import accuracy_score
6 cm = confusion_matrix(y_test, y_pred)
7 sns.heatmap(cm, annot=True, fmt='g')
8 plt.xlabel('Predicted labels')
9 plt.ylabel('True labels')
10 plt.show()
11
12 accuracy = accuracy_score(y_test, y_pred)
13 print(cm)
14 print(f'Accuracy: {accuracy:.3f}')
15
16

---------------------------------------------------------------------------
ImportError Traceback (most recent call last)
<ipython-input-24-2a6c70da7013> in <module>
2 import matplotlib.pyplot as plt
3 from sklearn.metrics import confusion_matrix
----> 4 from sklearn.metrics import plot_confusion_matrix
5 from sklearn.metrics import accuracy_score
6 cm = confusion_matrix(y_test, y_pred)

ImportError: cannot import name 'plot_confusion_matrix' from 'sklearn.metrics'

(/usr/local/lib/python3.9/dist-packages/sklearn/metrics/__init__.py)

---------------------------------------------------------------------------
NOTE: If your import is failing due to a missing package, you can
manually install dependencies using either !pip or !apt.

To view examples of installing some common dependencies, click the

"Open Examples" button below.
---------------------------------------------------------------------------

7 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

1 from matplotlib.colors import ListedColormap

2 X_set, y_set = sc.inverse_transform(X_train), y_train
3 X1, X2 = np.meshgrid(np.arange(start = X_set[:, 0].min() - 10, stop = X_set[:, 0].max() +
4 np.arange(start = X_set[:, 1].min() - 1000, stop = X_set[:, 1].max() +
5 plt.contourf(X1, X2, classifier.predict(sc.transform(np.array([X1.ravel(), X2.ravel()]).T)).re
6 alpha = 0.75, cmap = ListedColormap(('slategrey', 'slateblue')))
7 plt.xlim(X1.min(), X1.max())
8 plt.ylim(X2.min(), X2.max())
9 for i, j in enumerate(np.unique(y_set)):
10 plt.scatter(X_set[y_set == j, 0], X_set[y_set == j, 1], color = ListedColormap(('slategrey
11 plt.title('Random Forest Classification (Train set)')
12 plt.xlabel('Age')
13 plt.ylabel('Estimated Salary')
14 plt.legend()
15 plt.show()

8 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

1 from matplotlib.colors import ListedColormap

2 X_set, y_set = sc.inverse_transform(X_test), y_test
3 X1, X2 = np.meshgrid(np.arange(start = X_set[:, 0].min() - 10, stop = X_set[:, 0].max() +
4 np.arange(start = X_set[:, 1].min() - 1000, stop = X_set[:, 1].max() +
5 plt.contourf(X1, X2, classifier.predict(sc.transform(np.array([X1.ravel(), X2.ravel()]).T)).re
6 alpha = 0.75, cmap = ListedColormap(('slategrey', 'slateblue')))
7 plt.xlim(X1.min(), X1.max())
8 plt.ylim(X2.min(), X2.max())
9 for i, j in enumerate(np.unique(y_set)):
10 plt.scatter(X_set[y_set == j, 0], X_set[y_set == j, 1], color = ListedColormap(('slategrey
11 plt.title('Random Forest Classification (Test set)')
12 plt.xlabel('Age')
13 plt.ylabel('Estimated Salary')
14 plt.legend()
15 plt.show()

1 # create a new data point to predict

2 new_data = [[35, 50000]]
3
4 # scale the new data using the same scaler used on the training data
5 new_data_scaled = sc.transform(new_data)
6
7 # make a prediction using the trained classifier
8 prediction = classifier.predict(new_data_scaled)

9 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

9
10 print(prediction)
11

[0]

1 new_data_2 = [[35, 50000], [42, 80700], [51, 65000], [23, 45600], [36, 42000], [41, 72000
2 df = pd.DataFrame(new_data_2, columns=[ 'Age', 'Annual Salary'])
3 df.index = df.index + 1
4 display(df)
5

Age Annual Salary

1 35 50000

2 42 80700

3 51 65000

4 23 45600

5 36 42000

6 41 72000

1 from tabulate import tabulate

2
3 results = []
4
5 for i, data in enumerate(new_data_2):
6 # scale the new data using the same scaler used on the training data
7 data_scaled = sc.transform([data])
8
9 # make a prediction using the trained classifier
10 prediction = classifier.predict(data_scaled)
11
12 # add the results to the table
13 if prediction == 1:
14 prediction_text = "Yes"
15 else:
16 prediction_text = "No"
17 results.append([i+1, data[0], data[1], prediction_text])
18
19 # print the table

10 of 11 21-09-2023, 13:01
PA Exp 2 Predicting buying customer_behaviour and Predict customer ... https://colab.research.google.com/drive/18ImfExgc2pyWKWnkMoEi...

20 headers = ["Customer", "Age", "Annual Salary", "Prediction"]

21 print(tabulate(results, headers=headers))
22

Customer Age Annual Salary Prediction

---------- ----- --------------- ------------
1 35 50000 No
2 42 80700 No
3 51 65000 Yes
4 23 45600 No
5 36 42000 No
6 41 72000 No

11 of 11 21-09-2023, 13:01

Regression Analysis - Cheatsheet
No ratings yet
Regression Analysis - Cheatsheet
9 pages
ml-batch(1)
No ratings yet
ml-batch(1)
36 pages
Udacity Machine Learning Analysis Supervised Learning
100% (1)
Udacity Machine Learning Analysis Supervised Learning
504 pages
ML Practical 205160694034
No ratings yet
ML Practical 205160694034
33 pages
Rajeek8 12
No ratings yet
Rajeek8 12
21 pages
Approachin190808095205 PDF
No ratings yet
Approachin190808095205 PDF
112 pages
ML pdf
No ratings yet
ML pdf
30 pages
DS-Food
No ratings yet
DS-Food
23 pages
Mercedes-Benz Greener Manufacturing Ai
0% (1)
Mercedes-Benz Greener Manufacturing Ai
16 pages
Final ML File
No ratings yet
Final ML File
34 pages
Lesson3
No ratings yet
Lesson3
5 pages
Train
No ratings yet
Train
17 pages
Exercise5 Solution
No ratings yet
Exercise5 Solution
22 pages
MLT 1 - 7 Kanish
No ratings yet
MLT 1 - 7 Kanish
24 pages
ml lab
No ratings yet
ml lab
23 pages
CP4252 Lab Manual(1)
No ratings yet
CP4252 Lab Manual(1)
13 pages
Big Data Review-3
No ratings yet
Big Data Review-3
13 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
47 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
5) Randomforest - Ipynb - Colaboratory
No ratings yet
5) Randomforest - Ipynb - Colaboratory
12 pages
ML MANUAL WITH OUTPUTS (2)
No ratings yet
ML MANUAL WITH OUTPUTS (2)
30 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Ml Short Code_under Updating
No ratings yet
Ml Short Code_under Updating
4 pages
S6 - Data Mining Lab Experiments (Except 1)
No ratings yet
S6 - Data Mining Lab Experiments (Except 1)
6 pages
easy pract ml
No ratings yet
easy pract ml
7 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
23 pages
ml
No ratings yet
ml
17 pages
ML Complete Notes Hridoy.docx
No ratings yet
ML Complete Notes Hridoy.docx
5 pages
data-mining-lab-manual-CSE-VII-Sem
No ratings yet
data-mining-lab-manual-CSE-VII-Sem
63 pages
DA_Programs
No ratings yet
DA_Programs
44 pages
ML MANUAL
No ratings yet
ML MANUAL
24 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
Progress of GRADIENT BOOSTING ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
No ratings yet
Progress of GRADIENT BOOSTING ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
10 pages
data preprocessing
No ratings yet
data preprocessing
9 pages
Coe Projects
No ratings yet
Coe Projects
7 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
30 pages
ML_Lab_01999676272
No ratings yet
ML_Lab_01999676272
12 pages
1
No ratings yet
1
13 pages
Classification
No ratings yet
Classification
3 pages
Machine learning lab manual
No ratings yet
Machine learning lab manual
22 pages
How To Choose The Right AI Foundation Model
No ratings yet
How To Choose The Right AI Foundation Model
20 pages
Datascience Pr 6 Veda
No ratings yet
Datascience Pr 6 Veda
6 pages
Shobit Sharma (2124399) ML lab file pdf
No ratings yet
Shobit Sharma (2124399) ML lab file pdf
19 pages
DSBDA05
No ratings yet
DSBDA05
5 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Price Opti Medium Code
No ratings yet
Price Opti Medium Code
15 pages
Case Study - Classifier
No ratings yet
Case Study - Classifier
5 pages
Anomaly Detection in Partical Physics
No ratings yet
Anomaly Detection in Partical Physics
179 pages
ML EXTERNAL XEROX
No ratings yet
ML EXTERNAL XEROX
1 page
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
dsbda_5
No ratings yet
dsbda_5
4 pages
Articles Xgboost Classification With Smote-Enn Algorithm
No ratings yet
Articles Xgboost Classification With Smote-Enn Algorithm
11 pages
Predictive_Analysis_of_Stock_Market_Trends_A_Machine_Learning_Approach
No ratings yet
Predictive_Analysis_of_Stock_Market_Trends_A_Machine_Learning_Approach
6 pages
Data analytics
No ratings yet
Data analytics
10 pages
Spam Detection in Emails Using Machine Learning
No ratings yet
Spam Detection in Emails Using Machine Learning
56 pages
Krakauer (2011)
No ratings yet
Krakauer (2011)
10 pages
Mlext
No ratings yet
Mlext
1 page
Capstone project_Jaro-Prof. Babji
No ratings yet
Capstone project_Jaro-Prof. Babji
5 pages
Logistic Regression vs Decision Tree
No ratings yet
Logistic Regression vs Decision Tree
2 pages
What Is Hadoop
No ratings yet
What Is Hadoop
162 pages
Model Evaluation and Selection Cheatsheet 1708023215
No ratings yet
Model Evaluation and Selection Cheatsheet 1708023215
7 pages
Untitled Document
No ratings yet
Untitled Document
19 pages
SRL-ACO A Text Augmentation Framework Based On Semantic Role
No ratings yet
SRL-ACO A Text Augmentation Framework Based On Semantic Role
18 pages
PCML Notes
No ratings yet
PCML Notes
249 pages
linear
No ratings yet
linear
2 pages
Neural Network Learning Rules
No ratings yet
Neural Network Learning Rules
33 pages
Harnessing AI For Smart Marketing
No ratings yet
Harnessing AI For Smart Marketing
9 pages
STC_AML_Py'25 Brochure_New
No ratings yet
STC_AML_Py'25 Brochure_New
2 pages
Learning Bayesian Network Structure Based On Ant Colony Optimization and Differential Evolution
No ratings yet
Learning Bayesian Network Structure Based On Ant Colony Optimization and Differential Evolution
24 pages
Neural Networks & Machine Learning: Worksheet 3
No ratings yet
Neural Networks & Machine Learning: Worksheet 3
3 pages
Book 7
No ratings yet
Book 7
35 pages
Class-Balanced Loss Based On Effective Number of Samples
No ratings yet
Class-Balanced Loss Based On Effective Number of Samples
11 pages
3170924 (1)
No ratings yet
3170924 (1)
2 pages
Enterprise Applications of Business Intelligence
No ratings yet
Enterprise Applications of Business Intelligence
12 pages
Data Mining Project
No ratings yet
Data Mining Project
5 pages
ML Algorithms
100% (1)
ML Algorithms
1 page
Society 5.0 and Artificial Intelligence With A Human Face
No ratings yet
Society 5.0 and Artificial Intelligence With A Human Face
15 pages
Vision Transformer Attention With Multi-Reservoir Echo State
No ratings yet
Vision Transformer Attention With Multi-Reservoir Echo State
17 pages
Sagar Resume
No ratings yet
Sagar Resume
1 page
SummaryofSubjects For B Eng
No ratings yet
SummaryofSubjects For B Eng
12 pages
1) s2.0 S277266222200011X Main
No ratings yet
1) s2.0 S277266222200011X Main
30 pages
Flexible Machine Learning-Based Cyberattack Detection Using Spatiotemporal
No ratings yet
Flexible Machine Learning-Based Cyberattack Detection Using Spatiotemporal
7 pages
XG Boost
No ratings yet
XG Boost
4 pages
Unit 2 AIML
No ratings yet
Unit 2 AIML
23 pages
Ankit: Academic Qualifications
No ratings yet
Ankit: Academic Qualifications
4 pages
A List of Factorial Math Constants
From Everand
A List of Factorial Math Constants
Archive Classics
No ratings yet
IBM System 360 RPG Debugging Template and Keypunch Card
From Everand
IBM System 360 RPG Debugging Template and Keypunch Card
Archive Classics
No ratings yet