0% found this document useful (0 votes)

4 views

program-8

The document outlines the implementation of KMeans and DBSCAN clustering algorithms using the 'Mall_Customers.csv' dataset. It includes code snippets for data loading, clustering, and visualization of results, as well as warnings related to the KMeans algorithm's memory usage on Windows. The dataset consists of 200 customers with attributes such as annual income and spending score.

Uploaded by

switchxblade420

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

program-8

Uploaded by

switchxblade420

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

program-8

December 7, 2023

0.0.1 Implement KMeans and DBSCAN algorithm using appropriate Data sets.

[5]: import numpy as np import matplotlib.pyplot as plt import

pandas as pd from sklearn.cluster import DBSCAN data =
pd.read_csv("Mall_Customers.csv") data.head() print("Dataset
shape:", data.shape) data.isnull().any().any() x =
data.loc[:, ['Annual Income (k$)','Spending Score (1-
100)']].values
# cluster the data into five clusters
dbscan = DBSCAN(eps = 8, min_samples =
4).fit(x)
# fitting the model
labels = dbscan.labels_ # getting the labels
plt.scatter(x[:, 0], x[:,1], c = labels, cmap= "plasma")
# plotting the clusters
plt.xlabel("Income") # X-axis label
plt.ylabel("Spending Score") # Y-axis
label plt.show() # showing the plot

Dataset shape: (200, 5)

[6]: import numpy as nm

import matplotlib.pyplot as mtp
import pandas as pd

# Importing the dataset

dataset = pd.read_csv('Mall_Customers.csv')

[7]: dataset

[7]: CustomerID Gender Age Annual Income (k$) Spending Score (1-100)
0 1 Male 19 15 39
1 2 Male 21 15 81
2 3 Female 20 16 6
3 4 Female 23 16 77

1
4 5 Female 31 17 40
.. … … … … …
195 196 Female 35 120 79
196 197 Female 45 126 28
197 198 Male 32 126 74
198 199 Male 32 137 18
199 200 Male 30 137 83
[200 rows x 5 columns]

[9]:
[8]: x = dataset.iloc[:, [3, 4]].values

x
[9]: array([[ 15,
39], [ 15,
81],
[ 16, 6],
[ 16, 77],
[ 17, 40],
[ 17, 76],
[ 18, 6],
[ 18, 94],
[ 19, 3],
[ 19, 72],
[ 19, 14],
[ 19, 99],
[ 20, 15],
[ 20, 77],
[ 20, 13],
[ 20, 79],
[ 21, 35],
[ 21, 66],
[ 23, 29],
[ 23, 98],
[ 24, 35],
[ 24, 73],
[ 25, 5],
[ 25, 73],
[ 28, 14],
[ 28, 82],

2
[ 28, 32],
[ 28, 61],
[ 29, 31],
[ 29, 87],
[ 30, 4],
[ 30, 73],
[ 33, 4],
[ 33, 92],
[ 33, 14],
[ 33, 81],
[ 34, 17],
[ 34, 73],
[ 37, 26],
[ 37, 75],
[ 38, 35],
[ 38, 92],
[ 39, 36],
[ 39, 61],
[ 39, 28],
[ 39, 65],
[ 40, 55],
[ 40, 47],
[ 40, 42],
[ 40, 42],
[ 42, 52],
[ 42, 60],
[ 43, 54],
[ 43, 60],
[ 43, 45],
[ 43, 41],
[ 44, 50],
[ 44, 46],
[ 46, 51],
[ 46, 46],
[ 46, 56],
[ 46, 55],
[ 47, 52],
[ 47, 59],
[ 48, 51],
[ 48, 59],
[ 48, 50],
[ 48, 48],
[ 48, 59],
[ 48, 47],
[ 49, 55],
[ 49, 42],

3
[ 50, 49],
[ 50, 56],
[ 54, 47],
[ 54, 54],
[ 54, 53],
[ 54, 48],
[ 54, 52],
[ 54, 42],
[ 54, 51],
[ 54, 55],
[ 54, 41],
[ 54, 44],
[ 54, 57],
[ 54, 46],
[ 57, 58], [ 57, 55],
[ 58, 60],
[ 58, 46],
[ 59, 55],
[ 59, 41],
[ 60, 49],
[ 60, 40],
[ 60, 42],
[ 60, 52],
[ 60, 47],
[ 60, 50],
[ 61, 42],
[ 61, 49],
[ 62, 41],
[ 62, 48],
[ 62, 59],
[ 62, 55],
[ 62, 56],
[ 62, 42],
[ 63, 50],
[ 63, 46],
[ 63, 43],
[ 63, 48],
[ 63, 52],
[ 63, 54],
[ 64, 42],
[ 64, 46],
[ 65, 48],
[ 65, 50],
[ 65, 43],
[ 65, 59],
[ 67, 43],

4
[ 67, 57],
[ 67, 56],
[ 67, 40],
[ 69, 58],
[ 69, 91],
[ 70, 29],
[ 70, 77],
[ 71, 35],
[ 71, 95],
[ 71, 11],
[ 71, 75],
[ 71, 9],
[ 71, 75],
[ 72, 34],
[ 72, 71], [ 73, 5],
[ 73, 88],
[ 73, 7],
[ 73, 73],
[ 74, 10],
[ 74, 72],
[ 75, 5],
[ 75, 93],
[ 76, 40],
[ 76, 87],
[ 77, 12],
[ 77, 97],
[ 77, 36],
[ 77, 74],
[ 78, 22],
[ 78, 90],
[ 78, 17],
[ 78, 88],
[ 78, 20],
[ 78, 76],
[ 78, 16],
[ 78, 89],
[ 78, 1],
[ 78, 78],
[ 78, 1],
[ 78, 73],
[ 79, 35],
[ 79, 83],
[ 81, 5],
[ 81, 93],
[ 85, 26],
[ 85, 75],

5
[ 86, 20],
[ 86, 95],
[ 87, 27],
[ 87, 63],
[ 87, 13],
[ 87, 75],
[ 87, 10],
[ 87, 92],
[ 88, 13],
[ 88, 86],
[ 88, 15],
[ 88, 69],
[ 93, 14],
[ 93, 90],
[ 97, 32], [ 97, 86],
[ 98, 15],
[ 98, 88],
[ 99, 39],
[ 99, 97],
[101, 24],
[101, 68],
[103, 17],
[103, 85],
[103, 23],
[103, 69],
[113, 8],
[113, 91],
[120, 16],
[120, 79],
[126, 28],
[126, 74],
[137, 18],
[137, 83]], dtype=int64)

[10]: #finding optimal number of clusters using the elbow

method from sklearn.cluster import KMeans
wcss_list= [] #Initializing the list for the values
of WCSS

#Using for loop for iterations from 1 to 10. for i in

range(1, 11): kmeans = KMeans(n_clusters=i, init='k-means+
+', random_state= 42) kmeans.fit(x)
wcss_list.append(kmeans.inertia_)
mtp.plot(range(1, 11),
wcss_list) mtp.title('The
Elobw Method Graph')

6
mtp.xlabel('Number of
clusters(k)')
mtp.ylabel('wcss_list')
mtp.show()

C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436: UserWarning: KMeans is known to have a memory leak
on Windows with MKL, when there are less chunks than available
threads. You can avoid it by setting the environment variable
OMP_NUM_THREADS=1.
warnings.warn(
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436: UserWarning: KMeans is known to have a memory leak
on Windows with MKL, when there are less chunks than available
threads. You can avoid it by setting the environment variable
OMP_NUM_THREADS=1.
warnings.warn(
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436: UserWarning: KMeans is known to have a memory leak
on Windows with MKL, when there are less chunks than available
threads. You can avoid it by setting the environment variable
OMP_NUM_THREADS=1.
warnings.warn(
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in

7
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436: UserWarning: KMeans is known to have a memory leak
on Windows with MKL, when there are less chunks than available
threads. You can avoid it by setting the environment variable
OMP_NUM_THREADS=1.
warnings.warn(
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436: UserWarning: KMeans is known to have a memory leak
on Windows with MKL, when there are less chunks than available
threads. You can avoid it by setting the environment variable
OMP_NUM_THREADS=1.
warnings.warn(
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436: UserWarning: KMeans is known to have a memory leak
on Windows with MKL, when there are less chunks than available
threads. You can avoid it by setting the environment variable
OMP_NUM_THREADS=1.
warnings.warn(
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436:
UserWarning: KMeans is known to have a memory leak on Windows with
MKL, when there are less chunks than available threads. You can
avoid it by setting the environment variable OMP_NUM_THREADS=1.
warnings.warn(
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:

8
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436: UserWarning: KMeans is known to have a memory leak
on Windows with MKL, when there are less chunks than available
threads. You can avoid it by setting the environment variable
OMP_NUM_THREADS=1.
warnings.warn(
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436: UserWarning: KMeans is known to have a memory leak
on Windows with MKL, when there are less chunks than available
threads. You can avoid it by setting the environment variable
OMP_NUM_THREADS=1.
warnings.warn(
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1412:
FutureWarning: The default value of `n_init` will change from 10 to
'auto' in
1.4. Set the value of `n_init` explicitly to suppress the warning
super()._check_params_vs_input(X, default_n_init=10)
C:\Users\shilpa\anaconda3\Lib\site-packages\sklearn\cluster\
_kmeans.py:1436: UserWarning: KMeans is known to have a memory leak
on Windows with MKL, when there are less chunks than available
threads. You can avoid it by setting the environment variable
OMP_NUM_THREADS=1.
warnings.warn(

9
[11]: #training the K-means model on a dataset kmeans =
KMeans(n_clusters=5, init='k-means++', random_state= 42)
y_predict= kmeans.fit_predict(x)

#visulaizing the clusters mtp.scatter(x[y_predict == 0, 0],

x[y_predict == 0, 1], s = 100, c = 'blue',␣
↪label = 'Cluster 1') #for first cluster mtp.scatter(x[y_predict

== 1, 0], x[y_predict == 1, 1], s = 100, c = 'green',␣

↪label = 'Cluster 2') #for second cluster

mtp.scatter(x[y_predict== 2, 0], x[y_predict == 2, 1], s = 100, c

= 'red',␣
↪label = 'Cluster 3') #for third cluster mtp.scatter(x[y_predict

== 3, 0], x[y_predict == 3, 1], s = 100, c = 'cyan',␣

↪label = 'Cluster 4') #for fourth cluster mtp.scatter(x[y_predict ==

4, 0], x[y_predict == 4, 1], s = 100, c = 'magenta',␣

↪label = 'Cluster 5') #for fifth cluster

mtp.scatter(kmeans.cluster_centers_[:, 0],
kmeans.cluster_centers_[:, 1], s =␣

10
↪300, c = 'yellow', label = 'Centroid')

mtp.title('Clusters of customers')
mtp.xlabel('Annual Income (k$)')
mtp.ylabel('Spending Score (1-100)')
mtp.legend()
mtp.show()

[ ]:

Creative Writing Melcs
90% (10)
Creative Writing Melcs
3 pages
RPH Bi Year 1 (L16-20)
100% (1)
RPH Bi Year 1 (L16-20)
6 pages
Exercice 1 TP K-Means
No ratings yet
Exercice 1 TP K-Means
1 page
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
No ratings yet
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
6 pages
customers-k-means
No ratings yet
customers-k-means
11 pages
Lab Assignment 3 Ai
No ratings yet
Lab Assignment 3 Ai
1 page
K-Means Clustering - Jupyter Notebook
No ratings yet
K-Means Clustering - Jupyter Notebook
11 pages
EXPERIMENT 9
No ratings yet
EXPERIMENT 9
10 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
KMeans
No ratings yet
KMeans
1 page
1 Kmeans-Pratical-No-1
No ratings yet
1 Kmeans-Pratical-No-1
8 pages
k-means-clustering
No ratings yet
k-means-clustering
6 pages
Day59 K Means Clustering 1701989733
No ratings yet
Day59 K Means Clustering 1701989733
5 pages
ML 5
No ratings yet
ML 5
12 pages
Data Mining - Project
100% (2)
Data Mining - Project
11 pages
K - Means - Clustering - Ipynb - Colaboratory
No ratings yet
K - Means - Clustering - Ipynb - Colaboratory
2 pages
6
No ratings yet
6
4 pages
K Means Clustering
100% (1)
K Means Clustering
10 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
ml lab
No ratings yet
ml lab
8 pages
Mall Customer Segmentation Using KMeans Clustering Algorithm and Classification Algorithm
No ratings yet
Mall Customer Segmentation Using KMeans Clustering Algorithm and Classification Algorithm
40 pages
D3 docs
No ratings yet
D3 docs
6 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
AIML_LAB
No ratings yet
AIML_LAB
37 pages
ML Lab
No ratings yet
ML Lab
7 pages
AdityaGaur BDA Exp8
No ratings yet
AdityaGaur BDA Exp8
4 pages
Practical 5
No ratings yet
Practical 5
6 pages
k means
No ratings yet
k means
5 pages
21BCE5775 Clustering
No ratings yet
21BCE5775 Clustering
42 pages
7 output
No ratings yet
7 output
4 pages
Kmeansclustering Sales Dataset
No ratings yet
Kmeansclustering Sales Dataset
6 pages
Practical-8: Import As Import As Import As Import Import As
No ratings yet
Practical-8: Import As Import As Import As Import Import As
9 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Merged
No ratings yet
Merged
35 pages
AAM CODES
No ratings yet
AAM CODES
8 pages
DL Lab 3
No ratings yet
DL Lab 3
5 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
S6 - Data Mining Lab Experiments (Except 1)
No ratings yet
S6 - Data Mining Lab Experiments (Except 1)
6 pages
Assignmnet 5
No ratings yet
Assignmnet 5
11 pages
K Means
No ratings yet
K Means
15 pages
DWM Practical
No ratings yet
DWM Practical
12 pages
ml labs
No ratings yet
ml labs
14 pages
PRAC9_23BME053
No ratings yet
PRAC9_23BME053
4 pages
14401172022_tanu raman ml lab file
No ratings yet
14401172022_tanu raman ml lab file
21 pages
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
21BECE30036 Prac 1
No ratings yet
21BECE30036 Prac 1
10 pages
Pa66 ML Exp6
No ratings yet
Pa66 ML Exp6
9 pages
EE2211 CheatSheet
No ratings yet
EE2211 CheatSheet
15 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
K Means Algorithm
No ratings yet
K Means Algorithm
6 pages
ML_labs
No ratings yet
ML_labs
15 pages
DSBDA6
No ratings yet
DSBDA6
6 pages
TOO
No ratings yet
TOO
7 pages
DataScience All 1to8
No ratings yet
DataScience All 1to8
6 pages
Practical File of AI and ML
No ratings yet
Practical File of AI and ML
26 pages
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
No ratings yet
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
7 pages
K Means
No ratings yet
K Means
3 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
machine learning lab
No ratings yet
machine learning lab
20 pages
Ai ML Programs
No ratings yet
Ai ML Programs
34 pages
The Aliens and Gravity in the Solar System
From Everand
The Aliens and Gravity in the Solar System
Rafat al-Majzoub Muhammad al-Ghamd
5/5 (1)
How To Install UXP (v10 and Later)
No ratings yet
How To Install UXP (v10 and Later)
3 pages
Eng 4M
No ratings yet
Eng 4M
3 pages
La Airways
0% (1)
La Airways
3 pages
Speakout Pronunciation Extra Elementary Unit 1
No ratings yet
Speakout Pronunciation Extra Elementary Unit 1
1 page
2WB05 Simulation Lecture 5: Random-Number Generators: Marko Boon
No ratings yet
2WB05 Simulation Lecture 5: Random-Number Generators: Marko Boon
32 pages
Discourses of De Legitimization Participatory Culture in Digital Contexts 1st Edition Andrew S. Ross (Editor) 2024 Scribd Download
100% (1)
Discourses of De Legitimization Participatory Culture in Digital Contexts 1st Edition Andrew S. Ross (Editor) 2024 Scribd Download
37 pages
Math Form 1 End Term 1 Exam 2022docx
No ratings yet
Math Form 1 End Term 1 Exam 2022docx
7 pages
Discourse Markers Conversational Gambits Generalizations
No ratings yet
Discourse Markers Conversational Gambits Generalizations
8 pages
Emilio Aguinaldo 1899-1901: Contributions and Achievements
No ratings yet
Emilio Aguinaldo 1899-1901: Contributions and Achievements
16 pages
What Is Weld Mapping
No ratings yet
What Is Weld Mapping
4 pages
Microsoft Access Data Analysis Michael Alexander - Download the ebook now for instant access to all chapters
100% (2)
Microsoft Access Data Analysis Michael Alexander - Download the ebook now for instant access to all chapters
53 pages
(DOCX) Kiber-Telekom
No ratings yet
(DOCX) Kiber-Telekom
5 pages
Rediplus API Docs
No ratings yet
Rediplus API Docs
8 pages
Lecture 4 - Computer Arithmetic
No ratings yet
Lecture 4 - Computer Arithmetic
18 pages
Candy CDB 134 SY Washer-Dryer
No ratings yet
Candy CDB 134 SY Washer-Dryer
49 pages
Letter by Prabhupada On Following Gopis
No ratings yet
Letter by Prabhupada On Following Gopis
2 pages
DE KHAO SAT DOI TUYEN 6
No ratings yet
DE KHAO SAT DOI TUYEN 6
5 pages
Abc CH-1-4
No ratings yet
Abc CH-1-4
91 pages
FINAL EVIDENCIE: Simon's Routine: Evidencia: Present Simple / Adverbios de Frecuencia
No ratings yet
FINAL EVIDENCIE: Simon's Routine: Evidencia: Present Simple / Adverbios de Frecuencia
4 pages
Essay Type Test
100% (1)
Essay Type Test
4 pages
Traditional Lit
No ratings yet
Traditional Lit
8 pages
Tugas English Grammar Matilde
No ratings yet
Tugas English Grammar Matilde
3 pages
Language and Structural Techniques Help Sheet.130685795
No ratings yet
Language and Structural Techniques Help Sheet.130685795
2 pages
Role of Teachers' Sttitude in Remedial Teaching - Chidambaram
No ratings yet
Role of Teachers' Sttitude in Remedial Teaching - Chidambaram
10 pages
Design and Analysis of Algorithms For R-2017 by Krishna Sankar P., Shangaranarayanee N.P.
No ratings yet
Design and Analysis of Algorithms For R-2017 by Krishna Sankar P., Shangaranarayanee N.P.
6 pages
Lise Bourbeau Despre Frici Si Credinte
100% (2)
Lise Bourbeau Despre Frici Si Credinte
134 pages
Dbms Lab File
No ratings yet
Dbms Lab File
78 pages
15 English Words Phrases You May Be Saying Wrongly
No ratings yet
15 English Words Phrases You May Be Saying Wrongly
12 pages

program-8

Uploaded by

program-8

Uploaded by

program-8

[5]: import numpy as np import matplotlib.pyplot as plt import

Dataset shape: (200, 5)

[6]: import numpy as nm

# Importing the dataset

[10]: #finding optimal number of clusters using the elbow

#Using for loop for iterations from 1 to 10. for i in

#visulaizing the clusters mtp.scatter(x[y_predict == 0, 0],

== 1, 0], x[y_predict == 1, 1], s = 100, c = 'green',␣

mtp.scatter(x[y_predict== 2, 0], x[y_predict == 2, 1], s = 100, c

== 3, 0], x[y_predict == 3, 1], s = 100, c = 'cyan',␣

4, 0], x[y_predict == 4, 1], s = 100, c = 'magenta',␣

You might also like