SlideShare a Scribd company logo
UNSUPERVISED
MACHINE LEARNING
presented by- Pravinkumar Landge
1
• Introduction
• Clustering
• K-means clustering
• Hierarchical clustering
• Comparison between K-means and Hierarchical
• DBSCAN clustering
2
Introduction
• Unsupervised learning is a type of machine
learning algorithm used to draw inferences from datasets
consisting of input data without labeled responses. The
most common unsupervised learning method is cluster
analysis, which is used for exploratory data analysis to
find hidden patterns or grouping in data.
3
What is clustering?
• A group of objects that are similar to other objects in the
cluster, and dissimilar to data points in other clusters.
4
Use of clustering
Clustering has been widely used across industries for
years:
• Biology - for genetic and species grouping;
• Medical imaging - for distinguishing between different
kinds of tissues;
• Market research - for differentiating groups of customers
based on some attributes
• Recommender systems - giving you better Amazon
purchase suggestions or Netflix movie matches.
5
Clustering algorithms
• Partition-based clustering
• Relatively efficient
• E.g. k-means
• Hierarchical clustering
• Produces trees of clusters
• E.g. Agglomerative, Divisive
• Density-based clustering
• Produces arbitrary shaped clusters
• E.g. DBSCAN
6
K-means clustering
• k-means is a partitioning clustering
• K-means divides the data into non-overlapping subsets
(clusters) without any cluster-internal structure
• Examples within a cluster are very similar
• Examples across different clusters are very different
7
Determine the similarity or dissimilarity
8
1-dimensional similarity/distance
9
2-dimensional similarity/distance
10
How does k-means clustering works?
1. Randomly place k centroids, one for each cluster
2. Calculate the distance of each point from each centroid
3. Assign each data point(object) to the closest centroid,
creating a cluster
4. Recalculate the position of the k centroids
5. Repeat the steps 2-4, until the centroids no longer
move
11
12
Choosing k
13
• K-means is partitioning algorithm relatively efficient for
medium and large sized databases
• Produces sphere-like clusters
• Needs number of clusters (k)
14
Hierarchical clustering
• Hierarchical clustering algorithms build a hierarchy of
clusters where each node is a cluster consists of the
clusters of its daughter nodes.
• Hierarchical clustering strategies
• Divisive (top down)
• Agglomerative (bottom up)
15
Agglomerative algorithm
1. Create n clusters, one for each data point
2. Compute the proximity matrix
3. Repeat
1. Merge the two closest clusters
2. Update the proximity matrix
4. Until only a single cluster remains
16
Similarity/Distance
17
Distance between clusters
• Single-Linkage clustering
• Minimum distance between clusters
• Complete-Linkage Clustering
• Maximum distance between clusters
• Average linkage clustering
• Average distance between clusters
• Centroid linkage clustering
• Distance between cluster centroids
18
• Advantages
• Doesn’t required number of clusters to be specified
• Easy to implement
• Produces a dendrogram, which helps with understanding the data
19
• Disadvantages
• Can never undo any previous steps throughout the algorithm
• Generally has long runtimes
• Sometimes difficult to identify the number of clusters by the
dendrogram
20
Hierarchical clustering Vs. K-means
K-means Hierarchical Clustering
Much more efficient Can be slow for large datasets
Requires the number of clusters to be
specified
Does not require the number of
clusters to run
Gives only one partitioning of the data
based on the predefined number of
clusters
Gives more than one partitioning
depending on the resolution
Potentially returns different clusters
each time it is run due to random
initialization of centroids
Always generates the same clusters
21
DBSCAN clustering
• When applied to tasks with arbitrary shaped clusters or
clusters within clusters, traditional techniques might not
be able to achieve good results
• Partitioning based algorithms has no notion of outliers that
is, all points are assigned to a cluster even if they do not
belong in any
• In contrast, density-based clustering locates regions
of high density that are separated from one another by
regions of low density. Density in this context is defined as
the number of points within a specified radius.
22
23
K-means vs density based clustering
24
What is DBSCAN?
• DBSCAN (Density-Based Spatial Clustering of
Applications with Noise)
• Is one of the most common clustering algorithms
• Works based on density of objects
• R (Radius of neighborhood)
• Radius (R) that if includes enough number
of points within, we call it a dense area
• M (Min number of neighbors)
• The minimum number of data points
we want in a neighborhood to define a cluster
25
How DBSCAN works?
26
DBSCAN algorithm- core point
• R=2 units M=6
27
DBSCAN algorithm- border point
• R=2 unit M=6
28
29
DBSCAN algorithm- outliers
30
DBSCAN algorithm- identify all points
31
DBSCAN algorithm- clusters?
32
Advantages of DBSCAN
1. Arbitrarily shaped clusters
2. Robust to outliers
3. Does not require specification
of the number of clusters
33
34
Ad

More Related Content

What's hot (20)

Decision trees in Machine Learning
Decision trees in Machine Learning Decision trees in Machine Learning
Decision trees in Machine Learning
Mohammad Junaid Khan
 
Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...
Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...
Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...
Simplilearn
 
Density Based Clustering
Density Based ClusteringDensity Based Clustering
Density Based Clustering
SSA KPI
 
Support vector machine
Support vector machineSupport vector machine
Support vector machine
zekeLabs Technologies
 
Ensemble methods
Ensemble methods Ensemble methods
Ensemble methods
zekeLabs Technologies
 
Ensemble methods in machine learning
Ensemble methods in machine learningEnsemble methods in machine learning
Ensemble methods in machine learning
SANTHOSH RAJA M G
 
Ensemble methods
Ensemble methodsEnsemble methods
Ensemble methods
Christopher Marker
 
K means clustering
K means clusteringK means clustering
K means clustering
keshav goyal
 
Ensemble learning Techniques
Ensemble learning TechniquesEnsemble learning Techniques
Ensemble learning Techniques
Babu Priyavrat
 
Multiclass classification of imbalanced data
Multiclass classification of imbalanced dataMulticlass classification of imbalanced data
Multiclass classification of imbalanced data
SaurabhWani6
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
Knoldus Inc.
 
Density based clustering
Density based clusteringDensity based clustering
Density based clustering
YaswanthHariKumarVud
 
05 Clustering in Data Mining
05 Clustering in Data Mining05 Clustering in Data Mining
05 Clustering in Data Mining
Valerii Klymchuk
 
Support Vector Machine ppt presentation
Support Vector Machine ppt presentationSupport Vector Machine ppt presentation
Support Vector Machine ppt presentation
AyanaRukasar
 
Introduction to Machine Learning Classifiers
Introduction to Machine Learning ClassifiersIntroduction to Machine Learning Classifiers
Introduction to Machine Learning Classifiers
Functional Imperative
 
K mean-clustering
K mean-clusteringK mean-clustering
K mean-clustering
Afzaal Subhani
 
3.5 model based clustering
3.5 model based clustering3.5 model based clustering
3.5 model based clustering
Krish_ver2
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Rahul Jain
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Simplilearn
 
K-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierK-Nearest Neighbor Classifier
K-Nearest Neighbor Classifier
Neha Kulkarni
 
Decision trees in Machine Learning
Decision trees in Machine Learning Decision trees in Machine Learning
Decision trees in Machine Learning
Mohammad Junaid Khan
 
Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...
Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...
Support Vector Machine - How Support Vector Machine works | SVM in Machine Le...
Simplilearn
 
Density Based Clustering
Density Based ClusteringDensity Based Clustering
Density Based Clustering
SSA KPI
 
Ensemble methods in machine learning
Ensemble methods in machine learningEnsemble methods in machine learning
Ensemble methods in machine learning
SANTHOSH RAJA M G
 
K means clustering
K means clusteringK means clustering
K means clustering
keshav goyal
 
Ensemble learning Techniques
Ensemble learning TechniquesEnsemble learning Techniques
Ensemble learning Techniques
Babu Priyavrat
 
Multiclass classification of imbalanced data
Multiclass classification of imbalanced dataMulticlass classification of imbalanced data
Multiclass classification of imbalanced data
SaurabhWani6
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
Knoldus Inc.
 
05 Clustering in Data Mining
05 Clustering in Data Mining05 Clustering in Data Mining
05 Clustering in Data Mining
Valerii Klymchuk
 
Support Vector Machine ppt presentation
Support Vector Machine ppt presentationSupport Vector Machine ppt presentation
Support Vector Machine ppt presentation
AyanaRukasar
 
Introduction to Machine Learning Classifiers
Introduction to Machine Learning ClassifiersIntroduction to Machine Learning Classifiers
Introduction to Machine Learning Classifiers
Functional Imperative
 
K mean-clustering
K mean-clusteringK mean-clustering
K mean-clustering
Afzaal Subhani
 
3.5 model based clustering
3.5 model based clustering3.5 model based clustering
3.5 model based clustering
Krish_ver2
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Rahul Jain
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Simplilearn
 
K-Nearest Neighbor Classifier
K-Nearest Neighbor ClassifierK-Nearest Neighbor Classifier
K-Nearest Neighbor Classifier
Neha Kulkarni
 

Similar to Unsupervised learning (clustering) (20)

DS9 - Clustering.pptx
DS9 - Clustering.pptxDS9 - Clustering.pptx
DS9 - Clustering.pptx
JK970901
 
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
NANDHINIS900805
 
Unsupervised%20Learninffffg (2).pptx. application
Unsupervised%20Learninffffg (2).pptx. applicationUnsupervised%20Learninffffg (2).pptx. application
Unsupervised%20Learninffffg (2).pptx. application
ShabirAhmad625218
 
Machine Learning : Clustering - Cluster analysis.pptx
Machine Learning : Clustering - Cluster analysis.pptxMachine Learning : Clustering - Cluster analysis.pptx
Machine Learning : Clustering - Cluster analysis.pptx
tecaviw979
 
Clustering on DSS
Clustering on DSSClustering on DSS
Clustering on DSS
Enaam Alotaibi
 
Data mining techniques unit v
Data mining techniques unit vData mining techniques unit v
Data mining techniques unit v
malathieswaran29
 
26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt
vikassingh569137
 
machine learning - Clustering in R
machine learning - Clustering in Rmachine learning - Clustering in R
machine learning - Clustering in R
Sudhakar Chavan
 
algoritma klastering.pdf
algoritma klastering.pdfalgoritma klastering.pdf
algoritma klastering.pdf
bintis1
 
Clustering as a unsupervised learning method inin machine learning
Clustering as a unsupervised learning method inin machine learningClustering as a unsupervised learning method inin machine learning
Clustering as a unsupervised learning method inin machine learning
tanishqgujari
 
clustering using different methods in .pdf
clustering using different methods in .pdfclustering using different methods in .pdf
clustering using different methods in .pdf
officialnovice7
 
UNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptxUNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptx
sandeepsandy494692
 
Clustering in Data Mining
Clustering in Data MiningClustering in Data Mining
Clustering in Data Mining
Archana Swaminathan
 
PPT s10-machine vision-s2
PPT s10-machine vision-s2PPT s10-machine vision-s2
PPT s10-machine vision-s2
Binus Online Learning
 
Clustering
ClusteringClustering
Clustering
Dr. C.V. Suresh Babu
 
Clusteryanam
ClusteryanamClusteryanam
Clusteryanam
Nagasuri Bala Venkateswarlu
 
Advanced database and data mining & clustering concepts
Advanced database and data mining & clustering conceptsAdvanced database and data mining & clustering concepts
Advanced database and data mining & clustering concepts
NithyananthSengottai
 
CSA 3702 machine learning module 3
CSA 3702 machine learning module 3CSA 3702 machine learning module 3
CSA 3702 machine learning module 3
Nandhini S
 
Algorithms used in AIML and the need for aiml basic use cases
Algorithms used in AIML and the need for aiml basic use casesAlgorithms used in AIML and the need for aiml basic use cases
Algorithms used in AIML and the need for aiml basic use cases
Bhagirathi12
 
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Maninda Edirisooriya
 
DS9 - Clustering.pptx
DS9 - Clustering.pptxDS9 - Clustering.pptx
DS9 - Clustering.pptx
JK970901
 
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
NANDHINIS900805
 
Unsupervised%20Learninffffg (2).pptx. application
Unsupervised%20Learninffffg (2).pptx. applicationUnsupervised%20Learninffffg (2).pptx. application
Unsupervised%20Learninffffg (2).pptx. application
ShabirAhmad625218
 
Machine Learning : Clustering - Cluster analysis.pptx
Machine Learning : Clustering - Cluster analysis.pptxMachine Learning : Clustering - Cluster analysis.pptx
Machine Learning : Clustering - Cluster analysis.pptx
tecaviw979
 
Clustering on DSS
Clustering on DSSClustering on DSS
Clustering on DSS
Enaam Alotaibi
 
Data mining techniques unit v
Data mining techniques unit vData mining techniques unit v
Data mining techniques unit v
malathieswaran29
 
26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt
vikassingh569137
 
machine learning - Clustering in R
machine learning - Clustering in Rmachine learning - Clustering in R
machine learning - Clustering in R
Sudhakar Chavan
 
algoritma klastering.pdf
algoritma klastering.pdfalgoritma klastering.pdf
algoritma klastering.pdf
bintis1
 
Clustering as a unsupervised learning method inin machine learning
Clustering as a unsupervised learning method inin machine learningClustering as a unsupervised learning method inin machine learning
Clustering as a unsupervised learning method inin machine learning
tanishqgujari
 
clustering using different methods in .pdf
clustering using different methods in .pdfclustering using different methods in .pdf
clustering using different methods in .pdf
officialnovice7
 
UNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptxUNIT_V_Cluster Analysis.pptx
UNIT_V_Cluster Analysis.pptx
sandeepsandy494692
 
Advanced database and data mining & clustering concepts
Advanced database and data mining & clustering conceptsAdvanced database and data mining & clustering concepts
Advanced database and data mining & clustering concepts
NithyananthSengottai
 
CSA 3702 machine learning module 3
CSA 3702 machine learning module 3CSA 3702 machine learning module 3
CSA 3702 machine learning module 3
Nandhini S
 
Algorithms used in AIML and the need for aiml basic use cases
Algorithms used in AIML and the need for aiml basic use casesAlgorithms used in AIML and the need for aiml basic use cases
Algorithms used in AIML and the need for aiml basic use cases
Bhagirathi12
 
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Maninda Edirisooriya
 
Ad

Recently uploaded (20)

new ppt artificial intelligence historyyy
new ppt artificial intelligence historyyynew ppt artificial intelligence historyyy
new ppt artificial intelligence historyyy
PianoPianist
 
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G..."Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
Infopitaara
 
Mathematical foundation machine learning.pdf
Mathematical foundation machine learning.pdfMathematical foundation machine learning.pdf
Mathematical foundation machine learning.pdf
TalhaShahid49
 
Level 1-Safety.pptx Presentation of Electrical Safety
Level 1-Safety.pptx Presentation of Electrical SafetyLevel 1-Safety.pptx Presentation of Electrical Safety
Level 1-Safety.pptx Presentation of Electrical Safety
JoseAlbertoCariasDel
 
Oil-gas_Unconventional oil and gass_reseviours.pdf
Oil-gas_Unconventional oil and gass_reseviours.pdfOil-gas_Unconventional oil and gass_reseviours.pdf
Oil-gas_Unconventional oil and gass_reseviours.pdf
M7md3li2
 
Explainable-Artificial-Intelligence-XAI-A-Deep-Dive (1).pptx
Explainable-Artificial-Intelligence-XAI-A-Deep-Dive (1).pptxExplainable-Artificial-Intelligence-XAI-A-Deep-Dive (1).pptx
Explainable-Artificial-Intelligence-XAI-A-Deep-Dive (1).pptx
MahaveerVPandit
 
Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...
Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...
Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...
Journal of Soft Computing in Civil Engineering
 
Data Structures_Introduction to algorithms.pptx
Data Structures_Introduction to algorithms.pptxData Structures_Introduction to algorithms.pptx
Data Structures_Introduction to algorithms.pptx
RushaliDeshmukh2
 
QA/QC Manager (Quality management Expert)
QA/QC Manager (Quality management Expert)QA/QC Manager (Quality management Expert)
QA/QC Manager (Quality management Expert)
rccbatchplant
 
Avnet Silica's PCIM 2025 Highlights Flyer
Avnet Silica's PCIM 2025 Highlights FlyerAvnet Silica's PCIM 2025 Highlights Flyer
Avnet Silica's PCIM 2025 Highlights Flyer
WillDavies22
 
The Gaussian Process Modeling Module in UQLab
The Gaussian Process Modeling Module in UQLabThe Gaussian Process Modeling Module in UQLab
The Gaussian Process Modeling Module in UQLab
Journal of Soft Computing in Civil Engineering
 
Introduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptxIntroduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptx
AS1920
 
DT REPORT by Tech titan GROUP to introduce the subject design Thinking
DT REPORT by Tech titan GROUP to introduce the subject design ThinkingDT REPORT by Tech titan GROUP to introduce the subject design Thinking
DT REPORT by Tech titan GROUP to introduce the subject design Thinking
DhruvChotaliya2
 
Data Structures_Searching and Sorting.pptx
Data Structures_Searching and Sorting.pptxData Structures_Searching and Sorting.pptx
Data Structures_Searching and Sorting.pptx
RushaliDeshmukh2
 
"Boiler Feed Pump (BFP): Working, Applications, Advantages, and Limitations E...
"Boiler Feed Pump (BFP): Working, Applications, Advantages, and Limitations E..."Boiler Feed Pump (BFP): Working, Applications, Advantages, and Limitations E...
"Boiler Feed Pump (BFP): Working, Applications, Advantages, and Limitations E...
Infopitaara
 
Compiler Design Unit1 PPT Phases of Compiler.pptx
Compiler Design Unit1 PPT Phases of Compiler.pptxCompiler Design Unit1 PPT Phases of Compiler.pptx
Compiler Design Unit1 PPT Phases of Compiler.pptx
RushaliDeshmukh2
 
Artificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptxArtificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptx
aditichinar
 
Smart_Storage_Systems_Production_Engineering.pptx
Smart_Storage_Systems_Production_Engineering.pptxSmart_Storage_Systems_Production_Engineering.pptx
Smart_Storage_Systems_Production_Engineering.pptx
rushikeshnavghare94
 
introduction to machine learining for beginers
introduction to machine learining for beginersintroduction to machine learining for beginers
introduction to machine learining for beginers
JoydebSheet
 
Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...
Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...
Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...
Journal of Soft Computing in Civil Engineering
 
new ppt artificial intelligence historyyy
new ppt artificial intelligence historyyynew ppt artificial intelligence historyyy
new ppt artificial intelligence historyyy
PianoPianist
 
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G..."Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
Infopitaara
 
Mathematical foundation machine learning.pdf
Mathematical foundation machine learning.pdfMathematical foundation machine learning.pdf
Mathematical foundation machine learning.pdf
TalhaShahid49
 
Level 1-Safety.pptx Presentation of Electrical Safety
Level 1-Safety.pptx Presentation of Electrical SafetyLevel 1-Safety.pptx Presentation of Electrical Safety
Level 1-Safety.pptx Presentation of Electrical Safety
JoseAlbertoCariasDel
 
Oil-gas_Unconventional oil and gass_reseviours.pdf
Oil-gas_Unconventional oil and gass_reseviours.pdfOil-gas_Unconventional oil and gass_reseviours.pdf
Oil-gas_Unconventional oil and gass_reseviours.pdf
M7md3li2
 
Explainable-Artificial-Intelligence-XAI-A-Deep-Dive (1).pptx
Explainable-Artificial-Intelligence-XAI-A-Deep-Dive (1).pptxExplainable-Artificial-Intelligence-XAI-A-Deep-Dive (1).pptx
Explainable-Artificial-Intelligence-XAI-A-Deep-Dive (1).pptx
MahaveerVPandit
 
Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...
Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...
Development of MLR, ANN and ANFIS Models for Estimation of PCUs at Different ...
Journal of Soft Computing in Civil Engineering
 
Data Structures_Introduction to algorithms.pptx
Data Structures_Introduction to algorithms.pptxData Structures_Introduction to algorithms.pptx
Data Structures_Introduction to algorithms.pptx
RushaliDeshmukh2
 
QA/QC Manager (Quality management Expert)
QA/QC Manager (Quality management Expert)QA/QC Manager (Quality management Expert)
QA/QC Manager (Quality management Expert)
rccbatchplant
 
Avnet Silica's PCIM 2025 Highlights Flyer
Avnet Silica's PCIM 2025 Highlights FlyerAvnet Silica's PCIM 2025 Highlights Flyer
Avnet Silica's PCIM 2025 Highlights Flyer
WillDavies22
 
Introduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptxIntroduction to Zoomlion Earthmoving.pptx
Introduction to Zoomlion Earthmoving.pptx
AS1920
 
DT REPORT by Tech titan GROUP to introduce the subject design Thinking
DT REPORT by Tech titan GROUP to introduce the subject design ThinkingDT REPORT by Tech titan GROUP to introduce the subject design Thinking
DT REPORT by Tech titan GROUP to introduce the subject design Thinking
DhruvChotaliya2
 
Data Structures_Searching and Sorting.pptx
Data Structures_Searching and Sorting.pptxData Structures_Searching and Sorting.pptx
Data Structures_Searching and Sorting.pptx
RushaliDeshmukh2
 
"Boiler Feed Pump (BFP): Working, Applications, Advantages, and Limitations E...
"Boiler Feed Pump (BFP): Working, Applications, Advantages, and Limitations E..."Boiler Feed Pump (BFP): Working, Applications, Advantages, and Limitations E...
"Boiler Feed Pump (BFP): Working, Applications, Advantages, and Limitations E...
Infopitaara
 
Compiler Design Unit1 PPT Phases of Compiler.pptx
Compiler Design Unit1 PPT Phases of Compiler.pptxCompiler Design Unit1 PPT Phases of Compiler.pptx
Compiler Design Unit1 PPT Phases of Compiler.pptx
RushaliDeshmukh2
 
Artificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptxArtificial Intelligence (AI) basics.pptx
Artificial Intelligence (AI) basics.pptx
aditichinar
 
Smart_Storage_Systems_Production_Engineering.pptx
Smart_Storage_Systems_Production_Engineering.pptxSmart_Storage_Systems_Production_Engineering.pptx
Smart_Storage_Systems_Production_Engineering.pptx
rushikeshnavghare94
 
introduction to machine learining for beginers
introduction to machine learining for beginersintroduction to machine learining for beginers
introduction to machine learining for beginers
JoydebSheet
 
Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...
Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...
Structural Response of Reinforced Self-Compacting Concrete Deep Beam Using Fi...
Journal of Soft Computing in Civil Engineering
 
Ad

Unsupervised learning (clustering)

  • 2. • Introduction • Clustering • K-means clustering • Hierarchical clustering • Comparison between K-means and Hierarchical • DBSCAN clustering 2
  • 3. Introduction • Unsupervised learning is a type of machine learning algorithm used to draw inferences from datasets consisting of input data without labeled responses. The most common unsupervised learning method is cluster analysis, which is used for exploratory data analysis to find hidden patterns or grouping in data. 3
  • 4. What is clustering? • A group of objects that are similar to other objects in the cluster, and dissimilar to data points in other clusters. 4
  • 5. Use of clustering Clustering has been widely used across industries for years: • Biology - for genetic and species grouping; • Medical imaging - for distinguishing between different kinds of tissues; • Market research - for differentiating groups of customers based on some attributes • Recommender systems - giving you better Amazon purchase suggestions or Netflix movie matches. 5
  • 6. Clustering algorithms • Partition-based clustering • Relatively efficient • E.g. k-means • Hierarchical clustering • Produces trees of clusters • E.g. Agglomerative, Divisive • Density-based clustering • Produces arbitrary shaped clusters • E.g. DBSCAN 6
  • 7. K-means clustering • k-means is a partitioning clustering • K-means divides the data into non-overlapping subsets (clusters) without any cluster-internal structure • Examples within a cluster are very similar • Examples across different clusters are very different 7
  • 8. Determine the similarity or dissimilarity 8
  • 11. How does k-means clustering works? 1. Randomly place k centroids, one for each cluster 2. Calculate the distance of each point from each centroid 3. Assign each data point(object) to the closest centroid, creating a cluster 4. Recalculate the position of the k centroids 5. Repeat the steps 2-4, until the centroids no longer move 11
  • 12. 12
  • 14. • K-means is partitioning algorithm relatively efficient for medium and large sized databases • Produces sphere-like clusters • Needs number of clusters (k) 14
  • 15. Hierarchical clustering • Hierarchical clustering algorithms build a hierarchy of clusters where each node is a cluster consists of the clusters of its daughter nodes. • Hierarchical clustering strategies • Divisive (top down) • Agglomerative (bottom up) 15
  • 16. Agglomerative algorithm 1. Create n clusters, one for each data point 2. Compute the proximity matrix 3. Repeat 1. Merge the two closest clusters 2. Update the proximity matrix 4. Until only a single cluster remains 16
  • 18. Distance between clusters • Single-Linkage clustering • Minimum distance between clusters • Complete-Linkage Clustering • Maximum distance between clusters • Average linkage clustering • Average distance between clusters • Centroid linkage clustering • Distance between cluster centroids 18
  • 19. • Advantages • Doesn’t required number of clusters to be specified • Easy to implement • Produces a dendrogram, which helps with understanding the data 19
  • 20. • Disadvantages • Can never undo any previous steps throughout the algorithm • Generally has long runtimes • Sometimes difficult to identify the number of clusters by the dendrogram 20
  • 21. Hierarchical clustering Vs. K-means K-means Hierarchical Clustering Much more efficient Can be slow for large datasets Requires the number of clusters to be specified Does not require the number of clusters to run Gives only one partitioning of the data based on the predefined number of clusters Gives more than one partitioning depending on the resolution Potentially returns different clusters each time it is run due to random initialization of centroids Always generates the same clusters 21
  • 22. DBSCAN clustering • When applied to tasks with arbitrary shaped clusters or clusters within clusters, traditional techniques might not be able to achieve good results • Partitioning based algorithms has no notion of outliers that is, all points are assigned to a cluster even if they do not belong in any • In contrast, density-based clustering locates regions of high density that are separated from one another by regions of low density. Density in this context is defined as the number of points within a specified radius. 22
  • 23. 23
  • 24. K-means vs density based clustering 24
  • 25. What is DBSCAN? • DBSCAN (Density-Based Spatial Clustering of Applications with Noise) • Is one of the most common clustering algorithms • Works based on density of objects • R (Radius of neighborhood) • Radius (R) that if includes enough number of points within, we call it a dense area • M (Min number of neighbors) • The minimum number of data points we want in a neighborhood to define a cluster 25
  • 27. DBSCAN algorithm- core point • R=2 units M=6 27
  • 28. DBSCAN algorithm- border point • R=2 unit M=6 28
  • 29. 29
  • 31. DBSCAN algorithm- identify all points 31
  • 33. Advantages of DBSCAN 1. Arbitrarily shaped clusters 2. Robust to outliers 3. Does not require specification of the number of clusters 33
  • 34. 34