0% found this document useful (0 votes)

5 views

UNIT 4 AIML

Unit IV covers ensemble techniques and unsupervised learning, focusing on model combination schemes like bagging, boosting, and stacking, as well as K-means clustering. The K-means algorithm is an iterative method that groups unlabeled data into predefined clusters based on similarity, with the goal of minimizing distances between data points and their centroids. The Elbow method is introduced as a technique to determine the optimal number of clusters by analyzing the Within Cluster Sum of Squares (WCSS).

Uploaded by

angelg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

UNIT 4 AIML

Uploaded by

angelg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

UNIT IV

UNIT IV ENSEMBLE TECHNIQUES AND UNSUPERVISED

LEARNING
Combining multiple learners: Model combination schemes, Voting,
Ensemble Learning - bagging, boosting, stacking, Unsupervised
learning: K-means, Instance Based Learning: KNN, Gaussian mixture
models and Expectation maximization
Course Objective: Study about ensembling and unsupervised
learning algorithms
Course Outcome CO4: Build ensembling and unsupervised
models
K – Means clustering algorithm
• K-Means Clustering is an
Unsupervised Learning algorithm, which groups the
unlabeled dataset into different clusters. Here K
defines the number of pre-defined clusters that need to
be created in the process, as if K=2, there will be two
clusters, and for K=3, there will be three clusters, and
so on.
• It is an iterative centroid based algorithm that
divides the unlabeled dataset into k different clusters in
such a way that each dataset belongs only one group
that has similar properties.
• Each cluster is associated with a centroid. The main aim
of this algorithm is to minimize the sum of distances
between the data point and their corresponding
Each cluster has datapoints with some
commonalities, and it is away from other
clusters.

https://www.javatpoint.com/k-means-clustering-algorithm-in-machine-learning
• Determines the best value for K center points or
centroids by an iterative process.
• Assigns each data point to its closest k-center. Those
data points which are near to the particular k-center,
create a cluster.
How does the K-Means Algorithm Work?

• Step-1: Select the number K to decide the number of clusters.

• Step-2: Select random K points or centroids. (It can be other from
the input dataset).
• Step-3: Assign each data point to their closest centroid, which will
form the predefined K clusters.
• Step-4: Calculate the variance and place a new centroid of each
cluster.
• Step-5: Repeat the third steps, which means reassign each
datapoint to the new closest centroid of each cluster.
• Step-6: If any reassignment occurs, then go to step-4 else go to
FINISH.
• Step-7: The model is ready.
• The Elbow method is one of ∑Pi in Cluster1 distance(Pi C1)2: It is the sum of the
the most popular ways to square of the distances between each data point
find the optimal number of and its•tcentroid
executes the aK-means
within cluster1clustering on a
and the same
clusters. This method uses for thegiven dataset
other two terms.for different K values
the concept of WCSS value. (ranges from 1-10).
•For each value of K, calculates the WCSS
• WCSS - Within Cluster value.
Sum of Squares, which •Plots a curve between calculated WCSS
defines the total variations values and the number of clusters K.
within a cluster. •The sharp point of bend or a point of the
plot looks like an arm, then that point is
considered as the best value of K
Elbow method - Steps
• executes the K-means clustering
on a given dataset for different K
values (ranges from 1-10).
• For each value of K, calculates the
WCSS value.
• Plots a curve between calculated
WCSS values and the number of
clusters K.
• The sharp point of bend or a point
of the plot looks like an arm, then
that point is considered as the
best value of K
• WCSS is zero - the endpoint of the plot.
Python Implementation
• Data Pre-processing
• Finding the optimal number of clusters using the
elbow method
• Training the K-means algorithm on the training
dataset
• Visualizing the clusters
# importing libraries
• import numpy as nm
• import matplotlib.pyplot as mtp
• import pandas as pd
Example o/p of a K-means
Algorithm
Centroid
3rd iteration
Assignment Question

A LEVEL Heat and Modern 2016
82% (17)
A LEVEL Heat and Modern 2016
212 pages
Troubleshooting Cisco Catalyst 2960 3560 and 3750 Series Switches
100% (1)
Troubleshooting Cisco Catalyst 2960 3560 and 3750 Series Switches
135 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
13 pages
K-Means Clustering Algorithm - Javatpoint
No ratings yet
K-Means Clustering Algorithm - Javatpoint
21 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
UNIT-4
No ratings yet
UNIT-4
22 pages
K Means Clustering Algorithm
No ratings yet
K Means Clustering Algorithm
12 pages
Data_mining-4
No ratings yet
Data_mining-4
9 pages
K-MEANS CLUSTERING ppt kpu
No ratings yet
K-MEANS CLUSTERING ppt kpu
4 pages
algo
No ratings yet
algo
59 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
ML Seminar
No ratings yet
ML Seminar
37 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
Kmean
No ratings yet
Kmean
24 pages
K Means
No ratings yet
K Means
26 pages
Assignment 4 A
No ratings yet
Assignment 4 A
15 pages
Elbow Method for Optimal Cluster Number in K-Means
No ratings yet
Elbow Method for Optimal Cluster Number in K-Means
8 pages
Clustering Kmeans
No ratings yet
Clustering Kmeans
6 pages
Unit IV
No ratings yet
Unit IV
96 pages
Unit_4 (1)
No ratings yet
Unit_4 (1)
63 pages
K-Means Clustering
No ratings yet
K-Means Clustering
5 pages
Mod4_Unsupervised Learning
No ratings yet
Mod4_Unsupervised Learning
9 pages
06. k Clustering
No ratings yet
06. k Clustering
28 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
9 pages
Unit 3 - KmeansClustering
No ratings yet
Unit 3 - KmeansClustering
17 pages
K-Mean Clustering
No ratings yet
K-Mean Clustering
8 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
Week 9
No ratings yet
Week 9
66 pages
USL
No ratings yet
USL
21 pages
ML Unit-2
No ratings yet
ML Unit-2
31 pages
K-Means Clustering Clearly Explained
No ratings yet
K-Means Clustering Clearly Explained
12 pages
Lecture 11 K Means Clustering
No ratings yet
Lecture 11 K Means Clustering
8 pages
K-Means Clustering
No ratings yet
K-Means Clustering
14 pages
Ml Unit5 Notes
No ratings yet
Ml Unit5 Notes
18 pages
Introduction To The K-Means Clustering Algorithm Based On The Elbow
No ratings yet
Introduction To The K-Means Clustering Algorithm Based On The Elbow
4 pages
DWM Exp7 C49
No ratings yet
DWM Exp7 C49
11 pages
Simple K Means
No ratings yet
Simple K Means
3 pages
K-means clustering
No ratings yet
K-means clustering
7 pages
Lecture 18 K Means Clustering
No ratings yet
Lecture 18 K Means Clustering
77 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
kmea
No ratings yet
kmea
53 pages
KMean Merged
No ratings yet
KMean Merged
13 pages
ML-Unit III - K-Means Clustering
No ratings yet
ML-Unit III - K-Means Clustering
22 pages
kmeansfinal
No ratings yet
kmeansfinal
16 pages
Clustering
No ratings yet
Clustering
17 pages
Determining Clusters
No ratings yet
Determining Clusters
4 pages
6 Clustering
No ratings yet
6 Clustering
15 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
2 - K-Mean
No ratings yet
2 - K-Mean
39 pages
CPE412 Pattern Recognition (Week 7)
No ratings yet
CPE412 Pattern Recognition (Week 7)
48 pages
K-Means With Elbow Method
No ratings yet
K-Means With Elbow Method
24 pages
CE345 - Lecture #9 - Clustering
No ratings yet
CE345 - Lecture #9 - Clustering
56 pages
Pilot
No ratings yet
Pilot
3 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
12 pages
Clustering
No ratings yet
Clustering
24 pages
MODULE 4 - 5TH SEM (2)
No ratings yet
MODULE 4 - 5TH SEM (2)
23 pages
Unsupervised Learning 2024-PPG
No ratings yet
Unsupervised Learning 2024-PPG
85 pages
K - Means Clustering
No ratings yet
K - Means Clustering
13 pages
Introduction To Unsupervised Learning:: Clustering
No ratings yet
Introduction To Unsupervised Learning:: Clustering
21 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
A Novel High-Entropy (Sm0.2Eu0.2Tb0.2Dy0.2Lu0.2) 2Zr2O7 Ceramic Aerogel With Ultralow Thermal Conductivity
No ratings yet
A Novel High-Entropy (Sm0.2Eu0.2Tb0.2Dy0.2Lu0.2) 2Zr2O7 Ceramic Aerogel With Ultralow Thermal Conductivity
10 pages
DataStage Custom Stages
No ratings yet
DataStage Custom Stages
22 pages
The Effect of Leverage, Capital Intensity and Deferred Tax
No ratings yet
The Effect of Leverage, Capital Intensity and Deferred Tax
8 pages
Cambridge International AS & A Level: BIOLOGY 9700/31
No ratings yet
Cambridge International AS & A Level: BIOLOGY 9700/31
16 pages
Fast Atom Bombardment PDF
No ratings yet
Fast Atom Bombardment PDF
2 pages
Assignment On Operation Process of Jaypee Cement Plant: Sumbitted By: Anmol Garg (A1802014082) Mba-Ib
No ratings yet
Assignment On Operation Process of Jaypee Cement Plant: Sumbitted By: Anmol Garg (A1802014082) Mba-Ib
13 pages
RMT Assignment 2
No ratings yet
RMT Assignment 2
4 pages
Calculus Assignment Section 3.3
No ratings yet
Calculus Assignment Section 3.3
2 pages
Grade 5 MTAP Elimination - 2005 PDF
67% (3)
Grade 5 MTAP Elimination - 2005 PDF
4 pages
Capitulo 7
No ratings yet
Capitulo 7
49 pages
CIS-145 Homework #3
No ratings yet
CIS-145 Homework #3
1 page
Latihan Soal Otk 1
No ratings yet
Latihan Soal Otk 1
3 pages
YAB2063 Jan2020 Tutorial 1 Answers
100% (1)
YAB2063 Jan2020 Tutorial 1 Answers
3 pages
Amca - 501 Application Manual For Air Louvers
No ratings yet
Amca - 501 Application Manual For Air Louvers
34 pages
Learning Capability and Storage Capacity of Two-Hidden-Layer Feedforward Networks
No ratings yet
Learning Capability and Storage Capacity of Two-Hidden-Layer Feedforward Networks
8 pages
Counductometry
No ratings yet
Counductometry
13 pages
PH141 Recommended Problems Chapt.10 Even
No ratings yet
PH141 Recommended Problems Chapt.10 Even
3 pages
Detection and Evaluation of Rail Defects With Nondestructive Testing Methods
No ratings yet
Detection and Evaluation of Rail Defects With Nondestructive Testing Methods
9 pages
Questions: 5
No ratings yet
Questions: 5
5 pages
Eco
No ratings yet
Eco
20 pages
Arab Academy For Science and Technology & Maritime
No ratings yet
Arab Academy For Science and Technology & Maritime
62 pages
TOEFL Glossary
No ratings yet
TOEFL Glossary
5 pages
Operator Types in Java
No ratings yet
Operator Types in Java
11 pages
GATE 2016 2018 Mining Engineering Question Paper and Answer Key
100% (1)
GATE 2016 2018 Mining Engineering Question Paper and Answer Key
61 pages
Compact Hardwired Logic Controler M241 - EIO0000001679.00
No ratings yet
Compact Hardwired Logic Controler M241 - EIO0000001679.00
192 pages
Phosphorus
No ratings yet
Phosphorus
1 page
Log Cat 1715787262498
No ratings yet
Log Cat 1715787262498
22 pages
WOI - Econometrics Ch11 Further Issues in Using OLS With Time Series Data
No ratings yet
WOI - Econometrics Ch11 Further Issues in Using OLS With Time Series Data
12 pages

UNIT 4 AIML

Uploaded by

UNIT 4 AIML

Uploaded by

UNIT IV

UNIT IV ENSEMBLE TECHNIQUES AND UNSUPERVISED

• Step-1: Select the number K to decide the number of clusters.

You might also like