0% found this document useful (0 votes)

10 views

Machine Learning Section3 Ebook v05

This document discusses unsupervised learning techniques including cluster analysis algorithms like k-means, k-medoids, hierarchical clustering, self-organizing maps, fuzzy c-means, and Gaussian mixture models. Dimensionality reduction techniques are also covered. Examples are provided for using k-means to site cell towers and fuzzy c-means to analyze gene expression data.

Uploaded by

camgova

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Machine Learning Section3 Ebook v05

Uploaded by

camgova

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Applying

Unsupervised Learning
When to Consider
Unsupervised Learning

Unsupervised learning is useful when you want to explore your data but
don’t yet have a specific goal or are not sure what information the data
contains. It’s also a good way to reduce the dimensions of your data.
Unsupervised Learning Techniques

As we saw in section 1, most unsupervised learning techniques are

a form of cluster analysis.

In cluster analysis, data is partitioned into groups based on some

measure of similarity or shared characteristic. Clusters are
formed so that objects in the same cluster are very similar and
objects in different clusters are very distinct.

Clustering algorithms fall into two broad groups:

• Hard clustering, where each data point belongs to only

one cluster
• Soft clustering, where each data point can belong to more
than one cluster
Gaussian mixture model used to separate data into two clusters.
You can use hard or soft clustering techniques if you already know
the possible data groupings.

If you don’t yet know how the data might be grouped:

• Use self-organizing feature maps or hierarchical

clustering to look for possible structures in the
data.

• Use cluster evaluation to look for the “best” number

of groups for a given clustering algorithm.

Applying Unsupervised Learning 3

Common Hard Clustering Algorithms

k- Means k- Medoids
How it Works How It Works
Partitions data into k number of mutually exclusive clusters. Similar to k-means, but with the requirement that the cluster
How well a point fits into a cluster is determined by the centers coincide with points in the data.
distance from that point to the cluster’s center.
Best Used...
Best Used...
• When the number of clusters is known
• When the number of clusters is known
• For fast clustering of categorical data
• For fast clustering of large data sets
• To scale to large data sets

Result: Cluster centers Result: Cluster centers that

coincide with data points

Applying Unsupervised Learning 4

Common Hard Clustering Algorithms continued

Hierarchical Clustering Self-Organizing Map

How it Works How It Works
Produces nested sets of clusters by analyzing similarities Neural-network based clustering that transforms a dataset
between pairs of points and grouping objects into a binary, into a topology-preserving 2D map.
hierarchical tree.
Best Used...
Best Used...
• To visualize high-dimensional data in 2D or 3D
• When you don’t know in advance how many clusters
• To deduce the dimensionality of data by preserving its
are in your data
topology (shape)
• You want visualization to guide
your selection

Result:
Result: Dendrogram
Lower-dimensional
showing the hierarchical
(typically 2D)
relationship between
representation
clusters

Applying Unsupervised Learning 5

Common Hard Clustering Algorithms continued

Example: Using k-Means Clustering to Site Cell Phone Towers

A cell phone company wants to know the number and placement

of cell phone towers that will provide the most reliable service. For
optimal signal reception, the towers must be located within
clusters of people.

The workflow begins with an initial guess at the number of clusters

that will be needed. To evaluate this guess, the engineers compare
service with three towers and four towers to see how well they’re
able to cluster for each scenario (in other words, how well the
towers provide service).

A phone can only talk to one tower at a time, so this is a hard

clustering problem. The team uses k-means clustering because
k-means treats each observation in the data as an object having
a location in space. It finds a partition in which objects within
each cluster are as close to each other as possible and as far from
objects in other clusters as possible.

After running the algorithm, the team can accurately determine the
results of partitioning the data into three and four clusters.

Applying Unsupervised
Applying Unsupervised Learning
Learning 6
Common Soft Clustering Algorithms

Fuzzy c-Means Gaussian Mixture Model

How it Works How It Works
Partition-based clustering when data points may belong to Partition-based clustering where data points come from
more than one cluster. different multivariate normal distributions with certain
Best Used... probabilities.

• When the number of clusters is known Best Used...

• For pattern recognition • When a data point might belong to more than
• When clusters overlap one cluster
• When clusters have different sizes and correlation
structures within them

Result: Cluster centers Result: A model of

(similar to k-means) but Gaussian distributions
with fuzziness so that that give probabilities of
points may belong to a point being in a cluster
more than one cluster

Applying Unsupervised Learning 7

Common Soft Clustering Algorithms continued

Example: Using Fuzzy c-Means Clustering to Analyze

Gene Expression Data

A team of biologists is analyzing gene expression data from

microarrays to better understand the genes involved in normal and
abnormal cell division. (A gene is said to be “expressed” if it is
actively involved in a cellular function such as protein production.)

The microarray contains expression data from two tissue samples.

The researchers want to compare the samples to determine
whether certain patterns of gene expression are implicated in
cancer proliferation.

After preprocessing the data to remove noise, they cluster the

data. Because the same genes can be involved in several biological
processes, no single gene is likely to belong to one cluster only.
The researchers apply a fuzzy c-means algorithm to the data. They
then visualize the clusters to identify groups of genes that behave
in a similar way.

Applying Unsupervised
Applying Unsupervised Learning
Learning 8
Improving Models with Dimensionality Reduction

Machine learning is an effective method for finding patterns in As datasets get bigger, you frequently need to reduce the
big datasets. But bigger data brings added complexity. number of features, or dimensionality.

Example: EEG Data Reduction

Suppose you have electroencephalogram (EEG) data that captures

electrical activity of the brain, and you want to use this data to
predict a future seizure. The data was captured using dozens of
leads, each corresponding to a variable in your original dataset.
Each of these variables contains noise. To make your prediction
algorithm more robust, you use dimensionality reduction
techniques to derive a smaller number of features. Because these
features are calculated from multiple sensors, they will be less
susceptible to noise in an individual sensor than would be the case
if you used the raw data directly.

Applying Unsupervised Learning 9

Common Dimensionality Reduction Techniques

The three most commonly used dimensionality reduction

techniques are:

Principal component analysis (PCA)—performs a linear

transformation on the data so that most of the variance or
information in your high-dimensional dataset is captured by the
first few principal components. The first principal component
will capture the most variance, followed by the second principal
component, and so on.

Factor analysis—identifies underlying correlations between

variables in your dataset to provide a representation in terms of a
smaller number of unobserved latent, or common, factors.

Nonnegative matrix factorization—used when model terms must

represent nonnegative quantities, such as physical quantities.

Applying Unsupervised Learning 10

Using Principal Component Analysis

In datasets with many variables, groups of variables often move Each principal component is a linear combination of the original
together. PCA takes advantage of this redundancy of information variables. Because all the principal components are orthogonal to
by generating new variables via linear combinations of the original each other, there is no redundant information.
variables so that a small number of new variables captures most of
the information.

Example: Engine Health Monitoring

You have a dataset that includes measurements for different

sensors on an engine (temperatures, pressures, emissions, and so
on). While much of the data comes from a healthy engine, the
sensors have also captured data from the engine when it needs
maintenance.

You cannot see any obvious abnormalities by looking at any

individual sensor. However, by applying PCA, you can transform
this data so that most variations in the sensor measurements
are captured by a small number of principal components. It is
easier to distinguish between a healthy and unhealthy engine by
inspecting these principal components than by looking at the raw
sensor data.

Applying Unsupervised Learning 11

Using Factor Analysis

Your dataset might contain measured variables that In a factor analysis model, the measured variables depend on
overlap, meaning that they are dependent on one another. a smaller number of unobserved (latent) factors. Because
Factor analysis lets you fit a model to multivariate data to each factor might affect several variables, it is known as a
estimate this sort of interdependence. common factor. Each variable is assumed to be dependent on
a linear combination of the common factors.

Example: Tracking Stock Price Variation

Over the course of 100 weeks, the percent change in stock prices
has been recorded for ten companies. Of these ten, four are
technology companies, three are financial, and a further three
are retail. It seems reasonable to assume that the stock prices
for companies in the same sector will vary together as economic
conditions change. Factor analysis can provide quantitative
evidence to support this premise.

Applying Unsupervised Learning 12

Using Nonnegative Matrix Factorization

This dimension reduction technique is based on a low-rank nonnegative, producing models that respect features such
approximation of the feature space. In addition to reducing as the nonnegativity of physical quantities.
the number of features, it guarantees that the features are

Example: Text Mining

Suppose you want to explore variations in vocabulary and style

among several web pages. You create a matrix where each
row corresponds to an individual web page and each column
corresponds to a word (“the”,”a”,”we”, and so on). The data will
be the number of times a particular word occurs on a particular
page.

Since there more than a million words in the English language,

you apply nonnegative matrix factorization to create an arbitrary
number of features that represent higher-level concepts rather than
individual words. These concepts make it easier to distinguish
between, say, news, educational content, and online retail content.

Applying Unsupervised Learning 13

Next Steps

In this section we took a closer look at hard and soft clustering

LOTS OF
algorithms for unsupervised learning, offered some tips on DATA
selecting the right algorithm for your data, and showed how
reducing the number of features in your dataset improves model UNSUPERVISE
performance. D
LEARNING
As for your next steps:

• Unsupervised learning might be your end goal. For example,

if you are doing market research and want to segment
DATA LOWER-
consumer groups to target based on web site behavior, a DIMENSIONA
CLUSTERS
clustering algorithm will almost certainly give you the results L DATA
you’re looking for.
RESUL
• On the other hand, you might want to use unsupervised TS FEATUR
learning as a preprocessing step for supervised learning. E
SELECTIO
For example, apply clustering techniques to derive a smaller
number of features, and then use those features as inputs
for training a classifier. SUPERVISE
D
In section 4 we’ll explore supervised learning algorithms and LEARNIN
techniques, and see how to improve models with feature selection,
feature reduction, and parameter tuning. MODE
L

Applying Unsupervised Learning 14

Learn More

Ready for a deeper dive? Explore these unsupervised learning resources.

Clustering Algorithms Fuzzy C-Means Dimensionality

and Techniques Reduction
Cluster Quasi-Random Data Using
Fuzzy C-Means Clustering Analyze Quality of Life in U.S. Cities
k-Means Using PCA
Gaussian Mixture Models
Use K-Means and Hierarchical Analyze Stock Prices Using Factor
Clustering to Find Natural Patterns Gaussian Process Regression Models Analysis
in Data
Cluster Data from Mixture of Gaussian
Cluster Genes Using K-Means and Distributions Nonnegative Factorization
Self-Organizing Maps
Cluster Gaussian Mixture Data Using Perform Nonnegative Matrix
Color-Based Segmentation Using Soft Clustering Factorization
K-Means Clustering
Tune Gaussian Mixture Models Model Suburban Commuting Using
Subtractive Clustering
Hierarchical Clustering Image Processing Example: Detecting
Connectivity-Based Clustering Cars with Gaussian Mixture Models

Iris Clustering

Self-Organizing Maps
Cluster Data with a
Self-Organizing Map

Other product or brand names may be trademarks or registered trademarks of their respective holders.
80823v00

1.supervised and Unsupervised
No ratings yet
1.supervised and Unsupervised
42 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
49 pages
Unsupervised Machine Learning in Python
100% (1)
Unsupervised Machine Learning in Python
89 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
47 pages
SJNanda - Spider and CollidingBodies
No ratings yet
SJNanda - Spider and CollidingBodies
50 pages
Unsupervised Learning 1691392220
No ratings yet
Unsupervised Learning 1691392220
15 pages
1
No ratings yet
1
59 pages
Unsupervised Learning_ a Comprehensive Overview Of
No ratings yet
Unsupervised Learning_ a Comprehensive Overview Of
5 pages
2nd Unit NN Final Class Notes (1)
No ratings yet
2nd Unit NN Final Class Notes (1)
50 pages
ML_Unit-3
No ratings yet
ML_Unit-3
22 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
16 pages
DSA Presentation Group 6
No ratings yet
DSA Presentation Group 6
34 pages
Unit-5 Clustering (March 16, 24)
No ratings yet
Unit-5 Clustering (March 16, 24)
25 pages
R20 machine learning unit 4
No ratings yet
R20 machine learning unit 4
49 pages
Variance
No ratings yet
Variance
6 pages
ARTIFICIAL INTELLIGENCE LEC 5
No ratings yet
ARTIFICIAL INTELLIGENCE LEC 5
20 pages
ML UNIT-III
No ratings yet
ML UNIT-III
18 pages
Unit 5
No ratings yet
Unit 5
5 pages
Week 9. Unsupervised Learning
No ratings yet
Week 9. Unsupervised Learning
32 pages
Lab 10 Unsupervised
No ratings yet
Lab 10 Unsupervised
12 pages
Assignment 2
No ratings yet
Assignment 2
8 pages
Module 6 - Un-Supervised Learning Algorithms
No ratings yet
Module 6 - Un-Supervised Learning Algorithms
31 pages
ML CH 4
No ratings yet
ML CH 4
51 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
Unit 4
No ratings yet
Unit 4
74 pages
Lec04 - Unsupervised
No ratings yet
Lec04 - Unsupervised
18 pages
9 Som
No ratings yet
9 Som
32 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
Machine Learning For Humans, Part 3 - Unsupervised Learning - by Vishal Maini - Machine Learning For Humans - Medium
No ratings yet
Machine Learning For Humans, Part 3 - Unsupervised Learning - by Vishal Maini - Machine Learning For Humans - Medium
23 pages
genedata doc
No ratings yet
genedata doc
67 pages
Lecture 01 - Unsupervised Learning (Optional)
No ratings yet
Lecture 01 - Unsupervised Learning (Optional)
57 pages
What is Unsupervised Learning (1)
No ratings yet
What is Unsupervised Learning (1)
9 pages
ML Unit-4-1
No ratings yet
ML Unit-4-1
39 pages
DSUP_Exp5[1]
No ratings yet
DSUP_Exp5[1]
7 pages
UnsupervisedLearning_FoundationalMathofAI_S24
No ratings yet
UnsupervisedLearning_FoundationalMathofAI_S24
6 pages
Un-Supervised Machine Learning
No ratings yet
Un-Supervised Machine Learning
9 pages
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
No ratings yet
Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering
28 pages
Unit 3 unsupervised learning algorith
No ratings yet
Unit 3 unsupervised learning algorith
15 pages
Unit-5 Unit-5: Case Studies of Big Data Analytics Using Map-Reduce Programming
No ratings yet
Unit-5 Unit-5: Case Studies of Big Data Analytics Using Map-Reduce Programming
11 pages
Optimisation and Dimension Reduction Tech-unlocked
No ratings yet
Optimisation and Dimension Reduction Tech-unlocked
29 pages
Clustering (Unit 3)
100% (2)
Clustering (Unit 3)
71 pages
Ai Notes V
No ratings yet
Ai Notes V
7 pages
Group I Discrete Mathematics
No ratings yet
Group I Discrete Mathematics
4 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
4 pages
ML+Clustering
No ratings yet
ML+Clustering
33 pages
DW & DM Unit 4 Notes
No ratings yet
DW & DM Unit 4 Notes
40 pages
MODULE 3
No ratings yet
MODULE 3
17 pages
abc
No ratings yet
abc
10 pages
Clustering
No ratings yet
Clustering
64 pages
DWMModule 4 (1) (1) (1)
No ratings yet
DWMModule 4 (1) (1) (1)
31 pages
Unsupervised Machine Learning Techniques (2)
No ratings yet
Unsupervised Machine Learning Techniques (2)
58 pages
Unit-4
No ratings yet
Unit-4
53 pages
Unit 3 Clustering Algorithm
No ratings yet
Unit 3 Clustering Algorithm
44 pages
Final ML Unit3 May24
No ratings yet
Final ML Unit3 May24
154 pages
CLUSTRING
No ratings yet
CLUSTRING
13 pages
Day 3 - Content
No ratings yet
Day 3 - Content
50 pages
Advanced Data Analysis Techniques 2
No ratings yet
Advanced Data Analysis Techniques 2
32 pages
U20cs604 Machine Learning Unit III
No ratings yet
U20cs604 Machine Learning Unit III
23 pages
UnSupervised Learning
No ratings yet
UnSupervised Learning
3 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Period T Demand D Deseasonalized Demand
No ratings yet
Period T Demand D Deseasonalized Demand
3 pages
The Lagrangian, in General Is The Function Of: T and Q Q Q Q Q Q
No ratings yet
The Lagrangian, in General Is The Function Of: T and Q Q Q Q Q Q
19 pages
Assignment 1 LA S25 D13
No ratings yet
Assignment 1 LA S25 D13
7 pages
BRM CS
No ratings yet
BRM CS
4 pages
Sample Ques - ELQ, Comp
No ratings yet
Sample Ques - ELQ, Comp
14 pages
Data Science For Managers and Business Leaders Curriculum
No ratings yet
Data Science For Managers and Business Leaders Curriculum
6 pages
Download Complete Analysis of Chaotic Behavior in Non-linear Dynamical Systems Michał Piórek PDF for All Chapters
100% (3)
Download Complete Analysis of Chaotic Behavior in Non-linear Dynamical Systems Michał Piórek PDF for All Chapters
55 pages
How To Square Fractions
No ratings yet
How To Square Fractions
3 pages
Final - CO2011 - en - 2020 - 201 - 281x - No Keys
No ratings yet
Final - CO2011 - en - 2020 - 201 - 281x - No Keys
5 pages
Ebook Calculus Several Variables Canadian 9Th Edition Adams Test Bank Full Chapter PDF
100% (26)
Ebook Calculus Several Variables Canadian 9Th Edition Adams Test Bank Full Chapter PDF
64 pages
An Efficient Hardware Implementation of Artificial Neural Network Based On Stochastic Computing
No ratings yet
An Efficient Hardware Implementation of Artificial Neural Network Based On Stochastic Computing
6 pages
1 Process Dynamics and Control No1
100% (2)
1 Process Dynamics and Control No1
32 pages
Quantum Mechanic II Lectures Note
No ratings yet
Quantum Mechanic II Lectures Note
153 pages
1737539654980_Assignment I-21MAB204T (2024-25)
No ratings yet
1737539654980_Assignment I-21MAB204T (2024-25)
2 pages
BT4211 Data-Driven Marketing: Customer: Purchase Choice, Quantity, Duration
No ratings yet
BT4211 Data-Driven Marketing: Customer: Purchase Choice, Quantity, Duration
35 pages
@ CHPTR 03
No ratings yet
@ CHPTR 03
43 pages
M 340L (Matrices and Matrix Calculations) Syllabus
No ratings yet
M 340L (Matrices and Matrix Calculations) Syllabus
5 pages
Unit4 - T2 - TBVP SRM Tutorial
No ratings yet
Unit4 - T2 - TBVP SRM Tutorial
1 page
Osteoporosis Detection Using Machine and Deep Learning Techniques
No ratings yet
Osteoporosis Detection Using Machine and Deep Learning Techniques
15 pages
Crime Analysis and Prediction Using K-Means Clustering Technique
No ratings yet
Crime Analysis and Prediction Using K-Means Clustering Technique
4 pages
Module 1 Algorithm
No ratings yet
Module 1 Algorithm
4 pages
(Ebook) Classical and Quantum Computing: with C++ and Java Simulations by Yorick Hardy, Willi-Hans Steeb ISBN 9783034883665, 9783764366100, 3034883668, 3764366109 All Chapters Instant Download
100% (3)
(Ebook) Classical and Quantum Computing: with C++ and Java Simulations by Yorick Hardy, Willi-Hans Steeb ISBN 9783034883665, 9783764366100, 3034883668, 3764366109 All Chapters Instant Download
71 pages
Uct Phy4000w Computational Physics Tutorial
No ratings yet
Uct Phy4000w Computational Physics Tutorial
2 pages
Quizzes Prelim Analytic
No ratings yet
Quizzes Prelim Analytic
5 pages
Conditional Statement - Seasons
No ratings yet
Conditional Statement - Seasons
7 pages
Year 3 Module Report (2022-2023)
No ratings yet
Year 3 Module Report (2022-2023)
28 pages
Unit-5
No ratings yet
Unit-5
52 pages
Feedback and Control System Chapter 1. Introduction To Control System
No ratings yet
Feedback and Control System Chapter 1. Introduction To Control System
9 pages
GriffithsQMCh2p48
No ratings yet
GriffithsQMCh2p48
5 pages
AP-Stats-Chapter-5-PowerPoint-TPS4e
No ratings yet
AP-Stats-Chapter-5-PowerPoint-TPS4e
18 pages