0% found this document useful (0 votes)

12 views27 pages

Machine Learning (CSO851) - Lecture 05

Support Vector Machines (SVM) is a supervised learning technique used for classification, regression, and outlier detection, effective in high-dimensional spaces. SVMs create a hyperplane that maximizes the margin between different classes, utilizing support vectors for decision-making. The method can employ hard or soft margins depending on the data's separability, with considerations for overfitting and the need for probability estimates.

Uploaded by

trijitrana9878

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views27 pages

Machine Learning (CSO851) - Lecture 05

Uploaded by

trijitrana9878

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

Support Vector Machines

Machine Learning (CSO851)

Acknowledgement

Duda, Hard et. al.

https://scikit-learn.org/stable/modules/svm.html
https://www.baeldung.com/cs/ml-support-vector-mac
hines

https://monkeylearn.com/blog/introduction-to-
support-vector-machines-svm/
https://math.mit.edu/~edelman/publications/
support_vector.pdf
Introduction to SVM
Support vector machines (SVMs) is a supervised learning method used for
classification, regression and outliers detection.

The advantages of support vector machines are:

• Effective in high dimensional spaces.
• Effective in cases where number of dimensions is greater than the number
of samples.
• Uses a subset of training points in the decision function (called support
vectors), so it is also memory efficient.
• Versatile: different Kernel functions can be specified for the decision
function. Common kernels are provided, but it is also possible to specify
custom kernels.
The disadvantages of support vector machines include:
• If the number of features is much greater than the number of samples, avoid
over-fitting in choosing Kernel functions and regularization term is crucial.
• SVMs do not directly provide probability estimates, these are calculated
using an expensive five-fold cross-validation.
Basic Features of SVM

– Optimal hyperplane for linearly separable patterns

– Extend to patterns that are not linearly separable by

transformations of original data to map into new space – the
Kernel function

4
How Does SVM Works?
• Let’s imagine we have two
tags: red and blue, and our
data has two features: x and
y.

• We want a classifier that,

given a pair of (x, y)
coordinates, outputs if it’s
either red or blue.

• We plot our already labeled

training data on a plane.

5
How Does SVM Works?
• A support vector
machine takes these data
points and generates a
hyperplane that best
separates the tags.

• This line is the decision

boundary: anything that
falls to one side of it we
will classify as blue, and
anything that falls to the
other as red.

6
How Does SVM Works?

What exactly is the best

hyperplane?

For SVM, it’s the one that

maximizes the margins
from both tags. In other
words: the hyperplane
(remember it's a line in this
case) whose distance to the
nearest element of each tag
is the largest.

7
How Does SVM Works?

8
Linear SVM
If we have a dataset comprising of observations, these span a feature
space . Here, |x| is the dimensionality of the vector comprising of the
features for a given observation x.
In this feature space V, an
SVM identifies the
hyperplane that maximizes
the distance that exists
between itself and the
closest two points or sets of
points that belong to distinct
categories. If one such
hyperplane exists, we can
say that the observations are
linearly separable in the
feature space V. 9
Separating Hyperplane & Support Vectors

10
Separating Hyperplane & Support Vectors

If any pair of
observations belong to
two different classes,
then the hyperplane
lies somewhere
between them.
These informative
observations support the
identification of the
decision boundary by the
SVM. For this reason, we
call the feature vectors
located in proximity to
observations of other
classes “support vectors”.

11
Decision Boundary

12
Decision Boundary

13
Decision Boundary
How can we find the optimal separating hyperplanes?

We know that the distance between the two parallel hyperplanes in (2.1) is
d = 2/||W||. Therefore, given any N points belonging to two classes, we can
formulate the finding of the optimal separating hyperplanes as the following
linearly constrained QP problem:

14
Decision Boundary
we derive the dual problem to the linearly constrained QP problem (2.9),
which is the one that is actually computed in practice. We also give
conditions for judging if a solution is optimal.

15
Decision Boundary

16
Training: Parameter Estimation

17
SVM With Hard Margin
When the data is linearly separable, and we don’t want to have any
misclassifications, we use SVM with a hard margin. However, when a linear
boundary is not feasible, or we want to allow some misclassifications in the hope
of achieving better generality, we can opt for a soft margin for our classifier.

18
SVM With Hard Margin

Without allowing any misclassifications in the hard margin SVM, we want

to maximize the distance between the two hyperplanes. To find this distance,
we can use the formula for the distance of a point from a plane. So the
distance of the blue points and the red point from the black line would
respectively be:

19
SVM With Hard Margin

20
SVM With Hard Margin

21
SVM With Soft Margin

22
SVM With Soft Margin

23
Hard Margin vs Soft Margin
• A hard margin and a soft margin in SVMs lies in the separability
of the data.
• If our data is linearly separable, we go for a hard margin.
However, if this is not the case, it won’t be feasible to do that. In
the presence of the data points that make it impossible to find a
linear classifier, we would have to be more lenient and let some
of the data points be misclassified. In this case, a soft margin
SVM is appropriate.
• Sometimes, the data is linearly separable, but the margin is so
small that the model becomes prone to overfitting or being too
sensitive to outliers.
• Also, in this case, we can opt for a larger margin by using soft
margin SVM in order to help the model generalize better.

24
Lagrange Multiplier
• Suppose we are given a function f(x,y,z,…) for which we want
to find extrema, subject to the condition g(x,y,z,…)=k.

• The idea used in Lagrange multiplier is that the gradient of

the objective function f, lines up either in parallel or anti-
parallel direction to the gradient of the constraint g, at an
optimal point.

• In such case, one the gradients should be some multiple of

another. Let’s see using an example —

25
Lagrange Multiplier

Using Lagrange multiplier we solve it the following way

26
27

Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
Machine Learning Note - Exam Note For ML
No ratings yet
Machine Learning Note - Exam Note For ML
28 pages
2.6 Supervised-Support Vector Machine
No ratings yet
2.6 Supervised-Support Vector Machine
18 pages
Support Vector Machine-1
No ratings yet
Support Vector Machine-1
12 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Ch-7support Vecbot Mochines El Keinal Based Meihods Regression and
No ratings yet
Ch-7support Vecbot Mochines El Keinal Based Meihods Regression and
6 pages
13.1 Support Vector Machine
No ratings yet
13.1 Support Vector Machine
28 pages
SVM_Presentation
No ratings yet
SVM_Presentation
19 pages
SVM Tutorial
No ratings yet
SVM Tutorial
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
SVM - Feb 15
No ratings yet
SVM - Feb 15
34 pages
SVM
No ratings yet
SVM
11 pages
Support Vector Machine
100% (1)
Support Vector Machine
11 pages
AI-900+2025
No ratings yet
AI-900+2025
267 pages
10 Classification SVM
No ratings yet
10 Classification SVM
22 pages
Unit-III - SVM
No ratings yet
Unit-III - SVM
105 pages
Support vector Machine.pptx
No ratings yet
Support vector Machine.pptx
18 pages
AI in Literary Translation
No ratings yet
AI in Literary Translation
6 pages
SVM
No ratings yet
SVM
6 pages
Svm
No ratings yet
Svm
52 pages
SVM.pptx
No ratings yet
SVM.pptx
67 pages
SVM notes
No ratings yet
SVM notes
4 pages
ST Andrews Gurgaon Prospectus
No ratings yet
ST Andrews Gurgaon Prospectus
44 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Assignment
No ratings yet
Assignment
3 pages
data mining techniques
No ratings yet
data mining techniques
27 pages
OD333269701051295100-1
No ratings yet
OD333269701051295100-1
8 pages
Unit 2
No ratings yet
Unit 2
47 pages
RNA torsoin angles.RNA
No ratings yet
RNA torsoin angles.RNA
20 pages
Project Report
No ratings yet
Project Report
36 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
emerging tech
No ratings yet
emerging tech
40 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Support Vector Machine
No ratings yet
Support Vector Machine
8 pages
W12 SVM
No ratings yet
W12 SVM
52 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
Tej Anand CV
No ratings yet
Tej Anand CV
7 pages
Ankita
No ratings yet
Ankita
10 pages
By: Moataz Al-Haj: Vision Topics - Seminar (University of Haifa)
No ratings yet
By: Moataz Al-Haj: Vision Topics - Seminar (University of Haifa)
69 pages
Machine Learning(r17a0534) 54 57
No ratings yet
Machine Learning(r17a0534) 54 57
4 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
SVM Algorithm
No ratings yet
SVM Algorithm
17 pages
Support Vector Machines
No ratings yet
Support Vector Machines
19 pages
Support Vector Machines (SVMs) - Introduction and Key Concepts
No ratings yet
Support Vector Machines (SVMs) - Introduction and Key Concepts
52 pages
SVM
No ratings yet
SVM
43 pages
Ai Class 9 Final Study Material
No ratings yet
Ai Class 9 Final Study Material
6 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
Session Svmclassification
No ratings yet
Session Svmclassification
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Artificial Intelligence Ch2
No ratings yet
Artificial Intelligence Ch2
10 pages
Analysis Sustainable Supply Chain and Industry 4.0
No ratings yet
Analysis Sustainable Supply Chain and Industry 4.0
39 pages
SVM
No ratings yet
SVM
11 pages
Paper 17573
No ratings yet
Paper 17573
11 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
All in one
No ratings yet
All in one
346 pages
Perplexity AI
No ratings yet
Perplexity AI
3 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
Subject For Software Engineering Course Malaysia
No ratings yet
Subject For Software Engineering Course Malaysia
10 pages
ML-Lec9-SVM
No ratings yet
ML-Lec9-SVM
32 pages
SVM notes unit 4.docx
No ratings yet
SVM notes unit 4.docx
8 pages
Svm
No ratings yet
Svm
52 pages
Karun Sharma Resume
No ratings yet
Karun Sharma Resume
2 pages
The 100 Most Influential People in AI 2023 - TIME
No ratings yet
The 100 Most Influential People in AI 2023 - TIME
2 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
SVMs
No ratings yet
SVMs
30 pages
Support Vector Machine
No ratings yet
Support Vector Machine
31 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Plotting Decision Regions - 1 - Mlxtend
No ratings yet
Plotting Decision Regions - 1 - Mlxtend
5 pages
ChatGPT Prompt Engineering For Developers
No ratings yet
ChatGPT Prompt Engineering For Developers
3 pages
SVMs[1]
No ratings yet
SVMs[1]
30 pages
Classification Regression: Mostly Used in Classification Problems
No ratings yet
Classification Regression: Mostly Used in Classification Problems
8 pages
Support Vector Machine
No ratings yet
Support Vector Machine
40 pages
S51270 - Actionable Insights For Surgical Practice - 1679376090580001fEW9
No ratings yet
S51270 - Actionable Insights For Surgical Practice - 1679376090580001fEW9
40 pages
Question and Answer for Unit 1,2,3,8
No ratings yet
Question and Answer for Unit 1,2,3,8
35 pages
Chapter 01 Notes
No ratings yet
Chapter 01 Notes
11 pages
General Purpose Robots by Foundation Models
No ratings yet
General Purpose Robots by Foundation Models
48 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Image Captioning Using Deep Stacked LSTMS, Contextual Word Embeddings and Data Augmentation
No ratings yet
Image Captioning Using Deep Stacked LSTMS, Contextual Word Embeddings and Data Augmentation
18 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
Supervised Learning: Hadrien Lacroix
No ratings yet
Supervised Learning: Hadrien Lacroix
85 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
Artificial Intelligence - Faiza Yamen-3
No ratings yet
Artificial Intelligence - Faiza Yamen-3
39 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Deep Learning Local Descriptor For Image Splicing PDF
No ratings yet
Deep Learning Local Descriptor For Image Splicing PDF
15 pages
10 B CS3491 AI&ML IAT 2 QP
No ratings yet
10 B CS3491 AI&ML IAT 2 QP
2 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
Ebook - Unleash The Next Wave of Productivity With AI A Practical Guide For IT Leaders
No ratings yet
Ebook - Unleash The Next Wave of Productivity With AI A Practical Guide For IT Leaders
9 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
CSE4022 Natural-Language-Processing ETH 1 AC41
No ratings yet
CSE4022 Natural-Language-Processing ETH 1 AC41
6 pages
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet

Machine Learning (CSO851) - Lecture 05

Uploaded by

Machine Learning (CSO851) - Lecture 05

Uploaded by

Support Vector Machines

Machine Learning (CSO851)

Duda, Hard et. al.

The advantages of support vector machines are:

– Optimal hyperplane for linearly separable patterns

– Extend to patterns that are not linearly separable by

• We want a classifier that,

• We plot our already labeled

• This line is the decision

What exactly is the best

For SVM, it’s the one that

Without allowing any misclassifications in the hard margin SVM, we want

• The idea used in Lagrange multiplier is that the gradient of

• In such case, one the gradients should be some multiple of

Using Lagrange multiplier we solve it the following way

You might also like