0% found this document useful (0 votes)

18 views

SVM SLIDES

Support Vector Machines (SVM) are supervised learning algorithms used for classification and regression, effective for complex datasets. Developed in the early 1990s, SVMs utilize hyperplanes and support vectors to optimize classification margins, and they can handle non-linear data through the kernel trick. Applications of SVM span various fields, including image and text classification, finance, and bioinformatics.

Uploaded by

Johnbabu Guttikonda

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

SVM SLIDES

Uploaded by

Johnbabu Guttikonda

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Support Vector Machines

Dr.G.JOHN BABU

November 3, 2024

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 1 / 32

Classification Problem

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 2 / 32

Classification Problem

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 3 / 32

Classification Problem

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 4 / 32

Classification Problem

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 5 / 32

Support Vector Machines

Support Vector Machines (SVM) are powerful supervised learning

algorithms used for classification and regression.
Effective for complex datasets where traditional linear classifiers may
fail.
Gained popularity due to robustness and versatility across various
applications.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 6 / 32

History and Context

Developed by Vladimir Vapnik in the early 1990s.

Originated from statistical learning theory.
Initially designed for binary classification by finding an optimal
hyperplane.
Kernel trick introduced in the late 1990s expanded SVM’s capabilities
to non-linear classification.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 7 / 32

Principles of Support Vector Machines

Hyperplane: A subspace that divides the space into two half-spaces.

Margin: Distance between the hyperplane and the closest data points
of each class.
Support Vectors: Critical data points that influence the position of
the hyperplane.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 8 / 32

Optimization Problem Formulation

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 9 / 32

Applications of Support Vector Machines

SVMs have numerous applications across fields:

Image Classification: For tasks such as face detection and object
recognition.
Text Classification: Useful for spam detection, sentiment analysis.
Bioinformatics: Applied in gene expression and protein structure
prediction.
Finance: Credit scoring, fraud detection.
Handwriting Recognition: Enhancing OCR accuracy.
Speech Recognition: Classifying spoken words or phrases.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 10 / 32

Optimal Separation

Optimal separation aims to find the best hyperplane that maximizes the
margin between two classes, improving the generalization of the classifier.
A larger margin reduces the likelihood of misclassification for new
data points.
Support vectors are crucial in defining the optimal hyperplane.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 11 / 32

Mathematical Formulation

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 12 / 32

Mathematical Formulation for Optimal Separation

minimize 12 ||w||2
subject to yi (w · xi + b) ≥ 1, ∀i
where:
w and b define the hyperplane,
yi denotes class labels ensuring points of one class yield positive
results and the other yields negative results.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 13 / 32

Lagrange Multipliers

Lagrange multipliers are a method in optimization to maximize or

minimize a function under certain constraints.
Objective Function: The function to be optimized.
Constraints: The conditions that must be met.
Lagrangian Function:

L(x, y , λ) = f (x, y ) + λg (x, y )

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 14 / 32

Example with Lagrange Multipliers

Maximize f (x, y ) = xy subject to g (x, y ) = x + y − 10 = 0.

Lagrangian Function:

L(x, y , λ) = xy + λ(10 − x − y )

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 15 / 32

Partial Derivatives of the Lagrangian

Taking partial derivatives:

∂L
=y −λ=0
∂x
∂L
=x −λ=0
∂y
∂L
= 10 − x − y = 0
∂λ
From these, we find x = 5, y = 5.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 16 / 32

To visualize the concept of optimal separation, consider a simple example
with two classes represented by blue and red points in a two-dimensional
space.
Suppose the data points can be separated by multiple lines. Some
lines may run closely against the data points, while others may run
further away. The line that runs equidistant from the nearest points
of both classes is the optimal hyperplane.
If we visualize the distance from the hyperplane to the nearest points
of each class, the goal is to maximize this distance. This ensures that
there is greater separation between classes, reducing the likelihood of
misclassification.
By selecting the hyperplane that maximizes the margin, the SVM is
more robust against noise and can handle small variations in new data
points effectively.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 17 / 32

Illustrative Example

To visualize the concept of optimal separation, consider a simple example

with two classes represented by blue and red points in a two-dimensional
space.
Suppose the data points can be separated by multiple lines. Some
lines may run closely against the data points, while others may run
further away. The line that runs equidistant from the nearest points
of both classes is the optimal hyperplane.
If we visualize the distance from the hyperplane to the nearest points
of each class, the goal is to maximize this distance. This ensures that
there is greater separation between classes, reducing the likelihood of
misclassification.
By selecting the hyperplane that maximizes the margin, the SVM is
more robust against noise and can handle small variations in new data
points effectively.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 18 / 32

A Constrained Optimization Problem

we define a set of constraints where the classifier should make correct

predictions. By assigning the target values for two classes as ±1 instead of
0 and 1, we can write down the product of the target ti and the predicted
output yi . This product will be positive if the predicted class matches the
target, and negative otherwise.
Thus, we can formulate the classifier’s condition as:

ti (w T xi + b) ≥ 1

ensuring correct classification.

The full optimization problem is then:
1
min w T w subject to ti (w T xi + b) ≥ 1 ∀i = 1, . . . , n.
2
This optimization problem involves minimizing the norm of the weight
vector w , while ensuring that each datapoint satisfies the given constraint.
Dr.G.JOHN BABU Support Vector Machines November 3, 2024 19 / 32
Quadratic Programming Solution

Quadratic Programming method, it is both quadratic (involving the

square of the weight vector) and convex (the minimization problem has a
unique solution).
The Karush–Kuhn–Tucker (KKT) conditions define the optimal
solution as follows for all values of i:
λ∗i (1 − ti (w ∗T xi + b ∗ )) = 0
1 − ti (w ∗T xi + b ∗ ) ≤ 0
λ∗i ≥ 0
Here, λi are Lagrange multipliers, which allow us to solve constrained
optimization problems. The first condition implies that if λi = 0, then
ti (w ∗T xi + b ∗ ) = 1, meaning that the constraint holds as an equality for
support vectors. These support vectors lie on the boundary of the
margin, and their constraints hold as equalities, reducing the number of
datapoints that need to be considered.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 20 / 32

Lagrangian Function

We define the Lagrangian for the problem as:

n
1 X
L(w , b, λ) = w T w + λi (1 − ti (w T xi + b)).
2
i=1

Differentiating this with respect to w and b, we obtain:

n
∂L X
=w− λi ti xi ,
∂w
i=1

n
∂L X
=− λi ti .
∂b
i=1

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 21 / 32

Setting these derivatives equal to zero gives us the optimal values for w
and b:
X n
∗
w = λi ti xi ,
i=1
n
X
λi ti = 0.
i=1

Substituting these values into the Lagrangian function yields the dual
problem, where we aim to maximize the following with respect to λi :
n n n
∗ ∗
X 1 XX
L(w , b , λ) = λi − λi λj ti tj xiT xj ,
2
i=1 i=1 j=1
Pn
subject to λi ≥ 0 and i=1 λi ti = 0.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 22 / 32

Slack Variables for Non-Linearly Separable Problems
In the case where the dataset is non-linearly separable, we introduce
slack variables ηi ≥ 0 to relax the constraints:
ti (w T xi + b) ≥ 1 − ηi .
Here, ηi = 0 for correctly classified points, and ηi > 0 for misclassified
points.
The objective function now becomes:
Xn
L(w , ξ) = w T w + C ηi ,
i=1
where C is a parameter that balances the trade-off between minimizing the
classification error and maximizing the margin. This transforms the
classifier into a soft-margin classifier.
The KKT conditions for this problem are:
λ∗i (1 − ti (w ∗T xi + b ∗ ) − ηi ) = 0
∗
Pn− λi∗)ηi = 0
(C
i=1 λi ti = 0.
Finally, we compute
Dr.G.JOHN BABU the optimal bias
Support b ∗Machines
Vector by averaging over the 3,support
November 2024 23 / 32
Prediction for a New Data Point

For a new point z, the prediction can be made using:

n
X
w ∗ z + b∗ = λi ti xiT z + b ∗ .
i=1

Thus, classification of a new point involves computing the inner product

between the point and the support vectors.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 24 / 32

Kernel Trick in SVM

The kernel trick in SVM is used to handle data that is not linearly
separable by transforming it into a higher-dimensional space.
Instead of computing this transformation directly, the kernel trick
allows us to compute the inner product of transformed vectors in the
original space.
This reduces computational complexity, making algorithms efficient
even with complex mappings.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 25 / 32

Need of Kernel in SVM

When we cannot linearly separate data in the original feature space,

modifying the features can help. This idea is similar to the XOR problem
we encountered earlier. By transforming the data into a
higher-dimensional space, we might find a linear decision boundary that
separates the classes. To achieve this, we introduce new functions ϕ(x)
based on the input features.
The key idea is to transform the input xi into a new form ϕ(xi ), while still
being able to use the SVM algorithm. Specifically, Equation remains valid,
but with xi replaced by ϕ(xi ). The resulting prediction equation becomes:
n
X
wT x + b = λi ti ϕ(xi )T ϕ(z) + b.
i=1

The choice of functions ϕ(x) is critical.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 26 / 32

How the Kernel Trick Works

1 Problem with Non-Linear Data: In SVM, we want to classify data

with a separating hyperplane, but non-linear data cannot be separated
in the original space.
2 Mapping to a Higher-Dimensional Space: A transformation
function ϕ(x) maps each data point to a higher-dimensional space,
making separation possible. However, directly computing ϕ(x) is
computationally expensive.
3 Role of the Kernel Trick: The kernel function K (x, y ) = ϕ(x) · ϕ(y )
allows us to compute similarity between data points in the original
space, avoiding direct computation in higher dimensions.
4 Computational Advantage: This avoids high-dimensional
transformation calculations, reducing computational load.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 27 / 32

Polynomial Kernel

The polynomial kernel function can map input data into polynomial
feature space:
K (x, y ) = (x · y + c)d
where:
d is the degree of the polynomial.
c is a constant, controlling the influence of higher-dimensional features.
By applying a polynomial kernel, we can capture interactions of
features up to the d-th degree, helping classify data that has
non-linear relationships.
Example: A polynomial kernel of degree 2 can separate data that
requires a quadratic boundary.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 28 / 32

Sigmoid Kernel in SVM

The sigmoid kernel function is defined as:

K (x, y ) = tanh(α (x · y ) + c)

where α and c control the shape of the hyperplane.

Inspired by neural networks, the sigmoid kernel behaves similarly to
neuron activation functions.
Example: With appropriate parameters, the sigmoid kernel maps
data to a curved decision boundary, capturing non-linear relationships.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 29 / 32

Radial Basis Function (RBF) Kernel in SVM

The RBF kernel (or Gaussian kernel) is widely used for non-linear
classification:
∥x − y ∥2

K (x, y ) = exp −
2σ 2
where σ determines the spread of the kernel.
Measures the ”distance” between points, with closer points having
higher similarity.
Example: For data forming concentric circles, the RBF kernel allows
SVM to classify these clusters by mapping them into separable
regions in the transformed space.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 30 / 32

Mercer’s Theorem

Mercer’s theorem is fundamental in validating the use of kernels in

SVM.
It states that a function K (x, y ) is a valid kernel if it corresponds to
an inner product in some higher-dimensional space.
This means that if K (x, y ) is positive semi-definite, it can be used as
a kernel in SVM.
Implication: Mercer’s theorem ensures that we can apply the kernel
trick with confidence, knowing that K (x, y ) represents a genuine
inner product.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 31 / 32

Summary of Kernel Trick

The kernel trick allows SVM to compute similarity directly in the

input space, reducing computational complexity.
Key Kernels:
Polynomial Kernel: Maps data into polynomial feature space.
Sigmoid Kernel: Inspired by neural networks, creates curved
boundaries.
RBF Kernel: Suitable for data with clusters or curved boundaries.
The choice of kernel function depends on the data structure and
separation required for effective classification.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 32 / 32

SVM
No ratings yet
SVM
21 pages
SVM_NEW
No ratings yet
SVM_NEW
12 pages
Mathematics of Operation Research A Note On Linear Programming Problems
No ratings yet
Mathematics of Operation Research A Note On Linear Programming Problems
6 pages
Day 1
No ratings yet
Day 1
41 pages
Support Vector Machines PDF
No ratings yet
Support Vector Machines PDF
5 pages
Support Vector Machines: Jeff Wu
No ratings yet
Support Vector Machines: Jeff Wu
35 pages
Vmls Additional Exercises
No ratings yet
Vmls Additional Exercises
66 pages
SVM Tutorial: SVM - Understanding The Math - The Optimal Hyperplane
No ratings yet
SVM Tutorial: SVM - Understanding The Math - The Optimal Hyperplane
13 pages
Support Vector Machine (SVM) Algorithm
No ratings yet
Support Vector Machine (SVM) Algorithm
8 pages
Support Vector Machines: Review and Applications in Civil: October 2011
No ratings yet
Support Vector Machines: Review and Applications in Civil: October 2011
15 pages
cs188 Fa23 Note21
No ratings yet
cs188 Fa23 Note21
8 pages
Presented By:-Manjot Singh Bilkhu (20135100) Shashank Bharadwaj (20135032) Shailendra Azad (20135159) Guided By: - Prof. V.K.Srivastava
No ratings yet
Presented By:-Manjot Singh Bilkhu (20135100) Shashank Bharadwaj (20135032) Shailendra Azad (20135159) Guided By: - Prof. V.K.Srivastava
23 pages
SVM
No ratings yet
SVM
44 pages
Lec06 SVM
No ratings yet
Lec06 SVM
25 pages
Vmls Additional Exercises
No ratings yet
Vmls Additional Exercises
77 pages
SMAI-M20-06: Data, Distances and Learning: C. V. Jawahar
No ratings yet
SMAI-M20-06: Data, Distances and Learning: C. V. Jawahar
24 pages
Time Series Forecasting by Using Wavelet Kernel SVM
No ratings yet
Time Series Forecasting by Using Wavelet Kernel SVM
52 pages
SVM
No ratings yet
SVM
17 pages
Tut3 Questions
No ratings yet
Tut3 Questions
2 pages
Intro 2 ML
No ratings yet
Intro 2 ML
162 pages
Faster retrieval with a two-pass dynamic-time-warping lower bound
No ratings yet
Faster retrieval with a two-pass dynamic-time-warping lower bound
12 pages
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
No ratings yet
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
14 pages
Final - Support Vector Machine - Class - Modifie
No ratings yet
Final - Support Vector Machine - Class - Modifie
69 pages
Fundamental data science 2,3,5 units of acharya nagarjuna university
No ratings yet
Fundamental data science 2,3,5 units of acharya nagarjuna university
33 pages
arb_free_c3_jod
No ratings yet
arb_free_c3_jod
20 pages
Homework2 v1.0
No ratings yet
Homework2 v1.0
5 pages
Lmy 2024 Coam
No ratings yet
Lmy 2024 Coam
26 pages
Detection of Temporal Bone Abnormalities Using Hybrid Wavelet Support Vector Machine Classification
No ratings yet
Detection of Temporal Bone Abnormalities Using Hybrid Wavelet Support Vector Machine Classification
6 pages
SVM Hands-On Problem
No ratings yet
SVM Hands-On Problem
7 pages
10 SVM
No ratings yet
10 SVM
23 pages
Frank-Wolfe_and_friends_a_journey_into_projection-
No ratings yet
Frank-Wolfe_and_friends_a_journey_into_projection-
33 pages
3927-14144-2-PB
No ratings yet
3927-14144-2-PB
14 pages
An Efficient Hybrid Algorithm For The Separable Convex Quadratic Knapsack Problem
No ratings yet
An Efficient Hybrid Algorithm For The Separable Convex Quadratic Knapsack Problem
25 pages
Lecture 9 - SVM
No ratings yet
Lecture 9 - SVM
42 pages
Econ20222 MJAbackgr
No ratings yet
Econ20222 MJAbackgr
164 pages
Orf523 S24 HW3
No ratings yet
Orf523 S24 HW3
4 pages
Support Vector Machines: Logisic Regression
No ratings yet
Support Vector Machines: Logisic Regression
10 pages
Problem Set 6
No ratings yet
Problem Set 6
5 pages
Hadiji Benh 2023
No ratings yet
Hadiji Benh 2023
22 pages
Micro 1
No ratings yet
Micro 1
99 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
11 pages
Algebra 2A Lecture 1
No ratings yet
Algebra 2A Lecture 1
12 pages
Support Vector Machine Classification Algorithm and Its Application
No ratings yet
Support Vector Machine Classification Algorithm and Its Application
8 pages
Machine Learning and Data Mining: Introduction to (Học máy và Khai phá dữ liệu)
No ratings yet
Machine Learning and Data Mining: Introduction to (Học máy và Khai phá dữ liệu)
49 pages
Multi-Class Classification Using Support Vector Machines in Binary Tree Architecture
No ratings yet
Multi-Class Classification Using Support Vector Machines in Binary Tree Architecture
6 pages
SVM Explained PDF
No ratings yet
SVM Explained PDF
19 pages
Exercise Sheet 1: Quantum Information - Summer Semester 2020
No ratings yet
Exercise Sheet 1: Quantum Information - Summer Semester 2020
2 pages
Support Vector Machines and Singular Value Decomposition
No ratings yet
Support Vector Machines and Singular Value Decomposition
7 pages
An Introduction Of: Support Vector Machine
No ratings yet
An Introduction Of: Support Vector Machine
36 pages
Group Theory: 1. Show That The Wave Equation For The Propagation of An Impulse at The Speed of Light C
No ratings yet
Group Theory: 1. Show That The Wave Equation For The Propagation of An Impulse at The Speed of Light C
2 pages
Machine Learning 10-701 Final Exam May 5, 2015: Obvious Exceptions For Pacemakers and Hearing Aids
No ratings yet
Machine Learning 10-701 Final Exam May 5, 2015: Obvious Exceptions For Pacemakers and Hearing Aids
17 pages
Support Vector Machine
No ratings yet
Support Vector Machine
55 pages
Main
No ratings yet
Main
12 pages
103 Exercises
No ratings yet
103 Exercises
70 pages
Stability Results of Coupled Wave Models With Locally Memory in A Past History Framework Via Nonsmooth Coefficients On The Interface
No ratings yet
Stability Results of Coupled Wave Models With Locally Memory in A Past History Framework Via Nonsmooth Coefficients On The Interface
32 pages
Download Complete (Ebook) Functional Analysis - Fundamentals and Applications by Michel Willem ISBN 9783031091483, 3031091485 PDF for All Chapters
100% (6)
Download Complete (Ebook) Functional Analysis - Fundamentals and Applications by Michel Willem ISBN 9783031091483, 3031091485 PDF for All Chapters
71 pages
A Unified Formula For The NTH Derivative and The NTH Anti-Derivative of The Bessel Function of Real Orders
No ratings yet
A Unified Formula For The NTH Derivative and The NTH Anti-Derivative of The Bessel Function of Real Orders
5 pages
Least Squares Formulations For Eigenvalue Proble 2021 Computers Mathematic
No ratings yet
Least Squares Formulations For Eigenvalue Proble 2021 Computers Mathematic
9 pages
Taylor & Francis, LTD., American Statistical Association The American Statistician
No ratings yet
Taylor & Francis, LTD., American Statistical Association The American Statistician
7 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
PROBABILISTIC Learning Jb-new
No ratings yet
PROBABILISTIC Learning Jb-new
13 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Syntax_complete
No ratings yet
Syntax_complete
22 pages
UNIT-3_SEMANTICS MATERIAL
No ratings yet
UNIT-3_SEMANTICS MATERIAL
16 pages
Lect 11
No ratings yet
Lect 11
8 pages
Determining How To Select A Sample
100% (8)
Determining How To Select A Sample
53 pages
Domain-Specific AI Agents Impact
No ratings yet
Domain-Specific AI Agents Impact
29 pages
Introduction To PySpark
100% (1)
Introduction To PySpark
21 pages
Lab Manual: Department of Computer Engineering
No ratings yet
Lab Manual: Department of Computer Engineering
66 pages
8 Sinif 4 Unite On The Phone Vocabulary Grammar Quiz
No ratings yet
8 Sinif 4 Unite On The Phone Vocabulary Grammar Quiz
1 page
Hydraulic Regenerative Braking System PDF
No ratings yet
Hydraulic Regenerative Braking System PDF
12 pages
Wallstreet Forex Robot: User Guide
No ratings yet
Wallstreet Forex Robot: User Guide
18 pages
Reliability Analysis Center: The Journal of The
100% (1)
Reliability Analysis Center: The Journal of The
24 pages
Kubernetes: What's It Do?: Presenter Eric Paris Red Hat
No ratings yet
Kubernetes: What's It Do?: Presenter Eric Paris Red Hat
25 pages
Bedir Star International School: Grade 1-3 Annual Lesson Plan
No ratings yet
Bedir Star International School: Grade 1-3 Annual Lesson Plan
13 pages
trace_2024-09-03 07_51_39 154
No ratings yet
trace_2024-09-03 07_51_39 154
1 page
Chapter 10 - CRM
No ratings yet
Chapter 10 - CRM
11 pages
6228f96dd382261a4887643f - Winning Duels in Valorant
No ratings yet
6228f96dd382261a4887643f - Winning Duels in Valorant
14 pages
9. Systematic Approaches to Electrical Fault Diagnosis-1
No ratings yet
9. Systematic Approaches to Electrical Fault Diagnosis-1
31 pages
Chapter 3 Dec 50143
No ratings yet
Chapter 3 Dec 50143
29 pages
PLSQL PDF
100% (2)
PLSQL PDF
55 pages
Subject Class Test/Exam Syllabus To Be Covered in The Examination
No ratings yet
Subject Class Test/Exam Syllabus To Be Covered in The Examination
6 pages
Readme Stario Dk-Aircash Ios SDK v314
No ratings yet
Readme Stario Dk-Aircash Ios SDK v314
51 pages
Unit-5 Cloud Computing-05!03!2024
No ratings yet
Unit-5 Cloud Computing-05!03!2024
42 pages
Cheat Sheet Rebel JVM Options
No ratings yet
Cheat Sheet Rebel JVM Options
1 page
NEPTUNE 4-Firmware Update Instructions - V1.2
No ratings yet
NEPTUNE 4-Firmware Update Instructions - V1.2
11 pages
Exception Handling
No ratings yet
Exception Handling
43 pages
Annotation: ALBERTO FUSCO: Parámetros "H"
No ratings yet
Annotation: ALBERTO FUSCO: Parámetros "H"
6 pages
Modulo 5
No ratings yet
Modulo 5
9 pages
Ai 04 00019
No ratings yet
Ai 04 00019
10 pages
Upgrade SAP Access Control 10.0-10.1 To 12.0
No ratings yet
Upgrade SAP Access Control 10.0-10.1 To 12.0
28 pages
Element Specifications
No ratings yet
Element Specifications
145 pages
DWDM Unit 2 PDF
No ratings yet
DWDM Unit 2 PDF
16 pages

SVM SLIDES

Uploaded by

SVM SLIDES

Uploaded by

Support Vector Machines

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 1 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 2 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 3 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 4 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 5 / 32

Support Vector Machines (SVM) are powerful supervised learning

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 6 / 32

Developed by Vladimir Vapnik in the early 1990s.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 7 / 32

Hyperplane: A subspace that divides the space into two half-spaces.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 8 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 9 / 32

SVMs have numerous applications across fields:

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 10 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 11 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 12 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 13 / 32

Lagrange multipliers are a method in optimization to maximize or

L(x, y , λ) = f (x, y ) + λg (x, y )

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 14 / 32

Maximize f (x, y ) = xy subject to g (x, y ) = x + y − 10 = 0.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 15 / 32

Taking partial derivatives:

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 16 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 17 / 32

To visualize the concept of optimal separation, consider a simple example

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 18 / 32

we define a set of constraints where the classifier should make correct

ensuring correct classification.

Quadratic Programming method, it is both quadratic (involving the

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 20 / 32

We define the Lagrangian for the problem as:

Differentiating this with respect to w and b, we obtain:

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 21 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 22 / 32

For a new point z, the prediction can be made using:

Thus, classification of a new point involves computing the inner product

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 24 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 25 / 32

When we cannot linearly separate data in the original feature space,

The choice of functions ϕ(x) is critical.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 26 / 32

1 Problem with Non-Linear Data: In SVM, we want to classify data

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 27 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 28 / 32

The sigmoid kernel function is defined as:

where α and c control the shape of the hyperplane.

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 29 / 32

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 30 / 32

Mercer’s theorem is fundamental in validating the use of kernels in

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 31 / 32

The kernel trick allows SVM to compute similarity directly in the

Dr.G.JOHN BABU Support Vector Machines November 3, 2024 32 / 32

You might also like