0% found this document useful (0 votes)

9 views

ML-II UNIT-1

Support Vector Machine (SVM) is a supervised machine learning algorithm primarily used for classification tasks, which finds the optimal hyperplane to separate different classes while maximizing the margin between them. SVM employs kernel functions to handle non-linearly separable data and has advantages such as good generalization and effectiveness with high-dimensional data, but it also faces challenges like long training times and difficulty in model interpretation. The Vapnik-Chervonenkis (VC) dimension is a key concept in understanding the complexity and capacity of SVM models, influencing their performance and generalization capabilities.

Uploaded by

sahilsharma747392

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

ML-II UNIT-1

Uploaded by

sahilsharma747392

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Support Vector Machine (SVM) is a supervised machine learning algorithm used for

classification and regression tasks. While it can handle regression problems, SVM is particularly
well-suited for classification tasks.

SVM aims to find the optimal hyperplane in an N-dimensional space to separate data points into
different classes. The algorithm maximizes the margin between the closest points of different
classes.

Support Vector Machine (SVM) Terminology

• Hyperplane: A decision boundary separating different classes in feature space,

represented by the equation wx + b = 0 in linear classification.

• Support Vectors: The closest data points to the hyperplane, crucial for determining the
hyperplane and margin in SVM.

• Margin: The distance between the hyperplane and the support vectors. SVM aims to
maximize this margin for better classification performance.

• Kernel: A function that maps data to a higher-dimensional space, enabling SVM to

handle non-linearly separable data.

• Hard Margin: A maximum-margin hyperplane that perfectly separates the data without
misclassifications.

• Soft Margin: Allows some misclassifications by introducing slack variables, balancing

margin maximization and misclassification penalties when data is not perfectly separable.

• C: A regularization term balancing margin maximization and misclassification penalties.

A higher C value enforces a stricter penalty for misclassifications.

• Hinge Loss: A loss function penalizing misclassified points or margin violations,

combined with regularization in SVM.

• Dual Problem: Involves solving for Lagrange multipliers associated with support
vectors, facilitating the kernel trick and efficient computation.

How does Support Vector Machine Algorithm Work?

The key idea behind the SVM algorithm is to find the hyperplane that best separates two classes
by maximizing the margin between them. This margin is the distance from the hyperplane to the
nearest data points (support vectors) on each side.

The best hyperplane, also known as the “hard margin,” is the one that maximizes the distance
between the hyperplane and the nearest data points from both classes. This ensures a clear
separation between the classes. So, from the above figure, we choose L2 as hard margin.

Let’s consider a scenario like shown below:

When data is not linearly separable (i.e., it can’t be divided by a straight line), SVM uses a
technique called kernels to map the data into a higher-dimensional space where it becomes
separable. This transformation helps SVM find a decision boundary even for non-linear data.

A kernel is a function that maps data points into a higher-dimensional space without explicitly
computing the coordinates in that space. This allows SVM to work efficiently with non-linear
data by implicitly performing the mapping.
For example, consider data points that are not linearly separable. By applying a kernel function,
SVM transforms the data points into a higher-dimensional space where they become linearly
separable.

• Linear Kernel: For linear separability.

• Polynomial Kernel: Maps data into a polynomial space.

• Radial Basis Function (RBF) Kernel: Transforms data into a space based on distances
between data points.

SVM Advantages & Disadvantages

SVM Advantages

• SVM’s are very good when we have no idea on the data.

• Works well with even unstructured and semi structured data like text, Images and trees.
• The kernel trick is real strength of SVM. With an appropriate kernel function, we can solve any
complex problem.
• Unlike in neural networks, SVM is not solved for local optima.
• It scales relatively well to high dimensional data.
• SVM models have generalization in practice, the risk of over-fitting is less in SVM.
• SVM is always compared with ANN. When compared to ANN models, SVMs give better
results.

SVM Disadvantages

• Choosing a “good” kernel function is not easy.

• Long training time for large datasets.
• Difficult to understand and interpret the final model, variable weights and individual impact.
• Since the final model is not so easy to see, we can not do small calibrations to the model hence
its tough to incorporate our business logic.
• The SVM hyper parameters are Cost -C and gamma. It is not that easy to fine-tune these hyper-
parameters. It is hard to visualize their impact

SVM Application

• Protein Structure Prediction

• Intrusion Detection
• Handwriting Recognition
• Detecting Steganography in digital images
• Breast Cancer Diagnosis

Vapnik-Chervonenkis Dimension

The Vapnik-Chervonenkis (VC) dimension is a measure of the capacity of a hypothesis set to fit
different data sets. It was introduced by Vladimir Vapnik and Alexey Chervonenkis in the 1970s
and has become a fundamental concept in statistical learning theory. The VC dimension is a
measure of the complexity of a model, which can help us understand how well it can fit different
data sets.

The VC dimension of a hypothesis set H is the largest number of points that can be shattered by
H. A hypothesis set H shatters a set of points S if, for every possible labeling of the points in S,
there exists a hypothesis in H that correctly classifies the points. In other words, a hypothesis set
shatters a set of points if it can fit any possible labeling of those points.

Bounds of VC – Dimension

The VC dimension provides both upper and lower bounds on the number of training examples
required to achieve a given level of accuracy. The upper bound on the number of training
examples is logarithmic in the VC dimension, while the lower bound is linear.

Applications of VC – Dimension

The VC dimension has a wide range of applications in machine learning and statistics. For
example, it is used to analyze the complexity of neural networks, support vector machines, and
decision trees. The VC dimension can also be used to design new learning algorithms that are
robust to noise and can generalize well to unseen data.

The VC dimension can be extended to more complex learning scenarios, such as multiclass
classification and regression. The concept of the VC dimension can also be applied to other areas
of computer science, such as computational geometry and graph theory.

Operation and Maintenance Manual: 3500B Series II and 3500C Marine Propulsion Engines
100% (1)
Operation and Maintenance Manual: 3500B Series II and 3500C Marine Propulsion Engines
144 pages
Osbourne Reynolds CGE536
No ratings yet
Osbourne Reynolds CGE536
24 pages
Madras PDF
No ratings yet
Madras PDF
6 pages
SVM
No ratings yet
SVM
9 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
Unit5_ml
No ratings yet
Unit5_ml
12 pages
Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
Support Vector Machines (SVMs) - Introduction and Key Concepts
No ratings yet
Support Vector Machines (SVMs) - Introduction and Key Concepts
52 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
SVMs[1]
No ratings yet
SVMs[1]
30 pages
Honours Endsem Notes
No ratings yet
Honours Endsem Notes
163 pages
SVM Theory
No ratings yet
SVM Theory
7 pages
Session Svmclassification
No ratings yet
Session Svmclassification
28 pages
SVMs
No ratings yet
SVMs
30 pages
machine learning note 3
No ratings yet
machine learning note 3
2 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
svm
No ratings yet
svm
4 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
Ankita
No ratings yet
Ankita
10 pages
SVM_Presentation
No ratings yet
SVM_Presentation
13 pages
Support Vecor Machine
No ratings yet
Support Vecor Machine
4 pages
SVM 1
No ratings yet
SVM 1
17 pages
Day 34
No ratings yet
Day 34
3 pages
Svm
No ratings yet
Svm
52 pages
Lecture-9 Classification Using SVM
No ratings yet
Lecture-9 Classification Using SVM
40 pages
Support Vector Machines
No ratings yet
Support Vector Machines
43 pages
SVM
No ratings yet
SVM
6 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
SVM Présentation
No ratings yet
SVM Présentation
23 pages
ML-Lecture-14-SVM
No ratings yet
ML-Lecture-14-SVM
15 pages
UNIT - 2-1
No ratings yet
UNIT - 2-1
7 pages
Support Vector Machine
No ratings yet
Support Vector Machine
14 pages
SVM
No ratings yet
SVM
12 pages
SVM Unit 2
No ratings yet
SVM Unit 2
12 pages
SVM Fully Translated Fixed
No ratings yet
SVM Fully Translated Fixed
5 pages
SVM DM
No ratings yet
SVM DM
10 pages
Lecture#12
No ratings yet
Lecture#12
16 pages
Tutorial On Support Vector Machine (SVM) : Abstract
No ratings yet
Tutorial On Support Vector Machine (SVM) : Abstract
13 pages
Machine Learning Answer Bank
No ratings yet
Machine Learning Answer Bank
54 pages
What Is Support Vector Machine
No ratings yet
What Is Support Vector Machine
13 pages
This Is
No ratings yet
This Is
7 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
support_vector_machines
No ratings yet
support_vector_machines
12 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
6. Support Vector Machine for Classification
No ratings yet
6. Support Vector Machine for Classification
38 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
Supervised Learning SVM
No ratings yet
Supervised Learning SVM
9 pages
SVM VS SVC
No ratings yet
SVM VS SVC
27 pages
Unit 2 SVM
No ratings yet
Unit 2 SVM
16 pages
SVM Manual
No ratings yet
SVM Manual
7 pages
Support Vector Machine: Suraj Kumar Das
No ratings yet
Support Vector Machine: Suraj Kumar Das
10 pages
Pca PDF
No ratings yet
Pca PDF
10 pages
Support Vector Machine
No ratings yet
Support Vector Machine
40 pages
Machine learning unit 5 part 1
No ratings yet
Machine learning unit 5 part 1
19 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
1694600937-Unit2.5 Support Vector Machine CU 2.0
No ratings yet
1694600937-Unit2.5 Support Vector Machine CU 2.0
26 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Ta-Therm Zero en Low
No ratings yet
Ta-Therm Zero en Low
6 pages
Sadbhav ICTNWK612 Assignment 1
No ratings yet
Sadbhav ICTNWK612 Assignment 1
25 pages
Computer Engineering Thesis Statement
100% (2)
Computer Engineering Thesis Statement
8 pages
System Optimization of Turbofan Engines Using Genetic Algorithms
No ratings yet
System Optimization of Turbofan Engines Using Genetic Algorithms
12 pages
Fluentgrid Actilligence Datasheet
No ratings yet
Fluentgrid Actilligence Datasheet
13 pages
NSIT Delhi End Sem Admit Card
No ratings yet
NSIT Delhi End Sem Admit Card
1 page
Gstr1 Excel
No ratings yet
Gstr1 Excel
64 pages
CLASS XI-COMPUTER SCIENCE UNIT-III
No ratings yet
CLASS XI-COMPUTER SCIENCE UNIT-III
2 pages
240 - Ceragon - LAG - Presentation v1.9
No ratings yet
240 - Ceragon - LAG - Presentation v1.9
28 pages
Karthik M Sarma Resume
No ratings yet
Karthik M Sarma Resume
2 pages
Instant Access to Lectures on Quantum Information Physics Textbook 1. Auflage Edition Dagmar Bruß ebook Full Chapters
No ratings yet
Instant Access to Lectures on Quantum Information Physics Textbook 1. Auflage Edition Dagmar Bruß ebook Full Chapters
77 pages
3 Way Switch
No ratings yet
3 Way Switch
16 pages
GERAN 2006 Workshop eBSC Hardware & eTRAU Outlook: Communications
No ratings yet
GERAN 2006 Workshop eBSC Hardware & eTRAU Outlook: Communications
25 pages
Installation, Service and Maintenance Instructions For Low Voltage Air Circuit Breakers
No ratings yet
Installation, Service and Maintenance Instructions For Low Voltage Air Circuit Breakers
162 pages
W9S1S2 - First Order Logic (Formula, Model, Tableaux)
No ratings yet
W9S1S2 - First Order Logic (Formula, Model, Tableaux)
40 pages
be_first-year-fe-engineering_semester-1_2024_may_programming-and-problem-solving-pattern-2019
No ratings yet
be_first-year-fe-engineering_semester-1_2024_may_programming-and-problem-solving-pattern-2019
3 pages
Analysis of The Banking Industry
No ratings yet
Analysis of The Banking Industry
44 pages
Readme
No ratings yet
Readme
9 pages
SIG Operating and Instruction Manual Issue 7 ENG PDF
No ratings yet
SIG Operating and Instruction Manual Issue 7 ENG PDF
178 pages
SAP Intelligent Clinical Supply Management
No ratings yet
SAP Intelligent Clinical Supply Management
6 pages
Certification Specifications and Acceptable Means of Compliance For Auxiliary Power Units (CS-APU)
No ratings yet
Certification Specifications and Acceptable Means of Compliance For Auxiliary Power Units (CS-APU)
34 pages
Hydrology And Floodplain Analysis 5th Edition Bedient Solutions Manual pdf download
100% (2)
Hydrology And Floodplain Analysis 5th Edition Bedient Solutions Manual pdf download
71 pages
(Backlog) Elec 3201
No ratings yet
(Backlog) Elec 3201
4 pages
MoD Report
No ratings yet
MoD Report
3 pages
EC 252:signals and Systems Laboratory: Generating Signals Using Matlab Simulator
No ratings yet
EC 252:signals and Systems Laboratory: Generating Signals Using Matlab Simulator
11 pages
A. Reading Comprehension
No ratings yet
A. Reading Comprehension
3 pages
Introduction To Management Accounting
No ratings yet
Introduction To Management Accounting
7 pages