0% found this document useful (0 votes)

6 views9 pages

SVM-ML-AI_lecturenotes_cs725

The document discusses the dual formulation of Support Vector Machines (SVM) and the conditions for optimality using the Karush-Kuhn-Tucker (KKT) conditions. It details the formulation of the dual optimization problem, the requirements for kernel functions, and introduces algorithms like Sequential Minimal Optimization (SMO) for solving the dual. Additionally, it covers properties of kernel functions and examples of different kernels used in SVM.

Uploaded by

sai bharadwaja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views9 pages

SVM-ML-AI_lecturenotes_cs725

Uploaded by

sai bharadwaja

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

CS 725 : Foundations of Machine Learning Autumn 2011

Lecture 27: SVM: Dual Formulation, Notion of Kernel

Instructor: Ganesh Ramakrishnan Date: 05/11/2011
Computer Science & Engineering Indian Institute of Technology, Bombay

21.2 SVM : Dual Formulation

Primal formulation

p∗ = min f (x) (49)

x∈D (50)
s.t. gi (x) ≤ 0 (51)
i = 1, . . . , m (52)

Dual Formulation

m
!
X
d∗ = max min f (x) + λi gi (x) (53)
λ∈R x∈D
i=1
s.t. λi ≥ 0 (54)

Equation 53 is and convex optimization problem. Also, d∗ ≤ p∗ and (p∗ − d∗ ) is called the duality
gap.
If for some (x∗ , λ∗ ) where x∗ is primal feasible and λ∗ is dual feasible and we see the KKT
conditions are satisfied and f is and all gi are convex then x∗ is optimal solution to primal and λ∗
to dual.
Also, the dual optimization problem becomes,

d∗ = max L(x∗ , λ) (55)

λ∈Rm
s.t. λi ≥ 0∀i (56)
m
X
where L(x, λ) = f (x) + λi gi (x) (57)
i=1
L∗ (λ) = min L(x, λ) (58)
x∈D
= min L(x, λ) (59)
x∈KKT
λi ≥ 0∀i (60)

It happens to be,

p ∗ = d∗ (61)

111
21 SUPPORT VECTOR MACHINES 112

21.3 Duality theory applied to KKT

m m m
¯ w0 , ᾱ, λ̄) = 1 ||w||2 + c
X X X
αi 1 − ξi − yi φT (xi )w + w0 −

L(w̄, ξ, ξi + λi ξi (62)
2 i=1 i=1 i=1

Now we check for KKT conditions at the point of optimality,

KKT 1.a

∇w L = 0 (63)
Xn
=⇒ w − αj yj φT (xj ) = 0 (64)
j=1

KKT 1.b

∇xii L = 0 (65)
=⇒ c − αi − λi = 0 (66)

KKT 1.c

∇w 0 L = 0 (67)
Xn
=⇒ αi yi = 0 (68)
i=1

KKT 2

∀i (69)
T

yi φ (xi )w + w0 ≥ 1 − ξi (70)
ξi ≥ 0 (71)

KKT 3

αj ≥ 0 and λk ≥ 0 (72)
∀j, k = 1, . . . , n (73)

KKT 4

αj yi φT (xj )w + w0 − 1 + ξj = 0

(74)
λk ξk = 0 (75)

(a)

m
X
w∗ = αj yi φ(xj ) (76)
j=1
21 SUPPORT VECTOR MACHINES 113

w∗ is weighted linear combination of points φ(x)s.

(b)

If 0 < αj < c then, by Equation 66

0 < λj < c and by Equation 75, ξj = 0 and yi φT (xj )w + w0 = 1

If however, αj = c then λj = 0 and yi φT (xj )w + w0 ≤ 1.

If α0 then λj = c and ξj = 0, we get yi φT (xj )w + w0 ≥ 1. Then αj = 0

21.4 SVM dual

SVM can be formulated as the following optimization problem,
m
1 2
X
min{ kwk + C ξi }
w 2 i=0

subject to constraint,
∀i : yi (φT (xi )w + w0 ) ≥ 1 − ξi
The dual of the SVM optimization problem can be stated as,
m m m
1 XX X
max{− yi yj αi αj φT (xi )φ(xj ) + αj }
2 i=1 j=1 j=1

subject to constraints,
X
∀i : αi yi = 0
i
∀i : 0 ≤ αi ≤ c
The duality gap = f (x∗ ) − L∗ (λ∗ ) = 0, as shown in last lecture. Thus, as is evident from the
solution of the dual problem,
m
X
w∗ = αi∗ yi φ(xi )
i=1

To obtain wo∗ , we can use the fact (as shown in last lecture) that, if αi ∈ (0, C), yi (φT (xi )w +
w0 ) = 1. Thus, for any point xi such that, αi ∈ (0, C), that is, αi is a point on the margin,
1 − yi (φT (xi )w∗ )
wo∗ =
yi
= yi − φ (xi )w∗
T

The decision function,

g(x) = φT (x)w∗ + w0∗
Xm
= αi yi φT (x)φ(xi ) + w0∗
i=0
21 SUPPORT VECTOR MACHINES 114

21.5 Kernel Matrix

A kernel matrix
 
φT (x1 )φ(x1 ) φT (x1 )φ(x2 ) ... ... φT (x1 )φ(xn )
 T
 φ (x2 )φ(x1 ) φT (x2 )φ(x2 ) . . . ... φT (x2 )φ(xn ) 

 
K=
 ... ... ... ... ... 

. . . . . . ... ... ...
 
 
T T
φ (xn )φ(x1 ) φ (xn )φ(x2 ) . . . ... φT (xn )φ(xn )
In other words, Kij = φT (xi )φ(xj ). The SVM dual can now be re-written as,
1
max{− αT Ky α + αT ones(m, 1)}
2
subject to constraints,
X
αi yi = 0
i
0 ≤ αi ≤ c

Thus, for αi ∈ (0, C)

w0∗ = yi − φT (xi )w
Xm
= yi − αj∗ yj φT (xi )φ(xj )
j=0
m
X
= yi − αj∗ yj Kij
j=0

Generation of φ space
For a given x = [x1 , x2 , . . . , xn ] → φ(x) = [xd1 , xd2 , xd3 , . . . , x1d−1 x2 , . . . ].
For n = 2, d = 2, φ(x) = [x21 , x1 x2 , x2 x1 , x22 ], thus,
m X
X m
φT (x).φ(x̄) = xi xj .x̄i x̄j
i=1 j=1
Xm m
X
= ( xi x̄i ).( xj x̄j )
i=1 j=1
Xm
= ( xi x̄i )2
i=1
= (xT x̄)2

In general, for n ≥ 1 and d ≥ 1, φT (x).φ(x̄) = (xT x̄)d .

A polynomial kernel, in general, is defined as Kij = (xTi xj )d .
CS 725 : Foundations of Machine Learning Autumn 2011

Lecture 28: SVM: Kernel Methods, Algorithms for solving Dual

Instructor: Ganesh Ramakrishnan Date: 07/11/2011
Computer Science & Engineering Indian Institute of Technology, Bombay

21.6 Requirements of Kernel

1. Since

Kij = φT (xi )φ(xj )

= φT (xj )φ(xi )

Hence K should be a Symmetric Matrix.

2. The Cauchy Schwarz Inequality

(φT (x)φ(x̄))2 ≤ kφT (x)k2 kφ(x̄)k2

⇒ Kij 2 ≤ Kii Kjj

3. Positivity of Diagonal

K = V ΛV T

Where V is the eigen vector matrix (an orthogonal matrix), and Λ is the Diagonal matrix of
eigen values.

Goal is to construct a φ. Which can be constructed as

√
φ(xi ) = λi Vi (λi ≥ 0)
2
Kii = λi kVi k

Hence K must be
1. Symmetric.
2. Positive Semi Definite.
3. Having non-negative Diagonal Entries.

115
21 SUPPORT VECTOR MACHINES 116

Examples of Kernels
d
1. Kij = (xi T xj )
d
2. Kij = (xi T xj + 1)
3. Gaussian or Radial basis Function (RBF)
kxi −xj k
Kij = e− 2σ 2 (σ ∈ R, σ 6= 0)
4. The Hyperbolic Tangent function
Kij = tanh(σxTi xj + c)

Properties of Kernel Functions

If K 0 and K 00 are Kernels then K is also a Kernel if either of the following holds
1. Kij = K 0 ij + K 00 ij
2. Kij = αK 0 ij (α ≥ 0)
3. Kij = K 0 ij K 00 ij
Proof : (1) and (2) are left as an exercise.
(3)
0 00
Kij = Kij Kij
= φ0T (x0i )φ0 (x0j ) ∗ φ00T (x00i )φ00 (x00j )

Define φ(xi ) = φ0T (x0i )φ00T (x00i ). Thus, Kij = φ(xi )φ(xj ).
Hence, K is a valid kernel.

21.7 Algorithms for solving the dual

Duality offers multiple alternative check points to see if the solution is optimal. They are
1. KKT conditions satisfied ∀i
2. Primal objective ≈ Dual objective
We prefer solving the dual since we have the kernel and can avoid computing complex φ.
(K = xT x̄ i.e φ(x) = x .. However, linear kernel has simple φ and could be solved in primal form)

Sequential Minimal Optimization Algorithm (SMO)

It turns out that for most solutions, most αi = 0. So general (LCQP) solvers are an overkill. To
explot this, we use batch co-ordinate wise ascent. One of the best performers is the sequential
minimal optimization (SMO) algorithm.
This optimizes for 2 α’s at a time. The steps of the algorithm are:
1. Start with all αi = 0
21 SUPPORT VECTOR MACHINES 117

2. Seclect any 2 αs, say α1 and α2 that violate the KKT

3. Solve for α1 and α2

X 1
min − α1 − α2 − αi + α12 K11 + α22 K22 + α1 α2 K12 y1 y2
α1 ,α2 2
i6=1,2
X X
+ α1 y1 K1i αi yi + α2 y2 K2i αi yi (77)
i6=1,2 i6=1,2
X
s. t. α1 y1 + α2 y2 = − αj yj = α1old + α2old
j6=1,2

α1 , α2 ∈ [0, c]

4. From the second last constraint, we can write α1 in terms of α2 .

y2 y2
α1 = −α2 + α1old + α2old
y1 y1

Then the objective is just a function of α2 , let the objective is −D(α2 ). Now the program
reduces to

min − D(α2 )
α2

s. t. α2 ∈ [0, c]

Find α2∗ such that ∂D(α

∂α2
2)
= 0. We have to ensure that α1 ∈ [0, c]. So based on that we will
have to clipp α2 , ie, shift it to certain interval. The condition is as follows
y2 y2
0 <= −α2 + α1old + α2old <= c
y1 y1

5. case 1: y1 = y2

α2 ∈ [max(0, −c + α1old + α2old ), min(c, α1old + α2old )]

case 2: y1 = −y2

α2 ∈ [max(0, α2old − α1old ), min(c, c − α1old + α2old )]

If α2 is already in the interval then there is no problem. If it is more than the maximum
limit then reset it to the maximum limit. This will ensure the optimum value of the objective
constrained to this codition. Similarly if α2 goes below the lower limit then reset it to the
lower limit.
21 SUPPORT VECTOR MACHINES 118

Chunking and Decomposition Methods

We are interested in solving dual of the objective because we have already seen that most of the
dual variable will be zero in the solution and hence it will give a sparse solution (based on the KKT
conidtion).

X 1 XX
Dual: min − αi + αi αj yi yj Kij (78)
α 2 i j
X
s. t. αi yi = 0
i
αi ∈ [0, c]

The above program is a quadratic program. Any quadratic solvers can be used for solving (78),
but a generic solver will not take consider speciality of the solution and may not be efficient. One
way to solve (78) is by using projection methods(also called Kernel adatron). You can solve the
above one using two ways - chunking methods and decomposition methods.
The chunking method is as follows

1. Initialize αi s arbitrarily
2. Choose points(I mean the components αi ) that violate KKT condition

3. Consider only K working set and solve the dual for the variables in working set

∀α ∈ working set
X 1 X X
min − αi + αi αj yi yj Kij (79)
α 2
αi inW S i∈W S j∈W S
X X
s. t. αi yi = − αj yj
i∈W S j ∈W
/ S

αi ∈ [0, c]

4. set αnew = [αW

new old
S , αnonW S ]

Decompsition methods follow almost the same procedure except that in step 2 we always take
a fixed number of points which violate the KKT conditions the most.

Further Reading
For SVMs in general and kernel method in particular read the SVM book An Introduction to
Support Vector Machines and Other Kernel-based Learning Methods by Nello Cristianini and John
Shawe-Taylor uploaded on moodle.
CS 725 : Foundations of Machine Learning Autumn 2011

Lecture 29: Support Vector Regression, Attribute Selection

Instructor: Ganesh Ramakrishnan Date: 11/11/2011
Computer Science & Engineering Indian Institute of Technology, Bombay

22 Support Vector Regression

Please refer to previous years’ notes (http://www.cse.iitb.ac.in/~cs725/notes/classNotes/
lecturenote_2010.pdf) Section 22.2 for this topic.

23 Attribute Selection and Transformation

Please refer to the following material for this topic:

1. Chapter 7 of book Data Mining by I.H. Witten and E. Frank

2. Slides at http://www.cse.iitb.ac.in/~cs725/notes/classNotes/dataprocessing.pdf

119

Sap Multi Bank Connectivity Config Steps 1734953985
No ratings yet
Sap Multi Bank Connectivity Config Steps 1734953985
2 pages
Romania Off Plan Property Investment and Opportunities
100% (2)
Romania Off Plan Property Investment and Opportunities
12 pages
Lecture Slides-Week12
100% (1)
Lecture Slides-Week12
41 pages
96151
No ratings yet
96151
82 pages
Lecture 33
No ratings yet
Lecture 33
131 pages
Classroom Vs Online Learning
60% (5)
Classroom Vs Online Learning
17 pages
Assignment C++ (MUHAMMAD RYMI BIN MOHD RAFIZAL)
No ratings yet
Assignment C++ (MUHAMMAD RYMI BIN MOHD RAFIZAL)
6 pages
Solid Waste Management in Zamboanga City: Status Quo and Challenges
100% (1)
Solid Waste Management in Zamboanga City: Status Quo and Challenges
23 pages
Penetration Testing Service Data Sheet
No ratings yet
Penetration Testing Service Data Sheet
3 pages
Tynchyshyn SVM
No ratings yet
Tynchyshyn SVM
4 pages
Mike Okmawati - Case Study Research
No ratings yet
Mike Okmawati - Case Study Research
2 pages
hw2 4
No ratings yet
hw2 4
3 pages
SR211007115444
No ratings yet
SR211007115444
5 pages
SSVM A Simple SVM Algorithm
No ratings yet
SSVM A Simple SVM Algorithm
6 pages
Support Vector Machines: More Generally Kernel Methods
No ratings yet
Support Vector Machines: More Generally Kernel Methods
58 pages
Introduction to Support Vector Regression (SVR)
No ratings yet
Introduction to Support Vector Regression (SVR)
28 pages
Lecture 11
No ratings yet
Lecture 11
35 pages
Lecture 20-Dual Quadratic Programming Formulation of SVMs and Kernel Trick
No ratings yet
Lecture 20-Dual Quadratic Programming Formulation of SVMs and Kernel Trick
31 pages
Ds 3
No ratings yet
Ds 3
25 pages
Machine Learning 3
No ratings yet
Machine Learning 3
35 pages
SVM Incremental Learning, Adaptation and Optimization - IJCNN 2003 Presentation
No ratings yet
SVM Incremental Learning, Adaptation and Optimization - IJCNN 2003 Presentation
11 pages
PROFED 12 Module 1 Lesson 1 Answersheet
No ratings yet
PROFED 12 Module 1 Lesson 1 Answersheet
7 pages
Lecture 3
No ratings yet
Lecture 3
18 pages
Green and White Doodle Thesis Defense Presentation
No ratings yet
Green and White Doodle Thesis Defense Presentation
16 pages
Renfrew 2019
No ratings yet
Renfrew 2019
13 pages
10 Kartnig Paper
No ratings yet
10 Kartnig Paper
20 pages
Gallium Nitride in Pe
No ratings yet
Gallium Nitride in Pe
1 page
University of Hyderabad E-Receipt: Subject To Realization Fees Is Not Refundable Authorized Signatory
No ratings yet
University of Hyderabad E-Receipt: Subject To Realization Fees Is Not Refundable Authorized Signatory
1 page
COMMERCE notes - Unit 4 chapter 15- Consumer Protection SESSION - 1
No ratings yet
COMMERCE notes - Unit 4 chapter 15- Consumer Protection SESSION - 1
4 pages
Lecture4
No ratings yet
Lecture4
49 pages
39f6c97e482b96aba75c59b4ac0d99b8_MIT15_097S12_lec12
No ratings yet
39f6c97e482b96aba75c59b4ac0d99b8_MIT15_097S12_lec12
14 pages
A Study On Sigmoid Kernels For SVM and The Training of non-PSD Kernels by SMO-type Methods
No ratings yet
A Study On Sigmoid Kernels For SVM and The Training of non-PSD Kernels by SMO-type Methods
32 pages
Support vector machine
No ratings yet
Support vector machine
49 pages
Karush-Kuhn-Tucker (KKT) Conditions: Lecture 11: Convex Optimization
No ratings yet
Karush-Kuhn-Tucker (KKT) Conditions: Lecture 11: Convex Optimization
4 pages
Solar power continues to surge in 2024 research paper
No ratings yet
Solar power continues to surge in 2024 research paper
1 page
Mantras
No ratings yet
Mantras
16 pages
LED Ceiling Spot Light
No ratings yet
LED Ceiling Spot Light
3 pages
Practice_Problems_for_ML_Midterms
No ratings yet
Practice_Problems_for_ML_Midterms
5 pages
lecture6
No ratings yet
lecture6
17 pages
Kernel Method and Support Vector Machines: Nguyen Duc Dung, Ph.D. Ioit, Vast
No ratings yet
Kernel Method and Support Vector Machines: Nguyen Duc Dung, Ph.D. Ioit, Vast
34 pages
Lect 3
No ratings yet
Lect 3
14 pages
Food Court Proposal
No ratings yet
Food Court Proposal
3 pages
Lecture 7_SVM
No ratings yet
Lecture 7_SVM
125 pages
24142
No ratings yet
24142
7 pages
Module14 PDF
No ratings yet
Module14 PDF
100 pages
The Dual Simplex Method: Combinatorial Problem Solving (CPS)
No ratings yet
The Dual Simplex Method: Combinatorial Problem Solving (CPS)
49 pages
Support Vector Machines Jie Tang
No ratings yet
Support Vector Machines Jie Tang
28 pages
SVM Class 2
No ratings yet
SVM Class 2
87 pages
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
No ratings yet
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
15 pages
Reading 16 Environmental, Social, and Governance (ESG) Considerations in Investment Analysis
No ratings yet
Reading 16 Environmental, Social, and Governance (ESG) Considerations in Investment Analysis
5 pages
SEMI DETAILED LESSON PLAN IN SOCIAL NETWORKING Group1
No ratings yet
SEMI DETAILED LESSON PLAN IN SOCIAL NETWORKING Group1
6 pages
Deesha Bhaumik CV PHD Apps For 889
No ratings yet
Deesha Bhaumik CV PHD Apps For 889
3 pages
Barrio Bar Menu Ianuarie 2024 v2
No ratings yet
Barrio Bar Menu Ianuarie 2024 v2
2 pages
Resume Piyush Jain Digital Marketer
No ratings yet
Resume Piyush Jain Digital Marketer
6 pages
Introduction To Support Vector Machines: BTR Workshop Fall 2006
No ratings yet
Introduction To Support Vector Machines: BTR Workshop Fall 2006
88 pages
SD-M1 TSI Chapitre 4
No ratings yet
SD-M1 TSI Chapitre 4
42 pages
ML Lecture06 2
No ratings yet
ML Lecture06 2
63 pages
LIBSVM A Library For Support Vector Machines
No ratings yet
LIBSVM A Library For Support Vector Machines
25 pages
Intro SVM PDF
No ratings yet
Intro SVM PDF
47 pages
Index: Electrical Machines & Power Systems Lab
No ratings yet
Index: Electrical Machines & Power Systems Lab
29 pages
22-Kernel Tricks Shit
No ratings yet
22-Kernel Tricks Shit
43 pages
Tell Me About Yourself - Interview Cheat Sheet
No ratings yet
Tell Me About Yourself - Interview Cheat Sheet
26 pages
07 Kernels
No ratings yet
07 Kernels
6 pages
Lecture Notes SVM
No ratings yet
Lecture Notes SVM
4 pages
Introduction To Support Vector Machines: BTR Workshop Fall 2006
No ratings yet
Introduction To Support Vector Machines: BTR Workshop Fall 2006
88 pages
hw3 Solutions PDF
No ratings yet
hw3 Solutions PDF
11 pages
Practice Midterm 2010
No ratings yet
Practice Midterm 2010
4 pages
hw3 Soln
No ratings yet
hw3 Soln
7 pages
4 - SVM
No ratings yet
4 - SVM
58 pages
SIP Guidelines 2020 PDF
No ratings yet
SIP Guidelines 2020 PDF
4 pages
Lecture Notes SVM
No ratings yet
Lecture Notes SVM
4 pages
SVM-CDing2024 11 15
No ratings yet
SVM-CDing2024 11 15
54 pages
Power Quality UNIT 5
No ratings yet
Power Quality UNIT 5
10 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
45 pages
Tarnaka Times - Nov 2010
No ratings yet
Tarnaka Times - Nov 2010
8 pages
Assignment 3, 4 & 5 - MPMC r13 With Programs
No ratings yet
Assignment 3, 4 & 5 - MPMC r13 With Programs
14 pages
Assignment 3, 4 & 5 - MPMC r13 With Programs
No ratings yet
Assignment 3, 4 & 5 - MPMC r13 With Programs
14 pages
Support Vector Machines (SVM) : Y.H. Hu
No ratings yet
Support Vector Machines (SVM) : Y.H. Hu
25 pages
Lect3 2
No ratings yet
Lect3 2
43 pages
Machine Learning - Open Elective - Part III
No ratings yet
Machine Learning - Open Elective - Part III
90 pages
Configuring Triple Play Service (GPON/XG-PON/XGS-PON Networking in Simplified Mode)
No ratings yet
Configuring Triple Play Service (GPON/XG-PON/XGS-PON Networking in Simplified Mode)
7 pages
Tire Inspection Guide
No ratings yet
Tire Inspection Guide
38 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
SVM Tutorial
No ratings yet
SVM Tutorial
31 pages
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
4 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
Support Vector Machines & Kernels: David Sontag New York University
No ratings yet
Support Vector Machines & Kernels: David Sontag New York University
19 pages
Time Series Forecasting by Using Wavelet Kernel SVM
No ratings yet
Time Series Forecasting by Using Wavelet Kernel SVM
52 pages
SVM PRESENTATION
No ratings yet
SVM PRESENTATION
34 pages
Kernel PCA
No ratings yet
Kernel PCA
13 pages
Abaqus Fluid Structure Interaction Graz-Austria
No ratings yet
Abaqus Fluid Structure Interaction Graz-Austria
20 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
Instruction Note On How To Use This SOP Template
No ratings yet
Instruction Note On How To Use This SOP Template
13 pages
Natwest - Applied List
No ratings yet
Natwest - Applied List
22 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
From Everand
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
CSPacademic
No ratings yet
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet

SVM-ML-AI_lecturenotes_cs725

Uploaded by

SVM-ML-AI_lecturenotes_cs725

Uploaded by

CS 725 : Foundations of Machine Learning Autumn 2011

Lecture 27: SVM: Dual Formulation, Notion of Kernel

21.2 SVM : Dual Formulation

p∗ = min f (x) (49)

d∗ = max L(x∗ , λ) (55)

21.3 Duality theory applied to KKT

Now we check for KKT conditions at the point of optimality,

w∗ is weighted linear combination of points φ(x)s.

If 0 < αj < c then, by Equation 66

21.4 SVM dual

The decision function,

21.5 Kernel Matrix

Thus, for αi ∈ (0, C)

In general, for n ≥ 1 and d ≥ 1, φT (x).φ(x̄) = (xT x̄)d .

Lecture 28: SVM: Kernel Methods, Algorithms for solving Dual

21.6 Requirements of Kernel

Kij = φT (xi )φ(xj )

Hence K should be a Symmetric Matrix.

(φT (x)φ(x̄))2 ≤ kφT (x)k2 kφ(x̄)k2

⇒ Kij 2 ≤ Kii Kjj

Goal is to construct a φ. Which can be constructed as

Properties of Kernel Functions

21.7 Algorithms for solving the dual

Sequential Minimal Optimization Algorithm (SMO)

2. Seclect any 2 αs, say α1 and α2 that violate the KKT

4. From the second last constraint, we can write α1 in terms of α2 .

Find α2∗ such that ∂D(α

α2 ∈ [max(0, −c + α1old + α2old ), min(c, α1old + α2old )]

α2 ∈ [max(0, α2old − α1old ), min(c, c − α1old + α2old )]

Chunking and Decomposition Methods

4. set αnew = [αW

Lecture 29: Support Vector Regression, Attribute Selection

22 Support Vector Regression

23 Attribute Selection and Transformation

1. Chapter 7 of book Data Mining by I.H. Witten and E. Frank

You might also like