0% found this document useful (0 votes)

18 views

ZhuoLiu SVclustering

Support vector clustering is an unsupervised machine learning technique that maps data points into a Hilbert space and finds a minimal enclosing sphere to determine clusters. It addresses limitations of K-means by allowing clusters of arbitrary shapes and densities. The algorithm uses a Gaussian kernel to map points into a higher dimensional space where a minimal enclosing sphere is found. Points inside the sphere are clustered together, while those outside are outliers. The technique was shown to outperform other clustering methods on benchmark datasets like iris data.

Uploaded by

josephashwinkurian

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

ZhuoLiu SVclustering

Uploaded by

josephashwinkurian

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

Support Vector

Clustering
A SA BE N- H U R , D AVI D H O R N, H AVA T. SI E GE L M A N N,
VL A D I M I R VA P NI K

Zhuo Liu
Clustering
• Grouping a set of objects which are similar
• Similarity: distance, density, statistical distribution
• Unsupervised learning
Limitation of K-means: Differing Density

Original Data K-means (3 Clusters)

Limitation of K-means: Non-globular Shapes

Original Data K-means (2 Clusters)

Support Vector Clustering
• Data points are mapped by Gaussian kernel (NOT polynomial kernel
or linear kernel) to a Hilbert space
• Find minimal enclosing sphere in Hilbert space
• Map back the sphere back to data space, cluster forms
• Procedure to find this sphere is called the support vector domain
description (SVDD)
• SVDD is mainly used for outlier detection or novelty detection
• SVC is a unsupervised learning method
Support Vector Domain Description (SVDD)
• is a data set of N points
• Φ is a nonlinear transformation from to a Hilbert space
• Task:
minimize , with constraint
Support Vector Domain Description (SVDD)
• Lagrangian:

where , are Lagrange multipliers, is a constant, is the penalty term.

Support Vector Domain Description (SVDD)
Take partial derivatives and set them to be zeroes:

And KKT complementarity conditions of Fletcher (1987) result in:

Support Vector Domain Description (SVDD)
• If , then , then , then , so point lies outside the sphere, it is called a
bounded support vector or BSV.
• If and , then , it is inside the sphere.
• If and , then , it lies on the surface of the sphere. Such a point will be
referred to as a support vector or SV.
• Note that when no BSVs exist.
Support Vector

Bounded Support
Vector

Inner Point
Support Vector Domain Description (SVDD)
• Wolfe dual form:

with constraints:
• Now, we can introduce kernel function such that

• How does different kernel work?

Polynomial Kernel
𝑑
𝐾 ( 𝑥 𝑖 , 𝑥 𝑗 ) =(𝑥 𝑖 .𝑥 𝑗 +1)
Gaussian Kernel
2
𝐾 ( 𝑥𝑖 ,𝑥 𝑗 )=𝑒𝑥𝑝(−(𝑥 𝑖−𝑥 𝑗 )¿¿2/𝑠 )¿
Cluster Assignment
• Generating adjacency matrix
• has component with value either 0 or 1
• 0: line segment between and cross out the sphere
1: line segment between and is always in the sphere
• Clustering based on graph-based model
[ ] [ ]
1 1 1 0 0 0 2 −1 −1 0 0 0 Second Smallest Eigenvalue
1 1 1 0 0 0 −1 2 −1 0 0 0 for Laplacian:
1 1 1 0 0 0 −1 −1 2 0 0 0
𝐴= 𝐿𝑎𝑝𝑙𝑎𝑐𝑖𝑎𝑛=
0 0 0 1 1 1 0 0 0 2 −1 −1 So there are two clusters.
0 0 0 1 1 1 0 0 0 −1 2 −1
0 0 0 1 1 1 0 0 0 −1 −1 2
Example

2
𝐾 ( 𝑥𝑖 ,𝑥 𝑗 )=exp(−𝑞‖𝑥𝑖 −𝑥 𝑗‖ )
Example with BSVs
• In real data, clusters are usually not as well separated as in previous
example, so we need to allow some BSVs.
• BSVs are assigned to the cluster that they are closest to.
• An important parameter - upper bound on the fraction of BSVs:

where is number of points, is the coefficient for penalty term.

• Asymptotically (for large ), the fraction of outliers tends to .
Example with BSVs
Clusters with Overlapping Density Functions
Experiment on Iris Data
• There are three types of flowers, represented by 50 instances each
• First two principal components space:
1. q = 6 p = 0.6
2. the third cluster split into two
3. When these two clusters are considered together, the result is 2 misclassifications
• First three principal component space:
1. q = 7.0 p = 0.70
2. four misclassifications
• First four principal component space:
1. q = 9.0 p = 0.75
2. 14 misclassifications
• # of SVs: 18 in 2D, 23 in 3D, 34 in 4D
• Reason for improvement in 2d and 3d: PCA reduces noise
Experiment on Iris Data
Compare with Other Non-Parametric
Clustering Algorithms
• The information theoretic approach of Tishby and Slonim (2001) : 5
misclassifications.
• The SPC algorithm of Blatt et al. (1997), when applied to the dataset
in the original data-space: 15 misclassifications.
• SVC: 2 misclassification in first two PCs space, 4 misclassification in
first three PCs space.
Principle to Choose Parameter
• Starting from a small value of q and increasing it. Initial value can be chosen
as:

which will result in a single cluster, so no outliers are needed, hence choose .
• Criteria : a low number of SVs guarantees smooth boundaries.
• If the number of SVs is excessive, or a number of singleton clusters form, one
should increase to allow SVs to turn into BSVs, and smooth cluster
boundaries emerge.
• In other words, we need to systematically increase q and p along a direction
that guarantees a minimal number of SVs.
Complexity
• SMO algorithm of Platt (1999) to solve the quadratic programming
problem – very efficient
• Labeling part:
• If # of SVs is O(1), labeling part:
• Memory usage: O(1).
• In overall, SVC is useful even for very large datasets
Conclusion
• SVC has no explicit bias of either the number, or the shape of clusters
• SVC is a unsupervised clustering algorithm
• Two parameters:
q: when it increases, clusters begin to split
p: soft margin constant that controls the number of outliers
• A unique advantage: cluster boundaries can be of arbitrary shape,
whereas other algorithms are most often limited to hyper-ellipsoids
References
A. Ben-Hur, A. Elisseeff, and I. Guyon. A stability based method for discovering structure in clustered data. in Pacific Symposium on
Biocomputing, 2002.

A. Ben-Hur, D. Horn, H.T. Siegelmann, and V. Vapnik. A support vector clustering method. in International Conference on Pattern
Recognition, 2000.

A. Ben-Hur, D. Horn, H.T. Siegelmann, and V. Vapnik. A support vector clustering method. in Advances in Neural Information
Processing Systems 13: Proceedings of the 2000 Conference, Todd K. Leen, Thomas G. Dietterich and Volker Tresp eds., 2001.

C.L. Blake and C.J. Merz. Uci repository of machine learning databases, 1998.

Marcelo Blatt, Shai Wiseman, and Eytan Domany. Data clustering using a model granular magnet. Neural Computation, 9(8):1805–
1842, 1997.

R.O. Duda, P.E. Hart, and D.G. Stork. Pattern Classification. John Wiley & Sons, New York, 2001. R.A. Fisher. The use of multiple
measurments in taxonomic problems. Annals of Eugenics, 7:179–188, 1936.

R. Fletcher. Practical Methods of Optimization. Wiley-Interscience, Chichester, 1987.

K. Fukunaga. Introduction to Statistical Pattern Recognition. Academic Press, San Diego, CA, 1990.

A.K. Jain and R.C. Dubes. Algorithms for clustering data. Prentice Hall, Englewood Cliffs, NJ, 1988.

H. Lipson and H.T. Siegelmann. Clustering irregular shapes using high-order neurons. Neural Computation, 12:2331–2353,
2000.
References
J. MacQueen. Some methods for classification and analysis of multivariate observations. In Proc. 5th Berkeley
Symposium on Mathematical Statistics and Probability, Vol. 1, 1965.
G.W. Milligan and M.C. Cooper. An examination of procedures for determining the number of clusters in a data set.
Psychometrika, 50:159–179, 1985.
J. Platt. Fast training of support vector machines using sequential minimal optimization. In Advances in Kernel
Methods — Support Vector Learning, B. Schölkopf, C. J. C. Burges, and A. J. Smola, editors, 1999.
B.D. Ripley. Pattern recognition and neural networks. Cambridge University Press, Cambridge, 1996.
S.J. Roberts. Non-parametric unsupervised cluster analysis. Pattern Recognition, 30(2): 261–272, 1997.
B. Schölkopf, R.C. Williamson, A.J. Smola, J. Shawe-Taylor, and J. Platt. Support vector method for novelty
detection. in Advances in Neural Information Processing Systems 12: Proceedings of the 1999 Conference, Sara A.
Solla, Todd K. Leen and Klaus-Robert Muller eds., 2000.
Bernhard Schölkopf, John C. Platt, John Shawe-Taylor, , Alex J. Smola, and Robert C. Williamson. Estimating the
support of a high-dimensional distribution. Neural Computation, 13:1443–1471, 2001.
R. Shamir and R. Sharan. Algorithmic approaches to clustering gene expression data. In T. Jiang, T. Smith, Y. Xu, and
M.Q. Zhang, editors, Current Topics in Computational Biology, 2000.
D.M.J. Tax and R.P.W. Duin. Support vector domain description. Pattern Recognition Letters, 20:1991–1999, 1999.
N. Tishby and N. Slonim. Data clustering by Markovian relaxation and the information bottleneck method. in
Advances in Neural Information Processing Systems 13: Proceedings of the 2000 Conference, Todd K. Leen, Thomas
G. Dietterich and Volker Tresp eds., 2001.
V. Vapnik. The Nature of Statistical Learning Theory. Springer, New York, 1995.
Thanks!

Literature Review
89% (9)
Literature Review
24 pages
Support Vector Machine
100% (2)
Support Vector Machine
11 pages
NURS 208 Final
No ratings yet
NURS 208 Final
11 pages
Proposal Writing PDF
No ratings yet
Proposal Writing PDF
5 pages
Support Vector Clustering: Journal of Machine Learning Research 2 (2001) 125-137 Submitted 3/04 Published 12/01
No ratings yet
Support Vector Clustering: Journal of Machine Learning Research 2 (2001) 125-137 Submitted 3/04 Published 12/01
13 pages
Artificial Intelligence and Machine Learning: T.A. Silvia Bucci
No ratings yet
Artificial Intelligence and Machine Learning: T.A. Silvia Bucci
78 pages
Support Vector Machine Classification For Large Data Sets Via Minimum Enclosing Ball Clustering
No ratings yet
Support Vector Machine Classification For Large Data Sets Via Minimum Enclosing Ball Clustering
9 pages
Support Vector Machine in R Paper
No ratings yet
Support Vector Machine in R Paper
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
AP for NLP-LO2
No ratings yet
AP for NLP-LO2
38 pages
Support Vector Machines
100% (5)
Support Vector Machines
14 pages
SML Unit 4
No ratings yet
SML Unit 4
61 pages
A HSC-based Sample Selection Method For Support Vector Machine
No ratings yet
A HSC-based Sample Selection Method For Support Vector Machine
6 pages
Hearst SVM
No ratings yet
Hearst SVM
12 pages
Support Vector Machine - Wikipedia, The Free Encyclopedia
No ratings yet
Support Vector Machine - Wikipedia, The Free Encyclopedia
12 pages
A Comprehensive Survey On Support Vector Machine in Data Mining Tasks: Applications & Challenges
No ratings yet
A Comprehensive Survey On Support Vector Machine in Data Mining Tasks: Applications & Challenges
18 pages
6. Support Vector Machine for Classification
No ratings yet
6. Support Vector Machine for Classification
38 pages
A New Heuristic of The Decision Tree Induction: Ning Li, Li Zhao, Ai-Xia Chen, Qing-Wu Meng, Guo-Fang Zhang
No ratings yet
A New Heuristic of The Decision Tree Induction: Ning Li, Li Zhao, Ai-Xia Chen, Qing-Wu Meng, Guo-Fang Zhang
6 pages
Multi-Class Classification Using Support Vector Ma
No ratings yet
Multi-Class Classification Using Support Vector Ma
7 pages
Multi-Class Classification Using Support Vector Machines in Binary Tree Architecture
No ratings yet
Multi-Class Classification Using Support Vector Machines in Binary Tree Architecture
6 pages
SVM - Hype or Hallelujah
No ratings yet
SVM - Hype or Hallelujah
13 pages
Basic of SVM Algorithm
No ratings yet
Basic of SVM Algorithm
10 pages
Lec5 Support vector machine
No ratings yet
Lec5 Support vector machine
28 pages
2024-SCU-ML-2-1-SVM
No ratings yet
2024-SCU-ML-2-1-SVM
36 pages
SVM Explained PDF
No ratings yet
SVM Explained PDF
19 pages
K-SVM: An Effective SVM Algorithm Based On K-Means Clustering
No ratings yet
K-SVM: An Effective SVM Algorithm Based On K-Means Clustering
8 pages
Module10 - Support Vector Machine
No ratings yet
Module10 - Support Vector Machine
23 pages
Supervised Learning - Support Vector Machines and Feature Reduction
No ratings yet
Supervised Learning - Support Vector Machines and Feature Reduction
11 pages
MergedPDF Iml
No ratings yet
MergedPDF Iml
114 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
Support Vector Machines: Constantin F. Aliferis & Ioannis Tsamardinos
No ratings yet
Support Vector Machines: Constantin F. Aliferis & Ioannis Tsamardinos
37 pages
TAZ-TFG-2016-2057
No ratings yet
TAZ-TFG-2016-2057
52 pages
data mining techniques
No ratings yet
data mining techniques
27 pages
SVM-1
No ratings yet
SVM-1
36 pages
9 Svm-Handout PDF
No ratings yet
9 Svm-Handout PDF
21 pages
Maximal Margin Hyper-Sphere SVM For Binary Pattern Classification
No ratings yet
Maximal Margin Hyper-Sphere SVM For Binary Pattern Classification
23 pages
An SVM-Based Face Detection System
No ratings yet
An SVM-Based Face Detection System
7 pages
An Improved Training Algorithm For Support Vector Machines
No ratings yet
An Improved Training Algorithm For Support Vector Machines
10 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
History: 1 Support Vector Machines: History
No ratings yet
History: 1 Support Vector Machines: History
6 pages
L5-Support Vector Machine
No ratings yet
L5-Support Vector Machine
61 pages
Introduction To Support Vector Machines
No ratings yet
Introduction To Support Vector Machines
46 pages
UBICC Article 522 522
No ratings yet
UBICC Article 522 522
8 pages
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
No ratings yet
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
14 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
SVM Class
No ratings yet
SVM Class
33 pages
A Geometric Approach To Support Vector Machine SVM Classification
No ratings yet
A Geometric Approach To Support Vector Machine SVM Classification
12 pages
Discriminative and Generative Methods For Bags of Features: Zebra Non-Zebra
No ratings yet
Discriminative and Generative Methods For Bags of Features: Zebra Non-Zebra
40 pages
Support Vector Machines: Jeff Wu
No ratings yet
Support Vector Machines: Jeff Wu
35 pages
Learning Theory: y For Examples
No ratings yet
Learning Theory: y For Examples
11 pages
Articol Informatica Economica
No ratings yet
Articol Informatica Economica
10 pages
Unec 1705121586
No ratings yet
Unec 1705121586
33 pages
Unit 2
No ratings yet
Unit 2
47 pages
Introduction To Support Vector Machines: 1 Description
No ratings yet
Introduction To Support Vector Machines: 1 Description
15 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
This Is
No ratings yet
This Is
7 pages
C. Cifarelli Et Al - Incremental Classification With Generalized Eigenvalues
No ratings yet
C. Cifarelli Et Al - Incremental Classification With Generalized Eigenvalues
25 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Ijetae 0812 11
No ratings yet
Ijetae 0812 11
4 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Differential Equations
From Everand
Differential Equations
Harry Hochstadt
3.5/5 (2)
Physics Project
No ratings yet
Physics Project
12 pages
Department Order No. 245 24 Implementing Guidelines of The Career Development Support Program
No ratings yet
Department Order No. 245 24 Implementing Guidelines of The Career Development Support Program
5 pages
Theories in Second Language Acquisition An Introduction 3rd Edition Bill Vanpatten - The ebook is ready for download, no waiting required
No ratings yet
Theories in Second Language Acquisition An Introduction 3rd Edition Bill Vanpatten - The ebook is ready for download, no waiting required
86 pages
Krihsna As CEO
No ratings yet
Krihsna As CEO
6 pages
Lesson Plan Science
No ratings yet
Lesson Plan Science
4 pages
Complete Download Shaping the Digital Dissertation Knowledge Production in the Arts and Humanities 1st Edition Virginia Kuhn Anke Finger PDF All Chapters
No ratings yet
Complete Download Shaping the Digital Dissertation Knowledge Production in the Arts and Humanities 1st Edition Virginia Kuhn Anke Finger PDF All Chapters
40 pages
Class Policies-Students'Copy Class Policies-Professor'Scopy: ST ST
No ratings yet
Class Policies-Students'Copy Class Policies-Professor'Scopy: ST ST
2 pages
(Ebook) Illusions in motion media archaeology of the moving panorama and related spectacles by Huhtamo, Erkki ISBN 9780262018517, 0262018519 All Chapters Instant Download
100% (4)
(Ebook) Illusions in motion media archaeology of the moving panorama and related spectacles by Huhtamo, Erkki ISBN 9780262018517, 0262018519 All Chapters Instant Download
71 pages
Teaching Matters 2nd Edition Todd Whitaker & Beth Whitaker - The ebook is ready for download to explore the complete content
100% (3)
Teaching Matters 2nd Edition Todd Whitaker & Beth Whitaker - The ebook is ready for download to explore the complete content
55 pages
Narrative Report TVL Sample
No ratings yet
Narrative Report TVL Sample
52 pages
Blindfold Chess: Blindfold Chess (Also Known As Sans Voir) Is A Form of Chess Play
100% (1)
Blindfold Chess: Blindfold Chess (Also Known As Sans Voir) Is A Form of Chess Play
5 pages
Harley Therapy Wellbeing Booklet PDF
100% (1)
Harley Therapy Wellbeing Booklet PDF
15 pages
Grade 9 - Badminton Summative Assessment
100% (1)
Grade 9 - Badminton Summative Assessment
6 pages
Canadian Aikido Federation Test Requirements
No ratings yet
Canadian Aikido Federation Test Requirements
4 pages
BTech. 3rd Year - CSE - Hindi - 2023-24
No ratings yet
BTech. 3rd Year - CSE - Hindi - 2023-24
33 pages
Phys 2041
No ratings yet
Phys 2041
7 pages
Tuke Sop
No ratings yet
Tuke Sop
1 page
Experimental Manipulations of Self-Affirmation A Systematic Review
No ratings yet
Experimental Manipulations of Self-Affirmation A Systematic Review
67 pages
Selection - Interviewing Notes - March 2020
No ratings yet
Selection - Interviewing Notes - March 2020
11 pages
Teaching English language skills (Reading & Listening)
No ratings yet
Teaching English language skills (Reading & Listening)
59 pages
Episode 3
No ratings yet
Episode 3
11 pages
Shaver & Mikulincer (2005) Attachment Theory and Research - Resurrection of The Psychodinamic Approach To Personality
67% (3)
Shaver & Mikulincer (2005) Attachment Theory and Research - Resurrection of The Psychodinamic Approach To Personality
24 pages
Writing Assignment Rubric - 2024
No ratings yet
Writing Assignment Rubric - 2024
1 page
HRGP Module 1 (Week 1-2) G7
No ratings yet
HRGP Module 1 (Week 1-2) G7
2 pages
Sjit Gs Research Project Concept Note
No ratings yet
Sjit Gs Research Project Concept Note
2 pages
Ahsan Raza_Resume
No ratings yet
Ahsan Raza_Resume
2 pages
unit test 1+2
No ratings yet
unit test 1+2
8 pages

ZhuoLiu SVclustering

Uploaded by

ZhuoLiu SVclustering

Uploaded by

Support Vector

Original Data K-means (3 Clusters)

Original Data K-means (2 Clusters)

where , are Lagrange multipliers, is a constant, is the penalty term.

And KKT complementarity conditions of Fletcher (1987) result in:

• How does different kernel work?

where is number of points, is the coefficient for penalty term.

R. Fletcher. Practical Methods of Optimization. Wiley-Interscience, Chichester, 1987.

You might also like