0% found this document useful (1 vote)

400 views

Pattern Recognition Linear Classifier by Zaheer Ahmad

This document discusses pattern recognition and linear classifiers. It begins with an overview of pattern recognition, including definitions, applications, and key terminology. Feature extraction and representation are explained. Linear classification approaches are then introduced, including linear discriminant functions, linear separability, Fisher's linear discriminant, and support vector machines. Fisher's linear discriminant aims to find the optimal projection direction that maximizes separation between classes. Support vector machines seek to find the separating hyperplane that maximizes the margin between the two classes.

Uploaded by

Zaheer Ahmad

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

400 views

Pattern Recognition Linear Classifier by Zaheer Ahmad

Uploaded by

Zaheer Ahmad

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 37

Pattern Recognition Linear Classifiers

Zaheer Ahmad PhD Scholar [email protected] Department of Computer Science University of Peshawar

Agenda
Pattern Recognition
Features and Patterns Classifiers Approaches Design Cycle

Linear Classification
Linear Discriminant Functions Linear Separability Fisher Discriminant Functions Support Vector Machines(SVMs)

What is pattern recognition?

The assignment of a physical object or event to one of several pre-specified categories Duda and Hart The science that concerns the description or classification (recognition) of measurements Schalkoff

The process of giving names to observations x, Schrmann Pattern Recognition is concerned with answering the question What is this? Morse

Applications of PR
Image processing Computer vision Speech recognition Data Mining Automated target recognition Optical character recognition Seismic analysis Man and machine diagnostics Fingerprint identification Industrial inspection Financial forecast Medical diagnosis ECG signal analysis

Terminology
Recognition: During recognition (or classification) given objects are assigned to prescribed classes. classification is the problem of identifying which of a set of categories (sub-populations) a new observation belongs, on the basis of a training set of data containing observations (or instances) whose category membership is known An algorithm that implements classification, especially in a concrete implementation, is known as a classifier A classifier is a machine which performs classification.

Features
Feature is any distinctive aspect, quality or characteristic of an object Features may be symbolic (i.e., color) or numeric (i.e., height) The combination of features is a -dim column vector called a feature vector The -dimensional space defined by the feature vector is called the feature space
Objects are represented as points in feature space; the result is a scatter plot

Features

a good feature vector?

The quality of a feature vector is related to its ability to discriminate It should include examples from different classes Examples from the same class should have similar feature values Examples from different classes have different feature values

More feature properties

Pattern and Pattern Class

A pattern is an object, process or event that can be given a name. Pattern is a composite of traits or features characteristic of an individual In classification tasks, a pattern is a pair of variables {, } where
is a collection of observations or features (feature vector) is the concept behind the observation (label/category)

A pattern class (or category) is a set of patterns sharing common attributes and usually originating from the same source. A class/ pattern class is a set of objects having some important properties in common

Decision Boundary/Surface
A line or curve separating the classes is a decision boundary The equation g(x) = 0 defines the decision surface that separates points assigned to the category 1 from points assigned to the category 2 When g(x) is linear, the decision surface is a hyperplane If x1 and x2 are both on the hyperplane then

Decision Boundary

Slope intercept form of a Line(Straight Line): The equation of a line with a defined slope m can also be written as follows: y = mx + b

The task of a classifier is to partition feature space into class-labeled decision regions Borders between decision regions are called decision boundaries The classification of feature vector consists of determining which decision region it belongs to, and assign to this class

Classifiers

Pattern recognition approaches

Statistical Patterns classified based on an underlying statistical model of the features vector given class ) Neural Classification is based on the response of a network of processing units (neurons) to an input stimuli (pattern)
Knowledge is stored in the connectivity and strength of the synaptic weights Trainable, non-algorithmic, black-box strategy The statistical model is defined by a family of class-conditional probability density functions (|) (Probability of feature

Very attractive since it requires minimum a priori knowledge with enough layers and neurons, ANNs can create any complex decision region Syntactic Patterns classified based on measures of structural similarity Knowledge is represented by means of formal grammars or relational descriptions (graphs) Used not only for classification, but also for description Typically, syntactic approaches formulate hierarchical descriptions of complex patterns built up from simpler sub patterns

The pattern recognition design cycle

Data collection Probably the most time-intensive component of a PR project How many examples are enough? Feature choice Critical to the success of the PR problem
Garbage in, garbage out

Requires basic prior knowledge Model choice Statistical, neural and structural approaches Parameter settings

Training Given a feature set and a blank model, adapt the model to explain the data Supervised, unsupervised and reinforcement learning Evaluation How well does the trained model do? Overfitting vs. generalization

Linear Classification
Classification in which the decision boundary in the feature (input) space is linear In linear classification the input space is split in (hyper)planes, each with an assigned class

Linear Separable
If a hyperplanar decision boundary exists that correctly classify all the training samples for a c=2 class problem, the samples are said to be linearly separable.

Linear Discriminant Function

A discriminant function that is a linear combination of the components of x is called a linear discriminant function and can be written as

where w is the weight vector and w0 is the bias (or threshold weight).

Linear Classifiers
Linear Classifiers
a linear classifier is a mapping which partitions feature space using a linear function (a straight line, or a hyperplane) it is one of the simplest classifiers we can imagine
separate the two classes using a straight line in feature space

in 2 dimensions the decision boundary is a straight line

2-Class Data with a Linear Decision Boundary

TWO-CLASS DATA IN A TWO-DIMENSIONAL FEATURE SPACE 8 Decision Region 1 Decision Region 2

Feature 2

-2 Decision Boundary -4 -4 -2 0 2 4 6 Feature 1 8 10 12 14

Data that is Not Linearly Separable

TWO-CLASS DATA IN A TWO-DIMENSIONAL FEATURE SPACE 6 Decision Region 1 Decision Region 2

Feature 2

0 Decision Boundary -1 2 3 4 5 6 Feature 1 7 8 9 10

Fishers linear discriminant

A simple linear discriminant function is a projection of the data down to 1-D. So choose the projection that gives the best separation of the classes. An obvious direction to choose is the direction of the line joining the class means. But if the main direction of variance in each class is not orthogonal to this line, this will not give good separation (see the next figure). Fishers method chooses the direction that maximizes the ratio of between class variance to within class variance. This is the direction in which the projected points contain the most information about class membership (under Gaussian assumptions)

Classes well-separated in D-space may strongly overlap in 1dimension Adjust component of the weight vector w Select projection to maximize class-separation

Can be generalized for multiple classes

A picture showing the advantage of Fishers linear discriminant.

When projected onto the line joining the class means, the classes are not well separated.

Fisher chooses a direction that makes the projected classes much tighter, even though their projected means are less far apart.

Math of Fishers linear discriminants

What linear transformation is best for discrimination? The projection onto the vector separating the class means seems sensible: But we also want small variance within each class:

y wT x
w m 2 m1
2 s1

n C1

( yn m1 ) ( yn m2 )
between within

2 s2

n C2

Fishers objective function is:

J (w )

(m2 m1 ) 2
2 2 s1 s2

More math of Fishers linear discriminants

J (w ) (m2 m1 ) 2
2 s1 2 s2

wT S B w w SW w
T

S B (m 2 m1 ) (m 2 m1 )T SW (x n m1 ) (x n m1 )T (x n m 2 ) (x n m 2 )T

nC1

nC2

1 Optimal solution : w SW (m 2 m1 )

Support Vector Machines(SVMs)

A support vector machine (SVM) is a concept in statistics and computer science for a set of related supervised learning methods that analyze data and recognize patterns, used for classification and regression analysis. a support vector machine constructs a hyperplane or set of hyperplanes in a high- or infinite-dimensional space, which can be used for classification, regression, or other tasks. a good separation is achieved by the hyperplane that has the largest distance to the nearest training data point of any class the larger the margin the lower the generalization error of the classifier.

Separating Hyperplane
x2

yi 1

A separating hypreplane w x b 0
x1

But There are many possibilities

for such hyperplanes !!

Separating Hyperplanes

yi 1

Which one should we choose!

Yes, There are many possible separating hyperplanes

It could be this one or this or this or maybe.!

Choosing a separating hyperplane:

-Hyperplane should be as far as possible from any sample point.

-This way a new data that is close to the old samples will be classified correctly.

Good generalization!

Choosing a separating hyperplane. The SVM approach: Linear separable case

-The SVM idea is to maximize the distance between The hyperplane and the closest sample point.
In the optimal hyperplane:

The distance to the closest negative point =

The distance to the closest positive point.

Choosing a separating hyperplane. The SVM approach: Linear separable case

Support vectors are the samples closest to the separating hyperplane.

These are Support Vectors

Detailed Lesson Plan in Entrepreneurship I. Objectives: A Business That Legally Has No Separate Existence From
92% (13)
Detailed Lesson Plan in Entrepreneurship I. Objectives: A Business That Legally Has No Separate Existence From
4 pages
Do You Coach Heavy or Light - Checklist
No ratings yet
Do You Coach Heavy or Light - Checklist
1 page
Introduction To Machine Learning: Jaime S. Cardoso
100% (1)
Introduction To Machine Learning: Jaime S. Cardoso
52 pages
Chapter 3 The New Role of Human Resources
No ratings yet
Chapter 3 The New Role of Human Resources
9 pages
Lesson Plan - Now You See Me
No ratings yet
Lesson Plan - Now You See Me
3 pages
Classification Techniques
No ratings yet
Classification Techniques
99 pages
08classification I
No ratings yet
08classification I
52 pages
DSH - L5 - Data-Driven Approaches - Concepts
No ratings yet
DSH - L5 - Data-Driven Approaches - Concepts
38 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
Lec 9,10
No ratings yet
Lec 9,10
53 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
Linear - Classification
No ratings yet
Linear - Classification
72 pages
PR Assignment 01 - Seemal Ajaz (206979)
No ratings yet
PR Assignment 01 - Seemal Ajaz (206979)
7 pages
Introduction To Pattern Recognition: Vojtěch Franc
No ratings yet
Introduction To Pattern Recognition: Vojtěch Franc
21 pages
Pattern Recognition
No ratings yet
Pattern Recognition
45 pages
Lecture Notes On Pattern Recognition and Image Processing
No ratings yet
Lecture Notes On Pattern Recognition and Image Processing
24 pages
Spoken Dialog Systems and Voice XML
No ratings yet
Spoken Dialog Systems and Voice XML
94 pages
Single Layer Perceptron Classifier
No ratings yet
Single Layer Perceptron Classifier
62 pages
STATISTICAL PATTERN RECOGNITION
No ratings yet
STATISTICAL PATTERN RECOGNITION
19 pages
ML-UNIT4
No ratings yet
ML-UNIT4
41 pages
Pattern Revision
No ratings yet
Pattern Revision
63 pages
SVM Class
No ratings yet
SVM Class
33 pages
AI Unit-5 Notes
No ratings yet
AI Unit-5 Notes
25 pages
Pattern Recognition: Dr. Farah Qais Al-Khalidi
No ratings yet
Pattern Recognition: Dr. Farah Qais Al-Khalidi
43 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
Introduction To Pattern Recognition
No ratings yet
Introduction To Pattern Recognition
46 pages
Pattern Recognition
No ratings yet
Pattern Recognition
52 pages
PR Some Solutions
No ratings yet
PR Some Solutions
26 pages
CSE 473 Pattern Recognition
No ratings yet
CSE 473 Pattern Recognition
45 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
Linear-classifiers
No ratings yet
Linear-classifiers
48 pages
Fitting A Model To Data
No ratings yet
Fitting A Model To Data
41 pages
Pattern Classification
100% (1)
Pattern Classification
42 pages
Pattern Recognition
No ratings yet
Pattern Recognition
66 pages
Discriminant Functions
No ratings yet
Discriminant Functions
33 pages
Chapter 8
No ratings yet
Chapter 8
103 pages
ml41
No ratings yet
ml41
49 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
Pattern Recognition: C G (P) G (F (M) )
No ratings yet
Pattern Recognition: C G (P) G (F (M) )
143 pages
PR - L1-Introduction To Pattern Recognition PDF
0% (1)
PR - L1-Introduction To Pattern Recognition PDF
20 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
Pattern Recognition: Dr. Farah Qais Al-Khalidi
No ratings yet
Pattern Recognition: Dr. Farah Qais Al-Khalidi
49 pages
An Introduction To Pattern Recognition - 2
No ratings yet
An Introduction To Pattern Recognition - 2
46 pages
Pattern recognition unit 2
No ratings yet
Pattern recognition unit 2
24 pages
Deep Learning Answers
No ratings yet
Deep Learning Answers
36 pages
Lec 9
No ratings yet
Lec 9
15 pages
SVM
No ratings yet
SVM
57 pages
Discriminative and Generative Methods For Bags of Features: Zebra Non-Zebra
No ratings yet
Discriminative and Generative Methods For Bags of Features: Zebra Non-Zebra
40 pages
Fundamentals of PR
No ratings yet
Fundamentals of PR
44 pages
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
100% (1)
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
57 pages
1 An Introduction To Linear Classifiers
No ratings yet
1 An Introduction To Linear Classifiers
9 pages
Outline: Reducing Data Dimension
No ratings yet
Outline: Reducing Data Dimension
7 pages
Pattern L1 L6
No ratings yet
Pattern L1 L6
19 pages
IT 802 ML Unit-2 Notes
No ratings yet
IT 802 ML Unit-2 Notes
19 pages
ML Unit 2
No ratings yet
ML Unit 2
53 pages
Fintech ML Using Azure
No ratings yet
Fintech ML Using Azure
51 pages
Linear Discriminant Functions: CS479/679 Pattern Recognition Dr. George Bebis
No ratings yet
Linear Discriminant Functions: CS479/679 Pattern Recognition Dr. George Bebis
41 pages
ML Unit-4
No ratings yet
ML Unit-4
28 pages
MergedPDF Iml
No ratings yet
MergedPDF Iml
114 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
46 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Dll-Week 1 Eap
No ratings yet
Dll-Week 1 Eap
3 pages
The Teaching Profession
100% (7)
The Teaching Profession
17 pages
MY 21st CENTURY CLASSROOM FACILITATING SKILLS
100% (1)
MY 21st CENTURY CLASSROOM FACILITATING SKILLS
3 pages
Supreme Master Funakoshi Gichin
No ratings yet
Supreme Master Funakoshi Gichin
5 pages
K 12 Curriculum: Ernesto S. Villavert, LPT Caluya National High School Caluya, Antique
No ratings yet
K 12 Curriculum: Ernesto S. Villavert, LPT Caluya National High School Caluya, Antique
24 pages
MI & Learning Style
No ratings yet
MI & Learning Style
15 pages
Al Manar Modern School Strategic Digital Plan 2023 - 2025
No ratings yet
Al Manar Modern School Strategic Digital Plan 2023 - 2025
12 pages
Project Proposal: Division of Camarines Sur
No ratings yet
Project Proposal: Division of Camarines Sur
6 pages
References
No ratings yet
References
6 pages
WRCM 101 Outline Fall 2020
No ratings yet
WRCM 101 Outline Fall 2020
4 pages
Mississippi School Boards Association: School Board Member Recognition Week
No ratings yet
Mississippi School Boards Association: School Board Member Recognition Week
13 pages
Cultural Leadership
No ratings yet
Cultural Leadership
14 pages
Bachelor of Secondary Education Course Syllabus Pred 132 (Curriculum Development) 2Nd Semester, Ay. 2014-2015
No ratings yet
Bachelor of Secondary Education Course Syllabus Pred 132 (Curriculum Development) 2Nd Semester, Ay. 2014-2015
7 pages
Maulida Sempro
No ratings yet
Maulida Sempro
10 pages
Hhrec Holocaust Curriculum Intro
No ratings yet
Hhrec Holocaust Curriculum Intro
85 pages
Review of Related Literature
No ratings yet
Review of Related Literature
3 pages
A Detailed Lesson Plan in Music and Arts 9: Procedure
No ratings yet
A Detailed Lesson Plan in Music and Arts 9: Procedure
3 pages
2014 MoE Statistics INSIDE
No ratings yet
2014 MoE Statistics INSIDE
211 pages
Brady Whalen Resume 2020-21
No ratings yet
Brady Whalen Resume 2020-21
1 page
CM6 Combined 2013
No ratings yet
CM6 Combined 2013
154 pages
Registration No.:: Confirmation Page For Ctet - July 2013
No ratings yet
Registration No.:: Confirmation Page For Ctet - July 2013
1 page
2005 AEQ Manual PDF
No ratings yet
2005 AEQ Manual PDF
54 pages
2019 DMIR PS Plocies DO54s2016 and DO11s2018
No ratings yet
2019 DMIR PS Plocies DO54s2016 and DO11s2018
43 pages
Dysphagia Consultation Paper
No ratings yet
Dysphagia Consultation Paper
77 pages
Anti-Bullying Template (HNHS)
No ratings yet
Anti-Bullying Template (HNHS)
8 pages
The Effects of A Low Reading Comprehension On The Performance of Grade 10 Students of DIHS LIMPIN - 3
No ratings yet
The Effects of A Low Reading Comprehension On The Performance of Grade 10 Students of DIHS LIMPIN - 3
1 page

Pattern Recognition Linear Classifier by Zaheer Ahmad

Uploaded by

Pattern Recognition Linear Classifier by Zaheer Ahmad

Uploaded by

Pattern Recognition Linear Classifiers

What is pattern recognition?

a good feature vector?

More feature properties

Pattern and Pattern Class

Pattern recognition approaches

The pattern recognition design cycle

Linear Discriminant Function

in 2 dimensions the decision boundary is a straight line

2-Class Data with a Linear Decision Boundary

-2 Decision Boundary -4 -4 -2 0 2 4 6 Feature 1 8 10 12 14

Data that is Not Linearly Separable

0 Decision Boundary -1 2 3 4 5 6 Feature 1 7 8 9 10

Fishers linear discriminant

Can be generalized for multiple classes

A picture showing the advantage of Fishers linear discriminant.

Math of Fishers linear discriminants

Fishers objective function is:

More math of Fishers linear discriminants

Support Vector Machines(SVMs)

But There are many possibilities

Which one should we choose!

Yes, There are many possible separating hyperplanes

Choosing a separating hyperplane:

-Hyperplane should be as far as possible from any sample point.

Choosing a separating hyperplane. The SVM approach: Linear separable case

The distance to the closest negative point =

Choosing a separating hyperplane. The SVM approach: Linear separable case

Support vectors are the samples closest to the separating hyperplane.

These are Support Vectors

You might also like