Lecture8 KNN1

The document summarizes the nearest-neighbor classifier machine learning algorithm. It stores all training records and uses them to predict the class label of unseen cases. For a new, unseen case, it computes the distance to all training records and identifies the k nearest neighbors. It then uses the class labels of those k nearest neighbors, typically by majority vote, to determine the class label of the new case. The value of k and the distance metric used to compute distances between records are important parameters that affect the performance of the nearest-neighbor classifier.

Uploaded by

Zarin Tasnim

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Lecture8 KNN1

Uploaded by

Zarin Tasnim

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Nearest-Neighbor

Classifier
Instance-Based Classifiers
Set of Stored Cases • Store the training records

……... • Use training records to

Atr1 AtrN Class
predict the class label of
A unseen cases
B
B
Unseen Case
C
Atr1 ……... AtrN
A
C
B
Instance Based Classifiers
• Examples:
– Rote-learner
• Memorizes entire training data and performs classification only if attributes
of record match one of the training examples exactly

– Nearest neighbor
• Uses k “closest” points (nearest neighbors) for performing classification
Nearest Neighbor Classifiers
• Basic idea:
– If it walks like a duck, quacks like a duck, then it’s probably a duck

Compute
Distance Test
Record

Training Choose k of the

Records “nearest” records
Nearest-Neighbor Classifiers
Unknown record l Requires three things
– The set of stored records
– Distance Metric to compute
distance between records
– The value of k, the number of
nearest neighbors to retrieve

l To classify an unknown record:

– Compute distance to other
training records
– Identify k nearest neighbors
– Use class labels of nearest
neighbors to determine the
class label of unknown record
(e.g., by taking majority vote)
Definition of Nearest Neighbor

X X X

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

K-nearest neighbors of a record x are data points

that have the k smallest distance to x
1 nearest-neighbor
Voronoi Diagram
Nearest Neighbor Classification
• Compute distance between two points:
– Euclidean distance
d ( p, q )   ( pi
i
q )
i
2

– Manhatten distance
𝑑 𝑝, 𝑞 = 𝑝𝑖 − 𝑞𝑖
𝑖
– q norm distance
𝑑 𝑝, 𝑞 = ( 𝑝𝑖 − 𝑞𝑖 𝑞 ) 1/𝑞
𝑖
• Determine the class from nearest neighbor list
– take the majority vote of class labels among the k-nearest neighbors
y’ = argmax 𝒙𝑖 ,𝑦𝑖 ϵ 𝐷𝑧 𝐼( 𝑣 = 𝑦𝑖 )
𝑣
where Dz is the set of k closest training examples to z.
– Weigh the vote according to distance
y’ = argmax 𝒙𝑖 ,𝑦𝑖 ϵ 𝐷𝑧 𝑤𝑖 × 𝐼( 𝑣 = 𝑦𝑖 )
𝑣
• weight factor, w = 1/d2
The KNN classification algorithm
Let k be the number of nearest neighbors and D be the set of
training examples.
1. for each test example z = (x’,y’) do
2. Compute d(x’,x), the distance between z and every
example, (x,y) ϵ D
3. Select Dz ⊆ D, the set of k closest training examples to z.
4. y’ = argmax 𝒙𝑖 ,𝑦𝑖 ϵ 𝐷𝑧 𝐼( 𝑣 = 𝑦𝑖 )
𝑣
5. end for
KNN Classification
$2,50,000

$2,00,000

$1,50,000

Loan$ Non-Default
$1,00,000 Default

$50,000

$0
0 10 20 30 40 50 60 70

Age
Nearest Neighbor Classification…
• Choosing the value of k:
– If k is too small, sensitive to noise points
– If k is too large, neighborhood may include points from other classes

X
Nearest Neighbor Classification…
• Scaling issues
– Attributes may have to be scaled to prevent distance measures from
being dominated by one of the attributes
– Example:
• height of a person may vary from 1.5m to 1.8m
• weight of a person may vary from 60 KG to 100KG
• income of a person may vary from Rs10K to Rs 2 Lakh
Nearest Neighbor Classification…
• Problem with Euclidean measure:
– High dimensional data
• curse of dimensionality: all vectors are almost equidistant to the query vector
– Can produce undesirable results
111111111110 100000000000
vs
011111111111 000000000001
d = 1.4142 d = 1.4142

 Solution: Normalize the vectors to unit length

Nearest neighbor Classification…
• k-NN classifiers are lazy learners
– It does not build models explicitly
– Unlike eager learners such as decision tree induction and rule-based
systems
– Classifying unknown records are relatively expensive
Thank You

6618 Generative - AI - E0 - 77094
100% (5)
6618 Generative - AI - E0 - 77094
7 pages
KNN Presentation
No ratings yet
KNN Presentation
16 pages
Nearest-Neighbor Classifier: MTL 782 Iit Delhi
No ratings yet
Nearest-Neighbor Classifier: MTL 782 Iit Delhi
16 pages
20 KNN Presentation
No ratings yet
20 KNN Presentation
16 pages
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
No ratings yet
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
16 pages
Materi 7.2. K-NN
No ratings yet
Materi 7.2. K-NN
6 pages
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
K-Nearest Neighbor Classifier: This Slide Is Modified From Dr. Tan's Slides. Thanks To Dr. Tan
No ratings yet
K-Nearest Neighbor Classifier: This Slide Is Modified From Dr. Tan's Slides. Thanks To Dr. Tan
11 pages
Data Mining Lecture 10B: Classification
No ratings yet
Data Mining Lecture 10B: Classification
62 pages
Datamining Lect7knearst
No ratings yet
Datamining Lect7knearst
62 pages
Module3
No ratings yet
Module3
26 pages
Class 7 KNN
No ratings yet
Class 7 KNN
12 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
KNN
No ratings yet
KNN
16 pages
Nearest Neighbor Classification: Presented by Sam Brown Sbbrown@uvm - Edu DATA MINING - Xindong Wu
No ratings yet
Nearest Neighbor Classification: Presented by Sam Brown Sbbrown@uvm - Edu DATA MINING - Xindong Wu
39 pages
K-Nearest Neighbour (KNN)
No ratings yet
K-Nearest Neighbour (KNN)
14 pages
CSE445 NSU Week_5
No ratings yet
CSE445 NSU Week_5
26 pages
CSCI946 w5-classification
No ratings yet
CSCI946 w5-classification
72 pages
12_23ECE216_Nearest Neighbors
No ratings yet
12_23ECE216_Nearest Neighbors
29 pages
T6- KNN - Features, Distances &amp; Non-Parametric Models
No ratings yet
T6- KNN - Features, Distances &amp; Non-Parametric Models
23 pages
3. Classification (K-Nearest Neighbor)
No ratings yet
3. Classification (K-Nearest Neighbor)
22 pages
Lecture 22 - K-Nearnest Neighbours
No ratings yet
Lecture 22 - K-Nearnest Neighbours
11 pages
KNN Algorithm
No ratings yet
KNN Algorithm
16 pages
Lecture 12
No ratings yet
Lecture 12
15 pages
lec08_Classification_kNN_ANN
No ratings yet
lec08_Classification_kNN_ANN
39 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Classification (NaiveBayes KNN SVM DecisionTrees)
No ratings yet
Classification (NaiveBayes KNN SVM DecisionTrees)
105 pages
Lec03 Classifiers KNN+DT
No ratings yet
Lec03 Classifiers KNN+DT
30 pages
Univt - IV
No ratings yet
Univt - IV
72 pages
Session 5 ppt
No ratings yet
Session 5 ppt
36 pages
Lecture 4
No ratings yet
Lecture 4
31 pages
7.classification After
No ratings yet
7.classification After
51 pages
Chap4 KNN
No ratings yet
Chap4 KNN
11 pages
Class Adv Classification I
No ratings yet
Class Adv Classification I
39 pages
k-NN-1
No ratings yet
k-NN-1
31 pages
Non Parametric Classification: Pattern Recognition
No ratings yet
Non Parametric Classification: Pattern Recognition
74 pages
Lazy Learners Unit 2
No ratings yet
Lazy Learners Unit 2
26 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
31 pages
Classification - KNN
No ratings yet
Classification - KNN
8 pages
Classification - KNN
No ratings yet
Classification - KNN
8 pages
Chapter 6 ML Classifications
No ratings yet
Chapter 6 ML Classifications
51 pages
ML KNN
No ratings yet
ML KNN
20 pages
Datamining-lect4 - Other Classification Techniques. Nearest Neighbor Classifiers, Support Vector Machines, Logistic Regression, Naive Bayes Classification. Supervised Learning
No ratings yet
Datamining-lect4 - Other Classification Techniques. Nearest Neighbor Classifiers, Support Vector Machines, Logistic Regression, Naive Bayes Classification. Supervised Learning
79 pages
Introduction To Classification - PPT Slides 1
No ratings yet
Introduction To Classification - PPT Slides 1
62 pages
ML-KN
No ratings yet
ML-KN
12 pages
Algorithms - K Nearest Neighbors
No ratings yet
Algorithms - K Nearest Neighbors
23 pages
Instance-Based Learning: Slides Provided by Introduction To Data Mining, 2 Edition
No ratings yet
Instance-Based Learning: Slides Provided by Introduction To Data Mining, 2 Edition
13 pages
Mlfa Autumn 22 Lec 03
No ratings yet
Mlfa Autumn 22 Lec 03
61 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
24 pages
CMTH642 - Module 10.2- Classification
No ratings yet
CMTH642 - Module 10.2- Classification
10 pages
Nearest Neighbour Algorithm
No ratings yet
Nearest Neighbour Algorithm
20 pages
K Nearest Neighbor Classification
0% (1)
K Nearest Neighbor Classification
32 pages
Lecture Note #3_PEC-CS701E
No ratings yet
Lecture Note #3_PEC-CS701E
27 pages
K - Nearest Neighbors
No ratings yet
K - Nearest Neighbors
26 pages
Knn
No ratings yet
Knn
30 pages
3c Feature Extraction
No ratings yet
3c Feature Extraction
19 pages
Lesson 4.1 - Unsupervised Learning Partitioning Methods PDF
No ratings yet
Lesson 4.1 - Unsupervised Learning Partitioning Methods PDF
41 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Semiconductor Nano-Devices
No ratings yet
Semiconductor Nano-Devices
5 pages
KNN - Model: Train Test CL K
No ratings yet
KNN - Model: Train Test CL K
2 pages
College Edge Detection
No ratings yet
College Edge Detection
2 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
LSTM-AutoEncoders. Understand and Perform Composite & - by Bob Rupak Roy - DataDrivenInvestor
100% (1)
LSTM-AutoEncoders. Understand and Perform Composite & - by Bob Rupak Roy - DataDrivenInvestor
9 pages
Robotics Team Presentation
No ratings yet
Robotics Team Presentation
22 pages
Image Classification: CNN Model
No ratings yet
Image Classification: CNN Model
2 pages
(IJCST-V11I6P2) :ms. Madhuri P. Narkhede, Dr. Harshali B Patil
No ratings yet
(IJCST-V11I6P2) :ms. Madhuri P. Narkhede, Dr. Harshali B Patil
5 pages
Ai, Iot, Big Data & Blockchain
No ratings yet
Ai, Iot, Big Data & Blockchain
19 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
Shaoyu Lu, Sina Lin, Beibei Wang, Recognition and Classification of Fast Food Images
No ratings yet
Shaoyu Lu, Sina Lin, Beibei Wang, Recognition and Classification of Fast Food Images
5 pages
1154ec108nanoelectronics PDF
No ratings yet
1154ec108nanoelectronics PDF
3 pages
Physics Lecture 21122022 - Dr. Mujtaba Ikram
No ratings yet
Physics Lecture 21122022 - Dr. Mujtaba Ikram
36 pages
Class Notes Deep-Learning
No ratings yet
Class Notes Deep-Learning
3 pages
Training Full Spike Neural Networks Via Auxiliary Accumulation Pathway
No ratings yet
Training Full Spike Neural Networks Via Auxiliary Accumulation Pathway
16 pages
Basic Global Thresholding
No ratings yet
Basic Global Thresholding
13 pages
Face Detection Documentation
No ratings yet
Face Detection Documentation
3 pages
Deep Feed-Forward Neural Network
No ratings yet
Deep Feed-Forward Neural Network
4 pages
Week 4 Conceptualization and Operationalization of Emerging Technologies
No ratings yet
Week 4 Conceptualization and Operationalization of Emerging Technologies
32 pages
NN Matlab - Examples
No ratings yet
NN Matlab - Examples
14 pages
Neural-Network Questions
0% (1)
Neural-Network Questions
3 pages
Tim Assignment
No ratings yet
Tim Assignment
7 pages
Robotics - Seminar
No ratings yet
Robotics - Seminar
48 pages
Multi Distance Metric Network For Few Shot Learning: Farong Gao Lijie Cai Zhangyi Yang Shiji Song Cheng Wu
No ratings yet
Multi Distance Metric Network For Few Shot Learning: Farong Gao Lijie Cai Zhangyi Yang Shiji Song Cheng Wu
12 pages
Hindu Law A - LL.B. 103
No ratings yet
Hindu Law A - LL.B. 103
36 pages
Types of Agents: What Are Agent and Environment?
No ratings yet
Types of Agents: What Are Agent and Environment?
6 pages
Fake News Detection: Muhammad Hassan Ur Rehman Sufyan Ahmed Huzaifa Shuja Taber Bin Zameer
No ratings yet
Fake News Detection: Muhammad Hassan Ur Rehman Sufyan Ahmed Huzaifa Shuja Taber Bin Zameer
21 pages
Time: 3 Hours Total Marks: 70: Printed Page 1 of 2 Sub Code: RCS702
No ratings yet
Time: 3 Hours Total Marks: 70: Printed Page 1 of 2 Sub Code: RCS702
2 pages
Lip Reading Word Classification: Abiel Gutierrez Stanford University Zoe-Alanah Robert Stanford University
No ratings yet
Lip Reading Word Classification: Abiel Gutierrez Stanford University Zoe-Alanah Robert Stanford University
9 pages

Lecture8 KNN1

Uploaded by

Lecture8 KNN1

Uploaded by

Nearest-Neighbor

……... • Use training records to

Training Choose k of the

l To classify an unknown record:

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

K-nearest neighbors of a record x are data points

 Solution: Normalize the vectors to unit length

You might also like