0% found this document useful (0 votes)

2 views

ML-Lec7

Instance-based learning methods, such as K-Nearest Neighbor, store training examples and classify new instances based on their proximity to these examples, postponing generalization until needed. While this approach allows for local approximations of complex target functions, it can be computationally expensive at classification time. Additionally, the methods may struggle with high-dimensional data if only a few attributes are relevant for classification.

Uploaded by

luosuochao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

ML-Lec7

Uploaded by

luosuochao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Lecture 7 Instance-based Learning Instance-based Learning -

Overview
• In contrast to learning methods that construct a general,
• Introduction of Instance-based learning explicit description of the target function when training
• K-Nearest Neighbor examples are provided, instance-based learning methods
simply store the training examples.

• Generalizing beyond these examples is postponed until a new

instance must be classified.

• Each time a new query instance is encountered, its relationship

to the previously stored examples is examined in order to
assign a target function value for the new instance.
2

Instance-based Learning -
Instance-based Learning -
Overview
Overview • Instance-based learning methods such as Nearest
1. Instance-based learning methods are sometimes
Neighbor are conceptually straightforward
referred to as delayed/lazy learning methods because approaches to approximating real-valued or discrete-
they delay processing until a new instance must be valued target functions.
classified.
1. Learning in these algorithms consists of simply storing
2. A key advantage of delayed/lazy learning is that instead the presented training data.
of estimating the target function one for the entire 2. When a new query instance is encountered, a set of
instance space, these methods can estimate it locally similar related instances is retrieved from memory and
and differently for each new instance to be classified. used to classify the new query instance.

3 4
Instance-based Learning - Instance-based Learning -
Overview Disadvantages
• Many techniques construct only a local approximation
to the target function that applies in the neighborhood 1. One disadvantage of instance-based
of the new query instance, and never construct an approaches is that the cost of classifying new
approximation designed to perform well over the instances can be high.
entire instance space.
• This is due to the fact that nearly all computation
• This has significant advantages when the target takes place at classification time rather than when
function is very complex, but can still be described by the training examples are first encountered.
a collection of less complex local approximations.

5 6

Instance-based Learning - K-nearest Neighbor Learning

Disadvantages (Intro.)
2. A second disadvantage to many instance-based • The most basic instance-based method is the k-
approaches, especially nearest neighbor NEAREST NEIGHBOR algorithm.
approaches, is that they typically consider all
attributes of the instances when attempting to
retrieve similar training examples from memory. • This algorithm assumes all instances correspond to
points in the n-dimensional space n. That means
inputs of data are numeric ones.
• If the target concept depends on only a few of the many
available attributes, then the instances that are truly most
"similar" may well be a large distance apart.
• The nearest neighbors of an instance are defined in
terms of the standard Euclidean distance.

7 8
K-nearest Neighbor Learning K-nearest Neighbor Learning
(Euclidean distance) (output type)
• Let an arbitrary instance
• In nearest-neighbor learning the target function
x be described by the a
feature vector 1 (x),a2 (x),...an (x) may be either discrete-valued or real-valued.

• Let us first consider learning discrete-valued target

⚫ where ar( x ) denotes the value of the rth attribute of functions of the form f : n -> V, where V is the
instance x . Then the distance between two finite set {vl, . . . vs}.
instances xi and xj is defined tobe
d( xi , xj ) , where • The k-NEAREST NEIGHBOR algorithm for
n

 (a (x ) − a (x ))
approximating a discrete-valued target function is
d (xi , x j )  r i r j
2
given in next page ->
r =1 9 10

K-nearest Neighbor Algorithm for approximating K-nearest Neighbor Algorithm for approximating
a discrete-valued function f : n -> V, a discrete-valued function f : n -> V,
• Training algorithm:
• For each training example <x, f(x)>, add the example to the listing
• As shown there, the value fˆ(xq ) returned by this algorithm as its
training_examples estimate of f(xq) is just the most common value of f among the
k training examples nearest to xq.
• Classification algorithm
• Given a query instance xq to be classified,
• Let x1….xk denote the k instances from training_examples that are • If we choose k = 1, then the 1-NEAREST NEIGHBOR algorithm
nearest to xq assigns to fˆ(xq ) the value f(xi) where xi is the training instance
• Return nearest to xq.
k
fˆ(xq )  arg max
vV
 (v, f (x )) i • For larger values of k, the algorithm assigns the most common
i=1 value among the k nearest training examples.

⚫ Where (a, b) = 1 if a = b and 0 otherwise.

11 12
K-nearest Neighbor Algorithm for approximating K-nearest Neighbor Algorithm for approximating
a discrete-valued function f : n -> V a discrete-valued function f : n -> V

• Calculate the mean value of the k nearest training

examples.

• Replace the final line of previous algorithm by:

fˆ


k
f (xi )
fˆ(xq )  i=1
k

1 Nearest Neighbor classifies xq positive, 5 NearestNeighbor

classifies xq as negative. The diagram on the right handside
shows the decision surface induced by 1-Nearest Neighbor. 14 15

Distance-Weighted Nearest Neighbor Algorithm for discrete-

valued target functions Distance-Weighted Nearest Neighbor Algorithm for
continuous- valued target functions

• Weight the contribution k • We can distance-weight

fˆ (xq )  arg max  wi (v, f (xi ))

 w f (x )
of each of the k the instances for real- k
vV
neighbors according to i=1 valued target functions in i i
their distance to the where a similar way. fˆ(xq )  i=1

 w
k
query point xq, giving wi 
1
greater weight to closer d (xq , xi )2 i=1 i
neighbors. where
1
⚫ If xq exactly match one of the training instances xi, the wi 
denominator d(xq,xi)2 is zero and, we assign fˆ(xq ) to be 16
d (xq , xi )2
f(xi).
16
Remark
Remark
• Note all of the above variants of the k-NEAREST NEIGHBOR algorithm
consider only the k nearest neighbors to classify the query point.
• The distance-weighted k-NEAREST
• Once we add distance weighting, there is no harm in allowing all NEIGHBOR algorithm is a highly effective
training examples because very distant examples will have very little inductive inference method for many practical
effect on fˆ(xq ) problems.

• The only disadvantage of considering all examples is that our classifier • It is robust to noisy training data and quite effective
will run more slowly.
when it is provided a sufficiently large set of training
data.
• If all training examples are considered when classifying a new query
instance, we call the algorithm a global method.
• Note that by taking the weighted average of the k
• If only the nearest training examples are considered, we call it a local neighbors nearest to the query point, it can smooth
method. out the impact of isolated noisy training examples.
17 18

Remark

• Since the algorithm delays all processing until a new query is

received, significant computation can be required to process
each new query.

• Various methods have been developed for indexing the stored

training examples so that the nearest neighbors can be
identified more efficiently at some additional cost in memory.

• One such indexing method is the kd-tree

• https://www.ri.cmu.edu/pub_files/pub1/moore_andrew_1991_1/
moore_andrew_1991_1.pdf

Learning Management System Proposal
100% (4)
Learning Management System Proposal
52 pages
Module 4 A
No ratings yet
Module 4 A
29 pages
Module 5
No ratings yet
Module 5
94 pages
BTech_V_KCS_055_Unit3
No ratings yet
BTech_V_KCS_055_Unit3
12 pages
Supervised Learning: Instance Based Learning
No ratings yet
Supervised Learning: Instance Based Learning
16 pages
Module 5 Part 2 3
No ratings yet
Module 5 Part 2 3
19 pages
Module IV - K NN
No ratings yet
Module IV - K NN
15 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
20 pages
18CS71 AI & ML Module 5 Notes
No ratings yet
18CS71 AI & ML Module 5 Notes
21 pages
ch2
No ratings yet
ch2
30 pages
CS8082U4L01 - K-Nearest Neighbour Learning
No ratings yet
CS8082U4L01 - K-Nearest Neighbour Learning
21 pages
Unit-4 ML
No ratings yet
Unit-4 ML
12 pages
Instance Based Learning: November 2015
No ratings yet
Instance Based Learning: November 2015
11 pages
AML_mod5
No ratings yet
AML_mod5
33 pages
Machine Learning unit 3
No ratings yet
Machine Learning unit 3
40 pages
ML Module5Notes
No ratings yet
ML Module5Notes
20 pages
Lec05 InstanceBased
No ratings yet
Lec05 InstanceBased
13 pages
Aiml M3 C2
No ratings yet
Aiml M3 C2
56 pages
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
No ratings yet
Instance Based Learning: 09s1: COMP9417 Machine Learning and Data Mining
9 pages
ML Mid2 Ans
No ratings yet
ML Mid2 Ans
24 pages
Instance Based Learning: Artificial Intelligence and Machine Learning 18CS71
No ratings yet
Instance Based Learning: Artificial Intelligence and Machine Learning 18CS71
19 pages
15 Ec 834
No ratings yet
15 Ec 834
26 pages
Training Machine Learning KNN 2017
No ratings yet
Training Machine Learning KNN 2017
17 pages
AI Lec 5
No ratings yet
AI Lec 5
37 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Lecture 12
No ratings yet
Lecture 12
15 pages
CHP 4
No ratings yet
CHP 4
24 pages
331mt 3.2 (1)
No ratings yet
331mt 3.2 (1)
23 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
ML-KN
No ratings yet
ML-KN
12 pages
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
23 pages
Ai&ml Module 5 Final
No ratings yet
Ai&ml Module 5 Final
14 pages
COS4852 2023 Unit 2 - KNN
No ratings yet
COS4852 2023 Unit 2 - KNN
10 pages
02-knn Notes
No ratings yet
02-knn Notes
23 pages
ML Unit V
No ratings yet
ML Unit V
10 pages
U3 KNN
No ratings yet
U3 KNN
6 pages
5c. Nearest Neighbour Classifier
No ratings yet
5c. Nearest Neighbour Classifier
2 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
20180723161729D4730 - Pert18 - K-Nearest Neighbor
No ratings yet
20180723161729D4730 - Pert18 - K-Nearest Neighbor
22 pages
Module 5 AIML
No ratings yet
Module 5 AIML
18 pages
siddu AIml
No ratings yet
siddu AIml
8 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
K-Nearest Neighbors: KNN Algorithm Pseudocode
No ratings yet
K-Nearest Neighbors: KNN Algorithm Pseudocode
2 pages
KNN Presentation
No ratings yet
KNN Presentation
16 pages
03d Algind KNN Eng
No ratings yet
03d Algind KNN Eng
23 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
01 Basics 02knn 01
No ratings yet
01 Basics 02knn 01
7 pages
Machine learning Lecture 02
No ratings yet
Machine learning Lecture 02
25 pages
KNN Algorithm
No ratings yet
KNN Algorithm
3 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
ML UNIT IV
No ratings yet
ML UNIT IV
8 pages
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
K - Nearest Neighbors
No ratings yet
K - Nearest Neighbors
26 pages
Nearest Neighbour
No ratings yet
Nearest Neighbour
25 pages
K Nearest Neighbour
100% (1)
K Nearest Neighbour
35 pages
Aiml Module 3 Part 2
No ratings yet
Aiml Module 3 Part 2
12 pages
cs4302-lecture2
No ratings yet
cs4302-lecture2
40 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
ML-Lec11
No ratings yet
ML-Lec11
14 pages
ML-Lec12
No ratings yet
ML-Lec12
10 pages
ML-Lec5
No ratings yet
ML-Lec5
7 pages
ML-Lec3
No ratings yet
ML-Lec3
10 pages
THE RV SOLUTION & SCOPE OF WORK (2)
No ratings yet
THE RV SOLUTION & SCOPE OF WORK (2)
18 pages
Where can buy The Future of the Professions: How Technology Will Transform the Work of Human Experts 2nd Edition Richard Susskind ebook with cheap price
100% (3)
Where can buy The Future of the Professions: How Technology Will Transform the Work of Human Experts 2nd Edition Richard Susskind ebook with cheap price
26 pages
Assignment/ Tugasan
No ratings yet
Assignment/ Tugasan
14 pages
ec1c5223410e9dfcccaa4b90c2efaa8e
100% (1)
ec1c5223410e9dfcccaa4b90c2efaa8e
163 pages
FTG Acars
No ratings yet
FTG Acars
23 pages
Computer Course in Chandigarh
No ratings yet
Computer Course in Chandigarh
8 pages
MELSEC iQ-F FX5 User's Manual (Ethernet Communication)
No ratings yet
MELSEC iQ-F FX5 User's Manual (Ethernet Communication)
266 pages
Wiring Color and Pin-Out Schematic Electronic Vessel Control EVC - C, D4/D6-DPH/DPR
67% (3)
Wiring Color and Pin-Out Schematic Electronic Vessel Control EVC - C, D4/D6-DPH/DPR
2 pages
8051 Microcontroller Unit 4
No ratings yet
8051 Microcontroller Unit 4
49 pages
IFU R11-All
No ratings yet
IFU R11-All
1,526 pages
An Improved Office Building Cooling Load Prediction Model Based Onmultivariable Linear RegressionQiang
No ratings yet
An Improved Office Building Cooling Load Prediction Model Based Onmultivariable Linear RegressionQiang
11 pages
Les Amants Magnifiques
No ratings yet
Les Amants Magnifiques
66 pages
Elements of Art & Design
No ratings yet
Elements of Art & Design
24 pages
Tieng Anh 11 Friends Global - Unit 5 - Test 2
100% (1)
Tieng Anh 11 Friends Global - Unit 5 - Test 2
5 pages
Fabrication of Semiconductor
100% (1)
Fabrication of Semiconductor
12 pages
Pau 1
No ratings yet
Pau 1
2 pages
DP-203 - Data Engineering On Microsoft Azure 2021-1
100% (2)
DP-203 - Data Engineering On Microsoft Azure 2021-1
42 pages
ANTHROPOMETRICS
No ratings yet
ANTHROPOMETRICS
6 pages
Python Interview Questions (2023) - Javatpoint
No ratings yet
Python Interview Questions (2023) - Javatpoint
22 pages
Block Diagram: Power Supply
No ratings yet
Block Diagram: Power Supply
11 pages
LAB5
No ratings yet
LAB5
5 pages
Deepsea 7420 CAD Drawing
100% (1)
Deepsea 7420 CAD Drawing
2 pages
Digital Communications Etiquette
No ratings yet
Digital Communications Etiquette
27 pages
Microsoft
No ratings yet
Microsoft
53 pages
Ftti Algebra
100% (1)
Ftti Algebra
3 pages
SOP and Preliminary Research Ideas - Copy (2)
No ratings yet
SOP and Preliminary Research Ideas - Copy (2)
4 pages
(T-GCPAWS-I) Module 0 - Course Intro (ILT)
No ratings yet
(T-GCPAWS-I) Module 0 - Course Intro (ILT)
18 pages
CAN-Nissan-NP300-08 (1)
No ratings yet
CAN-Nissan-NP300-08 (1)
1 page
Letterhead of Requestor
100% (1)
Letterhead of Requestor
2 pages

ML-Lec7

Uploaded by

ML-Lec7

Uploaded by

Lecture 7 Instance-based Learning Instance-based Learning -

• Generalizing beyond these examples is postponed until a new

• Each time a new query instance is encountered, its relationship

Instance-based Learning - K-nearest Neighbor Learning

• Let us first consider learning discrete-valued target

⚫ Where (a, b) = 1 if a = b and 0 otherwise.

• Calculate the mean value of the k nearest training

• Replace the final line of previous algorithm by:

1 Nearest Neighbor classifies xq positive, 5 NearestNeighbor

Distance-Weighted Nearest Neighbor Algorithm for discrete-

• Weight the contribution k • We can distance-weight

• Since the algorithm delays all processing until a new query is

• Various methods have been developed for indexing the stored

• One such indexing method is the kd-tree

You might also like