Neighbourhood Components Analysis

Neighbourhood components analysis is a supervised learning method that learns a distance metric to maximize classification performance. It transforms input data into a new space using a linear transformation matrix. The algorithm defines an objective function based on leave-one-out classification accuracy and iteratively optimizes the transformation matrix to maximize this function. This addresses the issue of selecting the optimal number of classes.

Uploaded by

john949

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Neighbourhood Components Analysis

Uploaded by

john949

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Neighbourhood components analysis

Neighbourhood components analysis is a supervised learning method for classifying multivariate data
into distinct classes according to a given distance metric over the data. Functionally, it serves the same
purposes as the K-nearest neighbors algorithm, and makes direct use of a related concept termed stochastic
nearest neighbours.

Definition
Neighbourhood components analysis aims at "learning" a distance metric by finding a linear transformation
of input data such that the average leave-one-out (LOO) classification performance is maximized in the
transformed space. The key insight to the algorithm is that a matrix corresponding to the transformation
can be found by defining a differentiable objective function for , followed by the use of an iterative solver
such as conjugate gradient descent. One of the benefits of this algorithm is that the number of classes can
be determined as a function of , up to a scalar constant. This use of the algorithm, therefore, addresses the
issue of model selection.

Explanation
In order to define , we define an objective function describing classification accuracy in the transformed
space and try to determine such that this objective function is maximized.

Leave-one-out (LOO) classification

Consider predicting the class label of a single data point by consensus of its -nearest neighbours with a
given distance metric. This is known as leave-one-out classification. However, the set of nearest-neighbours
can be quite different after passing all the points through a linear transformation. Specifically, the set of
neighbours for a point can undergo discrete changes in response to smooth changes in the elements of ,
implying that any objective function based on the neighbours of a point will be piecewise-constant,
and hence not differentiable.

Solution

We can resolve this difficulty by using an approach inspired by stochastic gradient descent. Rather than
considering the -nearest neighbours at each transformed point in LOO-classification, we'll consider the
entire transformed data set as stochastic nearest neighbours. We define these using a softmax function of
the squared Euclidean distance between a given LOO-classification point and each other point in the
transformed space:
The probability of correctly classifying data point is the probability of classifying the points of each of its
neighbours with the same class :

where is the probability of classifying neighbour of point .

Define the objective function using LOO classification, this time using the entire data set as stochastic
nearest neighbours:

Note that under stochastic nearest neighbours, the consensus class for a single point is the expected value
of a point's class in the limit of an infinite number of samples drawn from the distribution over its
neighbours i.e.: . Thus the predicted class is an affine
combination of the classes of every other point, weighted by the softmax function for each where
is now the entire transformed data set.

This choice of objective function is preferable as it is differentiable with respect to (denote

Obtaining a gradient for means that it can be found with an iterative solver such as conjugate gradient
descent. Note that in practice, most of the innermost terms of the gradient evaluate to insignificant
contributions due to the rapidly diminishing contribution of distant points from the point of interest. This
means that the inner sum of the gradient can be truncated, resulting in reasonable computation times even
for large data sets.

Alternative formulation

"Maximizing is equivalent to minimizing the -distance between the predicted class distribution and
the true class distribution (ie: where the induced by are all equal to 1). A natural alternative is the KL-
divergence, which induces the following objective function and gradient:" (Goldberger 2005)
In practice, optimization of using this function tends to give similar performance results as with the
original.

History and background

Neighbourhood components analysis was developed by Jacob Goldberger, Sam Roweis, Ruslan
Salakhudinov, and Geoff Hinton at the University of Toronto's department of computer science in 2004.

See also
Spectral clustering
Large margin nearest neighbor

References
J. Goldberger, G. Hinton, S. Roweis, R. Salakhutdinov. (2005) Neighbourhood Components
Analysis (http://www.csri.utoronto.ca/~roweis/papers/ncanips.pdf). Advances in Neural
Information Processing Systems. 17, 513–520, 2005.

External links

Software
The MLPACK library contains a C++ implementation
nca (https://github.com/vomjom/nca) (C++)
scikit-learn's "NeighborhoodComponentsAnalysis (https://scikit-learn.org/stable/modules/ge
nerated/sklearn.neighbors.NeighborhoodComponentsAnalysis.html)" implementation
(Python)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Neighbourhood_components_analysis&oldid=1156400667"

Oil Blending Problem (Sunco) - 20201219
No ratings yet
Oil Blending Problem (Sunco) - 20201219
6 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
ml unit2
No ratings yet
ml unit2
38 pages
ML Unit 2 r20 Jntuk
No ratings yet
ML Unit 2 r20 Jntuk
34 pages
20 Cs 112
No ratings yet
20 Cs 112
11 pages
ML UNIT-2
No ratings yet
ML UNIT-2
33 pages
AIML-Unit 4 Notes-Assignment 4
No ratings yet
AIML-Unit 4 Notes-Assignment 4
21 pages
LFD 2005 Nearest Neighbour
No ratings yet
LFD 2005 Nearest Neighbour
6 pages
Open Problems in The Mathematics of Data Science
No ratings yet
Open Problems in The Mathematics of Data Science
152 pages
20 Cs 112
No ratings yet
20 Cs 112
11 pages
Linear Discriminant Functions Lesson 26: Characterization of The Decision Boundary
No ratings yet
Linear Discriminant Functions Lesson 26: Characterization of The Decision Boundary
7 pages
4.4-InstanceBasedLearning Part 2
No ratings yet
4.4-InstanceBasedLearning Part 2
16 pages
Distance Metric Learning For Large Margin Nearest Neighbor Classification
No ratings yet
Distance Metric Learning For Large Margin Nearest Neighbor Classification
8 pages
K-Nearest Neighbour Classifiers
No ratings yet
K-Nearest Neighbour Classifiers
18 pages
BTech_V_KCS_055_Unit3
No ratings yet
BTech_V_KCS_055_Unit3
12 pages
Lab NN KNN SVM
No ratings yet
Lab NN KNN SVM
13 pages
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
No ratings yet
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
25 pages
K-Nearest Neighbours (KNN)
No ratings yet
K-Nearest Neighbours (KNN)
10 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
Kernel Nearest-Neighbor Algorithm
No ratings yet
Kernel Nearest-Neighbor Algorithm
10 pages
02-knn__slides
No ratings yet
02-knn__slides
57 pages
KNN Presentation
No ratings yet
KNN Presentation
16 pages
SVM
No ratings yet
SVM
57 pages
ML
No ratings yet
ML
3 pages
20180723161729D4730 - Pert18 - K-Nearest Neighbor
No ratings yet
20180723161729D4730 - Pert18 - K-Nearest Neighbor
22 pages
1501589527da-mod14-Q1-e-text
No ratings yet
1501589527da-mod14-Q1-e-text
12 pages
MIT18 S096F15 TenLec
No ratings yet
MIT18 S096F15 TenLec
165 pages
MCA 4th sem
No ratings yet
MCA 4th sem
18 pages
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Mlfa Autumn 22 Lec 03
No ratings yet
Mlfa Autumn 22 Lec 03
61 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
No ratings yet
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
33 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
Distance Metric Learning: A Comprehensive Survey: Liu Yang Advisor: Rong Jin May 8th, 2006
No ratings yet
Distance Metric Learning: A Comprehensive Survey: Liu Yang Advisor: Rong Jin May 8th, 2006
51 pages
CSE445 NSU Week_5
No ratings yet
CSE445 NSU Week_5
26 pages
cs4302-lecture2
No ratings yet
cs4302-lecture2
40 pages
Supervised Learning: Instance Based Learning
No ratings yet
Supervised Learning: Instance Based Learning
16 pages
Instance Based Learning: Artificial Intelligence and Machine Learning 18CS71
No ratings yet
Instance Based Learning: Artificial Intelligence and Machine Learning 18CS71
19 pages
Markov Random Field: Exploring the Power of Markov Random Fields in Computer Vision
From Everand
Markov Random Field: Exploring the Power of Markov Random Fields in Computer Vision
Fouad Sabry
No ratings yet
A Tutorial on ν-Support Vector Machines: 1 An Introductory Example
No ratings yet
A Tutorial on ν-Support Vector Machines: 1 An Introductory Example
29 pages
UNIT-3
No ratings yet
UNIT-3
100 pages
CIVI6731 Week8
No ratings yet
CIVI6731 Week8
24 pages
3.KNN
No ratings yet
3.KNN
18 pages
ML Lecture06 2
No ratings yet
ML Lecture06 2
63 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
Pattern Classification and Scene Analysis PDF
100% (1)
Pattern Classification and Scene Analysis PDF
8 pages
Improving Prediction of Distance Based OUTLIERS
No ratings yet
Improving Prediction of Distance Based OUTLIERS
12 pages
Clustering
No ratings yet
Clustering
104 pages
Scale Space: Exploring Dimensions in Computer Vision
From Everand
Scale Space: Exploring Dimensions in Computer Vision
Fouad Sabry
No ratings yet
Classification (NaiveBayes KNN SVM DecisionTrees)
No ratings yet
Classification (NaiveBayes KNN SVM DecisionTrees)
105 pages
Lec 04
No ratings yet
Lec 04
70 pages
ML Unit-2
No ratings yet
ML Unit-2
55 pages
Neighbourhood Component Analysis: Objective Function
No ratings yet
Neighbourhood Component Analysis: Objective Function
2 pages
Learning Book 11 Feb
No ratings yet
Learning Book 11 Feb
322 pages
COMP9517 Lab3 - Theory
No ratings yet
COMP9517 Lab3 - Theory
16 pages
CH 2
No ratings yet
CH 2
121 pages
Algorithms For Spatial Outlier Detection: Chang-Tien Lu Dechang Chen Yufeng Kou
No ratings yet
Algorithms For Spatial Outlier Detection: Chang-Tien Lu Dechang Chen Yufeng Kou
4 pages
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
No ratings yet
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
102 pages
Wavelet
No ratings yet
Wavelet
19 pages
Data Blending
No ratings yet
Data Blending
3 pages
Nonlinear System Identification
No ratings yet
Nonlinear System Identification
7 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
8 pages
Data Integration
No ratings yet
Data Integration
8 pages
List of Datasets For Machine-Learning Research
100% (1)
List of Datasets For Machine-Learning Research
61 pages
Extract, Transform, Load
No ratings yet
Extract, Transform, Load
9 pages
Data Defined Storage
No ratings yet
Data Defined Storage
3 pages
Data Lineage
No ratings yet
Data Lineage
14 pages
Data Wrangling
0% (1)
Data Wrangling
5 pages
Data Engineering
No ratings yet
Data Engineering
6 pages
Bayesian Epistemology
No ratings yet
Bayesian Epistemology
9 pages
Data Science
No ratings yet
Data Science
7 pages
Data Philanthropy
No ratings yet
Data Philanthropy
5 pages
Computational Intelligence
No ratings yet
Computational Intelligence
6 pages
Document-Oriented Database
No ratings yet
Document-Oriented Database
10 pages
List of Big Data Companies
No ratings yet
List of Big Data Companies
2 pages
Causal Loop Diagram
No ratings yet
Causal Loop Diagram
4 pages
XLDB
No ratings yet
XLDB
3 pages
Very Large Database
No ratings yet
Very Large Database
6 pages
Hierarchical Temporal Memory
No ratings yet
Hierarchical Temporal Memory
11 pages
Structured Data Analysis (Statistics)
No ratings yet
Structured Data Analysis (Statistics)
1 page
Curse of Dimensionality
No ratings yet
Curse of Dimensionality
9 pages
Computational Phylogenetics
No ratings yet
Computational Phylogenetics
18 pages
Bayesian Programming
No ratings yet
Bayesian Programming
16 pages
Parallel Coordinates
No ratings yet
Parallel Coordinates
5 pages
Multidimensional Scaling
No ratings yet
Multidimensional Scaling
6 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
33 pages
Community Structure
No ratings yet
Community Structure
12 pages
Automatic Clustering Algorithms
No ratings yet
Automatic Clustering Algorithms
3 pages
Genetic Algorithms Versus Traditional Methods
No ratings yet
Genetic Algorithms Versus Traditional Methods
7 pages
COMP 125 Assign 1 Summer 2024
No ratings yet
COMP 125 Assign 1 Summer 2024
2 pages
Mod1 Exam - 108
No ratings yet
Mod1 Exam - 108
8 pages
Nonlinear MPC (Autosaved)
No ratings yet
Nonlinear MPC (Autosaved)
14 pages
Local Mean Decomposition Using An Empirical Optima
No ratings yet
Local Mean Decomposition Using An Empirical Optima
16 pages
Class 1
No ratings yet
Class 1
16 pages
Influence and Outliers
No ratings yet
Influence and Outliers
37 pages
Swinir: Image Restoration Using Swin Transformer
No ratings yet
Swinir: Image Restoration Using Swin Transformer
12 pages
Obtaining Super-Resolution Images by Combining Low-Resolution Images With High-Frequency Information Derivedfrom Training Images
No ratings yet
Obtaining Super-Resolution Images by Combining Low-Resolution Images With High-Frequency Information Derivedfrom Training Images
13 pages
Grade (11) - 1st Term
No ratings yet
Grade (11) - 1st Term
43 pages
Rl Catalogue
No ratings yet
Rl Catalogue
3 pages
Cse3521 hw1
No ratings yet
Cse3521 hw1
3 pages
Dynamic Programming
100% (1)
Dynamic Programming
17 pages
IRS Unit-4
50% (4)
IRS Unit-4
13 pages
2018CS17#Approximation Algorithms
No ratings yet
2018CS17#Approximation Algorithms
23 pages
CSE 211 - QUIZ Question
No ratings yet
CSE 211 - QUIZ Question
4 pages
BINARY NUMBERS and Letter
No ratings yet
BINARY NUMBERS and Letter
11 pages
Linear Programming Method For Engineering Management
100% (1)
Linear Programming Method For Engineering Management
57 pages
Constrained Optimization With Inequality Constraint
No ratings yet
Constrained Optimization With Inequality Constraint
43 pages
COMP20007 Design of Algorithms
No ratings yet
COMP20007 Design of Algorithms
15 pages
Lecture 08-Gauss-Elemination
No ratings yet
Lecture 08-Gauss-Elemination
55 pages
Customer Segmentation With K-means Clustering and Visualization - Colab
No ratings yet
Customer Segmentation With K-means Clustering and Visualization - Colab
3 pages
Week5 6
No ratings yet
Week5 6
42 pages
Operations Research: Text Book: Operations Research: An Introduction by Hamdy A.Taha (Pearson Education) 8 Edition
No ratings yet
Operations Research: Text Book: Operations Research: An Introduction by Hamdy A.Taha (Pearson Education) 8 Edition
34 pages
202411005_ CS162_4
No ratings yet
202411005_ CS162_4
15 pages
Bioconf Iscku2024 00099
No ratings yet
Bioconf Iscku2024 00099
12 pages
Algorithms For VLSI Design Automation: Chapter-4 Tractable and Intractable Problems
No ratings yet
Algorithms For VLSI Design Automation: Chapter-4 Tractable and Intractable Problems
6 pages
A Brief Study and Analysis of Different Searching Algorithms
No ratings yet
A Brief Study and Analysis of Different Searching Algorithms
6 pages

Neighbourhood Components Analysis

Uploaded by

Neighbourhood Components Analysis

Uploaded by

Neighbourhood components analysis

Leave-one-out (LOO) classification

where is the probability of classifying neighbour of point .

This choice of objective function is preferable as it is differentiable with respect to (denote

History and background

Retrieved from "https://en.wikipedia.org/w/index.php?title=Neighbourhood_components_analysis&oldid=1156400667"

You might also like