0% found this document useful (0 votes)

62 views

Lecture 3

The document discusses the k-nearest neighbors (k-NN) machine learning algorithm. It begins with a recap of supervised learning concepts like classification, regression, bias, variance and overfitting/underfitting. It then introduces k-NN, which predicts the class or value of an unseen data point based on its distance to nearby training examples. The algorithm finds the k closest examples and outputs the majority class or average value. Choosing k involves a bias-variance tradeoff, with smaller k prone to overfitting and larger k underfitting.

Uploaded by

Mohit Garg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

Lecture 3

Uploaded by

Mohit Garg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

APL 405: Machine Learning for Mechanics

Lecture 3: 𝑘-Nearest Neighbour

Rajdip Nayek
Assistant Professor,
Applied Mechanics Department,
IIT Delhi

Instructor email: [email protected]

Supervised Learning: Recap
Supervised Learning
Learning (training, estimating) a function (or model) 𝑓 so that it best fits the relationship between
▪ the input 𝐱, and
▪ the output 𝑦
from observed training data (the individual data points are assumed to be (probabilistically) independent)

𝐷train = 𝐱 (1) , 𝑦 (1) , 𝐱 (2) , 𝑦 (2) , ⋯ , 𝐱 (𝑁) , 𝑦 (𝑁)

End goal will be to construct an output prediction 𝑦ො 𝐱 ∗ for unseen input 𝐱 ∗ so that it is close to 𝑦 ∗

Types of Supervised learning: Classification and Regression

▪ Output variable 𝑦? → categorical → Classification

▪ Output variable 𝑦?→ numerical → Regression
▪ Input variable can be categorical or numerical or mix of both
2
Supervised Learning: Recap
▪ Parametric vs Non-parametric models

▪ Prediction errors caused due to bias, variance and irreducible errors

▪ Bias make algorithms easier to understand but are generally less flexible
▪ Low bias: Suggests less assumptions about the function 𝑓
▪ High bias: Suggests more assumptions about the function 𝑓
▪ Machine learning algorithms that have a high variance are strongly influenced by the specifics
of the training data
▪ Low variance: Suggests small changes to the estimated function 𝑓 with changes to the training dataset
▪ High variance: Suggests large changes to the estimated function 𝑓 with changes to the training dataset

3
Overfit vs Underfit
▪ Overfitting refers to the phenomenon when a model fits the training data “too well”
▪ It happens when a model learns the detail and noise in the training data. This means that the noise or random
fluctuations in the training data is picked up and learned as concepts by the model.
▪ Models that have high variance and low bias leads to overfitting
▪ Does not generalize to new unseen data well

Overfitting Underfitting Balanced fit

▪ Underfitting refers to the phenomenon when a model is unable to fit to the training data
▪ It happens when a model is “too rigid”
▪ Models that have high bias and low variance leads to underfitting
▪ Does not generalize to new unseen data well
4
Introduction to 𝑘-Nearest Neighbours (𝑘-NN)
▪ We will start with the relatively simple 𝑘-nearest neighbours (𝑘-NN) method.
Can be used for both regression and classification

▪ Most ML algorithms are based on the intuition that if the unseen data point 𝐱 ∗ is close to training data point 𝐱 (𝑖), then
the prediction 𝑦ො 𝐱 ∗ should be close to y (𝑖) .

▪ A simple way to implement this idea is to find the “nearest” training data point
▪ Compute the EuclideanϮ distance between the unseen input and all training inputs.

(𝑖) 2 (𝑖) 2 (𝑖) 2

The 𝑖th Euclidean distance: 𝐱 (𝑖) − 𝐱∗ 2
= 𝑥1 − 𝑥1∗ + 𝑥2 − 𝑥2∗ + ⋯+ 𝑥𝑝 − 𝑥𝑝∗

▪ Find the data point 𝐱 (𝑗) with the shortest distance to 𝐱 ∗ , and use its output as the prediction, 𝑦ො 𝐱 ∗ = 𝑦 (𝑗)

▪ This is the 1-nearest neighbour algorithm

Ϯ There are many other distances: Manhattan, Mahalanobis, cosine similarity, etc. Use Manhattan if inputs variables are not similar in
type (such as age, gender, height, etc.) 5
Introduction to 𝑘-Nearest Neighbours (𝑘-NN)
▪ In practice we can rarely say for certain what the output value 𝑦 will be!
▪ Mathematically, we handle this by describing 𝑦 as a random variable. That is, we consider the data as noisy, meaning that
it is affected by random errors referred to as noise.

▪ Shortcoming: 1-nearest neighbour algorithm is sensitive to noise in data and mis-labelled data

Every test example in the blue

shaded area will be mis-
classified as the blue class
6
Introduction to 𝑘-Nearest Neighbours (𝑘-NN)
▪ In practice we can rarely say for certain what the output value 𝑦 will be!
▪ Mathematically, we handle this by describing 𝑦 as a random variable. That is, we consider the data as noisy, meaning that
it is affected by random errors referred to as noise.

▪ Shortcoming: 1-nearest neighbour algorithm is sensitive to noise in data and mis-labelled data

Every test example in the blue Every test example in the blue
shaded area will be mis- shaded area will be classified
classified as the blue class as the red class

▪ How to improve: Use 𝑘-nearest neighbours to obtain a majority vote (or take an average)
7
Introduction to 𝑘-Nearest Neighbours (𝑘-NN)
𝒌-NN algorithm

𝑁
Data: Training data 𝐱 (𝑖) , 𝑦 (𝑖) 𝑖=1
and unseen (test) input 𝐱 ∗

Result: Predicted test output 𝑦ො 𝐱 ∗

1. Compute the distances 𝐱 (𝑖) − 𝐱 ∗ 2

for all training data points 𝑖 = 1,2, ⋯ , 𝑁

2. Find 𝑘 examples 𝐱 (𝑖) , 𝑦 (𝑖) closest to the test instance 𝐱 ∗

3. Compute the prediction 𝑦ො 𝐱 ∗

Mean (or median) of 𝑘 closest examples 𝐑𝐞𝐠𝐫𝐞𝐬𝐬𝐢𝐨𝐧

𝑦ො 𝐱 ∗ = ቊ
Majority vote mode of 𝑘 closest examples 𝐂𝐥𝐚𝐬𝐬𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧

▪ 𝑘-NN is a non-parametric algorithm; makes no assumptions about the functional form and has no fixed set
of parameters. Uses the entire training data when making predictions
8
Example of 𝑘NN for binary classification
𝑖 𝐱 (𝑖) − 𝐱 ∗ 2
𝑦𝑖
𝑖 𝑥1 𝑥2 𝑦
6 1 Red
1 -1 3 Red
2 2 Blue
2 2 1 Blue
4 4 Blue
3 -2 2 Red
1 5 Red
4 -1 2 Blue
5 -1 0 Blue 5 8 Blue

6 1 1 Red 3 9 Red

▪ Predict the output for 𝐱 ∗ = 1 2 𝑇

▪ Consider two different kNN classifiers
▪ one using 𝑘 = 1, and (result is red)
▪ another using 𝑘 = 3 (result is blue)

9
Decision boundary of a classifier
𝑖 𝑥1 𝑥2 𝑦 ▪ Decision boundaries are the points in input space where
the class prediction changes, that is, the borders
1 -1 3 Red between different classes
2 2 1 Blue
3 -2 2 Red ▪ They can help to understand a classifier and given a
4 -1 2 Blue concise summary of a classifier
5 -1 0 Blue
6 1 1 Red

▪ Predict the output for 𝐱 ∗ = 1 2 𝑇

▪ Consider two different kNN classifiers
▪ one using 𝑘 = 1, and
▪ another using 𝑘 = 3

10
How to choose 𝑘?
▪ The number of neighbours 𝑘 is chosen by the user
𝑘=1

▪ Since it is not learned, it is not a parameter, and we refer to it as the hyperparameter

▪ The choice of hyperparameter 𝑘 has a big impact on the predictions made by 𝑘-NN
▪ Small 𝑘
• Good at capturing fine-grained patterns
• May overfit, i.e. be sensitive to random errors in the training data
▪ Large 𝑘 𝑘 = 15
• Makes stable predictions by averaging over lots of samples
• May underfit, i.e. fail to capture important patterns
▪ Balancing 𝑘 (trade-off between flexibility and rigidity)
• Optimal choice of 𝑘 depends on the number of data points 𝑁
• Rule of thumb: choose 𝑘 < 𝑁
• We can choose 𝑘 using cross-validation
11
Validation and Test sets
▪ We can tune the hyperparameters using a validation set:

▪ The test set is used only at the very end, to measure the generalization performance of the algorithm.

12
Pitfalls of 𝑘NN: Curse of dimensionality
▪ 𝑘NN works well with a small dimension of inputs (e.g. 2-3), but struggles when the input dimension is high
▪ In high dimensions, “most” points are far apart and are approximately at the same distance
▪ Hence, our intuition that works for distances in 2- and 3- dimensional spaces breaks down in higher dimensions

▪ We can show this by applying the rules of expectation and covariance of random variables (HW maybe)

13
Pitfalls of 𝑘NN: Normalization
▪ 𝑘NN can be quite sensitive to the range of the input features
▪ Example, 𝐱 = 𝑥1 𝑥2 𝑇 , where 𝑥1 is in the range [100, 1000] and the values of 𝑥2 is in the range [0, 1] (or vice-versa)

𝑥1

𝑥2

(𝑖) 2 (𝑖) 2
▪ The Euclidean distance between a test point 𝐱∗ and a training data point 𝐱 𝑖 is 𝑥1 − 𝑥1∗ + 𝑥2 − 𝑥2∗

(𝑖) 2
▪ The Euclidean distance is dominated by the first term 𝑥1 − 𝑥1∗ simply due to the larger magnitude of 𝑥1

▪ Thus, the variable 𝑥1 gets considered much more important than 𝑥2 by 𝑘NN
14
Pitfalls of 𝑘NN: Normalization
▪ 𝑘NN can be sensitive to the ranges of the input features

𝑥1

▪ Simple fix: Normalize each dimension to be in the range [0, 1]

𝑥2
𝑖 𝑖
𝑥𝑗 −min 𝑥𝑗
▪ 𝑥𝑗ҧ =
𝑖
𝑖
𝑖
𝑖 for all 𝑖 = 1,2, ⋯ , 𝑁 and 𝑗 = 1,2, ⋯ , 𝑝
max 𝑥𝑗 −min 𝑥𝑗
𝑖 i

▪ Simple fix: Standardize each dimension using mean and standard deviation of data
𝑖
𝑥𝑗 −𝜇𝑗
▪ 𝑥𝑗ҧ =
𝑖
𝜎𝑗
for all 𝑖 = 1,2, ⋯ , 𝑁 and 𝑗 = 1,2, ⋯ , 𝑝

15
Pitfalls of 𝑘NN: Computationally costly
▪ Computational cost for training time: 0
▪ Computational cost at test time, per test data point
▪ Calculate 𝑝-dimensional Euclidean distances with 𝑁 data points: 𝒪 𝑁𝑝
▪ Sort the distances: 𝒪 𝑁 log 𝑁
▪ This must be done for each test data point, which is very expensive by the standards of a learning algorithm!
▪ Need to store the entire dataset in memory!
▪ Gives decent accuracy when there is lots of data

MNIST digit classification

• Handwritten digits
• 28x28 pixel images: 𝑝 = 784
• 60,000 training samples
• 10,000 test samples
16
Summary
▪ 𝑘-Nearest Neighbors algorithm can be used for both classification and regression

▪ 𝑘NN stores the entire training dataset in memory which it uses as its representation
▪ 𝑘NN does not learn any model

▪ 𝑘NN makes predictions just-in-time by calculating the similarity between a test input and each training sample
▪ There are many distance measures to choose from to match the structure of your input data

▪ It is a good idea to rescale your data, such as using normalization, when using 𝑘NN

6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
Multiple Choice Questions For Class VIII: Crop Production and Management
60% (5)
Multiple Choice Questions For Class VIII: Crop Production and Management
21 pages
Chapter#10 (Part#01) SL (K-NN)
No ratings yet
Chapter#10 (Part#01) SL (K-NN)
27 pages
5. K-Nearest Neighbors
No ratings yet
5. K-Nearest Neighbors
35 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
Week 7 Nearest Neighbours
No ratings yet
Week 7 Nearest Neighbours
21 pages
ml5
No ratings yet
ml5
35 pages
KNN
No ratings yet
KNN
29 pages
Lecture 14 and 15
No ratings yet
Lecture 14 and 15
42 pages
3.1 K Nearest Neighbour Classifier (1)
No ratings yet
3.1 K Nearest Neighbour Classifier (1)
24 pages
K_Nearest_Neighbour_Classifier
No ratings yet
K_Nearest_Neighbour_Classifier
24 pages
cs4302-lecture2
No ratings yet
cs4302-lecture2
40 pages
ML-KN
No ratings yet
ML-KN
12 pages
Classification KNN
No ratings yet
Classification KNN
11 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
WEEK 07
No ratings yet
WEEK 07
24 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
ML Lec-10
No ratings yet
ML Lec-10
19 pages
06-knn
No ratings yet
06-knn
41 pages
Unit 5 Learning with Algorithm
No ratings yet
Unit 5 Learning with Algorithm
7 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
23 pages
AI Lec 5
No ratings yet
AI Lec 5
37 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
5c. Nearest Neighbour Classifier
No ratings yet
5c. Nearest Neighbour Classifier
2 pages
02-knn Notes
No ratings yet
02-knn Notes
23 pages
KNN Algorithm
No ratings yet
KNN Algorithm
16 pages
What Is KNN
No ratings yet
What Is KNN
9 pages
ML04_KNN-SVM_2024-2025
No ratings yet
ML04_KNN-SVM_2024-2025
57 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
Nearest Neighbour
No ratings yet
Nearest Neighbour
25 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
Lecture 2 - Nearest-Neighbors Methods
No ratings yet
Lecture 2 - Nearest-Neighbors Methods
57 pages
ML DSBA Lab4
No ratings yet
ML DSBA Lab4
5 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
KNN HMM
No ratings yet
KNN HMM
51 pages
ML Notes
100% (2)
ML Notes
125 pages
KNN CIML
No ratings yet
KNN CIML
12 pages
12_23ECE216_Nearest Neighbors
No ratings yet
12_23ECE216_Nearest Neighbors
29 pages
Example 1: Riding Mowers
No ratings yet
Example 1: Riding Mowers
6 pages
Week 03
No ratings yet
Week 03
28 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
KNN PDF
No ratings yet
KNN PDF
30 pages
COS4852 2023 Unit 2 - KNN
No ratings yet
COS4852 2023 Unit 2 - KNN
10 pages
Lecture_07_slides
No ratings yet
Lecture_07_slides
45 pages
KNN
100% (1)
KNN
7 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
Notes 02
No ratings yet
Notes 02
79 pages
Chapter 6 ML Classifications
No ratings yet
Chapter 6 ML Classifications
51 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
A B C D E F: Q1: The Parts of A Machine As Shown in Fig. 1 Are Given in Adjoining
No ratings yet
A B C D E F: Q1: The Parts of A Machine As Shown in Fig. 1 Are Given in Adjoining
12 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Lecture 2
No ratings yet
Lecture 2
22 pages
Course Outline 1614577744333
No ratings yet
Course Outline 1614577744333
1 page
Regional Mathematical Olympiad 2015 Questions With Solutions
No ratings yet
Regional Mathematical Olympiad 2015 Questions With Solutions
4 pages
B A B Ed 4year-2017-18 PDF
No ratings yet
B A B Ed 4year-2017-18 PDF
236 pages
Submitted By-Submitted To - : Dron Garg Class-Vii A Roll No.13 Mrs. Manju Bala MAM (Social Studies)
No ratings yet
Submitted By-Submitted To - : Dron Garg Class-Vii A Roll No.13 Mrs. Manju Bala MAM (Social Studies)
1 page
2017 USAPhO Exam 2
No ratings yet
2017 USAPhO Exam 2
10 pages
L
No ratings yet
L
1 page
XXIX Asian Pacific Mathematics Olympiad
No ratings yet
XXIX Asian Pacific Mathematics Olympiad
2 pages
Multiple Choice Questions For Class VIII: Crop Production and Management
No ratings yet
Multiple Choice Questions For Class VIII: Crop Production and Management
21 pages
Chapter Social Responsibility
No ratings yet
Chapter Social Responsibility
1 page
Consolidated Performance Report
No ratings yet
Consolidated Performance Report
1 page
Academic Calendar 2016 17 PDF
No ratings yet
Academic Calendar 2016 17 PDF
16 pages
Unit 1,2,3 ML
No ratings yet
Unit 1,2,3 ML
144 pages
IoT Note Module 5
No ratings yet
IoT Note Module 5
7 pages
An Introduction To Pattern Recognition And Machine Learning Paul Fieguth download
100% (1)
An Introduction To Pattern Recognition And Machine Learning Paul Fieguth download
85 pages
Yield Observation Outlier Detection With Unsupervised Machine Learning in Harvest Machines
No ratings yet
Yield Observation Outlier Detection With Unsupervised Machine Learning in Harvest Machines
7 pages
TME - Classification of SUSY Data Set
No ratings yet
TME - Classification of SUSY Data Set
9 pages
ML UNIT 2 Sir
No ratings yet
ML UNIT 2 Sir
46 pages
Axcel
No ratings yet
Axcel
2 pages
MLA-C01
No ratings yet
MLA-C01
24 pages
Deepfake Video Detection Challenges and Opportunities
No ratings yet
Deepfake Video Detection Challenges and Opportunities
48 pages
Human-Centered AI Ben Shneiderman instant download
100% (1)
Human-Centered AI Ben Shneiderman instant download
73 pages
Customer Classification by Past Purchase Data Analysis
No ratings yet
Customer Classification by Past Purchase Data Analysis
4 pages
Diabetes Prediction Using Machine Learning
No ratings yet
Diabetes Prediction Using Machine Learning
8 pages
4.optimization Techniques
No ratings yet
4.optimization Techniques
1 page
PTDLKT
No ratings yet
PTDLKT
11 pages
Eti A1
No ratings yet
Eti A1
9 pages
How Does AI Make Libraries Smart?: A Case Study of Hangzhou Public Library
No ratings yet
How Does AI Make Libraries Smart?: A Case Study of Hangzhou Public Library
22 pages
AIMLQBUnit 6
No ratings yet
AIMLQBUnit 6
22 pages
Introduction To Machine Learning: 2 Linear Classifiers
No ratings yet
Introduction To Machine Learning: 2 Linear Classifiers
4 pages
ICS423 IoT syllabus
No ratings yet
ICS423 IoT syllabus
2 pages
ML MCQ Unit 1
No ratings yet
ML MCQ Unit 1
8 pages
Artificial Intelligence and Machine Learning in The Security Operations Center Overview
No ratings yet
Artificial Intelligence and Machine Learning in The Security Operations Center Overview
12 pages
NNFL Question Paper
No ratings yet
NNFL Question Paper
2 pages
HR Analytics Certification Program
No ratings yet
HR Analytics Certification Program
32 pages
Data Science Lab Assignment: Credit Risk Prediction Using Random Forest Name: Vemula Yaminee Jyothsna Roll No: 20BM6JP44
No ratings yet
Data Science Lab Assignment: Credit Risk Prediction Using Random Forest Name: Vemula Yaminee Jyothsna Roll No: 20BM6JP44
3 pages
Midterm Solutions Machine
100% (1)
Midterm Solutions Machine
17 pages
Shrutanik Chatterjee - 34230822046 - Machine Learning Applications
No ratings yet
Shrutanik Chatterjee - 34230822046 - Machine Learning Applications
8 pages
confusion matrix problem solution
No ratings yet
confusion matrix problem solution
6 pages
Data Analytics Roadmap
No ratings yet
Data Analytics Roadmap
8 pages
Model Evaluation and Improvement 1
No ratings yet
Model Evaluation and Improvement 1
8 pages
Thesis Floris Visser 406508fv
No ratings yet
Thesis Floris Visser 406508fv
80 pages

Lecture 3

Uploaded by

Lecture 3

Uploaded by

APL 405: Machine Learning for Mechanics

Lecture 3: 𝑘-Nearest Neighbour

Instructor email: [email protected]

𝐷train = 𝐱 (1) , 𝑦 (1) , 𝐱 (2) , 𝑦 (2) , ⋯ , 𝐱 (𝑁) , 𝑦 (𝑁)

Types of Supervised learning: Classification and Regression

▪ Output variable 𝑦? → categorical → Classification

▪ Prediction errors caused due to bias, variance and irreducible errors

Overfitting Underfitting Balanced fit

(𝑖) 2 (𝑖) 2 (𝑖) 2

▪ This is the 1-nearest neighbour algorithm

Every test example in the blue

Result: Predicted test output 𝑦ො 𝐱 ∗

1. Compute the distances 𝐱 (𝑖) − 𝐱 ∗ 2

2. Find 𝑘 examples 𝐱 (𝑖) , 𝑦 (𝑖) closest to the test instance 𝐱 ∗

Mean (or median) of 𝑘 closest examples 𝐑𝐞𝐠𝐫𝐞𝐬𝐬𝐢𝐨𝐧

▪ Predict the output for 𝐱 ∗ = 1 2 𝑇

▪ Predict the output for 𝐱 ∗ = 1 2 𝑇

▪ Since it is not learned, it is not a parameter, and we refer to it as the hyperparameter

▪ Simple fix: Normalize each dimension to be in the range [0, 1]

MNIST digit classification

You might also like