Lec16 - Autoencoders

Autoencoders are unsupervised neural networks designed to reproduce their input. They consist of an encoder that compresses the input into a latent space representation and a decoder that reconstructs the input from the latent space. Variations include denoising autoencoders which add noise to the input to learn a more robust representation, and sparse autoencoders which add regularization to activate only a few nodes in the latent space. Contractive autoencoders enforce the latent space to be invariant to small perturbations in the input.

Uploaded by

Hoàng Anh Nguyễn

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

123 views

Lec16 - Autoencoders

Uploaded by

Hoàng Anh Nguyễn

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Autoencoders

• Supervised learning uses explicit labels/correct output

in order to train a network.
• E.g., classification of images.

• Unsupervised learning relies on data only.

• E.g., CBOW and skip-gram word embeddings: the output is
determined implicitly from word order in the input data.
• Key point is to produce a useful embedding of words.
• The embedding encodes structure such as word similarity
and some relationships.
• Still need to define a loss – this is an implicit supervision.
Autoencoders
• Autoencoders are designed to reproduce their
input, especially for images.
• Key point is to reproduce the input from a learned
encoding.

https://www.edureka.co/blog/autoencoders-tutorial/
Autoencoders
• Compare PCA/SVD
• PCA takes a collection of vectors (images) and produces a
usually smaller set of vectors that can be used to
approximate the input vectors via linear combination.
• Very efficient for certain applications.
• Fourier and wavelet compression is similar.

• Neural network autoencoders

• Can learn nonlinear dependencies
• Can use convolutional layers
• Can use transfer learning

https://www.edureka.co/blog/autoencoders-tutorial/
Autoencoders: structure
• Encoder: compress input into a latent-space of
usually smaller dimension. h = f(x)
• Decoder: reconstruct input from the latent space.
r = g(f(x)) with r as close to x as possible

https://towardsdatascience.com/deep-inside-autoencoders-7e41f319999f
Autoencoders: applications
• Denoising: input clean image + noise and train to
reproduce the clean image.

https://www.edureka.co/blog/autoencoders-tutorial/
Autoencoders: Applications
• Image colorization: input black and white and train
to produce color images

https://www.edureka.co/blog/autoencoders-tutorial/
Autoencoders: Applications
• Watermark removal

https://www.edureka.co/blog/autoencoders-tutorial/
Properties of Autoencoders
• Data-specific: Autoencoders are only able to
compress data similar to what they have been
trained on.
• Lossy: The decompressed outputs will be degraded
compared to the original inputs.
• Learned automatically from examples: It is easy to
train specialized instances of the algorithm that will
perform well on a specific type of input.

https://www.edureka.co/blog/autoencoders-tutorial/
Capacity
• As with other NNs, overfitting is a problem when
capacity is too large for the data.

• Autoencoders address this through some

combination of:
• Bottleneck layer – fewer degrees of freedom than in
possible outputs.
• Training to denoise.
• Sparsity through regularization.
• Contractive penalty.
Bottleneck layer (undercomplete)
• Suppose input images are nxn and the latent space
is m < nxn.
• Then the latent space is not sufficient to reproduce
all images.
• Needs to learn an encoding that captures the
important features in training data, sufficient for
approximate reconstruction.
Simple bottleneck layer in Keras
• input_img = Input(shape=(784,))
• encoding_dim = 32
• encoded = Dense(encoding_dim, activation='relu')(input_img)
• decoded = Dense(784, activation='sigmoid')(encoded)
• autoencoder = Model(input_img, decoded)
• Maps 28x28 images into a 32 dimensional vector.
• Can also use more layers and/or convolutions.

https://blog.keras.io/building-autoencoders-in-keras.html
Denoising autoencoders
• Basic autoencoder trains to minimize the loss
between x and the reconstruction g(f(x)).
• Denoising autoencoders train to minimize the loss
between x and g(f(x+w)), where w is random noise.
• Same possible architectures, different training data.

• Kaggle has a dataset on damaged documents.

https://blog.keras.io/building-autoencoders-in-keras.html
Denoising autoencoders
• Denoising autoencoders can’t simply memorize the
input output relationship.
• Intuitively, a denoising autoencoder learns a
projection from a neighborhood of our training
data back onto the training data.

https://ift6266h17.files.wordpress.com/2017/03/14_autoencoders.pdf
Sparse autoencoders
• Construct a loss function to penalize activations
within a layer.
• Usually regularize the weights of a network, not the
activations.
• Individual nodes of a trained model that activate
are data-dependent.
• Different inputs will result in activations of different
nodes through the network.
• Selectively activate regions of the network
depending on the input data.

https://www.jeremyjordan.me/autoencoders/
Sparse autoencoders
• Construct a loss function to penalize activations the
network.
• L1 Regularization: Penalize the absolute value of the
vector of activations a in layer h for observation I

• KL divergence: Use cross-entropy between average

activation and desired activation

https://www.jeremyjordan.me/autoencoders/
Contractive autoencoders
• Arrange for similar inputs to have similar activations.
• I.e., the derivative of the hidden layer activations are
small with respect to the input.
• Denoising autoencoders make the reconstruction function
(encoder+decoder) resist small perturbations of the input
• Contractive autoencoders make the feature extraction
function (ie. encoder) resist infinitesimal perturbations of
the input.

https://www.jeremyjordan.me/autoencoders/
Contractive autoencoders
• Contractive autoencoders make the feature
extraction function (ie. encoder) resist infinitesimal
perturbations of the input.

https://ift6266h17.files.wordpress.com/2017/03/14_autoencoders.pdf
Autoencoders
• Both the denoising and contractive autoencoder can
perform well
• Advantage of denoising autoencoder : simpler to implement-
requires adding one or two lines of code to regular autoencoder-
no need to compute Jacobian of hidden layer
• Advantage of contractive autoencoder : gradient is deterministic
-can use second order optimizers (conjugate gradient, LBFGS,
etc.)-might be more stable than denoising autoencoder, which
uses a sampled gradient
• To learn more on contractive autoencoders:
• Contractive Auto-Encoders: Explicit Invariance During Feature
Extraction. Salah Rifai, Pascal Vincent, Xavier Muller, Xavier
Glorot et Yoshua Bengio, 2011.

https://ift6266h17.files.wordpress.com/2017/03/14_autoencoders.pdf

Tschumi - The Beaux-Arts Since 68
100% (1)
Tschumi - The Beaux-Arts Since 68
18 pages
Unit 4 Basics of Feature Engineering
No ratings yet
Unit 4 Basics of Feature Engineering
33 pages
GANppt
100% (1)
GANppt
34 pages
Advanced Deep Learning Questions - ChatGPT
No ratings yet
Advanced Deep Learning Questions - ChatGPT
13 pages
Gradient Descent Algorithms and Variations - PyImageSearch
No ratings yet
Gradient Descent Algorithms and Variations - PyImageSearch
21 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
Tensor Flow
No ratings yet
Tensor Flow
12 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
Early Stopping in Practice
No ratings yet
Early Stopping in Practice
14 pages
Clustering K-Means
100% (2)
Clustering K-Means
28 pages
Hyperparameter Tuning in XGBoost Using Genetic Algorithm
100% (1)
Hyperparameter Tuning in XGBoost Using Genetic Algorithm
11 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
Lab I TENSOR FLOW AND KERAS
No ratings yet
Lab I TENSOR FLOW AND KERAS
3 pages
Deep Learning Notes
100% (1)
Deep Learning Notes
71 pages
Deep Learning 2017 Lecture7GAN
No ratings yet
Deep Learning 2017 Lecture7GAN
62 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
Deep Learning
100% (1)
Deep Learning
49 pages
Deep Learning Interview Questions
No ratings yet
Deep Learning Interview Questions
17 pages
DL Lab Manual
No ratings yet
DL Lab Manual
65 pages
Bagging and Boosting Regression Algorithms
100% (1)
Bagging and Boosting Regression Algorithms
84 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Feature Selection Techniques in Machine Learning
No ratings yet
Feature Selection Techniques in Machine Learning
9 pages
Chapter 2. Pair Programming
No ratings yet
Chapter 2. Pair Programming
15 pages
GenerativeAdversialNetwork
No ratings yet
GenerativeAdversialNetwork
21 pages
Classification Algorithms
100% (2)
Classification Algorithms
23 pages
Best Practices For Prompt Engineering With The OpenAI
No ratings yet
Best Practices For Prompt Engineering With The OpenAI
6 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
44 pages
Deep Learning PPT Full Notes
No ratings yet
Deep Learning PPT Full Notes
105 pages
Brief Introduction To GenAI
No ratings yet
Brief Introduction To GenAI
1 page
Rakesh Kumar - Data Scientist
No ratings yet
Rakesh Kumar - Data Scientist
3 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
Data Science New
No ratings yet
Data Science New
9 pages
AML 04 Backpropagation
100% (1)
AML 04 Backpropagation
26 pages
Deep Learning Questions
50% (2)
Deep Learning Questions
51 pages
Regularization: Swetha V, Research Scholar
No ratings yet
Regularization: Swetha V, Research Scholar
32 pages
542 315 Word2vec
No ratings yet
542 315 Word2vec
20 pages
Statistics in Details
100% (2)
Statistics in Details
283 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Image Processing With CUDA
No ratings yet
Image Processing With CUDA
66 pages
Machine Learning
No ratings yet
Machine Learning
12 pages
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
No ratings yet
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
73 pages
TensorFlow Tutorial
100% (1)
TensorFlow Tutorial
32 pages
CSE Dept. PPT 176 173
No ratings yet
CSE Dept. PPT 176 173
17 pages
Churn For Bank Customers
No ratings yet
Churn For Bank Customers
28 pages
Cluster Analysis: Concepts and Techniques - Chapter 7
100% (1)
Cluster Analysis: Concepts and Techniques - Chapter 7
60 pages
Bidirectional RNN and RVNN
No ratings yet
Bidirectional RNN and RVNN
15 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
63 pages
The COMPLETE TRUTH About AI Agents (2024)
No ratings yet
The COMPLETE TRUTH About AI Agents (2024)
32 pages
Feature Engineering
100% (2)
Feature Engineering
44 pages
Data Science Intervieew Questions
100% (1)
Data Science Intervieew Questions
16 pages
Combined ML
100% (1)
Combined ML
705 pages
K Means
100% (2)
K Means
329 pages
Emotion Detection
No ratings yet
Emotion Detection
23 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
Regularization_for_Neural_Networks_1718966083
No ratings yet
Regularization_for_Neural_Networks_1718966083
9 pages
AIML - 04 Single Layer Perceptron
No ratings yet
AIML - 04 Single Layer Perceptron
11 pages
Bag of Words
No ratings yet
Bag of Words
72 pages
ML Project Shivani Pandey
100% (2)
ML Project Shivani Pandey
49 pages
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Congo PPT - Ss
No ratings yet
Congo PPT - Ss
10 pages
IMIA-WGP 50 (07) : IMIA Conference Tokyo, 2007 Prepared by
No ratings yet
IMIA-WGP 50 (07) : IMIA Conference Tokyo, 2007 Prepared by
35 pages
Forestry (Ug Degree Standard) : Unit-I: Silviculture
No ratings yet
Forestry (Ug Degree Standard) : Unit-I: Silviculture
6 pages
Legaspi vs. Civil Service Commission
100% (4)
Legaspi vs. Civil Service Commission
2 pages
Municipal Corporation of Delhi v. Subhagwanti - My Legal Partner PDF
No ratings yet
Municipal Corporation of Delhi v. Subhagwanti - My Legal Partner PDF
4 pages
Plag Check
No ratings yet
Plag Check
57 pages
Excell Precision Weighing Scale Manual de Usuario Indicador
No ratings yet
Excell Precision Weighing Scale Manual de Usuario Indicador
31 pages
Problems On Bond and Equity Valuation
No ratings yet
Problems On Bond and Equity Valuation
3 pages
Electionics Material
No ratings yet
Electionics Material
8 pages
Ebooks File FS Fundamentals of Surveying Reference Handbook 2.0 National Council of Examiners For Engineering and Surveying (Ncees) All Chapters
100% (7)
Ebooks File FS Fundamentals of Surveying Reference Handbook 2.0 National Council of Examiners For Engineering and Surveying (Ncees) All Chapters
62 pages
Robotics 11
No ratings yet
Robotics 11
35 pages
Security Proposal
No ratings yet
Security Proposal
23 pages
SPA Sample
No ratings yet
SPA Sample
2 pages
Division First Periodical Test Math IV
No ratings yet
Division First Periodical Test Math IV
6 pages
MAFI Catalog 2018
No ratings yet
MAFI Catalog 2018
128 pages
Bio Project
No ratings yet
Bio Project
40 pages
ScissorLift Inst
No ratings yet
ScissorLift Inst
6 pages
MCQ'S - P&o Management
No ratings yet
MCQ'S - P&o Management
21 pages
Dbms Tutorial
100% (1)
Dbms Tutorial
80 pages
Marwin Condensed Catalog
No ratings yet
Marwin Condensed Catalog
8 pages
21st CENTURY TEACHERS
No ratings yet
21st CENTURY TEACHERS
3 pages
ICARE-Preweek-AFAR-Part 1
100% (1)
ICARE-Preweek-AFAR-Part 1
6 pages
Asme Section Ii B SB-62
No ratings yet
Asme Section Ii B SB-62
2 pages
Datasheet Panel 100w 12v Ecogreen
No ratings yet
Datasheet Panel 100w 12v Ecogreen
2 pages
VBAT
No ratings yet
VBAT
23 pages
3X RRZZHHTTS4 BR24 - Finale
No ratings yet
3X RRZZHHTTS4 BR24 - Finale
6 pages
Creating Custom CDS Views in SAP
No ratings yet
Creating Custom CDS Views in SAP
11 pages
ECC vs. CA G.R. No. 115858
No ratings yet
ECC vs. CA G.R. No. 115858
2 pages
Parabens Cosmetic PDF
No ratings yet
Parabens Cosmetic PDF
1 page

Lec16 - Autoencoders

Uploaded by

Lec16 - Autoencoders

Uploaded by

Autoencoders

• Supervised learning uses explicit labels/correct output

• Unsupervised learning relies on data only.

• Neural network autoencoders

• Autoencoders address this through some

• Kaggle has a dataset on damaged documents.

• KL divergence: Use cross-entropy between average

You might also like