Understanding Autoencoders in Deep Learning

Uploaded by

atharvagolwalkar16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views15 pages

Understanding Autoencoders in Deep Learning

Uploaded by

atharvagolwalkar16

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Deep Learning

Dr. Arundhati Das

©Usage of
these
slides on
any media
without
permission
of Dr
Arundhati
Image Courtesy: Internet Das is
strictly
prohibited
Module III: Autoencoders: Unsupervised
Learning
3.1 Introduction, Linear Autoencoder, Undercomplete
Autoencoder, Overcomplete Autoencoders, Regularization in
Autoencoders
3.2 Denoising Autoencoders, Sparse Autoencoders,
Contractive Autoencoders
3.3 Application of Autoencoders: Image Compression
Introduction: Auto-encoders
• An autoencoder is a special type of deep feed forward neural
network which does the following:
• Encodes its input x into a hidden representation h
• Decodes the input again from this hidden representation
• The model is trained to minimize a certain loss function
which will ensure that x’ is close to x
• Basically, an autoencoder contains an encoder and
decoder. These two parts function automatically and
give rise to the name “autoencoder”.
• The basic idea behind autoencoders is to encode the input
data into a lower-dimensional representation (i.e. called
latent space) and then decode it back into the original
format, with the objective of minimizing the reconstruction
error.

• An autoencoder is an unsupervised learning algorithm that

applies backpropagation setting the target values to be equal
to the inputs.
• Applications: dimensionality reduction, data compression as
well as for data reconstruction (data denoising) tasks. x h x’
3
Architecture and components of auto-
encoders (encoder and decoder)
• Autoencoders are simple network, where their
output (target feature) is their input.
• Their goal is to learn how to reconstruct the
input-data.
• The first part of the network is what we refer to
as the Encoder.
• It receives the input and it encodes it in a latent
space of a lower dimension.
• The second part (the Decoder) takes that vector
and decode it in order to produce the original
input.

4
Architecture-encoder, latent space,
decoder
• ENCODER:
• Encoding is achieved by the encoder part of the
network which has a decreasing number of hidden
units in each layer.
• In this way, this part is forced to pick up only the
most significant and representative features of the
data.
• We can implement this phenomenon by connecting
a series of pooling layers, each one reducing the
number of dimensions that are present in the data.
• LATENT SPACE:
• Thus, encoder transforms high-dimensional input
into lower-dimension (latent state, where the input is
more compressed).
5
Architecture-encoder, latent space,
decoder
• The latent vector in the middle is
important and crucial, as it is
a compressed representation of the input.
• It gives plenty of applications for
compression and dimensionality
reduction.
• DECODER:
• The latent vector can now further be used
to reproduce the same but slightly
different or better data. This gives rise to
applications for data denoising and data
augmentation.
6
Components of auto-encoders
• Autoencoder basically comprises of the components called of
encoder, the decoder, latent space and Loss function.
• 1. Encoder: The input data is first passed through an encoder
network, which consists of one or more layers of neurons.
These layers progressively reduce the dimensionality of the
data, creating a compressed representation (latent space) of
the input. The last layer of the encoder typically has fewer
neurons than the input layer, forcing it to capture essential
features and patterns in the data.
• 2. Latent Space: The compressed representation in the latent
space is a lower-dimensional representation of the input data.
This representation should ideally capture the most salient
features of the data. This is also known as bottleneck or code.
• 3. Decoder: The compressed representation is then passed through a decoder network, which aims to
reconstruct the original input from the compressed representation. Like the encoder, the decoder network
consists of one or more layers, and the final layer's output should match the input data's dimensions.
• 4. Loss Function: The performance of the autoencoder is evaluated using a loss function, which quantifies how well
the reconstructed output matches the input. Common loss functions include mean squared error (MSE) or cross-
entropy, depending on the nature of the data.
• A smaller loss means the autoencoder is learning to represent data more accurately.
7
Undercomplete Autoencoder ( )

• It is an autoencoder where the hidden layer has fewer units than the input
layer. The model compresses the input data into a lower-dimensional space
and then attempts to reconstruct the original input from this compressed
representation.
• If we are able to reconstruct perfectly from h, then h can be termed as loss-
free encoding of ; meaning h can capture all the characteristics of
Overcomplete Autoencoder ( )
• It is an autoencoder where the hidden layer has more units than the
input layer. The model expands the input data into a higher-
dimensional space, which allows for a potentially richer and detailed
representation of the data.
• Overcomplete autoencoders do make sense, but only when
combined with constraints that stop them from just copying the
input. With proper regularization, they can capture more nuanced,
high-dimensional structure in the data than an undercomplete
one.
• Encourage only a few neurons in the latent vector to be active for
any given input.
• This forces the network to learn a distributed, compressed-like
representation.
Auto-encoder advantages
while doing data compression
• Considering the applications for data-compression, autoencoders are preferred over
PCA.
• PCA makes one stringent but powerful assumption that is linearity i.e. there must
be linearity in the data set; which is not the case in real-life datasets.
• PCA is linear because it can only represent data transformations as linear combinations of the
original features
• However, an autoencoder can learn non-linear transformations with a non-linear
activation function and multiple layers.

• It can make use of pre-trained layers from another model to apply transfer learning
to enhance the encoder/decoder.
10
Autoencoder vs PCA
1. A type of neural network trained to 1. A linear statistical method that
learn an efficient compressed projects data into a lower-
representation (encoding) of the input dimensional space while preserving
data, and then reconstruct it. as much variance as possible.
2. Can model non-linear relationships 2. Always linear, transforms data
between features (if using non-linear using orthogonal basis vectors
activations like ReLU, Sigmoid, Tanh). (principal components).
3. Requires training using 3. No training, computed directly
backpropagation to minimize using eigen decomposition or SVD.
reconstruction error. 4. Produces principal components
4. Produces learned features (codes) in (uncorrelated features) ranked by
the bottleneck layer. variance explained.
5. More computationally expensive 5. Less computationally expensive
(gradient descent, multiple epochs). (closed-form solution).
Note: A linear autoencoder with no activation function and mean squared error loss will learn the same subspace
as PCA.
Q1.
Q2.

Q3.

Q4.
Choice of Activation functions
• Choice of f(xi) and g(xi)
Q

Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
40 pages
Applications of Autoencoders in Anomaly Detection
No ratings yet
Applications of Autoencoders in Anomaly Detection
13 pages
Understanding Autoencoders: Types & Uses
No ratings yet
Understanding Autoencoders: Types & Uses
20 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
79 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
42 pages
Recurrent Networks and Autoencoders
No ratings yet
Recurrent Networks and Autoencoders
53 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
248 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
16 pages
Understanding Auto-Encoders in AI
No ratings yet
Understanding Auto-Encoders in AI
47 pages
Module 3 DL
No ratings yet
Module 3 DL
57 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
39 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
58 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
39 pages
Auto Encoder S
No ratings yet
Auto Encoder S
57 pages
Understanding Autoencoders in AI
No ratings yet
Understanding Autoencoders in AI
17 pages
Understanding Autoencoders in AI
No ratings yet
Understanding Autoencoders in AI
12 pages
Understanding Autoencoders in Machine Learning
No ratings yet
Understanding Autoencoders in Machine Learning
39 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
27 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
11 pages
Understanding Autoencoders in Deep Learning
100% (1)
Understanding Autoencoders in Deep Learning
4 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
52 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
35 pages
Understanding Autoencoder Architecture
No ratings yet
Understanding Autoencoder Architecture
36 pages
Autoencoders and Generative Models Overview
No ratings yet
Autoencoders and Generative Models Overview
19 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
51 pages
DL Module 4
No ratings yet
DL Module 4
34 pages
Physics-Informed Neural Networks
No ratings yet
Physics-Informed Neural Networks
53 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
62 pages
7 AIntroductionto Autoencoder
No ratings yet
7 AIntroductionto Autoencoder
10 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
57 pages
Twitter Spam Detection with Autoencoders
No ratings yet
Twitter Spam Detection with Autoencoders
26 pages
DL 2
No ratings yet
DL 2
30 pages
Autoencoder Architecture and Hyperparameters
No ratings yet
Autoencoder Architecture and Hyperparameters
16 pages
Chapter # 3
No ratings yet
Chapter # 3
18 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
43 pages
Understanding Autoencoder Structure
No ratings yet
Understanding Autoencoder Structure
4 pages
AAI Module 3 Types of Auoencoder
No ratings yet
AAI Module 3 Types of Auoencoder
13 pages
01-Unit 3
No ratings yet
01-Unit 3
17 pages
Overview of Undercomplete Autoencoders
No ratings yet
Overview of Undercomplete Autoencoders
20 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
23 pages
Autoencoders: Long-Term Dependency Optimization
No ratings yet
Autoencoders: Long-Term Dependency Optimization
3 pages
Unit 4 Autoencoders
No ratings yet
Unit 4 Autoencoders
23 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
138 pages
Autoencoders and Their Applications
No ratings yet
Autoencoders and Their Applications
25 pages
Autoencoders and Regularization Techniques
No ratings yet
Autoencoders and Regularization Techniques
23 pages
Overview of Autoencoders in ML
No ratings yet
Overview of Autoencoders in ML
11 pages
Autoencoders for Dimensionality Reduction
No ratings yet
Autoencoders for Dimensionality Reduction
103 pages
Understanding Autoencoders for Dimensionality Reduction
No ratings yet
Understanding Autoencoders for Dimensionality Reduction
103 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
22 pages
Autoencoders: Applications and Architecture
No ratings yet
Autoencoders: Applications and Architecture
22 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
26 pages
Denoising Autoencoders Explained
No ratings yet
Denoising Autoencoders Explained
7 pages
Understanding Autoencoders Basics
No ratings yet
Understanding Autoencoders Basics
15 pages
Autoencoders and Generative Models Overview
No ratings yet
Autoencoders and Generative Models Overview
31 pages
Understanding Undercomplete Autoencoders
No ratings yet
Understanding Undercomplete Autoencoders
32 pages
Module 4 - Autoencoders (Ae) & Variational Autoencoders (Vae)
No ratings yet
Module 4 - Autoencoders (Ae) & Variational Autoencoders (Vae)
11 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
29 pages
Understanding Autoencoders: Types and Functions
No ratings yet
Understanding Autoencoders: Types and Functions
52 pages
Autoencoders: Types and Applications
No ratings yet
Autoencoders: Types and Applications
31 pages
PERT: Benefits and Challenges Explained
No ratings yet
PERT: Benefits and Challenges Explained
32 pages
Mathematical Models in Control Systems
100% (2)
Mathematical Models in Control Systems
262 pages
Machine Learning and Deep Learning Course
No ratings yet
Machine Learning and Deep Learning Course
5 pages
One-Time Pad and Public-Key Encryption
No ratings yet
One-Time Pad and Public-Key Encryption
4 pages
Trends in ML for Computational Fluid Dynamics
No ratings yet
Trends in ML for Computational Fluid Dynamics
8 pages
Spark ML Pipeline for Classifying Reviews
No ratings yet
Spark ML Pipeline for Classifying Reviews
11 pages
Understanding Transition Graphs in Automata
No ratings yet
Understanding Transition Graphs in Automata
32 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
1 page
Bairstow Method Example Math
No ratings yet
Bairstow Method Example Math
4 pages
Escape Room Clue Decoding Guide
No ratings yet
Escape Room Clue Decoding Guide
21 pages
Fake News Detection with Machine Learning
No ratings yet
Fake News Detection with Machine Learning
65 pages
Software Risk and Estimation Techniques
No ratings yet
Software Risk and Estimation Techniques
114 pages
Journal Pone 0284318
No ratings yet
Journal Pone 0284318
15 pages
Overview of Partitional Clustering Techniques
No ratings yet
Overview of Partitional Clustering Techniques
11 pages
Intelligent Methods For Intrusion Detection in Local Area Networks
No ratings yet
Intelligent Methods For Intrusion Detection in Local Area Networks
12 pages
Non-Parametric News Impact Curve Model
No ratings yet
Non-Parametric News Impact Curve Model
45 pages
Intro To CNN
No ratings yet
Intro To CNN
17 pages
NP-Completeness and Approximation Methods
No ratings yet
NP-Completeness and Approximation Methods
11 pages
FIR Filter Design on C6713 DSK
No ratings yet
FIR Filter Design on C6713 DSK
30 pages
Big Data & Machine Learning Prodegree
No ratings yet
Big Data & Machine Learning Prodegree
6 pages
Training An Artificial Neural Network To Play Tic Tac Toe PDF
No ratings yet
Training An Artificial Neural Network To Play Tic Tac Toe PDF
16 pages
Understanding Numerical Errors in Engineering
No ratings yet
Understanding Numerical Errors in Engineering
28 pages
Data and Network Security Quiz Answers
No ratings yet
Data and Network Security Quiz Answers
4 pages
Two-Port Network Parameter Analysis
No ratings yet
Two-Port Network Parameter Analysis
3 pages
Dynamic Programming Introduction - Tutorial (Updated)
No ratings yet
Dynamic Programming Introduction - Tutorial (Updated)
6 pages
Lagrange Multipliers: Max/Min Problems
No ratings yet
Lagrange Multipliers: Max/Min Problems
2 pages
Dirichlet Problem for Degenerate Elliptic PDEs
No ratings yet
Dirichlet Problem for Degenerate Elliptic PDEs
51 pages
Linear Programming Evaluation Questions
No ratings yet
Linear Programming Evaluation Questions
10 pages
CascadedGaze: Efficient Image Restoration
No ratings yet
CascadedGaze: Efficient Image Restoration
16 pages
Unit 6 MCQs on Classification Metrics
No ratings yet
Unit 6 MCQs on Classification Metrics
6 pages

Understanding Autoencoders in Deep Learning

Uploaded by

Understanding Autoencoders in Deep Learning

Uploaded by

Deep Learning

Dr. Arundhati Das

• An autoencoder is an unsupervised learning algorithm that

You might also like