GenAI Autoencoder

Module 5 covers generative AI models, focusing on their ability to learn patterns from training data to generate new content. It details autoencoders, including their architecture, types, and applications such as dimensionality reduction, feature extraction, image denoising, and compression. The module also discusses various types of autoencoders, including vanilla, denoising, and stacked autoencoders.

Uploaded by

febinsunny2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views23 pages

GenAI Autoencoder

Uploaded by

febinsunny2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Module 5

Generative AI Models

Contents
➢ Introduction to Generative Models: Overview of generative models,
Types of generative models (e.g., GANs, VAEs)
➢ Autoencoders: Basics of autoencoders, Variational Autoencoders (VAEs)
➢ Generative Adversarial Networks (GANs): Introduction to GAN
architecture, Training GANs, Applications of GANs
Introduction to Generative Models

➢ Generative models are a class of artificial intelligence (AI) that learn the
underlying patterns, structure, and probability distribution of training data to
generate new, original samples—such as images, text, audio, or 3D models—that
resemble the input data.
➢ A generative model is a machine learning model designed to create new data that
is similar to its training data. Generative artificial intelligence (AI) models learn
the patterns and distributions of the training data, then apply those
understandings to generate novel content in response to new input data.
- IBM
➢ Major types including Variational Autoencoders (VAEs), Generative Adversarial
Networks (GANs), Diffusion Models, and Transformers.
Autoencoders
● Autoencoders are a class of unsupervised neural network (since they don't
need explicit labels to train on) where the input is same as the output.
● The aim of an autoencoder is to learn a lower-dimensional representation
(encoding) for a higher-dimensional data.
● They compress the input into a lower dimensional code and then
reconstruct the output from this representation.
The architecture of autoencoders:

Autoencoders consist of 3 parts:

1. Encoder: A module that compresses the train-validate-test set input data

into an encoded representation that is typically several orders of magnitude
smaller than the input data. It compress and produces the code

2. Bottleneck: A module that contains the compressed knowledge

representations and is therefore the most important part of the network.

3. Decoder: A module that helps the network“decompress” the knowledge

representations and reconstructs the data back from its encoded form. The
output is then compared with a ground truth.
The relationship between the Encoder, Bottleneck, and Decoder
Encoder

The encoder is a set of convolutional blocks followed by pooling modules that compress
the input to the model into a compact section called the bottleneck.

Bottleneck
● The most important part of the neural network, and ironically the smallest one, is
the bottleneck.
● The bottleneck is designed in such a way that the maximum information possessed
by an image is captured in it, we can say that the bottleneck helps us form a
knowledge-representation of the input.
● A bottleneck as a compressed representation of the input further prevents the
neural network from memorising the input and overfitting on the data
Decoder
The decoder is a set of upsampling and convolutional blocks that reconstructs the
bottleneck's output.

The number of hidden units in the autoencoder is typically less than the number of input (and
output) units. This forces the encoder to learn a compressed representation of the input, which
the decoder reconstructs.
• If there is a structure in the input data in the form of correlations between input
features, then the autoencoder will discover some of these correlations, and end up
learning a low-dimensional representation of the data similar to that learned using
principal component analysis (PCA).

• Once the autoencoder is trained, we would typically just discard the decoder
component and use the encoder component to generate compact representations of
the input.

• Alternatively, we could use the encoder as a feature detector that generates a

compact, semantically rich representation of our input and build a classifier by
attaching a softmax classifier to the hidden layer.
Applications of Autoencoder
1. Dimensionality Reduction
Autoencoders train the network to explain the natural structure in the
data into efficient lower-dimensional representation. It does this by
using decoding and encoding strategy to minimize the reconstruction
error

The input and the output

dimension have 3000
dimensions, and the desired
reduced dimension is 200.
2. Feature Extraction
● Autoencoders can be used as a feature extractor for classification or
regression tasks.
● Autoencoders take unlabeled data and learn efficient codings about
the structure of the data that can be used for supervised learning
tasks.
● After training an autoencoder network using a sample of training
data, we can ignore the decoder part of the autoencoder, and only use
the encoder to convert raw input data of higher dimension to a lower
dimension encoded space.
● This lower dimension of data can be used as a feature for supervised
tasks.
3. Image Denoising

● The real-world raw input data is often noisy in nature, and to train a
robust supervised model requires cleaned and noiseless data.
Autoencoders can be used to denoise the data.

● Image denoising is one of the popular applications where the

autoencoders try to reconstruct the noiseless image from a noisy input
image.
4. Image Compression:

● Image compression is another application of an autoencoder network.

● The raw input image can be passed to the encoder network and obtained a
compressed dimension of encoded data.
● The autoencoder network weights can be learned by reconstructing the
image from the compressed encoding using a decoder network.
• We can think of autoencoders as consisting of two cascaded networks.

• The first network is an encoder, it takes the input x, and encodes it using a
transformation h to an encoded signal y, that is:

• The second network uses the encoded signal y as its input and performs
another transformation f to get a reconstructed signal r, that is:

• We define error, e, as the difference between the original input x and the
reconstructed signal r, e= x- r.

• The network then learns by reducing the loss function (for example mean
squared error (MSE)), and the error is propagated backwards to the hidden
layers as in the case of MLPs.
● Depending upon the actual dimensions of the encoded layer with respect to
the input, the loss function, and constraints, there are various types of
autoencoders:

■ Vanilla Autoencoders
■ Denoising autoencoders,
■ Stacked autoencoders
■ Sparse autoencoders
■ Variational Autoencoders
Vanilla autoencoders
• The Vanilla autoencoder, as proposed by Hinton in his 2006 paper Reducing
the Dimensionality of Data with Neural Networks, consists of one hidden
layer only.

• The number of neurons in the hidden layer are less than the number of
neurons in the input (or output) layer.

• This results in producing a bottleneck effect in the flow of information in the

network. The hidden layer in between is also called the "bottleneck layer.“

• Learning in the autoencoder consists of developing a compact representation

of the input signal at the hidden layer so that the output layer can faithfully
reproduce the original input.
Denoising autoencoders
• A denoising autoencoder learns from a corrupted (noisy) input; it feed its
encoder network the noisy input, and then the reconstructed image from the
decoder is compared with the original input.
• The idea is that this will help the network learn how to denoise an input.
• It will no longer just make pixel-wise comparisons, but in order to denoise it
will learn the information of neighboring pixels as well.
• The corruption process typically follows one of two approaches.
• Approach 1:
○ We can randomly set some of the inputs (as many as half of them) to
zero or one; most commonly it is setting random values to zero to imply
missing [Link] can be done by manually inputting zeros or ones
into the inputs or adding a dropout layer between the inputs and first
hidden layer.
• Approach 2: adding pure Gaussian noise
○ Training a denoising autoencoder is nearly the same process as training
a regular autoencoder. The only difference is we supply our corrupted
inputs to training_frame and supply the non-corrupted inputs
to validation_frame.
Results
Stacked autoencoder
• Until now we have restricted ourselves to autoencoders with only one hidden
layer.
• We can build Deep autoencoders by stacking many layers of both
encoder and decoder; such an autoencoder is called a Stacked autoencoder.
• The stacked autoencoder can be trained as a whole network with an aim to
minimize the reconstruction error.
● Thus stacked autoencoders are nothing but Deep autoencoders having multiple hidden
layers. With more hidden layers, the autoencoders can learns more complex coding.
● When the deep autoencoder network is a convolutional network, we call it a
Convolutional Autoencoder

Convolutional autoencoder for removing noise from images

Unit 4
No ratings yet
Unit 4
31 pages
Autoencoders and Generative Models Overview
No ratings yet
Autoencoders and Generative Models Overview
27 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
79 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
11 pages
Understanding Autoencoder Architecture
No ratings yet
Understanding Autoencoder Architecture
36 pages
4.3 Auto Encoders
No ratings yet
4.3 Auto Encoders
4 pages
DL Module 4
No ratings yet
DL Module 4
34 pages
Understanding Undercomplete Autoencoders
No ratings yet
Understanding Undercomplete Autoencoders
32 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
23 pages
Understanding Autoencoders in Deep Learning
100% (1)
Understanding Autoencoders in Deep Learning
4 pages
Understanding Autoencoders in AI
No ratings yet
Understanding Autoencoders in AI
17 pages
Understanding Autoencoders in AI
No ratings yet
Understanding Autoencoders in AI
12 pages
Understanding Autoencoders in Deep Learning
100% (1)
Understanding Autoencoders in Deep Learning
4 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
16 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
37 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
39 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
42 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
35 pages
Understanding Autoencoders in Machine Learning
No ratings yet
Understanding Autoencoders in Machine Learning
39 pages
Autoencoders and Their Applications
No ratings yet
Autoencoders and Their Applications
25 pages
Unit 3 High Dimensional Object
No ratings yet
Unit 3 High Dimensional Object
13 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
65 pages
01-Unit 3
No ratings yet
01-Unit 3
17 pages
Autoencoders: Applications and Architecture
No ratings yet
Autoencoders: Applications and Architecture
22 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
22 pages
Autoencoders: Types and Applications
No ratings yet
Autoencoders: Types and Applications
31 pages
Twitter Spam Detection with Autoencoders
No ratings yet
Twitter Spam Detection with Autoencoders
26 pages
Understanding Autoencoders and VAEs
100% (1)
Understanding Autoencoders and VAEs
22 pages
Recurrent Networks and Autoencoders
No ratings yet
Recurrent Networks and Autoencoders
53 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
248 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
62 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
39 pages
AAI Module 3 Types of Auoencoder
No ratings yet
AAI Module 3 Types of Auoencoder
13 pages
Understanding Autoencoders: Types & Uses
No ratings yet
Understanding Autoencoders: Types & Uses
11 pages
Understanding Auto-Encoders in AI
No ratings yet
Understanding Auto-Encoders in AI
47 pages
Generative AI: Autoencoders Overview
No ratings yet
Generative AI: Autoencoders Overview
15 pages
Autoencoders and Generative Models Overview
No ratings yet
Autoencoders and Generative Models Overview
16 pages
Understanding Autoencoders and GANs
No ratings yet
Understanding Autoencoders and GANs
29 pages
Autoencoders and Generative Models Overview
No ratings yet
Autoencoders and Generative Models Overview
19 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
51 pages
Module 3 DL
No ratings yet
Module 3 DL
57 pages
Auto Encoder S
No ratings yet
Auto Encoder S
57 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
23 pages
Overview of Autoencoders
No ratings yet
Overview of Autoencoders
22 pages
Lecture 9 Autoencoders 09122022 032236pm
No ratings yet
Lecture 9 Autoencoders 09122022 032236pm
23 pages
Unit - 3 DL
No ratings yet
Unit - 3 DL
20 pages
Denoising Autoencoders Explained
No ratings yet
Denoising Autoencoders Explained
7 pages
Autoencoders and Regularization Techniques
No ratings yet
Autoencoders and Regularization Techniques
23 pages
Clothing Translatio1
No ratings yet
Clothing Translatio1
4 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
27 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
52 pages
JNTUK R20 Deep Learning Notes PDF
No ratings yet
JNTUK R20 Deep Learning Notes PDF
61 pages
7 AIntroductionto Autoencoder
No ratings yet
7 AIntroductionto Autoencoder
10 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
40 pages
DL Lecture 18 Autoencoders
No ratings yet
DL Lecture 18 Autoencoders
40 pages
Contractive Autoencoder Overview
No ratings yet
Contractive Autoencoder Overview
21 pages
Deep Learning for Image Captioning
No ratings yet
Deep Learning for Image Captioning
6 pages
AI and Soft Computing Course Overview
No ratings yet
AI and Soft Computing Course Overview
2 pages
Advances in Text-to-Image Synthesis
No ratings yet
Advances in Text-to-Image Synthesis
16 pages
Multistage Model for Fake Profile Detection
No ratings yet
Multistage Model for Fake Profile Detection
15 pages
Latest Advancements in AI Technology
No ratings yet
Latest Advancements in AI Technology
8 pages
Survey of Vision Language Models
No ratings yet
Survey of Vision Language Models
22 pages
Deep Learning in Educational Data Science
No ratings yet
Deep Learning in Educational Data Science
18 pages
Understanding Shallow Neural Networks
No ratings yet
Understanding Shallow Neural Networks
10 pages
AI Engineer Learning Roadmap 2025
No ratings yet
AI Engineer Learning Roadmap 2025
4 pages
A Novel Deep Learning Approach For Deepfake Image Detection
No ratings yet
A Novel Deep Learning Approach For Deepfake Image Detection
76 pages
MAGNUM: Modular Multimodal Learning
No ratings yet
MAGNUM: Modular Multimodal Learning
8 pages
AI & Machine Learning Expertise Overview
No ratings yet
AI & Machine Learning Expertise Overview
2 pages
Deep Learning Study Guide Overview
No ratings yet
Deep Learning Study Guide Overview
11 pages
Anna University Student Attendance Report
No ratings yet
Anna University Student Attendance Report
1 page
IISc Deep Learning Course & Study Plan
No ratings yet
IISc Deep Learning Course & Study Plan
7 pages
Neural Network Components in MATLAB
No ratings yet
Neural Network Components in MATLAB
7 pages
Gender Classification in Images Using CNN
No ratings yet
Gender Classification in Images Using CNN
60 pages
RNN Model for Jena Climate Analysis
No ratings yet
RNN Model for Jena Climate Analysis
57 pages
Call for Papers: IJSCAI Journal
No ratings yet
Call for Papers: IJSCAI Journal
3 pages
Summary of "Attention Is All You Need"
100% (1)
Summary of "Attention Is All You Need"
2 pages
Deep Learning vs Machine Learning Guide
No ratings yet
Deep Learning vs Machine Learning Guide
133 pages
AI in Predicting Concrete Strength
No ratings yet
AI in Predicting Concrete Strength
4 pages
Neural Networks & Fuzzy Systems Model Paper
No ratings yet
Neural Networks & Fuzzy Systems Model Paper
4 pages
Knowledge-Guided Attention Networks for NLU
No ratings yet
Knowledge-Guided Attention Networks for NLU
11 pages
Enhancing Smart City Security with AI
No ratings yet
Enhancing Smart City Security with AI
13 pages
Paraphrasing Techniques in NLP
No ratings yet
Paraphrasing Techniques in NLP
11 pages
Introduction to Artificial Intelligence
No ratings yet
Introduction to Artificial Intelligence
5 pages
Introduction to Soft Computing Concepts
No ratings yet
Introduction to Soft Computing Concepts
21 pages
M.Tech in AI & ML for Working Professionals
No ratings yet
M.Tech in AI & ML for Working Professionals
31 pages