0% found this document useful (0 votes)

22 views49 pages

Deep Generative Models Overview

The document provides an overview of deep generative models, focusing on unsupervised learning techniques such as PCA, auto-encoders, VAEs, and GANs. It discusses the objectives and applications of unsupervised learning, including density estimation, clustering, and feature learning. Additionally, it compares GANs and VAEs, highlighting their strengths and weaknesses in generating new samples from training data.

Uploaded by

imran khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views49 pages

Deep Generative Models Overview

Uploaded by

imran khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Deep Generative Models

Shenlong Wang
● Why unsupervised learning?

● Old-school unsupervised learning

Overview ○ PCA, Auto-encoder, KDE, GMM

● Deep generative models

○ VAEs, GANs
Unsupervised Learning
● No labels are provided during training
● General objective: inferring a function to describe hidden structure from
unlabeled data
○ Density estimation (continuous probability)
○ Clustering (discrete labels)
○ Feature learning / representation learning (continuous vectors)
○ Dimension reduction (lower-dimensional representation)
○ etc.
Why Unsupervised Learning?
● Density estimation: estimate the probability density function p(x) of a random
variable x, given a bunch of observations {X1, X2, ...}

2D density estimation of
Stephen Curry’s
shooting position

Credit: BallR
Why Unsupervised Learning?
● Clustering: grouping a set of input {X1, X2, ...} in such a way that objects in
the same group (called a cluster) are more similar

Clustering analysis of Hall-of-fame players in NBA

Credit: BallR
Why Unsupervised Learning?
● Feature learning: a transformation of raw data input to a representation that
can be effectively exploited in machine learning tasks

2D topological visualization given the input how similar players

are with regard to points, rebounds, assists, steals, rebounds,
blocks, turnovers and fouls

Credit: Ayasdi
Why Unsupervised Learning?
● Dimension reduction: reducing the number of random variables under
consideration, via obtaining a set of principal variables

Principle component analysis over players trajectory data

Credit: Bruce, Arxiv 2016

Principle Component Analysis (PCA)
An algorithm that conducts dimension reduction

Intuition:

● Finds the lower-dimension projection that minimizes

reconstruction error
● Keep the most information (maximize variance)

See more details in Raquel’s CSC411 slides:

[Link]
Principle Component Analysis (PCA)
An algorithm that conducts dimension reduction

Intuition:

● Finds the lower-dimension projection that minimizes

reconstruction error
● Keep the most information (maximize variance)

Algorithm:

● Conduct eigen decomposition

● Find K-largest eigenvectors
● Linear projection with the matrix composed of K
eigenvectors
See more details in Raquel’s CSC411 slides:
[Link]
Auto-encoder
A neural network that the output is the input itself.

Intuition:

● A good representation should keep the information well (reconstruction error)

● Deep + nonlinearity might help enhance the representation power
Auto-encoder
A neural network that the output is the input itself.

Intuition:

● A good representation should keep the information well (reconstruction error)

● Deep + nonlinearity might help enhance the representation power

Credit: LeCun
Learnt representation
Auto-encoder
A neural network that the output is the input itself.

10-dimensional Auto-encoder feature embedding based on players shooting tendency

Credit: Wang et al. 2016 Sloan Sports Conference

Kernel Density Estimation (KDE)
A nonparametric way to estimate the probability density function of a random variable

Intuition:

● Point with more neighbouring samples have higher density

● Smoothed histogram, centered at data point

Kernel function, measures the similarity

Credit: Wikipedia
Kernel Density Estimation (KDE)
A nonparametric way to estimate the probability density function of a random variable

Applications:

● Visualization
● Sampling

Shooting heat map of Lamarcus Aldridge

2015-2016. Credit: Squared Statistics
Generative models
Task: generate new samples follows the same probabilistic distribution of a given a training dataset
Generative models
Task: generate new samples follows the same probabilistic distribution of a given a training dataset
Generative models
Task: generate new samples follows the same probabilistic distribution of a given a training dataset

Training samples Generated samples

Credit: Kingma

Note: sometimes it’s fine if we cannot estimate the explicit form of p(x), since it might be over complicated
Variational Auto-encoder (VAE)
Intuition: given a bunch of random variables that can be sampled easily, we can generate random
samples following other distributions, through a complicated non-linear mapping x = f(z)

Image Credit: Doersch 2016

Variational Auto-encoder (VAE)
Intuition: given a bunch of random variables that can be sampled easily, we can generate some new
random samples through a complicated non-linear mapping x = f(z)

Image Credit: Doersch 2016

Variational Auto-encoder (VAE)
Intuition: given a bunch of random variables, we can generate some new random samples through a
complicated non-linear mapping x = f(z)

Gaussian
NN

Image Credit: Doersch 2016

Variational Auto-encoder (VAE)
You can consider it as a decoder!

Decoder
network

Code z Gaussian
parameters
Variational Auto-encoder (VAE)
How do we learn the parameters?

Decoder
network

Code z Gaussian
parameters
Variational Auto-encoder (VAE)
Graphical model

Image Credit: Doersch 2016

Variational Auto-encoder (VAE)
Learning objective: maximize the log-probability

many sampled z will have a close-to-zero p(x|z)

Quiz: Why not doing this?

Image Credit: Doersch 2016
Variational Auto-encoder (VAE)
Learning objective: maximize variational lower-bound

Variational lower-bound

Quiz: How to choose a good proposal distribution?

Proposal distribution
Variational Auto-encoder (VAE)
Learning objective: maximize variational lower-bound

Variational lower-bound

Proposal distribution
Quiz: How to choose a good proposal distribution?

● Easy to sample
● Differentiable
● Given a training sample X, the sampled z is likely to have a non-zero p(x|z)
Variational Auto-encoder (VAE)
Learning objective: maximize variational lower-bound

Answer: Another neural network + Gaussian to approximate the posterior!

Variational Auto-encoder (VAE)
Learning objective: maximize variational lower-bound

Reconstruction error: Prior:

● Training samples have higher probability ● Proposal distribution should be like Gaussian
Variational Auto-encoder (VAE)
Learning objective: maximize variational lower-bound

● KL-Divergence: closed-form and differentiable if

both are Gaussians
● Reconstruction error: approximate by just
sampling one z

Computation graph
Credit: Doersch
Variational Auto-encoder (VAE)
Why it is the variational lower-bound?

Jenson inequality

Kingma et al. 2014

Variational Auto-encoder (VAE)
The whole learning structure
KL-Divergence

Reconstruction Loss

Encoder Decoder
network network

Input Image x Code z Reconstruction

Variational Auto-encoder (VAE)
Results

Kingma et al. 2014

Generative Adversarial Network (GAN)

Generator

Code z Generated
Image
Generative Adversarial Network (GAN)

Generator

Fake

Code z Generated Image Discriminator

Real

Training Image
Generative Adversarial Network (GAN)
Intuitions

Crook

Google
Generative Adversarial Network (GAN)
Intuitions

Generator

Teller

Google
Generative Adversarial Network (GAN)
Intuitions

Crook

Teller

Google
Generative Adversarial Network (GAN)
Intuitions:

● Generator tries the best to cheat

the discriminator by generating Generator
more realistic images
Fake
Code z Generated Image Discriminator Real

● Discriminator tries the best to

distinguish whether the image is
generated by computers or not
Training Image
Generative Adversarial Network (GAN)
Objective function:

For each iteration:

● Sample a mini-batch of fake images and true images

● Update G using back-prop
● Update D using back-prop

Very difficult to optimize:

● Min-max problem: finding a saddle point instead of a local optimum, unstable

Generative Adversarial Network (GAN)

Generator

Code z Generated Image Discriminator Cross-Entropy

Training Image
GANs for face and bedroom

Credit: Denton
GANs for Japanese Anime

Credit: Radford
GAN for videos

Credit: Vondrick
Generative Adversarial Network (GAN)
Extensions:

● DCGANs: some hacks that work well

● LAPGANs: coarse-to-fine conditional generation through Laplacian pyramids
● f-GANs: more general GANs with different loss other than cross-entropy
● infoGANs: additional objective that maximize mutual-information between the latent and the sample
● EBGANs: Discriminative as energy functions
● GVMs: using GANs as an energy term for interactive image manipulation
● Conditional GANs: not random z, instead z is some data from other domain
● ...
Generative Adversarial Network (GAN)
Hacks:

● How to train a GAN?

● 17 hacks that make the training work.
● [Link]
GANs vs VAEs
GANs:

● High-quality visually appealing result

● Difficult to train
● The idea of adversarial training can be applied in many other domains

VAEs:

● Easy to train
● Blurry result due to minimizing the MSE based reconstruction error
● Nice probabilistic formulation, easy to introduce prior
Demos
VAEs:

● [Link]

GANs:

● [Link]
● [Link]
References
[1] Goodfellow and Bengio, “Deep learning ” 2016
[2] Goodfellow, Ian, et al. "Generative adversarial nets." Advances in neural information processing systems. 2014.
[3] Kingma, Diederik P., and Max Welling. "Auto-encoding variational bayes." arXiv preprint arXiv:1312.6114 (2013).
[4] Radford, Alec, Luke Metz, and Soumith Chintala. "Unsupervised representation learning with deep convolutional generative adversarial
networks." arXiv preprint arXiv:1511.06434 (2015).
[5] Nowozin, Sebastian, Botond Cseke, and Ryota Tomioka. "f-GAN: Training generative neural samplers using variational divergence
minimization." Advances in Neural Information Processing Systems. 2016.
[6] Denton, Emily L., Soumith Chintala, and Rob Fergus. "Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks."
Advances in neural information processing systems. 2015.
[7] Chen, Xi, et al. "Infogan: Interpretable representation learning by information maximizing generative adversarial nets." Advances in Neural
Information Processing Systems. 2016.
[8] Zhao, Junbo, Michael Mathieu, and Yann LeCun. "Energy-based generative adversarial network." arXiv preprint arXiv:1609.03126 (2016).
[9] Doersch, Carl. "Tutorial on variational autoencoders." arXiv preprint arXiv:1606.05908 (2016).
[10] Wang, K-C., and Richard Zemel. "classifying NBA offensive plays using neural networks." MIT Sloan Sports Analytics Conference, 2016.
[11] Wikipedia “Kernel density estimation”
[12] Wikipedia “Principal component analysis”
[13] Wikipedia “Autoencoder”
Thanks

Understanding Variational Autoencoders
No ratings yet
Understanding Variational Autoencoders
44 pages
Understanding Generative AI Models
No ratings yet
Understanding Generative AI Models
36 pages
Understanding Autoencoders and VAEs
No ratings yet
Understanding Autoencoders and VAEs
11 pages
Generative Modeling Techniques Overview
No ratings yet
Generative Modeling Techniques Overview
111 pages
Giovanni Iacca on Generative Models
No ratings yet
Giovanni Iacca on Generative Models
51 pages
4-VAEs Final 3
No ratings yet
4-VAEs Final 3
29 pages
Deep Learning: Autoencoders & GANs
No ratings yet
Deep Learning: Autoencoders & GANs
22 pages
VAE vs GAN: Key Differences Explained
100% (1)
VAE vs GAN: Key Differences Explained
3 pages
Variational Autoencoders Overview
No ratings yet
Variational Autoencoders Overview
31 pages
Variational Autoencoders and GANs Explained
No ratings yet
Variational Autoencoders and GANs Explained
10 pages
Understanding Variational Autoencoders
No ratings yet
Understanding Variational Autoencoders
57 pages
Variational Autoencoders and GANs Explained
No ratings yet
Variational Autoencoders and GANs Explained
14 pages
Unit - 2 - GenAI - Final Notes - KR23
No ratings yet
Unit - 2 - GenAI - Final Notes - KR23
39 pages
Understanding Variational Autoencoders
No ratings yet
Understanding Variational Autoencoders
48 pages
Generative Models: Understanding VAEs
No ratings yet
Generative Models: Understanding VAEs
10 pages
Deep Learning Mod 5
No ratings yet
Deep Learning Mod 5
60 pages
Generative Models
No ratings yet
Generative Models
93 pages
VAEs vs GANs: Generative Models Explained
No ratings yet
VAEs vs GANs: Generative Models Explained
19 pages
Autoencoder Architecture for Image Denoising
No ratings yet
Autoencoder Architecture for Image Denoising
11 pages
Autoencoders and Generative Models Explained
No ratings yet
Autoencoders and Generative Models Explained
4 pages
Generative AI with Python & PyTorch Guide
No ratings yet
Generative AI with Python & PyTorch Guide
13 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
65 pages
DL Module 3
No ratings yet
DL Module 3
17 pages
Autoencoders and GANs Explained
No ratings yet
Autoencoders and GANs Explained
7 pages
Understanding Autoencoders and GANs
No ratings yet
Understanding Autoencoders and GANs
29 pages
Understanding Variational Autoencoders
No ratings yet
Understanding Variational Autoencoders
37 pages
Week 14
No ratings yet
Week 14
19 pages
Understanding Autoencoders and Their Applications
No ratings yet
Understanding Autoencoders and Their Applications
10 pages
Understanding Autoencoders in Unsupervised Learning
No ratings yet
Understanding Autoencoders in Unsupervised Learning
35 pages
Lec 51
No ratings yet
Lec 51
12 pages
VAE Applications in Image Generation
No ratings yet
VAE Applications in Image Generation
10 pages
DL Unit-5
No ratings yet
DL Unit-5
30 pages
Overview of Autoencoders
No ratings yet
Overview of Autoencoders
22 pages
Generative AI: VAE, GAN, and Transformers
No ratings yet
Generative AI: VAE, GAN, and Transformers
36 pages
Understanding Variational Autoencoders
No ratings yet
Understanding Variational Autoencoders
24 pages
Understanding Autoencoders and VAEs
No ratings yet
Understanding Autoencoders and VAEs
21 pages
Understanding Variational Autoencoders
No ratings yet
Understanding Variational Autoencoders
18 pages
VAEs and GANs in Deep Learning
No ratings yet
VAEs and GANs in Deep Learning
46 pages
Auto Encoder
No ratings yet
Auto Encoder
22 pages
Machine Learning Algorithms and Concepts
No ratings yet
Machine Learning Algorithms and Concepts
9 pages
Deep Learning Unit4
No ratings yet
Deep Learning Unit4
11 pages
Supervised vs Unsupervised Learning Guide
No ratings yet
Supervised vs Unsupervised Learning Guide
41 pages
AAI Cheatsheet
No ratings yet
AAI Cheatsheet
6 pages
Visual Information Interpretation: Transformers & Generative Models
No ratings yet
Visual Information Interpretation: Transformers & Generative Models
36 pages
Understanding GANs: Generative Models Explained
No ratings yet
Understanding GANs: Generative Models Explained
65 pages
Variational Autoencoders
No ratings yet
Variational Autoencoders
2 pages
Deep Learning: VAEs and GANs Overview
No ratings yet
Deep Learning: VAEs and GANs Overview
52 pages
Encoder-Decoder Models in Deep Learning
No ratings yet
Encoder-Decoder Models in Deep Learning
22 pages
Deep Generative Models Overview
No ratings yet
Deep Generative Models Overview
11 pages
Module IICore Generative Model Families
No ratings yet
Module IICore Generative Model Families
23 pages
DL2 DensityEstimation Slides
No ratings yet
DL2 DensityEstimation Slides
35 pages
Variational Autoencoders Explained
No ratings yet
Variational Autoencoders Explained
136 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
138 pages
DL Lecture 18 Autoencoders
No ratings yet
DL Lecture 18 Autoencoders
40 pages
Deep Generative Models Review 2023
No ratings yet
Deep Generative Models Review 2023
9 pages
Deep Learning Optimization Techniques
No ratings yet
Deep Learning Optimization Techniques
51 pages
Deep Learning Fundamentals and Backpropagation
No ratings yet
Deep Learning Fundamentals and Backpropagation
26 pages
Lenovo In-Plant Training Request
No ratings yet
Lenovo In-Plant Training Request
1 page
Dr. Sofia: ECE Assistant Professor Profile
No ratings yet
Dr. Sofia: ECE Assistant Professor Profile
2 pages
Paper Chromatography MCQs and Concepts
No ratings yet
Paper Chromatography MCQs and Concepts
8 pages
Khargone Thermal Power Project Design
No ratings yet
Khargone Thermal Power Project Design
30 pages
NEMA PE-5 Battery Chargers Overview
No ratings yet
NEMA PE-5 Battery Chargers Overview
37 pages
Understanding %lu in C Programming
No ratings yet
Understanding %lu in C Programming
50 pages
Fluid Dynamics Calculations and Graphs
No ratings yet
Fluid Dynamics Calculations and Graphs
3 pages
Diploma Marks Certificate - Electronics Engineering
No ratings yet
Diploma Marks Certificate - Electronics Engineering
1 page
Understanding Generative AI Concepts
No ratings yet
Understanding Generative AI Concepts
22 pages
Class 10 Physics Practical: Light Refraction
No ratings yet
Class 10 Physics Practical: Light Refraction
2 pages
Vaccine Vial Monitor Performance Specs
No ratings yet
Vaccine Vial Monitor Performance Specs
10 pages
Critique of Ruling Elite Theory
No ratings yet
Critique of Ruling Elite Theory
8 pages
Sheikh Sarai Housing Case Study
100% (1)
Sheikh Sarai Housing Case Study
7 pages
Manual Escaner Estructural
No ratings yet
Manual Escaner Estructural
37 pages
English 5 Lesson Plans Overview
No ratings yet
English 5 Lesson Plans Overview
12 pages
Flender N Eupex, Rupex and N Bipex
No ratings yet
Flender N Eupex, Rupex and N Bipex
116 pages
Delegate List for Adam Davie Academy
No ratings yet
Delegate List for Adam Davie Academy
10 pages
Grade 7 Math: Polygons Week 3 DLL
No ratings yet
Grade 7 Math: Polygons Week 3 DLL
6 pages
JEE Main & Advanced 2024 Test Instructions
No ratings yet
JEE Main & Advanced 2024 Test Instructions
24 pages
Algerian High School Interview Project
No ratings yet
Algerian High School Interview Project
35 pages
Music Therapy for Adolescents: Insights & Recommendations
No ratings yet
Music Therapy for Adolescents: Insights & Recommendations
9 pages
Germany's Cultural Profile in VET
No ratings yet
Germany's Cultural Profile in VET
27 pages
MARKESUNITY CIA Advantage XR Spec Sheet v12
No ratings yet
MARKESUNITY CIA Advantage XR Spec Sheet v12
5 pages
GARDEN 2T Safety Data Sheet (REACH)
No ratings yet
GARDEN 2T Safety Data Sheet (REACH)
8 pages
Account Statement: A/c XXXXXXX18
No ratings yet
Account Statement: A/c XXXXXXX18
2 pages
Python List Operations Guide
No ratings yet
Python List Operations Guide
4 pages
Understanding Gender Criticism in Literature
No ratings yet
Understanding Gender Criticism in Literature
23 pages
Circular
No ratings yet
Circular
5 pages
JCP Water Treatment System Specs
No ratings yet
JCP Water Treatment System Specs
1 page
Assertion and Reactions Lesson Plan
No ratings yet
Assertion and Reactions Lesson Plan
7 pages
Sport Chek Purchase Receipt
No ratings yet
Sport Chek Purchase Receipt
2 pages
Loan Management Database Lab Tasks
No ratings yet
Loan Management Database Lab Tasks
17 pages

Deep Generative Models Overview

Uploaded by

Deep Generative Models Overview

Uploaded by

Deep Generative Models

● Old-school unsupervised learning

Overview ○ PCA, Auto-encoder, KDE, GMM

● Deep generative models

Clustering analysis of Hall-of-fame players in NBA

2D topological visualization given the input how similar players

Principle component analysis over players trajectory data

Credit: Bruce, Arxiv 2016

● Finds the lower-dimension projection that minimizes

See more details in Raquel’s CSC411 slides:

● Finds the lower-dimension projection that minimizes

● Conduct eigen decomposition

● A good representation should keep the information well (reconstruction error)

● A good representation should keep the information well (reconstruction error)

10-dimensional Auto-encoder feature embedding based on players shooting tendency

Credit: Wang et al. 2016 Sloan Sports Conference

● Point with more neighbouring samples have higher density

Kernel function, measures the similarity

Shooting heat map of Lamarcus Aldridge

Training samples Generated samples

Image Credit: Doersch 2016

Image Credit: Doersch 2016

Image Credit: Doersch 2016

Image Credit: Doersch 2016

many sampled z will have a close-to-zero p(x|z)

Quiz: Why not doing this?

Quiz: How to choose a good proposal distribution?

Answer: Another neural network + Gaussian to approximate the posterior!

Reconstruction error: Prior:

● KL-Divergence: closed-form and differentiable if

Kingma et al. 2014

Input Image x Code z Reconstruction

Kingma et al. 2014

Code z Generated Image Discriminator

● Generator tries the best to cheat

● Discriminator tries the best to

For each iteration:

● Sample a mini-batch of fake images and true images

Very difficult to optimize:

● Min-max problem: finding a saddle point instead of a local optimum, unstable

Code z Generated Image Discriminator Cross-Entropy

● DCGANs: some hacks that work well

● How to train a GAN?

● High-quality visually appealing result

You might also like