0% found this document useful (0 votes)

13 views138 pages

Unsupervised Learning

The document discusses unsupervised learning techniques, focusing on Self-Organizing Maps (SOM) and Autoencoders. SOM is an artificial neural network that maps high-dimensional data to lower dimensions for clustering and dimensionality reduction, while Autoencoders are designed to compress and reconstruct data, learning representations through a bottleneck architecture. Both methods are used for tasks such as generation, denoising, and anomaly detection, with Autoencoders being capable of learning non-linear relationships unlike PCA.

Uploaded by

meghdaves4ds

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views138 pages

Unsupervised Learning

Uploaded by

meghdaves4ds

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Unsupervised Learning Algorithm

►Self Organizing Maps

►Kohonen Self-Organizing Feature
Maps
Unsupervised Learning
In many cases in which we do not have labeled data and need to find the
hidden patterns from the given dataset.
So, to solve such types of cases in machine learning, we need unsupervised
learning techniques.
Self Organizing Map (or Kohonen Map or SOM)
• Type of Artificial Neural Network
• Inspired by biological models of neural systems form the 1970’s.
• It follows an unsupervised learning approach
• Trained its network through a competitive learning algorithm.
• SOM is used for clustering and mapping (or dimensionality
reduction) techniques
• Maps multidimensional data onto lower-dimensional which allows
people to reduce complex problems for easy interpretation.
• SOM has two layers, one is the Input layer and the other one is the
Output layer.
The Algorithm:

1. Each node’s weights are initialized.

2. A vector is chosen at random from the set of training data.

3. Every node is examined to calculate which one’s weights are

most like the input vector. The winning node is commonly

known as the Best Matching Unit (BMU).

4. Then the neighbourhood of the BMU is calculated. The amount

of neighbors decreases over time.

[Link] winning weight is rewarded with becoming more like the
sample vector. The nighbors also become more like the sample
vector. The closer a node is to the BMU, the more its weights get
altered and the farther away the neighbor is from the BMU, the less
it learns.

[Link] step 2 for N iterations.

Best Matching Unit is a technique which calculates the distance from
each weight to the sample vector, by running through all weight
vectors.
The weight with the shortest distance is the winner.
There are numerous ways to determine the distance, however, the most
commonly used method is the Euclidean Distance .
A self-organizing map (SOM) is a type of artificial neural network
(ANN) that is trained using unsupervised learning to produce a low-
dimensional (typically two-dimensional), discretized representation
of the input space of the training samples, called a map, and is
therefore a method to do dimensionality reduction.

Self-organizing maps differ from other artificial neural networks as

they apply competitive learning as opposed to error-correction
learning (such as backpropagation with gradient descent), and in the
sense that they use a neighborhood function to preserve the
topological properties of the input space.
What really happens in SOM ?

Each data point in the data set recognizes themselves by competing for
representation.
SOM mapping steps starts from initializing the weight vectors.
From there a sample vector is selected randomly and the map of weight
vectors is searched to find which weight best represents that sample.
Each weight vector has neighboring weights that are close to it.
The weight that is chosen is rewarded by being able to become more like
that randomly selected sample vector.
The neighbors of that weight are also rewarded by being able to become
more like the chosen sample vector.
This allows the map to grow and form different shapes.
Most generally, they form square/rectangular/hexagonal/L shapes in 2D
feature space.
D(1)=0.4

D(2)=2.04
1 W11=0.1
Y1
W21=0.2

0 W31=0.8
0.9

0 W12=0.9
W22=0.7
W32=0.5
Y2
0
W42=0.3
H.W.
neighborhood Over time the neighborhood will
shrink to the size of just one
node
Autoencoders
Autoencoders

Unsupervised learning
Autoencoders

Unsupervised learning

Representation learning
Autoencoders: The sneaky idea

Create an architecture with a bottleneck,

which ensures a lower-dimensional
representation of the original data.
Autoencoders: The sneaky idea

Create an architecture with a bottleneck,

which ensures a lower-dimensional
representation of the original data.
Input layer Output layer

x1
Bottleneck x’1

x2
x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5
Autoencoder = Encoder + Decoder
Input layer Output layer

x1 x’1

Bottleneck
x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5

Encoder = compress data into lower-dimensional representation (latent space)

Necessary condition to learn a representation

● Data should have dependencies across

dimensions
Necessary condition to learn a representation

● Data should have dependencies across

dimensions

● If dimensions are all independent -> impossible to learn

lower-dimensional representation
Input layer Output layer

x1 x’1

Bottleneck
x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5

Encoder = compress data into lower-dimensional representation (latent space)

Input layer Output layer

x1 x’1

Bottleneck
x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5

Decoder = Decompress representation back to original domain

Input layer Output layer

x1 x’1

Bottleneck
x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5

Decoder = Decompress representation back to original domain

Original data Reconstruction

How can we train an autoencoder?

● Backpropagation

● Minimise reconstruction error

How can we train an autoencoder?

● Backpropagation

● Minimise reconstruction error

Original data Reconstructed data

Expectation from autoencoder...

● Sensitive enough to input data to reconstruct it

● Insensitive enough to input data not to overfit it

Expectation from autoencoder......

● Sensitive enough to input data to reconstruct it

● Insensitive enough to input data not to overfit it

Deep Autoencoder

v2
Deep Convolutional Autoencoder

● Similar architecture to AE

● Convolutional layers
● Encoder: Convolution + Leaky Relu +Batch
normalization

● Decoder: Convolution transpose + Leaky Relu + Batch

normalization
What is the point of compressing and decompressing data?

The latent space keeps the most important

attributes of the input data

We can leverage the latent space to perform

several interesting tasks
Autoencoder applications
● Generation

● Denoising

● Anomaly detection

● ...
Generation with AEs

x1 x’1

x2 x’2

x3 x’3

x4 x’4

x5 x’5
Generation with AEs

x1 x’1

x2 x’2

x3 x’3

x4 x’4

x5 x’5
Generation with AEs

x1 x’1

x2 x’2

x3 x’3

x4 x’4

x5 x’5
Generation with AEs

x1 x’1

x2 x’2

x3 x’3

x4 x’4

x5 x’5
Generation with AEs

x1 x’1

x2 x’2

x3 x’3

x4 x’4

x5 x’5
Generation with AEs

x1 x’1

x2 x’2

x3 x’3

x4 x’4

x5 x’5
Generation with AEs

x1 x’1

x2 x’2

x3 x’3

x4 x’4

x5 x’5
Generation with AEs

x1 x’1

x2 x’2

x3 x’3

x4 x’4

x5 x’5
Generation with AEs

x’1

x’2

x’3

x’4

x’5
Generation with AEs

x’1
Sample a point in the latent
space and pass it through
x’2
the decoder

x’3

x’4

x’5
Denoising with AEs

x1 x’1

x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5
Denoising with AEs

x1 x’1

x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5
Denoising with AEs

x1 x’1

Add noise to audio

x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5
Denoising with AEs

x1 x’1

Add noise to audio

x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5
Denoising with AEs

x1 x’1

Add noise to audio

x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5
Denoising with AEs

Measure
reconstruction error
x1 x’1
of output against
Add noise to audio original audio
x2 x’2
v1

x3 x’3
v2

x4 x’4

x5 x’5
Anomaly detection with AEs
Auto encoders
Autoencoder

An autoencoder is a special type of

xˆi feed forward neural network which
does the following
W∗
Encodes its input x i into a hidden
h representation h
Decodes the input again from this
W
hidden representation
xi
The model is trained to minimize a
certain loss function which will ensure
that xˆi is close to x i (we will see some
h = g(W x i + b)
such loss functions soon)
x̂ i = f (W ∗ h + c)

3/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
Undercomplete Autoencoder

Let us consider the case where dim(h) < dim(x i )

xˆi
If we are still able to reconstruct xˆi
W∗
perfectly from h, then what does it
h say about h?
h is a loss-free encoding of x i . It cap-
W
tures all the important characteristics
xi of x i
Do you see an analogy with P C A ?
h = g(W x i + b)
x̂ i = f (W ∗ h + c)

An autoencoder where dim(h) < dim(x i ) is

called an under complete autoencoder

4/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
Overcomplete Autoencoder

Let us consider the case when

xˆi dim(h) ≥ dim(x i )
W∗
h
W
xi

h = g(W x i + b)
x̂ i = f (W ∗ h + c)

5/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
Let us consider the case when dim(h) ≥ dim(x i )

xˆi
In such a case the autoencoder
W∗
could learn a trivial encoding by
h simply copying x i into h
and then copying h into xˆi
W
Such an identity encoding is useless in
xi practice as it does not really tell
us anything about the important
h = g(W x i + b) characteristics of the data
x̂ i = f (W ∗ h + c)

An autoencoder where dim(h) ≥ dim(x i ) is

called an over complete autoencoder

5/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
PCA vs Encoders

● Both perform dimensionality reduction

PCA vs Encoders

● Both perform dimensionality reduction

● PCA learns linear relationships

PCA vs Encoders

● Both perform dimensionality reduction

● PCA learns linear relationships

● Encoders can learn non-linear relationships

PCA vs Encoders

Encoder
PCA
PCA vs Encoders

● Both perform dimensionality reduction

● PCA learns linear relationships

● Encoders can learn non-linear relationships

● Encoder = PCA, if it uses linear activation functions
Autoencoder is equivalent to PCA if
xˆi y PCA

use a linear encoder

h ≡ u1 u2 use a linear decoder
use squared error loss function
normalize the inputs to
x
xi PT X T X P = D
m
xˆij = √ 1 x i j − 1 Σ x k j !
m m
k =1

16/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
Regularization in Autoencoder
Denoising Autoencoder
A denoising encoder simply corrupts the
input data using a probabilistic
xˆi
process (P (x˜i j |x i j )) before feeding it
to the network

h P (x˜i j = 0 |xi j ) = q
P (x̃ i j = x i j |x i j ) = 1 − q

x˜i Aim: N/w should not memories training data

Hence it will work well on unseen data
P (x̃ i j |x i j )
xi How does this help ?
This helps because the objective is
still to reconstruct the original (un-
corrupted) x i
m n
1
L=min Σ Σ (x̂ i j − x i j ) 2
m i=1 j = 1
28/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
Regularization need in Autoencoder
While poor generalization could happen
xˆi even in undercomplete autoencoders it is
an even more serious problem for
W∗ overcomplete auto encoders
h Here, (as stated earlier) the model can
simply learn to copy x i to h and then h
W to xˆi
xi To avoid poor generalization, we need to
introduce regularization

24/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
The simplest solution is to add a L 2 -
xˆi regularization term to the
objective function
W∗
m n
1 2 2
h min
∗
Σ Σ (x̂ i j − x i j ) + λ ǁθǁ
θ,w,w ,b,c m
i=1 j=1
W
This is very easy to implement and
xi
just adds a term λW to the gradient
∂L ( θ)
∂W (and similarly for other para-
meters)

25/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
Another trick is to tie the weights
xˆi of the encoder and decoder i.e., W ∗ =W T
W∗
This effectively reduces the capacity
h of Autoencoder and acts as a regular
izer
W

26/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
Sparse autoencoder
A hidden neuron with sigmoid activation will
xˆi
have values between 0 and 1
We say that the neuron is activated when its
h output is close to 1 and not activated when
its output is close to 0.
A sparse autoencoder tries to ensure
xi the hidden neuron is inactive most of the
times.

41/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
xˆi If the neuron l is sparse (i.e. mostly inactive)
then ρˆl→ 0

The average value of the

activation of a neuron l is given
by
m
1
ρˆl = Σ h(x i ) l
m
i= 1

42/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
xˆi If the neuron l is sparse (i.e. mostly inactive)
then ρˆl→ 0
A sparse autoencoder uses a sparsity para-
meter ρ (typically very close to 0, say,
h
0.005) and tries to enforce the constraint ρˆl =
ρ
xi One way of ensuring this is to add the follow-
ing term to the objective function

The average value of the

k
ρ 1−ρ
Ω(θ) = Σ ρ log + (1 − ρ) log
activation of a neuron l is given ρˆl 1 − ρ̂l
l= 1
by
m
1 When will this term reach its minimum
ρˆl = Σ h(x i ) l value and what is the minimum value?
m
i= 1

42/1
CS7015 (Deep Learning) : Lecture 7

The function will reach its minimum

Mitesh M . value(s)
K h a p r a when ρˆl = ρ.
Contractive autoencoder
A contractive autoencoder also tries to
prevent an overcomplete autoencoder from
learning the identity function. x̂
1.................................n

It does so by adding the following reg-

ularization term to the loss function h
l=1..............................................k

Ω(θ) = ǁ J x(h)ǁ 2
x
where J x ( h ) is the Jacobian of the encoder. J=1.................................n

47/1
CS7015 (Deep Learning) : Lecture 7

th
Mitesh M . K h a p r a
n k
∂h l 2
B y putting these two contradicting ǁJ 2
x (h)ǁ F = Σ Σ
∂xj
objectives against each other we en- j = 1 l=1

sure that h is sensitive to only very

important variations as observed in
the training data. x̂
L(θ) - capture important variations
in data
h
Ω(θ) - do not capture variations in
data
Tradeoff - capture only very import- x
ant variations in the data

50/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
xˆi
Regularization
2
h Ω(θ) = λǁθǁ Weight decaying
Σk ρ 1−ρ
Ω(θ) = ρ log + (1 − ρ) log Sparse
ρ̂l 1 − ρ̂l
l= 1
x˜i Σn Σk ∂h l 2
Ω(θ) = Contractive
P (x̃ i j |x i j ) j=1 l=1
∂xj

55/1
CS7015 (Deep Learning) : Lecture 7

Mitesh M . K h a p r a
applications of auto-encoder
► Dimensionality Reduction.
► Feature Extraction.
► Image Denoising.
► Image Compression.
► Anomaly Detection.
Image Search:

•Missing Value Imputation.

GAN Architecture
► Generator function

► Discriminator function
Linear factor methods such as probablistic PCA
and Factor analysis,
Independent component analysis
► Using average x and y translation we can measure
centroid of data
► Shift centroid to origin
► Find line passing through origin fitting all data point
Thank you !

129
Let us look at the full
D ec o der
architecture
A man ⟨ stop ⟩
throwing A CNN is first used to encode
. . . yt P ( y t = j |yt− 1
, I)
1
the image
V V V V
A RNN is then used to decode
(generate) a sentence from the
W W W
. . .W sT encoding
Encoder This is a typical encoder
U U U U
h0 decoder architecture
. . . xT
Both the encoder and decoder
< GO> A man park
use a neural network
CNN

C S 7 0 1 5 (Deep Learning) : Lecture 1 6

D ec o der Let us look
architecture at the full
A man ⟨ stop ⟩
throwing A CNN is first used to encode
. . . yt P ( y t = j |yt− 1
, I)
1
the image
V V V V
A RNN is then used to decode
(generate) a sentence from the
W W W W
sT encoding
Encoder This is a typical encoder
U U U U
h0 decoder architecture
. . . xT
Both the encoder and decoder
<GO> A man park
use a neural network
CNN
Alternatively, the encoder’s
output can be fed to every step
of the decoder

C S 7 0 1 5 (Deep Learning) : Lecture 1 6

Task: Image captioning
t− 1
Deco der L t ( θ ) = − l o g P ( yt = j|y 1 , f c 7 )

A man throwing . . . ⟨ stop ⟩

Data: {xi = image i , yi = caption i }Ni = 1
. . . yt P(y t =
t− 1
j|y1 , fc 7 ) Model:

V V V V
Encoder:
s0 = CNN(xi)
W W W
. . .W sT

E n co der
Decoder:
U U U U
h0
st = RNN(st− 1 , e(ŷt− 1 ))
. . . xT

< GO> A man park

P(y t|y t− 1
1 , I) = softmax(Vs t + b)
CNN
Parameters: Udec , V, W dec , W conv , b
Loss:
∑T ∑T
L (θ) = L t (θ) = − log P(yt = ℓ t|y t−
1
1
, I)
i= 1 t= 1

Algorithm: Gradient descent with

backpropagation

C S 7 0 1 5 (Deep Learning) : Lecture 1 6

o/p: Surya Namaskar
Task: Video Classification
S u r ya n a m a sk a r
Data: {xi = Video i , yi = Activity i} N
i= 1

Model:
Encoder:
ht = RNN(ht− 1 , CNN(xit ))
. . .
Decoder:
s = hT
. . . P(y|I) = softmax(Vs + b)
Parameters: V, b, W conv , Uenc , W enc , b
CNN CNN . . . CNN
Loss:
L (θ) = − log P(y = ℓ|Video)
. . .
Algorithm: Gradient descent with
backpropagation

C S 7 0 1 5 (Deep Learning) : Lecture 1 6

Critic =realness score
Type of distances
• JS divergence
• KL divergence
• Wassertein Distance
• Total Variation distance

Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
43 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
15 pages
Autoencoders for Dimensionality Reduction
No ratings yet
Autoencoders for Dimensionality Reduction
103 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
27 pages
Understanding Autoencoders in AI
No ratings yet
Understanding Autoencoders in AI
12 pages
Understanding Autoencoders in AI
No ratings yet
Understanding Autoencoders in AI
17 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
40 pages
Twitter Spam Detection with Autoencoders
No ratings yet
Twitter Spam Detection with Autoencoders
26 pages
Module 3 DL
No ratings yet
Module 3 DL
57 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
51 pages
Auto Encoder S
No ratings yet
Auto Encoder S
57 pages
Physics-Informed Neural Networks
No ratings yet
Physics-Informed Neural Networks
53 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
79 pages
Understanding Autoencoders: Types & Uses
No ratings yet
Understanding Autoencoders: Types & Uses
20 pages
Understanding Autoencoder Architecture
No ratings yet
Understanding Autoencoder Architecture
36 pages
Applications of Autoencoders in Anomaly Detection
No ratings yet
Applications of Autoencoders in Anomaly Detection
13 pages
Unsupervised Deep Learning Techniques
No ratings yet
Unsupervised Deep Learning Techniques
26 pages
Unit 4 Autoencoders
No ratings yet
Unit 4 Autoencoders
23 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
52 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
11 pages
AE Tutoriaal 1
No ratings yet
AE Tutoriaal 1
14 pages
Recurrent Networks and Autoencoders
No ratings yet
Recurrent Networks and Autoencoders
53 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
248 pages
DL Module 4
No ratings yet
DL Module 4
34 pages
Understanding Autoencoders and VAEs
100% (1)
Understanding Autoencoders and VAEs
22 pages
7 AIntroductionto Autoencoder
No ratings yet
7 AIntroductionto Autoencoder
10 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
35 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
39 pages
Understanding Auto-Encoders in AI
No ratings yet
Understanding Auto-Encoders in AI
47 pages
Unsupervised Learning with Autoencoders
No ratings yet
Unsupervised Learning with Autoencoders
214 pages
DL Lecture 18 Autoencoders
No ratings yet
DL Lecture 18 Autoencoders
40 pages
Understanding Autoencoders in Machine Learning
No ratings yet
Understanding Autoencoders in Machine Learning
39 pages
Understanding Autoencoders in ML
No ratings yet
Understanding Autoencoders in ML
42 pages
Lecture 9 Autoencoders 09122022 032236pm
No ratings yet
Lecture 9 Autoencoders 09122022 032236pm
23 pages
Unit 3 High Dimensional Object
No ratings yet
Unit 3 High Dimensional Object
13 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
39 pages
Understanding Autoencoders in Deep Learning
100% (1)
Understanding Autoencoders in Deep Learning
4 pages
Understanding Autoencoders in Unsupervised Learning
No ratings yet
Understanding Autoencoders in Unsupervised Learning
35 pages
Understanding Autoencoders and Their Types
No ratings yet
Understanding Autoencoders and Their Types
45 pages
Understanding Autoencoders for Dimensionality Reduction
No ratings yet
Understanding Autoencoders for Dimensionality Reduction
103 pages
Autoencoders and Generative Models Overview
No ratings yet
Autoencoders and Generative Models Overview
27 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
37 pages
Autoencoders and Generative Models Overview
No ratings yet
Autoencoders and Generative Models Overview
19 pages
Understanding Autoencoders Basics
No ratings yet
Understanding Autoencoders Basics
15 pages
Autoencoder Architecture and Hyperparameters
No ratings yet
Autoencoder Architecture and Hyperparameters
16 pages
Autoencoder Applications in Deep Learning
No ratings yet
Autoencoder Applications in Deep Learning
7 pages
01-Unit 3
No ratings yet
01-Unit 3
17 pages
Autoencoders: Dimensionality Reduction Techniques
No ratings yet
Autoencoders: Dimensionality Reduction Techniques
74 pages
Understanding Autoencoders: Types and Functions
No ratings yet
Understanding Autoencoders: Types and Functions
52 pages
Autoencoders and Their Applications
No ratings yet
Autoencoders and Their Applications
25 pages
Unit 4
No ratings yet
Unit 4
31 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
26 pages
Understanding Autoencoder Structure
No ratings yet
Understanding Autoencoder Structure
4 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
23 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
16 pages
Understanding Autoencoders in Neural Networks
No ratings yet
Understanding Autoencoders in Neural Networks
16 pages
Autoencoders for Representation Learning
No ratings yet
Autoencoders for Representation Learning
18 pages
Factors Influencing Student Engagement
No ratings yet
Factors Influencing Student Engagement
66 pages
Teaching Procedure Texts: Food Recipes
No ratings yet
Teaching Procedure Texts: Food Recipes
7 pages
AI's Impact on Education and Business
No ratings yet
AI's Impact on Education and Business
6 pages
Powers of 10: Multiplication & Division
No ratings yet
Powers of 10: Multiplication & Division
21 pages
English Lesson Plans For Grade 9: Lessons in This Section
No ratings yet
English Lesson Plans For Grade 9: Lessons in This Section
18 pages
The Nature of Curiosity and Discovery
No ratings yet
The Nature of Curiosity and Discovery
2 pages
The Role of Age and Emotional Valence in Word Recognicition
No ratings yet
The Role of Age and Emotional Valence in Word Recognicition
13 pages
Benefits of Traveling: Personal Growth
No ratings yet
Benefits of Traveling: Personal Growth
2 pages
Color Poem Creation Guide
No ratings yet
Color Poem Creation Guide
8 pages
Individual vs. Group Decision Making
No ratings yet
Individual vs. Group Decision Making
9 pages
Format Abstrak untuk Penyelidikan
No ratings yet
Format Abstrak untuk Penyelidikan
2 pages
Biliteracy Framework for 4th Grade Science
No ratings yet
Biliteracy Framework for 4th Grade Science
7 pages
Enhancing Multicultural English Teaching
No ratings yet
Enhancing Multicultural English Teaching
15 pages
English Curriculum Overview Year 3
No ratings yet
English Curriculum Overview Year 3
18 pages
Overview of Raven's Progressive Matrices
No ratings yet
Overview of Raven's Progressive Matrices
5 pages
Yearly Lesson Plan for English Form 1
100% (2)
Yearly Lesson Plan for English Form 1
23 pages
Task Force on Dyslexia Report 2001
No ratings yet
Task Force on Dyslexia Report 2001
153 pages
Writing Effective Online Exam Questions
No ratings yet
Writing Effective Online Exam Questions
8 pages
Customer Service Training Guide
100% (5)
Customer Service Training Guide
22 pages
Book-Introduction To Linguistics 2019
No ratings yet
Book-Introduction To Linguistics 2019
153 pages
Intelligent Traffic Management System Report
100% (1)
Intelligent Traffic Management System Report
46 pages
Fine Motor Skills Development in Kids
No ratings yet
Fine Motor Skills Development in Kids
28 pages
Addition Sentences Lesson Plan
No ratings yet
Addition Sentences Lesson Plan
13 pages
Trainee Supervision and Training Plan
No ratings yet
Trainee Supervision and Training Plan
25 pages
Gad Elbeheri, Eric Q. Tridas - Assessing Dyslexia - A Teacher's Guide To Understanding and Evaluating Their Pupils' Needs-Routledge (2023)
No ratings yet
Gad Elbeheri, Eric Q. Tridas - Assessing Dyslexia - A Teacher's Guide To Understanding and Evaluating Their Pupils' Needs-Routledge (2023)
95 pages
Types and Importance of Delegated Authority
No ratings yet
Types and Importance of Delegated Authority
11 pages
GE1701 - Syllabus and Course Outline
100% (1)
GE1701 - Syllabus and Course Outline
9 pages
Deep Learning for Image Classification Review
No ratings yet
Deep Learning for Image Classification Review
8 pages
Understanding Artificial Intelligence
No ratings yet
Understanding Artificial Intelligence
16 pages
Understanding Stream of Consciousness
No ratings yet
Understanding Stream of Consciousness
10 pages

Unsupervised Learning

Uploaded by

Unsupervised Learning

Uploaded by

Unsupervised Learning Algorithm

►Self Organizing Maps

1. Each node’s weights are initialized.

2. A vector is chosen at random from the set of training data.

3. Every node is examined to calculate which one’s weights are

most like the input vector. The winning node is commonly

known as the Best Matching Unit (BMU).

4. Then the neighbourhood of the BMU is calculated. The amount

of neighbors decreases over time.

[Link] step 2 for N iterations.

Self-organizing maps differ from other artificial neural networks as

Create an architecture with a bottleneck,

Create an architecture with a bottleneck,

Encoder = compress data into lower-dimensional representation (latent space)

● Data should have dependencies across

● Data should have dependencies across

● If dimensions are all independent -> impossible to learn

Encoder = compress data into lower-dimensional representation (latent space)

Decoder = Decompress representation back to original domain

Decoder = Decompress representation back to original domain

Original data Reconstruction

● Minimise reconstruction error

● Minimise reconstruction error

Original data Reconstructed data

● Sensitive enough to input data to reconstruct it

● Insensitive enough to input data not to overfit it

● Sensitive enough to input data to reconstruct it

● Insensitive enough to input data not to overfit it

● Decoder: Convolution transpose + Leaky Relu + Batch

The latent space keeps the most important

We can leverage the latent space to perform

Add noise to audio

Add noise to audio

Add noise to audio

An autoencoder is a special type of

Let us consider the case where dim(h) < dim(x i )

An autoencoder where dim(h) < dim(x i ) is

Let us consider the case when

An autoencoder where dim(h) ≥ dim(x i ) is

● Both perform dimensionality reduction

● Both perform dimensionality reduction

● PCA learns linear relationships

● Both perform dimensionality reduction

● PCA learns linear relationships

● Encoders can learn non-linear relationships

● Both perform dimensionality reduction

● PCA learns linear relationships

● Encoders can learn non-linear relationships

use a linear encoder

x˜i Aim: N/w should not memories training data

The average value of the

The average value of the

The function will reach its minimum

It does so by adding the following reg-

sure that h is sensitive to only very

•Missing Value Imputation.

C S 7 0 1 5 (Deep Learning) : Lecture 1 6

C S 7 0 1 5 (Deep Learning) : Lecture 1 6

A man throwing . . . ⟨ stop ⟩

< GO> A man park

Algorithm: Gradient descent with

C S 7 0 1 5 (Deep Learning) : Lecture 1 6

C S 7 0 1 5 (Deep Learning) : Lecture 1 6

You might also like