0% found this document useful (0 votes)

15 views26 pages

Understanding RNNs, LSTMs, and GRUs

Uploaded by

Indoritwist

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views26 pages

Understanding RNNs, LSTMs, and GRUs

Uploaded by

Indoritwist

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Deep Learning

1
Recurrent Neural Network
Foods History Sentence History…

.... working with her.

.... working with him.

2
3
Recurrent Neural Network (RNN)
Recurrent Neural Network is a generalization of feedforward
neural network that has an internal memory. RNN is recurrent in
nature as it performs the same function for every input of data
while the output of the current input depends on the past one
computation. After producing the output, it is copied and sent
back into the recurrent network. For making a decision, it
considers the current input and the output that it has learned
from the previous input.
Unlike feedforward neural networks, RNNs can use their internal
state (memory) to process sequences of inputs. This makes them
applicable to tasks such as unsegmented, connected handwriting
recognition or speech recognition. In other neural networks, all
the inputs are independent of each other. But in RNN, all the
inputs are related to each other. 4
RNN Architecture and Working…

5
Unfold the RNN Layers

6
Examples of RNN with respect to the relationships

/
7
What is Time series Analysis, How relate it is RNN to
A time series is a series of data points
indexed in time order. Most commonly, a time
series is a sequence taken at successive
equally spaced points in time. Thus it is a
sequence of discrete-time data

Time series model is purely dependent on the idea

that past behavior and price patterns can be used
to predict future price behavior.

8
Why RNN and what is difference between
ANN & RNN

This is a cat, and _____

is a good pet animal

9
Vanishing gradient problem
The vanishing gradient makes the gradient very close to zero, so it's difficult to know
where to move in the state space; the exploding gradient makes the gradient a very
large value, so it makes learning unstable. This problem is more pronounced in
recurrent networks since they use the same matrix at each time step.

10
11
Exploding Gradient: Vanishing Gradient:
The working of the exploding When making use of back-
gradient is similar but the weights propagation the goal is to calculate the error which
here change drastically instead is actually found out by finding
of negligible change. Notice the small out the difference between the actual output and
the model output and raising.
change.

12
Exploding Gradient: Vanishing Gradient:
The working of the exploding When making use of back-
gradient is similar but the weights propagation the goal is to calculate the error which
here change drastically instead is actually found out by finding
of negligible change. Notice the small out the difference between the actual output and
the model output and raising.
change.

13
14
Basic LSTM

Long short-term memory network was first introduced in

1997 by Sepp Hochreiter and his supervisor for a Ph.D.
thesis.
LSTM is a special kind of RNN, capable of
learning long term dependencies.
Remembering information for long period of time is it’s
default behaviour.
Long short-term memory (LSTM) network is the most
popular solution to the vanishing gradient problem.

15
16
First Understand the RNN Works

This is a cat, and _____ is a good pet animal

17
Looking More Clearly

18
Looking More Clearly

19
LSTM’s and GRU’s as a solution
LSTM ’s and GRU’s were created as the solution to short-term
memory. They have internal mechanisms called gates that can
regulate the flow of information.

20
LSTM’s as a solution

21
LSTM’s as a solution (steps)
1. First, the previous hidden state and the current input get concatenated. We’ll call it combine.

2. Combine get’s fed into the forget layer. This layer removes non-relevant data.

4. A candidate layer is created using combine. The candidate holds possible values to add to the
cell state.

3. Combine also get’s fed into the input layer. This layer decides what data from the candidate
should be added to the new cell state.

5. After computing the forget layer, candidate layer, and the input layer, the cell state is
calculated using those vectors and the previous cell state.

6. The output is then computed.

7. Pointwise multiplying the output and the new cell state gives us the new hidden state. 22
GRU’s () Gated Recurrent Unit, as a solution
Now we know how an LSTM work, let’s briefly look at the GRU. The GRU is the newer
generation of Recurrent Neural networks and is pretty similar to an LSTM. GRU’s got rid of the
cell state and used the hidden state to transfer information. It also only has two gates, a reset
gate and update gate.

23
RNN vs LSTM vs GRU

24
RNN vs LSTM vs GRU

The key difference between a GRU and an LSTM is that a GRU has two gates
(reset and update gates) whereas an LSTM has three gates (namely input,
output and forget gates).

GRUs train faster and perform better than LSTMs on less training data if you are
doing language modeling (not sure about other tasks).

GRUs are simpler and thus easier to modify, for example adding new gates in
case of additional input to the network. It's just less code in general.

LSTMs should in theory remember longer sequences than GRUs and outperform
25
them in tasks requiring modeling long-distance relations.
Thanks
26

Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
20 pages
Chapter 5
No ratings yet
Chapter 5
48 pages
Overview of Recurrent Neural Networks
100% (1)
Overview of Recurrent Neural Networks
14 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
29 pages
Autoregressive Models & RNNs Explained
No ratings yet
Autoregressive Models & RNNs Explained
40 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
144 pages
LSTM Overview and Applications
No ratings yet
LSTM Overview and Applications
72 pages
RNNs for Time Series Prediction in Finance
100% (1)
RNNs for Time Series Prediction in Finance
35 pages
Advanced NLP: LSTM & GRU Explained
No ratings yet
Advanced NLP: LSTM & GRU Explained
68 pages
Simple CNN and RNN Model Overview
100% (3)
Simple CNN and RNN Model Overview
20 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
7 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
36 pages
RNN and LSTM Applications Overview
No ratings yet
RNN and LSTM Applications Overview
35 pages
LSTM and GRU: Illustrated Guide
No ratings yet
LSTM and GRU: Illustrated Guide
15 pages
Lecture6 RNN
No ratings yet
Lecture6 RNN
40 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
126 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
83 pages
LSTM and GRU Explained: A Visual Guide
No ratings yet
LSTM and GRU Explained: A Visual Guide
10 pages
Module3 Notes
No ratings yet
Module3 Notes
8 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
51 pages
Understanding RNNs, LSTMs, and GRUs
No ratings yet
Understanding RNNs, LSTMs, and GRUs
58 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
47 pages
Module 5 - Recurrent Neural Networks (RNN), LSTM and Gru
No ratings yet
Module 5 - Recurrent Neural Networks (RNN), LSTM and Gru
12 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
37 pages
Week - 19 (1) 3
No ratings yet
Week - 19 (1) 3
60 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
33 pages
Tugas Modul 6: Pembelajaran RNN
No ratings yet
Tugas Modul 6: Pembelajaran RNN
5 pages
RNN, LSTM, and GRU Architectures Explained
No ratings yet
RNN, LSTM, and GRU Architectures Explained
9 pages
UNIT IV Deep Learing
No ratings yet
UNIT IV Deep Learing
31 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
47 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
28 pages
Unit 4 DLA
No ratings yet
Unit 4 DLA
22 pages
RNN
No ratings yet
RNN
20 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
7 pages
Unfolding Computational Graphs in RNNs
No ratings yet
Unfolding Computational Graphs in RNNs
17 pages
Recurrent Neural Networks Overview
No ratings yet
Recurrent Neural Networks Overview
57 pages
UNIT-V NNDL New
No ratings yet
UNIT-V NNDL New
17 pages
Understanding RNNs and LSTMs
No ratings yet
Understanding RNNs and LSTMs
36 pages
Sequence Models in Deep Learning
No ratings yet
Sequence Models in Deep Learning
49 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
16 pages
Understanding RNN, LSTM, and GRU Concepts
No ratings yet
Understanding RNN, LSTM, and GRU Concepts
11 pages
RNNs and Sequence Modeling Techniques
No ratings yet
RNNs and Sequence Modeling Techniques
26 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
36 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
19 pages
Introduction to Recurrent Neural Networks
No ratings yet
Introduction to Recurrent Neural Networks
11 pages
RNN Architectures: LSTM vs GRU vs Transformer
0% (1)
RNN Architectures: LSTM vs GRU vs Transformer
123 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
31 pages
RNN and LSTM: Sequence Modeling Insights
No ratings yet
RNN and LSTM: Sequence Modeling Insights
42 pages
RNN Unrolling and Training Insights
No ratings yet
RNN Unrolling and Training Insights
60 pages
RNN and LSTM Overview and Applications
No ratings yet
RNN and LSTM Overview and Applications
43 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
12 pages
NLP Techniques: RNNs, LSTM, GRU, GANs
No ratings yet
NLP Techniques: RNNs, LSTM, GRU, GANs
37 pages
GRU vs LSTM: Pros and Cons in NLP
No ratings yet
GRU vs LSTM: Pros and Cons in NLP
59 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
14 pages
RNN and LSTM in Natural Language Processing
No ratings yet
RNN and LSTM in Natural Language Processing
36 pages
Understanding RNN and LSTM Models
No ratings yet
Understanding RNN and LSTM Models
51 pages
Understanding LSTM Gates and Equations
No ratings yet
Understanding LSTM Gates and Equations
22 pages
Sociology 102 Course Syllabus
No ratings yet
Sociology 102 Course Syllabus
2 pages
Document
No ratings yet
Document
19 pages
Sequence and Coding Challenges
No ratings yet
Sequence and Coding Challenges
3 pages
300 Vocabulary Words
No ratings yet
300 Vocabulary Words
52 pages
Newton's Polynomials Difference Formulas
No ratings yet
Newton's Polynomials Difference Formulas
8 pages
Monitoring Oracle Tablespace Growth
No ratings yet
Monitoring Oracle Tablespace Growth
32 pages
Problems On Variation - Word Problems of Constant Variation - Inverse Variation
No ratings yet
Problems On Variation - Word Problems of Constant Variation - Inverse Variation
1 page
Essay Guidelines for Historical Figures
No ratings yet
Essay Guidelines for Historical Figures
2 pages
IELTS Writing: Causes and Solutions Guide
No ratings yet
IELTS Writing: Causes and Solutions Guide
9 pages
Mahesh Chand: Chemical Engineer CV
No ratings yet
Mahesh Chand: Chemical Engineer CV
4 pages
Revised Six-Kingdom System of Life
No ratings yet
Revised Six-Kingdom System of Life
64 pages
Understanding Software Quality Concepts
No ratings yet
Understanding Software Quality Concepts
70 pages
Geostatistical Methods for Spatial Prediction
No ratings yet
Geostatistical Methods for Spatial Prediction
35 pages
Understanding Informal Assessment
No ratings yet
Understanding Informal Assessment
25 pages
CS1 Actuarial Science Study Notes
No ratings yet
CS1 Actuarial Science Study Notes
16 pages
Amendments to Pharmacy Law in the Philippines
No ratings yet
Amendments to Pharmacy Law in the Philippines
3 pages
Exploring Science SB2
100% (5)
Exploring Science SB2
197 pages
Pore Pressure Engineering Service
No ratings yet
Pore Pressure Engineering Service
1 page
Youtube Rhetorical Review - Video Presentation Rubric
No ratings yet
Youtube Rhetorical Review - Video Presentation Rubric
2 pages
Timetable Igcse.
No ratings yet
Timetable Igcse.
20 pages
QCAA Assessment Terms Glossary
No ratings yet
QCAA Assessment Terms Glossary
14 pages
Greek Essay Writing Guidelines
100% (1)
Greek Essay Writing Guidelines
2 pages
Language Functions and Grammar Quiz
No ratings yet
Language Functions and Grammar Quiz
2 pages
Advantages of Remedial Activities
100% (1)
Advantages of Remedial Activities
2 pages
PSR and PET Module Overview
No ratings yet
PSR and PET Module Overview
3 pages
Overview of Sample and Hold Circuits
No ratings yet
Overview of Sample and Hold Circuits
24 pages
Teacher 1 Work Experience Sheet
100% (1)
Teacher 1 Work Experience Sheet
2 pages
Exponential Growth and Decay Models
No ratings yet
Exponential Growth and Decay Models
35 pages
Understanding Computer Integrated Manufacturing
No ratings yet
Understanding Computer Integrated Manufacturing
117 pages
Work-Life Balance Strategies Guide
No ratings yet
Work-Life Balance Strategies Guide
21 pages

Understanding RNNs, LSTMs, and GRUs

Uploaded by

Understanding RNNs, LSTMs, and GRUs

Uploaded by

Deep Learning

.... working with her.

.... working with him.

Time series model is purely dependent on the idea

This is a cat, and _____

Long short-term memory network was first introduced in

This is a cat, and _____ is a good pet animal

6. The output is then computed.

You might also like