Understanding ResNet Architecture

The document outlines the ResNet architecture, introduced by Microsoft Research in 2015, which effectively addresses the vanishing and exploding gradient problems in deep networks through the use of residual blocks and skip connections. It highlights the advantages of ResNet in enabling the training of very deep networks, achieving significant performance improvements in image recognition tasks, and setting new benchmarks in computer vision. The session also includes learning outcomes, activities, and a brief overview of related architectures and applications.

Uploaded by

The Greatest

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views30 pages

Understanding ResNet Architecture

Uploaded by

The Greatest

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

ResNet Architectures

Session No.:2
Course NSession No.:24
Course Name: Deep Learning
Course Code: R1UC604C
Instructor Name: Dr. JAYAPRAKASH C
Review of the key concepts of session no. 23

• VGGNet showed the power of deep CNNs for

image recognition
• VGG16 & VGG19 are powerful pre-trained
models for transfer learning
• Simple, uniform design—computationally heavy
but conceptually clear
• Provide a strong baseline for comparing with
modern architectures
Ask Questions

How is Resnet ? Why Resnet needed

then AlexNet ?
At the end of this session students will be able to

Learning Outcome 1:
Describe ResNet and
Architecure

Learning Outcome 2:
To understand advantages
and key features of ResNet.
Session Outline
1 Residual Networks (ResNet) - Deep Learning
2 Introduction to ResNet
3 Key features

4 Activity 1

5 ResNet-34 Architecture

6 Activity 2

7 Conclusion
Residual Networks (ResNet) - Deep
Learning
• ResNet introduced by Microsoft Research in 2015
• Addressed vanishing/exploding gradient issues in deep networks
• Enabled very deep CNNs (up to 1000 layers) to train effectively
• ResNet (residual network) is a type of neural network
architecture that enables the training of very deep
networks by using "skip connections" to bypass
layers. This allows the network to learn residual
functions—the difference between the input and
output—making it easier to train and solving the
vanishing gradient problem common in deep
networks. Introduced in 2015, ResNet won the
ImageNet challenge and has been widely used for
tasks like image classification and object detection
Background: Increasing Network Depth
• After AlexNet (ImageNet 2012), deeper
networks were used to reduce error rates
• However, very deep networks suffered from
vanishing/exploding gradients
• Result: Increasing layers beyond a limit led to
higher training and testing errors
Problem: Vanishing/Exploding Gradient

• In deep neural networks, gradients can

become very small (vanish) or too large
(explode)
• This leads to unstable training and poor
convergence
• Example: 56-layer CNN performed worse than
a 20-layer CNN on ImageNet
Introduction to ResNet
• Proposed in 2015 by Kaiming He et al.
(Microsoft Research)
• Introduced a new architecture called Residual
Networks
• Key innovation: Residual Blocks using skip
(shortcut) connections
• Allows deeper networks to train effectively
without gradient issues
Key features
• Residual blocks: The core of ResNet is the residual block, which takes an
input, passes it through a few layers, and then adds the original input to
the output of those layers.
• Skip connections: These are the "shortcut" paths that allow the input to
be directly added to the output of a block. This enables gradients to flow
back to earlier layers more easily, preventing the vanishing gradient
problem.
• Learning the residual: Instead of learning the entire mapping from input
to output (\(Y=F(X)\)), ResNet learns the residual or the difference (\
(Y=F(X)+X\)). This is more efficient because much of the information from
the input is often already present in the output.
• Training very deep networks: By mitigating the vanishing gradient
problem, ResNet allows for the creation of networks with hundreds or
even thousands of layers, which was previously difficult or impossible.
Residual Blocks: Core Idea
• Instead of learning direct mapping H(x),
network learns residual mapping F(x):
• F(x) = H(x) - x (residual function)
• Final output: H(x) = F(x) + x
• This is achieved using skip connections that
bypass one or more layers
Skip (Shortcut) Connections
• Skip connections link earlier activations
directly to later layers
• If a layer harms performance, the skip
connection bypasses it
• This improves gradient flow and stabilizes
deep network training
• Allows training of networks with 100–1000
layers without vanishing gradients
Learning Activity 1:
Activity 1:
(pen and paper)

• Think–Pair–Share Activity
• Step 1 (Think):
Students individually write down one advantage and one disadvantage of Dropout
and Early Stopping each.
• Step 2 (Pair):
Share your answers with a partner. Discuss which regularization method seems
more suitable for a small dataset and why.
• Step 3 (Share):
A few pairs present their conclusions to the class.
Advantages of Skip Connections
• Improves gradient flow during
backpropagation
• Prevents degradation in very deep networks
• Faster convergence and better accuracy
• Supports flexible network depth (can scale
easily)
Related Architecture: Highway Networks

• Similar to ResNet but includes parametric

gates in skip paths
• Inspired by LSTMs – gates control how much
information passes through
• Highway Networks did not achieve better
accuracy than ResNet
ResNet-34 Architecture
• Based on a 34-layer plain network (inspired by
VGG-19)
• Shortcut connections convert this into a
residual network
• Each block consists of convolution, batch
normalization, ReLU, and skip addition
• Used as the baseline ResNet model
Schematic of ResNet-34. (a) The overall structure of
ResNet-34. It consists of a convolutional unit (Conv1) and
four residual blocks (Conv2_x, Conv3_x, Conv4_x and
Conv5_x), which generate the final probability vectors
and output the category labels by average pooling layer
and fully connected layer;

(b) Internal structure of the four residual blocks. Dashed

and solid lines between layers represented two different
short-circuit connection mechanisms of the residual units;

(c) The short-circuit connection mechanism (taking

Conv3_x block as an example) indicates that the number
of input and output channels are different, corresponding
with dashed lines in (b);

(d) The short-circuit connection mechanism (taking

Conv3_x block as an example) indicates that the number
of input and output channels are the same, corresponding
with solid lines in (b).
Implementation using TensorFlow & Keras
• Dataset: CIFAR-10 (60,000 color images, 32×32
pixels, 10 classes)
• Keras datasets API used to load CIFAR-10
• ResNet implemented by stacking multiple
residual blocks
• Can easily extend to deeper versions like
ResNet-50, ResNet-101, etc.
Impact and applications
• Image recognition: ResNet won the ImageNet Large
Scale Visual Recognition Challenge in 2015 and has
become a fundamental part of many computer vision
systems.
• Benchmark performance: It set a new standard for
performance on benchmark problems like ImageNet,
with models achieving substantial improvements over
previous methods.
• Versatility: Beyond image classification, ResNet is used in
other applications such as image super-resolution and
modeling physical systems.
Learning Activity 2:
Activity 2: (pen and paper)
• Think–Pair–Share Activity
• Step 1 (Think):
Students individually write down one advantage and one disadvantage of Dropout
and Early Stopping each.
• Step 2 (Pair):
Share your answers with a partner. Discuss which regularization method seems
more suitable for a small dataset and why.
• Step 3 (Share):
A few pairs present their conclusions to the class.
Summary and Conclusion
• ResNet revolutionized deep learning by
addressing vanishing gradients
• Residual connections make it easier to train
very deep models
• ResNet models (ResNet-34, 50, 101, 152)
remain foundational in vision tasks
• Inspired later architectures such as DenseNet,
EfficientNet, and Transformers
At the end of this session students will be able to

Learning Outcome 1:
Describe ResNet and
Architecure

Learning Outcome 2:
To understand advantages
and key features of ResNet.
• Post session activities
• Information to next topic of the
course

session no. 25

Transfer Learning Basics and Fine-

Tuning Strategies
Review and Reflection
from students

Galgotias University 30

Res Net
No ratings yet
Res Net
2 pages
ResNet Architecture and Applications Explained
No ratings yet
ResNet Architecture and Applications Explained
11 pages
ResNet: Transforming Deep Learning
No ratings yet
ResNet: Transforming Deep Learning
27 pages
Understanding CNN Architectures and ResNet
No ratings yet
Understanding CNN Architectures and ResNet
35 pages
ResNet and ResNeXt Overview
No ratings yet
ResNet and ResNeXt Overview
26 pages
ResNet: Deep Residual Networks Explained
No ratings yet
ResNet: Deep Residual Networks Explained
13 pages
ResNet: Deep Learning for Vision Tasks
No ratings yet
ResNet: Deep Learning for Vision Tasks
5 pages
ResNet Architecture and Skip Connections
No ratings yet
ResNet Architecture and Skip Connections
4 pages
ImageNet ConvNet Architectures Overview
No ratings yet
ImageNet ConvNet Architectures Overview
13 pages
ResNet Architecture and Implementation
No ratings yet
ResNet Architecture and Implementation
17 pages
Residual
No ratings yet
Residual
8 pages
Overview of CNN Architectures and Models
No ratings yet
Overview of CNN Architectures and Models
59 pages
Understanding Residual Networks (ResNet)
No ratings yet
Understanding Residual Networks (ResNet)
11 pages
Deep Convolutional Neural Network Architectures
No ratings yet
Deep Convolutional Neural Network Architectures
66 pages
CO2 Session12
No ratings yet
CO2 Session12
27 pages
Residual Networks as Shallow Ensembles
No ratings yet
Residual Networks as Shallow Ensembles
40 pages
Convolutional Neural Networks Overview
No ratings yet
Convolutional Neural Networks Overview
117 pages
Machine Learning Techniques Overview
No ratings yet
Machine Learning Techniques Overview
61 pages
Residual Networks Deep Learning
No ratings yet
Residual Networks Deep Learning
15 pages
ResNet and VGGNet Architecture Overview
No ratings yet
ResNet and VGGNet Architecture Overview
44 pages
Residual
No ratings yet
Residual
9 pages
Deep Residual Learning for Image Recognition
No ratings yet
Deep Residual Learning for Image Recognition
46 pages
Deep Residual Learning For Image Recognition: Te-Comps-A: 62-Swayam Mhaske 63-Kamlesh Mistry 70-Santosh Mahato
No ratings yet
Deep Residual Learning For Image Recognition: Te-Comps-A: 62-Swayam Mhaske 63-Kamlesh Mistry 70-Santosh Mahato
15 pages
ResNet: Revolutionizing Deep Learning
100% (1)
ResNet: Revolutionizing Deep Learning
8 pages
Comparative Analysis of CNN Architectures
No ratings yet
Comparative Analysis of CNN Architectures
41 pages
Basics of Supervised Deep Learning
No ratings yet
Basics of Supervised Deep Learning
12 pages
Res Net 4
No ratings yet
Res Net 4
23 pages
Overview of VGG-16 and CNN Architectures
No ratings yet
Overview of VGG-16 and CNN Architectures
14 pages
Image Classification Based On RESNET
No ratings yet
Image Classification Based On RESNET
7 pages
Deep Learning & CNNs Overview 2025-26
No ratings yet
Deep Learning & CNNs Overview 2025-26
27 pages
ResNet and VGGNet Architectures Overview
100% (1)
ResNet and VGGNet Architectures Overview
44 pages
Understanding GRU and ResNet-50 Architecture
No ratings yet
Understanding GRU and ResNet-50 Architecture
1 page
Key Insights from CNN Case Studies
No ratings yet
Key Insights from CNN Case Studies
94 pages
ResNet and VGGNet Architectures Explained
No ratings yet
ResNet and VGGNet Architectures Explained
44 pages
Image Feature Extraction in CNNs
No ratings yet
Image Feature Extraction in CNNs
3 pages
Convolutional Neural Network Algorithm
No ratings yet
Convolutional Neural Network Algorithm
14 pages
RESNET for Image Classification Insights
No ratings yet
RESNET for Image Classification Insights
5 pages
ResNet50 Architecture Overview
No ratings yet
ResNet50 Architecture Overview
4 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
73 pages
LAB 4 AlexNetandResNet
No ratings yet
LAB 4 AlexNetandResNet
17 pages
Deep Learning in Image Processing
No ratings yet
Deep Learning in Image Processing
63 pages
Deep Learning Architectures Overview
No ratings yet
Deep Learning Architectures Overview
17 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
211 pages
AI in Image Processing: ML & DL Insights
No ratings yet
AI in Image Processing: ML & DL Insights
14 pages
Comparing CNN Architectures: AlexNet, VGG, ResNet
No ratings yet
Comparing CNN Architectures: AlexNet, VGG, ResNet
25 pages
Understanding CNN Architectures
No ratings yet
Understanding CNN Architectures
14 pages
TresNet-L Parameter Overview
No ratings yet
TresNet-L Parameter Overview
37 pages
CNN Architectures: AlexNet, VGGNet, ResNet, Inception
No ratings yet
CNN Architectures: AlexNet, VGGNet, ResNet, Inception
14 pages
Unit 4
No ratings yet
Unit 4
14 pages
Understanding ConvNet Architectures
No ratings yet
Understanding ConvNet Architectures
38 pages
ConvNet Architectures Overview
No ratings yet
ConvNet Architectures Overview
37 pages
Lightweight ResNet-9 for Image Classification
No ratings yet
Lightweight ResNet-9 for Image Classification
4 pages
ResNet Architecture and Implementation Guide
No ratings yet
ResNet Architecture and Implementation Guide
18 pages
CNN Architectures in Deep Learning
No ratings yet
CNN Architectures in Deep Learning
4 pages
Overview of CNN Models: AlexNet to DenseNet
No ratings yet
Overview of CNN Models: AlexNet to DenseNet
7 pages
Understanding Residual Learning in ResNet
No ratings yet
Understanding Residual Learning in ResNet
3 pages
Supervised Deep Learning Basics
No ratings yet
Supervised Deep Learning Basics
41 pages
088706115X PhysicsTime
No ratings yet
088706115X PhysicsTime
612 pages
SITC of Main Panel at Sinchai Bhawan
No ratings yet
SITC of Main Panel at Sinchai Bhawan
13 pages
Minimum Spanning Tree Analysis
No ratings yet
Minimum Spanning Tree Analysis
1 page
2010 Front Discharge Mixer Truck Parts Manual PN - 30947 - FDPB - Rev - 5
No ratings yet
2010 Front Discharge Mixer Truck Parts Manual PN - 30947 - FDPB - Rev - 5
292 pages
Overview of OM 607 Engine Specifications
100% (2)
Overview of OM 607 Engine Specifications
20 pages
Mathematics Practice Test Questions
No ratings yet
Mathematics Practice Test Questions
14 pages
Post-Harvest Agriculture Overview
No ratings yet
Post-Harvest Agriculture Overview
8 pages
15
No ratings yet
15
468 pages
Chlorine Content in Illinois Coal
No ratings yet
Chlorine Content in Illinois Coal
25 pages
Zir't Xikm't'g Qupum Qdjuch Guide
No ratings yet
Zir't Xikm't'g Qupum Qdjuch Guide
3 pages
AI Introduction Exam Paper Summer 2023
No ratings yet
AI Introduction Exam Paper Summer 2023
1 page
Analyze Scatter Plots & Sequences
No ratings yet
Analyze Scatter Plots & Sequences
6 pages
Limestone Ammonium Nitrate Effects on Maize
100% (1)
Limestone Ammonium Nitrate Effects on Maize
5 pages
Syncopation Exercises in 4/4 Time
100% (4)
Syncopation Exercises in 4/4 Time
106 pages
Java Object Oriented Programming Guide
No ratings yet
Java Object Oriented Programming Guide
172 pages
Electromagnetic Theory Question Bank
No ratings yet
Electromagnetic Theory Question Bank
18 pages
Car Voltage Stabilizer Circuit Design
No ratings yet
Car Voltage Stabilizer Circuit Design
9 pages
OLED Display Datasheet LY112WG22-128128
100% (1)
OLED Display Datasheet LY112WG22-128128
22 pages
Manifesting Your Desires Using Reiki
No ratings yet
Manifesting Your Desires Using Reiki
8 pages
Types of Vertical Evaporators Explained
No ratings yet
Types of Vertical Evaporators Explained
9 pages
Prob & Stat PDF
No ratings yet
Prob & Stat PDF
173 pages
Pronest Software
No ratings yet
Pronest Software
2 pages
Cond Bench F30 - FP30 EN Operating Instructions
No ratings yet
Cond Bench F30 - FP30 EN Operating Instructions
32 pages
Probability and Rational Choice Course Guide
No ratings yet
Probability and Rational Choice Course Guide
16 pages
Network Coding Theory Overview
No ratings yet
Network Coding Theory Overview
34 pages
Python OOP Exercises and Exception Handling
No ratings yet
Python OOP Exercises and Exception Handling
7 pages
Understanding Solution Conductivity
No ratings yet
Understanding Solution Conductivity
1 page
Duct Design Tables and Charts
No ratings yet
Duct Design Tables and Charts
45 pages
Biophysical Basis For Meridian and Acupoint Functions: Evidence - Based Complementary and Alternative Medicine
No ratings yet
Biophysical Basis For Meridian and Acupoint Functions: Evidence - Based Complementary and Alternative Medicine
321 pages
Gullfaks Oil Field Reservoir Analysis
No ratings yet
Gullfaks Oil Field Reservoir Analysis
8 pages