0% found this document useful (0 votes)

3 views4 pages

Convolution Types and Activation Functions

The document outlines various convolution types used in CNNs, including Standard, Dilated, Transposed, Separable, Grouped, Pointwise, Causal, and Deformable convolutions, each with distinct characteristics and applications. It also discusses the importance of nonlinearity in CNNs, detailing different activation functions like Sigmoid, Tanh, ReLU, Leaky ReLU, ELU, and Softmax, along with their advantages and disadvantages. The overall focus is on enhancing feature extraction and improving model performance in tasks such as image generation and object detection.

Uploaded by

successtrbtet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views4 pages

Convolution Types and Activation Functions

Uploaded by

successtrbtet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1.

Standard Convolution

 The basic sliding filter operation across the input.

 Each filter computes a weighted sum over the receptive field

2. Dilated (Atrous) Convolution

 Introduces gaps ("dilations") between kernel elements.

 Increases receptive field without increasing parameters.
 Useful in segmentation tasks (e.g., DeepLab).

3. Transposed Convolution (Deconvolution / Fractionally Strided)

 Upsampling variant of convolution.

 Used for generating larger feature maps (e.g., in decoders, GANs).

4. Separable Convolutions

 Spatially Separable: Breaks a 2D kernel into two 1D kernels (e.g., 3×3 → 3×1 + 1×3).
 Depthwise Separable: Splits convolution into two steps:
1. Depthwise convolution (per-channel).
2. Pointwise convolution (1×1 across channels).
 Used in MobileNet for efficiency.

5. Grouped Convolution

 Input channels are split into groups, and each group is convolved separately.
 Reduces computation.
 Used in ResNeXt and AlexNet.

6. Pointwise Convolution (1×1 Conv)

 A convolution with kernel size = 1.

 Used for channel mixing and dimensionality reduction.
 Core part of Inception modules.

7. Causal Convolution

 Ensures output at time t depends only on input at time ≤ t.

 Used in temporal models like WaveNet.

8. Deformable Convolution

 Learns offsets for sampling positions instead of fixed grid.

 Improves handling of geometric transformations (e.g., object detection).
In short:

 Standard = basic
 Dilated = bigger receptive field
 Transposed = upsampling
 Separable (depthwise/pointwise) = efficiency
 Grouped = channel grouping
 Causal = time-series
 Deformable = adaptive receptive field

Variant Operation / Formula Key Idea Use Case

Standard Sliding kernel over input, Feature extraction in
y=∑w⋅xy = \sum w \cdot x
Convolution weighted sum CNNs
Semantic
y=∑w⋅xd⋅iy = \sum w \cdot Inserts gaps (dilation rate)
Dilated (Atrous) segmentation
x_{d \cdot i} in kernel
(DeepLab)
Transposed Spreads input over output Image generation,
Reverse of standard conv
(Deconv) grid, learns upsampling decoders, GANs
Spatially 2D kernel → two 1D Factorizes kernel (e.g., 3×3 Reduces
Separable kernels → 3×1 + 1×3) computation
Depthwise Depthwise conv + Convolution per channel + MobileNet, efficient
Separable Pointwise (1×1) mixing channels CNNs
Grouped Split input channels into Convolve each group
AlexNet, ResNeXt
Convolution groups separately
Mixes channels,
Pointwise (1×1) Kernel size = 1×1 Inception modules
dimensionality reduction
Causal yt=∑w⋅x≤ty_t = \sum w Only depends on current & Time-series,
Convolution \cdot x_{\leq t} past inputs WaveNet
Deformable y=∑w⋅x(p+Δp)y = \sum w Learns offsets for sampling Object detection,
Convolution \cdot x(p + \Delta p) positions dense prediction

CNN LEARNING NONLINEARITY FUNCTION IN CNN:

 After convolution and pooling layers extract features, the activation function introduces non-
linearity so that the CNN can approximate nonlinear decision boundaries.

 Without nonlinearity, multiple convolution layers would collapse into a single linear
transformation → CNN would behave like a single linear classifier.
low of Nonlinearity in CNN

1. Input image → Convolution layer (linear feature extraction)

2. Activation (ReLU, etc.) → Nonlinearity
3. Pooling → Downsampling
4. Stack multiple layers (conv + activation)
5. Fully connected + Softmax for final prediction

Activation
Advantages Disadvantages
Function
– Smooth output between 0 and 1 – Vanishing gradient (small updates
Sigmoid (probability-like) – Historically well in deep layers) – Not zero-centered →
understood slower convergence
– Output between -1 and 1 (zero-
– Still suffers vanishing gradient –
Tanh centered) – Stronger gradients than
Slower than ReLU
sigmoid
– Very fast to compute – Reduces
– Dead neuron problem (neurons
ReLU vanishing gradient problem – Sparse
stuck at 0 forever) – Not smooth at 0
activation (only positive neurons fire)
– Fixes dead neuron issue (small slope for
– Extra parameter α to tune – Slightly
Leaky ReLU negatives) – Works better than ReLU in
more compute
some tasks
ELU – Smooth curve for negative values – – More computationally expensive –
Activation
Advantages Disadvantages
Function
Faster convergence than ReLU – Mean Slower than ReLU
activations closer to 0 → helps training
– Not used in hidden layers – Can be
Softmax – Converts raw scores into probability
unstable with very large inputs (needs
(output layer) distribution – Good for classification
normalization)

Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
26 pages
Deep Learning Fundamentals and CNNs
No ratings yet
Deep Learning Fundamentals and CNNs
61 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
10 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
47 pages
One Sheet To Rule Them All
No ratings yet
One Sheet To Rule Them All
3 pages
Image Classification with ANN and CNN
No ratings yet
Image Classification with ANN and CNN
8 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
49 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
35 pages
CS Net
No ratings yet
CS Net
40 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
82 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
7 pages
Deep Neural Networks Overview
No ratings yet
Deep Neural Networks Overview
51 pages
Image Classification with ANN & CNN
No ratings yet
Image Classification with ANN & CNN
8 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
70 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
22 pages
DL Unit-IV
No ratings yet
DL Unit-IV
61 pages
Variants of Convolution Functions in CNN
No ratings yet
Variants of Convolution Functions in CNN
23 pages
Unit - 4
No ratings yet
Unit - 4
13 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
35 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
10 pages
CNN Equation and Architecture Overview
No ratings yet
CNN Equation and Architecture Overview
8 pages
Deep Neural Networks Explained
No ratings yet
Deep Neural Networks Explained
99 pages
DL Unit-3
No ratings yet
DL Unit-3
12 pages
Mathematical Foundations of CNNs
No ratings yet
Mathematical Foundations of CNNs
6 pages
CNN Mathematical Operations Explained
No ratings yet
CNN Mathematical Operations Explained
4 pages
CNN Basics for Image Classification
No ratings yet
CNN Basics for Image Classification
7 pages
Fundamental of Deep Learning: DR - Mona Hussein Alnaggar
No ratings yet
Fundamental of Deep Learning: DR - Mona Hussein Alnaggar
46 pages
CNN and RNN Architectures Explained
No ratings yet
CNN and RNN Architectures Explained
36 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
15 pages
Module2 Notes
No ratings yet
Module2 Notes
12 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
14 pages
Introduction to AI and Neural Networks
No ratings yet
Introduction to AI and Neural Networks
71 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
29 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
30 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
31 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
14 pages
Deep Learning Concepts and CNNs
No ratings yet
Deep Learning Concepts and CNNs
9 pages
CNN Basics: Convolution & Layers
No ratings yet
CNN Basics: Convolution & Layers
18 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
16 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
23 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
31 pages
Simple CNN Implementation Lab Report
No ratings yet
Simple CNN Implementation Lab Report
9 pages
Deep Learning: Convolutional Neural Networks
No ratings yet
Deep Learning: Convolutional Neural Networks
71 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
62 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
21 pages
CNN
No ratings yet
CNN
51 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
9 pages
Overview of Convolutional Neural Networks
No ratings yet
Overview of Convolutional Neural Networks
11 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
22 pages
Aiml Ece Unit-5+
No ratings yet
Aiml Ece Unit-5+
49 pages
Deep Neural Networks Overview
No ratings yet
Deep Neural Networks Overview
39 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
48 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
30 pages
Unit 21
No ratings yet
Unit 21
24 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
33 pages
Internet History for Students
No ratings yet
Internet History for Students
1 page
Sliding Window Flow Control & ARQ Mechanisms
No ratings yet
Sliding Window Flow Control & ARQ Mechanisms
10 pages
Certified Associate Project Management Exam Outline
No ratings yet
Certified Associate Project Management Exam Outline
9 pages
Premier Race India: Game Development Report
No ratings yet
Premier Race India: Game Development Report
45 pages
PPA Guidelines for Suppliers at SEBN
No ratings yet
PPA Guidelines for Suppliers at SEBN
40 pages
C++ Pattern Printing Assignment Solutions
No ratings yet
C++ Pattern Printing Assignment Solutions
11 pages
iDirect X3 Modem Setup Guide
No ratings yet
iDirect X3 Modem Setup Guide
35 pages
IoT Home Automation and Security Insights
No ratings yet
IoT Home Automation and Security Insights
8 pages
HI VAC II Autoclave Specifications
No ratings yet
HI VAC II Autoclave Specifications
6 pages
Commcrete Stardust Tactical Radio
No ratings yet
Commcrete Stardust Tactical Radio
5 pages
Cloning R12 EBS Environment Guide
No ratings yet
Cloning R12 EBS Environment Guide
24 pages
Sign Language Recognition System Report
No ratings yet
Sign Language Recognition System Report
29 pages
BCCL Internship Report on Telecom Systems
No ratings yet
BCCL Internship Report on Telecom Systems
43 pages
Product of Mts
No ratings yet
Product of Mts
17 pages
Etisalat Telecom Infrastructure Guidelines
No ratings yet
Etisalat Telecom Infrastructure Guidelines
12 pages
Ranking Industrial HMI Panel Suppliers
No ratings yet
Ranking Industrial HMI Panel Suppliers
194 pages
DC DCconverters 2016
No ratings yet
DC DCconverters 2016
9 pages
Bubble Sort Overview and Analysis
No ratings yet
Bubble Sort Overview and Analysis
10 pages
3/2/1-Phase Synchronous-Rectified Buck Controller For Mobile GPU Power
No ratings yet
3/2/1-Phase Synchronous-Rectified Buck Controller For Mobile GPU Power
12 pages
Window Manager ANR State Report
No ratings yet
Window Manager ANR State Report
7,510 pages
KNX Communication Basics and Protocols
No ratings yet
KNX Communication Basics and Protocols
25 pages
Machine Learning Project Report Overview
No ratings yet
Machine Learning Project Report Overview
112 pages
BEEE Syllabus Overview for RGPV
No ratings yet
BEEE Syllabus Overview for RGPV
1 page
QGIS Training Guide for Myanmar Users
No ratings yet
QGIS Training Guide for Myanmar Users
232 pages
Internet Risks and Safety Tips for Youth
No ratings yet
Internet Risks and Safety Tips for Youth
3 pages
Supplier Invoice Workflow Management
No ratings yet
Supplier Invoice Workflow Management
25 pages
LTE 4G Wireless Communication Features
No ratings yet
LTE 4G Wireless Communication Features
26 pages
WESTRACE MKII Installation Checklist
No ratings yet
WESTRACE MKII Installation Checklist
25 pages
Business Acceptance Testing Overview
No ratings yet
Business Acceptance Testing Overview
13 pages
ASP.NET Practical Lab Manual
100% (2)
ASP.NET Practical Lab Manual
25 pages

Convolution Types and Activation Functions

Uploaded by

Convolution Types and Activation Functions

Uploaded by

1.

 The basic sliding filter operation across the input.

2. Dilated (Atrous) Convolution

 Introduces gaps ("dilations") between kernel elements.

3. Transposed Convolution (Deconvolution / Fractionally Strided)

 Upsampling variant of convolution.

6. Pointwise Convolution (1×1 Conv)

 A convolution with kernel size = 1.

 Ensures output at time t depends only on input at time ≤ t.

 Learns offsets for sampling positions instead of fixed grid.

Variant Operation / Formula Key Idea Use Case

CNN LEARNING NONLINEARITY FUNCTION IN CNN:

1. Input image → Convolution layer (linear feature extraction)

You might also like