Overview of One-Stage Detectors

Uploaded by

botov73940

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views35 pages

Overview of One-Stage Detectors

Uploaded by

botov73940

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Object detection

One-stage detectors
Object detection approaches
• Pass (process) the image through a Neural
Network.
• On the final feature map(s) of the network:
• Slide a window
• For each window location, predict an
object class and a bounding box for
each “anchor” (also called “default
Stage 1 − box” or “prior”), i.e., adjust the anchor.
• Generate multiple candidates (bounding boxes) all
over the image. It’s typical to make use of
“anchors”
• Selective search, Region Proposal Network
(RPN)
• Throw away the candidates without an object.
Stage 2 −
• Process each candidate independently
• Assign a category (class) to each candidate and One stage anchor-free detectors: Same as one stage
adjust its bounding box. detectors but instead of adjusting anchor, directly
predict: Top-left and bottom-right corners And / Or the
centre of the object
Object detection approaches
Two-stage detectors One-stage detectors
• R-CNN (2013 – 2014) • YOLO (2015 – 2016) − Latest: YOLOv8
(2023)
• SPP Net (2014 – 2015)
• SSD (2016)
• Fast R-CNN (2015)
• RetinaNet (2017)
• Faster R-CNN (2015) • CenterNet (2019)
• R-FCN (2016) • EfficientDet (2019 – 2020)
• Feature Pyramid Network • Swin Transformer (2021)
(2017)
Object detection techniques: Comparisons
YOLO- You Only Look Once

• The YOLO model was first described by Joseph Redmon, et al. in the 2015
paper titled “You Only Look Once: Unified, Real-Time Object Detection.”
Ross Girshick, developer of R-CNN, was also an author and contributor to
this work.
• The approach involves a single neural network trained end to end that
takes a photograph as input and predicts bounding boxes and class labels
for each bounding box directly.
• The R-CNN models may be generally more accurate, yet the YOLO family
of models are fast, much faster than R-CNN, achieving object detection in
real-time.

You Only Look Once: Unified, Real-Time Object Detection, Joseph Redmon, Santosh Divvala,
Ross Girshick, Ali Farhadi
YOLO- You Only Look Once: Concepts
• Detection as Single Regression Problem
• No bounding box proposal.
• A single regression problem, straight from Unified Detection
image pixels to bounding box coordinates and
class probabilities
• Developed as Single Convolutional Network
• Reason Globally on the Entire Image
• Learns Generalizable Representations

Easy and Fast

[Link]
Redmon et al. CVPR 2016.
YOLO: Step 1
• Divide the image into a grid of cells.
• Ex. SxS grid, , typically 7x7 or 13x13.
• If the center of an object fall into a grid cell, it will be the responsible for the object.
• Each cell is responsible for predicting a set of bounding boxes and class probabilities.
• A bounding box involving the x, y coordinate and the width and height and the
confidence.
• A class prediction is also based on each cell. For example, an image may be divided
into a 7×7 grid and each cell in the grid may predict 2 bounding boxes, resulting in
94 proposed bounding box predictions.
• The class probabilities map and the bounding boxes with confidences are then
combined into a final set of bounding boxes and class labels.
• Hence, Each grid cell predict:
• B bounding boxes;
• B confidence scores as C=Pr(Obj)*IOU;
• C cond. Class prob. as P=Pr(𝑪𝒍𝒂𝒔𝒔𝒊|Object);
• Confidence Prediction is obtained as IOU of predicted box and any ground truth box.
YOLO: Step 2
• Predict bounding boxes and class probabilities for each cell.
• For each cell, the YOLO algorithm predicts a set of bounding boxes and
class probabilities.
• The bounding boxes are represented as four coordinates: the top left
corner, the bottom right corner, and the width and height of the box.
• The class probabilities represent the probability that the object in the
box belongs to a particular class.
YOLO: Step 3
• Apply non-max suppression.
• The bounding boxes predicted by the YOLO algorithm may overlap.
• To remove overlapping boxes, the YOLO algorithm applies a non-max
suppression algorithm.
• This algorithm keeps the box with the highest confidence score, and it
removes all other boxes that have a high overlap with the selected box.
YOLO: Step 4
• Draw the bounding boxes and class labels on the image.
• The final step is to draw the bounding boxes and class labels on the
image.
• The bounding boxes are drawn in a different color for each class, and
the class labels are displayed next to the bounding boxes.
Loss-Function
Pros
• Trained on a loss function that directly corresponds to
detection performance.
• The entire model is trained jointly.
• The fastest general-purpose object detector in the literature.
• At least detection at 45fps.
Limitations
• Struggle with Small Object.
• Struggle with Different aspects and ratios of objects
• Loss function is an approximation.
• Loss function threats errors in different boxes ratio at the
same.
SSD: Single Shot MultiBox Detector
• Don’t generate object proposals!
• Consider a tiny subset of the output space by design; directly
classify this small set of boxes

Image credit:
[Link]
SSD: Design of Small set of boxes
SSD Network Structure

SSD architecture taken from the original paper

● 3*3 conv kernel

●2*2 pooling with stride = 2 VGG-16 network: by Oxford's Visual
Geometry Group (VGG): Simonyan,
Karen, and Andrew Zisserman. "Very
deep convolutional networks for large-
scale image recognition." arXiv preprint
arXiv:1409.1556 (2014).
SSD: Multi-scale Feature Map
SSD: Multi-scale Feature Map

Source:[Link]
SSD: Multi-scale Feature Map

Source: [Link]
Default Bounding Boxes - scale and shape
Default Bounding Boxes - scale and shape
Default Bounding Boxes
Why small boxes in large feature maps?

Source: [Link]
real-time-object-detection-in-deep-learning-495ef744fab
Default Bounding Boxes
Why small boxes in large feature maps?
• large feature map - small receptive field - small object
• small feature map - large receptive field - large object
Convolutional predictors for detection
Bounding Box Matching Strategy
Training Objective
• After pairing groundtruth and default boxes, we can write the objective
function:

Xijp ={1,0}: matching the i-th default box to the j-th ground truth box of
category p.
N: matched default boxes.
c: class confidence.
l: predicted bounding box
g: ground truth bounding box
Training Objective
SSD Network Structure vs YOLO

Similar to YOLO, but denser grid map, multiscale grid maps. + Data augmentation + Hard negative mining +
Other design choices in the network.
Design Improvement over YOLO
Hard Negative Mining
• Instead of using all the negative examples, we sort them using the highest
confidence loss for each default box and pick the top ones.
• The ratio between negative examples and positive examples is 3:1.
• This method leads to faster optimization and a more stable training.
Data Augmentation
• Making the model more robust to various input object sizes and outputs:
[Link] Images.
2. Sample patch with minimal jaccard scores as 0.1, 0.3, 0.5, 0.7 or 0.9.
3. Randomly sample a patch.
Experiments: Effects of various design choices and
components on SSD performance.
Experiments: Effects of using multiple output layers.
Detection Results
Strength and Drawbacks
• Strength Drawbacks
• High Speed • The classification task for
• High Accuracy small objects is relatively
• Simple Training(single shot) hard for SSD.
One-stage detection
• What could be the problems?
• The extreme foreground-background class imbalance -> we have a lot
more negative examples.
• Even though they have small loss values, the gradients overwhelm the
model
• Solution: Focal Loss for Dense Object Detection (Lin et al. ICCV 2017)
• For easy examples, we down-weight it loss, so that the gradients from
these example have smaller impact to the model

[Link]
Resources
• YOLO
• Original (Darknet) ([Link]
• Tensorflow ([Link]
• Keras ([Link]

• SSD (Caffe) ([Link]

YOLO: Real-Time Object Detection 2016
100% (1)
YOLO: Real-Time Object Detection 2016
10 pages
YOLO: Real-Time Object Detection System
No ratings yet
YOLO: Real-Time Object Detection System
10 pages
YOLO Architecture for Object Detection
100% (1)
YOLO Architecture for Object Detection
30 pages
YOLO: Real-Time Object Detection System
No ratings yet
YOLO: Real-Time Object Detection System
10 pages
YOLO v7: Object Detection Explained
100% (1)
YOLO v7: Object Detection Explained
32 pages
Yolopdf
No ratings yet
Yolopdf
10 pages
Deep Learning for Object Detection
No ratings yet
Deep Learning for Object Detection
37 pages
YOLO: Efficient Object Detection Method
No ratings yet
YOLO: Efficient Object Detection Method
14 pages
Understanding the YOLO Algorithm
No ratings yet
Understanding the YOLO Algorithm
21 pages
YOLO: Object Detection Overview
No ratings yet
YOLO: Object Detection Overview
20 pages
YOLO Architecture in Object Detection
No ratings yet
YOLO Architecture in Object Detection
13 pages
YOLO vs SSD: Object Detection Basics
No ratings yet
YOLO vs SSD: Object Detection Basics
4 pages
YOLO Object Detection Overview
No ratings yet
YOLO Object Detection Overview
14 pages
YOLO: Real-Time Object Detection Overview
100% (1)
YOLO: Real-Time Object Detection Overview
36 pages
L8 - Object Detection With Single Stage Methods Notes
No ratings yet
L8 - Object Detection With Single Stage Methods Notes
4 pages
Object Detection Techniques Overview
No ratings yet
Object Detection Techniques Overview
31 pages
Object Detection Techniques in ML
No ratings yet
Object Detection Techniques in ML
36 pages
Object Detection Techniques and Algorithms
No ratings yet
Object Detection Techniques and Algorithms
66 pages
Understanding YOLO for Object Detection
No ratings yet
Understanding YOLO for Object Detection
32 pages
Understanding Anchor Boxes in Object Detection
No ratings yet
Understanding Anchor Boxes in Object Detection
12 pages
YOLO Architecture and Performance Review
No ratings yet
YOLO Architecture and Performance Review
3 pages
YOLO-LITE: Efficient Object Detection
No ratings yet
YOLO-LITE: Efficient Object Detection
8 pages
Chapter 5 - Image - Recognition
No ratings yet
Chapter 5 - Image - Recognition
27 pages
YOLO Report
No ratings yet
YOLO Report
9 pages
YOLO Object Detection Overview
100% (1)
YOLO Object Detection Overview
19 pages
SSD: Fast Object Detection Framework
No ratings yet
SSD: Fast Object Detection Framework
17 pages
YOLO Object Detection Overview
No ratings yet
YOLO Object Detection Overview
43 pages
YOLO: Object Detection Overview
No ratings yet
YOLO: Object Detection Overview
31 pages
Introduction to Object Classification
No ratings yet
Introduction to Object Classification
24 pages
Understanding YOLO Object Detection
No ratings yet
Understanding YOLO Object Detection
46 pages
YOLO: Real-Time Object Detection System
No ratings yet
YOLO: Real-Time Object Detection System
78 pages
Object Classification and Detection Methods
No ratings yet
Object Classification and Detection Methods
27 pages
SSD: Fast Object Detection Framework
No ratings yet
SSD: Fast Object Detection Framework
17 pages
The YOLO
No ratings yet
The YOLO
6 pages
YOLO Object Detection Implementation Guide
No ratings yet
YOLO Object Detection Implementation Guide
4 pages
CV Presentation
No ratings yet
CV Presentation
14 pages
Deep Learning in Object Detection
No ratings yet
Deep Learning in Object Detection
35 pages
YOLO: Real-Time Object Detection Explained
No ratings yet
YOLO: Real-Time Object Detection Explained
5 pages
Enhancing Real-Time Object Detection With YOLO Alg
No ratings yet
Enhancing Real-Time Object Detection With YOLO Alg
9 pages
YOLO Algorithm for Real-Time Object Detection
No ratings yet
YOLO Algorithm for Real-Time Object Detection
9 pages
YOLO for Real-Time Object Detection
No ratings yet
YOLO for Real-Time Object Detection
17 pages
YOLO: Real-Time Object Detection Guide
No ratings yet
YOLO: Real-Time Object Detection Guide
10 pages
YOLO: Unified Real-Time Object Detection
100% (1)
YOLO: Unified Real-Time Object Detection
21 pages
YOLO Object Detection System Overview
No ratings yet
YOLO Object Detection System Overview
37 pages
YOLO Model for Real-Time Object Recognition
No ratings yet
YOLO Model for Real-Time Object Recognition
7 pages
YOLO vs SSD vs Faster R-CNN Comparison
No ratings yet
YOLO vs SSD vs Faster R-CNN Comparison
5 pages
YOLO Models for Document Object Detection
No ratings yet
YOLO Models for Document Object Detection
4 pages
YOLO Object Detection Architecture Explained
No ratings yet
YOLO Object Detection Architecture Explained
7 pages
YOLO-Based Real-Time Face Detection
No ratings yet
YOLO-Based Real-Time Face Detection
4 pages
YOLO9000: Real-Time Object Detection
No ratings yet
YOLO9000: Real-Time Object Detection
9 pages
YOLO: Advancements in Object Detection
No ratings yet
YOLO: Advancements in Object Detection
5 pages
YOLO Algorithm for Object Detection Insights
No ratings yet
YOLO Algorithm for Object Detection Insights
17 pages
YOLO Algorithm for Fast Object Detection
No ratings yet
YOLO Algorithm for Fast Object Detection
13 pages
YOLOv3: Object Detection Architecture
No ratings yet
YOLOv3: Object Detection Architecture
6 pages
YOLO Object Detection Overview
No ratings yet
YOLO Object Detection Overview
18 pages
Improved YOLOv3 for Object Detection
No ratings yet
Improved YOLOv3 for Object Detection
12 pages
Object Detection Methods in ML
No ratings yet
Object Detection Methods in ML
32 pages
YOLO Object Detection and Retrieval System
No ratings yet
YOLO Object Detection and Retrieval System
8 pages
YOLO Algorithm and Its Evolution
100% (1)
YOLO Algorithm and Its Evolution
264 pages
Operations Research Course Overview
No ratings yet
Operations Research Course Overview
1 page
Ai Unit 2 Notes
No ratings yet
Ai Unit 2 Notes
56 pages
Machine Learning Techniques Exam Paper
No ratings yet
Machine Learning Techniques Exam Paper
5 pages
Fundamentals of Computer Vision Workshop
100% (1)
Fundamentals of Computer Vision Workshop
21 pages
Debugging Strategies for Deep Learning
No ratings yet
Debugging Strategies for Deep Learning
5 pages
Non-Relativistic Dirac Equation Analysis
No ratings yet
Non-Relativistic Dirac Equation Analysis
3 pages
LCS and Greedy Algorithms Explained
No ratings yet
LCS and Greedy Algorithms Explained
3 pages
UML 2.0 for Discrete Event Simulation
No ratings yet
UML 2.0 for Discrete Event Simulation
11 pages
Data Analytics Techniques Overview
No ratings yet
Data Analytics Techniques Overview
16 pages
Forecasting Questions and Answers Guide
No ratings yet
Forecasting Questions and Answers Guide
3 pages
Pengendalian Posisi Motor DC dengan QNET
No ratings yet
Pengendalian Posisi Motor DC dengan QNET
7 pages
Heterotic Resolved Conifolds With Torsio
No ratings yet
Heterotic Resolved Conifolds With Torsio
53 pages
Fast Greedy K-Means Algorithm
No ratings yet
Fast Greedy K-Means Algorithm
62 pages
LCG Randomness Tests Explained
No ratings yet
LCG Randomness Tests Explained
6 pages
Euler's Method for Solving ODEs
No ratings yet
Euler's Method for Solving ODEs
17 pages
Cryptanalysis of Vigen Re Cipher Method Implementation
No ratings yet
Cryptanalysis of Vigen Re Cipher Method Implementation
5 pages
Table 1410you Worked As An Intern at We Always Win Car Insurance Company Last Summer You Notice That Individual Car Insurance Premiums Depen
No ratings yet
Table 1410you Worked As An Intern at We Always Win Car Insurance Company Last Summer You Notice That Individual Car Insurance Premiums Depen
8 pages
Examination Schedule 2022-23
No ratings yet
Examination Schedule 2022-23
19 pages
Dynamic Programming for Matrix Chain Multiplication
No ratings yet
Dynamic Programming for Matrix Chain Multiplication
6 pages
C Programming Lab Exercises at SNU
No ratings yet
C Programming Lab Exercises at SNU
2 pages
JPEG Compression Overview Diagram
No ratings yet
JPEG Compression Overview Diagram
21 pages
Advanced Regression Analysis Questions
No ratings yet
Advanced Regression Analysis Questions
42 pages
Machine-Learning Attacks on Optical Encryption
No ratings yet
Machine-Learning Attacks on Optical Encryption
12 pages
Statistical Techniques in CBIR for Leaves
No ratings yet
Statistical Techniques in CBIR for Leaves
4 pages
Noise-Induced Transitions in SIS Model
No ratings yet
Noise-Induced Transitions in SIS Model
19 pages
Relativistic Many-Electron Atom Theory
No ratings yet
Relativistic Many-Electron Atom Theory
15 pages
PageRank Algorithm Implementation in Python
No ratings yet
PageRank Algorithm Implementation in Python
3 pages
Z Cryptogrphic Algorithms
No ratings yet
Z Cryptogrphic Algorithms
71 pages
R's Impact on Insurance Data Analytics
No ratings yet
R's Impact on Insurance Data Analytics
8 pages
Memory Management Techniques Overview
No ratings yet
Memory Management Techniques Overview
2 pages