0% found this document useful (0 votes)

8 views181 pages

Intro To Computer Vision: By: Assistant Professor Dr. Ali R Hasoon CSD-CCS&IT-UOK 2025-2026

The document provides an overview of an introductory course on Computer Vision, highlighting its theoretical and practical aspects, including image formation, analysis, and recent technologies. It discusses the significance of computer vision in various applications such as OCR, medical imaging, and self-driving cars, while also addressing the challenges faced in the field. The course aims to enhance understanding of both computer and human vision, emphasizing the advancements made through deep learning.

Uploaded by

jowoti3401

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views181 pages

Intro To Computer Vision: By: Assistant Professor Dr. Ali R Hasoon CSD-CCS&IT-UOK 2025-2026

Uploaded by

jowoti3401

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Intro to Computer Vision

By: Assistant Professor Dr. Ali R Hasoon

CSD-CCS&IT-UOK 2025-2026
Semester Description
• Recognize and describe both the theoretical and practical
aspects of computing with images. Connect issues from
Computer Vision to Human Vision.
• Describe the fundamentals of image formation and image
analysis. Understand the basics of 2D and 3D Computer
Vision.
• Become familiar with the recent and common technical
approaches involved in computer vision.
References:
Main Textbooks:
• Rick Szeliski, Computer Vision: Algorithms and Applications 2nd edition.

• Scott Krig, Computer Vision Metrices: Survey, Taxonomy, and Analysis.

• th edition.
Why study Computer Vision?
• One can the (and avoid bad things...)!
• Images and movies are everywhere; fast-growing collection of
useful applications building representations of the 3D world
from pictures automated surveillance doing what)
movie post-processing face finding
Greater understanding of human vision.
Various deep and attractive scientific mysteries how does
object recognition work.
Some of the latest CV technology
(a) optical character recognition (OCR),
(b) mechanical inspection,
(c) warehouse picking,
(d) medical imaging,
(e) self-driving cars,
(f) drone-based photogrammetry.
History of Computer Vision
Some early examples of computer vision algorithms
Examples of computer vision algorithms from 1980s
Examples of computer vision algorithms from 1990s
Examples of computer vision algorithms 2000s
Examples of computer vision algorithms 2010s
Every image tells a story
• Goal of computer vision:

the picture
• Compute properties of the
world
– 3D shape
– Names of people or objects
– What happened?
The goal of computer vision
Can computers match human perception?
• Yes and no (mainly no)
– computers can be better at

–
things

• But huge progress

– Accelerating in the last five
years due to deep learning
–
changing
Current models still make very silly mistakes

[Tomer Ullmann, The Illusion-Illusion: Vision Language Models See Illusions Where There are None, arXiv 2024]
Human perception has its shortcomings

[Link]
Humans can tell a lot about a scene from a little
The goal of computer vision
The goal of computer vision
• Compute the 3D shape of the world

ZED 2i Camera
The goal of computer vision
• Recognize objects and people

Terminator 2, 1991
slide credit: Fei-Fei, Fergus & Torralba
sky
building

flag

face
banner
wall
street lamp
bus bus

cars slide credit: Fei-Fei, Fergus & Torralba

The goal of computer vision
•
The goal of computer vision
• Forensics

Source: Nayar and Nishino, “Eyes for Relighting”

Source: Nayar and Nishino, “Eyes for Relighting”
Source: Nayar and Nishino, “Eyes for Relighting”
April 10, 2019
The goal of computer vision
•

Super-resolution (source: 2d3)

Low-light photography
(credit: Hasinoff et al., SIGGRAPH ASIA 2016)

Inpainting / image completion

Depth of field on cell phone camera (image credit: Hays and Efros)
(source: Google Research Blog)
Why study computer vision?
• Billions of images/videos captured per day

• Huge number of potential applications

• The next slides show the current state of the art
Optical character recognition (OCR)
If you have a scanner, it probably came with OCR software

License plate readers

[Link]
[Link]

Sudoku grabber
[Link]

Automatic check processing

Face detection

• Nearly all cameras detect faces in real time

– (Why?)
Face analysis and recognition
Vision-based biometrics

Source: S. Seitz
Who is she?
Vision-based biometrics

How the Afghan Girl was Identified by Her Iris Patterns story

Source: S. Seitz
Login without a password

Fingerprint scanners on
most of the new Face unlock
smartphones
New York Times, Jan. 18, 2020
by Kashmir Hill
Bird identification

Merlin Bird ID (based on Cornell Tech technology!)

Special effects: shape capture

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Source: S. Seitz
Special effects: motion capture

Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz

3D face tracking w/ consumer cameras

Snapchat Lenses

Face2Face system (Thies et al.)

Image synthesis

Karras, et al., Progressive Growing of GANs for Improved Quality, Stability, and Variation, ICLR 2018
Which face is real?

[Link]
Image synthesis

Zhu, et al., Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks, ICCV 2017
Sports

Sportvision first down line

Explanation on [Link]
Smart cars

• Tesla Autopilot - How It Works?

• [Link]
Self-driving cars

Waymo
Robotics

Amazon Picking Challenge

[Link] [Link]

Amazon Prime Air Amazon Scout

Medical imaging

Skin cancer classification with deep learning

[Link]
3D imaging
(MRI, CT)
Virtual & Augmented Reality

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture

Current state of the art
• You just saw many examples of current systems.
– Many of these are less than 5 years old

• Computer vision is an active research area, and rapidly changing

– Many new apps in the next 5 years
– Deep learning powering many modern applications

• Many startups across a dizzying array of areas

– Deep learning, robotics, autonomous vehicles, medical imaging,
Why is computer vision difficult?

Viewpoint variation

Credit: Flickr user michaelpaul

Scale
Illumination
Why is computer vision difficult?

Motion (Source: S. Lazebnik)

Intra-class variation

Background clutter Occlusion

Challenges: local ambiguity

slide credit: Fei-Fei, Fergus & Torralba

Source: S. Lazebnik
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a given 2D image

Artist Julian Beever with his anamorphic Coke bottle

–
Image source: F. Durand
1. Low-level vision
• Basic image processing and image formation

* =
Filtering, edge detection

Feature extraction Image formation

Project: Hybrid images from image pyramids

G 1/8

G 1/4

Gaussian 1/2
Project: Feature detection and matching
2. Geometry

Image credit: IDS Imaging

Projective geometry Stereo vision

Multi-view stereo Structure from motion

Project: Creating panoramas
Project: 3D reconstruction
3. Recognition

Image classification

Object detection

Convolutional Neural Networks

Convolutional Neural Networks
Geometric primitives form the basic building blocks
used to describe three-dimensional shapes
Image Coordinate Systems
• Spatial Coordinates: Cartesian x and y coordinates, In a spatial
coordinate system, locations in an image are positions on a
continuous plane.

• Polar coordinate is a two-dimensional coordinate system in

which each point on a plane is determined by a distance from a
reference point and an angle from a reference direction.
De-Aging Harrison Ford

• [Link]
Thank you

Computer Vision Course Overview at Cornell
No ratings yet
Computer Vision Course Overview at Cornell
72 pages
Intro to Computer Vision Course Overview
No ratings yet
Intro to Computer Vision Course Overview
76 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
86 pages
Cornell CS5670: Intro to Computer Vision
No ratings yet
Cornell CS5670: Intro to Computer Vision
81 pages
EE5811: Computer Vision Overview
No ratings yet
EE5811: Computer Vision Overview
80 pages
Computer Vision Course Overview
No ratings yet
Computer Vision Course Overview
61 pages
Computer Vision and Image Analysis Guide
No ratings yet
Computer Vision and Image Analysis Guide
88 pages
CV Unit-1
No ratings yet
CV Unit-1
284 pages
CV UNIT-1 Part-1
No ratings yet
CV UNIT-1 Part-1
123 pages
Image Processing and Vision Fundamentals
No ratings yet
Image Processing and Vision Fundamentals
55 pages
Understanding Computer Vision Goals
No ratings yet
Understanding Computer Vision Goals
58 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
CV Chapter1 ExamGuide
No ratings yet
CV Chapter1 ExamGuide
14 pages
Introduction to Computer Vision Concepts
No ratings yet
Introduction to Computer Vision Concepts
83 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
CV 3
No ratings yet
CV 3
48 pages
CS2802: Computer Vision: Instructor: Dr. Nitin Kumar State-Of-The-Art in Computer Vision
No ratings yet
CS2802: Computer Vision: Instructor: Dr. Nitin Kumar State-Of-The-Art in Computer Vision
53 pages
CS2802: Computer Vision: Instructor: Dr. Nitin Kumar State-Of-The-Art in Computer Vision
No ratings yet
CS2802: Computer Vision: Instructor: Dr. Nitin Kumar State-Of-The-Art in Computer Vision
53 pages
Introduction to Computer Vision
No ratings yet
Introduction to Computer Vision
103 pages
Introduction to Computer Vision Course
No ratings yet
Introduction to Computer Vision Course
72 pages
Computer Vision Fundamentals and Applications
No ratings yet
Computer Vision Fundamentals and Applications
61 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
18 pages
CV-1 1
No ratings yet
CV-1 1
18 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
Lec 1
No ratings yet
Lec 1
65 pages
CV Unit1
No ratings yet
CV Unit1
88 pages
UCL Machine Vision Course Overview
No ratings yet
UCL Machine Vision Course Overview
53 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
CV Lect1
No ratings yet
CV Lect1
39 pages
Image Processing and Computer Vision Overview
No ratings yet
Image Processing and Computer Vision Overview
42 pages
Understanding Computer Vision Techniques
No ratings yet
Understanding Computer Vision Techniques
29 pages
Understanding Computer Vision Basics
100% (1)
Understanding Computer Vision Basics
16 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
1,100 pages
Lec 2
No ratings yet
Lec 2
67 pages
Introduction to Computer Vision Basics
No ratings yet
Introduction to Computer Vision Basics
37 pages
Module 1
No ratings yet
Module 1
68 pages
Lecture 1 - Introduction, and Image Formation and Acquisition
No ratings yet
Lecture 1 - Introduction, and Image Formation and Acquisition
59 pages
Introduction to Computer Vision
89% (9)
Introduction to Computer Vision
16 pages
Introduction to Computer Vision Concepts
No ratings yet
Introduction to Computer Vision Concepts
18 pages
Computer Vision Course Overview and Goals
No ratings yet
Computer Vision Course Overview and Goals
186 pages
Chapter - 1 Introduction To Computer Vision
No ratings yet
Chapter - 1 Introduction To Computer Vision
85 pages
Introduction to Computer Vision Basics
No ratings yet
Introduction to Computer Vision Basics
21 pages
Disadvantages of Computer Vision AI
No ratings yet
Disadvantages of Computer Vision AI
15 pages
CV Module 1
100% (1)
CV Module 1
166 pages
Computer Vision Course Prerequisites
No ratings yet
Computer Vision Course Prerequisites
8 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
5 pages
Computer Vision Fundamentals and Techniques
No ratings yet
Computer Vision Fundamentals and Techniques
200 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
Understanding Computer Vision Basics
No ratings yet
Understanding Computer Vision Basics
24 pages
Unit - 1 CVIP TE AIML Updated 22 March 2024
No ratings yet
Unit - 1 CVIP TE AIML Updated 22 March 2024
78 pages
Computer Vision
100% (1)
Computer Vision
48 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
62 pages
Overview of Computer Vision Systems
No ratings yet
Overview of Computer Vision Systems
38 pages
Foundations of Computer Vision BCS613B
No ratings yet
Foundations of Computer Vision BCS613B
26 pages
Computer Vision Course Syllabus
No ratings yet
Computer Vision Course Syllabus
46 pages
Computer Vision Overview and Applications
No ratings yet
Computer Vision Overview and Applications
51 pages
Visual Computing: Vision & Graphics Course
No ratings yet
Visual Computing: Vision & Graphics Course
78 pages
Data Literacy Essentials for AI
No ratings yet
Data Literacy Essentials for AI
39 pages
808-288 (CCNWeb)
No ratings yet
808-288 (CCNWeb)
134 pages
C8051F340 Block Diagram Overview
No ratings yet
C8051F340 Block Diagram Overview
3 pages
Introduction to ASP.NET Framework
No ratings yet
Introduction to ASP.NET Framework
55 pages
Financial Data Analyst Resume Summary
No ratings yet
Financial Data Analyst Resume Summary
1 page
SQL Bootcamp 2020: From Zero to Hero
100% (1)
SQL Bootcamp 2020: From Zero to Hero
101 pages
Python NumPy Arrays and Operations
No ratings yet
Python NumPy Arrays and Operations
21 pages
SQL Database Schema and Triggers
No ratings yet
SQL Database Schema and Triggers
25 pages
Penetration Testing Activity Guide
No ratings yet
Penetration Testing Activity Guide
4 pages
Flyer Opc Ua and Profinet en
No ratings yet
Flyer Opc Ua and Profinet en
2 pages
Key Features of CSS Explained
No ratings yet
Key Features of CSS Explained
5 pages
MySQL Subquery Overview and Examples
No ratings yet
MySQL Subquery Overview and Examples
19 pages
ECC-NDCS Project Workflow Overview
No ratings yet
ECC-NDCS Project Workflow Overview
11 pages
SAP BW Hierarchy Tables Overview
No ratings yet
SAP BW Hierarchy Tables Overview
16 pages
Rtu Front Page
No ratings yet
Rtu Front Page
3 pages
Feasibility of LoRaWAN for Underground Monitoring
No ratings yet
Feasibility of LoRaWAN for Underground Monitoring
13 pages
TDS Survey Pro With TSX v4.6.0 Reference Manual - Recon PDF
No ratings yet
TDS Survey Pro With TSX v4.6.0 Reference Manual - Recon PDF
481 pages
CRM and ERP Systems Overview
No ratings yet
CRM and ERP Systems Overview
30 pages
HR Trainer Resume of N. Karthi Keyan
No ratings yet
HR Trainer Resume of N. Karthi Keyan
3 pages
Ezyops Development and Data Migration
No ratings yet
Ezyops Development and Data Migration
1 page
PCDC1
No ratings yet
PCDC1
26 pages
IoT Milk ATM for Rural Areas
No ratings yet
IoT Milk ATM for Rural Areas
3 pages
Window Manager ANR Report 2023-03-06
No ratings yet
Window Manager ANR Report 2023-03-06
1,473 pages
Data Structures Course Overview
No ratings yet
Data Structures Course Overview
223 pages
PenMount 6000 Controller Installation Guide V1.6
No ratings yet
PenMount 6000 Controller Installation Guide V1.6
61 pages
I/O Polling and Pipeline Analysis
No ratings yet
I/O Polling and Pipeline Analysis
4 pages
Ieee Vlsi Sata 2026
No ratings yet
Ieee Vlsi Sata 2026
1 page
Android Agent Setup for OCS Inventory
No ratings yet
Android Agent Setup for OCS Inventory
8 pages
Enum in Dart Dart Tutorial Learn Dart Programming Dart
No ratings yet
Enum in Dart Dart Tutorial Learn Dart Programming Dart
9 pages
iSeeU Eyecare Compliance Case Study
No ratings yet
iSeeU Eyecare Compliance Case Study
4 pages

Intro To Computer Vision: By: Assistant Professor Dr. Ali R Hasoon CSD-CCS&IT-UOK 2025-2026

Uploaded by

Intro To Computer Vision: By: Assistant Professor Dr. Ali R Hasoon CSD-CCS&IT-UOK 2025-2026

Uploaded by

Intro to Computer Vision

By: Assistant Professor Dr. Ali R Hasoon

• Scott Krig, Computer Vision Metrices: Survey, Taxonomy, and Analysis.

• But huge progress

cars slide credit: Fei-Fei, Fergus & Torralba

Source: Nayar and Nishino, “Eyes for Relighting”

Super-resolution (source: 2d3)

Inpainting / image completion

• Huge number of potential applications

License plate readers

Automatic check processing

• Nearly all cameras detect faces in real time

Merlin Bird ID (based on Cornell Tech technology!)

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz

Face2Face system (Thies et al.)

Sportvision first down line

• Tesla Autopilot - How It Works?

Amazon Picking Challenge

Amazon Prime Air Amazon Scout

Skin cancer classification with deep learning

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture

• Computer vision is an active research area, and rapidly changing

• Many startups across a dizzying array of areas

Credit: Flickr user michaelpaul

Motion (Source: S. Lazebnik)

Background clutter Occlusion

slide credit: Fei-Fei, Fergus & Torralba

Artist Julian Beever with his anamorphic Coke bottle

Feature extraction Image formation

Image credit: IDS Imaging

Projective geometry Stereo vision

Multi-view stereo Structure from motion

Convolutional Neural Networks

• Polar coordinate is a two-dimensional coordinate system in

You might also like