0% found this document useful (0 votes)
8 views181 pages

Intro To Computer Vision: By: Assistant Professor Dr. Ali R Hasoon CSD-CCS&IT-UOK 2025-2026

The document provides an overview of an introductory course on Computer Vision, highlighting its theoretical and practical aspects, including image formation, analysis, and recent technologies. It discusses the significance of computer vision in various applications such as OCR, medical imaging, and self-driving cars, while also addressing the challenges faced in the field. The course aims to enhance understanding of both computer and human vision, emphasizing the advancements made through deep learning.

Uploaded by

jowoti3401
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views181 pages

Intro To Computer Vision: By: Assistant Professor Dr. Ali R Hasoon CSD-CCS&IT-UOK 2025-2026

The document provides an overview of an introductory course on Computer Vision, highlighting its theoretical and practical aspects, including image formation, analysis, and recent technologies. It discusses the significance of computer vision in various applications such as OCR, medical imaging, and self-driving cars, while also addressing the challenges faced in the field. The course aims to enhance understanding of both computer and human vision, emphasizing the advancements made through deep learning.

Uploaded by

jowoti3401
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Intro to Computer Vision

By: Assistant Professor Dr. Ali R Hasoon


CSD-CCS&IT-UOK 2025-2026
Semester Description
• Recognize and describe both the theoretical and practical
aspects of computing with images. Connect issues from
Computer Vision to Human Vision.
• Describe the fundamentals of image formation and image
analysis. Understand the basics of 2D and 3D Computer
Vision.
• Become familiar with the recent and common technical
approaches involved in computer vision.
References:
Main Textbooks:
• Rick Szeliski, Computer Vision: Algorithms and Applications 2nd edition.

• Scott Krig, Computer Vision Metrices: Survey, Taxonomy, and Analysis.

• th edition.
Why study Computer Vision?
• One can the (and avoid bad things...)!
• Images and movies are everywhere; fast-growing collection of
useful applications building representations of the 3D world
from pictures automated surveillance doing what)
movie post-processing face finding
Greater understanding of human vision.
Various deep and attractive scientific mysteries how does
object recognition work.
Some of the latest CV technology
(a) optical character recognition (OCR),
(b) mechanical inspection,
(c) warehouse picking,
(d) medical imaging,
(e) self-driving cars,
(f) drone-based photogrammetry.
History of Computer Vision
Some early examples of computer vision algorithms
Examples of computer vision algorithms from 1980s
Examples of computer vision algorithms from 1990s
Examples of computer vision algorithms 2000s
Examples of computer vision algorithms 2010s
Every image tells a story
• Goal of computer vision:

the picture
• Compute properties of the
world
– 3D shape
– Names of people or objects
– What happened?
The goal of computer vision
Can computers match human perception?
• Yes and no (mainly no)
– computers can be better at


things

• But huge progress


– Accelerating in the last five
years due to deep learning

changing
Current models still make very silly mistakes

[Tomer Ullmann, The Illusion-Illusion: Vision Language Models See Illusions Where There are None, arXiv 2024]
Human perception has its shortcomings

[Link]
Humans can tell a lot about a scene from a little
The goal of computer vision
The goal of computer vision
• Compute the 3D shape of the world

ZED 2i Camera
The goal of computer vision
• Recognize objects and people

Terminator 2, 1991
slide credit: Fei-Fei, Fergus & Torralba
sky
building

flag

face
banner
wall
street lamp
bus bus

cars slide credit: Fei-Fei, Fergus & Torralba


The goal of computer vision

The goal of computer vision
• Forensics

Source: Nayar and Nishino, “Eyes for Relighting”


Source: Nayar and Nishino, “Eyes for Relighting”
Source: Nayar and Nishino, “Eyes for Relighting”
April 10, 2019
The goal of computer vision

Super-resolution (source: 2d3)


Low-light photography
(credit: Hasinoff et al., SIGGRAPH ASIA 2016)

Inpainting / image completion


Depth of field on cell phone camera (image credit: Hays and Efros)
(source: Google Research Blog)
Why study computer vision?
• Billions of images/videos captured per day

• Huge number of potential applications


• The next slides show the current state of the art
Optical character recognition (OCR)
If you have a scanner, it probably came with OCR software

License plate readers


[Link]
[Link]

Sudoku grabber
[Link]

Automatic check processing


Face detection

• Nearly all cameras detect faces in real time


– (Why?)
Face analysis and recognition
Vision-based biometrics

Source: S. Seitz
Who is she?
Vision-based biometrics

How the Afghan Girl was Identified by Her Iris Patterns story

Source: S. Seitz
Login without a password

Fingerprint scanners on
most of the new Face unlock
smartphones
New York Times, Jan. 18, 2020
by Kashmir Hill
Bird identification

Merlin Bird ID (based on Cornell Tech technology!)


Special effects: shape capture

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Source: S. Seitz
Special effects: motion capture

Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz


3D face tracking w/ consumer cameras

Snapchat Lenses

Face2Face system (Thies et al.)


Image synthesis

Karras, et al., Progressive Growing of GANs for Improved Quality, Stability, and Variation, ICLR 2018
Which face is real?

[Link]
Image synthesis

Zhu, et al., Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks, ICCV 2017
Sports

Sportvision first down line


Explanation on [Link]
Smart cars

• Tesla Autopilot - How It Works?


• [Link]
Self-driving cars

Waymo
Robotics

Amazon Picking Challenge


[Link] [Link]

Amazon Prime Air Amazon Scout


Medical imaging

Skin cancer classification with deep learning


[Link]
3D imaging
(MRI, CT)
Virtual & Augmented Reality

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture


Current state of the art
• You just saw many examples of current systems.
– Many of these are less than 5 years old

• Computer vision is an active research area, and rapidly changing


– Many new apps in the next 5 years
– Deep learning powering many modern applications

• Many startups across a dizzying array of areas


– Deep learning, robotics, autonomous vehicles, medical imaging,
Why is computer vision difficult?

Viewpoint variation

Credit: Flickr user michaelpaul

Scale
Illumination
Why is computer vision difficult?

Motion (Source: S. Lazebnik)


Intra-class variation

Background clutter Occlusion


Challenges: local ambiguity

slide credit: Fei-Fei, Fergus & Torralba


Source: S. Lazebnik
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a given 2D image

Artist Julian Beever with his anamorphic Coke bottle


Image source: F. Durand
1. Low-level vision
• Basic image processing and image formation

* =
Filtering, edge detection

Feature extraction Image formation


Project: Hybrid images from image pyramids

G 1/8

G 1/4

Gaussian 1/2
Project: Feature detection and matching
2. Geometry

Image credit: IDS Imaging

Projective geometry Stereo vision

Multi-view stereo Structure from motion


Project: Creating panoramas
Project: 3D reconstruction
3. Recognition

Image classification

Object detection

Convolutional Neural Networks


Convolutional Neural Networks
Geometric primitives form the basic building blocks
used to describe three-dimensional shapes
Image Coordinate Systems
• Spatial Coordinates: Cartesian x and y coordinates, In a spatial
coordinate system, locations in an image are positions on a
continuous plane.

• Polar coordinate is a two-dimensional coordinate system in


which each point on a plane is determined by a distance from a
reference point and an angle from a reference direction.
De-Aging Harrison Ford

• [Link]
Thank you

You might also like