DAV Centenary Public School
Vaishali Nagar, Jaipur-21
Artificial Intelligence (AI-843)
Class XII
Part-B
UNIT 3: Making Machines See
Making Machines See…
• Computer vision has become a cornerstone
technology in today's digital era, enabling
machines to "see" and interpret visual data
much like humans. This lesson delves into the
fascinating world of computer vision,
exploring its fundamental principles, key
processes, real-world applications, and future
potential.
Learning Objectives:
1. Understand the fundamentals of computer vision and
its role in processing and analysing digital images and
videos.
2. Explore the various stages involved in the computer
vision process.
3. Gain insight into the applications of computer vision
across different industries.
4. Identify the challenges and ethical considerations
associated with computer vision technology, including
privacy concerns, data security, and misinformation.
5. Recognize the future potential of computer vision
technology and its impact on society.
Key Concepts:
1. Introduction to Computer Vision
2. Working of Computer Vision
3. Applications of Computer Vision
4. Challenges of Computer Vision
5. The Future of Computer Vision
Learning Outcomes:
Students will be able to –
1. Explain the concept of computer vision and its
significance in analysing visual data.
2. Demonstrate an understanding of the key stages
involved in computer vision process and their
respective roles in interpreting images and videos.
3. Identify real-world applications of computer vision
technology in various industries and understand
how it enhances efficiency and productivity.
4. Evaluate the ethical implications and challenges
associated with computer vision, including privacy
concerns and the spread of misinformation.
5. Envision the future possibilities of computer
vision technology.
HOW MACHINES SEE?
• Computer Vision is a field of artificial
intelligence (AI) that uses Sensing devices and
deep learning models to help systems
understand and interpret the visual world.
WORKING OF COMPUTER VISION
• Basics of digital images
• Interpretation of Image in digital form
COMPUTER VISION – PROCESS:
• The Computer Vision process often involves
five stages.
– Image Acquisition:
– Preprocessing:
– Feature Extraction:
– Detection/Segmentation:
– High-Level Processing: I
APPLICATIONS OF COMPUTER VISION
Computer vision is one of the areas in Machine Learning whose
principle is already integrated into major products that we use every day.
Some of the applications are listed below which you might have already
learned in lower classes.
• Facial recognition: Popular social media platforms like Facebook uses
facial recognition to detect and tag users.
• Healthcare: Helps in evaluating cancerous tumours, identifying diseases or
abnormalities. Object detection & tracking in medical imaging.
• Self-driving vehicles: Makes sense of the surroundings by capturing video
from different angles around the car. Detect other cars and objects, read
traffic signals, pedestrian paths, etc.
APPLICATIONS OF COMPUTER VISION
• Optical character recognition (OCR): Extract printed or handwritten text
from visual data such as images or documents like invoices, bills, articles,
etc.
• Machine inspection: Detects a machine's defects, features, and functional
flaws, determines inspection goals, chooses lighting and material-handling
techniques, and other irregularities in manufactured products.
• 3D model building: Constructing 3D computer models from existing
objects which has a variety of applications in various places, such as
Robotics, Autonomous driving, 3D tracking, 3D scene reconstruction, and
AR/VR.
• Surveillance: Live footage from CCTV cameras in public places helps to
identify suspicious behaviour, identify dangerous objects, and prevent
crimes by maintaining law and order.
• Fingerprint recognition and biometrics: Detects fingerprints and
biometrics to validate a user's identity.
CHALLENGES OF COMPUTER VISION
• Computer vision, a vital part of artificial intelligence, faces several
hurdles as it strives to make sense of the visual world around us.
These challenges include:
1. Reasoning and Analytical Issues: Computer vision relies on more
than just image identification; it requires accurate interpretation.
Robust reasoning and analytical skills are essential for defining
attributes within visual content. Without such capabilities,
extracting meaningful insights from images becomes challenging,
limiting the effectiveness of computer vision systems.
2. Difficulty in Image Acquisition: Image acquisition in computer vision
is hindered by various factors like lighting variations, perspectives,
and scales. Understanding complex scenes with multiple objects
and handling occlusions adds to the complexity. Obtaining high-
quality image data amidst these challenges is crucial for accurate
analysis and interpretation.
CHALLENGES OF COMPUTER VISION
3. Privacy and Security Concerns: Vision-powered surveillance
systems raise serious privacy concerns, potentially infringing
upon individuals' privacy rights. Technologies like facial
recognition and detection prompt ethical dilemmas regarding
privacy and security. Regulatory scrutiny and public debate
surround the use of such technologies, necessitating careful
consideration of privacy implications.
4. Duplicate and False Content: Computer vision introduces
challenges related to the proliferation of duplicate and false
content. Malicious actors can exploit vulnerabilities in image
and video processing algorithms to create misleading or
fraudulent content. Data breaches pose a significant threat,
leading to the dissemination of duplicate images and videos,
fostering misinformation and reputational damage.
THE FUTURE OF COMPUTER VISION
• Over the years, computer vision has evolved from basic image
processing tasks to complex systems capable of understanding and
interpreting visual data with human-like precision. Breakthroughs in
deep learning algorithms, coupled with the availability of vast
amounts of labelled training data, have propelled the field forward,
enabling machines to perceive and analyse images and videos in
ways previously thought impossible.
• As we look to the future, the possibilities by computer vision are
awe-inspiring. From personalized healthcare diagnostics to
immersive AR experiences, the impact of computer vision on
society is set to be profound and far-reaching. By embracing
innovation, fostering collaboration, and prioritizing ethical
considerations, we can unlock the full potential of computer vision
and harness its transformative power for the benefit of humanity.
Introduction to OpenCV
• OpenCV or Open-Source Computer Vision Library
is a cross-platform library using which we can
develop real-time computer vision applications.
• It mainly focuses on image processing, video
capture, and analysis including features like face
detection and object detection.
• It is also capable of identifying objects, faces, and
even handwriting. To use OpenCV in Python, you
need to install the library.
• Use the following command in your terminal or
command prompt: pip install opencv-python
Loading and Displaying an Image:
• Let us understand the loading and displaying
using a scenario followed by a question.
• Scenario- You are working on a computer
vision project where you need to load and
display an image. You decide to use OpenCV
for this purpose.
Question: What are the necessary steps to load and display an image using
OpenCV? Write a Python code snippet to demonstrate this.
• sol - Here's a simple Python script to load and
display an image using OpenCV:
import cv2
image = [Link]('[Link]')
# Replace '[Link]' with the path to your image
[Link]('original image', image)
[Link](0)
[Link]()
● [Link]('[Link]') loads the image into a variable.
Replace [Link] with your image's file name or path.
● [Link]() opens a new window to display the image.
● [Link](0) waits indefinitely for a key press to proceed
● [Link]() closes any OpenCV windows.
DAV Centenary Public School
Vaishali Nagar, Jaipur-21
Artificial Intelligence (AI-843)
Class XII
Part-B
UNIT 3: Making Machines See