0% found this document useful (0 votes)
344 views10 pages

Class 10 Computer Vision Question Bank

The document is a question bank for Class 10 on the topic of Computer Vision in Artificial Intelligence, covering fill-in-the-blanks, true/false questions, multiple-choice questions, and descriptive questions. It discusses key concepts such as image processing, feature extraction, and various applications of computer vision, including facial recognition and autonomous vehicles. Additionally, it differentiates between grayscale and RGB images, explains pixel values, and outlines the role of computer vision in sports.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
344 views10 pages

Class 10 Computer Vision Question Bank

The document is a question bank for Class 10 on the topic of Computer Vision in Artificial Intelligence, covering fill-in-the-blanks, true/false questions, multiple-choice questions, and descriptive questions. It discusses key concepts such as image processing, feature extraction, and various applications of computer vision, including facial recognition and autonomous vehicles. Additionally, it differentiates between grayscale and RGB images, explains pixel values, and outlines the role of computer vision in sports.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

CBSE | DEPARTMENT OF SKILL EDUCATION

ARTIFICIAL INTELLIGENCE
QUESTION BANK – CLASS 10
CHAPTER – 5 CMPUTER VISION

Fill in the Blanks.

1. Visuals search uses CV to help users search for different objects using real-world
images.
2. Facial recognition powered by CV is used to verify the identity of visitors and guests
in smart homes.
3. The Google Translate app is a free tool that can do real-time visual translations of up
to 88 languages with the help of CV.
4. Computers see images as a collection of Pixels.
5. The number of pixels in an image is called resolution.
6. In a grayscale image, each pixel is stored as one byte, forming a simple plane of a 2D
array of pixels.
7. When the value of all three primary colours is the same, the image form is a
grayscale image.
8. Features of an image are details that tell us what is in the image.
9. In image processing, convolution is performed by sliding a kernel over different
portions of the picture
10. The ReLU layer in the CNN gets rid of all the negative numbers in the feature map
11. The pooling layer in a CNN is responsible for reducing the spatial size of the
convolved feature
12. Open CV is an open source computer vision and image processing library
13. Each channel consist a number of pixels with the value of each pixel, ranging from 0
to 255.
14. A bounding box is a rectangular frame drawn by the algorithm around the object,
indicating its spatial location within the image.
15. A coloured image is made up of pixels that have three different values, i.e., red, blue
and green.

Write T for True and F for False.

1. OpenCV can be used To process images and videos to identify objects, faces or even
handwriting.T
2. In image processing features like corners are difficult to locate F
3. The first correlation layer in a CNN is responsible for extracting high level features.F
4. When we change the pixel values of an image, the image changes T
5. In a byte image, the pixel value is stored as an 8 bit integer with a range of values
from 0 to 200F
6. In a field of Agriculture, CV can process image data captured by drone cameras to
help in remotely monitoring crops.T
7. Google translate is capable of identifying and translating text that is part of an
image.T
8. In an angle, the smaller pixel values closer to zero represent lighter shades, while the
larger number closer to 255 represent darker shades. T
9. In an image processing for each position of the kernel, the corresponding pixel values
are multiplied and added together to replace the original centre pixel.T
10. In a grayscale image, the darkest shade is represented by a pixel value of 255 F

Select the Correct Option.

1. The objective of computer vision is _______________


a) To replace human vision
b) To understand human language
c) To plot graphs
d) To solve mathematical equations
2. Identify the use of computer vision in the retail business
a) Count footfalls
b) Autonomous checkouts
c) Detect suspicious activity
d) All of these
3. What is the smallest unit of information that makes up a text image or video on a
computer?
a) Pixel
b) Element
c) Dot
d) Unit
4. What is the term used to describe the number of pixels in an image?
a) Pixel size
b) Image quality
c) Resolution
d) Dimension
5. In computer vision what task are associated with single object in an image?
a) Image classification
b) Image classification plus localization
c) Object detection
d) Instance segmentation
6. What is the purpose of convolutional image processing?
a) To extract features from images
b) To resize the images
c) To apply filters directly on the image
d) To rotate the images
7. Which of the following CNN layers is responsible for classifying the image into a
label?
a) Convolution
b) Fully connected layer
c) Pooling
d) ReLU
8. What does the imshow() function do in open CV?
a) It displays images on the screen
b) It saves images as files
c) It applies filters to images
d) It converts images to grayscale
9. What is the purpose of the imread() function in openCV?
a) To read and load images from files
b) To display images on the screen
c) To apply filters to images
d) To resize images
10. Which of the following is true for feature maps in the context of CNNs?
a) They help extract and highlight important features
b) They reduce the size of the image
c) They allow us to focus on important features
d) All of these
11. Image convolution is a process that is used _______________.
a) Apply different effects to an image
b) Increase the size of an image
c) Crop an image
d) Copy an image
12. Where is the result of convolutions stored?
a) Output mask
b) Feature detector
c) Output matrix
d) Pixel array
13. Convolution performs the _________Operation on to arrays
a) Division
b) Multiplication
c) Subtraction
d) Exponential
14. When a coloured image is split, we get_________ of pixels
a) 0 channel
b) One channel
c) Three channels
d) 4 channels
15. Which of the following is the task of instance segmentation?
a) Identify an object within an image
b) Finding out the location of the object
c) Labelling individual instances of objects
d) Identify the object category

1. What is Computer Vision? Explain with example.

Computer vision is a field of computer science and artificial intelligence that focuses on
enabling computers and systems to interpret and understand visual information from the
world, such as images and videos. The goal of computer vision is to develop algorithms and
models that allow machines to process visual data in a way that mimics human vision,
enabling them to recognize, analyze, and make decisions based on visual input.

Key components of computer vision include:

1. Image Processing: Techniques used to enhance or manipulate images to make them


suitable for analysis.
2. Feature Extraction: Identifying and extracting important attributes or characteristics
from images.
3. Object Detection: Identifying and locating objects within images or videos.
4. Image Classification: Categorizing images into predefined classes or labels.
5. Image Segmentation: Dividing an image into meaningful regions or segments for
more detailed analysis.
6. Pattern Recognition: Identifying patterns and regularities in visual data.

Computer vision combines methods from various fields, including machine learning,
statistics, and computer science, to enable machines to understand and interact with the visual
world. Applications of computer vision are vast and include areas such as autonomous
vehicles, facial recognition, medical imaging, and industrial inspection.

2. Write any five applications of Computer Vision.


Facial Recognition:
 Application: Used for security and identification purposes, such as unlocking
smartphones, verifying identities in security systems, or tagging people in social
media photos.
 Example: Facial recognition systems in airports for passenger identification and
border control.
Autonomous Vehicles:
 Application: Enables self-driving cars to interpret and navigate their surroundings by
recognizing objects, road signs, lane markings, pedestrians, and other vehicles.
 Example: Tesla's Autopilot system uses computer vision to assist with lane-keeping,
adaptive cruise control, and automatic emergency braking.
Medical Imaging:
 Application: Assists in analyzing medical images (such as X-rays, MRIs, and CT
scans) to diagnose diseases, detect abnormalities, and plan treatments.
 Example: AI algorithms for detecting tumors or fractures in medical images,
improving diagnostic accuracy and efficiency.
Retail and Inventory Management:
 Application: Automates tasks such as checking stock levels, managing inventory, and
enhancing the shopping experience through features like checkout-free stores.
 Example: Amazon Go stores use computer vision to track items that customers pick
up and automatically charge them upon exit.
Agriculture and Farming:
 Application: Used for monitoring crops, detecting diseases or pests, and optimizing
harvests through image analysis of fields and plants.
 Example: Drones equipped with computer vision systems to assess crop health and
guide precision farming techniques.

3. Explain following tasks used in computer vision with the help of


example.
Classification
Classification + Localisation
Object Detection
Instance Segmentation

The various applications of Computer Vision are based on a certain number of tasks
which are performed to get certain information from the input image which can be
directly used for prediction or forms the base for further analysis. The tasks used in a
computer vision application are:

Classification Image Classification problem is the task of assigning an input image


one label from a fixed set of categories. This is one of the core problems in CV that,
despite its simplicity, has a large variety of practical applications.
Classification + Localisation This is the task which involves both processes of
identifying what object is present in the image and at the same time identifying at
what location that object is present in that image. It is used only for single objects.

Object Detection Object detection is the process of finding instances of real-world


objects such as faces, bicycles, and buildings in images or videos. Object detection
algorithms typically use extracted features and learning algorithms to recognize
instances of an object category. It is commonly used in applications such as image
retrieval and automated vehicle parking systems.

Instance Segmentation Instance Segmentation is the process of detecting instances


of the objects, giving them a category and then giving each pixel a label on the basis
of that. A segmentation algorithm takes an image as input and outputs a collection of
regions (or segments).

4. Explain grayscale image and list its applications.

Grayscale and RGB images are two fundamental types of image representations used
in computer vision and image processing, each with distinct characteristics and
applications.

Grayscale Images

Definition:
 Grayscale images contain shades of gray, ranging from black to white, without any
color information.
 Each pixel in a grayscale image represents intensity, with values typically ranging
from 0 (black) to 255 (white) in an 8-bit image.

Characteristics:
 Single Channel: Grayscale images have only one channel representing intensity.
 Reduced Complexity: They contain less data compared to RGB images because they
only represent intensity, not color.
Applications:

1. Medical Imaging: Often used in X-rays, MRI, and CT scans where color information
is less important than detail and contrast.
2. Document Scanning: Useful for scanning and processing text documents, where
color is not necessary.
3. Image Compression: Often used in scenarios where storage space is limited and
color information is not crucial.
4. Edge Detection: Techniques like Sobel or Canny edge detection are commonly
applied to grayscale images to identify features and boundaries.
5. Computer Vision Algorithms: Many vision algorithms, such as those for pattern
recognition and object detection, use grayscale images for simplicity and speed.

5. Explain RGB image and its applications.

RGB Images

Definition:
 RGB (Red, Green, Blue) images contain color information by combining three color
channels: red, green, and blue.
 Each pixel in an RGB image has three values, corresponding to the intensity of red,
green, and blue, which combine to produce a wide spectrum of colors.

Characteristics:
 Three Channels: RGB images have three channels, one for each color component.
 Color Information: They represent the full color spectrum, allowing for detailed and
accurate color reproduction.

Applications:
1. Photography: Standard for digital cameras and smartphones to capture and display
vibrant, full-color images.
2. Video: Used in video recording and streaming for rich visual content.
3. Computer Graphics: Essential for rendering graphics in gaming, virtual reality, and
graphical user interfaces (GUIs).
4. Image Editing: Tools like Adobe Photoshop use RGB images for detailed color
manipulation and enhancement.
5. Color-based Object Detection: Applications that rely on distinguishing objects
based on color, such as in autonomous vehicles or augmented reality.

6. Differentiate between Human Vision and Computer Vision.


As humans, we visually perceive the world around us not just in static images but
with a combination of colors, textures, motion, depth, and context. On the other
hand, computer vision typically focuses on specific visual features without
considering anything else.
7. What is meant by image features and what is their use in CV?
In computer vision and image processing, a feature is a piece of information which is
relevant for solving the computational task related to a certain application. Features
may be specific structures in the image such as points, edges or objects.
For example: Imagine that your security camera is capturing an image. At the top of
the image we are given six small patches of images. Our task is to find the exact
location of those image patches in the image. Take a pencil and mark the exact
location of those patches in the image.
8. What is the difference between Pixel and Resolution? Which of these
defines the quality of the image?

Pixel

 Definition: A pixel, short for "picture element," is the smallest unit of a digital image
or display. It represents a single point in a raster image or display screen.
 Characteristics: Each pixel has its own color and brightness value. In a color display,
a pixel is often composed of sub-pixels with different colors (usually red, green, and
blue) that combine to create a full range of colors.
 Role: Pixels are the building blocks of digital images. The more pixels an image or
screen has, the finer the detail it can display.

Resolution

 Definition: Resolution refers to the total number of pixels in an image or display,


typically expressed as a width by height measurement. It indicates the amount of
detail an image or display can show.

Characteristics: Higher resolution means more pixels, which generally leads to a


clearer and sharper image. Common resolutions include 1920x1080 (Full HD),
2560x1440 (Quad HD), and 3840x2160 (4K Ultra HD).

 Role: Resolution impacts the quality and clarity of the visual content. For displays, it
determines how sharp and detailed the images and texts appear. For images, it affects
how large you can print or display the image before it starts to look blurry.

Pixel Resolution
The basic unit of digital images and displays The measure of how many pixels make up
the image or display, typically given as width
x height (e.g., 1920x1080)
Resolution typically defines the quality of an image. Higher resolution means more pixels, which
generally leads to greater detail and clarity. However, other factors like pixel density (for displays)
and image processing also influence overall image quality

9. What do you understand by pixel value?


Each of the pixels that represent an image stored inside a computer has a pixel value which
describes how bright that pixel is, and/or what colour it should be. The most common pixel
format is the byte image, where this number is stored as an 8-bit integer giving a range of
possible values from 0 to 255.

Typically, zero is to be taken as no colour or black and 255 is taken to be full colour or white.
Why do we have a value of 255? In the computer systems, computer data is in the form of
ones and zeros, which we call the binary system. Each bit in a computer system can have
either a zero or a one. Since each pixel uses 1 byte of an image, which is equivalent to 8 bits
of data? Since each bit can have two possible values which tells us that the 8 bit can have 255
possibilities of values which starts from 0 and ends at 255.

[Link] a short note on the applications of computer vision in sports.

In sports, computer vision is used for:

 Player Performance Analysis: Tracking and analyzing player movements and stats.
 Tactical Analysis: Evaluating game strategies and opponents.
 Injury Prevention: Monitoring movements to prevent injuries.
 Refereeing: Assisting with decisions and automated calls.
 Fan Engagement: Enhancing broadcasts and AR experiences.
 Training: Providing feedback and improving skills.
 Scouting: Identifying and evaluating talent.
 Broadcast Automation: Automatically tracking action for broadcasts.

Common questions

Powered by AI

CNN architecture contributes to image processing by extracting hierarchical features through multiple layers. The convolution layer uses kernels to compute feature maps by sliding over the image, highlighting important attributes . The ReLU layer applies an activation function to introduce non-linearity by zeroing negative values . Pooling layers reduce spatial dimensions, thereby decreasing computation and enhancing feature invariance to position . The fully connected layer at the end of the network combines extracted features to classify images into specific labels . These layers work together to improve the machine's ability to recognize and process images similarly to human vision .

Grayscale images consist of shades of gray without any color, with each pixel value indicating intensity between 0 and 255. They are single-channel images with reduced data complexity, making them suitable for applications like medical imaging, document scanning, and edge detection . In contrast, RGB images use three color channels—red, green, and blue—to represent a wide spectrum of colors. They are employed in applications where color differentiation is critical, such as photography, video content, and computer graphics .

Computer vision in retail and inventory management provides significant advantages by automating and optimizing various business functions. It allows real-time inventory tracking, which reduces human error and enhances stock management efficiency . Automated checkout processes, as seen in Amazon Go stores, streamline customer experiences, reduce queue times, and improve customer satisfaction . Additionally, computer vision detects human patterns for safety and marketing insights, enabling businesses to understand consumer behavior accurately and adjust strategies accordingly . These benefits lead to cost savings, increased operational efficiency, and improved customer engagement .

Image features are pivotal elements or characteristics within an image, such as points, edges, textures, or specific patterns, that are significant for various tasks in computer vision. They enable algorithms to identify and differentiate between objects or scenes. For instance, in object detection, features help in recognizing and locating objects within an image . For classification, features allow categorizing an image into predefined labels based on shared attributes. Practical applications of feature analysis include facial recognition systems, where identifying unique facial features supports accurate identification, and in autonomous vehicles, where features from road signs and objects aid in navigation .

In medical imaging, computer vision tasks such as image classification, segmentation, and pattern recognition play critical roles. Algorithms classify medical images into categories for disease diagnosis, such as identifying tumors or fractures in X-rays . Segmentation techniques divide images into meaningful sections, like isolating an organ or highlighting abnormal tissues, facilitating targeted analyses. Pattern recognition aids in detecting irregularities that may indicate health issues. These tasks contribute to healthcare by improving diagnostic accuracy, speeding up treatment planning, and enabling early disease detection, ultimately enhancing patient outcomes and efficiency in healthcare services .

The core objectives of computer vision include the interpretation and understanding of visual data to enable machines to mimic human perception. Applications like facial recognition systems in security exploit these capabilities to identify individuals . In medicine, computer vision aids in diagnosing diseases through analysis of medical images . Autonomous vehicles use computer vision to navigate safely by recognizing and reacting to their environment . Furthermore, in retail, computer vision streamlines processes like inventory management and enhances customer experiences through features like checkout-free shopping . These applications demonstrate the technology's goal of enhancing machine interaction with the visual world across various sectors .

Computer vision enhances autonomous vehicles by allowing them to interpret their surroundings accurately. Key technologies include object detection, which identifies other vehicles, pedestrians, and road signs . Image classification categorizes visual data for path planning and decision-making processes. Image segmentation further refines the vehicle's comprehension of the environment by distinguishing lanes and boundaries. These tasks combine to enable functions such as adaptive cruise control, lane-keeping assistance, and emergency braking, as implemented in systems like Tesla's Autopilot .

Image convolution is a process where a kernel or filter slides over an image to apply linear operations, enhancing certain features such as edges or textures . This operation involves multiplying the kernel's values with the underlying pixel values and summing them, resulting in a feature map that represents specific features of the input . Convolution is crucial in computer vision as it automates feature extraction by focusing on local patterns that are invariant to object position. This capability is foundational in convolutional neural networks (CNNs), where multiple convolutional layers progressively extract high-level features, facilitating tasks like object detection and image classification .

Real-time visual translation in apps like Google Translate relies on computer vision technologies such as text detection and optical character recognition (OCR). The app uses a smartphone camera to capture text in an image, detects the text region through image processing, and then applies OCR to interpret and convert the visual text into digital text, allowing it to be translated in real-time . The capability to process images rapidly is supported by convolutional neural networks (CNNs) that identify and highlight text features for accurate translation into other languages .

Pixels are the smallest units of a digital image, each representing a specific color or intensity. Resolution, measured by the total number of pixels in an image or display, defines the image's detail level as width by height. High resolution implies more pixels, leading to clearer, more detailed images . Together, pixels and resolution determine an image's quality—higher resolutions with more pixels provide sharper and more detailed visuals. However, the benefit of resolution is contingent upon pixel density and image processing quality, which influence how cleanly an image is rendered at different sizes .

You might also like