0% found this document useful (0 votes)
12 views3 pages

Understanding Computer Vision Basics

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views3 pages

Understanding Computer Vision Basics

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

Topic: Computer Vision

Class:X

Q1. What is Computer Vision?

Ans. The Computer Vision domain of Artificial Intelligence, enables machines to see through images
or visual data, process and analyse them on the basis of algorithms and methods in order to analyse
actual phenomena with images.

Q2. What are the applications of Computer Vision?

Ans. 1)Facial Recognition: With the advent of smart cities and smart homes, Computer Vision plays a
vital role in making the home smarter. Security being the most important application involves use of
Computer Vision for facial recognition. It can be either guest recognition or log maintenance of the
visitors. It also finds its application in schools for an attendance system based on facial recognition of
students.

2) Face Filters: The modern-day apps like Instagram and snapchat have a lot of features based on the
usage of computer vision. The application of face filters is one among them. Through the camera the
machine or the algorithm is able to identify the facial dynamics of the person and applies the facial
filter selected

3) Google’s Search by Image: The maximum amount of searching for data on Google’s search engine
comes from textual data, but at the same time it has an interesting feature of getting search results
through an image. This uses Computer Vision as it compares different features of the input image to
the database of images and give us the search result while at the same time analysing various
features of the image.

4) Computer Vision in Retail: The retail field has been one of the fastest growing field and at the
same time is using Computer Vision for making the user experience more fruitful. Retailers can use
Computer Vision techniques to track customers’ movements through stores, analyse navigational
routes and detect walking patterns. Inventory Management is another such application. Through
security camera image analysis, a Computer Vision algorithm can generate a very accurate estimate
of the items available in the store. Also, it can analyse the use of shelf space to identify suboptimal
configurations and suggest better item placement.

5) Self-Driving Cars: Computer Vision is the fundamental technology behind developing autonomous
vehicles. Most leading car manufacturers in the world are reaping the benefits of investing in
artificial intelligence for developing on-road versions of hands-free technology. This involves the
process of identifying the objects, getting navigational routes and also at the same time environment
monitoring.

6) Medical Imaging: For the last decades, computersupported medical imaging application has been
a trustworthy help for physicians. It doesn’t only create and analyse images, but also becomes an
assistant and helps doctors with their interpretation. The application is used to read and convert 2D
scan images into interactive 3D models that enable medical professionals to gain a detailed
understanding of a patient’s health condition.

7) Google Translate App: All you need to do to read signs in a foreign language is to point your
phone’s camera at the words and let the Google Translate app tell you what it means in your
preferred language almost instantly. By using optical character recognition to see the image and
augmented reality to overlay an accurate translation, this is a convenient tool that uses Computer
Vision.

Q3. Define pixel and resolution.

Ans The word “pixel” means a picture element. Every photograph, in digital form, is made up of
pixels. They are the smallest unit of information that make up a picture. Usually round or square,
they are typically arranged in a 2-dimensional grid.

Resolution The number of pixels in an image is sometimes called the resolution. When the term is
used to describe pixel count, one convention is to express resolution as the width by the height, for
example a monitor resolution of 1280×1024. This means there are 1280 pixels from one side to the
other, and 1024 from top to bottom.

Q4. Differentiate between grayscale and RGB images.

Ans Gray Scale Images

1)Grayscale images are images which have a range of shades of gray without apparent colour.

2)The darkest possible shade is black, which is the total absence of colour or zero value of pixel. The
lightest possible shade is white, which is the total presence of colour or 255 value of a pixel .
Intermediate shades of gray are represented by equal brightness levels of the three primary colours.
3)A grayscale has each pixel of size 1 byte having a single plane of 2d array of pixels. The size of a
grayscale image is defined as the Height x Width of that image.

RGB Images

1)All the images that we see around are coloured images.

2) These images are made up of three primary colours Red, Green and Blue. All the colours that are
present can be made by combining different intensities of red, green and blue

Q5. How can the resolution of a monitor with 1280×1024 pixels be expressed in megapixels?

Ans Another convention is to express the number of pixels as a single number, like a 5 mega pixel
camera (a megapixel is a million pixels). This means the pixels along the width multiplied by the
pixels along the height of the image taken by the camera equals 5 million pixels. In the case of our
1280×1024 monitors, it could also be expressed as 1280 x 1024 = 1,310,720, or 1.31 megapixels.

Q6. Define

1)Classification-Image Classification problem is the task of assigning an input image one label from a
fixed set of categories.

2) Classification + Localisation This is the task which involves both processes of identifying what
object is present in the image and at the same time identifying at what location that object is present
in that image. It is used only for single objects.

3) Object Detection-Object detection is the process of finding instances of real-world objects such as
faces, bicycles, and buildings in images or videos. Object detection algorithms typically use extracted
features and learning algorithms to recognize instances of an object category. It is commonly used in
applications such as image retrieval and automated vehicle parking systems.

4) Instance Segmentation-Instance Segmentation is the process of detecting instances of the objects,


giving them a category and then giving each pixel a label on the basis of that. A segmentation
algorithm takes an image as input and outputs a collection of regions (or segments).

Common questions

Powered by AI

Instance segmentation involves detecting and categorizing object instances within an image while assigning a label to each pixel. This technique is used to distinguish multiple objects of the same category and is valuable in applications requiring detailed scene understanding, such as autonomous driving, where precise object boundaries and relations are crucial for decision-making .

Computer Vision improves customer experience in retail by tracking customer movements, analyzing their navigational patterns, and optimizing inventory management. By studying customer paths, retailers can rearrange store layouts effectively, enhancing shopper convenience. The technology also provides accurate estimates of in-store items, assisting both customer inquiries and inventory replenishment .

Computer Vision enhances self-driving cars by enabling them to identify objects, navigate roads, and monitor the environment. Using image processing algorithms, self-driving cars recognize and differentiate between various obstacles and traffic signs, which is crucial for safe navigation. This technology allows cars to make real-time decisions based on their surroundings, such as adjusting speed or lane position .

Computer Vision in medical imaging provides several advantages, such as creating detailed 3D models from 2D scans and assisting in interpretation. This technology enhances diagnostic accuracy by offering more comprehensive views of patient conditions and supports healthcare professionals in planning treatment paths. It also streamlines the analysis process, allowing practitioners to focus on more critical tasks .

Grayscale images are represented by pixels with values ranging from 0 (black) to 255 (white), allowing only shades of gray. In contrast, RGB images use three values per pixel corresponding to red, green, and blue intensities, allowing the representation of a full spectrum of colors. These differences affect image data processing as grayscale images are simpler and require less storage and processing power, while RGB images, being more complex, provide richer detail and color information .

Pixel resolution has significant implications on image quality and usability. Higher resolutions provide finer detail, crucial for applications like medical imaging or scientific research where small details are vital. In retail and automotive applications, optimal resolution balances detail with processing capacity for real-time analysis. Thus, understanding the trade-offs in resolution helps in designing systems that meet specific application demands effectively .

Computer Vision contributes to facial recognition systems by enabling accurate and efficient identification of individuals based on facial features. Security applications benefit from this technology by maintaining logs of visitors, recognizing persons of interest in real-time, and thus increasing safety in smart homes and cities. The high accuracy in dynamic environments is made possible by advanced image processing algorithms developed within Computer Vision .

Face filters in mobile apps utilize Computer Vision by detecting and analyzing facial dynamics through the device camera. The technology recognizes facial features and applies digital enhancements or decorations accordingly, allowing users to interact with augmented reality elements seamlessly. This interaction is driven by efficient facial landmark detection algorithms that enable real-time rendering of filters .

Google's Search by Image feature utilizes Computer Vision to compare the input image against a vast database, extracting relevant features for the comparison. This process allows users to receive search results based on image content, rather than text alone, enhancing the capability to locate information about unknown objects or locations directly through images .

Object Detection plays a crucial role in automated vehicle parking systems by identifying and locating real-world objects such as cars, pedestrians, and parking spaces. By recognizing these objects through image features and learning algorithms, the system can make informed decisions about parking maneuvers, ultimately improving the efficiency and safety of automated parking .

You might also like