Optical Character Recognition Overview

This document discusses optical character recognition (OCR) and a proposed solution for an OCR system. It provides background on OCR, including that it can recognize both handwritten and printed characters and convert them to a digital format. The proposed solution involves preprocessing the image through noise removal, segmentation of text into lines, words and characters, and using a neural network for character recognition trained on generated character samples. It discusses performing image acquisition, noise removal, normalization, tilt detection, line detection, word detection, and character detection. Limitations include issues with text separation and requiring high contrast between text and background.

Uploaded by

nancy Poonia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views7 pages

Optical Character Recognition Overview

Uploaded by

nancy Poonia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Optical Character Recognizer

(A broader aspect of handwritten digit recognizer)

CS-16
Project Supervisor : Dr. Krishna K. Mishra
Team Member:
Payal Gupta (20188062)
Prerna Agarwal (20184066)
Nancy (20184191)
Harshit Meena (20164065)
Intoduction
Text is everywhere! It is present in PDFs, docs as well as
images. There are lots of applications where text data is useful
for doing analytics like include receipts recognition, number
plate detection, extracting the latex formulas from the images
etc. By using the computer’s voice-operated program, blind
people can scan books, magazines, and incoming faxes into
word processing programs with ease.
As OCR stands for optical character recognition, OCR
technology deals with the problem of recognizing all kinds of
different characters. Both handwritten and printed characters
can be recognized and converted into a machine-readable,
digital data format.
Think of any kind of serial number or code consisting of
numbers and letters that you need digitized. By using OCR you
can transform these codes into a digital output. The technology
makes use of many different techniques. Put simply, the image
taken is processed, the characters extracted, and are then
recognized.
What OCR does not do is consider the actual nature of the
object that you want to scan. It simply “takes a look” at the
characters that you aim to transform into a digital format. For
example, if you scan a word it will learn and recognize the
letters, but not the meaning of the word.
LITERATURE REVIEW
Character recognition is not a new problem but its roots can
be traced back to systems before the inventions of computers.
The earliest OCR systems were not computers but mechanical
devices that were able to recognize characters, but very slow
speed and low accuracy. The early OCR systems were
criticized due to errors and slow recognition speed. Hence, not
much research efforts were put on the topic during 60’s and
70’s. The only developments were done on government
agencies and large corporations like banks, newspapers and
airlines etc. OCR text works efficiently with the printed text
only and not with handwritten text.
Documents generated on a high quality paper with modern
printing technologies allow the systems to exceed 99%
recognition accuracy. However, the recognition rate of the
commercially available products depends on the age of the
documents, quality of the paper and ink, which may result in
significant data acquisitions noise. Documents with coloured
or patterned backgrounds, marked with pens, crooked when
scanned, can yield poor OCR results. Some improvement can
be done by either adjusting the scanner settings and
rescanning the document or manually correcting the electronic
data.
PROPOSED SOLUTION
The OCR is performed in the following phases:
• Image is retrieved The image should be cropped in such
a way that only text is present. Also, the background
should be very lighter than the text. Ideal image would
be black text on a white paper background.
• Preprocessing Noises are removed by blurring. The it is
converted to binary image along with invert. For this
we've used OpenCV methods such as gaussian blur and
threshold.
• Segmentation Segmentation is divided into three parts.
First we segment the image based on lines. Then the
lines are separated into words. Lastly, the words are
separated into characters. OpenCV methods such as
projections and contour detections are used. The
characters are then fed into the neural network.
• Neural Network There are two parts to neural network.
First is Training Neural Network. For training the neural
network, we will first generate our own samples for each
characters. So we will then converte those images into
numpy array and combine all samples with
corresponding labels required by the neural network.
Second is Recognizing characters.
• Along with that, we also checked each words in the
english dictionary to fix the spelling errors.
SIMULATIONS and CONCLUSION
Image Processing-
[Link] Acquisition- Retrieve image saved in a remote
location in the computer.
[Link] Removal- Use blur/smoothen.
[Link]- Convert image pixels to one of two pixels –
either black or white.
[Link] Detection- Detect text pixels
[Link] Detection- Calculate horizontal projections of the
image.
[Link] Detection- Calculate vertical projections of the image.
[Link] Detection- Create separate character images based
on contours.
Limitations-
Text separation.
•The system cannot work with an input image consisting of
only a small amount of text and a large amount of scenery.
•The text should be darker and the background should be
brighter.

REFERENCES
[Link]
[Link]
[Link]

Common questions

In current OCR processes, the arrangement of images affects system effectiveness. Ideally, the text should contrast sharply against a brighter or plain white background to enhance clarity. Poorly arranged texts, such as those intertwined with heavy scenery or on dark backgrounds, can yield poor recognition results. Crooked scans or those with visible markings can further degrade OCR performance, necessitating additional preprocessing or manual corrections to improve accuracy .

Noise significantly affects the accuracy of OCR systems as it can distort text characters during scanning, leading to misrecognition. Effective noise removal is essential for enhancing recognition accuracy; this is typically achieved via noise reduction techniques such as blurring. Converting the image into a binary format further aids in clarity, facilitating better recognition by neural networks. Proper preprocessing mitigates noise-induced errors, elevating OCR's text recognition efficacy .

Image normalization in OCR is crucial as it standardizes different image attributes, converting pixels to a binary (black and white) format. This transformation simplifies the image, focusing computational resources on extracting text rather than processing various color distinctions, thus making the characters easily recognizable by subsequent OCR processes like segmentation and neural network recognition .

The OCR system addresses preprocessing issues by removing noise using blurring techniques and converting the image to binary form. OpenCV methods like Gaussian blur and thresholding are employed to ensure that characters are clearly distinguishable. Optimal results are achieved with an ideal image being black text on a white background .

OCR technology historically faced challenges like slow speed and low accuracy, particularly pertaining to handwritten text recognition. Earliest systems, being mechanical, were criticized for these limitations, which led to limited research in the 60s and 70s, with advancements being confined primarily to high-quality printed text used by banks and airlines. Hence, commercial OCR systems work efficiently with printed text on high-quality paper and modern printing technologies but struggle with aged documents, paper quality, and backgrounds, leading to significant data noise .

OCR systems can be hindered by limitations such as the inability to effectively process images with minimal text against a complex background. The text needs to be significantly darker than the background, which should be bright. Other challenges include dealing with images where the text is crooked, marked by pens, or affected by colored/patterned backgrounds, which increases the probability of OCR errors .

Advancements in OCR technology have allowed visually impaired individuals to scan textual content from books, magazines, and other documents using voice-operated programs. This technology facilitates conversion of scanned text into audible output or digital text, thereby improving access to written information and enhancing their ability to independently process written materials .

Neural networks in OCR systems are crucial for recognizing characters. The process involves training a network where samples of characters are converted into numpy arrays and labeled appropriately. Once trained, this network is used to recognize scanned characters from segmented images. This involves feeding characters obtained from segmentation phases into the trained model for identification. Additionally, words are checked against an English dictionary to correct potential spelling errors stemming from misrecognition .

Segmentation is essential in OCR as it facilitates breaking down an image into manageable parts, enabling detailed analysis. It is implemented in three stages: firstly, segmenting the image based on lines, secondly splitting these into words, and finally separating words into characters. Techniques such as projections and contour detection using OpenCV methods are employed here to ensure precise segmentation, which is a foundational step before character recognition takes place .

Initially, OCR development lagged due to limited interest and technology. However, government agencies, banks, and large organizations such as airlines played a pivotal role by investing in high-quality OCR technologies suited for their specific needs—processing of bank checks, printed tickets, and newspapers. This necessity for accurate document processing pushed forward technological advancements leading to systems capable of achieving over 99% accuracy under suitable conditions .

Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
13 pages
OCR Techniques for English Text
No ratings yet
OCR Techniques for English Text
6 pages
OCR Techniques for Image Text Extraction
No ratings yet
OCR Techniques for Image Text Extraction
8 pages
Optical Character Recognition System Overview
No ratings yet
Optical Character Recognition System Overview
5 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
24 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
11 pages
Survey on Intelligent Form Reader
No ratings yet
Survey on Intelligent Form Reader
5 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
14 pages
OCR Solutions for Image Text Extraction
No ratings yet
OCR Solutions for Image Text Extraction
9 pages
OCR Systems: Historical Overview and Techniques
No ratings yet
OCR Systems: Historical Overview and Techniques
37 pages
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
100% (1)
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
61 pages
Handwritten Character Recognition Project
No ratings yet
Handwritten Character Recognition Project
46 pages
OCR Techniques for Image Processing
No ratings yet
OCR Techniques for Image Processing
8 pages
Optical Character Recognition with ANN
No ratings yet
Optical Character Recognition with ANN
3 pages
Embedded OCR Techniques Overview
No ratings yet
Embedded OCR Techniques Overview
27 pages
Overview of Optical Character Recognition
No ratings yet
Overview of Optical Character Recognition
16 pages
OCR Text Extraction: A Systematic Review
No ratings yet
OCR Text Extraction: A Systematic Review
6 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
7 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
12 pages
Neural Network for Character Recognition
No ratings yet
Neural Network for Character Recognition
3 pages
OCR Technology Using Python Libraries
No ratings yet
OCR Technology Using Python Libraries
24 pages
Text Retrieval From Scanned Forms Using Optical Character Recognition Springerlink
No ratings yet
Text Retrieval From Scanned Forms Using Optical Character Recognition Springerlink
10 pages
Overview of Optical Character Recognition
No ratings yet
Overview of Optical Character Recognition
6 pages
Optical Character Recognition Overview
No ratings yet
Optical Character Recognition Overview
71 pages
OCR Technology: Transforming Text Extraction
No ratings yet
OCR Technology: Transforming Text Extraction
4 pages
OCR Text Detection Overview
No ratings yet
OCR Text Detection Overview
12 pages
Optical Character Recognition System Design
No ratings yet
Optical Character Recognition System Design
41 pages
Optical Character Recognition:: An Illustrated Guide To The Frontier
No ratings yet
Optical Character Recognition:: An Illustrated Guide To The Frontier
197 pages
Design of An OCR System and Its Hardware Implementation
No ratings yet
Design of An OCR System and Its Hardware Implementation
18 pages
Neural Networks for Optical Character Recognition
100% (1)
Neural Networks for Optical Character Recognition
4 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
7 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
4 pages
Text Recognition via Optical Character Recognition
No ratings yet
Text Recognition via Optical Character Recognition
4 pages
Seminar Report on Optical Character Recognition
50% (2)
Seminar Report on Optical Character Recognition
33 pages
Handwritten Text Recognition Project
No ratings yet
Handwritten Text Recognition Project
6 pages
EasyOCR: Multilingual Text Recognition
No ratings yet
EasyOCR: Multilingual Text Recognition
11 pages
AI-Powered OCR for Handwritten Text
No ratings yet
AI-Powered OCR for Handwritten Text
20 pages
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
No ratings yet
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
4 pages
Efficient OCR System with ASVM Classifier
No ratings yet
Efficient OCR System with ASVM Classifier
7 pages
A Review On OCR Technology
No ratings yet
A Review On OCR Technology
5 pages
Text Recognition Algorithm for OCR
No ratings yet
Text Recognition Algorithm for OCR
4 pages
Optical Character Recognition Overview
No ratings yet
Optical Character Recognition Overview
5 pages
Advances in Optical Character Recognition
No ratings yet
Advances in Optical Character Recognition
5 pages
OCR System Overview and Benefits
No ratings yet
OCR System Overview and Benefits
28 pages
Advances in Optical Character Recognition
No ratings yet
Advances in Optical Character Recognition
9 pages
OCR System Overview and Benefits
No ratings yet
OCR System Overview and Benefits
15 pages
Devanagari OCR Techniques and Methods
No ratings yet
Devanagari OCR Techniques and Methods
5 pages
Optical Character Recognition Overview
No ratings yet
Optical Character Recognition Overview
5 pages
Handwriting Recognition with OCR & Neural Networks
No ratings yet
Handwriting Recognition with OCR & Neural Networks
6 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
No ratings yet
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
14 pages
A Survey of Modern Optical Character Rec PDF
No ratings yet
A Survey of Modern Optical Character Rec PDF
37 pages
Text Recognition
No ratings yet
Text Recognition
11 pages
OCR Technology in Various Industries
No ratings yet
OCR Technology in Various Industries
7 pages
OCR with CNN: Image to Text Conversion
No ratings yet
OCR with CNN: Image to Text Conversion
5 pages
Tesseract OCR: Open Source Case Study
No ratings yet
Tesseract OCR: Open Source Case Study
7 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
32 pages
Text Extraction Using OCR Techniques
No ratings yet
Text Extraction Using OCR Techniques
5 pages
Understanding Optical Character Recognition
100% (1)
Understanding Optical Character Recognition
17 pages
Struggling Readers: Prevention & Intervention
No ratings yet
Struggling Readers: Prevention & Intervention
33 pages
Outbound Delivery Process Guide
No ratings yet
Outbound Delivery Process Guide
35 pages
Mumbai University Engineering Mechanics Exam 2017
No ratings yet
Mumbai University Engineering Mechanics Exam 2017
42 pages
Majlis Madani Muzakrah Guidelines
No ratings yet
Majlis Madani Muzakrah Guidelines
20 pages
Understanding System Calls in OS
No ratings yet
Understanding System Calls in OS
5 pages
Save Your People: Psalm 27 Song
No ratings yet
Save Your People: Psalm 27 Song
1 page
Quick Study Guide for Accounting Skills
No ratings yet
Quick Study Guide for Accounting Skills
14 pages
Writing Effective Critiques: Guidelines
No ratings yet
Writing Effective Critiques: Guidelines
27 pages
15 Essential Italian Verbs for Beginners
No ratings yet
15 Essential Italian Verbs for Beginners
9 pages
Effective Class Management Strategies
No ratings yet
Effective Class Management Strategies
31 pages
EE229 Third Problem Assignment
No ratings yet
EE229 Third Problem Assignment
5 pages
Samsung A04 User Manual Overview
No ratings yet
Samsung A04 User Manual Overview
112 pages
Graham Lambkin Solos Book
No ratings yet
Graham Lambkin Solos Book
3 pages
Efficient Railway Reservation System Design
No ratings yet
Efficient Railway Reservation System Design
11 pages
Java 8 Stream vs Parallel Stream Guide
No ratings yet
Java 8 Stream vs Parallel Stream Guide
11 pages
Set Representation Methods Explained
No ratings yet
Set Representation Methods Explained
2 pages
Dawoodi Collection of Rare Manuscripts
No ratings yet
Dawoodi Collection of Rare Manuscripts
54 pages
Introduction to HTML for Class 7
No ratings yet
Introduction to HTML for Class 7
5 pages
FastDriveVLA: Efficient Token Pruning for VLA
No ratings yet
FastDriveVLA: Efficient Token Pruning for VLA
9 pages
PHP Basics and Coding Questions
100% (1)
PHP Basics and Coding Questions
179 pages
Essential Microsoft Office Shortcuts
No ratings yet
Essential Microsoft Office Shortcuts
9 pages
Data Structures Question Bank - B.Tech II Sem
No ratings yet
Data Structures Question Bank - B.Tech II Sem
2 pages
Week 6 Compiler Design MCQs
100% (1)
Week 6 Compiler Design MCQs
3 pages
Sport
No ratings yet
Sport
1 page
Train to Pakistan: A Partition Novel Analysis
No ratings yet
Train to Pakistan: A Partition Novel Analysis
4 pages
Deductive Reasoning Explained with Examples
No ratings yet
Deductive Reasoning Explained with Examples
2 pages
Shapin Steven The Scientific Revolution PDF
No ratings yet
Shapin Steven The Scientific Revolution PDF
236 pages
EEE 321 Signals and Systems Lab Guide
No ratings yet
EEE 321 Signals and Systems Lab Guide
7 pages
Chakravarti Uma Social Dimensions of Early Buddhism 249p PDF
100% (5)
Chakravarti Uma Social Dimensions of Early Buddhism 249p PDF
251 pages
Prakrit Loanwords in Kannada Hymns
No ratings yet
Prakrit Loanwords in Kannada Hymns
61 pages