Speech and Optical Character Recognition

The document discusses applications of pattern recognition, focusing on speech recognition and optical character recognition (OCR). It outlines key features, algorithms, and use cases for speech recognition, as well as the importance and functioning of OCR technology in converting images of text into machine-readable formats. Additionally, it touches on scene analysis in film, emphasizing the need for thorough observation and note-taking for effective analysis.

Uploaded by

Charan Chowdary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views13 pages

Speech and Optical Character Recognition

Uploaded by

Charan Chowdary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Pattern Recognition

CHAPTER 8: Applications of Pattern Recognition

Prepared By: Prof. Manisha C. Chandramaully
What is speech recognition?
• Speech recognition, also known as automatic speech
recognition (ASR), computer speech recognition, or
speech-to-text, is a capability which enables a program
to process human speech into a written format.
• While it’s commonly confused with voice recognition,
speech recognition focuses on the translation of speech
from a verbal format to a text one whereas voice
recognition just seeks to identify an individual user’s
voice.
Key features of effective speech
recognition:
• Language weighting: Improve precision by weighting specific
words that are spoken frequently (such as product names or
industry jargon), beyond terms already in the base vocabulary.
• Speaker labeling: Output a transcription that cites or tags
each speaker’s contributions to a multi-participant
conversation.
• Acoustics training: Attend to the acoustical side of the
business. Train the system to adapt to an acoustic environment
(like the ambient noise in a call center) and speaker styles (like
voice pitch, volume and pace).
• Profanity filtering: Use filters to identify certain words or
phrases and sanitize speech output.
Speech recognition algorithms:
• Natural language processing (NLP)
• Hidden markov models (HMM)
• N-grams
• Neural networks
• Speaker Diarization (SD)
Speech recognition use cases:
• Automotive
• Technology
• Healthcare
• Security
Character recognition:
• Characters are then identified using one of two
algorithms: pattern recognition or feature recognition.
Pattern recognition is used when the OCR program is
fed examples of text in various fonts and formats to
compare and recognize characters in the scanned
document or image file.
Optical Character Recognition
• Optical Character Recognition (OCR) is the process that
converts an image of text into a machine-readable text
format.
• For example, if you scan a form or a receipt, your
computer saves the scan as an image file. You cannot
use a text editor to edit, search, or count the words in
the image file.
Why is OCR important?
• Most business workflows involve receiving information from
print media.
• Paper forms, invoices, scanned legal documents, and printed
contracts are all part of business processes.
• These large volumes of paperwork take a lot of time and
space to store and manage.
• The process requires manual intervention and can be
tedious and slow.
• OCR technology solves the problem by converting text
images into text data that can be analyzed by other
business software.
How does OCR work?
• Image acquisition: A scanner reads documents and converts
them to binary data. The OCR software analyzes the scanned
image and classifies the light areas as background and the dark
areas as text.
• Preprocessing: The OCR software first cleans the image and
removes errors to prepare it for reading. Cleaning techniques are:
1. Deskewing or tilting the scanned document slightly to fix
alignment issues during the scan.
2. Despeckling or removing any digital image spots or smoothing
the edges of text images.
3. Cleaning up boxes and lines in the image.
4. Script recognition for multi-language OCR technology
Cont..
• Text recognition: The two main types of OCR algorithms or software
processes that an OCR software uses for text recognition are called pattern
matching and feature extraction.
• Pattern matching: Pattern matching works by isolating a character image,
called a glyph, and comparing it with a similarly stored glyph. Pattern
recognition works only if the stored glyph has a similar font and scale to the
input glyph. This method works well with scanned images of documents that
have been typed in a known font.
• Feature extraction: Feature extraction breaks down or decomposes the
glyphs into features such as lines, closed loops, line direction, and line
intersections. It then uses these features to find the best match or the nearest
neighbor among its various stored glyphs.
• Postprocessing: After analysis, the system converts the extracted text data
into a computerized file. Some OCR systems can create annotated PDF files
that include both the before and after versions of the scanned document.
Types of OCR
• Simple optical character recognition software
• Intelligent character recognition software
• Intelligent word recognition
• Optical mark recognition
OCR Aplications:
• Banking
• Healthcare
• Logistics
Scene Analysis
• How to Analyze a Scene ?
• While you can analyze an entire film, you can also
choose a scene from the movie and break it down even
further. Before you choose a scene you want to analyze,
watch the entire film first so you can understand what’s
happening. Go over the scene you want to analyze
multiple times so you can pick out the details and take
notes on it. Once you have your notes, you can write a
formal analysis essay about the scene.

Optical Character Recognition Overview
No ratings yet
Optical Character Recognition Overview
7 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
8 pages
Optical Character Recognition with ANN
No ratings yet
Optical Character Recognition with ANN
3 pages
Optical Character Recognition Overview
No ratings yet
Optical Character Recognition Overview
71 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
24 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
16 pages
OCR Technology: Transforming Text Extraction
No ratings yet
OCR Technology: Transforming Text Extraction
4 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
7 pages
Overview of Optical Character Recognition
No ratings yet
Overview of Optical Character Recognition
16 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
13 pages
OCR System Overview and Benefits
No ratings yet
OCR System Overview and Benefits
15 pages
Overview of Optical Character Recognition
No ratings yet
Overview of Optical Character Recognition
6 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
14 pages
Image to Text Conversion with OCR
No ratings yet
Image to Text Conversion with OCR
21 pages
OCR Software: Features & Implementation Guide
No ratings yet
OCR Software: Features & Implementation Guide
16 pages
Embedded OCR Techniques Overview
No ratings yet
Embedded OCR Techniques Overview
27 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
7 pages
Lec 2 Pattern Recognition
No ratings yet
Lec 2 Pattern Recognition
52 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
12 pages
Understanding Optical Character Recognition
100% (1)
Understanding Optical Character Recognition
17 pages
Seminar Report on Optical Character Recognition
50% (2)
Seminar Report on Optical Character Recognition
33 pages
OCR Systems: Historical Overview and Techniques
No ratings yet
OCR Systems: Historical Overview and Techniques
37 pages
OCR Text Detection Overview
No ratings yet
OCR Text Detection Overview
12 pages
OCR Implementation in MATLAB
No ratings yet
OCR Implementation in MATLAB
10 pages
OCR vs ICR: Key Differences Explained
No ratings yet
OCR vs ICR: Key Differences Explained
7 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
17 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
11 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
5 pages
OCR System Overview and Benefits
No ratings yet
OCR System Overview and Benefits
28 pages
Matlab OCR Project Overview
No ratings yet
Matlab OCR Project Overview
6 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
32 pages
Lesson 2
No ratings yet
Lesson 2
3 pages
Understanding Optical Character Recognition
100% (1)
Understanding Optical Character Recognition
36 pages
Optical Character Recognition:: An Illustrated Guide To The Frontier
No ratings yet
Optical Character Recognition:: An Illustrated Guide To The Frontier
197 pages
Devanagari OCR Techniques and Methods
No ratings yet
Devanagari OCR Techniques and Methods
5 pages
A Survey of Modern Optical Character Rec PDF
No ratings yet
A Survey of Modern Optical Character Rec PDF
37 pages
OCR and gTTS Implementation Overview
No ratings yet
OCR and gTTS Implementation Overview
49 pages
Evaluation of OCR Technologies
No ratings yet
Evaluation of OCR Technologies
41 pages
Understanding Optical Character Recognition
No ratings yet
Understanding Optical Character Recognition
22 pages
Matlab OCR Project Overview
No ratings yet
Matlab OCR Project Overview
7 pages
Overview of Optical Character Recognition
No ratings yet
Overview of Optical Character Recognition
7 pages
Introduction to Optical Character Recognition
No ratings yet
Introduction to Optical Character Recognition
29 pages
OCR Text Extraction: A Systematic Review
No ratings yet
OCR Text Extraction: A Systematic Review
6 pages
Presentation 1
No ratings yet
Presentation 1
15 pages
OCR Techniques for Image Text Extraction
No ratings yet
OCR Techniques for Image Text Extraction
8 pages
MATLAB OCR Implementation Guide
No ratings yet
MATLAB OCR Implementation Guide
27 pages
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
No ratings yet
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
4 pages
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
100% (1)
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
61 pages
Neural Network for Character Recognition
No ratings yet
Neural Network for Character Recognition
3 pages
Optical Character Recognition System Overview
No ratings yet
Optical Character Recognition System Overview
5 pages
Enhancing Listening Skills with OCR and TTS
No ratings yet
Enhancing Listening Skills with OCR and TTS
42 pages
AI-Powered OCR for Handwritten Text
No ratings yet
AI-Powered OCR for Handwritten Text
20 pages
Intelligent Process Automation Ocr Whitepaper PDF
100% (1)
Intelligent Process Automation Ocr Whitepaper PDF
16 pages
Survey on Intelligent Form Reader
No ratings yet
Survey on Intelligent Form Reader
5 pages
Best Practices for OCR Accuracy
No ratings yet
Best Practices for OCR Accuracy
4 pages
NLP Applications Overview
No ratings yet
NLP Applications Overview
26 pages
Must-Have AI Skills for Engineers
No ratings yet
Must-Have AI Skills for Engineers
4 pages
Informed Search Algorithms in AI
No ratings yet
Informed Search Algorithms in AI
4 pages
Introduction to AI for Class 9
No ratings yet
Introduction to AI for Class 9
21 pages
AI Class 10: Key Concepts & Q&A Guide
No ratings yet
AI Class 10: Key Concepts & Q&A Guide
3 pages
Artificial Intelligence Fundamentals With Capstone - Orientation Deck Q4
No ratings yet
Artificial Intelligence Fundamentals With Capstone - Orientation Deck Q4
19 pages
AI Expert Systems in Education: Enhancing HOTS
No ratings yet
AI Expert Systems in Education: Enhancing HOTS
5 pages
AI in Library and Information Science
No ratings yet
AI in Library and Information Science
13 pages
History and Concepts of AI
No ratings yet
History and Concepts of AI
17 pages
Understanding AI: A Comprehensive Primer
No ratings yet
Understanding AI: A Comprehensive Primer
17 pages
AI Exam Questions for IT Students
No ratings yet
AI Exam Questions for IT Students
2 pages
Understanding Artificial Intelligence Concepts
No ratings yet
Understanding Artificial Intelligence Concepts
15 pages
Machine Learning in Digital Forensics
No ratings yet
Machine Learning in Digital Forensics
7 pages
Four Steps to Robust AI Development
No ratings yet
Four Steps to Robust AI Development
59 pages
ISC Class XI - Notes
100% (1)
ISC Class XI - Notes
12 pages
Data Scientist Resume - Visalakshi Iyer
No ratings yet
Data Scientist Resume - Visalakshi Iyer
1 page
History of Artificial Intelligence Summary
No ratings yet
History of Artificial Intelligence Summary
5 pages
B.Sc English & Hindi Exam Pattern 2025
No ratings yet
B.Sc English & Hindi Exam Pattern 2025
43 pages
Technology Without Ethics Is A Ship Without A Rudd 250731 155126
No ratings yet
Technology Without Ethics Is A Ship Without A Rudd 250731 155126
12 pages
AI's Impact on Daily Life and Challenges
No ratings yet
AI's Impact on Daily Life and Challenges
2 pages
Bostrom & Yudkowsky (2014) - The Ethics of Artificial Intelligence (Cambridge Handbook of AI) .2up2
No ratings yet
Bostrom & Yudkowsky (2014) - The Ethics of Artificial Intelligence (Cambridge Handbook of AI) .2up2
11 pages
Computer Vision & Deep Learning Overview
No ratings yet
Computer Vision & Deep Learning Overview
40 pages
POS Tagging and Sequence Labeling in NLP
No ratings yet
POS Tagging and Sequence Labeling in NLP
69 pages
CNN and GAN for Plant Disease Detection
No ratings yet
CNN and GAN for Plant Disease Detection
10 pages
BFS Pathfinding in 2D Grid Lab Report
No ratings yet
BFS Pathfinding in 2D Grid Lab Report
5 pages
AI's Role in P vs NP Challenges
No ratings yet
AI's Role in P vs NP Challenges
4 pages
AI Planning and Learning Techniques
No ratings yet
AI Planning and Learning Techniques
141 pages
AI & Machine Learning Lecture Notes
No ratings yet
AI & Machine Learning Lecture Notes
62 pages
Class X AI Practical Portfolio 2025-26
No ratings yet
Class X AI Practical Portfolio 2025-26
9 pages
UNIT 1 Natural Language Processing by Codes With Duo
No ratings yet
UNIT 1 Natural Language Processing by Codes With Duo
24 pages
Multi-Robot Coordination in Soccer AI
No ratings yet
Multi-Robot Coordination in Soccer AI
54 pages

Speech and Optical Character Recognition

Uploaded by

Speech and Optical Character Recognition

Uploaded by

Pattern Recognition

CHAPTER 8: Applications of Pattern Recognition

You might also like