Precision and Recall in Information Retrieval

The document outlines an assignment focused on implementing a program to calculate precision and recall in information retrieval systems, emphasizing the understanding of these metrics and indexing structures. It explains the definitions of precision and recall, their trade-offs, and the challenges in accurately measuring them. Additionally, it includes sample code in C++ and concludes with questions for further understanding of the concepts discussed.

Uploaded by

02 - Prathmesh Khandare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views6 pages

Precision and Recall in Information Retrieval

Uploaded by

02 - Prathmesh Khandare

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Assignment No.

ProblemStatement:
Implement a program to calculate precision and recall for sample input. (Answer set A, Query q1,
Relevant documents to query q1- Rq1 )

Objectives:
1. To understand precision and recall in information retrieval
2. To study indexing structures for information retrieval.

Outcomes:
At the end of the assignment the students should have:
1. Understood precision and recall in information retrieval.
2. Understood use of indexing in fast retrieval.

Theory:
Precision and Recall in Information Retrieval
Information Systems can be measured with two metrics: precision and recall. When a user decides to
search for information on a topic, the total database and the results to be obtained can be divided into 4
categories:
1. Relevant and Retrieved
2. Relevant and Not Retrieved
3. Non-Relevant and Retrieved
4. Non-Relevant and Not Retrieved
Relevant items are those documents that help the user in answering his question. Non-Relevant items
are items that don’t provide actually useful information. For each item there are two possibilities it can
be retrieved or not retrieved by the user’s query. Precision is defined as the ratio of the number of
relevant and retrieved documents(number of items retrieved that are actually useful to the user and
match his search need) to the number of total retrieved documents from the query. Recall is defined as
ratio of the number of retrieved and relevant documents(the number of items retrieved that are relevant
to the user and match his needs) to the number of possible relevant documents(number of relevant
documents in the database).Precision measures one aspect of information retrieval overhead for a user
associated with a particular search. If a search has 85 percent precision then 15(100-85) percent of user
effort is overhead reviewing non-relevant items. Recall measures to what extent a system processing a
particular query is able to retrieve the relevant items the user is interested in seeing. Recall is a very
useful concept but due to the denominator is non-calculable in operational systems. If the system is
made known the total set of relevant items in the database, recall can be made calculable.
Precision/recall trade-off
You can increase recall by returning more docs. Recall is a non-decreasing function of the number of
docs retrieved. A system that returns all docs has 100% recall! The converse is also true (usually): It’s
easy to get high precision for very low recall.
Consider an Information retrieval (IR) system returning relevant documents
Fig 1: IR system returning relevant documents

Precision and Recall explanation:

Consider,
I: an information request
R: the set of relevant documents for I
A: the answer set for I, generated by an IR system
R ∩ A: the intersection of the sets R and A
|A|-number of documents in the set A
|Ra |-number of documents in the intersection of sets R and A

The goal is to achieve high precision and high recall. The definition of precision and recall assumes
that all docs in the set A have been examined However, the user is not usually presented with all docs in
the answer set A at once User sees a ranked set of documents and examines them starting from the top
Thus, precision and recall vary as the user proceeds with their examination of the set A. Most
appropriate then is to plot a curve of precision versus recall.
If we proceed with our examination of the ranking generated, we can plot a curve of precision versus
recall as follows:

Thus, Precision and recall have been extensively used to evaluate the retrieval performance of IR
systems or algorithms. However, a more careful reflection reveals problems with these two measures:
First, the proper estimation of maximum recall for a query requires detailed knowledge of all the
documents in the collection Second, in many situations the use of a single measure could b e more
appropriate Third, recall and precision measure the effectiveness over a set of queries processed in batch
mode Fourth, for systems which require a weak ordering though, recall and precision might be
inadequate.
Sample code in C++
• Code
• Output

Conclusion: Implementation is concluded by executing a program to calculate precision and recall for
sample input with relevant documents Rq1 for query q1.

A. Write short answer of following questions:

1. What is precision and recall in IR systems?
2. How recall and precision measures are are defined?

B. Viva Questions:

1. What is relevance of document?

2. What are the metrics to measure information systems?
3. How are precision and recall calculated for information systems?.
4. What is the problem with these two measures?

Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
36 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
36 pages
IR Evaluation Methods and Metrics
No ratings yet
IR Evaluation Methods and Metrics
28 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
108 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
80 pages
Retrieval Performance Evaluation Metrics
No ratings yet
Retrieval Performance Evaluation Metrics
31 pages
Measuring Information Retrieval Effectiveness
No ratings yet
Measuring Information Retrieval Effectiveness
24 pages
IR Chapt 5
No ratings yet
IR Chapt 5
55 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
76 pages
Multi-Hop Retrieval Evaluation Insights
No ratings yet
Multi-Hop Retrieval Evaluation Insights
28 pages
IR System Evaluation Metrics Explained
No ratings yet
IR System Evaluation Metrics Explained
13 pages
Retrieval Evaluation Metrics in IR
No ratings yet
Retrieval Evaluation Metrics in IR
54 pages
Evaluating Information Retrieval Performance
No ratings yet
Evaluating Information Retrieval Performance
52 pages
IR Performance Evaluation Study Guide
No ratings yet
IR Performance Evaluation Study Guide
64 pages
Search Engine Evaluation Metrics Guide
No ratings yet
Search Engine Evaluation Metrics Guide
49 pages
Measuring Information Retrieval Effectiveness
No ratings yet
Measuring Information Retrieval Effectiveness
41 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
26 pages
Evaluating Information Retrieval Effectiveness
No ratings yet
Evaluating Information Retrieval Effectiveness
20 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
31 pages
Retrieval Evaluation Metrics Explained
No ratings yet
Retrieval Evaluation Metrics Explained
34 pages
Unit 3 (Isr)
No ratings yet
Unit 3 (Isr)
9 pages
Understanding Retrieval Model Performance
No ratings yet
Understanding Retrieval Model Performance
23 pages
IR System Evaluation Metrics
No ratings yet
IR System Evaluation Metrics
25 pages
Retrieval Evaluation Techniques
No ratings yet
Retrieval Evaluation Techniques
7 pages
Evaluating Modern Information Retrieval
No ratings yet
Evaluating Modern Information Retrieval
58 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
9 pages
User-Oriented Measures in IR System Evaluation
No ratings yet
User-Oriented Measures in IR System Evaluation
12 pages
Retrieval Evaluation in Information Systems
No ratings yet
Retrieval Evaluation in Information Systems
14 pages
IR Chapter V
No ratings yet
IR Chapter V
44 pages
Precision and Recall in IR Evaluation
No ratings yet
Precision and Recall in IR Evaluation
20 pages
Unit 3
No ratings yet
Unit 3
16 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
18 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
32 pages
Unit3 ISR
No ratings yet
Unit3 ISR
15 pages
Precision and Recall in Classification
No ratings yet
Precision and Recall in Classification
20 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
20 pages
Chapter Five
No ratings yet
Chapter Five
10 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
45 pages
Relevance Metrics in Software Engineering
No ratings yet
Relevance Metrics in Software Engineering
13 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
7 pages
Evaluation Metrics for Information Retrieval
No ratings yet
Evaluation Metrics for Information Retrieval
6 pages
Information Retrieval Evaluation Metrics
No ratings yet
Information Retrieval Evaluation Metrics
24 pages
Information Retrieval Evaluation Methods
No ratings yet
Information Retrieval Evaluation Methods
50 pages
Retrieval Evaluation Metrics Explained
No ratings yet
Retrieval Evaluation Metrics Explained
15 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
57 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
41 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
46 pages
Performance Evaluation in Information Retrieval
No ratings yet
Performance Evaluation in Information Retrieval
9 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
4 pages
Isr Q&a
No ratings yet
Isr Q&a
51 pages
Information Retrieval Evaluation Metrics
No ratings yet
Information Retrieval Evaluation Metrics
63 pages
8-Evaluation Measures
No ratings yet
8-Evaluation Measures
34 pages
Unit III Notes
No ratings yet
Unit III Notes
4 pages
9-Rank-Based Evaluation Measures and Result Summaries
No ratings yet
9-Rank-Based Evaluation Measures and Result Summaries
49 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
36 pages
Information Retrieval Models & Evaluation
No ratings yet
Information Retrieval Models & Evaluation
58 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
54 pages
Information Retrieval Evaluation
No ratings yet
Information Retrieval Evaluation
5 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
37 pages
Feature Extraction in 2D Color Images
No ratings yet
Feature Extraction in 2D Color Images
8 pages
Deep Learning Frameworks Overview and Implementation
No ratings yet
Deep Learning Frameworks Overview and Implementation
12 pages
Web Crawler Implementation in Python
No ratings yet
Web Crawler Implementation in Python
4 pages
ER Diagram for Exam System Analysis
No ratings yet
ER Diagram for Exam System Analysis
2 pages
SQL Triggers for Data Management
No ratings yet
SQL Triggers for Data Management
2 pages
Computer Network & Security Questions 2024
No ratings yet
Computer Network & Security Questions 2024
9 pages
Information Theory for Cyber Security
No ratings yet
Information Theory for Cyber Security
2 pages
LARK: Enhanced KG Reasoning with LLMs
No ratings yet
LARK: Enhanced KG Reasoning with LLMs
18 pages
Evolution of Programming Languages
No ratings yet
Evolution of Programming Languages
8 pages
EEG Channel Attention with Swin Transformer
No ratings yet
EEG Channel Attention with Swin Transformer
10 pages
Data Science Course Syllabus Overview
No ratings yet
Data Science Course Syllabus Overview
5 pages
PySpark ML Pipeline for Titanic Survival
No ratings yet
PySpark ML Pipeline for Titanic Survival
10 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
11 pages
Big Data Analysis with Python Libraries
No ratings yet
Big Data Analysis with Python Libraries
3 pages
Understanding Knowledge Graphs Basics
No ratings yet
Understanding Knowledge Graphs Basics
440 pages
Sales Data Overview and Analysis
No ratings yet
Sales Data Overview and Analysis
20 pages
Understanding Quantum Ledger Basics
No ratings yet
Understanding Quantum Ledger Basics
55 pages
DBMS Assignment Solutions for GTU 2025
No ratings yet
DBMS Assignment Solutions for GTU 2025
3 pages
Data Scientist Role in NLP - Abu Dhabi
No ratings yet
Data Scientist Role in NLP - Abu Dhabi
2 pages
Drone Survey for Land Encroachment Detection
No ratings yet
Drone Survey for Land Encroachment Detection
7 pages
Guidelines for VR/AR Education Systems
No ratings yet
Guidelines for VR/AR Education Systems
38 pages
AI-Driven Investment Insights at Goldman Sachs
No ratings yet
AI-Driven Investment Insights at Goldman Sachs
2 pages
Hadoop Basics: Overview and Features
No ratings yet
Hadoop Basics: Overview and Features
35 pages
Normalization and Functional Dependency in DBMS
No ratings yet
Normalization and Functional Dependency in DBMS
12 pages
AI Knowledge Representation Assignment
No ratings yet
AI Knowledge Representation Assignment
3 pages
Professional Slides and Infographics
100% (1)
Professional Slides and Infographics
20 pages
Enigma Cipher in Crime Detection System
No ratings yet
Enigma Cipher in Crime Detection System
4 pages
AI Development Course Overview
No ratings yet
AI Development Course Overview
52 pages
Introduction to Big Data Concepts
No ratings yet
Introduction to Big Data Concepts
28 pages
Deep Learning in Quantum Physics
No ratings yet
Deep Learning in Quantum Physics
205 pages
Technical Document Analyzer Overview
No ratings yet
Technical Document Analyzer Overview
49 pages
Database Modeling and SQL Techniques
No ratings yet
Database Modeling and SQL Techniques
5 pages
Research Findings and Project Highlights
No ratings yet
Research Findings and Project Highlights
2 pages
Kubernetes Overview and Key Concepts
No ratings yet
Kubernetes Overview and Key Concepts
76 pages
Understanding XML Basics and Uses
No ratings yet
Understanding XML Basics and Uses
38 pages
Understanding Sling Models in AEM
No ratings yet
Understanding Sling Models in AEM
2 pages

Precision and Recall in Information Retrieval

Uploaded by

Precision and Recall in Information Retrieval

Uploaded by

Assignment No.

Precision and Recall explanation:

A. Write short answer of following questions:

1. What is relevance of document?

You might also like