0% found this document useful (0 votes)

6 views6 pages

Query Equivalence

The document discusses two methods for evaluating relational algebra expressions: materialized evaluation, which stores intermediate results in temporary files, and pipelined evaluation, which processes operations simultaneously without storing intermediate results. It also covers query optimization, emphasizing the importance of reducing resource usage and improving execution speed through equivalent relational expressions and various optimization techniques. Additionally, it outlines several equivalence rules for transforming relational expressions to enhance query performance.

Uploaded by

kanhiyasingh2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

Query Equivalence

Uploaded by

kanhiyasingh2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Evaluation of relational algebra expressions

Materialized evaluation − Evaluate one operation at a time. Evaluate

the expression in a bottom-up manner and stores intermediate results to
temporary files.

Store the result of A ⋈ B in a temporary file.

Store the result of C ⋈ D in a temporary file.
Finally, join the results stored in temporary files.
The overall cost=sum of costs of individual operations + cost of writing
intermediate results to disk, cost of writing results to results to temporary
files and reading them back is quite high.
Pipelined evaluation − Evaluate several operations simultaneously.
Result of one operation is passed to the next operation. Evaluate the
expression in a bottom-up manner and don’t store intermediate results to
temporary files.

Don’t store the result of A ⋈ B in a temporary file. Instead the result is

passed directly for projection with C and so on.
Query Equivalence

Query: A query is a request for information from a database.

Query Plans: A query plan (or query execution plan) is an ordered set of
steps used to access data in a SQL relational database management
system.
Query Optimization: A single query can be executed through different
algorithms or re-written in different forms and structures. Hence, the
question of query optimization comes into the picture – Which of these
forms or pathways is the most optimal? The query optimizer attempts to
determine the most efficient way to execute a given query by
considering the possible query plans.
Importance: The goal of query optimization is to reduce the
system resources required to fulfill a query, and ultimately provide
the user with the correct result set faster.
 First, it provides the user with faster results, which makes the
application seem faster to the user.
 Secondly, it allows the system to service more queries in
the same amount of time, because each request takes less
time than unoptimized queries.
 Thirdly, query optimization ultimately reduces the amount of
wear on the hardware (e.g. disk drives), and allows the
server to run more efficiently (e.g. lower power
consumption, less memory usage).

There are broadly two ways a query can be optimized:

1. Analyze and transform equivalent relational expressions:

Try to minimize the tuple and column counts of the intermediate
and final query processes (discussed here).
2. Using different algorithms for each operation: These
underlying algorithms determine how tuples are accessed from
the data structures they are stored in, indexing, hashing, data
retrieval and hence influence the number of disk and block
accesses (discussed in query processing).

Analyze and transform equivalent relational expressions.

Here, we shall talk about generating minimal equivalent expressions. To
analyze equivalent expression, listed are a set of equivalence rules.
These generate equivalent expressions for a query written in relational
algebra. To optimize a query, we must convert the query into its
equivalent form as long as an equivalence rule is satisfied.

Transformation of Relational Expressions

• Two relational algebra expressions are said to be equivalent if the two
expressions generate the same set of tuples on every legal database
instance
Note: order of tuples is irrelevant
we don’t care if they generate different results on databases that violate
integrity constraints
• In SQL, inputs and outputs are multisets of tuples
Two expressions in the multiset version of the relational algebra are said
to be equivalent if the two expressions generate the same multiset of
tuples on every legal database instance.
• An equivalence rule says that expressions of two forms are equivalent
Can replace expression of first form by second, or vice versa

1. Conjunctive selection operations can be written as a

sequence of individual selections. This is called a sigma-
cascade.

σθ 1∧θ 2 ( E)=σ θ 1 (σθ 2 (E))

Explanation: Applying condition intersection is expensive.

Instead, filter out tuples satisfying condition (inner selection)
and then apply condition (outer selection) to the then
resulting fewer tuples. This leaves us with less tuples to process
the second time. This can be extended for two or more
intersecting selections. Since we are breaking a single condition
into a series of selections or cascades, it is called a “cascade”.

2. Selection is commutative.

σθ 1 (σθ 2 ( E))=σθ 2 (σθ 1 ( E))

Explanation: condition is commutative in nature. This means,

it does not matter whether we apply first or first. In
practice, it is better and more optimal to apply that selection
first which yields a fewer number of tuples. This saves time on
our outer selection.

3. All following projections can be omitted, only the first

projection is required. This is called a pi-cascade.

Π L1 ( Π L 2 (…( Π Ln( E))…))=Π L 1 (E)

Explanation: A cascade or a series of projections is

meaningless. This is because in the end, we are only selecting
those columns which are specified in the last, or the outermost
projection. Hence, it is better to collapse all the projections into
just one i.e. the outermost projection.

4. Selections on Cartesian Products can be re-written as

Theta Joins.
 Equivalence 1

σθ (E1X E2 ) = E1 θ E2

Explanation: The cross product operation is known to

be very expensive. This is because it matches each
tuple of E1 (total m tuples) with each tuple of E2 (total
n tuples). This yields m*n entries. If we apply a
selection operation after that, we would have to scan
through m*n entries to find the suitable tuples which
satisfy the condition . Instead of doing all of this, it is
more optimal to use the Theta Join, a join specifically
designed to select only those entries in the cross
product which satisfy the Theta condition, without
evaluating the entire cross product first.

 Equivalence 2

σθ1 (E1 θ2 E2 ) = E1 θ1∧ θ2E2

Explanation: Theta Join radically decreases the

number of resulting tuples, so if we apply an
intersection of both the join conditions i.e. and
into the Theta Join itself, we get fewer scans to do. On
the other hand, a condition outside unnecessarily
increases the tuples to scan.

5. Theta Joins are commutative.

E1 θ E2 = E2 θ E1

Explanation: Theta Joins are commutative, and the query

processing time depends to some extent which table is used as
the outer loop and which one is used as the inner loop during
the join process (based on the indexing structures and blocks).

6. Join operations are associative.

 Natural Join

Explanation: Joins are all commutative as well as

associative, so one must join those two tables first
which yield less number of entries, and then apply the
other join.

 Theta Join

Explanation: Theta Joins are associative in the above

manner, where involves attributes from only E2 and
E3.

7. Selection operation can be distributed.

 Equivalence 1

Explanation: Applying a selection after doing the

Theta Join causes all the tuples returned by the Theta
Join to be monitored after the join. If this selection
contains attributes from only E1, it is better to apply
this selection to E1 (hence resulting in a fewer number
of tuples) and then join it with E2.

 Equivalence 2

Explanation: This can be extended to two selection

conditions, and , where Theta1 contains the
attributes of only E1 and contains attributes of only
E2. Hence, we can individually apply the selection
criteria before joining, to drastically reduce the number
of tuples joined.

8. Projection distributes over the Theta Join.

 Equivalence 1

Explanation: The idea discussed for selection can be

used for projection as well. Here, if L1 is a projection
that involves columns of only E1, and L2 another
projection that involves the columns of only E2, then it
is better to individually apply the projections on both
the tables before joining. This leaves us with a fewer
number of columns on either side, hence contributing
to an easier join.
 Equivalence 2

Explanation: Here, when applying projections L1 and

L2 on the join, where L1 contains columns of only E1
and L2 contains columns of only E2, we can introduce
another column E3 (which is common between both the
tables). Then, we can apply projections L1 and L2 on E1
and E2 respectively, along with the added column L3.
L3 enables us to do the join.

9. Union and Intersection are commutative.

Explanation: Union and intersection are both distributive; we

can enclose any tables in parentheses according to requirement
and ease of access.

10. Union and Intersection are associative.

Explanation: Union and intersection are both distributive; we

can enclose any tables in parentheses according to requirement
and ease of access.

11. Selection operation distributes over the union,

intersection, and difference operations.

Explanation: In set difference, we know that only those tuples

are shown which belong to table E1 and do not belong to table
E2. So, applying a selection condition on the entire set
difference is equivalent to applying the selection condition on
the individual tables and then applying set difference. This will
reduce the number of comparisons in the set difference step.

Query Optimization Techniques Explained
No ratings yet
Query Optimization Techniques Explained
8 pages
Query Optimization Techniques Explained
No ratings yet
Query Optimization Techniques Explained
17 pages
Query Optimization Techniques Explained
No ratings yet
Query Optimization Techniques Explained
22 pages
Equivalence Rules and Parsing in DBMS
No ratings yet
Equivalence Rules and Parsing in DBMS
34 pages
Optimizing Distributed Query Processing
No ratings yet
Optimizing Distributed Query Processing
41 pages
Equivalence Rules and Parsing in DBMS
No ratings yet
Equivalence Rules and Parsing in DBMS
33 pages
Query Optimization Techniques Explained
No ratings yet
Query Optimization Techniques Explained
53 pages
CHAPTER 5 Chat GPT
No ratings yet
CHAPTER 5 Chat GPT
33 pages
Relational Algebra Query Optimization
No ratings yet
Relational Algebra Query Optimization
24 pages
Query Processing and Optimization in DBMS
No ratings yet
Query Processing and Optimization in DBMS
17 pages
Query Processing and Optimization Steps
No ratings yet
Query Processing and Optimization Steps
34 pages
RDBMS Query Optimization Techniques
No ratings yet
RDBMS Query Optimization Techniques
11 pages
Query Optimization in DBMS
No ratings yet
Query Optimization in DBMS
53 pages
Query Processing and Optimization in DBMS
No ratings yet
Query Processing and Optimization in DBMS
41 pages
Query Processing in Relational Algebra
No ratings yet
Query Processing in Relational Algebra
22 pages
Query Processing and Cost Estimation
No ratings yet
Query Processing and Cost Estimation
25 pages
SQL Server Query Processing Overview
No ratings yet
SQL Server Query Processing Overview
10 pages
Query Processing and Optimization in DBMS
No ratings yet
Query Processing and Optimization in DBMS
47 pages
Query Optimization Techniques
No ratings yet
Query Optimization Techniques
28 pages
Query Processing: Parsing & Optimization
No ratings yet
Query Processing: Parsing & Optimization
38 pages
Query Processing and Optimization in DBMS
No ratings yet
Query Processing and Optimization in DBMS
21 pages
Query Processing and Optimization in DBMS
No ratings yet
Query Processing and Optimization in DBMS
47 pages
DBMS Chapter 6 - Query Processing and Optimization
No ratings yet
DBMS Chapter 6 - Query Processing and Optimization
34 pages
Query Processing and Optimization Guide
No ratings yet
Query Processing and Optimization Guide
24 pages
Query Processing and Optimization Guide
No ratings yet
Query Processing and Optimization Guide
42 pages
Hash Functions and Query Optimization
No ratings yet
Hash Functions and Query Optimization
29 pages
Query Evaluation Plan Transformation
No ratings yet
Query Evaluation Plan Transformation
30 pages
Query Optimization and Processing Steps
No ratings yet
Query Optimization and Processing Steps
5 pages
Query Optimization Techniques Explained
No ratings yet
Query Optimization Techniques Explained
63 pages
Supplementary Material - Query Processing and Optimization (Short)
No ratings yet
Supplementary Material - Query Processing and Optimization (Short)
49 pages
Query Optimization in SQL Explained
No ratings yet
Query Optimization in SQL Explained
58 pages
Understanding Relational Algebra Operations
No ratings yet
Understanding Relational Algebra Operations
18 pages
Heuristic Query Optimization in DBMS
No ratings yet
Heuristic Query Optimization in DBMS
29 pages
Query Optimization Techniques Explained
No ratings yet
Query Optimization Techniques Explained
19 pages
Ch13-Query Optimization
No ratings yet
Ch13-Query Optimization
42 pages
Relational Algebra Operations Explained
No ratings yet
Relational Algebra Operations Explained
23 pages
Evaluation of Relational Algebra in DBMS
No ratings yet
Evaluation of Relational Algebra in DBMS
5 pages
Query Processing and Optimization Techniques
No ratings yet
Query Processing and Optimization Techniques
37 pages
Query Processing and Optimization Techniques
No ratings yet
Query Processing and Optimization Techniques
10 pages
SQL Query Optimization Techniques
No ratings yet
SQL Query Optimization Techniques
20 pages
Chapter 5 Chatgpt2
No ratings yet
Chapter 5 Chatgpt2
22 pages
Query Processing and Optimization Guide
No ratings yet
Query Processing and Optimization Guide
39 pages
Query Optimization in Database Systems
No ratings yet
Query Optimization in Database Systems
57 pages
Query Optimization Techniques Explained
No ratings yet
Query Optimization Techniques Explained
37 pages
Parsing and Translation in Query Processing
No ratings yet
Parsing and Translation in Query Processing
63 pages
Relational Query Optimization Overview
No ratings yet
Relational Query Optimization Overview
72 pages
Union Operation in Relational Algebra
No ratings yet
Union Operation in Relational Algebra
11 pages
Relational Algebra and SQL Overview
No ratings yet
Relational Algebra and SQL Overview
7 pages
Query Processing and Optimization Guide
No ratings yet
Query Processing and Optimization Guide
23 pages
Distinct and Grouping in Relational Algebra
No ratings yet
Distinct and Grouping in Relational Algebra
4 pages
Query Processing and Optimization in DBMS
No ratings yet
Query Processing and Optimization in DBMS
31 pages
Query Processing and Optimization in DBMS
No ratings yet
Query Processing and Optimization in DBMS
36 pages
Relational Algebra Operations in DBMS
No ratings yet
Relational Algebra Operations in DBMS
22 pages
Outer Joins in Relational Algebra
No ratings yet
Outer Joins in Relational Algebra
64 pages
Left Outer Join in Relational Algebra
No ratings yet
Left Outer Join in Relational Algebra
5 pages
Relational Algebra Join Operations
No ratings yet
Relational Algebra Join Operations
29 pages
Understanding Relational Algebra Operations
No ratings yet
Understanding Relational Algebra Operations
10 pages
Relational Algebra for Query Optimization
No ratings yet
Relational Algebra for Query Optimization
10 pages
Heuristic Query Optimization Techniques
No ratings yet
Heuristic Query Optimization Techniques
6 pages
OOP Through Java Question Bank with Bloom's Taxonomy
No ratings yet
OOP Through Java Question Bank with Bloom's Taxonomy
4 pages
Java Inheritance and Final Keyword Guide
No ratings yet
Java Inheritance and Final Keyword Guide
32 pages
Vision-Feature Extraction Topics: Pattern Recognition For Vision Fall 2004
No ratings yet
Vision-Feature Extraction Topics: Pattern Recognition For Vision Fall 2004
25 pages
Prodapt Recruitment Drive for B.Tech Students
No ratings yet
Prodapt Recruitment Drive for B.Tech Students
3 pages
Understanding Cyber Crime Factors
No ratings yet
Understanding Cyber Crime Factors
10 pages
Introduction to Fintech Overview
No ratings yet
Introduction to Fintech Overview
37 pages
A Petri Net Approach Based Elementary Siphons Supervisor For Flexible Manufacturing Systems
No ratings yet
A Petri Net Approach Based Elementary Siphons Supervisor For Flexible Manufacturing Systems
8 pages
Trusted Authentication Setup for BI 4.3
No ratings yet
Trusted Authentication Setup for BI 4.3
6 pages
Mobile Developer CV - Gabriel Rosół
No ratings yet
Mobile Developer CV - Gabriel Rosół
1 page
Cybersovereignty: Global Internet Control
No ratings yet
Cybersovereignty: Global Internet Control
43 pages
Operator Precedence Parsing Explained
No ratings yet
Operator Precedence Parsing Explained
10 pages
Multi-Level Dermoscopy Image Enhancement
No ratings yet
Multi-Level Dermoscopy Image Enhancement
19 pages
Haicom HI-604 GPS Tracker Overview
No ratings yet
Haicom HI-604 GPS Tracker Overview
19 pages
Celonis Knowledge Model & Views Guide
100% (4)
Celonis Knowledge Model & Views Guide
24 pages
Online Bus Ticket Booking System SRS
No ratings yet
Online Bus Ticket Booking System SRS
18 pages
ELEC4601: Sample Questions Overview
No ratings yet
ELEC4601: Sample Questions Overview
12 pages
Operating System Overview and Evolution
No ratings yet
Operating System Overview and Evolution
52 pages
Oracle Database 19c Workshop Guide
100% (1)
Oracle Database 19c Workshop Guide
248 pages
C Functions: Scope, Arguments, and Recursion
No ratings yet
C Functions: Scope, Arguments, and Recursion
49 pages
GCP Data Engineer Resume Summary
No ratings yet
GCP Data Engineer Resume Summary
2 pages
IEC 61850 Project Specification Guidelines
No ratings yet
IEC 61850 Project Specification Guidelines
5 pages
Bambi Doe Nude Content Removal Notice
No ratings yet
Bambi Doe Nude Content Removal Notice
1 page
Complete LMS Admin Panel in Node.js
No ratings yet
Complete LMS Admin Panel in Node.js
3 pages
Profile - 2026-03-08T010655.907
No ratings yet
Profile - 2026-03-08T010655.907
12 pages
Configure Central Payments in SAP S/4HANA
No ratings yet
Configure Central Payments in SAP S/4HANA
13 pages
UI Developer Profile: Ashish Chakravarti
No ratings yet
UI Developer Profile: Ashish Chakravarti
1 page
Matrix Determinants and Their Applications
No ratings yet
Matrix Determinants and Their Applications
23 pages
Verilog Hardware Modeling Assignment 1
No ratings yet
Verilog Hardware Modeling Assignment 1
6 pages
ANSYS Mechanical APDL Tutorials 16.2
No ratings yet
ANSYS Mechanical APDL Tutorials 16.2
140 pages
CPU and Memory Essentials Guide
No ratings yet
CPU and Memory Essentials Guide
26 pages

Query Equivalence

Uploaded by

Query Equivalence

Uploaded by

Evaluation of relational algebra expressions

Materialized evaluation − Evaluate one operation at a time. Evaluate

Store the result of A ⋈ B in a temporary file.

Don’t store the result of A ⋈ B in a temporary file. Instead the result is

Query: A query is a request for information from a database.

There are broadly two ways a query can be optimized:

1. Analyze and transform equivalent relational expressions:

Analyze and transform equivalent relational expressions.

Transformation of Relational Expressions

1. Conjunctive selection operations can be written as a

σθ 1∧θ 2 ( E)=σ θ 1 (σθ 2 (E))

Explanation: Applying condition intersection is expensive.

σθ 1 (σθ 2 ( E))=σθ 2 (σθ 1 ( E))

Explanation: condition is commutative in nature. This means,

3. All following projections can be omitted, only the first

Π L1 ( Π L 2 (…( Π Ln( E))…))=Π L 1 (E)

Explanation: A cascade or a series of projections is

4. Selections on Cartesian Products can be re-written as

Explanation: The cross product operation is known to

σθ1 (E1 θ2 E2 ) = E1 θ1∧ θ2E2

Explanation: Theta Join radically decreases the

5. Theta Joins are commutative.

Explanation: Theta Joins are commutative, and the query

6. Join operations are associative.

Explanation: Joins are all commutative as well as

Explanation: Theta Joins are associative in the above

7. Selection operation can be distributed.

Explanation: Applying a selection after doing the

Explanation: This can be extended to two selection

8. Projection distributes over the Theta Join.

Explanation: The idea discussed for selection can be

Explanation: Here, when applying projections L1 and

9. Union and Intersection are commutative.

Explanation: Union and intersection are both distributive; we

10. Union and Intersection are associative.

Explanation: Union and intersection are both distributive; we

11. Selection operation distributes over the union,

Explanation: In set difference, we know that only those tuples

You might also like