0% found this document useful (0 votes)

8 views27 pages

Database Normalization Techniques Explained

The document discusses normalization in database management, detailing the different normal forms (1NF, 2NF, 3NF) and their significance in reducing data redundancy and anomalies. It explains the processes for achieving each normal form, including eliminating repeating groups, identifying primary keys, and addressing dependencies. The document emphasizes the importance of normalization for maintaining data integrity and optimizing database structure.

Uploaded by

clafedersin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views27 pages

Database Normalization Techniques Explained

Uploaded by

clafedersin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

CENG 3005

Database Management Systems

Week 9

• Normal Forms (continued)

1
Normalization
 1NF (atomic, primary key):
 {Order, Product, Customer, Address, Quantity,
UnitPrice}
 2NF (no partial dependency on key)
 {Product, UnitPrice}
 {Order, Product, Quantity}
 {Order, Customer, Address}

 3NF (no transitive dependence on a key)

 {Product, UnitPrice}
 {Order, Product, Quantity}
 {Order, Customer}
 {Customer, Address}
Normalization of DB Tables
 Normalization
– Process for evaluating and correcting table structures
• determines the optimal assignments of attributes to entities
– Normalization provides micro view of entities
• focuses on characteristics of specific entities
• may yield additional entities
– Works through a series of stages called normal forms
• 1NF  2NF  3NF  4NF (optional)
– Higher the normal form, slower the database response
• more joins are required to answer end-user queries

 So if the response is slow, why do the database people

normalize?
1. Reduce uncontrolled data redundancies
• Help eliminate data anomalies
2. Produce controlled redundancies to link tables
Redundancy

 Dependencies between attributes cause

redundancy
– Eg. All addresses in the same town have
the same zip code

SSN Name Town Zip

Redundancy
1234 Joe Stony Brook 11790
4321 Mary Stony Brook 11790
5454 Tom Stony Brook 11790
………………….

4
Example

ER Model
SSN Name Address Hobby
1111 Joe 123 Main {biking, hiking}

Relational Model
SSN Name Address Hobby
1111 Joe 123 Main biking
1111 Joe 123 Main hiking
…………….
Redundancy
Anomalies
Redundancy leads to anomalies:
– Update anomaly: If you need to change Address,
you must change in multiple places (columns/tables)
– Deletion anomaly: Suppose a person gives up all
hobbies. Do we:
• Set Hobby attribute to null? No, you can’t,
because Hobby is part of key
• Delete the entire row? No, since we lose other
information in the row
– Insertion anomaly: Hobby value must be supplied
for any inserted row since Hobby is part of key
Decomposition of Tables
 Solution: use two relations to store
Person information
– Person1 (SSN, Name, Address)
– Hobbies (SSN, Hobby)
 The decomposition is more general:
people with hobbies can now be
described
 No update anomalies:
– Name and address stored once
– A hobby can be separately supplied
or deleted
What if you need to combine
tables?
 Suppose we combine borrower and loan to get
bor_loan = (customer_id, loan_number,
amount )
 Result might cause a possible repetition of
information (L-100 in example below)
A Combined Schema Without
Repetition
 Consider combining loan_branch and loan
loan_amt_br = (loan_number, amount,
branch_name)
 No repetition (as suggested by example below)
How to decide whether to
split into smaller tables?
 Suppose we had started with bor_loan. How would we know to
split up (decompose) it into borrower and loan?
 Write a rule “if there were a schema (loan_number, amount),
then loan_number would be a candidate key”
 Denote as a functional dependency:
loan_number  amount
 In bor_loan, because loan_number is not a candidate key, the
amount of a loan may have to be repeated. This indicates the
need to decompose bor_loan.
 Not all decompositions are good. Suppose we decompose
employee into
employee1 = (employee_id, employee_name)
employee2 = (employee_name, telephone_number, start_date)
 The next slide shows how we lose information -- we cannot
reconstruct the original employee relation -- and so, this is a
lossy decomposition.
1
A Lossy Decomposition

1
First Normal Form
 Domain is atomic if its elements are considered to be
indivisible units
– Examples of non-atomic domains:
• Set of names, composite attributes
• Identification numbers like CS101 that can be broken up
into parts
 A relational schema R is in first normal form if the
domains of all attributes of R are atomic
 Non-atomic values complicate storage and encourage
redundant (repeated) storage of data
– Example: Set of accounts stored with each customer, and set
of owners stored with each account
– We assume all relations are in first normal form

1
First Normal Form (Cont’d)
 Atomicity is actually a property of how the
elements of the domain are used.
– Example: Strings would normally be considered indivisible
– Suppose that students are given roll numbers which are strings
of the form CS0012 or EE1127
– If the first two characters are extracted to find the department,
the domain of roll numbers is not atomic.
– This leads to encoding of information in application program
rather than in the database.

1
Goal — Devise a Theory for the
Following
 Decide whether a particular relation R is in
“good” form.
 If a relation R is not in “good” form, decompose
it into a set of relations {R1, R2, ..., Rn} such that
– each relation is in good form
– the decomposition is a lossless-join decomposition
 Our theory is based on:
– functional dependencies
– multivalued dependencies

1
How to determine Primary Key
using Functional Dependencies
 Watch the video!!!
[Link]
w

1
Normalization Forms
(simple simple simple)

“Data depends on the key

[1NF]
the whole key
[2NF]
and nothing but the key
[3NF]”

“If all the arrows in FDs are out of a candidate

key” [BCNF]
1
First normal form (1NF)
 First normal form (1NF)
– All data values are atomic
– Each row is unique (has a primary key)
 Step by step 1NF:
1. Eliminate repeating groups, eliminate all-null columns
2. Identify the Primary Key –Primary key must uniquely identify
attribute value –New key must be composed
3. Identify All Dependencies –Dependencies can be depicted
with help of a dependency diagram

1
Normalization of DB Tables
 Normalization
– Process for evaluating and correcting table structures
• determines the optimal assignments of attributes to entities
– Normalization provides micro view of entities
• focuses on characteristics of specific entities
• may yield additional entities
– Works through a series of stages called normal forms
• 1NF  2NF  3NF  4NF (optional)
– Higher the normal form, slower the database response
• more joins are required to answer end-user queries

 Why normalize?
– Reduce uncontrolled data redundancies
• Help eliminate data anomalies
– Produce controlled redundancies to link tables

1
1
Example: Need for Normalization
 PRO_NUM is intended to be primary key but contain nulls
 Table entries invite data inconsistencies
– e.g. “Elect. Engineer”, “[Link].”, “EE”
 Table displays data redundancies that can cause data anomalies
– Update anomalies
• Modifying JOB_CLASS could require many alterations (all the rows for the same EMP_NUM)
– Insertion anomalies
• New employee must be assigned a project
– Deletion anomalies
• If employee quits and a row deleted, other vital data may get lost

1
1
Normalization: First Normal Form
 First Normal Form (1NF)
– All the primary key attributes are defined
– There are no repeating groups
– All attributes are dependent on the primary key

 Conversion to 1NF
– Objective
• Develop a proper primary key
– Steps
1. Eliminate repeating groups
– fill in the null cells with appropriate data value
2. Identify primary key
– identify attribute(s) that uniquely identifies each row
3. Identify all dependencies
– make sure all attributes are dependent on the primary key

2
2
Normalization: 1NF example
1. Eliminate repeating groups - Fill in the null cells to make each row define a single entity
2. Identify the primary key - Make sure all attributes are dependent on the primary key

2
2
Normalization: 1NF example
3. Identify all dependencies (in a Dependency Table)
– Desirable dependencies (arrows above)
• based on primary key (functional dependency)
– Less desirable dependencies (arrows below)
• Partial dependency
– based on part of composite primary key
• Transitive dependency
– one nonprime attribute depends on another nonprime attribute
• Subject to data redundancies and anomalies

2
2
Normalization: Second Normal

Form
Second Normal Form (2NF)
– It is in 1NF
– There are no partial dependencies

 Conversion to 2NF
– Objective
• Eliminate partial dependencies
– Steps
1. Start with 1NF format
2. Write each key component (w/ partial dependency) on
separate line
3. Write original (composite) key on last line
4. Each component is new table
5. Write dependent attributes after each key

1NF (PROJ_NUM, EMP_NUM, PROJ_NAME, EMP_NAME, JOB_CLASS, CHG_HOUR,

HOURS)

PROJECT (PROJ_NUM, PROJ_NAME)
EMPLOYEE (EMP_NUM, EMP_NAME, JOB_CLASS, CHG_HOUR) 2
Normalization: 2NF example

2
Normalization: Third Normal
Form
 Third Normal Form (3NF)
– It is in 2NF
– There are no transitive dependencies

 Conversion to 3NF
– Objective
• Eliminate transitive dependencies (TD)
– Steps
1. Start with 2NF format
2. Break off the TD pieces and create separate tables

EMPLOYEE (EMP_NUM, EMP_NAME, JOB_CLASS, CHG_HOUR)


EMPLOYEE (EMP_NUM, EMP_NAME, JOB_CLASS)
JOB (JOB_CLASS, CHG_HOUR)

2
Normalization: 3NF example

2
2
2

Understanding Database Normalization Techniques
No ratings yet
Understanding Database Normalization Techniques
37 pages
Logical Database Design and Normalization
No ratings yet
Logical Database Design and Normalization
34 pages
Understanding Functional Dependencies and Normalization
No ratings yet
Understanding Functional Dependencies and Normalization
35 pages
Functional Dependency & Normalization Guide
No ratings yet
Functional Dependency & Normalization Guide
34 pages
Logical Database Design Overview
No ratings yet
Logical Database Design Overview
19 pages
Relational Database Normalization Guide
No ratings yet
Relational Database Normalization Guide
74 pages
Study Material: Vivekananda College Thakurpukur
No ratings yet
Study Material: Vivekananda College Thakurpukur
10 pages
Database Normalization Techniques Explained
No ratings yet
Database Normalization Techniques Explained
36 pages
Understanding Database Normalization Techniques
No ratings yet
Understanding Database Normalization Techniques
31 pages
Data Normalization in Database Management
No ratings yet
Data Normalization in Database Management
20 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
26 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
68 pages
RDBMS Concepts and Normalization
No ratings yet
RDBMS Concepts and Normalization
46 pages
Relational Database Normalization Guide
No ratings yet
Relational Database Normalization Guide
74 pages
Database Design and Normalization Guide
No ratings yet
Database Design and Normalization Guide
43 pages
Database Management Systems Overview
No ratings yet
Database Management Systems Overview
48 pages
Database Design and Normalization Guide
No ratings yet
Database Design and Normalization Guide
8 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
31 pages
Relational Database Design and Normalization
No ratings yet
Relational Database Design and Normalization
4 pages
Database Normalization and Dependencies Guide
No ratings yet
Database Normalization and Dependencies Guide
36 pages
ERD Normalization Explained
No ratings yet
ERD Normalization Explained
10 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
61 pages
Lecture 9
No ratings yet
Lecture 9
49 pages
DBDC Normalisation Lec-5
No ratings yet
DBDC Normalisation Lec-5
21 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
5 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
52 pages
Database Normalization Techniques Guide
No ratings yet
Database Normalization Techniques Guide
39 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
40 pages
Database Design: Anomalies & Normalization
No ratings yet
Database Design: Anomalies & Normalization
23 pages
Types of Database Normalization
No ratings yet
Types of Database Normalization
47 pages
Database Design and Normalization Guide
No ratings yet
Database Design and Normalization Guide
27 pages
Understanding Database Normalization Techniques
No ratings yet
Understanding Database Normalization Techniques
47 pages
Database Design Methodology Guide
No ratings yet
Database Design Methodology Guide
6 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
41 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
50 pages
Relational Database Normalization Guide
No ratings yet
Relational Database Normalization Guide
68 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
51 pages
Understanding Data Normalization Techniques
No ratings yet
Understanding Data Normalization Techniques
31 pages
Database Design: Relational Schemas & Normalization
No ratings yet
Database Design: Relational Schemas & Normalization
37 pages
Database Table Normalization Guide
No ratings yet
Database Table Normalization Guide
34 pages
Normalization: 1NF to 3NF Explained
No ratings yet
Normalization: 1NF to 3NF Explained
41 pages
Understanding Functional Dependency and Normalization
No ratings yet
Understanding Functional Dependency and Normalization
11 pages
Relational Database Design & Normalization
No ratings yet
Relational Database Design & Normalization
5 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
51 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
9 pages
Advanced Database Management Course
No ratings yet
Advanced Database Management Course
29 pages
Understanding Functional Dependency in DBMS
No ratings yet
Understanding Functional Dependency in DBMS
17 pages
Database Normalization Study Guide
No ratings yet
Database Normalization Study Guide
7 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
5 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
30 pages
Normalization
No ratings yet
Normalization
26 pages
Lec 5 Normalization
No ratings yet
Lec 5 Normalization
25 pages
Module 4
No ratings yet
Module 4
17 pages
Normalization
No ratings yet
Normalization
13 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
6 pages
Database Normalization and Design Principles
No ratings yet
Database Normalization and Design Principles
38 pages
Database Normalization Explained
No ratings yet
Database Normalization Explained
18 pages
Normalization and Data Redundancy in DBMS
No ratings yet
Normalization and Data Redundancy in DBMS
17 pages
Understanding Database Normalization
No ratings yet
Understanding Database Normalization
56 pages
Operating System MCQs for Students
No ratings yet
Operating System MCQs for Students
11 pages
Create and Manage AWS EC2 Instances
No ratings yet
Create and Manage AWS EC2 Instances
91 pages
School Management System Overview
No ratings yet
School Management System Overview
6 pages
Gen AI Roadmap
No ratings yet
Gen AI Roadmap
5 pages
Python for Cybersecurity Training Syllabus
No ratings yet
Python for Cybersecurity Training Syllabus
6 pages
Online Library Management System Project
No ratings yet
Online Library Management System Project
58 pages
Python Programming Internship Report
No ratings yet
Python Programming Internship Report
37 pages
Kioptrix Level 4 CTF Walkthrough
No ratings yet
Kioptrix Level 4 CTF Walkthrough
12 pages
Azure DevOps Guide for Beginners
No ratings yet
Azure DevOps Guide for Beginners
35 pages
Packet Tracer: Log Network Activity 15.2.7
No ratings yet
Packet Tracer: Log Network Activity 15.2.7
2 pages
Business Analytics Process Overview
No ratings yet
Business Analytics Process Overview
9 pages
SC 100 Demo Certempire
No ratings yet
SC 100 Demo Certempire
60 pages
Understanding Cyber Crime and Activism
No ratings yet
Understanding Cyber Crime and Activism
11 pages
CRM Systems and Resources Overview
No ratings yet
CRM Systems and Resources Overview
5 pages
Analytics Vidhya Project Solutions
No ratings yet
Analytics Vidhya Project Solutions
3 pages
Object-Oriented Testing Strategies
No ratings yet
Object-Oriented Testing Strategies
26 pages
PSU College Admission Application Form
No ratings yet
PSU College Admission Application Form
2 pages
Cyber Cop AI: VS Code Security Extension
No ratings yet
Cyber Cop AI: VS Code Security Extension
11 pages
Migration of Oracle Apps to Azure Cloud
No ratings yet
Migration of Oracle Apps to Azure Cloud
15 pages
Networking Tools and Wireshark Overview
No ratings yet
Networking Tools and Wireshark Overview
4 pages
Testing Fundamentals and Techniques Guide
No ratings yet
Testing Fundamentals and Techniques Guide
23 pages
Architectural Design in Software Engineering
No ratings yet
Architectural Design in Software Engineering
26 pages
Accounting Document For Material Ledger - SAP Q&A
No ratings yet
Accounting Document For Material Ledger - SAP Q&A
2 pages
FortiAnalyzer ADOM Management Guide
No ratings yet
FortiAnalyzer ADOM Management Guide
5 pages
AI and Blockchain in Banking Security
No ratings yet
AI and Blockchain in Banking Security
14 pages
Data Science Solutions for Business Issues
No ratings yet
Data Science Solutions for Business Issues
15 pages
Database Integrity and Security Overview
No ratings yet
Database Integrity and Security Overview
25 pages
Penetration Test Report for ACME 2024
No ratings yet
Penetration Test Report for ACME 2024
13 pages
SLU Bulletin Development Methodology
No ratings yet
SLU Bulletin Development Methodology
3 pages
Understanding DNS: Function and Impact
No ratings yet
Understanding DNS: Function and Impact
2 pages

Database Normalization Techniques Explained

Uploaded by

Database Normalization Techniques Explained

Uploaded by

CENG 3005

Database Management Systems

• Normal Forms (continued)

 3NF (no transitive dependence on a key)

 So if the response is slow, why do the database people

 Dependencies between attributes cause

SSN Name Town Zip

“Data depends on the key

“If all the arrows in FDs are out of a candidate

1NF (PROJ_NUM, EMP_NUM, PROJ_NAME, EMP_NAME, JOB_CLASS, CHG_HOUR,

EMPLOYEE (EMP_NUM, EMP_NAME, JOB_CLASS, CHG_HOUR)

You might also like