0% found this document useful (0 votes)

4 views16 pages

Data Store Design (Chapter 11)

The document is a comprehensive guide on data storage design, outlining its definition, objectives, and various storage formats including file-based and database systems. It details the characteristics of different types of databases, such as relational, object, multidimensional, and NoSQL databases, along with techniques for optimizing data storage efficiency and access speed. Additionally, it provides exam tips, potential questions, and structured long answers for key topics related to data storage design.

Uploaded by

Vercity Notes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views16 pages

Data Store Design (Chapter 11)

Uploaded by

Vercity Notes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

🔥 FINAL EXAM MASTER GUIDE — DATA

STORE DESIGN (Chapter 11)

1️⃣ WHAT IS DATA STORAGE DESIGN? ⭐⭐⭐

Definition (MEMORIZE)

Data storage design is the process of deciding how data will be stored and
handled by programs in the system to ensure efficiency, accuracy, and
performance.

Main Objectives:

● Select data storage format

● Convert logical model → physical model

● Ensure DFDs and ERDs balance

● Optimize storage efficiency & access speed

💡 Exam Tip:
If asked “Purpose of data storage design” → list these four points.

2️⃣ DATA STORAGE FORMATS ⭐⭐⭐⭐

Two Main Types:

1️⃣ Files
2️⃣ Databases

3️⃣ FILE-BASED DATA STORAGE ⭐⭐⭐

Definition:

A file is an electronic list of data formatted for a specific transaction.

Characteristics:

● Sequential organization

● Uses pointers

● Also called linked lists

● Fast for single-purpose tasks

📁 TYPES OF FILES (VERY IMPORTANT)

File Type Purpose

Master File Core data (customers,

products)

Look-up File Static values (country codes)

Transaction File Updates master file

Audit File Before & after images

History (Archive) File Past transactions

💡 Exam Favorite:
Differentiate master file and transaction file

4️⃣ DATABASES ⭐⭐⭐⭐

Definition:

A database is a collection of related data stored together and managed by a

DBMS.
DBMS:

● Creates databases

● Manipulates data

● Ensures security & integrity

5️⃣ TYPES OF DATABASES ⭐⭐⭐⭐⭐ (HIGH

PROBABILITY)

1️⃣ RELATIONAL DATABASE ⭐⭐⭐⭐⭐

Definition:

A relational database stores data in tables connected using primary and foreign
keys.

Key Concepts:

● Tables (relations)

● Primary Key

● Foreign Key

● Referential Integrity

● SQL

💡 Exam Gold Line:

Referential integrity ensures that relationships between tables remain valid.

2️⃣ OBJECT DATABASE ⭐⭐⭐

● Based on object-oriented concepts

● Data + behavior together

● Uses encapsulation

● Highly reusable

● Complex data support

💡 Edge Case:
Why object databases allow reuse?
✔ Encapsulation

3️⃣ MULTIDIMENSIONAL DATABASE ⭐⭐⭐⭐

● Used in data warehousing

● Supports DSS

● Uses:

○ Data warehouses

○ Data marts

💡 Exam Trick:
Used for business intelligence?
✔ Multidimensional DB

4️⃣ NoSQL DATABASE ⭐⭐⭐⭐

Characteristics:

● Not relational
● No SQL

● Designed for:

○ Big data

○ Cloud systems

○ Fast access

Types:

● Document-oriented (MongoDB)

● Wide-column (Cassandra)

● Graph databases

💡 Edge Question:
Which DB handles unstructured data?
✔ NoSQL

6️⃣ COMPARING STORAGE FORMATS ⭐⭐⭐⭐⭐

Common Exam Question:

Compare files and databases

Files:

❌ Redundancy
❌ Poor scalability
✔ Fast for specific tasks

Databases:

✔ Less redundancy
✔ Data sharing
✔ Future-proof
💡 Never recommend files for new systems

7️⃣ MOVING FROM LOGICAL TO PHYSICAL ⭐⭐⭐⭐

Three Data Models (VERY IMPORTANT)

1️⃣ CONCEPTUAL MODEL

● Business view

● Entities + relationships

● No DB concern

2️⃣ LOGICAL MODEL

● More detailed

● Attributes included

● Still DB independent

3️⃣ PHYSICAL MODEL ⭐⭐⭐⭐

● Actual DB blueprint

● DBMS-specific

● Includes:

○ Data types

○ PK & FK
○ Constraints

💡 Exam Line:
Physical ERD is used for database construction

8️⃣ METADATA ⭐⭐⭐

Definition:

Metadata is data about data.

Includes:

● Table definitions

● Column types

● Constraints

💡 Short Question Favorite:

What is metadata?

9️⃣ OPTIMIZING DATA STORAGE ⭐⭐⭐⭐⭐

Two Dimensions:

1️⃣ Storage Efficiency

2️⃣ Access Speed

🔹 STORAGE EFFICIENCY ⭐⭐⭐⭐

Best Technique:
Normalization

● Removes redundancy

● Reduces null values

● Improves integrity

💡 Exam Trap:
Normalization improves storage but may reduce speed

🔹 ACCESS SPEED OPTIMIZATION ⭐⭐⭐⭐⭐

Techniques (MEMORIZE):

1️⃣ Denormalization

● Adds redundancy intentionally

● Improves read speed

4 Reasons for Denormalization:

● Frequent joins

● Performance issues

● Read-heavy system

● Reporting needs

2️⃣ Clustering

● Store related records together

Types:
● Intrafile clustering → same table

● Interfile clustering → multiple tables

3️⃣ Indexing ⭐⭐⭐⭐

● Mini-table

● Faster search

● Uses extra storage

📌 Index Tradeoff:
❌ Storage overhead
✔ Speed

4️⃣ Volumetrics (Estimating Storage Size) ⭐⭐⭐

Steps:

1. Calculate raw data

2. Add DBMS overhead

3. Estimate growth

💡 Exam Scenario:
How to plan hardware requirements?
✔ Use volumetrics

🔟 MOST LIKELY FINAL EXAM QUESTIONS 🔥

Short:
● Define data storage design

● What is DBMS?

● Define metadata

● What is indexing?

Medium:

● Types of files

● Compare relational & object DB

● Explain clustering

Long:

● Explain data storage formats

● Logical vs physical ERD

● Techniques for optimizing database performance

⏱️ LAST 20-MINUTE REVISION CHECKLIST ✅

✔ File types
✔ Database types
✔ Relational DB concepts
✔ Logical vs Physical ERD
✔ Normalization vs Denormalization
✔ Indexing & clustering

⚠️ IMPORTANT ADVICE FOR YOUR EXAM

● Always define first, then explain
● Use headings & bullet points

● Draw small ERD/diagram if allowed

If you want:

● 📄 One-page cheat sheet

● ✍️ Perfect long answers
● 🎯 Predicted questions
Just say what you want next.

perfect long answers

Perfect — below are FULL-MARK, UNIVERSITY-STYLE LONG ANSWERS from Lecture 08:
Data Store Design (Chapter 11).
You can directly write these in the exam with minor wording changes.

👉
I’ve written them exactly how examiners expect:
Definition → Explanation → Examples → Conclusion

✅ LONG QUESTION 1
Explain Data Storage Design and its Objectives
Answer:

Data storage design is the process of deciding how data will be stored, organized, and
managed by the programs that run an information system. It ensures that data is stored
efficiently, securely, and in a way that supports system performance and future growth.

The main objectives of data storage design are:

1. Selecting an appropriate data storage format, such as files or databases, based on
system requirements.

2. Converting the logical data model into a physical data model that reflects
implementation decisions.

3. Ensuring balance between DFDs and ERDs, so that all data used by processes are
properly stored.

4. Optimizing the data storage format to improve processing efficiency, storage
utilization, and access speed.

In summary, data storage design bridges the gap between analysis and implementation by
transforming abstract data models into a concrete structure ready for system development.

✅ LONG QUESTION 2
Discuss Different Data Storage Formats
Answer:

There are two major types of data storage formats used in information systems: file-based
storage and database storage.

1. File-Based Data Storage

A file is an electronic list of data that is formatted for a specific transaction. Files are typically
organized sequentially, and records are linked using pointers. Because of this linking
mechanism, files are sometimes referred to as linked lists.

Types of files include:

● Master files: Store core business data such as customer or product information.

● Look-up files: Contain static data such as country codes or department names.

● Transaction files: Store transactions used to update master files.

● Audit files: Record before and after images of data changes.

● History (archive) files: Store old or past transaction data.

Although files can be fast for specific tasks, they suffer from data redundancy and are not
recommended for new systems.

2. Database Storage

A database is a collection of related data stored together and managed by a Database

Management System (DBMS). Databases reduce redundancy, allow data sharing, and support
data integrity.

Thus, databases are preferred over file-based systems for modern applications.

✅ LONG QUESTION 3
Explain Relational Databases and Their Key Features
Answer:

A relational database is the most widely used database model in application development
today. It organizes data into tables (relations), where each table consists of rows and columns.

Each table has a primary key, which uniquely identifies each record. Relationships between
tables are created by placing the primary key of one table into another table as a foreign key.

An important feature of relational databases is referential integrity, which ensures that

relationships between tables remain valid and synchronized. For example, a foreign key value
must match an existing primary key value.

Relational databases use Structured Query Language (SQL) as the standard language for
data retrieval and manipulation.

Due to their flexibility, reliability, and strong data integrity support, relational databases are ideal
for transaction processing and decision-making systems.
✅ LONG QUESTION 4
Differentiate Between Conceptual, Logical, and Physical
Data Models
Answer:

Conceptual, logical, and physical models represent data at different levels of abstraction.

Conceptual Data Model

The conceptual model represents data from a business perspective. It identifies entities and
relationships based on business requirements without considering database constraints or
implementation issues.

Logical Data Model

The logical model is more detailed than the conceptual model. It includes attributes for each
entity and may specify data types, but it remains independent of any specific DBMS.

Physical Data Model

The physical model is the actual blueprint of the database. It defines how data will be stored
in a specific DBMS and includes:

● Table structures

● Data types

● Primary and foreign keys

● Constraints

In conclusion, conceptual and logical models are used during analysis, while the physical model
is used for database construction.

✅ LONG QUESTION 5
Explain Techniques for Optimizing Data Storage
Answer:

Optimizing data storage is essential to ensure system efficiency and performance. There are
two primary dimensions of optimization: storage efficiency and access speed.

1. Optimizing Storage Efficiency

The goal is to minimize storage space and eliminate redundancy. The most effective technique
is normalization, which organizes data into multiple related tables to remove duplicate data and
reduce null values.

2. Optimizing Access Speed

After normalization, data may be spread across many tables, which can slow down data
retrieval. Several techniques are used to improve access speed:

● Denormalization: Intentionally introduces redundancy to reduce the need for joins.

● Clustering: Stores related records physically close together.

○ Intrafile clustering: Records in the same table.

○ Interfile clustering: Records across multiple tables.

● Indexing: Uses mini-tables to speed up data search operations.

● Volumetrics: Estimates storage size for hardware planning.

Thus, optimization balances storage efficiency and system performance.

✅ LONG QUESTION 6
Explain Indexing and Its Importance
Answer:
Indexing is a technique used to improve the speed of data retrieval in a database. An index is a
mini-table that contains values from one or more columns and the physical location of the
corresponding records.

Indexes function similarly to the index of a book, allowing the DBMS to locate data quickly
without scanning the entire table.

Although indexing significantly improves access speed, it requires additional storage space and
increases overhead during insert and update operations. Therefore, indexes should be created
carefully based on usage patterns.

In conclusion, indexing is a critical performance optimization technique in large databases.

Understanding Databases and DBMS
100% (1)
Understanding Databases and DBMS
99 pages
DB Chapter1 Cheatsheet
No ratings yet
DB Chapter1 Cheatsheet
6 pages
DBMS UNIT 1 Answers
No ratings yet
DBMS UNIT 1 Answers
23 pages
CS3492 Database Management System Overview
No ratings yet
CS3492 Database Management System Overview
43 pages
Ensuring Data Integrity in DBMS
No ratings yet
Ensuring Data Integrity in DBMS
18 pages
Database Full
No ratings yet
Database Full
26 pages
Database Systems Overview and Components
No ratings yet
Database Systems Overview and Components
11 pages
69bbd57b17e55CSE TH-2
No ratings yet
69bbd57b17e55CSE TH-2
86 pages
DBMS Notes For Sem
No ratings yet
DBMS Notes For Sem
14 pages
2 Min Analysis of Whole
No ratings yet
2 Min Analysis of Whole
10 pages
Database vs File Processing Systems Explained
No ratings yet
Database vs File Processing Systems Explained
8 pages
Rdbms em 30 March 12023
No ratings yet
Rdbms em 30 March 12023
45 pages
Dbms With Oracle Material
No ratings yet
Dbms With Oracle Material
73 pages
Limitations of Traditional File Systems
No ratings yet
Limitations of Traditional File Systems
3 pages
Relational Database Management Systems Guide
No ratings yet
Relational Database Management Systems Guide
7 pages
Database Management System Overview
No ratings yet
Database Management System Overview
24 pages
Untitled Document 2
No ratings yet
Untitled Document 2
28 pages
Essential Database Management Guide
No ratings yet
Essential Database Management Guide
7 pages
Understanding Database Systems
No ratings yet
Understanding Database Systems
31 pages
Key DBMS Questions and Answers
No ratings yet
Key DBMS Questions and Answers
34 pages
DBMS Fundamentals and Query Processing
No ratings yet
DBMS Fundamentals and Query Processing
31 pages
Understanding Databases and DBMS
No ratings yet
Understanding Databases and DBMS
19 pages
DBMS Interview Questions and Answers
No ratings yet
DBMS Interview Questions and Answers
24 pages
Understanding Data, Information, and DBMS
No ratings yet
Understanding Data, Information, and DBMS
37 pages
Understanding Databases and DBMS Concepts
No ratings yet
Understanding Databases and DBMS Concepts
21 pages
Understanding Data Independence and DBMS
No ratings yet
Understanding Data Independence and DBMS
69 pages
DBMS - Unit 1 (Introduction To Database Management Systems and ER Model)
No ratings yet
DBMS - Unit 1 (Introduction To Database Management Systems and ER Model)
73 pages
Comprehensive DBMS Question Bank Guide
No ratings yet
Comprehensive DBMS Question Bank Guide
17 pages
Introduction to Database Management Systems
No ratings yet
Introduction to Database Management Systems
34 pages
Imp Ques Wit Sol
No ratings yet
Imp Ques Wit Sol
20 pages
Database Design and Concepts Overview
No ratings yet
Database Design and Concepts Overview
6 pages
CS3492 Database Management Syllabus
No ratings yet
CS3492 Database Management Syllabus
43 pages
Data Hierarchy in Computer Systems
No ratings yet
Data Hierarchy in Computer Systems
24 pages
DBMS Concepts: Data Independence & Architecture
No ratings yet
DBMS Concepts: Data Independence & Architecture
102 pages
Understanding DBMT and Its Functions
No ratings yet
Understanding DBMT and Its Functions
9 pages
Database Management Systems Overview
100% (1)
Database Management Systems Overview
56 pages
Lecture 1.1.1 Intro To Database
No ratings yet
Lecture 1.1.1 Intro To Database
32 pages
Database Schemas and Transactions Explained
No ratings yet
Database Schemas and Transactions Explained
5 pages
Database Systems Overview and Concepts
No ratings yet
Database Systems Overview and Concepts
29 pages
Understanding Database Management Systems
No ratings yet
Understanding Database Management Systems
10 pages
Senior 5 Database Exam 2014
No ratings yet
Senior 5 Database Exam 2014
7 pages
Winter 2023 DBMS Model Answer Paper
80% (5)
Winter 2023 DBMS Model Answer Paper
20 pages
Essential DBMS Interview Questions
No ratings yet
Essential DBMS Interview Questions
46 pages
Unit 1st Dbms
No ratings yet
Unit 1st Dbms
12 pages
Revision DB
No ratings yet
Revision DB
20 pages
Key DBMS Concepts and Design Steps
No ratings yet
Key DBMS Concepts and Design Steps
31 pages
Database Management Systems Overview
No ratings yet
Database Management Systems Overview
18 pages
Top DBMS Interview Questions 2025
No ratings yet
Top DBMS Interview Questions 2025
2 pages
Database Management System QB & Answers
No ratings yet
Database Management System QB & Answers
124 pages
Database Systems Overview
No ratings yet
Database Systems Overview
4 pages
DBMS Fundamentals and Concepts Guide
No ratings yet
DBMS Fundamentals and Concepts Guide
16 pages
CS3492 Database Management Systems Guide
No ratings yet
CS3492 Database Management Systems Guide
44 pages
CS3492 DBMS Question Bank
No ratings yet
CS3492 DBMS Question Bank
45 pages
Understanding RDBMS Concepts and Models
No ratings yet
Understanding RDBMS Concepts and Models
16 pages
520L0848
No ratings yet
520L0848
64 pages
Science 8 Second Periodic Test Specs
No ratings yet
Science 8 Second Periodic Test Specs
2 pages
Nonlinear Control in Electric Machines
No ratings yet
Nonlinear Control in Electric Machines
11 pages
Vedic Astrology: Panchanga Chart Method
No ratings yet
Vedic Astrology: Panchanga Chart Method
6 pages
Carbon Black Manufacturing Process Overview
0% (1)
Carbon Black Manufacturing Process Overview
37 pages
Cross Section of Leaf Diagram for Class 10
100% (1)
Cross Section of Leaf Diagram for Class 10
7 pages
Liquid Cooling Solution - ODCC2021
No ratings yet
Liquid Cooling Solution - ODCC2021
9 pages
Ionization and pH in Pharmaceuticals
No ratings yet
Ionization and pH in Pharmaceuticals
8 pages
Army Body Fat Standards Chart
No ratings yet
Army Body Fat Standards Chart
15 pages
Haloalkanes and Haloarenes MCQs Guide
No ratings yet
Haloalkanes and Haloarenes MCQs Guide
5 pages
Bird Strike Simulation Methods Review
No ratings yet
Bird Strike Simulation Methods Review
20 pages
Railway Track Crack Detection System
No ratings yet
Railway Track Crack Detection System
27 pages
Fehr-Schmidt Model of Fairness
No ratings yet
Fehr-Schmidt Model of Fairness
4 pages
Python Programming Exam Paper 2025
No ratings yet
Python Programming Exam Paper 2025
9 pages
Diode and Circuit Analysis Lab Report
No ratings yet
Diode and Circuit Analysis Lab Report
4 pages
An Analytical Solution To Transient Heat Conduction in A Composite Region With A Cylindrical Heat Source
No ratings yet
An Analytical Solution To Transient Heat Conduction in A Composite Region With A Cylindrical Heat Source
7 pages
Universal Seat Heater Installation Guide
No ratings yet
Universal Seat Heater Installation Guide
8 pages
Environmental Systems and Modeling Overview
No ratings yet
Environmental Systems and Modeling Overview
38 pages
Understanding Thermodynamics to String Theory
No ratings yet
Understanding Thermodynamics to String Theory
1 page
MOS Capacitor C-V Profiling Guide
No ratings yet
MOS Capacitor C-V Profiling Guide
7 pages
HXT500 5g Servo Specifications
No ratings yet
HXT500 5g Servo Specifications
3 pages
Compressed Air Treatment Systems - CS Industrial Services - New York
No ratings yet
Compressed Air Treatment Systems - CS Industrial Services - New York
9 pages
GSEB Std 12 Maths Question Bank
No ratings yet
GSEB Std 12 Maths Question Bank
68 pages
Uncertainty Effects on Coupled Structures
No ratings yet
Uncertainty Effects on Coupled Structures
4 pages
Speed Control Methods for DC Shunt Motor
No ratings yet
Speed Control Methods for DC Shunt Motor
2 pages
Introduction to Microwave Engineering
No ratings yet
Introduction to Microwave Engineering
33 pages
Non-Linear Systems Exam Paper 2024
No ratings yet
Non-Linear Systems Exam Paper 2024
2 pages
Cost of Capital Analysis in Finance
No ratings yet
Cost of Capital Analysis in Finance
22 pages
Gleason's Theorem in Quantum Mechanics
No ratings yet
Gleason's Theorem in Quantum Mechanics
55 pages
JEE Physics Questions & Solutions PDF
No ratings yet
JEE Physics Questions & Solutions PDF
3 pages

Data Store Design (Chapter 11)

Uploaded by

Data Store Design (Chapter 11)

Uploaded by

🔥 FINAL EXAM MASTER GUIDE — DATA

STORE DESIGN (Chapter 11)

1️⃣ WHAT IS DATA STORAGE DESIGN? ⭐⭐⭐

●​ Select data storage format​

●​ Convert logical model → physical model​

●​ Ensure DFDs and ERDs balance​

●​ Optimize storage efficiency & access speed​

2️⃣ DATA STORAGE FORMATS ⭐⭐⭐⭐

3️⃣ FILE-BASED DATA STORAGE ⭐⭐⭐

A file is an electronic list of data formatted for a specific transaction.

●​ Also called linked lists​

●​ Fast for single-purpose tasks​

📁 TYPES OF FILES (VERY IMPORTANT)

Master File Core data (customers,

Look-up File Static values (country codes)

Transaction File Updates master file

Audit File Before & after images

History (Archive) File Past transactions

4️⃣ DATABASES ⭐⭐⭐⭐

A database is a collection of related data stored together and managed by a

●​ Ensures security & integrity​

5️⃣ TYPES OF DATABASES ⭐⭐⭐⭐⭐ (HIGH

1️⃣ RELATIONAL DATABASE ⭐⭐⭐⭐⭐

💡 Exam Gold Line:

2️⃣ OBJECT DATABASE ⭐⭐⭐

●​ Data + behavior together​

●​ Complex data support​

3️⃣ MULTIDIMENSIONAL DATABASE ⭐⭐⭐⭐

4️⃣ NoSQL DATABASE ⭐⭐⭐⭐

6️⃣ COMPARING STORAGE FORMATS ⭐⭐⭐⭐⭐

Compare files and databases

7️⃣ MOVING FROM LOGICAL TO PHYSICAL ⭐⭐⭐⭐

1️⃣ CONCEPTUAL MODEL

2️⃣ LOGICAL MODEL

3️⃣ PHYSICAL MODEL ⭐⭐⭐⭐

8️⃣ METADATA ⭐⭐⭐

Metadata is data about data.

💡 Short Question Favorite:

9️⃣ OPTIMIZING DATA STORAGE ⭐⭐⭐⭐⭐

1️⃣ Storage Efficiency​

🔹 STORAGE EFFICIENCY ⭐⭐⭐⭐

●​ Reduces null values​

🔹 ACCESS SPEED OPTIMIZATION ⭐⭐⭐⭐⭐

●​ Adds redundancy intentionally​

●​ Improves read speed​

4 Reasons for Denormalization:

●​ Store related records together​

●​ Interfile clustering → multiple tables​

3️⃣ Indexing ⭐⭐⭐⭐

●​ Uses extra storage​

4️⃣ Volumetrics (Estimating Storage Size) ⭐⭐⭐

1.​ Calculate raw data​

2.​ Add DBMS overhead​

3.​ Estimate growth​

🔟 MOST LIKELY FINAL EXAM QUESTIONS 🔥

●​ Compare relational & object DB​

●​ Explain data storage formats​

●​ Logical vs physical ERD​

●​ Techniques for optimizing database performance​

⏱️ LAST 20-MINUTE REVISION CHECKLIST ✅

⚠️ IMPORTANT ADVICE FOR YOUR EXAM

●​ Draw small ERD/diagram if allowed​

●​ 📄 One-page cheat sheet​

perfect long answers

The main objectives of data storage design are:

1. File-Based Data Storage

Types of files include:

●​ Transaction files: Store transactions used to update master files.​

●​ History (archive) files: Store old or past transaction data.​

A database is a collection of related data stored together and managed by a Database

An important feature of relational databases is referential integrity, which ensures that

Conceptual Data Model

Logical Data Model

Physical Data Model

●​ Primary and foreign keys​

1. Optimizing Storage Efficiency

● Select data storage format

● Convert logical model → physical model

● Ensure DFDs and ERDs balance

● Optimize storage efficiency & access speed

● Also called linked lists

● Fast for single-purpose tasks

● Ensures security & integrity

● Data + behavior together

● Complex data support

1️⃣ Storage Efficiency

● Reduces null values

● Adds redundancy intentionally

● Improves read speed

● Store related records together

● Interfile clustering → multiple tables

● Uses extra storage

1. Calculate raw data

2. Add DBMS overhead

3. Estimate growth

● Compare relational & object DB

● Explain data storage formats

● Logical vs physical ERD

● Techniques for optimizing database performance

● Draw small ERD/diagram if allowed

● 📄 One-page cheat sheet

● Transaction files: Store transactions used to update master files.

● History (archive) files: Store old or past transaction data.

● Primary and foreign keys

● Denormalization: Intentionally introduces redundancy to reduce the need for joins.

● Clustering: Stores related records physically close together.

○ Intrafile clustering: Records in the same table.

○ Interfile clustering: Records across multiple tables.

● Indexing: Uses mini-tables to speed up data search operations.

● Volumetrics: Estimates storage size for hardware planning.