0% found this document useful (0 votes)

72 views6 pages

Ktu Data Compression Techniques Notes

Data compression reduces the number of bits needed to represent information, improving storage efficiency and transmission speed. It is categorized into lossless and lossy techniques, with lossless allowing exact data reconstruction and lossy removing less important information for higher compression. Key compression methods include Huffman coding, JPEG for images, MPEG for videos, and MP3 for audio, with applications in multimedia streaming, image storage, and mobile communication.

Uploaded by

aniabc2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views6 pages

Ktu Data Compression Techniques Notes

Uploaded by

aniabc2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

KTU S8 CSE – Data Compression Techniques

(CST446)

Quick Revision Notes – All Modules

Data compression is the process of reducing the number of bits required to represent information. It helps
reduce storage space and transmission bandwidth. Compression techniques are classified into lossless
and lossy compression.
Module 1 – Fundamentals of Data Compression
• Data compression reduces redundancy in data representation.

• Compression improves storage efficiency and transmission speed.

• Two major categories: Lossless compression and Lossy compression.

• Lossless compression allows exact reconstruction of the original data.

• Lossy compression removes less important information for higher compression.

• Entropy represents the theoretical limit of compression.

• Information theory forms the basis of many compression algorithms.

Entropy Formula:
H(X) = − Σ P(x) log2 P(x)

Where P(x) is the probability of symbol occurrence. Entropy measures the average information content per
symbol.
Module 2 – Lossless Compression Techniques
• Huffman Coding: Variable-length coding technique based on symbol frequencies.

• Arithmetic Coding: Encodes entire message into a single fractional number.

• LZ77 Algorithm: Uses sliding window technique to replace repeated patterns.

• LZ78 Algorithm: Builds dictionary dynamically during compression.

• LZW Algorithm: Improved dictionary-based compression used in GIF images.

• Run Length Encoding (RLE): Replaces repeated symbols with count and symbol.

Advantages of Lossless Compression:

• No loss of information.

• Used for text, executable files, and medical images.

• Allows exact reconstruction of original data.

Module 3 – Image Compression
• Images contain spatial redundancy which can be compressed.

• JPEG is the most common image compression standard.

• JPEG uses Discrete Cosine Transform (DCT).

• Compression steps: Color conversion → DCT → Quantization → Entropy coding.

• JPEG-LS provides lossless or near-lossless compression.

• PNG uses lossless compression.

JPEG Compression Steps:

• Divide image into 8×8 blocks.

• Apply DCT to transform spatial domain into frequency domain.

• Quantize coefficients to reduce precision.

• Apply entropy coding such as Huffman coding.

Module 4 – Video Compression
• Video compression removes spatial and temporal redundancy.

• MPEG standards are widely used for video compression.

• Types include MPEG■1, MPEG■2, MPEG■4 and H.264.

• Motion estimation finds similar blocks between frames.

• Motion compensation predicts frame using previous frames.

• Frames types: I■frame (intra), P■frame (predicted), B■frame (bidirectional).

Motion Compensation Concept:

Instead of transmitting full frames, the encoder sends motion vectors and residual errors between frames
to reduce redundancy.
Module 5 – Audio Compression
• Audio compression reduces data required for storing sound.

• Human hearing characteristics are used to remove inaudible sounds.

• MP3 is one of the most popular audio compression standards.

• Audio compression uses psychoacoustic models.

• Lossless audio compression formats include FLAC and ALAC.

Applications of Data Compression:

• Multimedia streaming (YouTube, Netflix).

• Image storage and transmission.

• Video conferencing.

• Mobile communication systems.

• Cloud storage optimization.

Common questions

MPEG standards in video compression are integral to modern digital broadcasting and online streaming due to their ability to efficiently compress video data while maintaining quality and supporting a wide range of resolutions and bitrates. These standards, including MPEG-1, MPEG-2, MPEG-4, and H.264, use techniques like motion compensation and spatial-temporal redundancy removal to produce compressed video files that are manageable in size yet high in quality, facilitating smooth transmission over bandwidth-constrained networks. This versatility allows MPEG standards to meet diverse demands from high-definition television broadcasting to mobile device streaming, optimizing content delivery for platforms like YouTube and Netflix .

Motion compensation in video compression enhances efficiency by reducing the amount of data that needs to be encoded and transmitted for video playback. Instead of encoding each frame independently (which would result in significant redundancy due to gradual changes between frames), motion compensation predicts frames by analyzing movement between them. It encodes only the differences (motion vectors and residuals) between current and predicted frames, thus significantly reducing redundant data. This results in decreased file sizes and improved transmission efficiency without sacrificing perceptual video quality, making the process much more efficient than a simple, static frame-by-frame encoding approach .

Huffman Coding optimizes data compression by assigning variable-length codes to input characters. More frequently occurring characters are assigned shorter codes, while less frequent characters are given longer codes. This results in a compressed file size that approximates the theoretical limit set by the data's entropy, thus optimizing storage and transmission efficiency. Because this method allows for exact reconstruction of the original data from the compressed version, it is classified under lossless compression techniques, ensuring no loss of information during the process .

Psychoacoustic models in audio compression exploit the characteristics of human hearing which is not uniformly sensitive to all frequencies and amplitude levels. By identifying and eliminating sounds that are masked by louder tones or those which are inaudible due to frequency ranges beyond human perception, these models enable more efficient file size reduction. Essentially, psychoacoustic models remove data that contribute little to the sound recognition and quality, thereby drastically reducing file size without perceptibly affecting audio quality, a technique prominently used in popular audio compression methods like MP3 .

Entropy defines the theoretical limit of how much data can be compressed without losing information. According to information theory, entropy represents the average amount of information produced by a stochastic source of data, and serves as a lower bound for lossless compression. This means that the more random or less predictable the data, the higher the entropy and the less compression possible; conversely, more predictable data has lower entropy, allowing greater compression .

Data compression techniques benefit cloud storage optimization by reducing the amount of data that needs to be stored and transferred across networks, thereby saving costs and improving efficiency. By compressing data before it is uploaded to the cloud, businesses can significantly decrease storage requirements and accelerate data retrieval times, which is critical for cost management and operational performance. This leads to enhanced resource allocation, as compressed data occupies less physical storage space and requires less bandwidth for data migrations and access. Consequently, compression is crucial for scalable and economical cloud storage strategies .

The Discrete Cosine Transform (DCT) is crucial in JPEG image compression as it transforms the image from the spatial domain to the frequency domain, concentrating most of the image's significant visual information into a few low-frequency components. By converting 8×8 blocks of pixels, the DCT makes it easier to identify and compress redundant information. This allows for the quantization step where data precision of less important high-frequency components can be reduced significantly without greatly affecting image quality, thus reducing the number of bits required for storage and enabling significant reductions in storage space .

The LZ77 algorithm uses a sliding window technique to compress data by replacing repeated patterns with references to earlier occurrences within a fixed-size window, thereby implicitly using a "dictionary" that consists of previously seen data within the window. In contrast, the LZW algorithm explicitly builds a dictionary dynamically during the compression process, starting with individual symbols as the initial dictionary, and creating new entries from unexplored symbol combinations. While both techniques aim at reducing redundancy, LZ77 works with a more implicit and temporary memory of previously seen data, whereas LZW constructs a more permanent and evolving dictionary during the compression process .

Lossless compression is particularly advantageous for medical images and data-sensitive applications because it ensures data integrity by allowing the exact reconstruction of the original file. This characteristic is crucial for medical imaging where undistorted data can be the difference in diagnosing patients, as any loss of information might affect data interpretations and clinical decisions. It also benefits other sensitive applications in preserving the accuracy and reliability of data processing, particularly where legal compliance or data authenticity is mandated. Therefore, lossless compression supports both storage efficiency and the critical need for data precision in such environments .

Lossless compression techniques allow for the exact reconstruction of the original data without any information loss, making them ideal for applications where data integrity is critical, such as text files, executables, and medical images. These methods reduce redundancy in data representation without sacrificing any original details. In contrast, lossy compression techniques achieve higher compression ratios by removing less important information, resulting in some loss of data fidelity. This makes them suitable for applications like audio and video where perfect fidelity is not required, but not suitable for contexts where precise data representation is required. Consequently, lossless compression maintains data integrity, while lossy achieves greater compression at the cost of data quality .

KTU Data Compression Techniques Overview
100% (1)
KTU Data Compression Techniques Overview
15 pages
Fuzzy Set Theory: Applications & Classification
100% (1)
Fuzzy Set Theory: Applications & Classification
32 pages
Cloud Computing Fundamentals Overview
No ratings yet
Cloud Computing Fundamentals Overview
36 pages
OOA&D Viva Questions and Answers
No ratings yet
OOA&D Viva Questions and Answers
5 pages
Computer Science Interview Questions Guide
No ratings yet
Computer Science Interview Questions Guide
45 pages
Halting Problem and Complexity Classes Explained
No ratings yet
Halting Problem and Complexity Classes Explained
22 pages
Compiler Design: Parsing Fundamentals
100% (1)
Compiler Design: Parsing Fundamentals
22 pages
Operating Systems Exam Questions 2023
No ratings yet
Operating Systems Exam Questions 2023
23 pages
Algorithms Lab Viva Questions
No ratings yet
Algorithms Lab Viva Questions
13 pages
CoCubes Programming Practice Questions
100% (2)
CoCubes Programming Practice Questions
8 pages
Atcd - Unit 5
No ratings yet
Atcd - Unit 5
10 pages
CISC vs RISC: C Code Simulations
No ratings yet
CISC vs RISC: C Code Simulations
18 pages
MUST-DO Questions For Interviews (DBMS, CN and OS)
No ratings yet
MUST-DO Questions For Interviews (DBMS, CN and OS)
3 pages
Creating a Marksheet with XML
No ratings yet
Creating a Marksheet with XML
7 pages
Sequential File Allocation Method in C
No ratings yet
Sequential File Allocation Method in C
4 pages
Netgear Interview FAQs Guide
No ratings yet
Netgear Interview FAQs Guide
5 pages
Accenture Exam Pattern and Syllabus 2025
No ratings yet
Accenture Exam Pattern and Syllabus 2025
4 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
4 pages
Dyashin Technosoft Associate Engineer Role
No ratings yet
Dyashin Technosoft Associate Engineer Role
2 pages
Chapter-wise Question Bank on Computation
100% (1)
Chapter-wise Question Bank on Computation
8 pages
Text Retrieval and Indexing Techniques
No ratings yet
Text Retrieval and Indexing Techniques
14 pages
Lossy vs. Lossless Compression Explained
No ratings yet
Lossy vs. Lossless Compression Explained
30 pages
Lumen Quest Hackathon 2025 Guide
No ratings yet
Lumen Quest Hackathon 2025 Guide
6 pages
Digital Editing Techniques Explained
No ratings yet
Digital Editing Techniques Explained
12 pages
TCS Coding Questions Overview
No ratings yet
TCS Coding Questions Overview
2 pages
I/O Organization in Computer Architecture
No ratings yet
I/O Organization in Computer Architecture
34 pages
BCT Software Trainee Interview Guide
No ratings yet
BCT Software Trainee Interview Guide
4 pages
Classical Analysis in Software Engineering
100% (1)
Classical Analysis in Software Engineering
6 pages
Client-Server Computing Overview
No ratings yet
Client-Server Computing Overview
23 pages
Cloud Parallel File Systems Overview
No ratings yet
Cloud Parallel File Systems Overview
9 pages
MindTree Technical Interview Guide
No ratings yet
MindTree Technical Interview Guide
3 pages
Accenture Fundamentals Explained
No ratings yet
Accenture Fundamentals Explained
9 pages
Industrial Extreme Programming Overview
No ratings yet
Industrial Extreme Programming Overview
7 pages
Google File System Case Study
No ratings yet
Google File System Case Study
7 pages
TCS Prime Interview Questions Guide
No ratings yet
TCS Prime Interview Questions Guide
8 pages
SIC and SIC/XE Architecture Overview
86% (7)
SIC and SIC/XE Architecture Overview
14 pages
Agilysys Manual Testing Interview Guide
No ratings yet
Agilysys Manual Testing Interview Guide
5 pages
Zensar AI Data Collector Role Guide
No ratings yet
Zensar AI Data Collector Role Guide
2 pages
Agile Testing with Extreme Programming
No ratings yet
Agile Testing with Extreme Programming
11 pages
Virtual Clusters and Resource Management
No ratings yet
Virtual Clusters and Resource Management
24 pages
FIFO vs SCAN Disk Scheduling Methods
No ratings yet
FIFO vs SCAN Disk Scheduling Methods
4 pages
5-Day Infosys SP/DSE Exam Prep Guide
No ratings yet
5-Day Infosys SP/DSE Exam Prep Guide
15 pages
SIC vs. SIC/XE Architecture Comparison
No ratings yet
SIC vs. SIC/XE Architecture Comparison
3 pages
Blockchain Technologies Exam Scheme
No ratings yet
Blockchain Technologies Exam Scheme
3 pages
Computer Networks Interview Notes
No ratings yet
Computer Networks Interview Notes
17 pages
BCS714A Deep Learning Overview
No ratings yet
BCS714A Deep Learning Overview
29 pages
KTU S8 OOPS 50 Viva Questions With Answers
0% (1)
KTU S8 OOPS 50 Viva Questions With Answers
3 pages
Java Event Handling Overview
No ratings yet
Java Event Handling Overview
16 pages
Cloud Computing Exam Question Paper
No ratings yet
Cloud Computing Exam Question Paper
11 pages
Pushdown Automata for Context-Free Languages
No ratings yet
Pushdown Automata for Context-Free Languages
37 pages
Computer Networking Cheat Sheet
No ratings yet
Computer Networking Cheat Sheet
7 pages
Data Engineering MCQs with Answers
No ratings yet
Data Engineering MCQs with Answers
9 pages
Boundary Length Measurement Techniques
No ratings yet
Boundary Length Measurement Techniques
7 pages
DeltaX Recruitment Drive for 2025 Batch
No ratings yet
DeltaX Recruitment Drive for 2025 Batch
1 page
Key Topics in Computer Networks
No ratings yet
Key Topics in Computer Networks
2 pages
Secondary Storage Management in OS
No ratings yet
Secondary Storage Management in OS
3 pages
RPA Foundations and Benefits Overview
No ratings yet
RPA Foundations and Benefits Overview
44 pages
Multimedia Compression and Decompression Techniques
No ratings yet
Multimedia Compression and Decompression Techniques
36 pages
Overview of Data Compression Techniques
No ratings yet
Overview of Data Compression Techniques
24 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
4 pages
M.Tech Digital Signal Processing Curriculum
No ratings yet
M.Tech Digital Signal Processing Curriculum
49 pages
Introduction to MPEG Video Compression
No ratings yet
Introduction to MPEG Video Compression
24 pages
Video Super-Resolution with Diffusion Models
No ratings yet
Video Super-Resolution with Diffusion Models
18 pages
Lossless vs. Lossy Video Compression
No ratings yet
Lossless vs. Lossy Video Compression
43 pages
Video Processing: Techniques & Applications
No ratings yet
Video Processing: Techniques & Applications
3 pages
MATLAB Image Processing Project Ideas
No ratings yet
MATLAB Image Processing Project Ideas
2 pages
H.264 Video Encoding Standard Overview
100% (3)
H.264 Video Encoding Standard Overview
36 pages
Video Motion Estimation Techniques
No ratings yet
Video Motion Estimation Techniques
26 pages
MPEG Video Compression Overview
No ratings yet
MPEG Video Compression Overview
22 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
98 pages
MPEG-2 Video Compression Overview
No ratings yet
MPEG-2 Video Compression Overview
37 pages
M.Tech CSE R21 Course Structure 2008
No ratings yet
M.Tech CSE R21 Course Structure 2008
47 pages
Graphics and Animation Principles Explained
No ratings yet
Graphics and Animation Principles Explained
26 pages
Efficient H.264 Inter Mode Decision
No ratings yet
Efficient H.264 Inter Mode Decision
4 pages
Multimedia Communication Course Plan 2024
No ratings yet
Multimedia Communication Course Plan 2024
26 pages
Low Power VLSI Design
100% (1)
Low Power VLSI Design
10 pages
Video Coding Standards Overview
No ratings yet
Video Coding Standards Overview
75 pages
Information Coding Techniques Overview
No ratings yet
Information Coding Techniques Overview
10 pages
Video Compression Techniques Explained
No ratings yet
Video Compression Techniques Explained
14 pages
Understanding Image Compression Techniques
No ratings yet
Understanding Image Compression Techniques
91 pages
Motion Detection and Tracking Techniques
No ratings yet
Motion Detection and Tracking Techniques
44 pages
Video Compression Standards Overview
No ratings yet
Video Compression Standards Overview
25 pages
Global Motion Estimation Techniques
No ratings yet
Global Motion Estimation Techniques
30 pages
Enhancing Video Quality: VSR & VFI Interaction
No ratings yet
Enhancing Video Quality: VSR & VFI Interaction
9 pages
Video Encryption Algorithms Survey
No ratings yet
Video Encryption Algorithms Survey
16 pages
Video and Audio Compression Techniques
No ratings yet
Video and Audio Compression Techniques
3 pages
Motion Vector Calculation Method Patent
No ratings yet
Motion Vector Calculation Method Patent
85 pages
Video Motion Estimation Techniques
No ratings yet
Video Motion Estimation Techniques
11 pages
Survey 1
No ratings yet
Survey 1
10 pages

Ktu Data Compression Techniques Notes

Uploaded by

Ktu Data Compression Techniques Notes

Uploaded by

KTU S8 CSE – Data Compression Techniques

Quick Revision Notes – All Modules

• Compression improves storage efficiency and transmission speed.

• Two major categories: Lossless compression and Lossy compression.

• Lossless compression allows exact reconstruction of the original data.

• Lossy compression removes less important information for higher compression.

• Entropy represents the theoretical limit of compression.

• Information theory forms the basis of many compression algorithms.

• Arithmetic Coding: Encodes entire message into a single fractional number.

• LZ77 Algorithm: Uses sliding window technique to replace repeated patterns.

• LZ78 Algorithm: Builds dictionary dynamically during compression.

• LZW Algorithm: Improved dictionary-based compression used in GIF images.

Advantages of Lossless Compression:

• Used for text, executable files, and medical images.

• Allows exact reconstruction of original data.

• JPEG is the most common image compression standard.

• JPEG uses Discrete Cosine Transform (DCT).

• Compression steps: Color conversion → DCT → Quantization → Entropy coding.

• JPEG-LS provides lossless or near-lossless compression.

• PNG uses lossless compression.

JPEG Compression Steps:

• Apply DCT to transform spatial domain into frequency domain.

• Quantize coefficients to reduce precision.

• Apply entropy coding such as Huffman coding.

• MPEG standards are widely used for video compression.

• Types include MPEG■1, MPEG■2, MPEG■4 and H.264.

• Motion estimation finds similar blocks between frames.

• Motion compensation predicts frame using previous frames.

• Frames types: I■frame (intra), P■frame (predicted), B■frame (bidirectional).

Motion Compensation Concept:

• Human hearing characteristics are used to remove inaudible sounds.

• MP3 is one of the most popular audio compression standards.

• Audio compression uses psychoacoustic models.

• Lossless audio compression formats include FLAC and ALAC.

Applications of Data Compression:

• Image storage and transmission.

• Mobile communication systems.

• Cloud storage optimization.

Common questions

Evaluate how the use of MPEG standards in video compression caters to the needs of modern digital broadcasting and online streaming services.

Evaluate how the use of MPEG standards in video compression caters to the needs of modern digital broadcasting and online streaming services.

How does motion compensation in video compression improve efficiency compared to simple frame-by-frame encoding?

How does motion compensation in video compression improve efficiency compared to simple frame-by-frame encoding?

Explain how Huffman Coding optimizes data compression and why it is classified under lossless compression techniques.

Explain how Huffman Coding optimizes data compression and why it is classified under lossless compression techniques.

Explain the significance of psychoacoustic models in audio compression and how they enhance file size reduction.

Explain the significance of psychoacoustic models in audio compression and how they enhance file size reduction.

How does the concept of entropy relate to the practical limits of data compression, according to the principles of information theory?

How does the concept of entropy relate to the practical limits of data compression, according to the principles of information theory?

How does the application of data compression techniques benefit cloud storage optimization?

How does the application of data compression techniques benefit cloud storage optimization?

Discuss the role of the Discrete Cosine Transform (DCT) in JPEG image compression and how it contributes to reducing storage space.

Discuss the role of the Discrete Cosine Transform (DCT) in JPEG image compression and how it contributes to reducing storage space.

How does the LZ77 algorithm differ from the LZW algorithm in terms of dictionary usage during the compression process?

How does the LZ77 algorithm differ from the LZW algorithm in terms of dictionary usage during the compression process?

Analyze the advantages of using lossless compression for medical images and other data-sensitive applications.

Analyze the advantages of using lossless compression for medical images and other data-sensitive applications.

In what ways do lossless compression techniques differ from lossy compression techniques, and what are the implications of these differences for data integrity and storage?

In what ways do lossless compression techniques differ from lossy compression techniques, and what are the implications of these differences for data integrity and storage?

You might also like