0% found this document useful (0 votes)

19 views25 pages

Understanding Data Compression Techniques

The document discusses data compression, highlighting its importance for optimizing storage space and resource usage. It covers various methods of compression, including lossless techniques like Run-length encoding and Huffman coding, as well as lossy methods such as JPEG and MPEG for images and videos. Additionally, it explains concepts like entropy, predictive encoding, and the process of quantization in compression.

Uploaded by

samriddhi623

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views25 pages

Understanding Data Compression Techniques

Uploaded by

samriddhi623

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Data Compression

CS 147
Minh Nguyen
Why Data Compression?
Make optimal use of limited
storage space

Save time and help to optimize

resources
 If compression and decompression are done in I/O processor,
less time is required to move data to or from storage
subsystem, freeing I/O bus for other work

 In sending data over communication line: less time to transmit

and less storage to host
Data Compression-
Entropy
Entropy is the measure of information
content in a message.
 Messages with higher entropy carry more
information than messages with lower entropy.
How to determine the entropy
 Find the probability p(x) of symbol x in the
message
 The entropy H(x) of the symbol x is:
H(x) = - p(x) • log2p(x)
The average entropy over the entire
message is the sum of the entropy of
all n symbols in the message
Data Compression
Methods
Data compression is about storing and
sending a smaller number of bits.
There’re two major categories for
methods to compress data: lossless
and lossy methods
Lossless Compression
Methods
Inlossless methods, original data and
the data after compression and
decompression are exactly the same.

Redundant data is removed in

compression and added during
decompression.

Lossless methods are used when we

can’t afford to lose any data: legal and
medical documents, computer programs.
Run-length encoding
 Simplest method of compression.
 How: replace consecutive repeating occurrences of a
symbol by 1 occurrence of the symbol itself, then
followed by the number of occurrences.

 The method can be more efficient if the data uses

only 2 symbols (0s and 1s) in bit patterns and 1
symbol is more frequent than another.
Huffman Coding
 Assign fewer bits to symbols that occur more
frequently and more bits to symbols appear less
often.
 There’s no unique Huffman code and every
Huffman code has the same average code length.
 Algorithm:
① Make a leaf node for each code symbol
Add the generation probability of each symbol to the leaf
node
② Take the two leaf nodes with the smallest probability and
connect them into a new node
Add 1 or 0 to each of the two branches
The probability of the new node is the sum of the
probabilities of the two connecting nodes
③ If there is only one node left, the code construction is
completed. If not, go back to (2)
Huffman Coding
 Example
Huffman Coding
 Encoding

 Decoding
Lempel Ziv Encoding
It is dictionary-based encoding

Basic idea:
 Create a dictionary(a table) of strings
used during communication.

 If both sender and receiver have a copy

of the dictionary, then previously-
encountered strings can be substituted
by their index in the dictionary.
Lempel Ziv Compression
Have 2 phases:
 Building an indexed dictionary
 Compressing a string of symbols
• Algorithm:
 Extract the smallest substring that cannot
be found in the remaining uncompressed
string.
 Store that substring in the dictionary as a
new entry and assign it an index value
 Substring is replaced with the index found
in the dictionary
 Insert the index and the last character of
the substring into the compressed string
Lempel Ziv Compression
 Compression
example:
Audio Encoding
Predictive encoding
 Only the differences
Lempel Ziv
Decompression
 It’s just the inverse
of compression process
Lossy Compression
Methods
Used for compressing images and
video files (our eyes cannot
distinguish subtle changes, so lossy
data is acceptable).
These methods are cheaper, less time
and space.
Several methods:
 JPEG: compress pictures and graphics
 MPEG: compress video
 MP3: compress audio
JPEG Encoding
Used to compress pictures and
graphics.
In JPEG, a grayscale picture is divided
into 8x8 pixel blocks to decrease the
number of calculations.
Basic idea:
 Change the picture into a linear (vector) sets of
numbers that reveals the redundancies.
 The redundancies is then removed by one of
lossless compression methods.
JPEG Encoding- DCT
 DCT: Discrete Concise Transform
 DCT transforms the 64 values in 8x8 pixel
block in a way that the relative relationships
between pixels are kept but the
redundancies are revealed.
 Example:

A gradient grayscale
Quantization &
Compression
 Quantization:
 After T table is created, the values are quantized
to reduce the number of bits needed for
encoding.
 Quantization divides the number of bits by a
constant, then drops the fraction. This is done to
optimize the number of bits and the number of 0s
for each particular application.

• Compression:
 Quantized values are read from the table and
redundant 0s are removed.
 To cluster the 0s together, the table is read
diagonally in an zigzag fashion. The reason is if
the table doesn’t have fine changes, the bottom
right corner of the table is all 0s.
 JPEG usually uses lossless run-length encoding at
the compression phase.
JPEG Encoding
MPEG Encoding
Used to compress video.
Basic idea:
 Each video is a rapid sequence of a set of
frames. Each frame is a spatial
combination of pixels, or a picture.
 Compressing video =
spatially compressing each frame
+
temporally compressing a set of
frames.
MPEG Encoding
Spatial Compression
 Each frame is spatially compressed by JPEG.
• Temporal Compression
 Redundant frames are removed.
 For example, in a static scene in which someone
is talking, most frames are the same except for
the segment around the speaker’s lips, which
changes from one frame to the next.
Audio Compression
Used for speech or music
 Speech: compress a 64 kHz digitized
signal
 Music: compress a 1.411 MHz signal

• Two categories of techniques:

 Predictive encoding
 Perceptual encoding
Audio Encoding
Predictive Encoding
 Only the differences between samples are
encoded, not the whole sample values.
 Several standards: GSM (13 kbps), G.729 (8
kbps), and G.723.3 (6.4 or 5.3 kbps)

• Perceptual Encoding: MP3

 CD-quality audio needs at least 1.411
Mbps and cannot be sent over the Internet
without compression.
 MP3 (MPEG audio layer 3) uses perceptual
encoding technique to compress audio.
References
[Link]
/english/[Link]

CS157B-Lecture 19 by Professor
Lee

[Link]
html

“The essentials of computer

organization and architecture” by
Linda Null and Julia Nobur.
Data Compression

QUESTION?

Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
25 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
25 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
37 pages
PDF Data Compression Techniques
No ratings yet
PDF Data Compression Techniques
22 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
19 pages
Unit 3.1 Data Compression
No ratings yet
Unit 3.1 Data Compression
38 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
21 pages
Overview of Data Compression Techniques
No ratings yet
Overview of Data Compression Techniques
24 pages
Overview of Data Compression Techniques
No ratings yet
Overview of Data Compression Techniques
31 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
21 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
23 pages
LEC08
No ratings yet
LEC08
41 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
29 pages
Data Compression: Methods Explained
No ratings yet
Data Compression: Methods Explained
21 pages
Overview of Data Compression Techniques
No ratings yet
Overview of Data Compression Techniques
46 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
41 pages
Text and Image Compression Techniques
No ratings yet
Text and Image Compression Techniques
69 pages
Lossless PDF Compression Techniques
No ratings yet
Lossless PDF Compression Techniques
36 pages
CST446 Module Notes
No ratings yet
CST446 Module Notes
13 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
36 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
19 pages
Chapter 3 Data Compression
No ratings yet
Chapter 3 Data Compression
8 pages
Text and Image Compression Techniques
No ratings yet
Text and Image Compression Techniques
40 pages
Efficient Source Encoding Techniques
No ratings yet
Efficient Source Encoding Techniques
30 pages
Multimedia Data Compression Techniques
100% (2)
Multimedia Data Compression Techniques
23 pages
Multimedia Compression Techniques Overview
No ratings yet
Multimedia Compression Techniques Overview
55 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
34 pages
Multimedia Compression Techniques Overview
No ratings yet
Multimedia Compression Techniques Overview
23 pages
Multimedia Data Compression Techniques
No ratings yet
Multimedia Data Compression Techniques
42 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
64 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
32 pages
Multimedia Data Compression Overview
No ratings yet
Multimedia Data Compression Overview
9 pages
Importance of Data Compression in Multimedia
No ratings yet
Importance of Data Compression in Multimedia
60 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
57 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
38 pages
Understanding Compression Techniques
No ratings yet
Understanding Compression Techniques
15 pages
Data Compression in Presentation Layer
No ratings yet
Data Compression in Presentation Layer
8 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
12 pages
Image and Video Compression Techniques
No ratings yet
Image and Video Compression Techniques
76 pages
Compression Techniques Overview
No ratings yet
Compression Techniques Overview
33 pages
Compression Techniques Overview
No ratings yet
Compression Techniques Overview
26 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
33 pages
Data Compression Techniques Overview
100% (1)
Data Compression Techniques Overview
18 pages
Multimedia Data Compression Techniques
No ratings yet
Multimedia Data Compression Techniques
87 pages
Multimedia Compression Techniques Explained
No ratings yet
Multimedia Compression Techniques Explained
68 pages
Data Compression
No ratings yet
Data Compression
6 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
70 pages
Text and Image Compression Techniques
No ratings yet
Text and Image Compression Techniques
32 pages
Text and Image Compression Techniques
No ratings yet
Text and Image Compression Techniques
96 pages
Tally Data Compression Techniques
100% (1)
Tally Data Compression Techniques
35 pages
Text and Image Compression Techniques
No ratings yet
Text and Image Compression Techniques
16 pages
Psychovisual Redundancy in Image Compression
No ratings yet
Psychovisual Redundancy in Image Compression
50 pages
Lecture 17
No ratings yet
Lecture 17
51 pages
Video Compression Techniques Overview
No ratings yet
Video Compression Techniques Overview
71 pages
Unit-5 (CSC330-MC) (Elective)
No ratings yet
Unit-5 (CSC330-MC) (Elective)
59 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
22 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
12 pages
Data Compression Chapter 7
No ratings yet
Data Compression Chapter 7
40 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
51 pages
VSX 417 817 Manual
No ratings yet
VSX 417 817 Manual
106 pages
Sharp Lc32wd1e LCD
No ratings yet
Sharp Lc32wd1e LCD
202 pages
LG 32LF580V.50LF580V
No ratings yet
LG 32LF580V.50LF580V
260 pages
RTSP Protocols for Ezviz and Dahua Models
No ratings yet
RTSP Protocols for Ezviz and Dahua Models
2 pages
MP4 Video and Audio Info Extraction
No ratings yet
MP4 Video and Audio Info Extraction
76 pages
Accessibility for Deaf Audiences Post-DTV
No ratings yet
Accessibility for Deaf Audiences Post-DTV
35 pages
Mobile Phone Installment Plans 2023
No ratings yet
Mobile Phone Installment Plans 2023
17 pages
Samsung 2025 4K Smart TVs for Sale
No ratings yet
Samsung 2025 4K Smart TVs for Sale
1 page
Ex 1
No ratings yet
Ex 1
2 pages
Understanding Blu-ray Authoring
No ratings yet
Understanding Blu-ray Authoring
19 pages
Sound and Video in Multimedia
No ratings yet
Sound and Video in Multimedia
55 pages
Samsung LED TV Specifications Overview
No ratings yet
Samsung LED TV Specifications Overview
27 pages
LG 40UF671V Owner's Manual
No ratings yet
LG 40UF671V Owner's Manual
44 pages
BN68 02808J 00L05 - 0329 PDF
No ratings yet
BN68 02808J 00L05 - 0329 PDF
306 pages
VSX-D810S Operating Instructions
No ratings yet
VSX-D810S Operating Instructions
52 pages
JPEG Compression and Audio Basics
No ratings yet
JPEG Compression and Audio Basics
20 pages
EB510 Bro en
No ratings yet
EB510 Bro en
20 pages
Manual Samsung Ue40es6200
No ratings yet
Manual Samsung Ue40es6200
808 pages
TITAN FILEMS 2 8 0-SupportedFormats
No ratings yet
TITAN FILEMS 2 8 0-SupportedFormats
14 pages
Hyundai ImageQuest LCD TV Manual
100% (1)
Hyundai ImageQuest LCD TV Manual
85 pages
Communication Engineering Overview
No ratings yet
Communication Engineering Overview
16 pages
Evolution of TVRI and Its Oversight
No ratings yet
Evolution of TVRI and Its Oversight
8 pages
DVB-C Signal Meter HD-CM+ Review
No ratings yet
DVB-C Signal Meter HD-CM+ Review
5 pages
Joey Martinez Samson: Philippines 2013: 60 Years of Philippine Television
No ratings yet
Joey Martinez Samson: Philippines 2013: 60 Years of Philippine Television
167 pages
UR82 Universal Remote Control Guide
No ratings yet
UR82 Universal Remote Control Guide
35 pages
Image Compression Techniques in Python
No ratings yet
Image Compression Techniques in Python
17 pages
Media Aptitude
No ratings yet
Media Aptitude
36 pages
LG DVD Players Training Manual
No ratings yet
LG DVD Players Training Manual
76 pages
History of TV Broadcasting in the Philippines
100% (1)
History of TV Broadcasting in the Philippines
38 pages
FT003 5.1 DTS/DOLBY Mini Decoder Guide
No ratings yet
FT003 5.1 DTS/DOLBY Mini Decoder Guide
1 page

Understanding Data Compression Techniques

Uploaded by

Understanding Data Compression Techniques

Uploaded by

Data Compression

Save time and help to optimize

 In sending data over communication line: less time to transmit

Redundant data is removed in

Lossless methods are used when we

 The method can be more efficient if the data uses

 If both sender and receiver have a copy

• Two categories of techniques:

• Perceptual Encoding: MP3

“The essentials of computer

You might also like