0% found this document useful (0 votes)

5 views8 pages

Arithmetic Coding

Arithmetic coding is an entropy encoding method used in lossless data compression that compresses an input stream into a single floating-point number between 0 and 1. It was introduced by Peter Elias in 1963 and later improved by Jorma Rissanen and Richard Pasco in 1976 to address practical limitations. Unlike Huffman coding, arithmetic coding assigns fractional ranges to symbols based on their probabilities, allowing for more efficient encoding and decoding processes.

Uploaded by

SACHIN VERMA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views8 pages

Arithmetic Coding

Uploaded by

SACHIN VERMA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Arithmetic Coding

Background of the Algorithm

What is it? A form of entropy encoding used in lossless data compression where the input stream

is compressed into a single floating-point number between 0 and 1.

Who invented it? This fundamental concept has been introduced a while ago in an unpublished

work around 1963 by Peter Elias. His initial proposal, though optimal, was impractical as it

presumes infinite-precision, and consequently an infinite buffer size! His idea was later on been

improved on using practical schemes by Jorma Rissanen and Richard Pasco which is known

as finite-precision in 1976.

What does it solve? The predecessor of arithmetic coding is the Huffman Coding which is a

fixed-length coding method. This means Huffman always assigns at least 1 bit for a given

probability.

This becomes a problem for when the probability of a character is very high. For example, when

encoding “aaaaaaaaab” with probabilities 0.9 for a and 0.1 for b:

 Huffman coding: assigns 0 to a, 1 to b, resulting in 0000000001

 Arithmetic coding: assigns the interval 0.1–1.0 to a, 0.0–0.1 to b, resulting in .301272

or .1010000

By representing fractions to the input, arithmetic coding bypasses the idea of replacing an input

symbol with a specific code.

What is the running time? For both encoding and decoding, typically the runtime complexity is

linear, O(n), though implementation details may complicate this, such as the precision. Though

compared to Huffman coding, the running time is slower due to the computational overhead.

What is the space requirements? The algorithm, given a finite-precision implementation, requires

buffers for the high and low values relative to the selected buffer size (e.g 16-bit integer). This

doesn’t include the requirements for the chosen probability model (which varies depending on

implementation).
To demonstrate the fundamental aspects of arithmetic coding, the provided examples will be

using infinite-precision.
Infinite-Precision Demo: Encoding

How does it work? To construct the floating-point number output:

1. Take the input stream and the probabilities of each symbol as inputs to the encoder.

2. Assign a portion of the probability line [0, 1) for each symbol which corresponds to its

probability.

3. Encode each symbol restricted along its range. The sum of this computation will result to the

final output.

As an example, our input string will be “ab$”. ‘$’ is the EOF symbol.

Suppose the probability model has generated:

 a: 0.4

 b: 0.4

 $: 0.2

And the upper and lower bounds assigned to each symbol are:

 $: [0.0–0.2)

 a: [0.2–0.6)

 b: [0.6–1.0)
We first start with the high and low values as 0.0 and 1.0 respectively. To encode ‘a’, get the

range it can fall into. In this case it will be [0.2–0.6). The final output will be a number between

this range.

Next, assign 0.2 as the new lower bound, and 0.6 as the new upper bound. Take the next symbol,

‘b’ and get its range. In this case it is assigned [0.6–1.0). This range is now relative to the

subdivided portion after encoding a [0.2–0.6).

Lastly, encode ‘$’ using the same procedure as before.

The resulting range, [0.440–0.472), means we can pick any number within this range and it will

uniquely encode ‘ab$’. For simplicity, let’s choose the lower bound 0.440.

The pseudocode for encoding is:

low = 0.0
high = 1.0
for each symbol in input
{
range = high - low
high = low + range * high_range(symbol)
low = low + range * low_range(symbol)
}
output(low)

Infinite-Precision Demo: Decoding

How does it work? Knowing the encoding process, we simply reverse the computations by

reducing the resulting output 0.440.

First we find the range for which the first symbol falls in. In 0.440, since it falls between 0.4 and

0.6, it must be the character ‘b’ according to the model. We then subtract the lower bound of ‘b’

to the output, then divide by the width of the range of ‘b’ (which is .2). Using this procedure, we

further reduce the output as we recognize the next symbol.

Press enter or click to view image in full size

The pseudocode for decoding is:

number = input_code()
while (symbol != EOF)
{
symbol = find_symbol_within_this_range(number)
output(symbol)
range = high_range(symbol) - low_range(symbol)
number = number - low_range(symbol)
number = number / range
}

Another way is to make use of the intervals where we continuously build up on the lower and

upper bound that will contain the input value.

low = 0.0
high = 1.0
number = input_code()
while (true)
{
for each symbol in possible symbols
{
range = high - low
low_temp = low + low_range(symbol)
high_temp = low + high_range(symbol)

if (low_temp <= number && number < high_temp)

{
output(symbol)
if (symbol == EOF)
quit
low = low_temp
high = high_temp
}
}
}

Float-value into binary format

So far, we only know that the final output was 0.440, but to actually encode this, we need to treat

it as binary fraction (i.e. .11111…).

To do this, we apply the same subdivision principle or rescaling technique when encoding 0 or 1.

The range of [0, 0.1) may be represented such that the lower half is 0, and the upper half is 1 in

binary.

The range of the final output was [0.440, 0.472). This will dictate the binary sequence to encode,

since this range will either fall on the lower half (0), or upper half (1).
Press enter or click to view image in full size
At the end, we have found a range that is contained within the initial range that was computed

[0.440, 0.472). Therefore, the resulting binary sequence will be .011101.

As the range becomes smaller during the encoding, you will find that it may not entirely lie on the

entirety of the upper/lower half of the width. Sometimes the range may be on the middle, such

that the lower bound is on the lower half (0) and the upper bound is on the upper half (1). In such

cases, the rescaling factors in the quarter portion, or the three-fourths portion of the range.

The same scaling principle is also applied during decoding.

The pseudocode for the rescaling is:

remaining_bits = 0

while (high < HALF or low > HALF)

{
if (high < HALF)
{
output(0)
low = 2 * low
high = 2 * high
}
else if (low > HALF)
{
output(1)
low = 2 * (low - HALF)
high = 2 * (high - HALF)
}
}

while (low > QUARTER and high < THREE-FOURTHS)

{
remaining_bits++
low = 2 * (low - QUARTER)
high = 2 * (high - QUARTER)
}

remaining_bits++
if (low <= QUARTER)
{
output(0)
for 0 to remaining_bits
output(1)
}
else
{
output(1)
for 0 to remaining_bits
output(0)
}

Arithmetic Coding Explained
No ratings yet
Arithmetic Coding Explained
6 pages
Arithmetic Coding Implementation Guide
No ratings yet
Arithmetic Coding Implementation Guide
11 pages
Arithmetic Coding: Implementation Guide
No ratings yet
Arithmetic Coding: Implementation Guide
7 pages
Arithmetic Coding Techniques
No ratings yet
Arithmetic Coding Techniques
36 pages
Arithmetic Coding Explained
No ratings yet
Arithmetic Coding Explained
15 pages
Arithmetic Coding in Image Compression
No ratings yet
Arithmetic Coding in Image Compression
34 pages
Arithmetic Coding: A Comprehensive Guide
No ratings yet
Arithmetic Coding: A Comprehensive Guide
48 pages
Entropy Coding Techniques Overview
No ratings yet
Entropy Coding Techniques Overview
45 pages
Arithmetic Coding Report
No ratings yet
Arithmetic Coding Report
9 pages
Understanding Arithmetic Coding
No ratings yet
Understanding Arithmetic Coding
12 pages
Arithmetic Coding in Data Compression
No ratings yet
Arithmetic Coding in Data Compression
26 pages
Arithmetic Coding for String Compression
No ratings yet
Arithmetic Coding for String Compression
8 pages
Understanding Arithmetic Coding Techniques
No ratings yet
Understanding Arithmetic Coding Techniques
20 pages
Arithmetic Encoding Algorithm Project
No ratings yet
Arithmetic Encoding Algorithm Project
19 pages
Understanding Arithmetic Coding Techniques
No ratings yet
Understanding Arithmetic Coding Techniques
2 pages
Arithmetic Coding Explained: Principles & Algorithms
No ratings yet
Arithmetic Coding Explained: Principles & Algorithms
23 pages
Arithmetic Coding Principles Explained
No ratings yet
Arithmetic Coding Principles Explained
22 pages
Arithmetic Coding in Data Compression
No ratings yet
Arithmetic Coding in Data Compression
18 pages
Advantages of Arithmetic Coding
No ratings yet
Advantages of Arithmetic Coding
33 pages
Lossless Data Compression with Verilog
No ratings yet
Lossless Data Compression with Verilog
6 pages
Adaptive Huffman Coding Techniques
No ratings yet
Adaptive Huffman Coding Techniques
39 pages
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
No ratings yet
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
15 pages
Arithmetic Coding for Barcodes
No ratings yet
Arithmetic Coding for Barcodes
66 pages
Arithmetic Coding Explained with Examples
No ratings yet
Arithmetic Coding Explained with Examples
11 pages
Arithmetic Coding Explained
No ratings yet
Arithmetic Coding Explained
4 pages
Byte-wise Normalization in Arithmetic Coding
No ratings yet
Byte-wise Normalization in Arithmetic Coding
11 pages
Predictive Coding in Multimedia Systems
No ratings yet
Predictive Coding in Multimedia Systems
25 pages
Arithmetic Coding: I J I J J
No ratings yet
Arithmetic Coding: I J I J J
12 pages
Arithmetic Coding and Elias Coding
No ratings yet
Arithmetic Coding and Elias Coding
38 pages
Arithmetic Coding in Digital Communications
No ratings yet
Arithmetic Coding in Digital Communications
54 pages
20.5 Arithmetic Coding
No ratings yet
20.5 Arithmetic Coding
6 pages
Understanding Arithmetic Coding Techniques
No ratings yet
Understanding Arithmetic Coding Techniques
30 pages
Arithmetic Coding in Multimedia
No ratings yet
Arithmetic Coding in Multimedia
44 pages
Huffman and Compression Techniques Guide
No ratings yet
Huffman and Compression Techniques Guide
10 pages
Understanding Arithmetic Coding Techniques
No ratings yet
Understanding Arithmetic Coding Techniques
26 pages
Arithmetic and Lempel-Ziv Coding
No ratings yet
Arithmetic and Lempel-Ziv Coding
25 pages
Arithmetic Coding Techniques Explained
No ratings yet
Arithmetic Coding Techniques Explained
24 pages
Understanding Range Coding Explained
No ratings yet
Understanding Range Coding Explained
6 pages
Arithmetic Coding and Decoding Techniques
No ratings yet
Arithmetic Coding and Decoding Techniques
17 pages
Efficient Multiplication-Free Binary Coder
No ratings yet
Efficient Multiplication-Free Binary Coder
4 pages
Adaptive Arithmetic Coding in Multimedia
No ratings yet
Adaptive Arithmetic Coding in Multimedia
23 pages
Arithmetic Coding Techniques Explained
No ratings yet
Arithmetic Coding Techniques Explained
22 pages
Understanding Arithmetic Coding Basics
No ratings yet
Understanding Arithmetic Coding Basics
18 pages
Arithmetic Coding Overview and Techniques
No ratings yet
Arithmetic Coding Overview and Techniques
9 pages
ZSTD Compression Level Explained
No ratings yet
ZSTD Compression Level Explained
17 pages
Image Compression Techniques Overview
No ratings yet
Image Compression Techniques Overview
29 pages
Golomb-Rice Coding Explained
No ratings yet
Golomb-Rice Coding Explained
7 pages
Arithmetic Coding Revisited
No ratings yet
Arithmetic Coding Revisited
39 pages
Introduction to Arithmetic Coding
No ratings yet
Introduction to Arithmetic Coding
31 pages
LEC2 - Source Coding Systems
No ratings yet
LEC2 - Source Coding Systems
34 pages
Lempel-Ziv Coding Overview
No ratings yet
Lempel-Ziv Coding Overview
26 pages
Huffman Coding Algorithm Explained
No ratings yet
Huffman Coding Algorithm Explained
13 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
37 pages
Arithmetic Coding for Image Compression
No ratings yet
Arithmetic Coding for Image Compression
12 pages
Lossless Compression Techniques Explained
No ratings yet
Lossless Compression Techniques Explained
20 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
48 pages
Arithmetic Coding Explained for Compression
No ratings yet
Arithmetic Coding Explained for Compression
9 pages
Agenda
No ratings yet
Agenda
3 pages
What Is Plaintext
No ratings yet
What Is Plaintext
2 pages
Web Technology Course Manual KCS 602
No ratings yet
Web Technology Course Manual KCS 602
47 pages
OOPs in Java: Comprehensive Tutorials
No ratings yet
OOPs in Java: Comprehensive Tutorials
5 pages
Substitution Techniques
No ratings yet
Substitution Techniques
12 pages
DBMS Concepts and SQL Queries Explained
No ratings yet
DBMS Concepts and SQL Queries Explained
1 page
Data Analytics: Definition and Examples
No ratings yet
Data Analytics: Definition and Examples
3 pages
BCS-451 Operating System Lab Manual
No ratings yet
BCS-451 Operating System Lab Manual
40 pages
BCS 251 Lab Solutions Overview
No ratings yet
BCS 251 Lab Solutions Overview
34 pages
BCS-452 Java OOP Lab Manual
No ratings yet
BCS-452 Java OOP Lab Manual
17 pages
Aptitude and Technical Interview Questions
No ratings yet
Aptitude and Technical Interview Questions
9 pages
Video Compression Techniques Explained
No ratings yet
Video Compression Techniques Explained
15 pages
Text and Image Compression Techniques
No ratings yet
Text and Image Compression Techniques
55 pages
Multimedia Basics and Compression Techniques
No ratings yet
Multimedia Basics and Compression Techniques
60 pages
Dynamic Markov Compression Method
No ratings yet
Dynamic Markov Compression Method
10 pages
Applications of Image Transform
No ratings yet
Applications of Image Transform
16 pages
Data Compression: Modeling & Coding Explained
No ratings yet
Data Compression: Modeling & Coding Explained
24 pages
Multimedia Communication Course Plan 2024
No ratings yet
Multimedia Communication Course Plan 2024
26 pages
Compression Theory
No ratings yet
Compression Theory
7 pages
Huffman Coding and Data Compression
No ratings yet
Huffman Coding and Data Compression
32 pages
Unit 5-Dec
No ratings yet
Unit 5-Dec
9 pages
Understanding Arithmetic Coding Basics
No ratings yet
Understanding Arithmetic Coding Basics
15 pages
Neural Linguistic Steganography Techniques
No ratings yet
Neural Linguistic Steganography Techniques
9 pages
Jpeg Ls Loco
No ratings yet
Jpeg Ls Loco
16 pages
Understanding Image Processing Types
No ratings yet
Understanding Image Processing Types
147 pages
Deneliwovilopo
No ratings yet
Deneliwovilopo
3 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
32 pages
Lossless Data Compression Efficiency
No ratings yet
Lossless Data Compression Efficiency
6 pages
On2 VP6 Codec: Features and Benefits
No ratings yet
On2 VP6 Codec: Features and Benefits
7 pages
Kap 5
No ratings yet
Kap 5
29 pages
Multimedia Compression Question Bank
No ratings yet
Multimedia Compression Question Bank
10 pages
Data Compression: Rate, Distortion, Coding
No ratings yet
Data Compression: Rate, Distortion, Coding
36 pages
LLMZip: Advanced Lossless Text Compression
No ratings yet
LLMZip: Advanced Lossless Text Compression
8 pages
Information Theory Overview
No ratings yet
Information Theory Overview
94 pages
Programming Assignment: Encoding & DCT
No ratings yet
Programming Assignment: Encoding & DCT
3 pages
Lossless Compression Algorithms Overview
No ratings yet
Lossless Compression Algorithms Overview
51 pages
Image Compression and Encoding Techniques
No ratings yet
Image Compression and Encoding Techniques
32 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
19 pages

Arithmetic Coding

Uploaded by

Arithmetic Coding

Uploaded by

Arithmetic Coding

Background of the Algorithm

is compressed into a single floating-point number between 0 and 1.

encoding “aaaaaaaaab” with probabilities 0.9 for a and 0.1 for b:

 Huffman coding: assigns 0 to a, 1 to b, resulting in 0000000001

 Arithmetic coding: assigns the interval 0.1–1.0 to a, 0.0–0.1 to b, resulting in .301272

symbol with a specific code.

How does it work? To construct the floating-point number output:

Suppose the probability model has generated:

subdivided portion after encoding a [0.2–0.6).

Lastly, encode ‘$’ using the same procedure as before.

The pseudocode for encoding is:

Infinite-Precision Demo: Decoding

reducing the resulting output 0.440.

further reduce the output as we recognize the next symbol.

The pseudocode for decoding is:

upper bound that will contain the input value.

if (low_temp <= number && number < high_temp)

Float-value into binary format

it as binary fraction (i.e. .11111…).

[0.440, 0.472). Therefore, the resulting binary sequence will be .011101.

The same scaling principle is also applied during decoding.

The pseudocode for the rescaling is:

while (high < HALF or low > HALF)

while (low > QUARTER and high < THREE-FOURTHS)

You might also like