0% found this document useful (0 votes)

18 views17 pages

Overview of Information Theory Concepts

Unit 2 Notes on IICT based on IPU 6th sem syllabus

Uploaded by

priyanshunailwal19022003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views17 pages

Overview of Information Theory Concepts

Unit 2 Notes on IICT based on IPU 6th sem syllabus

Uploaded by

priyanshunailwal19022003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Introduction to Information Theory

Information theory is a mathematical framework developed to quantify information, uncertainty,

and the efficiency of data transmission. It was pioneered by Claude Shannon in 1948, laying the
foundation for modern digital communication, data compression, and error correction
techniques. The theory is widely used in telecommunications, cryptography, machine learning,
and artificial intelligence.
At its core, information theory deals with the measurement of uncertainty in a system and how
efficiently information can be transmitted through a noisy channel.
1. Uncertainty and Information
In everyday communication, uncertainty refers to the unpredictability of an event. If an event is
certain, it carries no new information. Conversely, if an event is highly uncertain, it provides
more information when it occurs.
For example:

● If you check the weather forecast in a desert region and see "Sunny", it provides little to no new
information (low uncertainty).
● If the forecast predicts "Heavy Snowfall", it is surprising and conveys more information (high
uncertainty).

Mathematical Representation: The amount of information received from an event X = x can

be measured using the formula:

Interpretation:
 If P(x) is high, I(x) is low (less information).
 If P(x) is low, I(x) is high (more information).

Example:

● If a coin flip results in Heads or Tails, each with probability P(H)=P(T)=0.5, the
information content is:

● If an event is very rare, say P(x)=0.01, it carries more information:

2. Entropy (Shannon Entropy)

Entropy measures the average uncertainty in a random variable. It quantifies how much
information is required to describe an outcome.

For a discrete random variable X with probability distribution P(x), Shannon entropy is
defined as:

where:

3. Relative Entropy (Kullback-Leibler Divergence):

Problem 1: Compute the Entropy of a Fair Die
A fair six-sided die has equal probability for each face. Find the entropy of this system.
Solution:
Problem 2: Compute the Entropy of a Biased Coin
A coin is biased such that P(H)=0.7 and P(T)=0.3. Find the entropy.
Solution:

Problem 3: Calculate the Entropy of a 4-Sided Biased Dice

A four-sided die has probabilities:

P(1)=0.1, P(2)=0.2, P(3)=0.3, P(4)=0.4

Find the entropy H(X).

Solution:

Joint Entropy in Information Theory

Definition:
The joint entropy of two discrete random variables X and Y, denoted as H(X,Y), measures the
total amount of uncertainty (or information) contained in the pair of variables.
Mathematically, it is defined as:

Interpretation of Joint Entropy:

● Joint entropy measures the total information of the combined system (X,Y).
Example Calculation:
Suppose X and Y take the following values with the given joint probabilities:

Relationship to Other Entropy Measures

Joint entropy is connected to conditional entropy and mutual information:

1. Conditional Entropy:

H(X∣Y) = H(X,Y)−H(Y)

where H(X∣Y) is the uncertainty of X given Y.

2. Mutual Information:

I(X;Y)=H(X)+H(Y)−H(X,Y)

which measures the reduction in uncertainty of one variable given the others.

Marginal Entropy in Information Theory

Definition:
The marginal entropy of a single random variable X, denoted as H(X), measures the uncertainty
(or randomness) associated with X alone, ignoring any additional information about other
variables.
Mathematically, it is given by:

Interpretation of Marginal Entropy

● Marginal entropy measures the uncertainty of a single variable, ignoring other

variables.
● If X and Y are independent, then:

H(X,Y)=H(X)+H(Y)

● If X and Y are not independent, then:

H(X)≤H(X,Y)H(X)

because knowing Y reduces the uncertainty in X.

Problem : Marginal Entropy for Joint Distribution

Solution:
First, find the marginal probability distribution of X:

4. Mutual Information

Mutual information measures how much knowing one variable reduces uncertainty about
another.

For two random variables X and Y, mutual information is given by:

5. Average Mutual Information

Average mutual information measures the expected reduction in uncertainty of one variable due to
another.

It tells us how much on average we learn about X when observing Y.

Example:

Problem 4: Compute Mutual Information

Consider a communication system where a sender transmits bits X that can be received as Y. The
joint probability distribution is:
Solution:
Introduction to Lossless Coding
Lossless coding is a data compression technique that ensures the original data can be perfectly
reconstructed from the compressed data. Unlike lossy compression, which sacrifices some data
quality for higher compression rates, lossless coding retains all information, making it ideal for
applications such as text compression, medical imaging, and file storage.
Key Concepts in Lossless Coding
1. Redundancy Removal:
o Lossless compression reduces redundancy in data by identifying patterns and
encoding them efficiently.
o It relies on statistical methods to assign shorter codes to frequently occurring
symbols.
2. Entropy and Source Coding:
o According to Shannon's Information Theory, the entropy (H) of a source
represents the minimum average number of bits required to encode its symbols.
o Lossless coding techniques aim to achieve compression rates close to this
theoretical limit.
3. Prefix Codes:
o Lossless coding often uses prefix codes, where no codeword is a prefix of another.
o This ensures unique decodability without requiring delimiters between encoded
symbols.
Types of Lossless Coding Techniques
1. Huffman Coding:
o A widely used variable-length coding algorithm that assigns shorter codes to more
frequent symbols.
o Constructs a binary tree based on symbol frequencies.
o Used in applications like ZIP file compression and PNG images.
2. Arithmetic Coding:
o Assigns a single number between 0 and 1 to an entire message rather than
encoding symbols individually.
o Achieves higher compression efficiency compared to Huffman coding, especially
for sources with skewed probability distributions.
3. Run-Length Encoding (RLE):
o Efficient for data with repeated sequences, like simple images or text files with
repeated spaces.
o Example: "AAAAABBBCC" → "5A3B2C".
4. Lempel-Ziv (LZ) Compression:
o Dictionary-based coding method used in ZIP, GIF, and PNG formats.
o Examples include LZ77 and LZ78, which replace repeated sequences with
references.
5. Burrows-Wheeler Transform (BWT):
o A preprocessing step for compression that rearranges data to make it more
suitable for dictionary-based methods like LZ77.
o Used in Bzip2 compression.
Source Coding Theorem
The Source Coding Theorem is a fundamental result in Information Theory, introduced by
Claude Shannon in 1948. It establishes the theoretical limit for lossless data compression and
provides a foundation for efficient coding techniques.
Statement of the Theorem
The Source Coding Theorem states that:
"For a discrete memoryless source (DMS) with entropy H(X)), the minimum average length L of
any lossless encoding satisfies:"

where:
● L is the average code length per symbol.
● H(X) is the entropy of the source, which represents the average amount of uncertainty or
information in each symbol.
● The lower bound is asymptotically achievable using optimal coding schemes, such as
Huffman coding or Arithmetic coding.

Key Insights from the Theorem

1. Entropy as a Bound:
o The entropy H(X) sets the fundamental limit on how much a source can be compressed.
o If a source is compressed below H(X), some information will be lost, violating the
lossless condition.
2. Achievability:
o If L≈H(X), the encoding scheme is optimal.
o The closer L is to H(X), the more efficient the compression.
3. Uniqueness:
o No lossless compression scheme can achieve an average code length below H(X), making
entropy the absolute lower limit.

Example Calculation
Consider a source that emits three symbols A,B,C with probabilities:
Block Codes and Their Properties

A block code is a type of error-detecting and error-correcting code in which messages are
divided into fixed-length blocks of symbols and encoded into a longer sequence of symbols for
transmission. Block codes are used in digital communication systems to ensure reliable data
transmission over noisy channels.

Properties of Block Codes

1. Fixed-Length Encoding: Each input message of k bits is encoded into a codeword of n
bits (denoted as an (n,k) code).
2. Error Detection and Correction: Block codes can detect and correct errors based on the
redundancy added to the code.
3. Code Rate: The ratio of information bits to total transmitted bits is given by:

4. A higher rate means less redundancy and lower error correction capability.
5. Minimum Distance: The minimum Hamming distance between any two codewords
determines the error detection and correction ability.
6. Systematic and Non-Systematic Codes:
o Systematic Codes: The original message appears in the codeword, with
additional redundant bits.
o Non-Systematic Codes: The message is completely transformed into a different
codeword.
7. Linear Block Codes: A block code is linear if the sum of any two codewords is also a
valid codeword.
8. Hamming Weight and Hamming Distance:
o The Hamming weight of a codeword is the number of nonzero bits in it.
o The Hamming distance between two codewords is the number of positions at
which they differ.

Kraft-McMillan Inequality

The Kraft-McMillan inequality provides a necessary and sufficient condition for the existence
of a prefix-free or uniquely decodable code.

Statement of Kraft's Inequality

Implications of Kraft-McMillan Inequality

1. Prefix-Free Code: If a code satisfies Kraft’s inequality, then there exists a prefix-free
code with those codeword lengths.
2. Uniquely Decodable Code: For uniquely decodable codes, the inequality must also hold,
but not necessarily with equality.
3. Code Construction: Given a set of lengths satisfying Kraft’s inequality, we can construct
a prefix-free code.

Example of Kraft's Inequality

Example 1: Block Code Construction

Problem: Code Rate Calculation
A block code has code words of length n=7 and each message consists of k=4 information
bits. Find the code rate.
Solution:

So, the code rate is 0.571 or 57.1% efficiency.

Overview of Information Theory Concepts
No ratings yet
Overview of Information Theory Concepts
26 pages
Information Theory Basics and Applications
No ratings yet
Information Theory Basics and Applications
37 pages
Uncertainty and Information in Computing
No ratings yet
Uncertainty and Information in Computing
108 pages
Introduction to Information Theory Concepts
No ratings yet
Introduction to Information Theory Concepts
2 pages
ICT Module: Mathematical Communication Theory
No ratings yet
ICT Module: Mathematical Communication Theory
34 pages
Shannon's Information Theory Overview
No ratings yet
Shannon's Information Theory Overview
43 pages
Information Theory and Source Coding Basics
No ratings yet
Information Theory and Source Coding Basics
20 pages
Introduction to Information Theory
No ratings yet
Introduction to Information Theory
45 pages
Information Coding Techniques Overview
No ratings yet
Information Coding Techniques Overview
42 pages
Data Compression in Information Theory
No ratings yet
Data Compression in Information Theory
38 pages
Information Theory and Compression
No ratings yet
Information Theory and Compression
29 pages
Shannon-Fano and Huffman Coding Overview
No ratings yet
Shannon-Fano and Huffman Coding Overview
36 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
113 pages
Shannon's Channel Coding Theorem
No ratings yet
Shannon's Channel Coding Theorem
23 pages
Overview of Information Theory Concepts
No ratings yet
Overview of Information Theory Concepts
1 page
Information Theory: Entropy & Coding
No ratings yet
Information Theory: Entropy & Coding
44 pages
MCSE202 Unit1
No ratings yet
MCSE202 Unit1
35 pages
Module 4
No ratings yet
Module 4
42 pages
Information Theory and Coding Basics
No ratings yet
Information Theory and Coding Basics
56 pages
Shannon's 1948 Information Theory Overview
No ratings yet
Shannon's 1948 Information Theory Overview
32 pages
Quantum Data Compression Overview
No ratings yet
Quantum Data Compression Overview
7 pages
Communication System Performance Analysis
No ratings yet
Communication System Performance Analysis
68 pages
Information Theory
No ratings yet
Information Theory
22 pages
Information Theory and Source Coding Guide
No ratings yet
Information Theory and Source Coding Guide
45 pages
Information Theory and Coding Summary
No ratings yet
Information Theory and Coding Summary
3 pages
Information Theory: Limits and Concepts
No ratings yet
Information Theory: Limits and Concepts
108 pages
Information Rate in Communication Theory
No ratings yet
Information Rate in Communication Theory
2 pages
Huffman Coding and Information Theory
No ratings yet
Huffman Coding and Information Theory
34 pages
Data Compression Fundamentals Explained
No ratings yet
Data Compression Fundamentals Explained
34 pages
Effective Noise Temperature and Data Compression
No ratings yet
Effective Noise Temperature and Data Compression
86 pages
Source Compression Techniques Overview
No ratings yet
Source Compression Techniques Overview
34 pages
ICE513 Module 3 - Shannon Information Measures
No ratings yet
ICE513 Module 3 - Shannon Information Measures
33 pages
Understanding Information Theory Concepts
No ratings yet
Understanding Information Theory Concepts
19 pages
Information Theory and Coding Notes
No ratings yet
Information Theory and Coding Notes
156 pages
Information Coding Techniques Overview
No ratings yet
Information Coding Techniques Overview
39 pages
Spread Spectrum & Information Theory Concepts
No ratings yet
Spread Spectrum & Information Theory Concepts
41 pages
Limits of Information Theory Explained
No ratings yet
Limits of Information Theory Explained
21 pages
Information Theory and Coding Syllabus
100% (2)
Information Theory and Coding Syllabus
45 pages
Entropy (Information Theory) - Wikipedia
No ratings yet
Entropy (Information Theory) - Wikipedia
25 pages
Digital Communications Study Guide
No ratings yet
Digital Communications Study Guide
22 pages
Entropy and Noise in Communication Systems
No ratings yet
Entropy and Noise in Communication Systems
44 pages
Information Theory and Coding Concepts
No ratings yet
Information Theory and Coding Concepts
34 pages
Information Theory and Source Coding Overview
No ratings yet
Information Theory and Source Coding Overview
206 pages
Information Entropy and Coding Techniques
No ratings yet
Information Entropy and Coding Techniques
13 pages
Data Compression Seminar Certificate
No ratings yet
Data Compression Seminar Certificate
49 pages
Lossless Compression Techniques Overview
No ratings yet
Lossless Compression Techniques Overview
10 pages
Dias STP 04 10
No ratings yet
Dias STP 04 10
15 pages
Information Theory
No ratings yet
Information Theory
114 pages
Information Theory Overview
No ratings yet
Information Theory Overview
114 pages
Info Theory
No ratings yet
Info Theory
59 pages
Introduction to Information Theory
No ratings yet
Introduction to Information Theory
28 pages
Understanding Information Entropy
No ratings yet
Understanding Information Entropy
3 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
29 pages
Information Theory and Coding Concepts
No ratings yet
Information Theory and Coding Concepts
21 pages
Reinforcement Learning & Deep Learning Guide
No ratings yet
Reinforcement Learning & Deep Learning Guide
17 pages
Evaluate 102³ Using Identities
No ratings yet
Evaluate 102³ Using Identities
2 pages
Class 8 Quadrilaterals Worksheet
No ratings yet
Class 8 Quadrilaterals Worksheet
1 page
Direct Variations Test for 8th Grade
No ratings yet
Direct Variations Test for 8th Grade
1 page
Feedback for Improved Teaching Methods
No ratings yet
Feedback for Improved Teaching Methods
2 pages
Probability of Coin Selection
No ratings yet
Probability of Coin Selection
1 page
Statistics and Data Analytics Overview
No ratings yet
Statistics and Data Analytics Overview
142 pages
Chapter 2 Assignment 2 Class 10
No ratings yet
Chapter 2 Assignment 2 Class 10
3 pages
SSMDA Akash
No ratings yet
SSMDA Akash
5 pages
Class 10 Linear Equations Notes
No ratings yet
Class 10 Linear Equations Notes
1 page
Probability and Random Variables Explained
No ratings yet
Probability and Random Variables Explained
34 pages
Understanding TELNET Operations
No ratings yet
Understanding TELNET Operations
4 pages
Subnetting Implementation in Cisco Packet Tracer
No ratings yet
Subnetting Implementation in Cisco Packet Tracer
5 pages
Management Principles for Engineers
No ratings yet
Management Principles for Engineers
8 pages
Database Programming Concepts Explained
No ratings yet
Database Programming Concepts Explained
8 pages
Lambda Iteration Method in Economics
100% (1)
Lambda Iteration Method in Economics
76 pages
Gait Analysis
No ratings yet
Gait Analysis
3 pages
Forecasting Questions and Answers Guide
No ratings yet
Forecasting Questions and Answers Guide
3 pages
HCIA-AI Exam Questions and Answers
No ratings yet
HCIA-AI Exam Questions and Answers
62 pages
Internship at NetLeap IT Solutions
No ratings yet
Internship at NetLeap IT Solutions
19 pages
Quine-McClusky Algorithm Overview
No ratings yet
Quine-McClusky Algorithm Overview
14 pages
Python Implementation of BAM
No ratings yet
Python Implementation of BAM
5 pages
Graph Theory Concepts and Algorithms
No ratings yet
Graph Theory Concepts and Algorithms
13 pages
Statistical Methods for QC in Chemistry
No ratings yet
Statistical Methods for QC in Chemistry
4 pages
Mathematics Periodic Test Paper - XII
No ratings yet
Mathematics Periodic Test Paper - XII
10 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
8 pages
System Modeling & Simulation Overview
No ratings yet
System Modeling & Simulation Overview
28 pages
Modular Inverses in Cryptography
No ratings yet
Modular Inverses in Cryptography
14 pages
Avalanche Effect in Blowfish Algorithm
No ratings yet
Avalanche Effect in Blowfish Algorithm
5 pages
Engineering Mathematics III Overview
No ratings yet
Engineering Mathematics III Overview
31 pages
B.Tech CSE AI & ML Course Structure
No ratings yet
B.Tech CSE AI & ML Course Structure
34 pages
Signal Processing Concepts and Techniques
No ratings yet
Signal Processing Concepts and Techniques
9 pages
Efficient Multilevel Graph Partitioning
No ratings yet
Efficient Multilevel Graph Partitioning
12 pages
Control System Design Overview
No ratings yet
Control System Design Overview
14 pages
Few-Shot Spike Sorting with Adversarial Learning
No ratings yet
Few-Shot Spike Sorting with Adversarial Learning
4 pages
AI Algorithms in Banking Data Mining
No ratings yet
AI Algorithms in Banking Data Mining
3 pages
Dual Multiplex Method (Duplex)
No ratings yet
Dual Multiplex Method (Duplex)
5 pages
DataMites AI Expert Program Overview
No ratings yet
DataMites AI Expert Program Overview
10 pages
Dynamic Plantar Pressure for Human ID
No ratings yet
Dynamic Plantar Pressure for Human ID
4 pages
Data Science Projects by Sai Sreehas
No ratings yet
Data Science Projects by Sai Sreehas
1 page
Control Lab Manual
No ratings yet
Control Lab Manual
147 pages
Learning Unit 18 MAC2601 - LU18 - 8october2025
No ratings yet
Learning Unit 18 MAC2601 - LU18 - 8october2025
18 pages
Meta-Learning for Customer Preferences
No ratings yet
Meta-Learning for Customer Preferences
51 pages
Binomial Tree Option Valuation
No ratings yet
Binomial Tree Option Valuation
25 pages
Types of Simulation in Industrial Systems
No ratings yet
Types of Simulation in Industrial Systems
23 pages

Overview of Information Theory Concepts

Uploaded by

Overview of Information Theory Concepts

Uploaded by

Introduction to Information Theory

Information theory is a mathematical framework developed to quantify information, uncertainty,

Mathematical Representation: The amount of information received from an event X = x can

●​ If an event is very rare, say P(x)=0.01, it carries more information:

3. Relative Entropy (Kullback-Leibler Divergence):

Problem 3: Calculate the Entropy of a 4-Sided Biased Dice

A four-sided die has probabilities:

Find the entropy H(X).

Joint Entropy in Information Theory

Interpretation of Joint Entropy:

Relationship to Other Entropy Measures

Joint entropy is connected to conditional entropy and mutual information:

1.​ Conditional Entropy:

where H(X∣Y) is the uncertainty of X given Y.

Marginal Entropy in Information Theory

Interpretation of Marginal Entropy

●​ Marginal entropy measures the uncertainty of a single variable, ignoring other

●​ If X and Y are not independent, then:

because knowing Y reduces the uncertainty in X.

For two random variables X and Y, mutual information is given by:

It tells us how much on average we learn about X when observing Y.

Problem 4: Compute Mutual Information

Key Insights from the Theorem

Properties of Block Codes

Statement of Kraft's Inequality

Example of Kraft's Inequality

Example 1: Block Code Construction

So, the code rate is 0.571 or 57.1% efficiency.

You might also like

● If an event is very rare, say P(x)=0.01, it carries more information:

1. Conditional Entropy:

● Marginal entropy measures the uncertainty of a single variable, ignoring other

● If X and Y are not independent, then: