ENCODING
SCHEMS
Udit
XI-B Adhikary
Roll no:06
ACKNOWLEDGEMENT
I would like to express my sincere gratitude to
my Computer Science teacher for their
valuable guidance, support, and
encouragement throughout the completion of
this Class 11 Computer Science project. Their
suggestions and constructive feedback helped
me understand the concepts clearly and
execute the project successfully. I am also
thankful to my school for providing the
necessary resources . I extend my heartfelt
thanks to my parents and friends for their
constant motivation and cooperation. This
project has enhanced my practical knowledge,
programming skills, and overall understanding
of the subject.
.
.
.
Certificate
This is to certify that the Computer Science
project submitted by UDIT ADHIKARY,
Class XI B, Roll No. 06, has been completed
by them during the academic session
2025-26 under my guidance.
I further certify that this project is the original
work of the student and has been carried out
independently as per the requirements of the
prescribed syllabus. The student has shown
sincere effort, dedication, and keen interest in
the completion of this project.
Teacher’s Signature:
WHY DO WE NEED IT?
We need encoding because computers cannot
understand human languages directly. They
can only process data in binary form, that is,
0s and 1s. Character encoding provides a
standard method to convert letters, numbers,
and symbols into binary codes that computers
can store, process, and transmit .Encoding
ensures that text written on one computer
appears the same on another, enabling
accurate data exchange through networks and
the internet. It also allows different languages
and scripts to be represented [Link]
encoding systems, text data would become
inconsistent and unreadable across devices.
Therefore, encoding is essential for text
processing, communication, programming,
and digital storage in modern computing.
ASCII(AMERICAN STANDARD CODE FOR INFORMATION INTERCHANGE)
ASCII stands for American Standard Code
for Information Interchange. It is a
character encoding system developed in the
1960s to help computers represent and exchange
text in a standard way.
ASCII uses 7 bits to represent each character,
allowing a total of 128 characters numbered
from 0 to 127. These characters include letters,
digits, punctuation marks, and special control
characters.
The ASCII table is divided into two main parts.
The first part contains control characters
such as Line Feed (LF) and Carriage Return
(CR).
For example, the ASCII value of capital A is 65,
lowercase a is 97, and the digit 0 is 48. Each
character has a unique numeric code.
INDIAN STANDARD CODE FOR INFORMATION
INTERCHANGE (ISCII)
It is a character encoding system developed in India to
represent Indian languages in computers in a
standardized way. It was introduced in 1988 by the
Bureau of Indian Standards.
ISCII uses 8 bits to encode characters, allowing 256
possible codes. It is designed to handle multiple Indian
scripts such as Devanagari, Bengali, Tamil, Telugu,
Kannada, Malayalam, Gujarati, Punjabi, and Oriya.
One important feature of ISCII is that it uses a common
code structure for different scripts.
ISCII also includes special control codes to switch
between scripts within the same text. This allows
multilingual text processing in Indian languages.
Although ISCII was widely used earlier, it has largely
been replaced today by Unicode, which provides a
more universal system for representing characters from
all world languages.
UNICODE
Unicode is a universal character encoding standard
designed to represent text from all languages and
writing systems in the world. It was developed to
overcome the limitations of earlier encoding systems
like ASCII and ISCII, which could represent only a
limited number of characters.
Unicode assigns a unique numeric value, called a code
point, to every character, regardless of language or
platform. This ensures that text appears consistently
across different devices, software, and operating
systems.
Unicode uses multiple encoding formats, the most
common being UTF-8, UTF-16, and UTF-32. Among
these, UTF-8 is the most widely used because it is
memory efficient and compatible with ASCII.
The system supports characters from languages such as
English, Hindi, Bengali, Chinese, Arabic, and many
more. It also includes symbols, mathematical signs,
emojis, and special characters.
Because Unicode can represent over a million possible
characters, it has become the global standard for text
representation in modern computing, the internet, and
digital communication.
UTF-8
Most widely used. Variable-length encoding (1 to 4
bytes). Backward compatible with ASCII. Efficient for
English-heavy text.
UTF-16
Uses 2 or 4 bytes. Common in many operating systems
and programming environments.
COMPARISON
CONCLUSION
In conclusion, character encoding systems like
ASCII and ISCII played an important role in the
development of text processing in computers.
ASCII provided the first standard method to
represent English characters digitally, making data
exchange between early computer systems
possible. However, it was limited in supporting
other languages. ISCII was later developed to
address this limitation by enabling the
representation of multiple Indian scripts using a
common encoding structure. While both systems
were significant in their time, they have now been
largely replaced by Unicode, which offers a
universal solution for representing text from all
languages and symbols in modern computing.
Together, these encoding standards mark the
evolution of how computers understand and
process human language.
===================================
THANK YOU