0% found this document useful (0 votes)
9 views9 pages

IGCSE Computer Science Text Representation

The document explains text representation in computers, focusing on character sets that convert characters into binary data. It highlights ASCII as an early standard with limitations in character range and Unicode as a modern solution supporting a vast array of languages and symbols. Key takeaways include the binary foundation of text, the efficiency of ASCII for English, and the global capabilities of Unicode.

Uploaded by

tizhappening
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views9 pages

IGCSE Computer Science Text Representation

The document explains text representation in computers, focusing on character sets that convert characters into binary data. It highlights ASCII as an early standard with limitations in character range and Unicode as a modern solution supporting a vast array of languages and symbols. Key takeaways include the binary foundation of text, the efficiency of ASCII for English, and the global capabilities of Unicode.

Uploaded by

tizhappening
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

IGCSE COMPUTER

SCIENCE 0478
Text Representation
Computers only understand binary data (0s and 1s). Every
character, symbol, and command must be converted into a
numerical format through character sets.
UNDERSTANDING CHARACTER SETS
1 Logical Ordering: Characters are assigned codes in sequence
What is it? (e.g., A=65, B=66), allowing for easy sorting and manipulation.

A character set is a defined list of characters and 2 Standardization: Essential for global communication, ensuring
their corresponding binary codes that a computer documents look the same on all devices.
system can recognize and display.
3 The "Dictionary": Acts as a common language between sender
and receiver to interpret binary strings correctly.

Activity: Brainstorming
Think of all the different types of characters a computer
needs to represent (letters, numbers, punctuation, emojis,
mathematical symbols).
ASCII: THE
FOUNDATION
The first major standard for
representing text in computers,
developed in the 1960s.

7-BIT STRUCTURE CHARACTER RANGE


Uses 7 bits per character, providing 128 unique Includes A-Z, a-z, 0-9, punctuation, and non-printing
combinations (27 = 128). control codes like "Enter".
EXTENDED ASCII
Utilizing the 8th Bit for 256 Characters

THE GLOBAL PROBLEM


× Insufficient Capacity: 256 characters cannot
represent non-Latin scripts like Chinese, Arabic, or
Hindi.

× Regional Incompatibility: Different regions used


different versions of Extended ASCII, leading to
garbled text (Mojibake).

× Modern Needs: No support for the vast array of


modern symbols and emojis used today.
ACTIVITY: DECODING ASCII R EFER EN CE: A S CI I TA B L E

Task: Decode the word "CAB"


Step 1: Use the logic that 'A' = 65 (Denary).
What are 'B' and 'C'?
Step 2: Convert the word "CAB" into its Denary
codes.
Step 3: Convert the Denary code for 'A' (65)
into an 8-bit binary number.

Discussion: Why must the computer eventually see


these as binary rather than denary?
UNICODE: THE UNIVERSAL
SOLUTION
GLOBAL STANDARD
Created to provide a unique number for every character,
regardless of platform, program, or language.

16-BIT+ ENCODING
Typically uses 16 bits or more, allowing for over 65,536
unique characters (216).

UNIVERSAL SUPPORT
Represents almost every written language, ancient scripts, and
modern emojis.

BACKWARD COMPATIBLE
The first 128 characters are identical to ASCII, ensuring
seamless integration with older systems.
ASCII VS UNICODE

FEATURE ASCII UNICODE


Bits 7 or 8 Bits 16 or 32 Bits
Characters 128 or 256 65,536+
Languages English/Latin Global Support
Storage 1 Byte 2-4 Bytes

Unicode is the modern standard for global communication,


while ASCII remains efficient for simple English text.
QUIZ: TEST YOUR KNOWLEDGE

01 02
How many bits does Standard ASCII use to Why would a developer choose Unicode over
represent a character? ASCII for a messaging app?
Answer: 7 bits (providing 128 unique codes). Answer: To support global languages and modern symbols
like emojis.

03 04
If a character set uses 10 bits, how many unique True or False: The binary code for 'b' is smaller
characters can it represent? than the binary code for 'a'.
Answer: 1,024 characters (210 = 1024). Answer: False. They are ordered logically (a=97, b=98).
SUMMARY & KEY
TAKEAWAYS
■ Binary All text must be converted to binary using a
Foundation: character set dictionary.

■ ASCII:A legacy 7/8-bit standard efficient for English but


limited to 128/256 characters.

■ Unicode:The modern 16/32-bit standard enabling global


communication and emojis.

IGCSE EXAM TIP


When comparing ASCII and Unicode, always mention the
number of bits used and the range of characters supported.
Remember that Unicode is backward compatible with ASCII!

You might also like