0% found this document useful (0 votes)

4 views13 pages

Data Processing Notes

This document provides comprehensive notes on data processing, covering key terms, the data processing cycle, methods of data collection, types of errors, data integrity, and various data processing methods. It details the stages involved in transforming raw data into meaningful information and discusses the importance of accuracy and validation in data entry. Additionally, it outlines different file organization methods and electronic data processing modes, highlighting factors to consider when selecting a data processing approach.

Uploaded by

mitengvirginia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views13 pages

Data Processing Notes

Uploaded by

mitengvirginia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

COMPUTER STUDIES

Form Three Notes

CHAPTER 2
DATA PROCESSING
Comprehensive Study Notes
1. Definition of Key Terms

Data
Data is a collection of raw facts — figures, letters, characters, symbols — that convey little or no
meaning on their own without processing.

Information
Information is data that has been processed and is meaningful to the user. It must be available in the
form the user needs it, when they need it.

Data Processing
Data processing refers to the process of transforming raw facts (data) into meaningful output — i.e.,
information.

Data Processing Cycle

The data processing cycle refers to the stages of Input → Process → Output that data goes through
to be transformed into information.

■ Note: Remember: GIGO — Garbage In, Garbage Out. The accuracy of output depends entirely on
the accuracy of input data.

2. Data Processing Cycle

The data processing cycle has four primary stages:

Stage Description
1. Data Collection Gathering raw data from its point of origin for processing purposes.
2. Data Input Converting collected data from human-readable form to machine-readable form.
3. Processing Transformation of input data by the CPU into a more meaningful output.
4. Output The final activity — producing desired information and distributing it to target groups.
Fig 1: Electronic Data Processing — Source data is entered into a computer, processed by the CPU, and printed as
output.

3. Data Collection

Methods of Data Collection

• Interview — direct questioning of respondents
• Questionnaire — written set of questions distributed to respondents
• Observation — watching and recording events as they happen
• Record Inspection — examining existing documents and records

Stages of Data Collection

Depending on the method used, data collection may involve these stages:

• Data Creation — Putting together facts in an organised format (manually prepared documents
or captured using scanners, digital cameras, etc.)
• Data Transmission — Transferring data from the point of collection to the processing point
(electronically via computer-to-computer, or physically by post).
• Data Preparation — Converting data from source document to machine-readable form.
• Media Conversion — Converting data from one medium to another (e.g. CD to hard disk for
faster input).
• Input Validation — Subjecting entered data to validity and verification checks before
processing.
• Sorting — Arranging source documents in a particular order for easy and faster data entry.
Verification vs. Validation
Verification Validation
Checking that what is on the input document is exactly
Identification
the same as and
what
removal
is entered
of errors
into the
by the
computer.
computer through the co

4. Errors in Data Processing

The accuracy of data entered in the computer determines the accuracy of the information produced.
There are three main types of errors:

(a) Transcription Errors

These errors occur during data entry and include:
• Misreading errors — Incorrect reading of a source document leading to wrong values being
entered (e.g. reading '5' as 'S', or letter 'O' as zero '0'). Usually caused by bad handwriting.
• Transposition errors — Incorrect arrangement of characters, i.e. putting characters in the
wrong order (e.g. entering 524 instead of 542).
■ Note: Transcription errors can be eliminated by using data capture devices such as scanners and
barcode readers.

(b) Computational Errors

These occur when an arithmetic operation does not produce the expected result:
• Overflow errors — The result of a calculation is too large to be stored in the allocated memory
space (e.g. storing a 9-bit result in an 8-bit memory location).
• Truncation errors — Real numbers with long fractional parts are cut off to fit in allocated
memory (e.g. 0.854692 truncated to 0.854).
• Rounding errors — A digit is raised or lowered to the required rounded number (e.g. 3.59
rounded to 3.6).

(c) Algorithm / Logical Errors

These errors occur as a result of wrong algorithm design — the program logic is incorrect even
though the syntax may be fine.

5. Data Integrity

Definition
Data integrity refers to the accuracy and completeness of data entered in a computer or received
from an information system.

Factors that Determine Data Integrity

• Accuracy — Whether the data/information is true or correct. Computers produce accurate
results as long as correct instructions and data are entered.
• Timeliness — Whether information is available when needed. Outdated information has little
or no value in decision-making.
• Relevance — Data entered must be pertinent to the processing needs at hand.
• Audibility (Verifiability) — The ability of users to check the accuracy and completeness of
information.

Ways to Minimise Threats to Data Integrity

• Use error detection and correction software when transmitting data.
• Design user interfaces that minimise chances of invalid data entry.
• Use devices that capture data directly from the source (e.g. scanners).
• Control access to data by enforcing security measures.
• Back up data, preferably on external storage media.

6. Data Processing Methods

Data can be processed using one of three methods:

(a) Manual Data Processing

Staff use laid-down procedures with pen and paper. No machines are used — only simple tools like
tables and rulers. Tasks include collecting, processing, and distributing information.

Fig 2: Manual Data Processing — Human brain processes data from in-tray to out-tray.
(b) Mechanical Data Processing
Staff use mechanical machines such as calculators, typewriters, cash registers, and duplicating
machines to perform operations.

Fig 3: A Manual Typewriter — an example of a mechanical data processing tool.

(c) Electronic Data Processing

Data is manipulated using electronic machines (computers, mobile phones, washing machines,
digital TVs) to produce information. This method is faster and more accurate, especially for large
volumes of data.

■ Note: The first large-scale electronic general-purpose computer was the ENIAC (Electronic
Numeric Integrator and Calculator).
Fig 4: ENIAC — Electronic Numeric Integrator and Calculator, one of the earliest computers.

Factors Determining Choice of Data Processing Method

• Size and type of business
• Timing aspects (how urgently information is needed)
• Link between applications

7. Computer Files

Definition
A file is a collection of related records that give a complete set of information about a certain item or
entity. Files can be stored manually (in a file cabinet) or electronically (in a computer storage device).

Elements of a Computer File

• Characters — The smallest element. A single letter, number, or symbol that can be entered,
stored, and output.
• Field — A single character or collection of characters representing one piece of data (e.g. an
employee's name is a field).
• Record — A collection of related fields representing a single entity (e.g. Name, ID No., Sex,
Department = one employee record).

Logical vs. Physical File

Logical File Physical File
The way the user views the file — its contents and the
The
processing
actual arrangement
to be done of
onfile
them.
contents on the storage media surfa

Advantages of Computerised Filing

• Information takes up less physical space than manual systems.
• Enhances data integrity and reduces duplication.
• Offers faster access and retrieval of data.
• Much easier to update or modify information.

8. Types of Computer Processing Files

File Type Description
Master File The main file containing permanent records. Has both static fields (rarely change, e.g. name,
Transaction File Holds temporary incoming or outgoing data about an organisation's activities over a period of
(Movement File)
Reference File Permanent or semi-permanent file used for reference/look-up purposes (e.g. price lists, PAYE
Sort File Created from existing transaction or master files. Records are sorted in ascending or descend
Back-up File Duplicate copies of existing files. Created whenever an update is carried out. Used in case of
Report File Contains sets of records extracted from master files, used to prepare reports for later printing

9. File Processing Activities

• Updating — Changing data in a master file to reflect the current status.
• Referencing — Accessing a record to see its contents without altering it.
• Sorting — Arranging file contents into a predetermined sequence of the key field.
• Merging — Combining the contents of two or more input files into one output file.
• Matching — Comparing input file records to ensure the same records exist in both files.
• Summarising — Accumulating records of interest from a file to form a single record in an
output file.
• Searching — Looking for a specific record of interest within a file.

File Updating Terms

• Hit Rate — The proportion of a master file's records that are active/processed. Formula:
(Transactions ÷ Total Records) × 100. E.g. 600 ÷ 12,000 × 100 = 5%.
• Volatility — The frequency with which records are added or deleted. High frequency = 'volatile'
file; low frequency = 'static' file.
• Size — The total number of records stored in the file.
• Growth — Files grow as new records are added.
10. File Organisation Methods
File organisation is the arrangement of records within a particular file. There are four main methods:

(a) Sequential File Organisation

Records are stored and accessed in a sorted order using a key field. Searching starts at the
beginning and proceeds to the end until the record is found. Mainly used with magnetic tapes.

Advantages Disadvantages
Simple to understand and organise. Entire file must be read even with very low activity rate.
Easy to maintain. Random enquiries are impossible.
Inexpensive storage media. Data redundancy is typically high.

(b) Serial File Organisation

Records are laid out contiguously one after another in no particular sequence — stored in the same
order they arrive. No relationship exists between contiguous records. Used with magnetic tapes.

(c) Random (Direct) File Organisation

Records are stored randomly but accessed directly. A record key determines where a record is stored
on the media. Used with magnetic and optical disks.

Advantages Disadvantages
Records are quickly accessed. Data may be accidentally erased or overwritten.
File update is easily achieved. Expensive hardware and software required.
No indexes required. Complex and costly system design.

(d) Indexed Sequential File Organisation

Similar to sequential organisation, but an index is used to help the computer locate individual records
on the storage media. Used with magnetic disks.

Advantages Disadvantages
Records can be accessed sequentially or randomly. Storage medium is relatively expensive.
Records are not duplicated. Sequential access is time-consuming.
Fast random access. Sequential processing may introduce redundancy.
Fig 5: Records on a Magnetic Tape — showing unblocked (single) and blocked (multiple) records with Inter-Record
Gaps (IRG).

11. Electronic Data Processing Modes

There are eight main modes of electronic data processing:

(a) On-line Processing

Results are available immediately. All peripherals are under direct control of the CPU. Users can
interact with the system at any time using input/output facilities.
• Applications: Banking, Stock Exchange, Stock Control, Water/Electricity Billing.
• Advantages: Files kept up to date; information readily available; file enquiries possible via
terminals.
• Disadvantages: Complex to develop; costly hardware, software, and storage media.

(b) Time Sharing Processing

The CPU serves two or more users with different processing requirements. Processor time is divided
into time slices allocated equally to all jobs in a queue. Incomplete jobs return to the tail of the queue.
• Applications: Bureaus, learning institutions, companies.
• Advantages: Fast information output; file enquiries possible; user interaction supported.
• Disadvantages: User has no control over the central computer; poor data security; slow
response with many tasks.

(c) Real Time Processing

The computer processes incoming data immediately as it occurs, updates the transaction file, and
gives an immediate response that affects events as they happen.
• Applications: Airline reservation, hotel reservation, chemical plant processing.
• Advantages: Information instantly available; immediate control; fast and reliable.
• Disadvantages: Requires complex and expensive OS; not easy to develop; requires Front
End Processors (FEPs).

(d) Multi-programming / Multi-tasking

More than one program is executed apparently at the same time by a single CPU. The OS allocates
each program a time slice and determines execution order.
• Advantages: Increases CPU productivity; reduces peripheral-bound operations.
• Disadvantages: Requires more expensive CPUs; complex operating system.

(e) Distributed Processing

Processing tasks are divided and assigned to two or more computers at physically separate sites,
connected by data transmission media. Different database tables can reside on separate computers.
• Application: Banks — customers served from branches while data is updated at the head
branch.
• Advantages: Less risk of total system breakdown; reduced data loss; reduced load on host
computer.
• Disadvantages: Expensive communication costs; sophisticated software required.

(f) Batch Processing

Transactions are accumulated over a period of time (daily, weekly, monthly) and processed all at
once at a pre-specified time.
• Application: Payroll processing.
• Advantages: Simple to develop; timing of reports not critical; low unit processing cost.
• Disadvantages: Time lag between transaction origin and information availability; not suitable
for instant decisions; difficult priority scheduling.

(g) Multi-processing
More than one task is processed simultaneously on different processors within the same computer.
The computer contains more than one independent CPU working in a coordinated way.

(h) Interactive Processing

There is continuous dialogue between the user and the computer. The program keeps prompting the
user to provide input or respond to prompts displayed on screen.

12. Factors to Consider When Selecting a Data Processing Mode

• The need for direct information retrieval and/or file interrogation.
• Control over resources (files, input/output devices).
• Cost of acquiring relevant hardware, software, and media.
• Optimisation of processing time.
• Time factor of information needed for managerial decision-making.

Review Questions
1. Define: (a) Data Processing, (b) Data Processing Cycle.
2. Using an illustration, describe the four primary stages of the data processing cycle.
3. Outline the stages of data collection.
4. What is the relevance of GIGO (Garbage In Garbage Out) to errors in data processing?
5. Explain the two types of transcription errors.
6. State three types of computational errors.
7. Define the term data integrity.
8. Give three factors that determine the integrity of data.
9. State at least five ways of minimising threats to data integrity.
10. Distinguish between data and information.
11. Describe the types of data processing methods.
12. Distinguish between manual, mechanical, and electronic data processing.

Model Answers (Selected)

Answer 1:
(a) Data processing refers to the transformation of raw data into meaningful output (information). (b)
The data processing cycle refers to the stages (Data Collection → Data Input → Processing →
Output) that data goes through during its transformation into information.

Answer 3 — Stages of Data Collection:

• Data Creation
• Data Transmission
• Data Preparation
• Media Conversion
• Input Validation
• Sorting

Answer 6 — Computational Errors:

• Overflow errors
• Truncation errors
• Rounding errors

Data Processing Concepts and Methods
No ratings yet
Data Processing Concepts and Methods
21 pages
GIGO and Data Processing Errors
No ratings yet
GIGO and Data Processing Errors
12 pages
Understanding Data Processing Methods
No ratings yet
Understanding Data Processing Methods
4 pages
Data Processing and File Organization Course
No ratings yet
Data Processing and File Organization Course
12 pages
Understanding Data and Information Systems
No ratings yet
Understanding Data and Information Systems
208 pages
Data Processing Cycle Explained
No ratings yet
Data Processing Cycle Explained
7 pages
Data Processing Fundamentals Explained
No ratings yet
Data Processing Fundamentals Explained
11 pages
Data Processing Overview and Methods
No ratings yet
Data Processing Overview and Methods
31 pages
Understanding Data and Information Processing
No ratings yet
Understanding Data and Information Processing
5 pages
Computer Systems and Data Processing
No ratings yet
Computer Systems and Data Processing
35 pages
CSEC Information Technology Overview
No ratings yet
CSEC Information Technology Overview
6 pages
Data Processing Cycle
No ratings yet
Data Processing Cycle
8 pages
Understanding Information Processing Systems
No ratings yet
Understanding Information Processing Systems
15 pages
Data Processing and Verification Techniques
No ratings yet
Data Processing and Verification Techniques
8 pages
Overview of Data Processing Systems
No ratings yet
Overview of Data Processing Systems
2 pages
Data Processing System Overview
No ratings yet
Data Processing System Overview
2 pages
Business Intelligence Learning Outcomes
No ratings yet
Business Intelligence Learning Outcomes
65 pages
Computer Studies Notes Form 3
No ratings yet
Computer Studies Notes Form 3
54 pages
Lecture 10
No ratings yet
Lecture 10
6 pages
Data Processing
No ratings yet
Data Processing
28 pages
Data Processing Methods Explained
No ratings yet
Data Processing Methods Explained
7 pages
Computer File Structures Explained
No ratings yet
Computer File Structures Explained
13 pages
Computer Data Conversion Basics
No ratings yet
Computer Data Conversion Basics
32 pages
Data Processing Cycle Explained
No ratings yet
Data Processing Cycle Explained
11 pages
Understanding Information Processing Basics
No ratings yet
Understanding Information Processing Basics
44 pages
Data Processing Methods in Computers
No ratings yet
Data Processing Methods in Computers
1 page
Business Data Processing Overview
0% (1)
Business Data Processing Overview
101 pages
Expanded Data Processing Cycle Overview
No ratings yet
Expanded Data Processing Cycle Overview
7 pages
Understanding Data Processing Cycle
No ratings yet
Understanding Data Processing Cycle
8 pages
Data Processing and Information Flow
No ratings yet
Data Processing and Information Flow
10 pages
Data Processing Concepts Explained
No ratings yet
Data Processing Concepts Explained
13 pages
Data Processing Concepts and Cycle
No ratings yet
Data Processing Concepts and Cycle
53 pages
Data Processing Techniques Overview
No ratings yet
Data Processing Techniques Overview
43 pages
Understanding Data Processing Concepts
100% (1)
Understanding Data Processing Concepts
6 pages
Data Processing: Sources, Types, and Quality
No ratings yet
Data Processing: Sources, Types, and Quality
11 pages
CHAPTER 5 Data Processing
No ratings yet
CHAPTER 5 Data Processing
4 pages
Understanding Data and Processing Types
100% (2)
Understanding Data and Processing Types
18 pages
Understanding Data Processing Basics
No ratings yet
Understanding Data Processing Basics
13 pages
Form 3 Data Processing Overview
No ratings yet
Form 3 Data Processing Overview
1 page
JSS1 Data Processing
No ratings yet
JSS1 Data Processing
3 pages
Data Processing Methods and Steps
No ratings yet
Data Processing Methods and Steps
10 pages
Overview of Data Processing Types
50% (2)
Overview of Data Processing Types
21 pages
Data Basics and Processing Overview
No ratings yet
Data Basics and Processing Overview
62 pages
Data Processing Cycle Explained
No ratings yet
Data Processing Cycle Explained
8 pages
File Processing Operations Overview
No ratings yet
File Processing Operations Overview
7 pages
Data Processing Techniques Explained
No ratings yet
Data Processing Techniques Explained
29 pages
Understanding Data and Information
No ratings yet
Understanding Data and Information
11 pages
Data Processing Cycle and Techniques
No ratings yet
Data Processing Cycle and Techniques
7 pages
SS2 Data Processing Lesson Note
No ratings yet
SS2 Data Processing Lesson Note
29 pages
Data Processing Cycle Explained
No ratings yet
Data Processing Cycle Explained
26 pages
Data Processing Techniques Overview
No ratings yet
Data Processing Techniques Overview
29 pages
Data Processing Cycles Explained
No ratings yet
Data Processing Cycles Explained
24 pages
Data Processing
No ratings yet
Data Processing
29 pages
Data Processing Concepts and Techniques
No ratings yet
Data Processing Concepts and Techniques
69 pages
Validation and Verification
No ratings yet
Validation and Verification
7 pages
Left
No ratings yet
Left
18 pages
Introduction to Data Processing Basics
No ratings yet
Introduction to Data Processing Basics
23 pages
Hydrological Cycle and Rivers Notes-1
No ratings yet
Hydrological Cycle and Rivers Notes-1
36 pages
Soils Notes 1
No ratings yet
Soils Notes 1
17 pages
Form 2 End Term 3 Exam Marking Scheme
No ratings yet
Form 2 End Term 3 Exam Marking Scheme
113 pages
Elementary Programming Principles Notes
No ratings yet
Elementary Programming Principles Notes
19 pages
Desktop Publishing & Internet Basics 2025
No ratings yet
Desktop Publishing & Internet Basics 2025
5 pages
CREATION, COVENANTS, AND PROPHECIES IN C.R.E.
No ratings yet
CREATION, COVENANTS, AND PROPHECIES IN C.R.E.
4 pages
2024 Physics Exam Paper for Form II
No ratings yet
2024 Physics Exam Paper for Form II
9 pages
Murang’a South Form Two History Exam
No ratings yet
Murang’a South Form Two History Exam
9 pages
Form Two CRE Mid Term Exam 2021
No ratings yet
Form Two CRE Mid Term Exam 2021
3 pages
Chemistry Form 2 Mid Term Exam 2021
No ratings yet
Chemistry Form 2 Mid Term Exam 2021
4 pages
2021 History Form 2 Midterm Exam Questions
No ratings yet
2021 History Form 2 Midterm Exam Questions
2 pages
Types of Software Explained for Class 5
No ratings yet
Types of Software Explained for Class 5
6 pages
Form 1 Term 2 Exam Papers
No ratings yet
Form 1 Term 2 Exam Papers
70 pages
Overview of Secondary Storage Devices
No ratings yet
Overview of Secondary Storage Devices
16 pages
Unit4 Search Sort Hash
No ratings yet
Unit4 Search Sort Hash
20 pages
Class 8 AI Educational Resource PDF
No ratings yet
Class 8 AI Educational Resource PDF
1 page
SQL Query Optimization Techniques
No ratings yet
SQL Query Optimization Techniques
72 pages
Crowdsourced Civic Issue Reporting Survey
No ratings yet
Crowdsourced Civic Issue Reporting Survey
4 pages
Top 10 Computer Science Articles August 2025
No ratings yet
Top 10 Computer Science Articles August 2025
11 pages
Nvidia Learning Training Course Catalog
No ratings yet
Nvidia Learning Training Course Catalog
33 pages
LocalizeJs Integration in Software Projects
No ratings yet
LocalizeJs Integration in Software Projects
1 page
Cambridge IT Theory Exam Paper 2017
No ratings yet
Cambridge IT Theory Exam Paper 2017
692 pages
CS3352 Data Science Course Overview
No ratings yet
CS3352 Data Science Course Overview
3 pages
Data Engineering Roadmap Overview
No ratings yet
Data Engineering Roadmap Overview
3 pages
Automated Question Generator Using NLP
No ratings yet
Automated Question Generator Using NLP
8 pages
Kademlia Protocol in Gossip Systems
No ratings yet
Kademlia Protocol in Gossip Systems
4 pages
UX Designer Portfolio of Taruna Raina
No ratings yet
UX Designer Portfolio of Taruna Raina
1 page
Fuzzy Logic-Based Cryptography Algorithm
No ratings yet
Fuzzy Logic-Based Cryptography Algorithm
6 pages
Coating File
No ratings yet
Coating File
1 page
Class X AI Common Exam 2025-26
No ratings yet
Class X AI Common Exam 2025-26
5 pages
Node.js Coding & Database Standards SOP
No ratings yet
Node.js Coding & Database Standards SOP
2 pages
Mongo DB
No ratings yet
Mongo DB
17 pages
Database Management Systems Exam Guide
No ratings yet
Database Management Systems Exam Guide
2 pages
Tomcat Documentation V1.0
No ratings yet
Tomcat Documentation V1.0
7 pages
Beginner's Guide to Artificial Intelligence
No ratings yet
Beginner's Guide to Artificial Intelligence
45 pages
Final Exam: Data Structures Overview
No ratings yet
Final Exam: Data Structures Overview
2 pages
Product Cipher Design and Implementation
No ratings yet
Product Cipher Design and Implementation
7 pages
Mdpi 2018-n2 042-063 Authsub
No ratings yet
Mdpi 2018-n2 042-063 Authsub
22 pages
Video Frame Detection Using SIFT and AI
No ratings yet
Video Frame Detection Using SIFT and AI
5 pages
ICCAIS 2024 Conference Details
No ratings yet
ICCAIS 2024 Conference Details
1 page
Understanding Data Processing in CSC 227
No ratings yet
Understanding Data Processing in CSC 227
17 pages
AI Tools for Pharmacovigilance Insights
No ratings yet
AI Tools for Pharmacovigilance Insights
1 page
OPP360 Presentation 3
No ratings yet
OPP360 Presentation 3
31 pages
LangChain and RAG Overview
No ratings yet
LangChain and RAG Overview
32 pages

Data Processing Notes

Uploaded by

Data Processing Notes

Uploaded by

COMPUTER STUDIES

Form Three Notes

Data Processing Cycle

2. Data Processing Cycle

Methods of Data Collection

Stages of Data Collection

4. Errors in Data Processing

(a) Transcription Errors

(b) Computational Errors

(c) Algorithm / Logical Errors

Factors that Determine Data Integrity

Ways to Minimise Threats to Data Integrity

6. Data Processing Methods

(a) Manual Data Processing

Fig 3: A Manual Typewriter — an example of a mechanical data processing tool.

(c) Electronic Data Processing

Factors Determining Choice of Data Processing Method

Elements of a Computer File

Logical vs. Physical File

Advantages of Computerised Filing

8. Types of Computer Processing Files

9. File Processing Activities

File Updating Terms

(a) Sequential File Organisation

(b) Serial File Organisation

(c) Random (Direct) File Organisation

(d) Indexed Sequential File Organisation

11. Electronic Data Processing Modes

(a) On-line Processing

(b) Time Sharing Processing

(c) Real Time Processing

(d) Multi-programming / Multi-tasking

(e) Distributed Processing

(f) Batch Processing

(h) Interactive Processing

12. Factors to Consider When Selecting a Data Processing Mode

Model Answers (Selected)

Answer 3 — Stages of Data Collection:

Answer 6 — Computational Errors:

You might also like