Data Organization and Analysis Techniques
Data Organization and Analysis Techniques
SECTION
8 DATA ORGANISATION,
ANALYSIS AND
PRESENTATION
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
INTRODUCTION
Data is important in today’s world and it comes in different types like numbers,
categories and rankings. We can collect and organise data in many ways, then
illustrate it using graphs, charts or tables to make it easier to understand. To
summarise data we use the mean, median and mode, which give us one number
that shows the “average” of the data. These numbers help us compare and make
conclusions. Doing real-life projects with data helps us learn how to collect,
analyse and present information. These projects can be connected to different
subjects, like science, history, geography and economics. Learning these skills
helps us make smart decisions and prepares us for more advanced studies in areas
like statistics and data science. In this section, we will explore the collection and
study of data in order to get facts or information. This process is termed ‘Data
collection’ or ‘collection of data’. In JHS, you have learnt some data types and
how to analyse them, so let us do a quick recall of some of the things you have
learnt on data and data types.
2
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Key Ideas
3
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
WHAT IS DATA?
Data refers to the collection/gathering of pieces of information or facts in order
to analyse them, make decisions or gain insights. Data can be in various forms,
such as numbers, text, images or audio. In a more technical sense, data is often
raw and unprocessed (ie, it has yet to be organised or analysed) to reveal patterns
or meaning. In short, data is information gathered from various sources through
observation or from results of an experiment.
For example:
• Numbers like 24, 55 and 12 are data points that can represent ages,
temperatures, or other quantities.
• A list of customer reviews and their ratings is data that can be analysed to
understand customer satisfaction.
Data becomes more useful when it is processed and interpreted to generate
information and knowledge. In fields like statistics, data science and information
technology, data plays a crucial role in drawing conclusions and making informed
decisions.
Let us now consider some types of data
Types of Data
Primary and secondary data are two fundamental types of data used in research
and analysis. They serve different purposes and have distinct characteristics, each
contributing to the understanding of a given subject.
Primary data
Primary data refers to information collected first-hand, specifically for a particular
research study or investigation. This data is original and directly obtained from the
source. Primary data collection methods include surveys, interviews, experiments,
observations or direct measurements.
Secondary data
Secondary data refers to existing data that has been collected by someone else for
a different purpose but can be repurposed for new research or analysis. Secondary
data sources include published literature, reports, databases, government records
and data collected by other researchers.
4
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Example 1
Height (in inches) – the raw data: 62, 65, 67, 62, 70, 65, 66, 64, 67, 71
Table 1: Ungrouped Frequency Distribution
In the above frequency table, each data point is unique and stands alone, this
makes it ungrouped.
In the table above, the heights have been grouped into three categories: 60-64
inches, 65-69 inches and 70-74 inches. The frequency column represents the
5
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
number of observations falling within each category. This means the data is
grouped.
Let us now look at and draw out the differences, with examples, between discrete
and continuous data.
Continuous data
This is the unspecified number of possible measurements between two presumed
points. Continuous data is a type of data that can take any value within a certain
range or interval, including fractions and decimals. It represents measurable
values that can be divided into smaller parts and can be expressed on a continuous
scale. For example, time can be measured in hours, minutes, seconds and fractions
of seconds, such as [Link].123. Also, a person’s height can be measured in
6
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
centimetres or inches and can include fractional values, like 175.5 cm or 68.3
inches etc.
Other examples are;
• The weather temperature
• The wind speed
• The weight of the children
With these definitions and examples, I hope you can state some differences
between the various types of data. Discuss them with your classmates.
Activity 1
Suppose these are the exam scores of some of your classmates in Mathematics
in the last semester.
70, 60, 87, 90, 65, 90, 65, 76, 87, 95, 71, 85, 90, 62, 90
Below are the steps to help organise these exams scores in an ungrouped
frequency distribution table.
Step 1: Identify all the unique values in the dataset (in this case, exam scores)
from the smallest to the largest. These values will form the basis of your
table’s categories. These are 60, 63, 65, …, 98
Step 2: For each unique value, count how many times it appears in the dataset.
Step 3: Draw a frequency distribution table with two columns: one for the
unique values (exam scores) and one for their frequencies.
Step 4: Organise this information into the table.
Step 5: Ensure that the total of the frequencies equals the total number of
data points. This step verifies that all data points have been accounted for.
Total Frequency: 2 + 1 + 2 + 1 + 1 + 1 + 1 + 2 + 3 + 1 = 15
7
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Step 6: Review the data entry by double-checking for any errors in the
frequency counts.
Table 3: Frequency Distribution
Let us proceed to that of a grouped frequency distribution table using the same
exam scores.
Activity 2
Raw Data of the exam scores: 70, 60, 87, 90, 65, 90, 65, 76, 87, 95, 71, 85,
90, 62, 98
Step 1: Determine the Number of Groups (categories or intervals) you want
to create. Typically, 5-7 groups are sufficient.
Step 2: Determine the Group Size:
• Calculate the range of the data (maximum value - minimum value).
8
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Step 5:Review the data entry by double-checking for any errors in the
frequency counts.
Table 4: Grouped Frequency Distribution
9
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Qualitative data
Qualitative data is descriptive, referring to things that can be observed but not
measured, such as colours or emotions. Data collected about a categorical variable
will always be qualitative.
Characteristics of qualitative data:
• It is non-numerical: Qualitative data is descriptive, not numerical.
• Qualitative data is based on personal opinions and experiences.
• Qualitative data is contextual, meaning it is influenced by the research
context
• It is rich and detailed: Qualitative data provides in-depth insights.
• It is flexible: Qualitative data collection methods can be adapted during
research.
Examples of data that can be considered as qualitative;
• Maxjane has curly brown hair.
• They have brown eyes.
• Yamusah is funny, loud and a good listener.
• They can also be quite impatient and impulsive at times.
• Tinyor Raymond drives a red car
• They have a very friendly face and a contagious laugh.
10
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
11
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Take note of the differences between all these different types of data collection
and discuss them with your classmates. Then have a look at the following samples
of survey questionnaires, observation checklist and interview guide in the activity
below.
Activity 3
Identify some features of each of the sample data collection tools below. Share
these with your classmates.
Disagree
Disagree
Strongly
Strongly
Neutral
Agree
Agree
12
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Q9: How would you rate your overall experience at the QRZ Family
Restaurant?
Highly satisfactory
Satisfactory
Neutral
Unsatisfactory
Highly Unsatisfactory
Q10: What could we do to make your restaurant dining experience better?
_______________________________________________________
_______________________________________________________
_______________________________________________________
Q11. Demographic Data
Name (optional):_________________________________
Age: ____
Gender: ________________________________________
Number of Family Members:
1-2
3-5
6-10
more than 10
Email Address (optional): _______________________________
Notes: This section is optional. The questions asking for demographic data
should be relevant to the survey goal and must point to the characteristics of
the target population.
IV. Thank you for sharing your thoughts with us. We hope you enjoy
dining at QRZ Family Restaurant and look forward to seeing you
back with us soon.
Note: This section may also include further information regarding on how to
claim the incentive that you wish to provide to the respondent.
13
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
14
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Look for other samples and discuss the features of each of them with your
classmates.
Activity 4
15
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Frequency Tables
Ungrouped data
Example 2
A jar contains beads of different colours. Here is the raw data for this: red, green,
blue, black, red, green, blue, yellow, red, red, green, green, green, yellow, red,
green, yellow.
Organise the data in a frequency distribution table.
Solution
16
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Grouped data
Example 3
The raw data below represents the number of trees that survived in a survey by
100 schools who decided to plant 100 tree saplings in their gardens on world
environment day.
95, 67, 28, 32, 65, 65, 69, 33, 98, 96, 76, 42, 32, 38, 42, 40, 40, 69, 95, 92, 75, 83,
76, 83, 85, 62, 37, 65, 63, 42, 89, 65, 73, 81, 49, 52, 64, 76, 83, 92, 93, 68, 52, 79,
81, 83, 59, 82, 75, 82, 86, 90, 44, 62, 31, 36, 38, 42, 39, 83, 87, 56, 58, 23, 35, 76,
83, 85, 30, 68, 69, 83, 86, 43, 45, 39, 83, 75, 66, 83, 92, 75, 89, 66, 91, 27, 88, 89,
93, 42, 53, 69, 90, 55, 66, 49, 52, 83, 34, 36
Represent the given data in the form of a grouped frequency distribution table.
Solution
Line Graphs
A line graph is a type of chart used to display data points connected by straight
line segments
17
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Example 4
The temperature recorded in a city from 5 a.m. to 8 p.m. on a day was recorded
in the form of a line graph as shown below. Study the graph and answer the
following questions.
Solution
18
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Bar Graphs
A bar graph (or bar chart) is a visual representation of data using rectangular
bars, where the length or height of each bar corresponds to the value it represents.
Steps in drawing a bar graph
1. Choose a suitable scale for the xand y axes.
2. Draw two perpendicular lines and label them:
x-axis (horizontal): categories
y-axis (vertical): numerical values
3. Decide on a uniform bar width.
4. Leave equal spaces between bars (at least 1/2 bar width).
5. Measure and draw each bar’s height according to its value.
6. Ensure bars are aligned with corresponding x-axis labels
7. Label each bar with its corresponding value (optional).
8. Add a title to the graph.
Example 5
The following table shows the percentage of monthly salary saved by each
employee in a 100-person company. Create a vertical bar graph to represent it
19
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Solution
Example 6
The bar chart below is the summary of cases and recoveries of corona virus
pandemic recorded in various regions in Ghana from March 2020 to August 2022.
20
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Solution
Pie Charts
Example 7
The data below shows the amount of money that Akosua spent on buying some
items:
• Pepper - GH4
• Onions - GH4
• Salt - GH2
• Fish - GH10
Draw a pie chart for the data.
Solution
21
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Fish 10 10 × 18 = 180° 10
__
20
× 360° =180°
Total 20 360° 360°
Step 3. Use a pair of compasses to draw a circle with a radius of at least 5cm.
Step 4. Use a protractor to draw (measure) the angles for each sector in the circle.
Step 5. Write and label each sector.
Step 6. It is usually appropriate to write the angles (magnitudes) inside the sectors.
22
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
The marketing department of a company has collected the following sales figures
(in thousands of Ghana cedis) for its top 10 sales representatives for the previous
quarter:
11, 18, 15, 22, 19, 14, 17, 20, 16, 22
Calculate the following measures of central tendency for the given sales figures:
1. Mode
2. Mean
3. Median
Solution
1. Calculating the Mode: The mode is the value that appears most frequently
in the data set. In the given data, 22 appears twice, everything else only
appears once, so the mode in this data set is 22.
2. Calculating the Mean: The mean is the sum of all the values divided by
the total number of values.
Sum of all values = 11 + 18 + 15 + 22 + 19 + 14 + 17 + 20 + 16 + 22
= 174
Total number of values = 10
174
Mean = ___
10
= 17.4
23
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
3. Calculating the Median: The median is the middle value when the data is
arranged in ascending or descending order. Arranging the data in ascending
order:
11, 14, 15, 16, 17, 18, 19, 20, 22, 22
Since there are 10 values, the median is the average of the 5th and 6th
values.
(17 + 18)
Median = _______
2
= 17.5
Therefore, the measures of central tendency for the given sales figures are:
• Mode: 22
• Mean: 17.4
• Median: 17.5
Now that you know how to calculate measures of central tendencies with a raw
data set, let us now look at how to find mean, median and mode from a frequency
distribution table.
The data below shows the marks obtained in a short science test.
Organise the data in a frequency distribution table and find the mode and median.
2 2 4 3 1 4 1 0
1 2 3 1 2 1 3 0
4 2 3 3 3 4 3 1
1 0 1 4 0 0 1 4
24
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Solution
To obtain the mode, look for the mark with the highest frequency. From our
frequency table “mark 1” had the highest frequency of 9, hence it is the mode of
the data set.
To obtain the median from the frequency table, follow these steps:
1. Find the total frequency. To do this, sum up all the frequencies.
2. Divide the total frequency by 2.
3. Find the item that correspond to the frequency from your answer in point
two above. In the example, total frequency = 32
32
∴ __
2
=16
From the frequency table the 16th and 17th frequencies fall within 2. Therefore, the
median is 2.
25
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
N.B.: When the total frequency is an even number, it means there are two numbers
in the middle. It means the median will halfway between the 16th and the 17th
number.
Example 10
The data below are marks obtained in a social studies test in JHS 1. Calculate the
mean for this data.
3 2 2 3 4
2 5 4 2 2
2 2 1 4 2
4 5 3 2 7
2 3 2 6 3
26
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Solution
∑f = 25, ∑fx = 77
_ ∑ fx
Mean ( x ) = _ 77
= _25
= 3.08
∑f
∴ the mean score is 3.08
27
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
You grew fifty carrots using special soil. You dig them up and measure their
lengths (to the nearest mm) and group the results.
Calculate the mean, median and mode of the lengths of carrots.
Solution
Step 1: Calculate the mean from the table below. In this case x is the mid-
point of the group.
28
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
8530
∑fxf = ____
_
Estimated Mean = ∑ 50
= 170.6 mm
Note, it is only an ‘estimated’ mean as we have not used each exact
carrot length, but only estimated by taking the mid-point of the group.
Step 2: Find the median group.
The median is the mean of the 25th and the 26th length, so it occurs in the
170 – 174 group.
Step 3: Find the mode, or, modal group
The modal group is the one with the highest frequency, which is
175 – 179:
Mean Median
The mean, also known as the The median represents the middle
average, is calculated by summing value in a dataset when arranged in
all the values in a dataset and ascending or descending order.
dividing by the total number of
observations.
It is influenced by extreme values or It is not affected by extreme values
outliers in the dataset. or outliers in the dataset.
The mean represents the arithmetic The median is particularly useful
centre or the typical value of the when dealing with skewed
dataset. distributions or datasets with
outliers.
It is widely used and provides a It provides a measure of the central
measure of the central tendency value that is robust to extreme
that takes into account all values in values.
the dataset.
29
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Solution
30
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
So, the mean can be used when we do not have extreme values in the data but the
median is better if we have extreme values.
To review and reinforce the concepts of mean, mode, and median from raw
data sets and frequency distribution tables follow these instructions.
Instructions:
1. Watch the Videos: Click on the following links to watch videos that
demonstrate how to find the mean, median and mode from both raw data
sets and frequency distribution tables
• Video on Finding Mean, Median and Mode from Raw Data Set
([Link]
31
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
2. Take Notes: While watching the videos, take notes on the key steps
and methods used to calculate the mean, median and mode. Pay special
attention to any examples provided in the videos.
3. Practice Problems: Think about what raw data/frequency distribution
tables you could use to make your own problems on which to practise.
Use these to challenge your classmates. Ensure all steps are shown, with
calculations clearly written.
4. Review and Discuss: After completing the practice problems, review
your answers with a classmate or in a group. Discuss any difficulties or
questions you might have. If needed, refer back to the videos, or ask your
teacher for clarification.
The table below is the summary of cases and recoveries of coronavirus recorded
in various regions in Ghana from March 2020 to 11th August 2022. Use the
information from the table to make recommendations to the government on
resource allocation to the various regions.
Table 15: Cases and recoveries of coronavirus in Ghana from March 2020 to 11th
August 2022
32
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
33
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Project 2
• Design Student Course Evaluation Questionnaires and use them to collect
data from students in your school.
• Analyse the data and make conclusions and recommendations based of the
results.
34
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
REVIEW QUESTIONS
35
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
Age Height(ft)
3 1.9
6 2.4
9 2.7
12 3.0
15 4.6
18 5.5
21 6.1
24 6.2
Fruit People
Apple 35
Orange 30
Banana 10
Kiwifruit 25
Blueberry 40
Grapes 5
8. The table below shows the number of boys and girls in all JHS classes at
Winneba Don Bosco B JHS.
Construct a bar chart for the data.
9. Find the mode of the data {14, 16, 16, 16, 17, 16, 18}.
36
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
10. The ages of the members of a community have been listed below:
{42, 38, 29, 37, 40, 33, 41}
Calculate the median of the given data.
11. Find the mean, median and mode for the following list of values:
13, 18, 13, 14, 13, 16, 14, 21, 13
12. Estimate the mean, median and mode for the data set.
Seconds Frequency
51 - 55 2
56 - 60 7
61 - 65 8
66- 70 4
13. The ages of the 112 people who live on a tropical island are grouped as
follows:
Age Number
0-9 20
10 - 19 21
20 - 29 23
30 - 39 16
40 - 49 11
50 - 59 10
60 - 69 7
70 - 79 3
80 - 89 1
(a) Estimate the mean, median and mode for the data.
(b) Analyse the three central scores (mean, median and mode) calculated
and justify why a particular one best represents the data
37
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
14. Estimate the mean, median and mode for the data set.
Analyse the three central scores (mean, median and mode) calculated and
justify why a particular one best represents the data.
15. State and explain, with practical examples two differences between mean
and median.
16. Explain, with practical examples the effect of an extreme value on the
mean
38
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
GLOSSARY
• Data: Raw facts or information.
• Data Organisation: Arranging data systematically.
• Data Analysis: Examining and interpreting data.
• Data Presentation: Showing data visually (charts, graphs).
• Qualitative Data: Descriptive, non-numerical data.
• Quantitative Data: Numerical, measurable data.
• Frequency Distribution: How often values occur in data.
• Mean: Average of data values, found by the sum of data pieces divided by
the total number of pieces of data.
• Median: Middle value in ordered data.
• Mode: Most frequent value in data.
• Range: Difference between highest and lowest values.
• Pie Chart: Circular chart showing parts of a whole.
• Bar Chart: Bars representing different data values.
• Line Graph: Graph showing changes over time.
• Outliers: Data points far from others.
• Trend: General direction of data over time.
• Survey: Method to collect data via questions.
• Census: Data collected from an entire population.
• Sample: Subset of a population for analysis.
EXTENDED READING
• Akrong Series: Core mathematics for Senior High Schools New
International Edition (Pages 645 – 698)
• Aki – Ola series : Core Mathematics for Senior High Schools in West
Africa, Millennium edition 5 (Pages 267 – 294)
• Baffour Asamoah, B. A. (2015). Baffour BA series: Core mathematics.
Accra: Mega Heights, (Pages 259 - 298)
39
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
REFERENCES
• Akrong Series: Core mathematics for Senior High Schools New
International Edition (Pages 645 – 698)
• Aki – Ola series: Core Mathematics for Senior High Schools in West
Africa, Millennium edition 5 (Pages 267 – 294)
• Baffour Asamoah, B. A. (2015). Baffour BA series: Core mathematics.
Accra: Mega Heights, (Pages 259 - 298)
• Basic mathematics for TTC volume one by [Link] Appau, October 2001
40
SECTION 8 DATA ORGANISATION, ANALYSIS AND PRESENTATION
ACKNOWLEDGEMENTS
List of Contributors
NAME INSTITUTION
Janet Waponade OLA SHS
Yamusah Issahaku Kumbungu SHS
Joseph Bakpil Nagbija Nchumuruman SHS, Chinderi
Modzaka Godfred Keta SHTS, Keta
41