0% found this document useful (0 votes)

11 views107 pages

Understanding Statistics: Measures & Examples

Chapter 13 of the statistics document covers the collection, organization, and interpretation of numerical data, focusing on measures of central tendency such as mean, median, and mode. It also explains concepts related to grouped data, measures of dispersion, and visual representations like histograms and box plots. Additionally, it includes examples and exercises to illustrate these statistical concepts.

Uploaded by

ngozikannadi7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views107 pages

Understanding Statistics: Measures & Examples

Uploaded by

ngozikannadi7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

STATISTICS – Chapter 13

LESSON 1
STATISTICS
is a branch of mathematics that deals with the collection, organisation and interpretation of
numerical data.

Data can be continuous or discreet

Continuous – information collected by measurement

Discreet – information collected by counting

Measures of central tendency from a list
Measures of central tendency – are values that tell you where the middle (centre) of the
data lies.

Mean – the average of the data set

∑𝑥 (𝑠𝑢𝑚 𝑜𝑓 𝑎𝑙𝑙 𝑡ℎ𝑒 𝑠𝑐𝑜𝑟𝑒𝑠)

𝑥ҧ =
𝑛 (𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑎𝑙𝑢𝑒𝑠)

Median – the middle value of the data set after the data has been arranged in ascending order.

Mode – the value that occurs most often.

Mean, median & mode of ungrouped data
Example 1:
The following shoe sizes were collected from 13 learners:
7 8 8 12 7 9 8 10 9 10 11 5 8

Determine:
∑𝑥 112
a) Mean 𝑥ҧ = = = 8,62
𝑛 13

b) Median 5 7 7 8 8 8 8 9 9 10 10 11 12

c) Mode 8
Example 2:
The following test marks out of 20 were collected from 10 learners:
18 20 15 15 8 10 12 13 9 17

Determine:
∑𝑥 137
a) Mean 𝑥ҧ = = = 13,7
𝑛 10
13+15
a) Median 8 9 10 12 13 15 15 17 18 20 ∴ = 14
2

a) Mode 15
Mean, median, mode & range from grouped data

Example 3:
The frequency table below shows the marks of 125 Grade 12 Mathematics learners. Marks are
recorded as whole percentages.

Mark Frequency (𝒇) Midpoint (𝒙𝒊 ) 𝒇. 𝒙𝒊

0 ≤ 𝑥 < 20 6 0 + 20
= 10 6 × 10 = 60
2
20 ≤ 𝑥 < 40 14 30 420
40 ≤ 𝑥 < 60 38 50 1900
60 ≤ 𝑥 < 80 45 70 3150
80 ≤ 𝑥 < 100 22 90 1980
TOTAL: 125 7510
Determine:
a) The estimated mean
∑𝑓. 𝑥𝑖 7510
𝑥ҧ = = = 60,08
𝑛 125

b) The modal class

60 ≤ 𝑥 < 80

c) The class interval where the median is found

𝑛 125
𝑃𝑜𝑠𝑖𝑡𝑖𝑜𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑚𝑒𝑑𝑖𝑎𝑛 = = = 62,5 ∴ 63𝑟𝑑 𝑣𝑎𝑙𝑢𝑒
2 2
∴ 60 ≤ 𝑥 < 80
5 number summary
21 elements in the list
Example:
34 35 35 38 41 41 44 45 46 47 47 51 51 54 55 56 57 62 74 75 82

The Median Q2 → the value that divides the data in two halves
 Q2 = 47
 The lower quartile Q1 → the median of the lower half of the data
 Q1 = 41
 The upper quartile Q3 → the median of the upper half of the data
 Q3 = 56,5
 The minimum value → the smallest value of the set
min = 34
 The maximum value → the biggest value of the set
 max = 82
Box and whiskers diagram
Example continue:

min = 34 Q1 = 41 Q2 = 47 Q3 = 56 max = 82

30 40 50 60 70 80
Skewed and symmetrical data ഥ − 𝑸𝟐
𝒙
𝒎𝒆𝒂𝒏 – 𝒎𝒆𝒅𝒊𝒂𝒏
Symmetrical
Data is balanced, same amount of scores on each side
of the median
𝑚𝑒𝑎𝑛 – 𝑚𝑒𝑑𝑖𝑎𝑛 = 0 ⇔ 𝑚𝑒𝑎𝑛 = 𝑚𝑒𝑑𝑖𝑎𝑛

Skewed to the right

Data is spread more to the right of the median,
positively skewed
𝑚𝑒𝑎𝑛 – 𝑚𝑒𝑑𝑖𝑎𝑛 > 0 ⟺ 𝑚𝑒𝑎𝑛 > 𝑚𝑒𝑑𝑖𝑎𝑛

Skewed to the left

Data is spread more to the left of the median,
negatively skewed
𝑚𝑒𝑎𝑛 – 𝑚𝑒𝑑𝑖𝑎𝑛 < 0 ⟺ 𝑚𝑒𝑎𝑛 < 𝑚𝑒𝑑𝑖𝑎𝑛
Histograms and frequency polygons
Marks out of 50 Frequency Midpoint
0 ≤ 𝑥 < 10 0 5
10 ≤ 𝑥 < 20 25 15
20 ≤ 𝑥 < 30 74 25
30 ≤ 𝑥 < 40 66 35
40 ≤ 𝑥 < 50 35 45
50 ≤ 𝑥 < 60 0 55

Histogram showing learners marks

80
60
Frequency

40
20
0
10 20 30 40 50 60
Marks
Measures of dispersion
Measures of dispersion – are values that tell you how spread out or grouped the data values are.
Range – the difference between the highest and the lowest score
𝑅 = 𝑚𝑎𝑥 − 𝑚𝑖𝑛
Interquartile range – the difference between the 3rd and 1st quartiles
IQ𝑅 = 𝑄3 − 𝑄1

Semi-interquartile range – half the difference between the 3rd and 1st quartiles
𝑄3 −𝑄1
SIQ𝑅 =
2

Quartiles – divide ordered data into 4 equal parts (into quarters)

Percentiles – divide ordered data into 100 equal parts
Example 5: (Do the questions for Grade 11A only!!)
The 32 learners from each of the grade 11 classes wrote a maths test out of 60. Their
mathematics teachers recorded their marks on stem and leaf diagrams

Grade 11 A Grade 11 B Grade 11 C

1 2345667 1 222335799999 1 4
2 0012 2 001223356789 2 07
3 0112579 3 1579 3 13
4 46899 4 0002 4 00111122669
5 01266788 5 5 0112344667899
6 0 6 6 000

Determine for each class:

a) The mean and the five number summary
b) The difference between the mean and the median
c) Draw a box and whiskers diagram
d) Whether the data is skewed or symmetrical
e) The interquartile range
Example continue:
a) Grade 11 A Grade 11 B Grade 11 C
1155 779 1484
Mean 𝑥
Mean 𝑥ҧҧ 𝑥ҧ = = 36,094 𝑥ҧ = = 24,344 𝑥ҧ = = 46,375
32 32 32
Min value 12
Min value 12 14
𝑄1 20 19 41
𝑄1
𝑄2 22
𝑄2 36 49,5
𝑄3 30
𝑄3 50,5 56
Max value 42 60
Max value 60

b) 𝑥ҧ - median 𝑥ҧ - median 𝑥ҧ - median

= 36,094 – 36 = 24,344 – 22 = 46,375 – 49,5
=0,094 = 2,344 =-3,125
Example continue:
c)
Skewed left
11 C

11 B Skewed right

11 A symmetrical

10 20 30 40 50 60

𝑥ҧ - median 𝑥ҧ - median 𝑥ҧ - median

= 36,094 – 36 = 24,344 – 22 = 46,375 – 49,5
=0,094 = 2,344 =-3,125

d)
𝐼𝑄𝑅 = 50,5 − 20 𝐼𝑄𝑅 = 30 − 19 𝐼𝑄𝑅 = 56 − 41
= 30,5 = 11 = 15
Exercise 11.1 pg. 331 no. b
LESSON 2
Exercise 11.1 pg. 331 no. b
Ogive (Cumulative frequency)
Example 6:
Mark Tally Frequency Cumulative Cumulative frequency →
(f) frequency tells us how many learners got that
1 I 1 1 mark or less

2 III 3 1+3 = 4
3 II 2 4+2 = 6 11 learners got 5 marks or less
4 III 3 9
5 II 2 11
6 IIII 4 15
7 I 16 learners got 7 marks or less
1 16
8 II 2 18

Frequency → tells us how many 9 learners got between

learners got that mark 4 marks and 8
(18 – 9)
Example continue:
a) Complete the table

Cumulative
Marks Frequency Points to plot
frequency
1 – 10 0 0 (10;0)
11 – 20 2 2 (20;2)
21 – 30 6 8 (30;8)
31 – 40 7 15 (40;15)
41 – 50 14 29 (50;29)
51 – 60 20 49 (60;49)
61 – 70 35 84 (70;84)
71 – 80 29 113 (80;113)
81 – 90 6 119 (90;119)
91 – 100 1 120 (100;120)
Example continue:
b) Draw an ogive to represent the data
Marks obtained in an exam

Cumulative frequency
graph = OGIVE
Cumulative frequency

Marks
Example continue:
c) Find an estimate of the median, lower quartile and upper quartile.

1 1
𝑄1 : (𝑛) = 120 = 30 ∴ 30𝑡ℎ 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
Position of the quartiles 4 4

1 1
𝑄2 : (𝑛) = 120 = 60 ∴ 60𝑡ℎ 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
2 2
3 3
𝑄3 : (𝑛) = 120 = 90 ∴ 90𝑡ℎ 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
4
4
𝑛 = 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑎𝑙𝑢𝑒𝑠

d) 80th percentile

Position = 𝑛 × 𝑝𝑒𝑟𝑐𝑒𝑛𝑡𝑖𝑙𝑒%
= 120 × 80%
= 96𝑡ℎ 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
Example continue:
Marks obtained in an exam

Cumulative frequency

𝑸𝟏 ≈ 𝟓𝟐 𝑸𝟐 ≈ 𝟔𝟑 𝑸𝟑 ≈ 𝟕𝟐
Marks
c) 𝑄1 = 52 ; 𝑄2 = 63 ; 𝑄4 = 72 𝟖𝟎𝒕𝒉 𝒑𝒆𝒓𝒄𝒆𝒏𝒕𝒊𝒍𝒆 ≈ 𝟕𝟓
d) 80𝑡ℎ 𝑝𝑒𝑟𝑐𝑒𝑛𝑡𝑖𝑙𝑒 = 75
Example continue:
e) Draw the box and whisker plot representing the marks obtained in the exam.

Cumulative frequency

𝑸 ≈ 𝟔𝟑 𝑸𝟑 ≈ 𝟕𝟐
Marks 𝑸𝟏 ≈ 𝟓𝟐 𝟐
Exercise 11.2 pg. 340 no. b,c,d,e
LESSON 3
Exercise 11.2 pg. 340 no. b,c,d,e
Variance and standard deviation
Variance (𝑣𝑎𝑟) and Standard deviation (𝜎) measures the dispersion around the mean.

• How spread out a set of data values are.

• Does most of the data lie close to the mean? Or is it widely spread out?

mean → 62%

← 𝜎=8
Frequency

← 𝜎 = 16

Marks (%)
Variance (𝑣𝑎𝑟) – the average of the squared differences from the mean

Standard deviation (𝜎) – the square root of the variance

Steps to follow:
∑ 𝑥 − 𝑥ҧ 2
𝜎= 𝑜𝑟 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
 Find the mean 𝑥ҧ 𝑛
 Find the deviations from the mean 𝑥 − 𝑥ҧ
 Square the deviations 𝑥 − 𝑥ҧ 2

 Add up the squared deviation column and divide by the number of data values
– this is to find variance 𝑣𝑎𝑟
 Square root the variance – this is the standard deviation 𝜎
Example 7:
A girl measured the heights of her friends and family. She decided to compare her findings.

a) Determine the mean of her friend group.

𝐹𝑟𝑖𝑒𝑛𝑑𝑠
1185
𝑥ҧ = = 169,3
7
b) Work out the variance for her friend group.

∑ 𝑥 − 𝑥ҧ 2
𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒 =
𝑛
Deviations
Heights of (Deviations)2
from the
friends
mean ഥ
𝒙−𝒙 𝟐
𝒙
𝒙−𝒙 ഥ
162 162 – 169,3 (-7,3)2 𝐹𝑟𝑖𝑒𝑛𝑑𝑠
= -7,3 =53,29 419,43
𝑉𝑎𝑟 =
173 3,7 13,69 7
160 -9,3 𝑉𝑎𝑟 = 59,9 𝑐𝑚2
86,49
175 5,7 32,49
175 5,7 32,49
180 10,7 114,49
160 -9,3 86,49 This column is the sum of
419,43 (deviation)2 → ∑ 𝑥 − 𝑥ҧ 2
Heights of family
c) Work out the standard deviation for her friends. 𝒙
2 179
∑ 𝑥 − 𝑥ҧ
𝜎= 𝑜𝑟 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒 184
𝑛
156
𝐹𝑟𝑖𝑒𝑛𝑑𝑠 174
Calculator change to:
𝜎 = 59,9 220
frequency table off 𝜎 =7,7 cm 135
135
d) Work out the mean, variance and standard deviation of her family.
Use the calculator.
𝐹𝑎𝑚𝑖𝑙𝑦
𝐹𝑎𝑚𝑖𝑙𝑦 𝐹𝑎𝑚𝑖𝑙𝑦 𝑉𝑎𝑟 = 𝜎 2
𝑥ҧ = 169 𝜎 =27,86 cm 𝑉𝑎𝑟 = 776 𝑐𝑚2

e) What do the two standard deviation tell us about the spread of the data
around the mean?

𝐻𝑒𝑟 𝑓𝑎𝑚𝑖𝑙𝑦’𝑠 ℎ𝑒𝑖𝑔ℎ𝑡𝑠 𝑎𝑟𝑒 𝑚𝑜𝑟𝑒 𝑠𝑝𝑟𝑒𝑎𝑑 𝑜𝑢𝑡 𝑎𝑠 𝑡ℎ𝑒 𝑠𝑡𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑖𝑠 ℎ𝑖𝑔ℎ𝑒𝑟 𝑡ℎ𝑎𝑛 𝑡ℎ𝑒 𝑓𝑟𝑖𝑒𝑛𝑑𝑠 𝑠𝑡𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛,
𝑚𝑒𝑎𝑛𝑖𝑛𝑔 𝑡ℎ𝑒 𝑓𝑟𝑖𝑒𝑛𝑑𝑠 ℎ𝑒𝑖𝑔ℎ𝑡𝑠 𝑎𝑟𝑒 𝑚𝑜𝑟𝑒 𝑐𝑙𝑜𝑠𝑒𝑟 𝑡𝑜𝑔𝑒𝑡ℎ𝑒𝑟 𝑐𝑜𝑚𝑝𝑎𝑟𝑒𝑑 𝑡𝑜 ℎ𝑒𝑟 𝑓𝑎𝑚𝑖𝑙𝑦.
Heights of family
𝒙
179

f) How many people in her family lie within one standard deviation of the mean? 184
169 − 27,86 ; 169 + 27,86 156
174
∴ 141,14 ; 196,86 220
135
∴ 4 𝑝𝑒𝑜𝑝𝑙𝑒
135
Example 8:
Five numbers, 4; 8; 10; 𝑥 and 𝑦, have a mean of 10 and a standard deviation of 4. Find the values of 𝑥 and 𝑦.
ഥ = 𝟏𝟎
𝒙 𝜎=𝟒 𝑠𝑢𝑚 𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠
𝑥ҧ =
𝟐
𝑛
𝒙 ഥ
𝒙−𝒙 ഥ
𝒙−𝒙
4 36 4 + 8 + 10 + 𝑥 + 𝑦
4 − 10 = −6 10 =
5
8 8 − 10 = −2 4
50 = 22 + 𝑥 + 𝑦
10 10 – 10 = 0 0
28 = 𝑥 + 𝑦
𝑥 𝑥 − 10 𝑥 2 − 20𝑥 + 100
𝑦 𝑦 − 10 𝑦 2 − 20𝑦 + 100 

∑ 𝑥 − 𝑥ҧ 2
2
෍ 𝑥 − 𝑥ҧ = 36 + 4 + 0 + 𝑥2 − 20𝑥 + 100 + 𝑦2 − 20𝑦 + 100 𝜎=
𝑛
= 𝑥 2 − 20𝑥 + 𝑦 2 − 20𝑦 + 240
𝑥 2 − 20𝑥 + 𝑦 2 − 20𝑦 + 240
4=
5
𝑥 2 − 20𝑥 + 𝑦 2 − 20𝑦 + 240
16 =
5
80 = 𝑥 − 20𝑥 + 𝑦 2 − 20𝑦 + 240
2

0 = 𝑥 2 − 20𝑥 + 𝑦 2 − 20𝑦 + 160


0 = 𝑥 2 − 20𝑥 + 𝑦 2 − 20𝑦 + 160 
28 = 𝑥 + 𝑦 

𝑓𝑟𝑜𝑚 1 𝑥 = 28 − 𝑦 

𝑠𝑢𝑏 𝑥 𝑖𝑛 2
0 = 28 − 𝑦 2 − 20 28 − 𝑦 + 𝑦 2 − 20𝑦 + 160
0 = 784 − 56𝑦 + 𝑦 2 − 560 + 20𝑦 + 𝑦 2 − 20𝑦 + 160
0 = 2𝑦 2 − 56𝑦 + 384
0 = 𝑦 2 − 28𝑦 + 192
0 = 𝑦 − 16 𝑦 − 12
𝑦 = 16 𝑜𝑟 𝑦 = 12

𝑠𝑢𝑏 𝑦 𝑖𝑛 3
𝑥 = 28 − 16 𝑜𝑟 𝑥 = 28 − 12
𝑥 = 12 𝑜𝑟 𝑥 = 16

∴ 𝑚𝑦 𝑡𝑤𝑜 𝑛𝑢𝑚𝑏𝑒𝑟𝑠 𝑎𝑟𝑒 12 𝑎𝑛𝑑 16

Exercise 11.3 pg. 347 no. b,d,f,h
LESSON 4
Exercise 11.3 pg. 347 no. b,d,f,h

Calculator change to:

frequency table off
Calculator change to:
frequency table on
Distribution of data
NORMALLY DISTRIBUTED OR SYMMETRICAL DATA

Mean= Median
ഥ − 𝑸𝟐 = 𝟎
𝒙

If the data to the left of the

median balances with the data on
the right, then the mean, median
and mode will be the same.
The data is said to be normally
distributed
or symmetrical.
DISTRIBUTIONS THAT ARE NOT NORMALLY DISTRIBUTED Mean > Median
Skewed to the right / positively skewed ഥ − 𝑸𝟐 > 𝟎
𝒙
Skewed right implies that there is a mass of
data values on the left side of the
distribution with fewer higher vales on the
right. Skewed right implies that the data is
spread more to the right of the median, the
data is said to be positively skewed or
skewed to the right.
Skewed to the left / negatively skewed Mean < Median
ഥ − 𝑸𝟐 < 𝟎
𝒙
Skewed left implies that there is a mass of
data values on the right side of the
distribution with fewer higher vales on
the left. Skewed left implies that the data
is spread more to the left of the median,
the mode will be closer to the right of the
distribution, the data is said to be
negatively skewed or skewed to the left.
Distribution of data (graphical)
Bell curve

98 or 99 or 100%
95%
68%

-𝜎 ഥ
𝒙 +𝜎
-2𝜎 +2𝜎
-3𝜎 +3𝜎
Exercise 11.4 pg. 352 no. a,c,e
LESSON 5
Exercise 11.4 pg. 352 no. a,c,e
Calculator change to:
frequency table off
Calculator change to:
frequency table on
Outlier
Outlier
Exercise 11.5 pg. 357 no. a,b,e
LESSON 5
Exercise 11.5 pg. 357 no. a,b,e

Calculator change to:

frequency table off
Bivariate data: scatter plots, least squares regression line
(line of best fit) and correlation coefficient.
Bivariate data – relationship between two types of data (two variables)

• We display bivariate data graphically in a scatter plot

• 𝑥 −axis → independent variable
• 𝑦 − axis → dependant variable

The questions normally consist of three parts:

• Draw a scatter plot.
• Determine the equation of the least squares regression line and draw the line of best fit.
• Determine the correlation coefficient and comment on it.
Bivariate data: Scatter plot
The shape of the graph tells us about the relationship between the two variables.
Ask yourself these questions when looking at the scatter plot:

 Is there a positive or negative association?

 Is the relationship strong or weak?
 Is the relationship linear or non linear?
 Are there outliers?
 Are there groupings?
Bivariate data: scatter plots
Positive association  Negative association
 Strong relship  Strong relship
 Linear  Linear
 No outliers  No outliers
 No groupings  No groupings

 Negative association
 Weak relship Positive association
 Linear  Weak relship
 No outliers  Linear
 No groupings  No outliers
 No groupings
Bivariate data: scatter plots
Positive association Positive association
 Strong relship  Strong relship
 Exponential  Parabolic
 No outliers  No outliers
 No groupings  No groupings

Positive association Positive association

 Strong relship  Strong relship
 Linear  Linear
 One outliers  No outliers
 No groupings  Two groupings
Bivariate data: correlation coefficient - r
Correlation coefficient – determines the strength of the relationship between the two variables.
The value can vary between −1 (negative) and 1 (positive).

1 Perfect positive correlation

Strong positive correlation
0.8
Moderate strong positive correlation
0.5
Weak positive correlation
0 No correlation
Weak negative correlation
-0.5
Moderate negative correlation
-0.8
Strong negative correlation
-1 Perfect negative correlation
Bivariate data: correlation coefficient - r

𝑟=1 𝑟 = −0,9999
∴ Perfect positive ∴ Strong negative
correlation correlation

𝑟 = −0,0009 𝑟 = −0,68
∴ almost no correlation ∴Moderate negative
correlation
Bivariate data: regression line
Regression line – the equation for the line of best fit.

𝑦ො = 𝑎 + 𝑏𝑥

constant value The gradient of

(𝑦 − cut) the regression line

To plot the line of best fit – use 𝑦-cut and point (𝑥;ҧ 𝑦)
ത

Interpolation – using the regression line to predict values within the data range

Extrapolation – using the regression line to predict values outside the data range
Bivariate data
The table represents the distance in meters required by a car to apply breaks and reach a
standstill when it is travelling at a given speed.
Speed in km/h 20 40 60 80 100 120 140
Breaking distance in m 6 16 3 48 70 80 110

Enter the data in your calculator in order to find the values of 𝑟, 𝑎, 𝑏, 𝑥ҧ and , 𝑦ത .

𝑎 = −24,9
𝑏 = 0,9
𝑟 = 0,95
𝑥ҧ = 80
𝑦ത = 47,6
Bivariate data
Mrs du Trevou owns an ice-cream shop. She analyses her ice-cream sales over a randomly selected 14 days. She
then compared her sales with the temperature on the day of each sale. The results of her survey are shown in the
table below:

Temp in OC 20 13 18 23 19 28 21 38 26 22 17 14 15 16
Ice creams sold 116 85 107 139 123 172 128 127 148 124 112 89 101 96

a) Draw a scatter plot representing this data

b) Discuss the relationship between the data
c) Describe the correlation:
1) including the outlier
2) excluding the outlier
d) Determine the line of best fit
1) including the outlier
2) excluding the outlier
e) Using the regression line that was calculated without the outlier, predict how many ice creams Mrs du Trevou
will sell if the temperature is:
1) 5oC
2) 24oC
Bivariate data
a) Draw a scatter plot representing this data
200
180
No of ice creams sold 160
140
120
100
80
60
40
20
0
0 5 10 15 20 25 30 35 40
Temp degrees Celsius

b) Discuss the relationship between the data

Positive association, strong relationship, Linear, 1 outlier, no groupings

Bivariate data
c) Determine the correlation coefficient and describe the correlation:
𝑟 = 0,7135 ∴ Moderate strong positive correlation

d) Determine the equation for the line of best fit.

𝑎 = 66,0467 𝑏 = 2,5598
∴ 𝑦ො = 2,6𝑥 + 66,0

e) Draw the line of best fit, by finding the y-cut and the point (𝑥;ҧ 𝑦)
ത
Bivariate data
e) Draw the line of best fit, by finding the y-cut and the point (𝑥;ҧ 𝑦)
ത
𝑦ො = 2,6𝑥 + 66,0
𝑦 − 𝑐𝑢𝑡 = 66
𝑥;ҧ 𝑦ത = (20,7 ; 119,1)

200
180
160
No of ice creams sold

140
120
100
80
60
40
20
0
0 5 10 15 20 25 30 35 40
Temp degrees Celsius
Bivariate data
f) Determine the correlation coefficient as well as the line of best fit if the outlier is excluded.
𝑟 = 0,9756 ∴ Strong positive correlation
𝑎 = 16,22235948 𝑏 = 5,27424336
∴ 𝑦ො = 5,3𝑥 + 16, 2
g) Draw the new line of best fit.
𝑦 − 𝑐𝑢𝑡 = 16,2 200
180
𝑥;ҧ 𝑦ത = (19,4 ; 118,5) No of ice creams sold 160
140
h) How does the outlier 120
effect the data? 100
80
60
40
20
0
0 5 10 15 20 25 30 35 40
Temp degrees Celsius
Bivariate data
g) Using the regression line that was calculated without the outlier, predict how many ice creams
Mrs du Trevou will sell if the temperature is. Explain if this is a valid prediction.
1) 5oC
𝑦ො = 5,3𝑥 + 16,2
𝑦ො = 5,3(5) + 16,2
𝑦ො = 42,7
∴ 42 𝑖𝑐𝑒 𝑐𝑟𝑒𝑎𝑚𝑠
𝑁𝑜𝑡 𝑣𝑎𝑙𝑖𝑑 𝑎𝑠 𝑖𝑡 𝑖𝑠 𝑒𝑥𝑡𝑟𝑎𝑝𝑜𝑙𝑎𝑡𝑖𝑜𝑛

2) 24oC
𝑦ො = 5,3𝑥 + 16,2
𝑦ො = 5,3 24 + 16,2
𝑦ො = 143,4
∴ 143 𝑖𝑐𝑒 𝑐𝑟𝑒𝑎𝑚𝑠
𝑉𝑎𝑙𝑖𝑑 𝑎𝑠 𝑖𝑡 𝑖𝑠 𝑖𝑛𝑡𝑒𝑟𝑝𝑜𝑙𝑎𝑡𝑖𝑜𝑛
Worksheet no 1 – 5
In class and for homework
LESSON 6

Grade 10-11 Statistics Revision Guide
No ratings yet
Grade 10-11 Statistics Revision Guide
18 pages
Understanding Central Tendency and Variability
No ratings yet
Understanding Central Tendency and Variability
11 pages
Univariate Analysis: Central Tendency & Variability
No ratings yet
Univariate Analysis: Central Tendency & Variability
44 pages
A Level Maths Data Representation Guide
No ratings yet
A Level Maths Data Representation Guide
9 pages
Statistics Textbook For Students Who Wish To Math - 260331 - 154720
No ratings yet
Statistics Textbook For Students Who Wish To Math - 260331 - 154720
3 pages
Understanding Central Tendency and Dispersion
100% (1)
Understanding Central Tendency and Dispersion
13 pages
Grade 12 Statistics: Central Tendency & Dispersion
No ratings yet
Grade 12 Statistics: Central Tendency & Dispersion
10 pages
2024 Grade 11 Data Handling Part A - 045940
No ratings yet
2024 Grade 11 Data Handling Part A - 045940
10 pages
Statistical Measures Overview
No ratings yet
Statistical Measures Overview
16 pages
Understanding Mean, Median, Mode, and More
No ratings yet
Understanding Mean, Median, Mode, and More
55 pages
L-4 Central Tendency - RN Biostatistics - Ji
No ratings yet
L-4 Central Tendency - RN Biostatistics - Ji
61 pages
Measures of Central Tendency Explained
No ratings yet
Measures of Central Tendency Explained
41 pages
Levels of Measurement in Statistics
No ratings yet
Levels of Measurement in Statistics
33 pages
01 HBL Instruction Sheet, Notes On Averages and Assignment
No ratings yet
01 HBL Instruction Sheet, Notes On Averages and Assignment
15 pages
MMW Module 5 Part 2 Data Management
No ratings yet
MMW Module 5 Part 2 Data Management
41 pages
Data Interpretation in Statistics
No ratings yet
Data Interpretation in Statistics
12 pages
Central Tendency and Dispersion in Statistics
No ratings yet
Central Tendency and Dispersion in Statistics
29 pages
Statistical Measures: Mean, Median, Mode
No ratings yet
Statistical Measures: Mean, Median, Mode
30 pages
Data Visualization Techniques in Statistics
No ratings yet
Data Visualization Techniques in Statistics
11 pages
Understanding Measures of Spread
No ratings yet
Understanding Measures of Spread
9 pages
Univariate Data Statistics Overview
No ratings yet
Univariate Data Statistics Overview
6 pages
Central Tendency in Statistics
No ratings yet
Central Tendency in Statistics
14 pages
A Level New Curriculum Mathematics SV Topic 6 Descriptive Statistics
No ratings yet
A Level New Curriculum Mathematics SV Topic 6 Descriptive Statistics
34 pages
Grade 12 Statistics Revision Notes
100% (6)
Grade 12 Statistics Revision Notes
22 pages
Descriptive Statistics Overview
No ratings yet
Descriptive Statistics Overview
20 pages
Sec 2 Statistics Lesson Notes
No ratings yet
Sec 2 Statistics Lesson Notes
16 pages
Summarizing Quantitative Data Methods
No ratings yet
Summarizing Quantitative Data Methods
13 pages
Probability and Statistics Concepts Explained
No ratings yet
Probability and Statistics Concepts Explained
12 pages
Understanding Basic Statistics Concepts
No ratings yet
Understanding Basic Statistics Concepts
42 pages
Central Tendency: Mean, Median, Mode
No ratings yet
Central Tendency: Mean, Median, Mode
43 pages
Central Tendency: Mean, Mode, Median
No ratings yet
Central Tendency: Mean, Mode, Median
56 pages
Central Tendency and Dispersion Explained
No ratings yet
Central Tendency and Dispersion Explained
92 pages
Central Tendency Measures Explained
No ratings yet
Central Tendency Measures Explained
25 pages
Grade 12 Mathematics Statistics Guide
No ratings yet
Grade 12 Mathematics Statistics Guide
52 pages
Statistics Overview for Form 2 & 4
No ratings yet
Statistics Overview for Form 2 & 4
7 pages
Statistics II: Mean, Median, Mode & Dispersion
No ratings yet
Statistics II: Mean, Median, Mode & Dispersion
4 pages
10 Dispersion (I)
No ratings yet
10 Dispersion (I)
15 pages
Screenshot 2025-10-01 at 10.11.13 PM
No ratings yet
Screenshot 2025-10-01 at 10.11.13 PM
113 pages
Central Tendency Measures for Grouped Data
No ratings yet
Central Tendency Measures for Grouped Data
20 pages
Descriptive Statistics for Ungrouped Data
No ratings yet
Descriptive Statistics for Ungrouped Data
63 pages
Central Tendency
No ratings yet
Central Tendency
7 pages
Measures of Location and Spread in Stats
No ratings yet
Measures of Location and Spread in Stats
53 pages
Understanding Central Tendency in Statistics
No ratings yet
Understanding Central Tendency in Statistics
170 pages
Chapter 4 A Statistics
No ratings yet
Chapter 4 A Statistics
49 pages
Measures of Central Tendency Overview
No ratings yet
Measures of Central Tendency Overview
44 pages
Measures of Location and Spread in Stats
No ratings yet
Measures of Location and Spread in Stats
53 pages
Understanding Averages in Mathematics
No ratings yet
Understanding Averages in Mathematics
11 pages
Understanding Z-Scores and Percentiles
No ratings yet
Understanding Z-Scores and Percentiles
3 pages
Data Management and Statistics Overview
No ratings yet
Data Management and Statistics Overview
27 pages
Grade 10 Statistics Teaching Notes
No ratings yet
Grade 10 Statistics Teaching Notes
15 pages
Central Tendency and Variation Explained
No ratings yet
Central Tendency and Variation Explained
10 pages
Normal Distribution and Central Tendency
No ratings yet
Normal Distribution and Central Tendency
44 pages
Ungrouped Data: Central Tendency & Dispersion
No ratings yet
Ungrouped Data: Central Tendency & Dispersion
189 pages
Statistical Methods in Social Sciences
No ratings yet
Statistical Methods in Social Sciences
69 pages
Understanding Mean, Median, and Mode in Statistics
No ratings yet
Understanding Mean, Median, and Mode in Statistics
65 pages
Measures of Central Tendency Explained
No ratings yet
Measures of Central Tendency Explained
72 pages
Cumulative Frequency Worksheet Answers
No ratings yet
Cumulative Frequency Worksheet Answers
7 pages
Prandtl-Munk Vortex Sheet Analysis
No ratings yet
Prandtl-Munk Vortex Sheet Analysis
6 pages
Graphic Contraction Mappings in b-Metric Spaces
No ratings yet
Graphic Contraction Mappings in b-Metric Spaces
18 pages
Divide and Conquer Practice Problems
No ratings yet
Divide and Conquer Practice Problems
3 pages
Climate Change Modeling in Space Science
No ratings yet
Climate Change Modeling in Space Science
22 pages
Inner Product Spaces and Orthogonality
No ratings yet
Inner Product Spaces and Orthogonality
9 pages
Introduction To Numerical Methods in Chemical Engineering
92% (13)
Introduction To Numerical Methods in Chemical Engineering
299 pages
Solution Manual For Digital Signal Processing Using MATLAB 3rd Edition by Schilling Harris ISBN 1305635191 9781305635197 PDF Download
100% (25)
Solution Manual For Digital Signal Processing Using MATLAB 3rd Edition by Schilling Harris ISBN 1305635191 9781305635197 PDF Download
89 pages
Rebell SC2080S Calculator Comparison
No ratings yet
Rebell SC2080S Calculator Comparison
1 page
A Level Math Functions Tutorial Sheet
No ratings yet
A Level Math Functions Tutorial Sheet
5 pages
Eigenvalues and Eigenvectors Lab Guide
No ratings yet
Eigenvalues and Eigenvectors Lab Guide
3 pages
Excel Simplex Method Template
No ratings yet
Excel Simplex Method Template
6 pages
2007 A Level H2 Math Paper 1 Solutions
No ratings yet
2007 A Level H2 Math Paper 1 Solutions
4 pages
Properties of Matrices in Unit 1
No ratings yet
Properties of Matrices in Unit 1
25 pages
Class XII Mathematics: Inverse Trigonometric Functions Test
No ratings yet
Class XII Mathematics: Inverse Trigonometric Functions Test
3 pages
VNIT Nagpur B.Tech Quiz on Numerical Methods
No ratings yet
VNIT Nagpur B.Tech Quiz on Numerical Methods
4 pages
Displacement, Velocity, and Time Concepts
No ratings yet
Displacement, Velocity, and Time Concepts
29 pages
Cartesian Products and Relations Explained
No ratings yet
Cartesian Products and Relations Explained
1 page
Introduction to Finite Element Analysis
100% (1)
Introduction to Finite Element Analysis
142 pages
Floyd-Warshall Algorithm Overview
No ratings yet
Floyd-Warshall Algorithm Overview
22 pages
Math Formula Handbook for Classes 11-12
No ratings yet
Math Formula Handbook for Classes 11-12
10 pages
Numerical Integration in Structural Dynamics
No ratings yet
Numerical Integration in Structural Dynamics
17 pages
Finite Element Method Course Syllabus
No ratings yet
Finite Element Method Course Syllabus
57 pages
Understanding Determinants in Linear Algebra
No ratings yet
Understanding Determinants in Linear Algebra
19 pages
Matrix Operations and Properties Guide
No ratings yet
Matrix Operations and Properties Guide
13 pages
Binomial Theorem Practice Questions
No ratings yet
Binomial Theorem Practice Questions
11 pages
Force Analysis of Scissor Lift Actuator
No ratings yet
Force Analysis of Scissor Lift Actuator
4 pages
M2 Q4 Research-1 SSPecial Science
No ratings yet
M2 Q4 Research-1 SSPecial Science
19 pages
Cubic Spline Smoothing in Regression
No ratings yet
Cubic Spline Smoothing in Regression
5 pages
Applications of Schrödinger's Equation
100% (1)
Applications of Schrödinger's Equation
20 pages
Regula Falsi Method Explained in Hindi
No ratings yet
Regula Falsi Method Explained in Hindi
8 pages

Understanding Statistics: Measures & Examples

Uploaded by

Understanding Statistics: Measures & Examples

Uploaded by

STATISTICS – Chapter 13

Data can be continuous or discreet

Continuous – information collected by measurement

Discreet – information collected by counting

Mean – the average of the data set

∑𝑥 (𝑠𝑢𝑚 𝑜𝑓 𝑎𝑙𝑙 𝑡ℎ𝑒 𝑠𝑐𝑜𝑟𝑒𝑠)

Mode – the value that occurs most often.

Mark Frequency (𝒇) Midpoint (𝒙𝒊 ) 𝒇. 𝒙𝒊

b) The modal class

c) The class interval where the median is found

Skewed to the right

Skewed to the left

Histogram showing learners marks

Quartiles – divide ordered data into 4 equal parts (into quarters)

Grade 11 A Grade 11 B Grade 11 C

Determine for each class:

b) 𝑥ҧ - median 𝑥ҧ - median 𝑥ҧ - median

𝑥ҧ - median 𝑥ҧ - median 𝑥ҧ - median

Frequency → tells us how many 9 learners got between

• How spread out a set of data values are.

Standard deviation (𝜎) – the square root of the variance

Heights of friends Heights of family

a) Determine the mean of her friend group.

0 = 𝑥 2 − 20𝑥 + 𝑦 2 − 20𝑦 + 160

∴ 𝑚𝑦 𝑡𝑤𝑜 𝑛𝑢𝑚𝑏𝑒𝑟𝑠 𝑎𝑟𝑒 12 𝑎𝑛𝑑 16

Calculator change to:

If the data to the left of the

Calculator change to:

• We display bivariate data graphically in a scatter plot

The questions normally consist of three parts:

 Is there a positive or negative association?

Positive association Positive association

1 Perfect positive correlation

constant value The gradient of

a) Draw a scatter plot representing this data

b) Discuss the relationship between the data

Positive association, strong relationship, Linear, 1 outlier, no groupings

d) Determine the equation for the line of best fit.

You might also like