STATISTICS
STATISTICS
Concept - 01 – Average (Mean) – Ungrouped, Grouped &
Weighted
The arithmetic mean is a measure of central tendency. It tells us the
average value from a dataset.
In ungrouped data, each value is treated once.
In grouped or discrete data, values repeat as per their frequency.
In weighted mean, some values have more influence due to given
weights.
1. Mean of Ungrouped Data:
𝑥! + 𝑥" + 𝑥# + ⋯ + 𝑥$
𝑥̅ =
𝑛
2. Mean of Discrete/Grouped Data (with Frequency):
∑ 𝑓% 𝑥%
𝑥̅ =
∑ 𝑓%
∑ "& #&
3. Weighted Mean: 𝑥̅ = ∑ "&
Where: 𝑥% = observation, 𝑓% = frequency, 𝑤% = weight
Always multiply observations with their frequency or weight
Divide total sum by total frequency or weight. Watch for
hidden frequency values in grouped or weighted questions.
Solved Example (From Source)
The marks obtained by 30 students of class X in a mathematics
paper out of 100 are given below. Find the mean.
Marks (xᵢ) Frequency (fᵢ) fᵢ × xᵢ
10 1 10
20 1 20
36 3 108
40 4 160
50 3 150
56 2 112
60 4 240
70 4 280
72 1 72
80 1 80
88 2 176
92 3 276
95 1 95
∑ (! )!
Step 1: Use the mean 𝑥̅ = ∑ (!
!**+
Step 2: Substitute values: 𝑥̅ = #,
= 59.3
1. Arithmetic mean of the following data is 23, 17, 20, 19, 21
130 Powered by 1EQ
STATISTICS
(a) 20 (b) 19 (c) 23 (d) 21
Concept - 02 – Assumed Mean Method (Shortcut for Large
Values)
When the values of 𝑥% are large or awkward to compute directly,
we use an assumed mean 𝐴 near the center of the data to simplify
calculations.
This method is especially useful in grouped/discrete frequency
tables, where it reduces calculation errors.
If 𝐴 is the assumed mean and
∑ 𝑓% 𝑑%
𝑑% = 𝑥% − 𝐴; 𝑥̅ = 𝐴 +
∑ 𝑓%
Use this method when class marks or values are large or
awkward.
Choose 𝐴 close to the middle value to simplify.
Example: 𝐴 = 60 as the assumed mean:
xᵢ (Marks) fᵢ (Frequency) dᵢ = xᵢ – 60 fᵢdᵢ
10 1 −50 −50
20 1 −40 −40
36 3 −24 −72
40 4 −20 −80
50 3 −10 −30
56 2 −4 −8
60 4 0 0
70 4 10 40
72 1 12 12
80 1 20 20
88 2 28 56
92 3 32 96
95 1 35 35
Total 30 −21
∑ (! -!
Step 1: Use the formula: 𝑥̅ = 𝐴 + ∑ (!
."!
Step 2: Substitute values: 𝑥̅ = 60 + = 60 − 0.7 = 59.3
#,
Concept - 03 – Median – Ungrouped & Grouped Data
Median is the value dividing the data into two equal halves.
It is position-based, not value-based — unaffected by
extreme values.
For grouped data, cumulative frequencies help locate the
median class.
1. Median of Ungrouped Data (for odd/even number of
values):
$/! th
- Middle term = 6 7 item, if 𝑛 is odd
"
131 Powered by 1EQ
STATISTICS
$ th $ th
- Mean of 6"7 and 6" + 17 items, if 𝑛 is even
2. Median of Grouped Data (Continuous Frequency Table):
𝑁
− 𝐶𝐹
Median = 𝐿 + : 2 ?×ℎ
𝑓
Where: 𝐿 = Lower boundary of the median class
𝑁 = Total frequency; 𝐶𝐹 = Cumulative frequency before
median class; 𝑓 = Frequency of median class
ℎ = Class width
Median = middle value (for ungrouped) or estimated using
cumulative frequency (for grouped).
Solved Example (Grouped Data from Source)
Find the median for the following distribution:
Marks (Class Interval) No. of Students
0–10 2
10–20 3
20–30 8
30–40 10
40–50 5
50–60 2
Step 1: Create cumulative frequency table:
CI f CF
0–10 2 2
10–20 3 5
20–30 8 13
30–40 10 23 ← Median class (since N/2 = 15)
40–50 5 28
50–60 2 30
0
Step 2: Identify: 𝐿 = 30, 𝑁 = 30, = 15,
"
𝐶𝐹 = 13, 𝑓 = 10, ℎ = 10
!1.!# ",
Step 3: Apply Median = 30 + 6 7 × 10 = 30 + !, = 32
!,
2. The median of following ____ 32, 25, 33, 27, 35, 29 and 30
(a) 32 (b) 27 (c) 30 (d) 29
Concept - 04 – Mode – Ungrouped, Discrete, Grouped &
Empirical
Mode is the value or class interval that occurs most
frequently. In ungrouped or discrete data, find the value
with highest count.
In grouped data, use formula based on modal class (highest
frequency).
132 Powered by 1EQ
STATISTICS
When mode is missing, apply empirical formula if mean and
median are given.
Formula
1. Mode (Ungrouped/Discrete Data):
Mode = Value with highest frequency
2. Mode (Grouped Frequency Data):
𝑓! − 𝑓,
Mode = 𝑙 + D E×ℎ
2𝑓! − 𝑓, − 𝑓"
Where: 𝑙 = Lower limit of modal class; 𝑓! = Frequency of modal
class; 𝑓, = Frequency before modal class; 𝑓" = Frequency after
modal class; ℎ = Class width
3. Empirical Mode (Approximate):
Mode = 3 × Median − 2 × Mean
Mode is the most frequent value or class.
In grouped data, use the formula, not midpoint.
Example 1 – Ungrouped Data
Find the mode of the dataset:
13, 15, 31, 12, 27, 13, 27, 30, 27, 28, 16
Step 1: Count frequencies: 27 appears 3 times (most)
Step 2: Mode = 27
Example 2 – Discrete Frequency Data
Find the mode of the following data.
𝒙𝒊 𝑭𝒊
5 8
6 7
3 9
11 19
Step 1: Highest frequency = 19
Step 2: Mode = 11
Example 3 – Grouped Frequency Data
A Survey was conducted on 20 Houses. In a colony, an NGO of
students resulted in the following frequency table for the number
of Family members in a Household.
Family Size Number of families
1-3 7
3-5 8
5-7 2
7-9 2
9-11 1
Step 1: Modal class = 3–5 (highest frequency = 8)
𝑙 = 3, ℎ = 2, 𝑓! = 8, 𝑓, = 7, 𝑓" = 2
133 Powered by 1EQ
STATISTICS
3.* !
Step 2: Apply Mode = 3 + 6"⋅3.*."7 × 2 = 3 + 6*7 × 2 = 3.286
Example 4 – Empirical Formula
For sample data, mean = 60, and median = 48. For this
distribution, the mode is:
Step 1: Mode = 3 ⋅ 48 − 2 ⋅ 60 = 144 − 120 = 24
3. Find the mode of the given data. 21, 15, 16, 48, 6, 15, 44, 16, 6,
5, 15, 16, 21, 6, 15 and 43
(a) 21 (b) 16 (c) 15 (d) 6
4. Calculate the mode from the following data.
Days of confinement 6 7 8 9
No. of patients 4 6 7 5
(a) 8 (b) 7 (c) 5 (d) 6
5. A set of data presented in the form of a frequency
distribution table with class intervals and their respective
frequencies had a mode of 48.5. The lower boundary of the
modal class was 46.5; the frequency of the modal class was
34; the frequency of the class interval just preceding the
modal class was 32 and the frequency of the class interval
just succeeding the modal class was 25. What was the width
of the modal class?
(a) 10.5 (b) 12 (c) 10 (d) 11
6. The mean and median of a normal distribution is 22.7 and
23. Find its mode.
(a) 45.7 (b) 0.3 (c) 23.6 (d) 20
Concept - 05 – Ogive Method – Graphical Median
- Ogives are useful for visualizing distribution of data and
estimating median without calculations.
The two types of ogives: i) Less than type;
ii) More than type
Ogive method is a visual tool to estimate the median.
Helps in understanding distribution shape and cumulative
spread.
Example: Find median using ogive from the following data
Marks No. of Cumulative
(Class) Students Frequency
20-25 6 6
25-29 22 28
29-38 50 78
38-42 15 93
42-43 2 95
43-45 5 100
0
Sol: 𝑁 = 100, so = 50
"
134 Powered by 1EQ
STATISTICS
Median is the mark corresponding to 50th student.
From the cumulative table, 50th observation lies in 29–38
Concept - 06 – Standard Deviation & Variance – Measure
of Spread
Variance measures the average of squared deviations from
the mean.
Standard Deviation (SD) is the square root of variance.
SD tells us how spread out the values are around the mean.
Used to compare consistency or variation in datasets.
1. For Discrete/Grouped Frequency Data:
∑ 𝑓 (𝑥 − 𝑥̅ )"
Variance (σ" ) =
∑𝑓
Standard Deviation (σ) = √Variance
Alternate form (using deviation 𝑑% = 𝑥% − 𝐴, assumed mean):
∑ %& 5 ∑ %& $
σ$ = ∑%
− ' ∑% (
SD shows data consistency – low SD means values are
close to mean.
Example: Find the standard deviation of the following data:
𝑥% 𝑓% 𝑓% 𝑥%
88 2 176
92 3 276
95 1 95
16*
∑ 𝑓% = 6, ∑ 𝑓% 𝑥% = 547 ; ⇒ 𝑥̅ = = 91.17
7
Step 1: Compute deviations from mean and their squares:
(use table or calculator, or use shortcut method with assumed
mean)
∑ (().)̅ )"
Step 2: Apply 𝜎 = Q ∑(
or shortcut
Final Answer : 𝑥̅ = 59.3, Variance = 132; SD = √132 ≈ 11.49
7. If standard deviation & mean of a data set is 6 and 18 find the
coefficient of variance
(a) 50% (b) 25% (c) 33.33% (d) 100%
Concept - 07 – Coefficient of Variation – Compare Data
Consistency
CV is a percentage measure of how much the data varies
relative to its mean.
It is used to compare the consistency of two or more datasets.
Lower CV means more consistent data; higher CV means
more variability.
135 Powered by 1EQ
STATISTICS
Coefficient of Variation (CV)
Standard Deviation (σ)
= × 10
Mean (𝑥̅ )
Use CV to compare data spread when means are different.
Lower CV = more consistency, better performance.
Example: Two machines A and B produce the same parts.
Machine A has mean 60, SD = 5. Machine B has mean 48, SD = 4.
Which machine is more consistent?
1
CV; = 7, × 100 = 8.33%
6
CV< = 63 × 100 = 8.33%
Concept - 08 – Range – Spread Between Maximum and
Minimum
Range is the simplest measure of dispersion.
It tells us how far the data is spread between its smallest and
largest values.
Although it doesn’t consider every data point, it gives a quick idea
of variability.
Range = Maximum Value − Minimum Value
Range is a quick way to assess the spread of a dataset.
Example : Find the range of the dataset: 4, 5, 6, 14, 1, 0, 9, 48
Step 1: Identify the largest and smallest values:
Max = 48, Min = 0
Step 2: Apply Range = 48 − 0 = 48
Concept - 09 – Mean Deviation – From Mean & Median
Mean deviation is a measure of spread based on absolute
differences from a central point (mean or median).
It’s more robust than range and easier than standard deviation.
(
1. Mean Deviation from Mean: MD#̅ = ) ∑|𝑥* − 𝑥̅ |
(
2. Mean Deviation from Median: MD+ = ∑|𝑥* − 𝑚|
)
Mean deviation uses absolute distance, not squared hence
easy and intuitive.
Solved Example 1 – From Mean
Find the mean deviation from mean for 3, 5, 7, 4, 6.
#/1/*/6/7
Step 1: Mean = 1
=5
𝒙𝒊 U
𝒙 - 𝒙𝒊
3 2 = |3-5|
5 0 = |5-5|
136 Powered by 1EQ
STATISTICS
7 2 = |5-7|
4 1 = |5-4|
6 1 = |5-6|
7
MD)̅ = 1 = 1.2
Solved Example 2 – From Median
Find mean deviation about median of the dataset 2, 3, 4, 5, 6
Step 1: Median = 4
𝒙𝒊 𝒙𝒊 − 𝒎𝒆𝒅𝒊𝒂𝒏
2 2 = |4-2|
3 1 = |4-3|
4 0 = |4-4|
5 1 = |4-5|
6 2 = |4-6|
7
MD= = 1 = 1.2
8. If each value in a dataset is decreased by 𝒌, what happens to
mean, median, and mode?
(a) a, b, c (b) a − k, b − k, c – k
(c) a + k, b + k, c + k (d) none of these
ANSWER KEY
1 2 3 4 5 6 7 8
A C C A D C C A
137 Powered by 1EQ