Descriptive Statistics Overview

The document provides an overview of descriptive statistics, focusing on measures of central tendency (mean, median, mode) and measures of spread (range, quartiles, variance, standard deviation). It explains how to calculate these measures and their appropriate applications depending on the type of data. The importance of understanding both central tendency and dispersion is emphasized for accurately interpreting data sets.

Uploaded by

Gidy Kiprop

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views30 pages

Descriptive Statistics Overview

Uploaded by

Gidy Kiprop

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

DESCRIPTIVE STATISTICS

• EXAMPLES OF DESCRIPTIVE STATISTICS

INCLUDE:
– RATIOS E.G MEASURES OF MORBIDITY,
MORTALITY AND NATALITY.
– MEASURES OF CENTRAL TENDENCY E.G. MEAN,
MODE AND MEDIAN.
– MEASURES OF DISPERSION E.G. RANGE,
INTERQUATILE RANGE AND STANDARD
DEVIATION
MEASURES OF CENTRAL
TENDENCY
•MEAN
•MEDIAN
•MODE
• For qualitative variables, two summary
measures are commonly used:
– Measures of central tendency
– Measures of dispersion
• Three measures of central tendency
exist:
1. Arithmetic Mean
2. Median
3. Mode
INTRODUCTION
• Measure of central tendency: single number
that is most representative of the entire data.
1. MEAN: is the sum of the numbers divided by
n
2. MEDIAN: the middle number when the
numbers are ordered. If set is even, the
median is the average of the two middle
numbers.
3. MODE: most frequent number. Can be
bimodal or trimodal.
• Appropriate MoCT depends on the data
itself.
• Continuous data e.g. ht, use mean. = mean
height is 32.5 cm'. The mode is not a good
measure here because, it may not exist
• Discrete data e.g. number of children, use
mode or median, this avoids mean is 2.3
children‘!
• Categorical data e.g. Colour of houses sold
use mode, for example, ‘White” is the most
common house colour'.
MEAN
• The arithmetic mean is the most common
measure of central tendency.
• The symbol "μ" is used for the mean of a
population. The symbol "M" is used for
the mean of a sample. The formula for μ
is shown below:
• μ = ΣX/N (population mean)
• M = ΣX/n (sample mean)
• The mean presented along with the variance
and the standard deviation is the "best"
measure of central tendency for continuous
data.
• In some situations the mean is not the "best"
measure of central tendency. The median is
the preferred measure. E.G:
• When data distribution is skewed
• When you believe that a distribution might
be skewed
• When you have a small number of subjects
MEDIAN
• The median is also a frequently used
measure of central tendency. The median
is the midpoint of a distribution: the
same number of scores is above the
median as below it if the distribution of
data is odd. When the data set is even,
the median is the mean of the two
middle numbers.
• The median can also be thought of as the
50th percentile.
MODE
• The mode is the most frequently
occurring value
• With continuous data measured to
many decimals, the frequency of each
value is one since no two scores will be
exactly the same.
• Therefore the mode of continuous data
is normally computed from a grouped
frequency distribution.
Mode of continuous data
Grouped frequency distribution.
Wt Frequency
500-600 2
600-700 3
700-800 6
800-900 5
900-1000 4 1000-1100 0
700-800 is most frequent group, the MODE is
the midpoint of the scale range i.e. 750 kg
• The mode is not usually used because
the largest frequency of scores might
not be at the center. The only situation
in which the mode may be preferred
over the other two measures of central
tendency is when describing discrete
categorical data. The mode is preferred
in this situation because the greatest
frequency of responses is important for
describing categorical data
MEASURES OF SPREAD
Introduction
• A measure of spread (dispersion), is
used to describe the variability in a
sample or population. It is usually used
in conjunction with a measure of central
tendency, such as the mean or median,
to provide an overall description of a set
of data.
• Why is it important to measure the
spread of data?
• It gives us an idea how well a mean
representative a data set. Mean is not
good with data set with large spread but
is appropriate if the spread of data is
small. Large spread indicates high
variability between individual scores,
such does not auger well in research.
Types of measures of spread
• Range
• Quartiles
• Variance
• Absolute deviation and
• Standard deviation.
Range
• The range is the difference between the
highest and lowest scores in a data set
and is the simplest measure of spread.
• Range = maximum value - minimum
value
• NB, unlike with median, data must not
be ordered, however, an ordered data
makes it easier to quickly see the
minimum and maximum values
• The range delineates the boundaries of
data sets. The importance of this is seen
if you are measuring a variable that has
a high and low values that should not be
crossed. The range can be used to detect
any errors when entering data. E.g., if
you are recording the age of school
children, you quickly note a mistake if
your range is 7 to 118yrs!
Quartiles and Interquartile Range

• Quartiles measure spread of a data set

by breaking the data set into quarters.
There are four quartiles in a percentile-
1st quartile is in 25th percentile, 2nd
quartile= 50th percentile, 3rd quartile=
75th percentile, and 4th quartile is in 100th
percentile.
• When an ordered data set is even, the
quartiles will be calculated as follows:
• First quartile (Q1) = 25th+ 26th value of
data set/2 i.e. x+y/2 = Q1
• Second quartile (Q2) = 50th + 51st/2 = Q2
• Third quartile (Q3) = 75th + 76th ÷ 2 = Q3
• If data set is odd, Q1, 2 and 3 will be
data on 25th, 50th, and 75th position of an
ordered set
• Quartiles are much less affected by
outliers or a skewed data set than the
equivalent measures of mean and
standard deviation. For this reason,
quartiles are often reported along with
the median as the best choice of
measure of spread and central
tendency, respectively, when dealing
with skewed and/or data with outliers.
• A common way of expressing quartiles is
as an interquartile range. The
interquartile range describes the
difference between the third quartile
(Q3) and the first quartile (Q1). It tells of
the range of the middle half of the
scores in the distribution. i.e.
• Formula: IQR = Q3 - Q1
Absolute Deviation, Variance and standard deviation (Variations)

• Quartiles do not take into account every

score in our group of data. To take into
account the actual values of each score
in a data set and get the spread we use
the ABSOLUTE DEVIATION,
VARIANCE & STANDARD DEVIATION.
• Either of the three variations can be
used in research.
Variance
• Another method for calculating the deviation
of a group of scores from the mean, is to use
the variance. Unlike the absolute deviation,
which substitutes negative results with the
absolute value of the deviation, the variance
achieves positive values by squaring each of
the deviations . Adding up these squared
deviations gives us the sum of squares, which
we can then divide by the total number of
scores in a group of data
• As a measure of variability, the variance is
useful. If the scores in a data set are
spread out, the variance will be a large
number. Conversely, if the scores are
spread closely around the mean, the
variance will be a smaller number.
However, there are two potential
problems with the variance. First, because
the deviations of scores from the mean are
'squared', this gives more weight to
extreme scores.
• If our data contains outliers this can give
undue weight to these scores. Secondly,
the variance is not in the same units as the
scores in our data set: variance is
measured in the units squared. This means
it cannot be placed on a frequency
distribution and cannot directly relate its
value to the values in our data set.
Calculating the standard deviation rather
than the variance rectifies this problem.
Standard Deviation
Introduction
• The standard deviation is a measure of
the spread of scores within a set of data.
Usually, the standard deviation of a
population is preferred. However, as
researchers often deal with data from a
sample only, the population standard
deviation can be derived from a sample
standard deviation.
When to use the sample or population standard deviation

• Knowing the population standard deviation is

more important because the population
contains all the researchers are interested in.
Therefore, population standard deviation will
be preferred if: (1) the entire population is
available or (2) a sample of a larger
population is available but the interest is in
the sample only and the researchers do not
wish to generalize their findings to the
population.
What type of data should you use when you
calculate a standard deviation?
• The standard deviation is used in
conjunction with the mean to
summarise continuous data, NOT
categorical data. In addition, the
standard deviation, like the mean, is
normally only appropriate when the
continuous data is not significantly
skewed or has outliers.
What are the formulas for the population and standard deviation?

• The sample standard deviation formula is:

• The population standard deviation formula
is:

Central Tendency and Variability Measures
100% (15)
Central Tendency and Variability Measures
15 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
67 pages
CH - 2 Stastical Data Analysis
No ratings yet
CH - 2 Stastical Data Analysis
46 pages
Lesson3 Descriptive Statistics Reviewer
No ratings yet
Lesson3 Descriptive Statistics Reviewer
12 pages
Central Tendency & Dispersion Explained
No ratings yet
Central Tendency & Dispersion Explained
24 pages
Understanding Measures of Central Tendency
No ratings yet
Understanding Measures of Central Tendency
102 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
63 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
5 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
42 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
74 pages
Unit 3 - Descriptive Statistics
No ratings yet
Unit 3 - Descriptive Statistics
44 pages
Statistics in Data Science Overview
No ratings yet
Statistics in Data Science Overview
155 pages
Understanding Statistics: Key Concepts
No ratings yet
Understanding Statistics: Key Concepts
46 pages
Central Tendency and Variability Explained
No ratings yet
Central Tendency and Variability Explained
28 pages
Data Presentation Techniques Explained
No ratings yet
Data Presentation Techniques Explained
104 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
15 pages
Understanding Frequency Distribution in Statistics
No ratings yet
Understanding Frequency Distribution in Statistics
13 pages
Central Tendency and Dispersion Measures
No ratings yet
Central Tendency and Dispersion Measures
35 pages
Understanding Variables and Data Analysis
No ratings yet
Understanding Variables and Data Analysis
4 pages
Data Analysis and Research Report Guide
No ratings yet
Data Analysis and Research Report Guide
40 pages
Central Tendency and Data Dispersion
No ratings yet
Central Tendency and Data Dispersion
63 pages
Descriptive Statistics and Standard Deviation
No ratings yet
Descriptive Statistics and Standard Deviation
38 pages
Chapter 2
No ratings yet
Chapter 2
19 pages
Understanding Statistics Basics
No ratings yet
Understanding Statistics Basics
4 pages
Central Tendency and Dispersion Measures
No ratings yet
Central Tendency and Dispersion Measures
4 pages
Central Tendency and Dispersion Explained
No ratings yet
Central Tendency and Dispersion Explained
34 pages
Numerical Measures in Data Analysis
No ratings yet
Numerical Measures in Data Analysis
46 pages
Data Management: Types and Techniques
No ratings yet
Data Management: Types and Techniques
43 pages
Understanding Measures of Variability
No ratings yet
Understanding Measures of Variability
3 pages
Central Tendency Measures Explained
No ratings yet
Central Tendency Measures Explained
37 pages
Descriptive Statistics Overview
No ratings yet
Descriptive Statistics Overview
38 pages
Chapter 3 Formulas
No ratings yet
Chapter 3 Formulas
3 pages
Unit 1 Stats
No ratings yet
Unit 1 Stats
32 pages
Statistics for Transport Engineers
No ratings yet
Statistics for Transport Engineers
68 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
6 pages
Central Tendency and Variation Explained
No ratings yet
Central Tendency and Variation Explained
36 pages
Understanding Data Averages and Variability
No ratings yet
Understanding Data Averages and Variability
38 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
23 pages
Descriptive Statistics Overview Guide
No ratings yet
Descriptive Statistics Overview Guide
48 pages
Data Management and Statistical Concepts
No ratings yet
Data Management and Statistical Concepts
21 pages
Biostatistics: Central Tendency & Dispersion
No ratings yet
Biostatistics: Central Tendency & Dispersion
28 pages
Understanding Central Tendency & Variability
No ratings yet
Understanding Central Tendency & Variability
61 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
5 pages
Statistical Methods in Social Sciences
No ratings yet
Statistical Methods in Social Sciences
69 pages
Key Statistics in Economics and Business
No ratings yet
Key Statistics in Economics and Business
37 pages
Best Measure of Central Tendency
No ratings yet
Best Measure of Central Tendency
68 pages
Statistics for Research: Central Tendency & Dispersion
No ratings yet
Statistics for Research: Central Tendency & Dispersion
33 pages
Descriptive Statistics Overview
No ratings yet
Descriptive Statistics Overview
24 pages
Numerical Measures for Data Analysis
No ratings yet
Numerical Measures for Data Analysis
48 pages
B. Stat Measures of Variability or Dispersion.
No ratings yet
B. Stat Measures of Variability or Dispersion.
13 pages
Descriptive Statistics Phs St.
No ratings yet
Descriptive Statistics Phs St.
54 pages
Understanding Statistics: Key Concepts
No ratings yet
Understanding Statistics: Key Concepts
27 pages
Descriptive and Inferential Statistics Guide
No ratings yet
Descriptive and Inferential Statistics Guide
41 pages
Descriptive Statistics in Economics
No ratings yet
Descriptive Statistics in Economics
45 pages
Descriptive Statistics Overview Guide
No ratings yet
Descriptive Statistics Overview Guide
10 pages
Central Tendency and Dispersion Measures
100% (1)
Central Tendency and Dispersion Measures
8 pages
Descriptive Statistics Overview
No ratings yet
Descriptive Statistics Overview
64 pages
Statistical Measures and Definitions
No ratings yet
Statistical Measures and Definitions
8 pages
Interfractile Range in Descriptive Stats
No ratings yet
Interfractile Range in Descriptive Stats
42 pages
Class 10 Statistics Test Paper
No ratings yet
Class 10 Statistics Test Paper
2 pages
Classroom Testing Evaluation Principles
No ratings yet
Classroom Testing Evaluation Principles
7 pages
Study Hours vs. Points Analysis
No ratings yet
Study Hours vs. Points Analysis
10 pages
Elementary Statistics Exam Solutions
No ratings yet
Elementary Statistics Exam Solutions
5 pages
Online Tax System Impact on Compliance
No ratings yet
Online Tax System Impact on Compliance
11 pages
Inbound 5818984559829786067
No ratings yet
Inbound 5818984559829786067
22 pages
Correlation and Regression Questions Guide
No ratings yet
Correlation and Regression Questions Guide
20 pages
Bubble Point Calculation for Benzene-Toluene
No ratings yet
Bubble Point Calculation for Benzene-Toluene
37 pages
1342 Test1 Review 25FA
No ratings yet
1342 Test1 Review 25FA
7 pages
Arithmetic Mean and Standard Deviation Calculations
No ratings yet
Arithmetic Mean and Standard Deviation Calculations
5 pages
Importance and Scope of Statistics
No ratings yet
Importance and Scope of Statistics
20 pages
Data Types and Analysis Methods Explained
No ratings yet
Data Types and Analysis Methods Explained
87 pages
MBA503A: Statistical Techniques Course
No ratings yet
MBA503A: Statistical Techniques Course
5 pages
PCA Analysis of Treasury Yield Changes
No ratings yet
PCA Analysis of Treasury Yield Changes
4 pages
ZemZem Digital Download
100% (1)
ZemZem Digital Download
94 pages
Areas Under the Normal Curve Explained
No ratings yet
Areas Under the Normal Curve Explained
13 pages
Mathematics Test 1 Marking Guidelines
No ratings yet
Mathematics Test 1 Marking Guidelines
4 pages
Statistical Functions in R Explained
No ratings yet
Statistical Functions in R Explained
2 pages
Uncertainty Analysis of Beam Loads and Variables
No ratings yet
Uncertainty Analysis of Beam Loads and Variables
3 pages
Statistics Exam Paper May 2024
No ratings yet
Statistics Exam Paper May 2024
2 pages
Skewness MCQ Quiz - Free PDF Download
No ratings yet
Skewness MCQ Quiz - Free PDF Download
29 pages
Deciles and Percentiles in Grouped Data
No ratings yet
Deciles and Percentiles in Grouped Data
25 pages
Confidence Intervals & Hypothesis Testing Guide
No ratings yet
Confidence Intervals & Hypothesis Testing Guide
5 pages
Small Sample Tests in Statistics
No ratings yet
Small Sample Tests in Statistics
27 pages
IB AA HL Probability & Statistics Test
No ratings yet
IB AA HL Probability & Statistics Test
3 pages
C/a/d Expressing Dollars and Employees in Thousands, The Weighted Mean Expenditure Per Employee Is
No ratings yet
C/a/d Expressing Dollars and Employees in Thousands, The Weighted Mean Expenditure Per Employee Is
22 pages
Understanding Basic Statistical Concepts
No ratings yet
Understanding Basic Statistical Concepts
42 pages
Data Preprocessing Techniques Explained
No ratings yet
Data Preprocessing Techniques Explained
27 pages
Statistics and Probability Concepts Guide
No ratings yet
Statistics and Probability Concepts Guide
1 page
Central Tendency & Dispersion Measures Guide
No ratings yet
Central Tendency & Dispersion Measures Guide
16 pages

Descriptive Statistics Overview

Uploaded by

Descriptive Statistics Overview

Uploaded by

DESCRIPTIVE STATISTICS

• EXAMPLES OF DESCRIPTIVE STATISTICS

• Quartiles measure spread of a data set

• Quartiles do not take into account every

• Knowing the population standard deviation is

• The sample standard deviation formula is:

You might also like