Statistics Probability
Statistics Probability
Learning Module
in
Statistics & Probability
Name:________________________________________
Section/Strand:_________________________________
MODULE COVERAGE
CONTENT STANDARD
The learner demonstrates understanding of key concepts of random variables and probability distributions.
The learner demonstrates understanding of key concepts of sampling and sampling distributions of the sample mean
and estimation of population mean and population proportion.
PERFORMANCE STANDARD
The learner is able to accurately formulate and solve real-life problems in different disciplines involving normal
distribution.
The learner is able to apply suitable sampling and sampling distributions of the sample mean to solve real-life
problems in different disciplines; estimate the population mean and population proportion to solve and make
sound inferences in real life problems in different disciplines.
LEARNING COMPETECIES
Lesson Learning Competencies Code
Random illustrates a random variable (discrete and continuous). M11/12SP-IIIc-1
variables and distinguishes between a discrete and a continuous random variable. M11/12SP-IIIc-2
probability finds the possible values of a random variable M11/12SP-IIIa-3
distribution illustrates a probability distribution for a discrete random variable and its properties M11/12SP-IIIa-4
illustrates the mean and variance of a discrete random variable. M11/12SP-IIIb-1
computes probabilities corresponding to a given random variable. M11/12SP-IIIa-6
calculates the mean and the variance of a discrete random variable M11/12SP-IIIb-2
interprets the mean and the variance of a discrete random variable. M11/12SP-IIIb-3
solves problems involving mean and variance of probability distributions. M11/12SP-IIIb-4
illustrates a normal random variable and its characteristics M11/12SP-IIIc-1
identifies regions under the normal curve corresponding to different standard normal values. M11/12SP-IIIc-4
converts a normal random variable to a standard normal variable and vice versa. M11/12SP-IIIc-4
computes probabilities and percentiles using the standard normal table. M11/12SP-IIIc-d-
1
illustrates random sampling. M11/12SP-IIId-2
distinguishes between parameter and statistic M11/12SP-IIId-3
identifies sampling distributions of statistics (sample mean). M11/12SP-IIId-4
finds the mean and variance of the sampling distribution of the sample mean. M11/12SP-IIId-5
defines the sampling distribution of the sample mean for normal population when the variance is: (a) M11/12SP-IIIe-1
known; (b) unknown
illustrates the Central Limit Theorem M11/12SP-IIIe-2
defines the sampling distribution of the sample mean using the Central Limit Theorem. M11/12SP-III-3
solves problems involving sampling distributions of the sample mean. M11SP-IIIe-f-1
illustrates the t-distribution. M11/12SP-IIIg-2
identifies percentiles using the t-table. M11/12SP-IIIg-5
identifies the length of a confidence interval M11/12SP-IIIj-1
computes for the length of the confidence interval. M11/12SP-IIIj-2
computes for an appropriate sample size using the length of the interval. M11/12SP-IIIj-3
solves problems involving sample size determination M11/12SP-IIIj-4
Concept map
Expected Skills
To do well in this module, you need to remember the following:
1. Answer all the exercises and process questions completely.
2
Page
2. Study the explanation well.
3. Supplement yourself with other learning materials when available and necessary.
4. Write down your thoughts to help you in processing information.
5. Keep in mind that your success in this module depends on how much effort you will put.
EXPLORE
Are you familiar with card games? In this activity you are going to think of the probability that you will get a
certain card.
FIRM-UP
In this section of your module will introduce you to probability and probability distribution.
Activity 3 Identify
A. Start Up!
Write “start” if each of the following situations is a discrete random variable, otherwise write “up”.
B. Discrete or Continuous
Classify as Discrete or Continuous random variable.
1. Three coins are tossed. Let W be the random variable representing the number of tails that occur. Find the
values of the random variable W. Complete the table below.
Possible outcomes Value of the Random Variable W (number of tails)
Ex. HHH O
HHT 1
4
Page
2. Let V be a random variable giving the number of heads plus the number of tails in three tosses of a coin. List the
elements of the sample space D for the three tosses of the coin and assign a value to each sample point.
HHT 3
A probability distribution consists of the values of a random variable can assume and the corresponding
probabilities of the values. The probabilities can be determined theoretically or by observation. For example, if two coins
are tossed, the possible outcomes are HH, HT, TH, or TT. If X is the random variable for the number of heads, then X
assumes the values 0, 1, or 2. Probabilities for the values of X can be determined as follows:
Determine whether the distribution represents a probability distribution. Explain your answer.
5
Page
__________________________________________________________________________________________________
__________________________________________________________________________________________________
__________________________________________________________________________________________________
So, the variance of the number of cars sold per day is 1.56 and the standard
deviation is 1.25.
A. Read the following statements carefully. Write ND if the statement describes a characteristic of a normal
distribution, and NND if it does not describe a characteristic of a normal distribution.
B. Make a sketch for each of the 3 areas under the normal curve as stated in the empirical rule. Use the space
provided below.
1. Mrs. Cruz wants to do an analysis on students' final examination scores in her 40 Grade 11-Baguio Mathematics
class for the semester. Should she consider her all the students’ scores or only some of the students’ scores?
2. A group of researchers want to determine the average Mathematics grade for the semester of Grade 11
students in the Philippines. Should the group consider conducting a survey to all the Grade 11 students in the
Philippines or random students of selected places?
Analysis: For case number 1, this task is only dealing with the students' final examination scores in one specific class
with 40 students. The intent is not to the students' final examination scores of students in a much bigger population.
Since the teacher has the entire population available for this situation, she should use the all of them. In case number 2,
the population is extremely large. There is actually an impractical and an inconvenient way of obtaining all of the data in
the population. You simply will not have all of the data available for your use especially if you only have a limited time.
You will need to use a sample of the population.
Population is the group you want to generalize. It consists of all the members of the group you are interested in.
Sample is the subset from the population you want to examine. A population commonly contains too many individuals
to study conveniently and practically, so an investigation is often restricted to one or more samples drawn from it. A
well-chosen sample will contain most of the information about a particular population parameter but the relation
between the sample and the population must be such as to allow true inferences to be made about a population from
that sample.
Sampling is a process used in statistical analysis in which a predetermined number of observations are taken
from a larger population. There are various sampling methods that allow all the units in the population to have an equal
chance of being selected. These sampling methods are discussed below.
1. Simple Random Sampling
Each element of the population has an equal chance of being selected. There are no rules that dictate where and
how you will start the selection process, as long as you do not intentionally look for a specific number. In this
method, the samples can be selected through:
a. Lottery Method
Every member is assigned a unique number. These numbers are put in a jar and thoroughly mixed. After that,
the researcher picks some numbers without looking at it and those people are included in the study.
Example:
N
If Population (N) = 2000, sample size (n) = 500, K = , so k = 2000/500 = 4th. Use a table of random numbers to
n
determine the starting point for selecting every 4th subject. With list of the 2000 subjects in the sampling frame, go to
the starting point, and select every 4th name on the list until the sample size is reached. Probably will have to return to
the beginning of the list to complete the selection of the sample.
3. Stratified Random Sampling
This can be done by first dividing the elements in the population into strata and then samples
are randomly selected from each stratum ensuring that each selected element is proportionately
represented in the total population. Sampling fraction: n/N (desired sample size divided by the
population size)
Example: Assume you have a population of 1000 students with 500 from grade school, 300 from high
school, and 200 from senior high school. Determine the how many samples you need or you can use the
Slovin’s Formula or any other formula for computing the sample size. In this example, Slovin’s Formula is
11
used and a sample size of 400 is computed. To get the samples from each stratum, divide 400 by 1000
Page
and the answer is 0.4. Multiply 0.4 to each of the number of students per stratum (e.g. 0.4 x 500 grade
school is 200).
4. Clustered Sampling
A multistage sampling method adopted when it is either impossible or impractical to compile an
exhaustive list of elements found in the target population. The whole population is subdivided into
clusters, or groups, and random samples are then collected from each group.
Example: A researcher wants to survey about academic performance of high school students in the
municipality of Alubijid. He can divide the entire population into different clusters (barangays). Then,
the researcher selects a number of barangays depending on his research through simple or
systematic random sampling. The researcher could draw random samples from the selected
barangay through simple random sampling or take them all.
Identify the type of sampling method used by the researcher in each situation: Simple Random Sampling,
Systematic Sampling, Stratified Sampling, or Cluster Sampling.
_______________1. A researcher chose the participants of his study by selecting every 8th member of the population.
_______________2. A researcher interviewed all the teachers in each of 15 randomly selected private schools in
Cagayan de Oro City.
_______________3. A researcher interviewed people from each barangay in the municipality of Alubijid for his
research on population.
_______________4. A researcher is doing a research work on the students’ reaction to the newly implemented
curriculum in mathematics and interviewed every 5th student entering the gate of the school.
_______________5. A researcher randomly selected 15 barangays in a town for her study. She did this by writing
the names of each barangay on a piece of paper which she folded and put in a bowl then she
draw 15 pieces of paper from the bowl.
_______________6. A researcher selected a sample of n=300 from a population of 900 by using the Table of
Random Numbers.
_______________7. A researcher interviewed all the nurses in each of the 5 randomly selected private hospitals in
Northern Mindanao.
_______________8. A statistician selected a sample of n=500 high school students from a private school with
2,500 students. He randomly selected the students from each grade level.
_______________9. A Statistics student did a research on the time spent by Grade 11 and 12 students in playing
mobile legends. He randomly selected his subjects by using the Table of Random Numbers.
_______________10. A teacher conducted a study in her school to determine who were better in mathematics: the
males or the females.
Analysis: The first case contains a measure for a sample. It indicated that 40% of the 1600 samples said yes. On the
other hand, the second case contains a measure from a population because an average of 2 hours per day spend in
training is from all of the people in the Mathematics department.
A measure that describes a population is called parameter. A measure that describes a sample is called statistic.
In population parameter, mean is represented by μ, σ 2 represents variance, and σ represents standard deviation. In
12
Page
sample statistics, mean is represented by x , s2 represents variance, and srepresents standard deviation. Data from the
sample are used to calculate statistics. Statistics are estimates of the corresponding parameters.
Examples:
Parameter: 50% of the 24 Philippine senators agreed to support a certain measure.
Statistic: A researcher found out that 25% of the students in the Philippines reported to have internet connection at
home.
Explanation:
1. The example indicates a parameter since there are only 24 senators in the Philippines.
2. Rresearchers cannot ask millions of students if they have internet connection at home, so they take samples
from the target population and calculate.
Let’s Summarize!
Parameter is a measure that describes a population. Parameter includes population mean μ,
population variance σ 2, and population standard deviation σ .
Statistic is a measure that describes a sample. Statistics include sample mean x , sample variance s2
and sample standard deviation s .
________________________________________________________________________________________
________________________________________________________________________________________
________________________________________________________________________________________
__________________________________________________________________
__________1.The researcher found out that the 29 senior high school teachers of a certain school spend an
average of 2 hours preparing their lessons.
__________2.Based on a sample of 900 elementary students, it was found out 30% of them do not know
multiplication facts.
__________3.Based on sample of 1,200 surveyed students, it was found out that 20% of them needed
financial assistance.
__________4.A teacher surveyed all 50 students under his advisory class about their learning styles and
found out that most them are visual learners.
__________5.The Statistics teacher wants to know the average score of the students in the final exam. He
randomly selected 35 students and obtained an average score of 43.
C. This time, let us make a probability distribution of the sample means. This probability distribution is called the
sampling distribution of the sample means.
Observe that all sample means appeared only one; thus, their probability is P(x)= 1/10 or 0.1
A sampling distribution of sample mean is a frequency distribution using the means computed from all possible
random samples of a specific size taken from a population.
End Firm-up
In this section of your module you have learned about the basic concepts of probability distribution.
Deepen
Now let’s proceed to the next section of your module and gain more knowledge and understanding about the lesson.
Activity 19 Solve
Using what you have learned on activity 6 answer the following.
A. Determine the mean or expected value of each Random Variable. Write your answer at the back of your
module.
17
Page
B. Determine the Variance and Standard Deviation of each random variable. Write your answer at the back of your
module.
19
Page
2.
Page 20
The Percentile
A percentile is a measure used in statistics indicating the value below which
a given percentage of observations in a group of observations fall.
Imagine you took a standardized test and you scored 91 at the 89th percentile. This means that 89% of the
examiners scored lower than 91 and 11% scored higher than 91. This explains that 89th percentile is located where 89%
of the total population lies below and 11% lies above that point. To illustrate the 89th percentile of the normal curve
here are the steps:
1. Express the given percentage as probability, remember 89% is the same as 0.8900.
2. Using the z-table (Cumulative Distribution Function (CDF) of the Standard Normal Curve), locate the area of 0.8900.
3. There is no area corresponding exactly to 0.8900. It is between of 0.8888 with a corresponding z - score of 1.22 and
0.8907 with a corresponding z - score of 1.23. The nearest value to 0.8900 is 0.8888 and therefore, the distribution
lies below z = 1.22.
4. Construct a normal curve and shade the region to the left of 1.22.
21
Page
Activity 21
Read the instructions given before answering the following activity.
A. Complete the table by converting the given values into z-scores. Then find the corresponding area using the z-
table.
B. Compute the following probabilities using the standard normal curve. Construct a curve then shade the region
corresponding to the area. Use the space at the back of your module.
Scaffold 3 Activity 23
Complete the following table and solve for the unknowns. A population of a certain ethnic group is consisting of
three numbers (3, 5, 7). If samples of size 2 will be taken with replacements, complete the following table, and answer
the questions that follow.
Below is the t-table which you can use to find the t-value
23
Page
Illustrative example:
3. Identify the t-value of the following percentiles given the number of samples.
a. 95th percentile, N= 30
b. 90th percentile, N=20
Solution:
1. To identify the t-value, identify first the degree of freedom using the formula
df= n-1 where n is the sample size.
df = n-1
df = 7-1
df = 6
Locate the t-value on the t-table using the degree of freedom and the area (𝛼)
24
Page
Therefore, the t-value with sample size of 7 and has 𝛼 = 0.05 is 1.943.
3. a. The 95th percentile is the number where 95% of the values lie below it and 5% lie above it, so you want the right-
tail area to be 0.05. Move across the row, find the column for 0.05, and then locate the t-value using n=30 or df = 29.
So therefore, the t-value of P95 with n= 30 is 1.699.
b. The 90th percentile is the number where 90% of the values lie below it and 10% lie above it, so you want the right-tail
area to be 0.01. Move across the row, find the column for 0.05, and then locate the t-value using n=20 or df = 19.
try it!
Complete the table below by identifying the degree of freedom, right-tailed area and the t-value for each percentiles
and the sample size.
Percentile n df Right tailed area t
1. 60th percentile
25
Page
2. 75th percentile
3. 25th percentile
4. 90th percentile
5. 95th percentile
END OF DEEPEN
After answering all the activities in this section, you are now ready to do the performance task for this lesson and apply the
knowledge you have learned before.
Transfer
ACTIVITY 26 THE T-VALUE
A researcher conducts a diagnostic examination to 20 students. The following are the interpretation of the
diagnostic examination.
1. James got a score which is 60% higher than those who took the diagnostic test.
2. Rowena has a percentile rank of 90.
3. Jimuel got a score which is only 5% higher than those who took the diagnostic test.
4. Joanna has a percentile rank of 75.
5. Lester is 90% lower than those who took the diagnostic test.15
Identify the t-value of the percentiles of the following by filling up the table and answer the following questions:
Student T-Value
1. Joanna
2. James
3. Jimuel
4. Lester
5. Rowena
while the word “thesis” means “placing”. The plural of “hypothesis” is “hypotheses”
Page
As you go through this lesson keep thinking about What is the importance of hypothesis testing and correlation
analysis in real life problems?
CONTENT STANDARD
The learner demonstrates understanding of key concepts of tests of hypotheses on the population mean and
population proportion.
The learner demonstrates understanding of key concepts of correlation and regression analyses.
PERFORMANCE STANDARD
The learner is able to perform appropriate tests of hypotheses involving the population mean and population
proportion to make inferences in real-life problems in different disciplines.
concepts of correlation and regression analyses. regression analyses on real-life problems in different disciplines.
identifies the appropriate rejection region for a given level of significance when: (a) the population M11/12SP-IVc-1
variance is assumed to be known; (b) the population variance is assumed to be unknown; and (c)
the Central Limit Theorem is to be used
identifies the appropriate form of the test-statistic when the Central Limit Theorem is to be used. M11/12SP-IVe-3
identifies the appropriate rejection region for a given level of significance when the Central Limit M11/12SP-IVe-4
Theorem is to be used
formulates the appropriate null and alternative hypotheses on a population mean. M11/12SP-IVb-1
computes for the test-statistic value (population mean). M11/12SP-IVd-1
draws conclusion about the population mean based on the test-statistic value and the rejection M11/12SP-IVd-2
region.
solves problems involving test of hypothesis on the population mean. M11/12SP-IVe-1
formulates the appropriate null and alternative hypotheses on a population proportion. M11/12SP-IVe-2
computes for the test-statistic value (population proportion). M11/12SP-IVf-1
draws conclusion about the population proportion based on the test-statistic value and the M11/12SP-IVf-2
rejection region.
solves problems involving test of hypothesis on the population proportion M11/12SP-IVf-g-1
illustrates the nature of bivariate data. M11/12SP-IVf-g-1
constructs a scatter plot. M11/12SP-IVg-3
describes shape (form), trend (direction), and variation (strength) based on a scatter plot. M11/12SP-IVg-4
calculates the Pearson’s sample correlation coefficient. M11/12SP-IVh-2
identifies the independent and dependent variables. M11/12SP-IVi-1
solves problems involving correlation analysis. M11/12SP-IVh-3
calculates the slope and y-intercept of the regression line. M11/12SP-IVi-3
interprets the calculated slope and y-intercept of the regression line. M11/12SP-IVi-4
predicts the value of the dependent variable given the value of the independent variable. M11/12SP-IVj-1
solves problems involving regression analysis. M11/12SP-IVj-2
EXPLORE
In the following activity you will be introduce to the lesson by watching the video on the link below.
End of Explore
Now you already have an idea what is the lesson all about proceed to the next part of your module.
Firm up
In this section of your module you are going to learn new things such as hypothesis testing, t-value, z-score and many other
things related to this lesson.
REMINDER:
TO HELP YOU IN ANSWERING THE ACTIVITIES THROUGH OUT THIS LESSON
MAKE SURE THAT YOU HAVE READ THE GIVEN NOTES FOR THIS LESSON AND
WATCH THE VIDEOS ON THE PLAYLIST BELOW:
Statistics and Probability||Fourth Quarter by WOW MATH
[Link]
𝐻𝑜:
1. A medical trial is conducted to test whether or not a new medicine reduces uric acid by 50%.
𝐻𝑎:
_______________________________________________________________________________________________
_______________________________________________________________________________________________
𝐻𝑜:
2. We want to test whether the general average of students in Math is different from 80%.
𝐻𝑎:
_______________________________________________________________________________________________
_______________________________________________________________________________________________
𝐻𝑜:
3. We want to test whether the mean height of Grade 8 students is 58 inches.
𝐻𝑎:
_______________________________________________________________________________________________
_______________________________________________________________________________________________
𝐻𝑜:
4. We want to test if LPIHS students take more than four years to graduate from high school, on the average.
𝐻𝑎:
_______________________________________________________________________________________________
_______________________________________________________________________________________________
𝐻𝑜: ______________________________________________________________________________________________
5. We want to test if it takes less than 60 minutes to answer the quarterly test in Calculus.
𝐻𝑎: ______________________________________________________________________________________________
6. A medical test is conducted to determine whether or not a new vaccine
𝐻𝑜: ______________________________________________________________________________________________
reduces the complications of dengue fever.
𝐻𝑎: ______________________________________________________________________________________________
End of firm up
You have gained the basic concepts of the lesson in this part of the module. Now proceed to the next part.
Deepen
In this section you will help you to have a better understanding towards this lesson by answering the following activities.
1. A school principal claims that the Grade 11 students in her high school have a mean score of 92.
2. A medicine company has manufactured and claimed that their medicine pill contains an average of 14mg of active
ingredient.
Parameter: average Null Hypothesis: ___________
Claim: average of 14mg Alternative Hypothesis: ___________
B. Write the null hypothesis and alternative hypothesis in notations for each given situation.
1. Mrs. Dela Cruz claims that her students scored an average of 91 in their Mathematics quiz. The master teacher wants
𝐻𝑜: _________________________________________________
to know whether the teacher’s claim is acceptable or not.
𝐻𝑎: _________________________________________________
2. A car manufacturer claims that the mean selling price of all cars manufactured is only ₱150,000. A consumer agency
𝐻𝑜: _________________________________________________
wants to test whether the mean selling price of all the cars manufactured exceeds₱150, 000.
𝐻𝑎: _________________________________________________
3. A manufacturer of soft drinks claims that all labeled 1.5-liter bottles contain an average of 1.49 liters of soft drinks. A
𝐻𝑜: _________________________________________________
retailer wishes to test whether the mean amount of soft drinks in labeled 1.5-liter bottle is less than 1.49 liters.
𝐻𝑎: _________________________________________________
End of deepen
After answering the different activity’s in this part of the module you can now apply what you have learned by doing the
performance task below.
Transfer
Activity 13 Statistical Analysis
You are a Senior High School student and you want to conduct a research to determine whether there is a
correlation between the age of tricycles (in years) in a certain city and the mileage it runs (in kilometers). Using the data
below calculate the Pearson’s sample correlation coefficient then analyze and interpret the result.
*Show your solution at the back of your module.
kilometers).
Key answers
Post test
1. d 5. d 9. d
2. b 6. c 10. b
3. a 7. b
4. c 8. a
GLOSSARY
Alternative hypothesis is a statement denoted by H 1, is a statement that states that there is a difference
Critical region, also known as rejection region, is a range of values that corresponds to the rejection of the null
hypothesis
Critical values of the tests of hypotheses are the number or numbers that determine the rejection region
Hypothesis is an assumption or conjecture about a population parameter which may or may
Null hypothesis is a statement denoted by H 0, that states that there is no difference, no changes, nothing happened, no
relationship between a parameter and a specific value, or the independent variable has no effect on the dependent
variable.
Null was defined as having no value or amounting to nothing
Sample Space is the set of all possible outcomes of an experiment.
Significance is defined as the quality of being statistically significant
significance level, refers to a criterion of judgment upon which a decision is made regarding the value stated in a null
hypothesis
Statistical Experiment has been used to describe any process by which one or more chance measurements are
obtained.
REFERENCE
DEPED Modules
Belecina, R., Baccay, E., & Mateo, E. (2016). Statistics and Probability. Manila: Rex Book Store, Inc.
Mercado, J. & Orines, F. (2016). Next Century Mathematics: Statistics and Probability. Manila: Phoenix Publishing House,
Inc.
[Link]
[Link]
32
Page
POST TEST
NAME: ___________________________________________________________________________________
Section/Strand: ____________________________________________________________________________
4. Find the mean of the probability distribution involving the random variable X that gives the
number of heads that appear after tossing four coins once.
A. 4 B. 3 C. 2 D. 1
5. Samples of size 25 are selected from a population with a mean of 40 and a standard deviation
of 7.5. What is the mean of the sampling distribution of sample means?
A. 7.5 B. 8 C. 25 D. 40
According to a study done last year the average monthly expenses for cell phone loads of
high school students in Manila was P350.00. A Statistics student believes that this amount has
increased since January of this year. Is there a reason to believe that this amount has really
increased if a random sample of 60students has an average monthly expenses for cell phone
loads of P380.00? Use a 0.05 level of significance. Assume that the population standard deviation
is P77.00.
10. This is the measure of degree of the relationship between two variables.
a. Johnson’s correlation coefficient
b. Pearson’s correlation coefficient
c. standard correlation coefficient
d. Spearman Rho coefficient
33
Page