Discrete Distributions
Discrete Distributions
SIVAKUMARAN
ASSISTANT PROFESSOR OF STATISTICS
PROBABILITY AND STATISTICS(20MCA25C)
DISCRETE DISTRIBUTIONS
BINOMIAL DISTRIBUTION
Introduction:
In this chapter we will discuss the theoretical discrete distributions in which variables
are distributed according to some definite probability law, which can be expressed
mathematically. The Binomial distribution is a discrete distribution expressing the probability
of a set of dichotomous alternative i.e., success or failure. This distribution has been used to
describe a wide variety of process in business and social sciences as well as other areas.
Bernoulli Distribution:
A random variable X which takes two values 0 and 1 with probabilities q and p i.e.,
P (x = 1) = p and P (x = 0) = q, q = 1 – p, is called a Bernoulli variate and is said to be
a Bernoulli Distribution, where p and q takes the probabilities for success and failure
respectively. It is discovered by Swiss Mathematician James Bernoulli (1654-1705).
Examples of Bernoulli’ s Trails are:
Binomial Distribution:
The problems relating to tossing of coins or throwing of dice or drawing cards from a
pack of cards with replacement lead to binomial probability distribution.
1.1.2 Characteristics of Binomial Distribution:
3. The mode of the binomial distribution is that value of the variable which occurs with the
largest probability. It may have either one or two modes.
4. If two independent random variables X and Y follow binomial distribution with parameter
(n1, p) and (n2, p) respectively, then their sum (X + Y) also follows Binomial distribution
with parameter (n1 + n2, p).
5. If n independent trials are repeated N times, N sets of n trials are obtained and the expected
frequency of x success is N(nCx px qn-x). The expected frequencies of 0, 1, 2… n success
are the successive terms of the binomial distribution of N(p + q)n.
Example 1:
Comment on the following: “ The mean of a binomial distribution is 5 and its variance
is 9”
Solution:
The parameters of the binomial distribution are n and p
We have mean ⇒ np = 5
Variance ⇒ npq = 9
npq 9
∴ q= =
np 5
9
q = >1
5
Which is not admissible since q cannot exceed unity. Hence the given statement is
wrong.
Example :
Eight coins are tossed simultaneously. Find the probability of getting atleast six heads.
Solution:
If the random variable X denotes the number of heads, then the probability of a
success in n trials is given by
1 x 1 8−x 18
= 8C x = 8C x
2 2 2
1
= 8Cx
28
1
= [8C + 8C 7 + 8C ]
6 8
28
1
= [28 + 8 + 1] = 37
8 256
2
x = 0, 1, 2, .... , n
Example 3:
p = Probability of
getting a head
1
=
2
q = Probability of not
1
getting a head =
2
The probability of
getting x heads
throwing 10 coins
simultaneously is given
by
P(X = x) = nC x p x q n−x ,
1 x 1 10−x 1
= 10C x = 10 10Cx
2 2 2
7
210 =
120
1024
210
210
1
= 1− [45 + 10 + 1]
210
= 1 − 56
1024
968
=
1024
Example :
20 wrist watches in a box of 100 are defective. If 10 watches are selected at random,
find the probability that (i) 10 are defective (ii) 10 are good (iii) at least one watch is defective
(iv) at most 3 are defective.
Solution:
1 x 4 10−x
= 10C x
5 5
1 1
= 1. 10
.1 =
5 510
5 5
Example :
With the usual notation find p for binomial random variable X if n = 6 and
9P(X = 4) = P(X = 2)
Solution:
The probability mass function of binomial random variable X is given by
P(X = x) = nCx px qn-x , x = 0 , 1, 2, ...,n
X f fx
0 36 0
1 40 40
2 22 44
3 2 6
∑f = 100 ∑fx = 90
3 − 0 p
f(1) = × 34.3
0 + 1 q
= 3 × (0.43) × 34.3 = 44.247
3 −1 p
f(2) = f(1)
1 + 1 q
2
= (0.43) × 44.247
2
= 19.03
3 − 2 p
f(3) = f(2)
2 + 1 q
1
= (0.43) × 19.03
3
= 2.727
Observed
36 40 22 2 100
frquencies
Expected
34 44 19 3 100
frequencies
Example 7:
4 coins are tossed and number of heads noted. The experiment is repeated 200 times
and the following distribution is obtained .
x: Number of heads 0 1 2 3 4
f: frequencies 62 85 40 11 2
Solution :
X 0 1 2 3 4 Total
f 62 85 40 11 2 200
fx 0 85 80 33 8 206
Total
Observed
62 85 40 11 2 200
frquencies
Expected
61 84 44 10 1 200
frequencies
POISSON DISTRIBUTION:
Introduction:
Since number of trials is very large and the probability of success p is very small, it is
clear that the event is a rare event. Therefore Poisson distribution relates to rare events.
Note:
1 1 1
1) e is given by e = 1 + + + + .......= 2.71828
1! 2! 3!
e−m m 0
2) P (X = 0) = , 0! = 1 and 1! = 1
0!
e−m m1
3) P (X = 1) =
1!
Some examples of Poisson variates are :
Conditions:
Poisson distribution is the limiting case of binomial distribution under the following
conditions:
1. The number of trials n is indefinitely large i.e., n → ∞
i) The occurrence or non- occurrence of an event does not influence the occurrence
or non-occurrence of any other event.
ii) The probability of success for a short time interval or a small region of space is
proportional to the length of the time interval or space as the case may be.
iii) The probability of the happening of more than one event is a very small interval
is negligible.
Example :
Suppose on an average 1 house in 1000 in a certain district has a fire during a year. If
there are 2000 houses in that district, what is the probability that exactly 5 houses will have a
fire during the year? [given that e-2 = 0.13534]
1
Mean, x = np, n = 2000 and p =
1000
1
= 2000 ×
1000
m=2
e−2 25
The Poisson distribution is
5!
−m
ex
m
P(X = x) =
x!
∴P(X = 5) =
(0.13534) × 32
=
120
= 0.036
Example :
Solution:
e−m m x
Poisson distribution is given by P(X = x) =
x!
Given that 3P(x = 2) = P(x = 4)
e−m m 2 e−m m 4
3. =
2! 4!
3 × 4!
m2 =
2!
∴m = ± 6
Example :
e−m . m x
iii) By using the formula P(X = x) =
x!
Substituting x = 0, P(0) = e-m
Then f(0) = N×P(0)
The other expected frequencies will be obtained by using the recurrence formula
m
f(x + 1) = f(x); x = 0,1, 2,.......
x +1
Example :
xi fi fixi
0 211 0
1 90 90
2 19 38
3 5 15
4 0 0
N = 325 ∑fx = 143
∑ fx
Mean = x =
N
143
= = 0.44 = m
325
Then e−m ⇒ e−0.44 = 0.6440
The other expected frequencies will be obtained by using the recurrence formula
m
f(x + 1) = f(x) . By putting x = 0,1,2,3 we get the expected frequencies and are
x +1
calculated as follows.
f(1) = 0.44 × 209.43 = 92.15
0.44
f(2) = × 92.15 = 20.27
2
0.44
f(3) = × 20.27 = 2.97
3
0.44
f(4) = × 2.97 = 0.33
4
Total
Observed
211 90 19 5 0 325
frquencies
Expected
210 92 20 3 0 325
frequencies
Example :
Find mean and variance to the following data which gives the frequency of the number
of deaths due to horse kick in 10 corps per army per annum over twenty years.
X 0 1 2 3 4 Total
F 109 65 22 3 1 200
Solution :
xi fi fixi fixi2
0 109 0 0
1 65 65 65
2 22 44 88
3 3 9 27
4 1 4 16
Total N = 200 ∑fx = 122 ∑fx2 = 196
∑ fixi
Mean = x =
N
122
=
200
= 0.61
∑ fixi2
Variance = σ = 2 − (x 2
N
196
= − (0.61)2
200
= 0.61
No. of sets 79 18 2 1 0
x f fx
0 79 0
1 18 18
2 2 4
3 1 3
4 0 0
N = 100 ∑fx = 25
∑ fx
Mean = x =
N
25
=
100
∴m = 0.25
By putting x = 0,1,2,3, we get the expected frequencies and are calculated as follows.
m
f(1) = f(0 + 1) = f(0)
0 +1
0.25
f(1) = (77.9)
1
= 19.46
0.25
f(2) = (19.46)
2
= 2.43
0.25
f(3) = (2.43)
3
= 0.203
0.25
f(4) = (0.203)
4
= 0.013
Example:
Solution:
∴q = 1 − p = 1 − 0.0125
= 0.9875
n = 30
P (x ≥ 2) = 1 – P(x < 2)
= 1 – { 0.6839 + 0.2597}
= 1 – 0.9436
P( x ≥ 2) = 0.0564
(ii) By using Poisson distribution:
The probability mass function of Poisson distribution is given by
e−m m x
P(x) =
x!
Mean = m = np
= 30 (0.0125) = 0.375
P(x ≥ 2) = 1 − P(x < 2)
= 1 − e−0.375(1 + 0.375)
= 1 − (0.6873) (1.375) = 1 − 0.945 = 0.055
CONTINUOUS DISTRIBUTIONS:
NORMAL DISTRIBUTION:
Introduction:
In the preceding sections we have discussed the discrete distributions, the Binomial and
Poisson distribution.
In this section we deal with the most important continuous distribution, known as
normal probability distribution or simply normal distribution. It is important for the reason that
it plays a vital role in the theoretical and applied statistics.
A continuous random variable X is said to follow normal distribution with mean μ and
standard deviation σ, if its probability density function
Note:
The mean μ and standard deviation σ are called the parameters of Normal distribution.
The normal distribution is expressed by X ~ N (μ, σ2)
Condition of Normal Distribution:
i) Normal distribution is a limiting form of the binomial distribution under the following
conditions.
ii) Normal distribution can also be obtained as a limiting form of Poisson distribution with
parameter m → ∞
iii) Constants of normal distribution are mean = μ, variation = σ2, Standard deviation = σ.
The curve representing the normal distribution is called the normal probability curve.
The curve is symmetrical about the mean (μ), bell-shaped and the two tails on the right and
left sides of the mean extends to the infinity. The shape of the curve is shown in the following
figure.
-∞ x=µ
11. If X and Y are independent normal variates with mean μ1 and μ2, and variance σ1 2
and σ 22 respectively then their sum (X + Y) is also a normal variate with mean
(μ1 + μ2) and variance (σ 12 + σ 22)
12. Area Property P (μ - σ < × < μ + σ) = 0.6826
distribution with mean 0 and standard deviation 1 i.e., Z ~ N(0,1). The standard normal
−1 2
1 z
distribution is given by φ (z) = e2 ; – ∞ < z < ∞ The advantage of the above function
2π
is that it doesn’ t contain any parameter. This enable us to compute the area under the normal
probability curve.
Area properties of Normal curve:
The total area under the normal probability curve is 1. The curve is also called standard
probability curve. The area under the curve between the ordinates at x = a and x = b where a <
b, represents the probabilities that x lies between x = a and x = b i.e., P(a ≤ x ≤ b)
X −µ
To find any probability value of x, we first standardize it by using Z = , and use
σ
the area probability normal table. (given in the Appendix).
For Example: The probability that the normal random variable x to lie in the interval
(μ – σ , μ + σ) is given by
P (μ – σ < x < μ + σ) = P (–1 ≤ z ≤ 1 )
= 0.6826
= 2 (0.49865) = 0.9973
The probability that a normal variate x lies outside the range µ ± 3σ is given by
P(|x – μ| > 3σ) = P(|z| >3)
= 1 – P(-3 ≤ z ≤ 3)
= 1 – 0.9773 = 0.0027
Thus we expect that the values in a normal probability curve will lie between the range
μ ± 3σ, though theoretically it range from – ∞ to ∞.
Example:
Find the probability that the standard normal variate lies between 0 and 1.56
Solution:
P (–1.96 < z < 1.5) = P (–1.96 < z < 0) + P(0 < z < 1.5)
= P (0 < z < 1.96) + P(0 < z < 1.5)
= 0.4750 + 0.4332 (from the table)
= 0.9082
Example :
Given a normal distribution with μ = 50 and σ = 8, find the probability that x assumes
a value between 42 and 64.
Solution:
2
8 8
= 0 .8012
Example:
Students of a class were given an aptitude test. Their marks were found to be normally
distributed with mean 60 and standard deviation 5. What percentage of students scored.
i) More than 60 marks (ii) Less than 56 marks (iii) Between 45 and 65 marks
Solution:
= 0.2119
Hence the percentage of students score less than 56 marks is 0.2119(100) = 21.19 %
45 − 60 −15
iii) If X = 45, Z = = = −3
5 5
65 − 60 5
X = 65 then z = = =1
5 5
Example:
In a normal distribution 31 % of the items are under 45 and 8 % are over 64. Find the
mean and variance of the distribution.
Solution:
Let x denotes the items are given and it follows the normal distribution with mean μ
and standard deviation σ
The points x = 45 and x = 64 are located as shown in the figure.
i) Since 31 % of items are under x = 45, position of x into the left of the ordinate x = μ
ii) Since 8 % of items are above x = 64 , position of this x is to the right of ordinate x = μ
0.19
0.42
0.31 0.08
-∞ z = -z1 z = 0 z = z2 +∞
x = 45 x = µ x = 64
x−µ 45 − µ
When x = 45, z = = = −z (say)
1
σ σ
Exercise
I. Choose the best answer:
1
(a) 5 2π (b) (c) (d) 5
5 2π
27. Normal distribution has
(a) no mode (b) only one mode (c) two modes (d) many mode
28. For the normal distribution
(a) mean = median = mode (b) mean < median < mode
(c) mean > median > mode (d) mean > median < mode
1 (x−30)2
1 −
29. Probability density function of normal variable P(X = x) = e 2 25 ;-α<x<α
then mean and variance are 5 2π
2 1
9
36. + refers the binomial distribution and its standard deviation is .
3 3
37. In a binomial distribution if number of trials to be large and probability of success
bezero, then the distribution becomes .
38. The mean and variance are in Poisson distribution.
39. The mean of Poisson distribution is 0.49 and its standard deviation is .
40. In Poisson distribution, the recurrence formula to calculate expected frequencies is
.
∑ fx2
41. The formula −x( )2 is used to find .
N
42. In a normal distribution, mean takes the values from to .
43. When μ = 0 and σ = 1 the normal distribution is called .
44. P(– ∞ < z < 0) covers the area .
45. If μ = 1200 and σ = 400 then the standard normal variate z for x = 800 is .
46. At x = μ ± σ are called as in a normal distribution.
47. P(-3 < z < 3) takes the value
48. X axis be the to the normal curve.
51. In a binomial distribution the mean and standard deviation are 12 and 2 respectively.
Findn and p.
52. A pair of dice is thrown 4 times. If getting a doublet is considered a success, find the
probability of 2 success.
57. For the binomial distribution (0.68 + 0.32)10 find the probability of 2 success.
58. Find the mean of binomial distribution of the probability of occurrence of an event is
1/5and the total number of trials is 100.
59. If on an average 8 ships out of 10 arrive safely at a port, find the mean and standard
deviation of the number of ships arriving safely out of total of 1600 ships.
60. The probability of the evening college student will be a graduate is 0.4. Determine
theprobability that out of 5 students (i) none (ii) one (iii) atleast one will be a graduate.
61. Four coins are tossed simultaneously. What is the probability of getting i) 2 heads and
2tails ii) atleast 2 heads iii) atleast one head.
63. 5 dice are thrown together 96 times. The numbers of getting 4, 5 or 6 in the experiment
is given below. Calculate the expected frequencies and compare the standard deviation of
the expected frequencies and observed frequencies.
Getting 4 ,5 or 6 : 0 1 2 3 4 5
Frequency : 1 10 24 35 18 8
64. Fit a binomial distribution for the following data and find the expected frequencies.
X: 0 1 2 3 4
f 18 35 30 13 4
65. Eight coins are tossed together 256 times. Number of heads observed at each toss is
recorded and the results are given below. Find the expected frequencies. What are the
theoretical value of mean and standard deviation? Calculate also mean and standard
deviation of the observed frequencies.
Number of heads: 0 1 2 3 4 5 6 7 8
Frequencies : 2 6 39 52 67 56 32 10 1
70. A variable x follows a Poisson distribution with mean 6 calculate i) P(x = 0) ii) P(x = 2)
71. The variance of a Poisson Distribution is 0.5. Find P(x = 3). [e-0.5 = 0.6065]
72. If a random variable X follows Poisson distribution such that P(x =1) = P(x = 2) find (a)
the mean of the distribution and P(x = 0). [e-2 = 0.1353]
73. If 3% of bulbs manufactured by a company are defective then find the probability in a
sample of 100 bulbs exactly five bulbs are defective.
74. It is known from the past experience that in a certain plant there are on the average 4
industrial accidents per month. Find the probability that in a given year there will be less
than 3 accidents. Assume Poisson distribution.[e-4 = 0.0183]
75. A manufacturer of television sets known that of an average 5% of this product is defective.
He sells television sets in consignment of 100 and guarantees that not more than 4 sets will
be defective. What is the probability that a television set will fail to meet the guaranteed
quality? [e-5 = 0.0067]
76. One fifth percent of the blades produced by a blade manufacturing factory turns out to
be a defective. The blades are supplied in pockets of 10. Use Poisson distribution to
calculate the approximate number of pockets containing i) no defective (ii) all defective
(iii) two defective blades respectively in a consignment of 1,00,000 pockets.
77. A factory employing a huge number of workers find that over a period of time, average
absentee rate is three workers per shift. Calculate the probability that in a given shift
i) exactly 2 workers (ii) more than 4 workers will be absent.
78. A manufacturer who produces medicine bottles finds that 0.1 % of the bottles are
defective. They are packed in boxes containing 500 bottles. A drag manufactures buy 100
boxes from the producer of bottles. Using Poisson distribution find how many boxes will
contain (i) no defective ii) exactly 2 (iii) atleast 2 defective.
79. The distribution of typing mistakes committed by a typist is given below:
81. The following tables given that number of days in a 50, days period during which
automatically accidents occurred in city. Fit a Poisson distribution to the data
No of accidents : 0 1 2 3 4
No of days : 21 18 7 3 1
82. Find the probability that standard normal variate lies between 0.78 and 2.75.
83. Find the area under the normal curve between z = 0 and z = 1.75.
84. Find the area under the normal curve between z = -1.5 and z = 2.6.
86. Find the area under the normal curve which lies to the right of z = 2.70.
87. A normal distribution has mean = 50 and standard deviation is 8. Find the probability that
x assumes a value between 34 and 62.
88. A normal distribution has mean = 20 and S.D = 10. Find area between x =15 and x = 40.
89. Given a normal curve with mean 30 and standard deviation 5. Find the area under the
curve between 26 and 40
90. The customer accounts of a certain departmental store have an average balance of Rs.1200
and a standard deviation of Rs.400. Assuming that the account balances are normally
distributed. (i) what percentage of the accounts is over Rs.1500? (ii) What percentage of
the accounts is between Rs.1000 and Rs.1500? iii) What percentage of the accounts is
below Rs.1500?
91. The weekly remuneration paid to 100 lecturers coaching for professional entrance
examinations are normally distributed with mean Rs.700 and standard deviation Rs.50.
Estimate the number of lecturers whose remuneration will be i) between Rs.700
andRs.720 ii) more than Rs.750 iii) less than Rs.630.
92. x is normally distributed with mean 12 and standard deviation 4. Find the probability of
the following i) x ≥ 20 ii) x ≤ 20 iii) 0 < x < 12
93. A sample of 100 dry cells tested to find the length of life produced the following results
μ =12 hrs, σ = 3 hrs. Assuming the data, to be normally distributed. What percentage of
battery cells are expressed to have a life. i) more than 15 hrs ii) between 10 and 14 hrs as
iii) less than 6 hrs?.
94. Find the mean and standard deviation of marks in an examination where 44 % of
thecandidates obtained marks below 55 and 6 % got above 80 marks.
95. In a normal distribution 7 % of the items are under 35 and 89 % of the items are under
63. Find its mean as standard deviation.
Note: For fitting a binomial distribution in the problem itself, if it is given that the coin is
unbiased, male and female births are equally probable, then we consider p = q = ½ . All other
cases we have to find the value of p from the mean value of the given data.
9-3. RECTANGULAR (OR UNIFORM) DISTRIBUTION
Definition. A random variable X is said to have a continuous rectangular (uniform)
distribution over an interval (a, b), i.e., (- o<a<b< o), ifits [Link] given by :
= - a i f a <x<b . (9.19)
fx;a,b)
0, otherwise
Remarks 1. a and b, (a < b) are the two parameters of the distribution. The distributin is
called uniform distribution on (a, b) since it assumes a constant (uniform) value for all x in (a, b).
2. The distribution is also known as rectangular distribution, since the curve y f(x)
describes a rectangle o v e r the x-axis and between the ordinates at x =a and x b. =
XR [a b).
4. The cumulative distribution function F(x) is given by:
0 xSa
F )=-- a<x<b . (9.19a)
1 x2b
f() Fl
1
1
b-a
b
o
6. For a rectangular or uniform variate X in (-a, a), the [Link]. is given by:
f e r ) - 2 - a < x <a
0, 0, otherwise ..9-196)
9-3-1. Moments of Rectangular Distribution. Let X - U [a, b].
9-20a)
and
) - ( + ab +a-)
Variance
H-H2=(6+ ab +)-{ %b +a= b-a (9-206
9.3-2. M.G.F. of Rectangular
Distribution is given by:
M)
-"R) da =|-a da= - eat
+0 (9-20c)
9.3-3. Characteristic Function of
Rectangular Distribution is given
ox)= ei dx =eibt-eiat by:
t(6t#0 .. (9.20d)
9-3-4. Mean Deviati
about Mean, n of
n =Elx- Mean Rectangular Distributionisis given
given by:
- Mean|f(x) dx
(9.20)
ECIAL CONTINUOUS PROBABILITY DISTRIBUTIONS
9.31
Example 9.21. f X is uniformly distributed with mean 1 and
variancefind
P(X<0).
0
P (X 0)= dr=/|«[,-i
Example 9-22. Subway trains ona certain line run every half hour between mid-night
and six in the morning. What is the probability that a man entering the station at a random
time during this period will have to wait at least twenty minutes?
Solution. Let the r.v. X denote the waiting time (in minutes) for the next train.
X is distributed
Under the assumption that a man arrives at the station at random,
uniformly on (0, 30), with [Link].,
fa) =00<x<30
0, otherwise
The probability that he has to wait at
least 20 minutes is given by:
30 30
P(X220) = dr= 1. dx= 30-20)=
20 20
P
(-2 log XSy)
=
oy )= P (YSy) = P e w 2
e/2
s)dr=1- 1. dx =
- ev2
=1-P(XSe?) =1
.()
8 )- )=ze2,0<y<o
from 0 to ]
as X ranges in (0, 1), Y= -2 log Xranges
-2 log X, has an exponential
[0, 1], then Y
=
u
illustrates that if X -
distribution with
Hemark. This example or Y = -2 log X, has chi-square
lstribution with parameter 0 [c.f. 5 9-8]
=
n2 degrees offreedom [c.f. Chapter 15].
distribution :J() 2-a <x<a, =