0% found this document useful (0 votes)
9 views44 pages

Discrete Distributions

This document discusses discrete distributions, focusing on the Binomial distribution, which models the probability of success or failure in a series of independent trials. It covers the Bernoulli distribution, conditions for Binomial distribution, its characteristics, and provides examples of calculating probabilities for various scenarios. Additionally, it outlines the fitting of a Binomial distribution to observed data.

Uploaded by

Jemal
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views44 pages

Discrete Distributions

This document discusses discrete distributions, focusing on the Binomial distribution, which models the probability of success or failure in a series of independent trials. It covers the Bernoulli distribution, conditions for Binomial distribution, its characteristics, and provides examples of calculating probabilities for various scenarios. Additionally, it outlines the fitting of a Binomial distribution to observed data.

Uploaded by

Jemal
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Dr.P.K.

SIVAKUMARAN
ASSISTANT PROFESSOR OF STATISTICS
PROBABILITY AND STATISTICS(20MCA25C)

DISCRETE DISTRIBUTIONS

BINOMIAL DISTRIBUTION
Introduction:

In this chapter we will discuss the theoretical discrete distributions in which variables
are distributed according to some definite probability law, which can be expressed
mathematically. The Binomial distribution is a discrete distribution expressing the probability
of a set of dichotomous alternative i.e., success or failure. This distribution has been used to
describe a wide variety of process in business and social sciences as well as other areas.
Bernoulli Distribution:

A random variable X which takes two values 0 and 1 with probabilities q and p i.e.,
P (x = 1) = p and P (x = 0) = q, q = 1 – p, is called a Bernoulli variate and is said to be
a Bernoulli Distribution, where p and q takes the probabilities for success and failure
respectively. It is discovered by Swiss Mathematician James Bernoulli (1654-1705).
Examples of Bernoulli’ s Trails are:

1) Toss of a coin (head or tail)

2) Throw of a die (even or odd number)


3) Performance of a student in an examination (pass or fail)

Binomial Distribution:

A random variable X is said to follow binomial distribution, if its probability mass


function is given by
nC x p x q n−x ;
P (X = x) = P(x) = x = 0,1, 2, ...... , n

0 ; otherwise
Here, the two independent constants n and p are known as the ‘ parameters’ of the
distribution. The distribution is completely determined if n and p are known. x refers the number
of successes. If we consider N sets of n independent trials, then the number of
times we get x success is N(nCx px qn-x). It follows that the terms in the expansion of
N (q + p)n gives the frequencies of the occurrences of 0, 1, 2,..., x, ..., n success in the N sets
of independent trials.

1.1.1 Condition for Binomial Distribution:

We get the Binomial distribution under the following experimental conditions.


1) The number of trials ‘ n’ is finite.
2) The trials are independent of each other.
3) The probability of success ‘ p’ is constant for each trial.
4) Each trial must result in a success or a failure.

The problems relating to tossing of coins or throwing of dice or drawing cards from a
pack of cards with replacement lead to binomial probability distribution.
1.1.2 Characteristics of Binomial Distribution:

1. Binomial distribution is a discrete distribution in which the random variable X (the


number of success) assumes the values 0,1, 2, ….n, where n is finite.

2. Mean = np, variance = npq and

Standard deviation σ = npq


q−p
Coefficient of skewness = ,
npq

Coefficient of kurtosis = , clearly each of the probabilities is non-negative and


sum of all probabilities is 1 ( p < 1 , q < 1 and p + q =1, q = 1- p ).

3. The mode of the binomial distribution is that value of the variable which occurs with the
largest probability. It may have either one or two modes.

4. If two independent random variables X and Y follow binomial distribution with parameter
(n1, p) and (n2, p) respectively, then their sum (X + Y) also follows Binomial distribution
with parameter (n1 + n2, p).
5. If n independent trials are repeated N times, N sets of n trials are obtained and the expected
frequency of x success is N(nCx px qn-x). The expected frequencies of 0, 1, 2… n success
are the successive terms of the binomial distribution of N(p + q)n.
Example 1:
Comment on the following: “ The mean of a binomial distribution is 5 and its variance
is 9”
Solution:
The parameters of the binomial distribution are n and p
We have mean ⇒ np = 5
Variance ⇒ npq = 9
npq 9
∴ q= =
np 5
9
q = >1
5
Which is not admissible since q cannot exceed unity. Hence the given statement is
wrong.
Example :

Eight coins are tossed simultaneously. Find the probability of getting atleast six heads.

Solution:

Here number of trials, n = 8, p denotes the probability of getting a head.


1 1
∴ p = and q =
2 2

If the random variable X denotes the number of heads, then the probability of a
success in n trials is given by

P(X = x) = nC x p x q n−x , x = 0, 1, 2, .... , n

 1  x  1  8−x  18
= 8C x     = 8C x  
 2  2  2

1
= 8Cx
28

Probability of getting atleast six heads is given by


P(x ≥ 6) = P(x = 6) + p(x = 7) + P(x = 8)
1 1 1
= 8C + 8C + 8C
8 6 8 7 8 8
2 2 2

1
= [8C + 8C 7 + 8C ]
6 8
28
1
= [28 + 8 + 1] = 37
8 256
2
x = 0, 1, 2, .... , n
Example 3:

Ten coins are tossed


simultaneously. Find the
probability of getting (i)
atleast sevenheads (ii) exactly
seven heads (iii) atmost seven
heads
Solution:

p = Probability of
getting a head

1
=
2
q = Probability of not
1
getting a head =
2
The probability of
getting x heads
throwing 10 coins
simultaneously is given
by

P(X = x) = nC x p x q n−x ,

 1  x  1 10−x 1
= 10C x     = 10 10Cx
 2  2 2

i) Probability of getting atleast seven heads


P(x ≥ 7) = P(x = 7) + P(x = 8) + P(x = 9) + P(x = 10)
1
= [10C + 10C + 10C + 10C ]
10 7 8 9 10
2
1 176
= [120 + 45 + 10 + 1] =
1024 1024

ii) Probability of getting exactly 7 heads


1 1
P(x = 7) = 10C = (120)

7
210 =
120
1024
210

ii) Probability of getting atmost 7 heads


P(x ≤ 7) = 1 − P(x > 7)
= 1 −{P(x = 8) + P(x = 9) + P(x = 10)}
1
= 1− {10C8 + 10C9 + 10C10}

210
1
= 1− [45 + 10 + 1]
210
= 1 − 56

1024
968
=
1024
Example :

20 wrist watches in a box of 100 are defective. If 10 watches are selected at random,
find the probability that (i) 10 are defective (ii) 10 are good (iii) at least one watch is defective
(iv) at most 3 are defective.

Solution:

20 out of 100 wrist watches are defective


20 1
Probability of defective wrist watch , p = =
100 5
4
∴q = 1 − p =
5
Since 10 watches are selected at random, n =10

P(X = x) = nC x p x q n−x , x = 0, 1, 2, .... , n

 1 x  4  10−x
= 10C x    
 5  5 

i) Probability of selecting 10 defective watches


 1  10  4  0
P (x = 10) = 10C10    
 5  5 

1 1
= 1. 10
.1 =
5 510

ii) Probability of selecting 10 good watches (i.e. no defective)


 1 0  4 10
P (x = 0) = 10C0    
 5  5 
 4  10  4  10
= 1.1  = 

 5  5

iii) Probability of selecting at least one defective watch


P(x ≥ 1) = 1 − P(x < 1)
= 1 − P(x = 0)
 1 0  4  10
= 1 − 10C0    
 5  5 
 4  10
= 1−  
 5
iv) Probability of selecting at most 3 defective watches

Example :
With the usual notation find p for binomial random variable X if n = 6 and
9P(X = 4) = P(X = 2)
Solution:
The probability mass function of binomial random variable X is given by
P(X = x) = nCx px qn-x , x = 0 , 1, 2, ...,n

Here n = 6 ∴ P (X = x) = 6Cx px q6-x


P (x = 4) = 6C4 p4 q2
P (x = 2) = 6C2 p2 q4
Given that,
9. P (x = 4) = P (x = 2)
9. 6C4 p4 q2 = 6C2 p2 q4
⇒ 9 × 15p2 = 15q2
9p2 = q2
Taking positive square root on both sides we get,
3p = q
= 1− p
4p = 1
1
∴p = = 0.25
4

Fitting of Binomial Distribution:


When a binomial distribution is to be fitted to an observed data, the following
procedure is adopted.
∑ fx
1. Find Mean = x = = np
∑f
x
⇒ p = where n is number of trials.
n

2. Determine the value, q = 1– p.


3. The probability function is P(x) = nCx px qn-x put x = 0, we set P(0) = qn and
f(0) = N × P(0)
4. The other expected frequencies are obtained by using the recurrence formula is given by
n−x p
f(x + 1) = f(x)
x +1 q
Example :
A set of three similar coins are tossed 100 times with the following results
Number of heads : 0 1 2 3
Frequency : 36 40 22 2
Fit a Binomial distribution.
Solution :

X f fx
0 36 0
1 40 40
2 22 44
3 2 6
∑f = 100 ∑fx = 90

The probability function is P(x) = nCx px qn-x


Here n = 3, p = 0.3 q = 0.7
∴ P(x) = 3Cx (0.3)x (0.7)3-x
P(0) = 3C0 (0.3)0 (0.7)3
= (0.7)3 = 0.343
∴ f(0) = N × P(0) = 0.343 × 100 = 34.3
n −x p
The other frequencies are obtained by using the recurrence formula f(x + 1) = f(x).
x +1 q
By putting x = 0, 1, 2 the expected frequencies are calculated as follows.

3 − 0  p
f(1) = × 34.3
0 + 1  q
= 3 × (0.43) × 34.3 = 44.247
3 −1  p
f(2) = f(1)
1 + 1  q

2
= (0.43) × 44.247
2
= 19.03
3 − 2  p
f(3) = f(2)

2 + 1  q
1
= (0.43) × 19.03
3
= 2.727

The observed and theoretical (expected) frequencies are tabulated below:


Total

Observed
36 40 22 2 100
frquencies
Expected
34 44 19 3 100
frequencies

Example 7:
4 coins are tossed and number of heads noted. The experiment is repeated 200 times
and the following distribution is obtained .

x: Number of heads 0 1 2 3 4

f: frequencies 62 85 40 11 2

Solution :

X 0 1 2 3 4 Total
f 62 85 40 11 2 200
fx 0 85 80 33 8 206

Here n = 4 , p = 0.2575 ; q = 0.7425


The probability function of binomial distribution is P(x) = nCxpx qn-x
The binomial probability function is
P(x) = 4Cx (0.2575)x (0.7425)4-x
P(0) = (0.7425)4
= 0.3039
∴ f(0) = NP(0)
= 200 × 0.3039
= 60.78
p
The other frequencies are calculated using the recurrence formula f(x + 1) = n − x f(x).
x +1 q
By putting x = 0,1, 2, 3 then the expected frequencies are calculated as follows:
Put x = 0, we get
4 −0
f(1) = (0.3468)(60.78)
0 +1
= 84.3140
4 −1
f(2) = (0.3468)(84.3140)
1+1
= 43.8601
4 −2
f(3) = (0.3468)(43.8601)
2 +1
= 10.1394
4 −3
f(4) = (0.3468)(10.1394)
3+1
= 0.8791
The theoretical and expected frequencies are tabulated below:

Total

Observed
62 85 40 11 2 200
frquencies
Expected
61 84 44 10 1 200
frequencies

POISSON DISTRIBUTION:
Introduction:

Poisson distribution was discovered by a French Mathematician-cum-Physicist Simeon


Denis Poisson in 1837. Poisson distribution is also a discrete distribution. He derived it as a
limiting case of Binomial distribution. For n-trials the binomial distribution is (q + p)n ; the
probability of x successes is given by P(X = x) = nCx px qn-x . If the number of trials n is
very large and the probability of success ‘ p’ is very small so that the product np = m is non –
negative and finite.
The probability of x success is given by
 e −m m x
 for x = 0,1, 2 ....
P(X = x) =  x!
 0 ; otherwise

Here m is known as parameter of the distribution so that m >0

Since number of trials is very large and the probability of success p is very small, it is
clear that the event is a rare event. Therefore Poisson distribution relates to rare events.
Note:
1 1 1
1) e is given by e = 1 + + + + .......= 2.71828
1! 2! 3!
e−m m 0
2) P (X = 0) = , 0! = 1 and 1! = 1
0!
e−m m1
3) P (X = 1) =
1!
Some examples of Poisson variates are :

1. The number of blinds born in a town in a particular year.

2. Number of mistakes committed in a typed page.

3. The number of students scoring very high marks in all subjects

4. The number of plane accidents in a particular week.

5. The number of defective screws in a box of 100, manufactured by a reputed


company.
6. Number of suicides reported in a particular day.

Conditions:

Poisson distribution is the limiting case of binomial distribution under the following
conditions:
1. The number of trials n is indefinitely large i.e., n → ∞

2. The probability of success ‘ p’ for each trial is very small; i.e., p → 0

3. np = m (say) is finite , m > 0

Characteristics of Poisson Distribution:

The following are the characteristics of Poisson distribution


1. Discrete distribution: Poisson distribution is a discrete distribution like Binomial
distribution, where the random variable assume as a countably infinite number of
values 0,1,2 ….
2. The values of p and q: It is applied in situation where the probability of success p
of an event is very small and that of failure q is very high almost equal to 1 and n is
very large.

3. The parameter: The parameter of the Poisson distribution is m. If the value of m is


known, all the probabilities of the Poisson distribution can be ascertained.
4. Values of Constant: Mean = m = variance; so that standard deviation = m
Poisson distribution may have either one or two modes.

5. Additive Property: If X and Y are two independent Poisson distribution with


parameter m1 and m2 respectively. Then (X+Y) also follows the Poisson distribution
with parameter (m1 + m2).
6. As an approximation to binomial distribution: Poisson distribution can be taken as a
limiting form of Binomial distribution when n is large and p is very small in such a
way that product np = m remains constant.
7. Assumptions: The Poisson distribution is based on the following assumptions.

i) The occurrence or non- occurrence of an event does not influence the occurrence
or non-occurrence of any other event.

ii) The probability of success for a short time interval or a small region of space is
proportional to the length of the time interval or space as the case may be.

iii) The probability of the happening of more than one event is a very small interval
is negligible.

Example :

Suppose on an average 1 house in 1000 in a certain district has a fire during a year. If
there are 2000 houses in that district, what is the probability that exactly 5 houses will have a
fire during the year? [given that e-2 = 0.13534]
1
Mean, x = np, n = 2000 and p =
1000
1
= 2000 ×
1000
m=2
e−2 25
The Poisson distribution is
5!
−m
ex
m
P(X = x) =
x!

∴P(X = 5) =

(0.13534) × 32
=
120
= 0.036

(Note: The values of e-m are given in Appendix )

Example :

In a Poisson distribution 3P(X = 2) = P(X = 4) Find the parameter ‘m’ .

Solution:

e−m m x
Poisson distribution is given by P(X = x) =
x!
Given that 3P(x = 2) = P(x = 4)

e−m m 2 e−m m 4
3. =
2! 4!
3 × 4!
m2 =
2!
∴m = ± 6

Since mean is always positive ∴ m = 6

Example :

If 2% of electric bulbs manufactured by a certain company are defective. Find the


probability that in a sample of 200 bulbs i) less than 2 bulbs ii) more than 3 bulbs are
defective.[e-4 = 0.0183]
Solution:
2
The probability of a defective bulb = p = = 0.02
100

Given that n = 200 since p is small and n is large


We use the Poisson distribution
mean, m = np = 200 X 0.02 = 4
e−m m x
Now, Poisson Probability function, P(X = x) = x!

i) Probability of less than 2 bulbs are defective


ii) Probability of getting more than 3 defective bulbs
P(x > 3) = 1 − P(x ≤ 3)
= 1 −{P(x = 0) + P(x = 1) + P(x = 2) + P(x = 3)}
2
− 44 43
= 1− e {1 + 4 + + }
2! 3!
= 1 − {0.0183 × (1 + 4 + 8 + 10.67)}
= 0.567

Fitting of Poisson Distribution:

The process of fitting of Poisson distribution for the probabilities of x = 0, 1,2,...


success are given below :
∑ fx
i) First we have to calculate the mean = x = =m
∑f
ii) The value of e-m is obtained from the table (see Appendix )

e−m . m x
iii) By using the formula P(X = x) =
x!
Substituting x = 0, P(0) = e-m
Then f(0) = N×P(0)
The other expected frequencies will be obtained by using the recurrence formula
m
f(x + 1) = f(x); x = 0,1, 2,.......
x +1

Example :

The following mistakes per page were observed in a book.

Number of mistakes (per page) 0 1 2 3 4

Number of pages 211 90 19 5 0


Fit a Poisson distribution to the above data.
Solution:

xi fi fixi
0 211 0
1 90 90
2 19 38
3 5 15
4 0 0
N = 325 ∑fx = 143
∑ fx
Mean = x =
N
143
= = 0.44 = m
325
Then e−m ⇒ e−0.44 = 0.6440

Probability mass function of Poisson distribution is


x
−m m
P(x) = e
x!
440
Put x = 0, P(0) = e −0.44
0!
= e−0.44
= 0.6440
∴f(0) = N P(0)
= 325 × 0.6440
= 209.43

The other expected frequencies will be obtained by using the recurrence formula
m
f(x + 1) = f(x) . By putting x = 0,1,2,3 we get the expected frequencies and are
x +1

calculated as follows.
f(1) = 0.44 × 209.43 = 92.15
0.44
f(2) = × 92.15 = 20.27
2
0.44
f(3) = × 20.27 = 2.97
3
0.44
f(4) = × 2.97 = 0.33
4
Total
Observed
211 90 19 5 0 325
frquencies
Expected
210 92 20 3 0 325
frequencies

Example :

Find mean and variance to the following data which gives the frequency of the number
of deaths due to horse kick in 10 corps per army per annum over twenty years.

X 0 1 2 3 4 Total
F 109 65 22 3 1 200

Solution :

Let us calculate the mean and variance of the given data

xi fi fixi fixi2
0 109 0 0
1 65 65 65
2 22 44 88
3 3 9 27
4 1 4 16
Total N = 200 ∑fx = 122 ∑fx2 = 196
∑ fixi
Mean = x =
N
122
=
200
= 0.61

∑ fixi2
Variance = σ = 2 − (x 2
N
196
= − (0.61)2
200
= 0.61

Hence, mean = variance = 0.61


Example :
100 car radios are inspected as they come off the production line and number of
defects per set is recorded below
No. of defects 0 1 2 3 4

No. of sets 79 18 2 1 0

Fit a Poisson distribution and find expected frequencies


Solution :

x f fx
0 79 0
1 18 18
2 2 4
3 1 3
4 0 0
N = 100 ∑fx = 25
∑ fx
Mean = x =
N
25
=
100
∴m = 0.25

Then e-m = e-0.25 = 0.7788 = 0.779


Poisson probability function is given by
e−m m x
P(x) =
x!
e−0.25(0.25)0
P(0) = = (0.779)
0!

∴ f(0) = N.P(0) = 100 × (0.779) = 77.9


Other frequencies are calculated using the recurrence formula
m
f(x + 1) = f(x).
x +1

By putting x = 0,1,2,3, we get the expected frequencies and are calculated as follows.
m
f(1) = f(0 + 1) = f(0)
0 +1
0.25
f(1) = (77.9)
1
= 19.46
0.25
f(2) = (19.46)
2
= 2.43
0.25
f(3) = (2.43)
3
= 0.203

0.25
f(4) = (0.203)
4
= 0.013

Observed frequencies 79 18 2 1 0 100


Expected frequencies 78 20 2 0 0 100

Example:

Assuming that one in 80 births in a case of twins, calculate the probability of 2 or


more sets of twins on a day when 30 births occurs. Compare the results obtained by using (i)
the binomial and (ii) Poisson distribution.

Solution:

(i) Using Binomial distribution


1
Probability of twins birth = p = = 0.0125
80

∴q = 1 − p = 1 − 0.0125
= 0.9875
n = 30

Binomial distribution is given by

P (x) = nCx px qn-x

P (x ≥ 2) = 1 – P(x < 2)

= 1 – {P(x = 0) + P(x =1)}

= 1 – {30C0(0.0125)0 (0.9875)30 + 30C1(0.0125)1(0.9875)29}

= 1– {1.1(0.9875)30 + 3 (0.125) (0.9875)29}

= 1 – { 0.6839 + 0.2597}

= 1 – 0.9436

P( x ≥ 2) = 0.0564
(ii) By using Poisson distribution:
The probability mass function of Poisson distribution is given by

e−m m x
P(x) =
x!
Mean = m = np
= 30 (0.0125) = 0.375
P(x ≥ 2) = 1 − P(x < 2)

= 1 −{P(x = 0) + P(x = 1)}


 e −0.375 (0.375)0 e −0.375 (0.375)1 
=
1−  + 
 0! 1! 

= 1 − e−0.375(1 + 0.375)
= 1 − (0.6873) (1.375) = 1 − 0.945 = 0.055

CONTINUOUS DISTRIBUTIONS:

NORMAL DISTRIBUTION:
Introduction:

In the preceding sections we have discussed the discrete distributions, the Binomial and
Poisson distribution.
In this section we deal with the most important continuous distribution, known as
normal probability distribution or simply normal distribution. It is important for the reason that
it plays a vital role in the theoretical and applied statistics.

The normal distribution was first discovered by DeMoivre (English Mathematician)


in 1733 as limiting case of binomial distribution. Later it was applied in natural and social
science by Laplace (French Mathematician) in 1777. The normal distribution is also known as
Gaussian distribution in honour of Karl Friedrich Gauss(1809).
Definition:

A continuous random variable X is said to follow normal distribution with mean μ and
standard deviation σ, if its probability density function

Note:
The mean μ and standard deviation σ are called the parameters of Normal distribution.
The normal distribution is expressed by X ~ N (μ, σ2)
Condition of Normal Distribution:

i) Normal distribution is a limiting form of the binomial distribution under the following
conditions.

a) n, the number of trials is indefinitely large ie., n → ∞ and

b) Neither p nor q is very small.

ii) Normal distribution can also be obtained as a limiting form of Poisson distribution with
parameter m → ∞

iii) Constants of normal distribution are mean = μ, variation = σ2, Standard deviation = σ.

Normal probability curve:

The curve representing the normal distribution is called the normal probability curve.
The curve is symmetrical about the mean (μ), bell-shaped and the two tails on the right and
left sides of the mean extends to the infinity. The shape of the curve is shown in the following
figure.

-∞ x=µ

Properties of normal distribution:


1. The normal curve is bell shaped and is symmetric at x = μ.

2. Mean, median, and mode of the distribution are coincide

i.e., Mean = Median = Mode = μ

3. It has only one mode at x = μ (i.e., unimodal)

4. Since the curve is symmetrical, Skewness = β1 = 0 and Kurtosis = β2 = 3.

5. The points of inflection are at x = μ ± σ


1
6. The maximum ordinate occurs at x = μ and its value is =
σ 2π
7. The x axis is an asymptote to the curve (i.e. the curve continues to approach but
never touches the x axis)
8. The first and third quartiles are equidistant from median.
9. The mean deviation about mean is 0.8 σ

10. Quartile deviation = 0.6745 σ

11. If X and Y are independent normal variates with mean μ1 and μ2, and variance σ1 2
and σ 22 respectively then their sum (X + Y) is also a normal variate with mean
(μ1 + μ2) and variance (σ 12 + σ 22)
12. Area Property P (μ - σ < × < μ + σ) = 0.6826

P (μ - 2σ < × < μ + 2σ) = 0.9544


P (μ - 3σ < × < μ + 3σ) = 0.9973
Standard Normal distribution:
Let X be random variable which follows normal distribution with mean µ and variance
X −µ
σ2 .The standard normal variate is defined as Z = which follows standard normal
σ

distribution with mean 0 and standard deviation 1 i.e., Z ~ N(0,1). The standard normal
−1 2
1 z
distribution is given by φ (z) = e2 ; – ∞ < z < ∞ The advantage of the above function

is that it doesn’ t contain any parameter. This enable us to compute the area under the normal
probability curve.
Area properties of Normal curve:

The total area under the normal probability curve is 1. The curve is also called standard
probability curve. The area under the curve between the ordinates at x = a and x = b where a <
b, represents the probabilities that x lies between x = a and x = b i.e., P(a ≤ x ≤ b)

X −µ
To find any probability value of x, we first standardize it by using Z = , and use
σ
the area probability normal table. (given in the Appendix).

For Example: The probability that the normal random variable x to lie in the interval
(μ – σ , μ + σ) is given by
P (μ – σ < x < μ + σ) = P (–1 ≤ z ≤ 1 )

= 2P(0 < z < 1)

= 2 (0.3413) (from the area table)

= 0.6826

P(μ – 2σ < x < μ + 2σ) = P(– 2 < z < 2 )

= 2P(0 < z < 2)


= 2 (0.4772) = 0.9544

P(μ – 3σ < x < μ + 3σ) = P(– 3 < z < 3 )

= 2P(0 < z < 3)

= 2 (0.49865) = 0.9973
The probability that a normal variate x lies outside the range µ ± 3σ is given by
P(|x – μ| > 3σ) = P(|z| >3)
= 1 – P(-3 ≤ z ≤ 3)
= 1 – 0.9773 = 0.0027
Thus we expect that the values in a normal probability curve will lie between the range
μ ± 3σ, though theoretically it range from – ∞ to ∞.
Example:
Find the probability that the standard normal variate lies between 0 and 1.56
Solution:

P(0 < z < 1.56) = Area between z = 0 and z = 1.56


= 0.4406 (from table)
Example:
Find the area of the standard normal variate from –1.96 to 0.
Solution:

Area between z = 0 & z =1.96 is same as the area z = – 1.96 to z = 0


P (–1.96 < z < 0) = P (0 < z < 1.96) (by symmetry)
= 0.4750 (from the table)
Example:
Find the area to the right of z = 0.25
Solution:

P (z > 0.25) = P (0 < z < ∞) – P (0 < z < 0.25)


= 0.5000 – 0.0987 (from the table) = 0.4013
Example:
Find the area to the left of z = 1.5
Solution:

P (z < 1.5) = P(– ∞ < z < 0 ) + P( 0 < z < 1.5 )


= 0.5 + 0.4332 (from the table)
= 0.9332
Example:
Find the area of the standard normal variate between –1.96 and 1.5
Solution:

P (–1.96 < z < 1.5) = P (–1.96 < z < 0) + P(0 < z < 1.5)
= P (0 < z < 1.96) + P(0 < z < 1.5)
= 0.4750 + 0.4332 (from the table)
= 0.9082
Example :
Given a normal distribution with μ = 50 and σ = 8, find the probability that x assumes
a value between 42 and 64.
Solution:

Given that μ = 50 and σ = 8


X −µ
The standard normal variate z =
σ
If X = 42, Z = 42 − 50 = −8 = −1
1
8 8
64 − 50 14
If X = 64, Z = = = 1.75

2
8 8

∴ P(42 < x < 64) = P(–1 < z < 1.75)

= P(–1 < z < 0) + P (0 < z < 1.95)

= P (0 < z < 1) + P (0 < z < 1.75) (by symmetry)

= 0.3413 +0 .4599 (from the table)

= 0 .8012

Example:

Students of a class were given an aptitude test. Their marks were found to be normally
distributed with mean 60 and standard deviation 5. What percentage of students scored.

i) More than 60 marks (ii) Less than 56 marks (iii) Between 45 and 65 marks

Solution:

Given that mean = μ = 60 and standard deviation = σ = 5


X −µ
i) The standard normal varaiate Z =
σ
x−µ 60 − 60
If X = 60, Z = = =0
σ 5

∴ P(x > 60) = P (z > 0)

= P(0 < z < ∞ ) = 0.5000

Hence the percentage of students scored more than 60 marks is 0.5000(100) = 50 %

P(x < 56) = P(z < -0.8)

= P(– ∞ < z < 0) – P (– 0.8 < z < 0) (by symmetry)

= P(0 < z < ∞) – P (0 < z < 0.8)

= 0.5 – 0.2881 (from the table)

= 0.2119

Hence the percentage of students score less than 56 marks is 0.2119(100) = 21.19 %
45 − 60 −15
iii) If X = 45, Z = = = −3
5 5
65 − 60 5
X = 65 then z = = =1
5 5

P (45 < x < 65) = P (– 3 < z < 1)


= P (– 3 < z < 0 ) + P (0 < z < 1)
= P (0 < z < 3) + P(0 < z < 1) (by symmetry)
= 0.4986 + 0.3413 (from the table)
= 0.8399

Hence the percentage of students scored between 45 and 65 marks is


0.8399 (100) = 83.99 %
Example:
X is normal distribution with mean 2 and standard deviation 3. Find the value of the
variable x such that the probability of the interval from mean to that value is 0.4115
Solution:
Given μ = 2, σ = 3
Suppose z1 is required standard value,
Thus P (0 < z < z1) = 0.4115
From the table the value corresponding to the area 0.4115 is 1.35 that is z1 = 1.35
x−µ
Here z1 =
σ
x−2
1.35 =
3
x = 3(1.35) + 2
= 4.05 + 2 = 6.05

Example:

In a normal distribution 31 % of the items are under 45 and 8 % are over 64. Find the
mean and variance of the distribution.
Solution:

Let x denotes the items are given and it follows the normal distribution with mean μ
and standard deviation σ
The points x = 45 and x = 64 are located as shown in the figure.

i) Since 31 % of items are under x = 45, position of x into the left of the ordinate x = μ

ii) Since 8 % of items are above x = 64 , position of this x is to the right of ordinate x = μ

0.19

0.42
0.31 0.08

-∞ z = -z1 z = 0 z = z2 +∞
x = 45 x = µ x = 64

x−µ 45 − µ
When x = 45, z = = = −z (say)
1
σ σ

Since x is left of x = µ , z1 is taken as negative


64 − µ
When x = 64, z = = z (say)
2
σ

From the diagram P(x < 45) = 0.31


P(z < – z1) = 0.31
P (– z1 < z < 0) = P (– ∞ < z < 0) – p (– ∞ < z < z1)
= 0.5 – 0.31 = 0.19
P (0 < z < z1) = 0.19 (by symmetry)
z1 = 0.50 (from the table)
Also from the diagram p (x > 64) = 0.08
P(0 < z < z2) = P(0 < z < ∞) – P(z2 < z < ∞)
= 0.5 - 0.08 = 0.42
z2 = 1.40 (from the table)
Substituting the values of z1 and z2 we get
64 − µ
45 − µ = 1.40
= −0.50 and σ
σ

Solving μ – 0.50 σ = 45 ------(1)


μ + 1.40 σ = 64 ----- (2)
(2) – (1) ⇒ 1.90 σ = 19 ⇒ σ = 10
Substituting σ = 10 in (1) μ = 45 + 0.50 (10)
= 45 + 5.0 = 50.0
Hence mean = 50 and variance = σ2 = 100

Exercise
I. Choose the best answer:

1. Binomial distribution applies to


(a) rare events (b) repeated alternatives (c) three events (d) impossible events
2. For Bernoulli distribution with probability p of a success and q of a failure, the relation
between mean and variance that hold is
(a) mean < variance (b) mean > variance
(c) mean = variance (d) mean ≤ variance
3. The variance of a binomial distribution is

(a) npq (b) np (c) npq (d) 0


 2 x  115−x 2

4. The mean of the binomial distribution 15Cx     in which p = is


 3  3 3

(a) 5 (b) 10 (c) 15 (d) 3


5. The mean and variance of a binomial distribution are 8 and 4 respectively. Then P(x = 1)
is equal to
1 1 1 1
(a) (b) (c) (d)
212 24 26 28
6. The mean of a binomial distribution is 10 and the number of trials is 30 then probability
of failure of an event is
(a) 0.25 (b) 0.333 (c) 0.666 (d) 0.9
7. The variance of a binomial distribution is 2. Its standard deviation is
(a) 2 (b) 4 (c) 1/2 (d)
8. In a binomial distribution if the numbers of independent trials is n, then the probability of
n success is
(a) nCxpxqn-x (b) 1 (c) pn (d) qn
9. The binomial distribution is completely determined if it is known
(a) p only (b) q only (c) p and q (d) p and n
10. The trials in a binomial distribution are
(a) mutually exclusive (b) non-mutually exclusive
(c) independent (d) non-independent
11. If two independent variables x and y follow binomial distribution with parameters, (n1, p)
and (n2, p) respectively, their sum (x + y) follows binomial distribution with parameters
(a) (n1 + n2, 2p) (b) (n, p) (c) (n1 + n2, p) (d) (n1 + n2, p + q)
12. For a Poisson distribution
(a) mean > variance (b) mean = variance
(c) mean < variance (d) mean ≤ variance
13. Poisson distribution correspondents to
(a) rare events (b) certain event
(c) impossible event (d) almost sure event
14. If the Poisson variables X and Y have parameters m1 and m2 then X + Y is a Poisson
variable with parameter.
(a) m1m2 (b) m1 + m2 (c) m1– m2 (d) m1/m2
15. Poisson distribution is a
(a) Continuous distribution (b) discrete distribution
(c) either continuous or discrete (d) neither continue nor discrete
16. Poisson distribution is a limiting case of Binomial distribution when

(a) n → ∞ ; p → 0 and np = m (b) n → 0 ; p → ∞ and p=1/m


(c) n → ∞ ; p → ∞ and np = m (d) n → ∞ ; p → 0 ,np = m
17. If the expectation of a Poisson variable (mean) is 1 then P(x < 1) is
(a) e-1 (b) 1-2e-1 (c) 1- 5/2e-1 (d) none of these
18. The normal distribution is a limiting form of Binomial distribution if
(a) n → ∞ p → 0 (b) n → 0 , p → q
(c) n → ∞ , p → n (d) n → ∞ and neither p nor q is small.
19. In normal distribution, skewness is
(a) one (b) zero (c) greater than one (d) less than one
20. Mode of the normal distribution is
1
(a) σ (b) (c) μ (d) 0

21. The standard normal distribution is represented by


(a) N (0,0) (b) N(1,1) (c) N(1,0) (d) N(0,1)
22. Total area under the normal probability curve is
(a) less than one (b) unity (c) greater than one (d) zero
23. The probability that a random variable x lies in the interval (μ – 2σ , μ + 2σ) is
(a) 0.9544 (b) 0.6826 (c) 0.9973 (d) 0.0027
24. The area P(- ∞ < z < 0) is equal to
(a) 1 (b) 0.1 (c) 0.5 (d) 0
25. The standard normal distribution has
(a) μ =1, σ = 0 (b) μ = 0, σ = 1 (c) μ = 0, σ = 0 (d) μ =1, σ = 1
1 (x−100)2

26. The random variable x follows the normal distribution f(x) = C.e 2 25 then the
value of C is

1
(a) 5 2π (b) (c) (d) 5
5 2π
27. Normal distribution has
(a) no mode (b) only one mode (c) two modes (d) many mode
28. For the normal distribution
(a) mean = median = mode (b) mean < median < mode
(c) mean > median > mode (d) mean > median < mode
1 (x−30)2
1 −
29. Probability density function of normal variable P(X = x) = e 2 25 ;-α<x<α
then mean and variance are 5 2π

(a) mean = 30 variance = 5 (b) mean = 0, variance = 25


(c) mean = 30 variance = 25 (d) mean = 30, variance = 10
30. The mean of a Normal distribution is 60, its mode will be
(a) 60 (b) 40 (c) 50 (d) 30
31. If x is a normal variable with μ =100 and σ2 = 25 then P(90 < x < 120) is same as
(a) P (– 1 < z < 1) (b) P(-2 < z < 4)
(c) P(4 < z < 4.1) (d) P(-2 < z < 3)
33. If x is N(6, 1.2) and P(0 ≤ z ≤1) = 0.3413 then P(4.8 ≤ x ≤ 7.2) is
(a) 0.3413 (b) 0.6587 (c) 0.6826 (d) 0.3174

II. fill in the blanks:


34. The probability of getting a head in successive throws of a coin is .
35. If the mean of a binomial distribution is 4 and the variance is 2 then the parameter is
.

 2 1
9
36. + refers the binomial distribution and its standard deviation is .
 3 3 
37. In a binomial distribution if number of trials to be large and probability of success
bezero, then the distribution becomes .
38. The mean and variance are in Poisson distribution.
39. The mean of Poisson distribution is 0.49 and its standard deviation is .
40. In Poisson distribution, the recurrence formula to calculate expected frequencies is
.
∑ fx2
41. The formula −x( )2 is used to find .
N
42. In a normal distribution, mean takes the values from to .
43. When μ = 0 and σ = 1 the normal distribution is called .
44. P(– ∞ < z < 0) covers the area .
45. If μ = 1200 and σ = 400 then the standard normal variate z for x = 800 is .
46. At x = μ ± σ are called as in a normal distribution.
47. P(-3 < z < 3) takes the value
48. X axis be the to the normal curve.

III. Answer the following

49. Comment the following :

“ For a binomial distribution mean = 7 and variance = 16

50. Find the binomial distribution whose mean is 3 and variance 2.

51. In a binomial distribution the mean and standard deviation are 12 and 2 respectively.
Findn and p.
52. A pair of dice is thrown 4 times. If getting a doublet is considered a success, find the
probability of 2 success.

53. Explain a binomial distribution.

54. State the characteristics of a binomial distribution.

55. State the conditions for a binomial variate.

56. Explain the fitting of a binomial distribution.

57. For the binomial distribution (0.68 + 0.32)10 find the probability of 2 success.

58. Find the mean of binomial distribution of the probability of occurrence of an event is
1/5and the total number of trials is 100.
59. If on an average 8 ships out of 10 arrive safely at a port, find the mean and standard
deviation of the number of ships arriving safely out of total of 1600 ships.

60. The probability of the evening college student will be a graduate is 0.4. Determine
theprobability that out of 5 students (i) none (ii) one (iii) atleast one will be a graduate.
61. Four coins are tossed simultaneously. What is the probability of getting i) 2 heads and
2tails ii) atleast 2 heads iii) atleast one head.

62. 10% of the screws manufactured by an automatic machine are found to be


defective.20 screws are selected at random. Find the probability that i) exactly 2 are
defective
ii) atmost 3 are defective iii) atleast 2 are defective.

63. 5 dice are thrown together 96 times. The numbers of getting 4, 5 or 6 in the experiment
is given below. Calculate the expected frequencies and compare the standard deviation of
the expected frequencies and observed frequencies.
Getting 4 ,5 or 6 : 0 1 2 3 4 5

Frequency : 1 10 24 35 18 8
64. Fit a binomial distribution for the following data and find the expected frequencies.
X: 0 1 2 3 4
f 18 35 30 13 4

65. Eight coins are tossed together 256 times. Number of heads observed at each toss is
recorded and the results are given below. Find the expected frequencies. What are the
theoretical value of mean and standard deviation? Calculate also mean and standard
deviation of the observed frequencies.
Number of heads: 0 1 2 3 4 5 6 7 8
Frequencies : 2 6 39 52 67 56 32 10 1

66. Explain Poisson distribution.

67. Give any two examples of Poisson distribution.

68. State the characteristics of Poisson distribution.

69. Explain the fitting of a Poisson distribution

70. A variable x follows a Poisson distribution with mean 6 calculate i) P(x = 0) ii) P(x = 2)

71. The variance of a Poisson Distribution is 0.5. Find P(x = 3). [e-0.5 = 0.6065]
72. If a random variable X follows Poisson distribution such that P(x =1) = P(x = 2) find (a)
the mean of the distribution and P(x = 0). [e-2 = 0.1353]

73. If 3% of bulbs manufactured by a company are defective then find the probability in a
sample of 100 bulbs exactly five bulbs are defective.

74. It is known from the past experience that in a certain plant there are on the average 4
industrial accidents per month. Find the probability that in a given year there will be less
than 3 accidents. Assume Poisson distribution.[e-4 = 0.0183]
75. A manufacturer of television sets known that of an average 5% of this product is defective.
He sells television sets in consignment of 100 and guarantees that not more than 4 sets will
be defective. What is the probability that a television set will fail to meet the guaranteed
quality? [e-5 = 0.0067]

76. One fifth percent of the blades produced by a blade manufacturing factory turns out to
be a defective. The blades are supplied in pockets of 10. Use Poisson distribution to
calculate the approximate number of pockets containing i) no defective (ii) all defective
(iii) two defective blades respectively in a consignment of 1,00,000 pockets.

77. A factory employing a huge number of workers find that over a period of time, average
absentee rate is three workers per shift. Calculate the probability that in a given shift
i) exactly 2 workers (ii) more than 4 workers will be absent.
78. A manufacturer who produces medicine bottles finds that 0.1 % of the bottles are
defective. They are packed in boxes containing 500 bottles. A drag manufactures buy 100
boxes from the producer of bottles. Using Poisson distribution find how many boxes will
contain (i) no defective ii) exactly 2 (iii) atleast 2 defective.
79. The distribution of typing mistakes committed by a typist is given below:

Mistakes per page : 0 1 2 3 45


No. of pages : 142 156 69 57 5 1

Fit a Poisson distribution.

80. Fit a Poisson distribution to the following data:


x: 0 1 2 3 4 5 6 7 8 Total
f : 229 325 257 119 50 17 2 1 0 1000

81. The following tables given that number of days in a 50, days period during which
automatically accidents occurred in city. Fit a Poisson distribution to the data
No of accidents : 0 1 2 3 4
No of days : 21 18 7 3 1

82. Find the probability that standard normal variate lies between 0.78 and 2.75.

83. Find the area under the normal curve between z = 0 and z = 1.75.

84. Find the area under the normal curve between z = -1.5 and z = 2.6.

85. Find the area to the left side of z = 1.96

86. Find the area under the normal curve which lies to the right of z = 2.70.

87. A normal distribution has mean = 50 and standard deviation is 8. Find the probability that
x assumes a value between 34 and 62.
88. A normal distribution has mean = 20 and S.D = 10. Find area between x =15 and x = 40.

89. Given a normal curve with mean 30 and standard deviation 5. Find the area under the
curve between 26 and 40

90. The customer accounts of a certain departmental store have an average balance of Rs.1200
and a standard deviation of Rs.400. Assuming that the account balances are normally
distributed. (i) what percentage of the accounts is over Rs.1500? (ii) What percentage of
the accounts is between Rs.1000 and Rs.1500? iii) What percentage of the accounts is
below Rs.1500?
91. The weekly remuneration paid to 100 lecturers coaching for professional entrance
examinations are normally distributed with mean Rs.700 and standard deviation Rs.50.
Estimate the number of lecturers whose remuneration will be i) between Rs.700
andRs.720 ii) more than Rs.750 iii) less than Rs.630.
92. x is normally distributed with mean 12 and standard deviation 4. Find the probability of
the following i) x ≥ 20 ii) x ≤ 20 iii) 0 < x < 12

93. A sample of 100 dry cells tested to find the length of life produced the following results
μ =12 hrs, σ = 3 hrs. Assuming the data, to be normally distributed. What percentage of
battery cells are expressed to have a life. i) more than 15 hrs ii) between 10 and 14 hrs as
iii) less than 6 hrs?.

94. Find the mean and standard deviation of marks in an examination where 44 % of
thecandidates obtained marks below 55 and 6 % got above 80 marks.
95. In a normal distribution 7 % of the items are under 35 and 89 % of the items are under
63. Find its mean as standard deviation.
Note: For fitting a binomial distribution in the problem itself, if it is given that the coin is
unbiased, male and female births are equally probable, then we consider p = q = ½ . All other
cases we have to find the value of p from the mean value of the given data.
9-3. RECTANGULAR (OR UNIFORM) DISTRIBUTION
Definition. A random variable X is said to have a continuous rectangular (uniform)
distribution over an interval (a, b), i.e., (- o<a<b< o), ifits [Link] given by :

= - a i f a <x<b . (9.19)
fx;a,b)
0, otherwise

Remarks 1. a and b, (a < b) are the two parameters of the distribution. The distributin is
called uniform distribution on (a, b) since it assumes a constant (uniform) value for all x in (a, b).
2. The distribution is also known as rectangular distribution, since the curve y f(x)
describes a rectangle o v e r the x-axis and between the ordinates at x =a and x b. =

3. A uniform or rectangular variate X on the interval (a, b) is written as : X U la, b] or


-

XR [a b).
4. The cumulative distribution function F(x) is given by:

0 xSa
F )=-- a<x<b . (9.19a)

1 x2b

Since F(x) is not continuous at x =a and x = b, it is not differentiable at these points.


and b and
ThusFx) =flx) = 0, exists everywhere except at the points x = a x =

consequently [Link]. fr) is given by (919).


FUNDAMENTALS OF MATHEMATICAL
9-30
uniform [Link]. fx)
and the corresponding distribution funos
STATISTI
ction F) are
CS
5. The graphs of
given below.

f() Fl

1
1
b-a

b
o
6. For a rectangular or uniform variate X in (-a, a), the [Link]. is given by:

f e r ) - 2 - a < x <a

0, 0, otherwise ..9-196)
9-3-1. Moments of Rectangular Distribution. Let X - U [a, b].

-)d- da= ) .9-20)


In particular

9-20a)
and
) - ( + ab +a-)
Variance
H-H2=(6+ ab +)-{ %b +a= b-a (9-206
9.3-2. M.G.F. of Rectangular
Distribution is given by:
M)
-"R) da =|-a da= - eat
+0 (9-20c)
9.3-3. Characteristic Function of
Rectangular Distribution is given
ox)= ei dx =eibt-eiat by:
t(6t#0 .. (9.20d)
9-3-4. Mean Deviati
about Mean, n of
n =Elx- Mean Rectangular Distributionisis given
given by:
- Mean|f(x) dx

-(b -a)/2 It dt, where t


- b-2 t dt
="
x-

(9.20)
ECIAL CONTINUOUS PROBABILITY DISTRIBUTIONS
9.31
Example 9.21. f X is uniformly distributed with mean 1 and
variancefind
P(X<0).

Solution. Let X~ U [a, b], so that p()= a<x<b. We are given:


Mean = +a) = 1 b+a =2 and Var (X) =( b - a y = b-a =t4.

Solving, we get a =-1 and b=3;(a<), plx)=|-1<x<3

0
P (X 0)= dr=/|«[,-i
Example 9-22. Subway trains ona certain line run every half hour between mid-night
and six in the morning. What is the probability that a man entering the station at a random
time during this period will have to wait at least twenty minutes?
Solution. Let the r.v. X denote the waiting time (in minutes) for the next train.
X is distributed
Under the assumption that a man arrives at the station at random,
uniformly on (0, 30), with [Link].,

fa) =00<x<30
0, otherwise
The probability that he has to wait at
least 20 minutes is given by:
30 30
P(X220) = dr= 1. dx= 30-20)=
20 20

distribution in [0, 1), find the distribution ([Link].) of


Example 9-23. 1f X has a uniform
-2 log X. Identify the distribution also.
X. Then the
distribution function G of Yis given by:
Solution. Let Y=-2 log
P (X2ey?)
(log X2-y/2)
=

P
(-2 log XSy)
=

oy )= P (YSy) = P e w 2

e/2

s)dr=1- 1. dx =
- ev2
=1-P(XSe?) =1
.()
8 )- )=ze2,0<y<o
from 0 to ]
as X ranges in (0, 1), Y= -2 log Xranges
-2 log X, has an exponential
[0, 1], then Y
=
u
illustrates that if X -

distribution with
Hemark. This example or Y = -2 log X, has chi-square
lstribution with parameter 0 [c.f. 5 9-8]
=
n2 degrees offreedom [c.f. Chapter 15].
distribution :J() 2-a <x<a, =

EXample 9-24. Showthat forrectangular


Also show that
moments
of even orderare given by:
8 J . about origin is (sinh at).
a2n
H2(2n+1)

You might also like