Introduction to Differential Equations
Introduction to Differential Equations
Differential Equations
General Notions
Definition. A differential equation is an equation that relates some function of
one or more variables with its derivatives.
The problem of forming and solving these equations is widely encountered in
physics and engineering. In biology and economics, differential equations are used
to model the behavior of complex systems.
The subject differential equations was motivated by problems in mechanics,
elasticity, astronomy and geometry during the latter part of the 17 th century.
Inventions (or discoveries) in theory, methods, and notation evolved concurrently
with innovations in calculus. Since their early historical origins, the number and
variety of problems to which differential equations are applied have grown
substantially. Today, scientists and engineers use differential
equations to study problems such as
• airflow around airplanes and design of airplanes,
• ship design,
• controlling the flight of airplanes and rockets,
• designing medical imaging technologies,
• designing electric circuits,
• earthquake detection and prediction,
• wave propagation,
• heat transfer,
• weather forecasting,
• designing optimal vaccination policies to prevent the spread of disease,
• forecasting and managing the harvesting of fish populations, and
• determining the price of financial derivatives.
All fields of science and engineering, as well as several of the business and
social sciences,use differential equations.
Basic definitions
Let us now introduce some definitions important for further understanding. A
differential equation is usually taken in a form which connects an argument (or
several arguments) and an unknown function (or several unknown functions) with
its derivatives. If the unknown function depends on one variable the differential
equation is called ordinary.
Otherwise the equation is called a partial differential equation. The highest
order of the derivative of the unknown function entering into an equation is called
the order of the differential equation.
The general form of a differential equation of the nth order is
F ( x, y , y ¿ , y // , .. . , y( n) )=0 , (1)
where y= y (x ) is the sought-for function. A function is called a solution of a
differential equation if it reduces the equation to an identity when substituted into
the equation.
Even the simplest examples indicate that a differential equation has infinitely
many solutions. For instance, taking a simple equation of the form
¿
y =x 2 , y= y (x ) (2)
we immediately find, by integrating, that
x3
y= +C
3 (3)
This is the general solution of equation (2).
It contains an arbitrary constant C and is the set of solutions containing all
solutions of the equation. Making the arbitrary constant assume concrete numerical
values we obtain particular solutions of equation (2):
x3 x3 x3 √ 2
y= ; y= +6 , y = −
3 3 3 3
Similarly, the general solution of an equation of the form (1) also contains n
arbitrary constants, i.e. it has the form
y= y(x , c 1 , c 2 ,. . ., c n ) (4)
we often obtain the general solution in an implicit form
Φ( x, y 1 , c 1 , c 2 ,. . ., c n )=0 (5)
Relations (4) and (5) are also called general integrals of equation (1).
Particular solution can be obtained from (4) or (5) if we make each of the arbitrary
constants c 1 ,c 2 ,...,c n take on a certain concrete numerical value. The graph of every
particular solution is called an integral curve of the differential equation.
To isolate a unique particular solution from the general solution we must set
some additional conditions. Such conditions are often taken as so-called initial
conditions. In the general case of an equation of form (1) the initial conditions are
y= y 0 , y ¿ =( y ¿ )0 , . .. y( n−1)=( y (n−1) )0 for x=x 0 (6)
Condition (6) for an equation of the first order of from (2) means that for a
certain value x=x 0 we must assign a value y= y 0 . For instance, let it be necessary
to isolate a solution for which y (1)=2 . Then (3) implies
1 5
2 = +C C=
3 i.e. 3.
Hence, the sought – for particular solution has the form
x 2 +5
y=
3
Definition. The problem of finding a particular solution of a differential
equation when certain initial conditions are given is called the Cauchy problem
(initial-value problem)
Separable equations
dy
=f (x , y )
1) The differential equation of the form dx is called separable, if
f ( x , y )=h( x ) g( y ) that is
dy
=h( x ) g( y )
dx (2)
Let us rewrite the equation (2) as
dy
=h( x )dx
g( y)
and then, integrating,
1
∫ g( y ) dy=∫ h( x )dx+C
we obtain
G( y )=H ( x )+C
dy y 2−1
=
Example. Solve dx x
Separating the variables we obtain
dy dx
=
y −1 x
2
1 y−1
ln| |=ln|x|+C
2 y+1
2) The equation
M 1 ( x ) N 1 ( y )dx+M 2 ( x )N 2 ( y )dy=0
(3)
is also an equation with variables separable. If we divide both side of equation
(3) by N 1 ( y )M 2 ( x ) , we get
M 1 ( x )N 1 ( y ) M 2 ( x )N 2 ( y )
dx + dy=0
N 1( y ) M 2( x ) N 1 ( y ) M 2( x )
or
M1 ( x ) N 2( y )
dx + dy=0
M 2( x ) N 1( y )
We can construct the general solution of this equation:
M ( x) N ( y)
∫ M 1( x ) dx+∫ N 2 ( y ) dy =C
2 1
Example: ( x −1 ) y +2 xy =0
2 ' 2
y ( 0 )=1
dy 2x
=− dx
y2 x 2 −1
1
=ln|x 2−1|+c
y
1=ln 1+ c
c=1
1
=ln|x 2−1|+1
y
'
Example: ( x +1 ) y +xy =0
dy x
=− dx
y x +1
x +1−1
ln y =−∫ dx+ln c
x+1
ln y =−∫ 1− ( 1
x +1 )
dx +ln c
ln y=−x+ln|x+1|+ln c
ln y−ln c|x+1|=−x
y
ln =−x
c ( x +1 )
y
=e− x
c ( x +1 )
y=c ( x +1 ) e−x
Homogeneous Equations
dy
=f (x , y )
dx (1)
is homogeneous if the function f ( x , y ) is homogeneous, that is f (tx , ty)=f ( x , y ) for
1
t=
any number t. Let us substitute x in this identity, then we have
f ( x , y )=f 1; ( )
x
y
xdu du dx
=f (1, u )−u =
dx or f (1 , u)−u x
Integrating, we find
du dx
∫ f (1 , u)−u =∫ x
+C .
Once solved, go back to the old variable y via the equation y=ux .
A first – order ordinary differential in the form
M ( x , y )dx +N ( x , y )dy=0
is a homogeneous type if both functions M ( x , y ) and N ( x , y ) are homogeneous
functions of the same degree n. That is, multiplying each variable by a parameter λ
, we find
dy −2 x +5 y
=
dx 2 x+ y
Solution: it is easy to check, that
−2 x +5 y
f ( x , y )=
2 x + y is homogeneous.
y
=u
we substitute x , then
¿ ¿
y=ux , y =u+xu
−2 x+5 ux
u+ xu ¿=
and hence 2 x +ux or
−2+5 u
u+ xu ¿=
2+u
which can be rewritten as
u=
¿
(
1 −2+5 u
x 2+u
−u )
This is a separable equation
= −
dx x u+2 (
du 1 u2 −3 u+2
−
u+2
or (u−2)(u−1)
)dx
du= .
x
Integrating, the latter we find
4 du 3 du dx
−∫ +∫ =∫ +lnC
u−2 u−1 x
−4ln|u−2|+3ln|u−1|=ln|x|+ln C .
|u−1|3 (u−1)3
ln =ln|x|C =xC
|u−2|4 or (u−2)4
( ) ( )
3 4
y y
−1 =xC −2
x x ,
( y−x )3=C ( y−2 x ) 4
This is the general solution of the original differential equation. Example:
Solution: We substitute y= xu ,
dy y
= +2
Solve the homogeneous equation dx x
y
x. √
dy dy
=u+x
then dx dx .
du du
u+ x =u+2 √u , x =2 √ u
and dx dx
this is a separable equation.
Integrating, the latter we find
dx du
= , ln x=ln c + √ u , √u=ln cx
x 2 √u
We get
√ y
x
=ln cx 2
, or y=x ln cx
general solution of the original differential equation.
LECTURE 2
Linear Equations
¿ −∫ P( x)dx
z ( x) e =Q( x )
or
z ¿ ( x) =Q( x)e∫
P (x)dx
Integrating, find
z( x) =∫ Q (x )e∫
P( x)dx
dx+C1
Substituting this expression in formula (3), we get general solution of linear
equation (1):
(
y = ∫ Q (x )e∫
P( x)dx
dx+C1 e ∫ )
− P( x)dx
x¿
Examples. Solve y − y=e
Let us consider associated homogeneous equation
¿
y − y=0
It is separable equation. Thus
dy dy dy
=y ⇒ =dx ⇒∫ =∫ dx+ln C
dx y y
hence
ln y =x+lnC
or
y
ln =x
y=Ce x
C
Let us substitute C(x) into C
x
Then y=C ( x )e
Differentiating this expression, we find
¿ ¿
y =C ( x )e x +C ( x )e x
¿
Substituting y and y into given equation, we get
¿
C ( x )e x +C ( x )e x −C (x )e x=e x
¿
C ¿ ( x )e x =e x or C ( x)=1
Integrating, we find
C ( x )=∫ dx + C1 ⇒C ( x )=x +C 1
So
y=( x+ C1 )e x
It is general solution of given equation.
¿ 4
Examples. Solve xy −2 y=2 x
Let us consider associated homogeneous equation
xy ¿ −2 y=0
It is separable equation. Thus
dy 2 y dy 2 dx dy 1
= ⇒ = ⇒∫ =2∫ dx+ln C
dx x y x y x
hence
ln y =ln x 2 +ln C
or
y
ln =ln x 2
C y=Cx 2
Let us substitute C(x) into C
2
Then y=C ( x) x
Differentiating this expression, we find
y ¿ =C¿ ( x ) x2 +2C ( x ) x
Substituting y ¿ and y into given equation, we get
C ¿ ( x ) x 3 +2 C( x ) x 2 −2C ( x ) x2 =2 x 4
¿
C ¿ ( x ) x 3=2 x 4 or C ( x)=2x
Integrating, we find
Bernoulli Equation
z= y −1 z ¿ =− y −2 y ¿
With this substitution the differential equation becomes,
4
−z ' + z=x 3
x
So, as noted this is linear differential equation that we know how to solve. We
have
4
z − z=−x 3
'
x (6)
Let us consider the corresponding homogeneous equation
' 4
z − z=0
x
which is separable equation. Thus
dz 4 dz dx dz dx
= z ⇒ =4 ⇒∫ =4 ∫ +ln C
dx x z x z x
4
Hence ln z=4 ln x +ln C ⇒ ln z=lnCx ⇒ z =Cx
4
¿ 4
C ( x ) x 4 +4 C ( x ) x3 − C ( x )⋅x 4 =−x 3
x
¿ ¿ 1
C ( x ) x 4 =−x 3 ⇒ C ( x )=− ⇒ C( x )=−ln x+C1
x
so
z=(−ln x +C 1 )x 4
To get the solution in terms of y all we need to do is plug the substitution back in.
Doing this gives
y−1 =x 4 (C1 −ln x )
Example 1. Solve
( x +1 ) ( y ' + y 2 ) =− y (7)
Solution. The first thing that we need to do is get this into the “proper” form
and that means dividing everything by y 2 . Doing this gives,
1
y−2 y +1=−
¿
y ( x +1 )
The substitution and derivative that we will need here is:
z= y −1 z =− y −2 y
¿ ¿
' z
−z +1=−
x+1
So, as noted this is linear differential equation that we know how to solve. We
have
z '
z −1=
x+1 (8)
Let us consider the corresponding homogeneous equation
dz dx
=
z x +1
which is separable equation. Thus
ln z=ln|x+1|+ln c
Hence z=c ( x+ 1 )
Let us substitute C(x) instead of C
Then z=C ( x ) ( x +1 )
¿
Differentiating this expression, and substituting z and z into (8), we get
C ( x ) ( x +1 )
C ¿ ( x ) ( x+1 )+C ( x )−1=
( x+1 )
C ( x )=ln ( x +1 ) +C 1
z=( x +1 ) ( ln ( x +1 ) +C1 )
y ( 1+ x ) ( ln ( x+1 ) +C 1 ) =1
.
Lecture 3
∂ M ∂2 f ∂2 f ∂ N
= = =
∂ y ∂ x∂ y ∂ y ∂ x ∂ x
This suggests the following test for exactness
∂M ∂N
= ,
But, since ∂ y dx then
x
or
¿
N ( x , y )−N ( x 0 , y )+ϕ ( y )=N ( x , y )
¿
Hence ϕ ( y )=N ( x 0 , y ) , and it follows that
y
ϕ ( y )=∫ N ( x 0 , y )dy +C 1
y0
Therefore,
x y
f ( x , y )=∫ M (x , y )dx +∫ N ( x 0 , y )dy + C1
x0 y0
∫ M ( x , y )dx+∫ N ( x 0 , y )dy= C
x0 y0
Otherwise, IF the equation (4) is not equal. What we will do for solving these
equations?
So why aren't all equations exact? Consider the following example.
( x 2 + y )+( x−sin y ) dy =0
dx is exact
( x 2 + y ) ( x−sin y ) dy
+ =0
x x dx is no exact
There is no real difference between the two equations, we have just divided
through by x to obtain the second equation from the first. The equations have the
same solutions. But when we test for exactness, we find that only the first equation
is exact. The reason most first order equations aren't exact is that some key factor
has been divided out. One way to solve the equation would be to find the factor,
called an integrating factor, and put it back in. Unfortunately, finding integrating
factors can be extremely difficult. There is one situation where it is fairly
straightforward, when the equation is linear. A first order differential equation is
said to be linear if it can be written in the form
'
y + p ( x ) y=q ( x )
For a linear equation, we find an integrating factor as follows
∫ p( x )dx
μ( x )=e
If we multiply through the linear equation by μ( x ) we obtain
'
μ( x ) y ( x )+μ ( x ) p (x ) y ( x )=μ( x )q( x )
'
Now μ ( x )=μ( x ) p( x ) from the way we defined μ( x ) so we can now write our
equation as
' '
μ( x ) y ( x )+μ (x ) y ( x )=μ( x )q( x )
d
( μ( x ) y ( x ))=μ( x )q (x )
dx
Now we integrate both sides to obtain
μ( x ) y ( x )=∫ μ (x )q ( x )dx
1
y( x)=
μ( x)
∫ μ ( x)q( x)dx
So
WARNING: You can't cancel the inside the integral by the outside the integral.
This formula is confusing because we are using x as both the dummy variable of
integration and as the independent variable of the resulting function.
Lecture 4
Lagrange and Clairaut Equations*
Alexis Claude Clairaut (1713-1765) solved the differential equation
y=x y + g ( y )
' '
x=F ( p , C ) ,
y=F ( p ,C ) f ( p ) + g ( p )
(4)
We had also assumed the p−f ( p )≠0 . However, there might also be solutions of
Lagrange’s equation for which p−f ( p )=0 . Such solutions are called singular
solutions.
Example 1.
' ¿
Solve the Lagrange equation y=2 x y − y . We will start with Equation (3). Noting
that
f ( p )=2 p , g ( p )=− p2 , we have
' '
' f ( p) g ( p)
x− x=
p−f ( p ) p−f ( p )
' 2 −2 p
x− x=
p−2 p p−2 p
' 2
x + x=2.
p
(5)
This first order linear differential equation can be solved using an integrating
factor. Namely,
μ ( p )=exp ( p )
∫ 2
dp =e2 ln p
.
= p 2
solution.
' ¿
Example 2. Find the solutions of y=x y − y . As noted, there is a family of
straight line solutions
y=Cx−C 2 , since g ( p )=− p2 . There might also by a parametric solution not
contained n this family. It would be given by the set of equations
'
x=−g ( p )=2 p
y=− p g ( p ) +g ( p )=2 p 2− p 2= p2
'
(8)
x2
y=
Eliminating p , we have the parabolic curve 4 . In Figure 1.8 we plot these
solutions. The family of straight line solutions are shown in blue. The limiting
curve traced out, much like string figures one might create, is the parametric curve.
1.4.3 Riccati Equation*
Jacopo Francesco Riccati (1676-1754) studied curves with some specified
curvature. He proposed an equation of the form
y +a ( x ) y 2 +b ( x ) y + c ( x )=0
'
Example 1.
2 x 2x x '
Find the general solution of the Riccati equation, y − y +2 e y−e −e =0 using
x
the particular solution y 1 ( x )=e . We let the sought solution take the form
y ( x )=z ( x ) +e x Then, the equation for z(x) is found as
dz
=z 2
. dx
1
z=
This equation is simple enough to integrate directly to obtain C−x . Then, the
solution to the problem becomes
1
y ( x )= +e x
C−x
Lecture 5
Applications
In this section we will look at some simple applications which are modeled with
first order differential equations. We will begin with simple exponential models of
growth and decay
1.3.1 Growth and Decay
Some of the simplest models are those involving growth or decay. For example, a
population model can be obtained under simple assumptions. Let P ( t ) be the
population at time t . We want to find an expression for the rate of change of the
dp
population, dt . Assuming that there is no migration of population, the only way
the population can change is by adding or subtracting individuals in the population.
The equation would take the form
dP
=Rate In−Rate Out
dt
The Rate In could be due to the number of births per unit time and the Rate Out by
the number of deaths per unit time. The simplest forms for these rates would be
given by
Rate In=bP and the Rate Out=mp
Here we have denoted the birth rate as b and the mortality rate as m . This gives
the total rate of change of population as
dp
=bP−mP=kP
dt
(1.27)
Equation (1.27) is a separable equation. The separation follows as we have seen
earlier in the chapter. Rearranging the equation, its differential form is
dP
=kdt
P
Integrating, we have
dP
∫ P =∫ kdt
ln|P|=kt +C
(1.28)
Next, we solve for P ( t ) through exponentiation, Integrating, we have
|P ( t )|=e kt+C
P ( t ) =±e kt +C
(1.29)
kt
C
=±e e
= Ae kt
C
Here we renamed the arbitrary constant, ±e as A
If the population at t =0 is P0 , i.e., P ( 0 )=P 0 , then the solution gives
P ( 0 )= Ae 0 =A=P 0 . So, the solution of the initial value problem is
P ( t ) =P0 e kt
Equation (1.27) the familiar exponential model of population growth: Malthusian
population growth.
dP
=kP
dt
This is easily solved and one obtains exponential growth (k > 0) or decay (k < 0).
This Malthusian growth model has been named after Thomas Robert Malthus
(1766-1834), a clergyman who used this model to warn of the impending doom of
the human race if its reproductive practices continued.
Example 1.8.
If 150.0 g of Thorium-234 decays to 137.6 g of Thorium234 in three days, what is
its half-life?
This is another simple decay process. If Q ( t ) represents the quantity of unstable
material, then Q ( t ) satisfies the rate equation
dQ
=kQ
dt
with k < 0.
kt
The solution of the initial value problem, as we have seen, is Q ( t ) =Q0 e . Now, let
1
Q ( τ )= Q 0
the half-life be given by τ. Then, 2 . Inserting this fact into the solution,
we have
Q ( τ )=Q0 e kt
1
Q0 =Q0 e kt
2
1 kt
=e
2
t
Noting that Q ( t ) =Q0 ( e ) , we solve Equation (1.31) for
k
−1
e k =2 . τ
(2
pounds
gal )( )
5
gal
min
=10
pound
min
Similarly, one can determine the rate out as
(x pounds
50 gal )( )
5
gal
=
x pounds
min 10 min
Thus, we have
dx x
=10−
dt 10
This equation is solved using the methods for linear first order equations. The
x
10
integrating factor is μ= e , leading to the general solution
−t
10
x (t )=100+ Ae
Using the initial condition, one
finds the particular solution
( )
−t
10
x (t )=100 1−e
Often one is interested in the long time behavior of a system. In this case we have
that lim t →∞ x ( t )=100 lb . This makes sense because 2 pounds per galloon enter during
this time to eventually leave the entire 50 gallons with this concentration. Thus,
lb
50 gal×2 =100 lb
50 gal
Lecture 6
Differential equations of second – order.
Reduction of order
Some second-order equation can be reduced to first-order equation, rendering
then susceptible to the simple methods of solving equations of the first order. The
following are three types of such second-order equations:
Type 1: Let
y //=f ( x ) (1)
Integrating this equation (1), we obtain
y =∫ f (x )dx +C 1
¿
x3
−sin x +C 1 x+C 2
6
Type 2: Second-order equations with the dependent variable mussing.
This is an equation of the type
¿
F ( x , y , y // )=0 (2)
The dependent variable y does not explicitly appear in the equation. This type
of second-order equation is easily reduced to a first-order equation by the
transformation
¿
y = p.
// '
This substitution obviously implies y = p , and the original equation
becomes a first-order equation for p
¿
F ( x, p , p )=0
Suppose we have managed to integrate this equation and have obtained its
general solution p=ϕ (x , C1 ) . Then we have
¿
y =ϕ ( x ,C 1 )
and therefore the general solution of equation (2) is thus obtained:
y=∫ ϕ(x , C1 )dx +C2
Example 2. Solve the differential equation
xy // = y ¿
¿ ¿
Solution. Since the dependent variable y is missing, let y = p and y = p .
//
These substitutions transform the given second-order equation into the first-
order equation
¿
xp = p
It is a separable equation. Separating the variables, we obtain
dp dx
=
p x
Integrating, we have
dP dx
∫ P =∫ x
Then ln p=ln x+ ln C1 or ln p=ln xC 1
and hence
p=xC 1
x2
y =C1 x ⇒ y=∫ C 1 x +C 2=C 1
¿
+C 2
So 2
Type 3. Second-order nonlinear equations with the independent variable
missing.
Let it be of the form
F ( y , y ¿ , y // )=0 (3)
The independent variable x does not explicitly appear in the equation.
The method for reducing the order of these second-order equations begins
with the same substitution as for Type 2 equations, namely, replacing y/ as p. But
instead of simple writing y// as p/, the trick here is to express y// in terms of a first
derivative with respect to y.
Therefore we write
y //= ( )
d dy dp dp dy
= =
dx dx dx dy dx
=p
dp
dy
Now we deduce from equation (3) the equation
(
F y , p, p
dp
dy
=0)
which is a first-order equation.
If we manage to integrate it and to find its general solution p=ϕ ( y , C 1 ) then
we have
dy
=ϕ ( y ,C 1 )
dx
and therefore the general solution of equation (3) can be directly written in the
form
dy
∫ ϕ( y , C ) =x+C2
1 .
Example 3. Solve the differential equation
¿
1+( y )2 =2 yy //
dp
¿ y //= p
Solution. The substitutions y = p and dy transform this second-order
equation for y into the following first-order equation for p:
dp
1+ p2 =2 yp
dy
Separating the variables, we obtain
2 pdp dy
=
1+ p2 y
Integrating, we have
2 pdp dy
∫ 1+ p 2 =∫ y +ln C 1
It follows that
2
ln (1+ p 2 )=ln y+ ln C1 or 1+ p =C1 y
⇒ p 2=C 1 y−1 ⇒ p= C1 y −1 √
¿
Now, since p= y , this last result becomes
dy
=√ C1 y−1
dx
Separating the variables, we get
dy
∫ =∫ dx+C2
√ 1
C y −1
2
√C y−1=x+C2
C1 1
So
4 ( C1 y −1 )
=( x +C 2 )2
or C21
.
Second – Order Linear Differential Equations
d2 y dy
2
+ p (x ) + q( x ) y=0
dx dx (2)
¿ ¿
q ( x )(C1 y 1 +C2 y 2 )=(C 1 y //1 + C2 y 2// )+ p ( x )(C1 y 1 + C2 y 2 )+
+q ( x )( C 1 y 1 +C 2 y 2 )=C1 [ y //1 + p ( x ) y ¿1 +q ( x ) y 1 ] +
+C 2 [ y 2// + p( x ) y ¿2 + q( x ) y 2 ] =C 1⋅0+C 2⋅0=0
Thus, y=C 1 y 1 +C 2 y 2 is a solution of equation (2). The other fact we need is given by
the following theorem. It says that the general solution is a linear combination of
two linearly independent solutions y 1 and y 2 .
This means that neither y 1 and y 2 is a constant multiple of the other. For
2 2 x
instance, the functions f ( x )=x and g( x )=5 x are linearly dependent, but f ( x )=e
x
and g( x )=xe are linearly independent.
Theorem 2. If y 1 and y 2 are linearly independent solutions of equation (2),
then the general solution is given by
y ( x )=C1 y 1 (x )+C2 y 2 ( x )
where C and C 2 are arbitrary constants.
1
( )
x
Integrating, we have
x
ln W =−∫ p( x ) dx +ln C
x0
or
x
W
ln =−∫ p( x ) dx
C x 0
So
( )
x
W ( y 1 , y 2 ) ( x )=C exp −∫ p( x ) dx
x0
where C=W ( y 1 , y 2 )( x 0 )
Lecture 7
Second-order linear homogeneous differential equation with constant
coefficients
following fact:
2
If the roots k1 and k2 of the auxiliary equation ak +bk +c=0 are real and
unequal, then the general solution of
¿
ay // +by +cy=0 is
k1x k2 x
y=C 1 e +C 2 e (4)
// ¿
Example 1. Solve the equation y + y −6 y=0
Solution. The auxiliary equation is
2
k +k −6=(k −2)(k + 3)=0
whose roots are k1=2, k2 =-3. Therefore by (4) the general solution of the given
differential equation is
y=C 1 e 2 x +C 2 e−3 x
2
Case 2. b −4 ac=0
In this case k1=k2; that is, the roots of the auxiliary equation are real and equal. Let
us denote by k the common value of k1 and k2. Then, from formula (3), we have
b
k =−
2 a so 2 ak +b=0 (5)
kx kx
We know that y 1 =e is one solution of equation (1). We now verify that y 2 =xe is
also a solution:
¿
ay //2 + by 2 +cy 2 =a(2 ke kx +k 2 xekx )+b(e kx + kxekx )+cxe kx =
¿(2 ak + b)e kx +( ak 2 +bk + c )xe kx =0⋅( ekx )+ 0⋅( xekx )=0
The first term is 0 by equation (5); the second term is 0 because k is a root of the
kx kx
auxiliary equation. Since y 1 =e and y 2 =xe are linearly independent solutions.
Theorem 2 provides us with the general solution. If the auxiliary equation
ak 2 +bk +c=0 has only one real root k, then the general solution of
¿
ay // +by +cy=0 is
y=C 1 e kx +C 2 xekx (6)
// ¿
Example 2. Solve the equation y −2 y + y=0
2
Solution. The auxiliary equation k −2 k +1=0 can be factored as (k −1) =0
2
[ b
In fact α=− , β=
2a
√ 4 ac −b2
2a ]
Then, using Euler’s equation
e iθ =cos θ+i sin θ
we write the solution of the differential equation as
k1x k2 x
y=C 1 e +C 2 e =C1 e( α +iβ) x +C 2 e(α−iβ )x =
=C 1 e αx (cos βx +isin βx )+C 2 e αx (cos βx−isin βx )=
¿ e αx [(C 1 +C 2 ) cos βx +i(C1 −C 2 )sin βx ]=
~ ~
¿ e αx [ C1 cos βx+ C 2 sin βx ]
~ ~
where C 1 =C 1 +C 2 , C 1=i(C 1−C2 )
This gives all solutions (real or complex) of the differential equation.
~ ~
The solutions are real when the constants C1 and C2 are real. We summarize
the discussion as follows.
2
If the roots of the auxiliary equation ak +bx+ c=0 are the complex
// ¿
numbers k 1=α +iβ , k 2 =α −iβ , then the general solution of ay +by +c=0 is
y=e αx [ C 1 cos βx +C2 sin βx ] (7)
// ¿
Example 3. Solve the equation y +4 y +13 y=0
Solution. The auxiliary equation is
k 2 +4 k +13=0
By the quadratic formula, the roots are
k 1, 2 =−2±3 i
By (11) the general solution of the differential equation is
y=e−2 x (C1 cos 3 x +C 2 sin 3 x )
Initial-Value Problem
An initial-value problem for the second-order equation (1) consists of finding
a solution y of the differential equation that also satisfies initial conditions of the
form
¿
y ( x 0 )= y 0 y ( x 0 )= y 1
where y 0 and y 1 are given constants.
Example 4. Solve the initial-value problem
¿
y // + y ¿ −6 y=0; y ( 0)=1 and y (0)=0
Solution. From Example 1 we know that the general solution of the
differential equation is
y ( x )=C1 e 2 x +C 2 e−3 x
Differentiating this solution, we get
¿
y (x )=2 C1 e 2 x −3 C2 e−3 x .
To satisfy the initial conditions we require that
y (0 )=C 1 +C 2=1
¿
y (0)=2C 1−3C 2 =0 .
2
C 2= C 1
We have from second equation 3 and from the first
2 3 2
C 1 + C 1=1 ⇒C 1 = , C 2=
3 5 5
Thus, the required solution of the initial-value problem if
3 2
y= e2 x + e−3 x
5 5
Example 5. Solve the initial-value problem
y // + y=0 y (0 )=2 y ¿ ( 0)=3
Solution. The auxiliary equation is
k 2 +1=0 or k 2 =−1 , whose roots are ±i . Thus α=0 , β=1 and since e 0 x =1 ,
the general solution is
y ( x )=C1 cos x +C 2 sin x
'
Since y ( x )=−C sin x +C 2 cos x
the initial conditions become
y (0 )=C 1=2 y ¿ (0 )=C 2=3
Therefore, the solution of initial-value problem is
y ( x )=2cos x+3 sin x
// ¿
Summary: Solution of ay +by +c=0
2
Roots of ak +bk +c=0 General solution
k 1 , k 2 - real and distinct y=C 1 e
k 1x
+C 2 e
k2 x
Lecture 8
Nonhomogeneous Linear Differential Equations
Let us consider second-order nonhomogeneous linear differential equations
with constant coefficients, that is, equation of the form
¿
ay // + by +cy =f (x ) (1)
where a,b,c are constants and f(x) is a continuous function. The related
homogeneous equation
¿
ay // +by +cy=0 (2)
is called the complementary equation and plays an important role in the solution
of the original nonhomogeneous equation (1).
Theorem. The general solution of the nonhomogeneous differential equation
(1) can be written as
y ( x )= y p ( x )+ y c ( x ) ,
where y p is a particular solution of equation (1) and y c is the general solution
of the complementary equation (2).
Proof. All we have to do is verify that if y is any solution of equation (1), then
y− y p is a solution of the complementary equation (2).
Indeed
'
a ( y− y p ) // +b ( y− y p ) +c ( y− y p )=
¿ ay // + ay //p +by ¿ −by ¿p +cy−cy p =
¿( ay // +by ¿ + cy )−(ay //p +by ¿p +cy p )=
¿ f ( x )−f ( x )=0
Polynomials are equal when their corresponding coefficients are equal. Thus
{−2A=1¿{2A−2B=0¿¿ ¿
The solution of this system of equation is
1 1 3
A=− , B=− , C=−
2 2 4
A particular solution is therefore
1 1 3
y p ( x )=− x 2 − x−
2 2 4
and, by theorem 1, the general solution is
1 1 3
y= y c + y p =C 1 e x +C 2 e−2 x − x 2 − x−
2 2 4
Example 2. Solve the equation
¿
y // + y =5 x+3
Solution. The auxiliary equation of
// ¿
y + y =0 is
k 2 +k =0
with roots k 1=0 k 2=−1
So the solution of complementary equation is
y c =C1 +C2 e−x
Since f ( x )=5 x +3 is a polynomial of degree 1 and one of the roots of auxiliary
equation is 0 (r=1), then we look for a particular solution of the form
y p ( x )=( Ax +B ) x=Ax 2 +Bx
¿
Then p =2 Ax+B and y p =2 A
//
y
So, substituting into the given equation, we obtain
2 A +2 Ax +B=5 x+ 3
or
2 Ax +(2 A+ B )=5 x+3
Polynomials are equal when their coefficients are equal. Thus
{2 A=5¿¿¿¿
The solution of this system of equations is
5
A= ; B=−2
2
Therefore, a particular solution is
5
y p ( x )= x 2 −2 x
2
and, the general solution is
5
y= y c + y p =C 1 +C 2 e−x + x 2 −2 x
2
αx
II. If the right side of equation (1) f ( x ) is of the form e P n ( x ) , where Pn ( x) is
or y p =( Ax 2 +Bx )e 3 x
Then
¿
y p =(2 Ax+B )e3 x +3( Ax 2 +Bx )e3 x =
=(3 Ax 2 +3 Bx+2 Ax+B )e 3 x
and
y //p =(6 Ax+3 B+2 A )e 3 x +3(3 Ax 2 +3 Bx+2 Ax+B )e 3 x =
=(9 Ax 2 +12 Ax+9 Bx+6 B+2 A )e3 x
Substituting into the differential equation, we have
( 9 Ax 2 +12 Ax+9 Bx +6 B+2 A )e3 x−
−2(3 Ax 2 +3 Bx+2 Ax+B)e 3 x −
−3( Ax 2 +Bx )e 3 x=( x+2)e3 x
or 9 Ax 2 +12 Ax +9 Bx +6 B+2 A−
−6 Ax 2 −6 Bx−4 Ax−2 B−
2
−3 Ax −3 Bx=x +2
Simplifying, we get
8 Ax+(2 A+ 4 B)=x+ 2
whence
{8 A=1¿¿¿¿
The solution of this system of equation is
1 7
A= B=
8 and 16
y p ( x )= ( 18 x +167 x ) e
2 3x
y ( x )=C1 e− x +C 2 e3 x + ( 18 x +167 x) e
2 3x
III. If the right side of equation (1) f ( x ) is of the form M cos βx+ N sin βx , we take
as particular solution a function of the form
y p ( x )=( A cos βx +B sin βx ) x r
where A,B are unknown coefficients and r = number of the roots of the auxiliary
{−3A+B=0¿¿¿¿
The solution of this system is
1 3
A=− B=−
10 and 10
So a particular solution is
1 3
y p ( x )=− cos x− sin x
10 10
In example 1 we determined that the solution of the complementary equation is
y c =C1 e x +C 2 e−2 x
and ay // + by ¿ +cy =f 2 ( x )
yp + yp
respectively, then 1 2 is a solution of
¿
ay // + by +cy =f 1 ( x )+f 2 ( x )
// x
Example 6. Solve y −4 y =xe +cos2 x
2
Solution. The auxiliary equation is k −4=0 with roots ±2 , so the solution of
the complementary equation is
y c (x )=C 1 e2 x +C2 e−2 x
// x
For the equation y −4 y =xe we try
x
y p 1 ( x )=( Ax +B )e
x
y p =( Ax +B+ A )e
¿
Then 1
y //p 1 =( Ax+2 A+ B )e x
Thus
{−3 A=1¿¿¿¿
1 2
A=− , B=−
So 3 9 and
(1 2
y p ( x )= − x− e x
1 3 9 )
//
For the equation y −4 y =cos2 x , we try
y p ( x )=C cos 2 x+ D sin 2 x
2
Substitution gives
−4 C cos2 x−4 D sin 2 x−4(C cos2 x+D sin 2 x )=cos 2 x
or −8 C cos 2 x−8 D sin 2 x=cos2 x
Lecture 9
The Method of Variation of Parameters
Suppose we have already solved the homogeneous equation
ay // +by ¿ +cy =0
and written the solution as
y ( x )=C1 y 1 (x )+C2 y 2 ( x ) (1)
where y 1 and y 2 are linearly independent solutions. Let us replace the constants (or
Since u1 and u2 are arbitrary functions, we can impose two conditions on them.
One condition is that y p is a solution of the differential equation; we can choose the
other condition so as to simplify our calculations. In view of the expression in
equation (3), let us impose the condition that
u¿1 y 1 + u¿2 y 2 =0 (4)
Then
¿ ¿ ¿ ¿
y //p =u 1 y 1 +u2 y 2 +u1 y //1 +u2 y 2//
Substituting in the differential equation, we get
¿ ¿ ¿ ¿
a (u1 y 1 +u2 y 2 +u1 y 1// +u2 y 2// )+
+b (u1 y ¿1 +u2 y ¿2 )+C (u1 y 1 +u 2 y 2 )=f (x )
or
¿ ¿
u1 (ay 1// +by 1 + cy 1 )+u2 ( ay 2// +by 2 + cy 2 )+
+ a(u¿1 y ¿1 + u¿2 y ¿2 )=f ( x ) (5)
So
u2 ( x )=sin x−ln|tan ( x2 + π4 )|
Therefore
[
y p ( x )=−cos x sin x + sin x−ln|tan ( )]
x π
+ | cos x=
2 4
=−cos x ln|tan ( )
x π
+ |
2 4
and the general solution is
y ( x )=C1 sin x +C 2 cos x−cos x ln|tan ( )
x π
+ |
2 4
Lecture 10
Complex Numbers
Definitions
Definition 1.1 Complex numbers are defined as ordered pairs ( x , y )
Points on a complex [Link] axis, imaginary axis, purely imaginary
[Link] and imaginary parts of complex number. Equality of two complex
numbers
Definition 1.2 The sum and product of two complex numbers are defined as
follows:
( x 1 , y 1 ) + ( x 2 , y 2 ) =( x 1 + x 2 , y 1 + y 2 )
( x 1 , y 1 )⋅( x 2 , y 2 ) =( x 1 x 2− y 1 y 2 , x 1 y 2 + x2 y1 )
In the rest of the chapter use z , z 1 , z 2 for complex numbers and x , y for real
[Link] and z=x +iy notation.
Algebraic Properties
1. Commutativity
z 1 + z 2 =z 2 + z 1 z 1 z 2=z 2 z 1
2. Associativity
( z 1 + z 2 ) + z 3 = z1 + ( z 2 + z 3 )( z 1 z 2 ) z 3 =z 1 ( z 2 z 3 )
3. Distributive Law
z ( z 1 + z 2 ) =zz 1 + zz 2
4. Additive and Multiplicative Indentity
z +0= z z⋅1=z
5. Additive and Multiplicative Inverse
−z=(−x ,− y )
x
( −y
z−1 = 2 2 , 2 2 , z≠0 .
x +y x + y )
6. Subtraction and Division
z1
=z 1 z −1
z 1−z 2 =z 1 + ( −z2 ) z2 2
,
7. Modulus or Absolute Value
|z|=√ x 2 + y 2
8. Conjugates and properties
z̄=x−iy=( x ,− y )
z1 ±z 2 =z 1±z 2
()
z1 z1
=
z2 z2
z1 z 2 =z 1 z 2
9.
|z|2 =z z
z+z z−z
Re z= , Im z=
2 2i
[Link] Inequality
|z 1 +z 2|≤|z 1|+|z 2|
3.
i ( θ1 +θ2 )
z 1 z 2=r 1 r 2 e
z1 r 1 i ( θ 1−θ2 )
= e
z2 r2
z n =r n e nθ
[Link]’s Formula
( cos θ+ isin θ )n =cos nθ+i sin nθ
( ) (e )
1 iπ iπ iπ i2 π iπ i4π
1+i 3 + +
3= 4 12 12 3 12 3
are e ,e ,e
Example [Link] three possible roots of √ 2
Regions in Complex Plane
Domain of f is C−{ 0 } .
A multiple-valued function is a rule that assigns more than one value to each point
of domain.
Example 2.2
f ( z )= √ z This function assigns two distinct values to each z ( ¿ 0 ) . One can choose
the function to be single-valued by specifying
√ z=+ √ r e
iθ
2
where θ is the principal value.
Elementary Function
1. Polynomials
P ( z )=a 0 +a 1 z + a2 z 2 +. ..+ an z n
Where the coefficients are real. Rational Functions.
2. Exponential Function
z2 z3
e z=1+ z + + +.. .
2! 3 !
Converges for all . For real the definition coincides with usual exponential
z
sin z
tan z=
cos z
2 2
a. sin z +cos z=1
2 sin z1 cos z 2=sin ( z1 + z 2 ) +sin ( z 1 −z 2 )
b.
2 cos z 1 cos z 2 =cos ( z 1 + z 2 ) +cos ( z1 −z 2 )
c.
2 sin z1 sin z 2 =−cos ( z 1 + z 2 ) +cos ( z 1−z 2 )
d.
e. sin ( z +2 π )=sin z and cos ( z+2 π )=cos z
f. sin z=0 iff z=nπ n=( 0 ,±1 ,. . . )
π
cos z=0 iff z = +nπ n=( 0 ,±1 , .. . )
g. 2
h. These functions are not bounded.
i. A line segment from ( 0 , y ) to ( 2 π , y ) maps to an ellipse with semimajor axis
equal to cash y under sin function.
[Link] Functions
Define
e z +e− z
cosh z=
2
e z −e−z
sinh z=
2
Analytic Functions
Limits
A function f is defined nbd of z 0 .
Definition 3.1 The limit of the function f ( z ) as z → z 0 is number ω 0 if, for any
given ε > 0 there exists a δ >0 such that
|z−z 0|<δ ⇒|f ( z )−ω0|<ε .
lim z→ z f ( z )=5 z 0
Example 3.1 f ( z )=5 z . show that 0 .
2
2 ( )
lim z→ z 0 f z =z 0
Example 3.2 f ( z )=z . Show that .
Example 3.3 f ( z )=z / z̄ show that the limit of f does not exist as z → 0 .
ω 0 =u0 +iv 0 . lim f ( z ) =ω0
Theorem 3.1 Let f ( z )=u ( x , y )+iv ( x , y ) and z→ z 0
if and only
lim
if ( x , y ) →( x , y )
0 0
lim
u=u0 and ( x , y ) →( x , y ) v=v 0 .
0 0
lim
Example 3.4 f ( z )=sin z . show that the z→ z 0 f ( z )=sin z 0
Example 3.5 f ( z )=2 x +iy Show that the lim z→ 2i f ( z )=4 i .
2
lim [ f ( z ) + F ( z ) ] = ω0 +W 0
z→ z0
;
lim f ( z ) F ( z )=ω 0 W 0
z→ z0
;
ω0
lim f ( z ) / F ( z )= W 0¿ 0
z→ z0 W0 .
This theorem immediately makes available the entire machinery and tools used for
real analysis to be applied to complex analysis. The rules for ending limits then can
be listed as follows:
lim c=c .
1. z→ z0
n n
lim z =z 0 .
2. z→ z0
lim P ( z )=P ( z 0 )
3. z→ z0
if P is a polynomial in z .
lim exp=exp ( z 0 )
4. z→ z0
Continuity
Definition 3.2A function f , defined in some nbd of z 0 is continuous at z 0 if
lim f ( z ) =f ( z 0 )
z → z0
.
This definition clearly assumes that the function is defined at z 0 and the limit on
the LHS exists. The function f is continuous in a region if it is continuous at all
points in that region. If functions f and g
Are continuous at z 0 then f +g , fg and f / g ( g ( z 0 ) ≠0 ) are also continuous at z 0 . If a
function f ( z )=u ( x , y )+iv ( x , y ) is continuous at z 0 then the component functions u
Example 3.7 f ( z )=|z|2 . Show that this function is differentiable only at z=0 . In
2
real analysis |x|is not differentiable but |x| is.
If a function is differentiable at z , then it is continuous at z.
The converse in not true. See example 3.7.
Even if component functions of a complex function have all the partial
derivatives , does not imply that the complex function will be differentiable. See
example 3.7.
Some rules for obtaining the derivatives of function are listed here. Let f and g be
differentiable at z .
d
( f ±g ) ( z )=f ' ( z )±g ( z )
1. dz .
d
( fg ) ( z )=f ' ( z ) g ( z ) + f ( z ) g' ( z )
2. dz
d 2
( f /g )( z )=( f ' ( z ) g ( z )−f ( z ) g' ( z ) ) / [ g ( z ) ] if g ( z )≠0
3. dz .
d
( f ∘ g ) ( z )=f ' ( z ) ( g ( z ) ) g' ( z )
4. dz .
d
c=0 .
5. dz
d n
z =nz n−1
6. dz .
Cauchy-Riemann Equations
Theorem 3.3 if f ( z 0 ) exists then all the first order partial derivatives of component
'
z̄ 2
f ( z )=
Example 3.12 z . Show that the CR conditions are satisfied at z=0 but the
function f is not differentiable at 0.
iθ
If we write z=re then we can write Cauchy-Riemann Conditions in polar
coordinates:
1
ur = v θ
r
uθ =−rv r
Analytic Functions
Definition 3.4 A function is analytic in an open set if it has a derivative at each
point in that set.
Definition 3.5 A function is analytic at a point z 0 if it is analytic in some nbd of z 0 .
Definition 3.6 A function is an entire function if it is analytic at all points of C .
Example 3.13 f ( z )=1 /z is analytic at all nonzero points.
Example 3.14 f ( z )=|z|2 is not analytic anywhere.
A function is not analytic at a point z 0 , but is analytic at some point in each nbd of
z 0 then z 0 is called the singular point of the function f .
Harmonic Function
Definition 3.7 Areal valued function H ( x , y ) is said to be harmonic in a domain of
xy plane if it has continuous partial derivatives of the first and second order and
satisfied Laplace equation:
H xx ( x , y )+ H y ( x , y )=0 .
Theorem 3.5 If a function f ( z )=u ( x , y )+iv ( x , y ) is analytic in a domain D then the
functions u and v are harmonic in D .
Definition 3.8 if two given functions u ( x , y ) and v ( x , y ) are harmonic in domain D
and their first order partial derivatives satisfy Cauchy-Riemann Conditions
u x=v y
u y=−v x
Then v is said to be harmonic conjugate of u .
2 2
Example 3.15 Let u ( x , y )= x − y and v ( x , y )=2 xy . Show that v is hc of u and not
vice versa.
3
Example 3.16 u ( x , y )= y −3 xy . find harmonic conjugate of u .
Lecture 11
Integral of a function of a complex variable. The
Cauchy integral theorem. The Cauchy Formula.
Example : Show that a half circle in upper half plane with radius R and centered
at origin can be parameterized in various ways as given below:
1. x (t )=R cos t , y ( t )=R sin t , where t : 0→π .
2. x (t )=t , y ( t ) =√ R −t , where t :−R →R .
2 2
A closed simple contour has only first and last point same and does not cross itself.
Contour Integral
If C is a contour in complex plane defined by z (t )=x ( t )+iy ( t ) and a function
f ( z )=u ( x , y )+iv ( x , y ) is defined on it . The integral of f ( z ) along the contour C is
denoted and defined as follows:
b b b
∫ f ( z ) dz=∫ f ( z ) z ( t ) dt=∫ ( u x −v y ) dt+i∫ ( u y' +v x ' ) dt=
' ' '
C a a a
∫ f ( z ) dz=∫ f ( z ) dz+∫ f ( z ) dz
3. C 1−C 2 C1 C2
|∫ f ( z ) dz≤∫|f ( z (t ) ) z ( t )||dt
'
4. C C
|∫ f ( z ) dz|≤ML,
5. If |f ( z )|≤M for all z ∈ C then C where L is length of the
countour C .
2
Example 4.4 f ( z )=z , Find integral of f from (0,0) to (2,1) along a straight
line and also along st line from (0,0) to (2,0) and from (2,0) to (2,1).
Example 4.5 f ( z )=1 /z . Find the integral from (2,0) to (-2,0) along a
semicircular path in upper plane given by |z|=2 .
π
|∫ f ( z ) dz|≤ for f ( z )=1/ ( z 2 −1 ) and C :|z|=2
Example 4.6 Show that C
3 from 2 to
2i.
Cauchy-Goursat Theorem
Theorem 4.1 (Jordan Curve Theorem)
Every simple and closed contour in complex plane splits the entire plane into two
domains one of which is bounded. The bounded domain is called the interior of the
countour and the other one is called the exterior of the contour.
Define a sense direction for a contour.
Theorem 4.2 Let C be a simple closed contour with positive orientation and let D
be the interior of C ,
If P and Q are continuous and have continuous partial derivatives P x ,P y ,Q y at all
points on C and D , then
∫ ( p ( x , y ) dx+Q ( x , y ) dy ) =∫∫ [ Qx ( x , y )−P y ( x , y ) ] dxdy
C
Theorem 4.3 (Cauchy-Goursat Theorem) Let f be analytic in a simply connected
domain D . If C is any simple closed contour in D , then
∫ f ( z ) dz=0
C
2
Example 4.7 f ( z )=z , exp ( z ) , cos ( z ) etc are entire functions so integral about any
loop is zero.
Theorem 4.4 Let C 1 and C 2 be two simple closed positively oriented contours such
that C 2 lies entirely in the interior of C 1 . If f is an analytic function in a domain
D that contains C 1 and C 2 both and the region between them, then
∫ f ( z ) dz=∫ f ( z ) dz
C1 C2
f ( z ) =1 /z
∫ f ( z ) dz
Example 4.8 Find C if C is any contour containing origin.
Choose a circular contour inside C .
1
∫ z−z dz=2 πi
Example 4.9 C 0 if C contains z 0 .
2 zdz
∫ z 2+2
Example 4.10 Find C where C :|z|=2 . Extend the Cauchy Goursat theorem to
multiply connected domains.
Antiderivative
Theorem 4.5 (Fundamental Theorem of Integration) Let f be defined in a
simply connected domain D and is analytic in D . If z 0 and z are points in D and
C is any contour in D joining z 0 and z then the function
F ( z )=∫ f ( z ) dz
C
'
is analytic in D and F ( z )=f ( z ) .
Definition 4.4 If f is analytic in D and z 1 and z 2 are two points in D then the
definite integral is defined as
z2
∫ f ( z ) dz=F ( z 2) −F ( z1 )
z1
where F is an antiderivative of f .
2−i
2 11
∫ z 2 dz=z 3 /3|2+i
0 = +i
3 3
Example 4.11 0
i
∫ cos zdz=sin z|i1=sin i−sin 1
Example 4.12 1
z2
1
f ( z )=
. ∫ f ( z ) dz C :|z−i|=2
2
Example 4.14 z + 4 Find C if
z
f ( z )= z ∫ f ( z ) dz
Example 4.15 2 +1 Find C if C is square with vertices on (±2 ,±2 ) .
Theorem 4.7 If f is analytic at a point, then all its derivatives exist and are
analytic at that point.
n! f ( z ) dz
( z 0 ) = 2 πi ∫
( n)
f
C ( z −z 0 )n+1 .
Lecture 12
PROBABILITY
Probability is a key part of mathematics. It plays an important role in the money markets,
in weather reporting, and many other areas of human life.
In everyday speech, the word “probable” is used in vague senses. For example, we say “it
will probably rain tomorrow”, “the patient will probably recover (or die)”, “our football team
will probably win”, “I will probably get my salary tomorrow” and so on.
The concept of probability is important and useful because it gives us more than just a
reflection of past experience.
The most commonly used term in Probability theory is “event”. In everyday life we often
say that a certain event is highly probable whereas some other event is improbable. The “event”
can be almost any observable phenomenon.
All events that we may observe or that may occur in our daily life one can subdivide into
three following types: certain, impossible and random events.
A certain event is an event which occurs when a certain set of conditions is realized.
An impossible event is one which cannot occur when a certain set of conditions is
realized.
A random (stochastic) event is one that may occur or may not occur when a certain set of
conditions is realized.
Every realization of the indicated set of conditions is called a trial or an experiment.
Let us explain first what we mean when we say “a trial” or “an event”. For instance, when
tossing a coin we can consider the fact that it shows heads to be an event. In this case tossing the
coin serves as a trial. (an experiment).
Let us make the following conclusions and give definitions.
A trial or an experiment is a process that can be repeated and gives rise to a number of
outcomes.
An event is a collection (or set) of one or more outcomes.
A sample space is the set of all possible outcomes of an experiment. Events are denoted by
capital letters.
The events are called equally likely (or equally probable) if there is no ground to believe
that some one of them is more probable than the others.
Empirical Probability
Theoretical Probability
m
P( A )=
In the summary form: n
where m is the number of outcomes favourable to A, and n is the number of all the possible
outcomes.
Properties of probabilities
Let us now list some properties of probabilities:
1. The probability of a random event is between 0 and 1. The event A is random
⇒ 0< P( A )<1 .
2. The probability of a certain (sure) event is 1.
The event A is certain ⇒ P( A )=1 .
3. The probability of an impossible event is zero.
The event A is impossible ⇒ P( A )=0 .
4. The probability that event A will not occur (is denoted by Ā ) is 1− P( A ) .
5. The more likely that an event will occur, the higher its probability (the closer to 1 it is);
the less likely that an event will occur, the lower its probability (the closer to 0 it is).
0 1
24 6
20 8
17 11
15 14
Find the following probabilities:
(a) P(8)
(b) P(even)
(c) P(odd)
(d) P(prime number)
(e) P(number <10)
(f) P(number >16)
Solution.
Since each region is the same size, it is equally likely that the spinner will stop in any of
the eight regions.
(a) There is 1 chance in 8 that it will stop in the region marked 8. So we say that the probability
1
P(8 )=
of spinning a 8 is one eighth and write 8.
(b) There are 5 chances in 8 that the spinner will land in a region with an even number in it, so
5
P(even)=
8.
(c) There are 3 chances in 8 that the spinner will land in a region with an odd number in it, so
3
P(odd )=
8.
(d) There are 2 chances in 8 that the spinner will land in a region with a prime number in it, so
2 1
= =
P (prime number) 8 4 .
(c) There are 2 chances in 8 that the spinner will stop in a region with a number less than 10 in it,
so
2 1
= =
P(number <10) 8 4.
(f) There are 3 chances in 8 that the spinner will stop in a region with a number greater than 16 in
it, so
3
= .
P(number >16) 8
Example 3.
A box contains 10 balls labeled with the numbers from 1 to 10. We draw one ball. What is
the probability that the number of the drawn ball:
(a) does not exceed 10
(b) prime number
(c) a factor of 10
Solution.
10
= =1
(a) P(¿ 10 ) 10 , the event is certain.
4 2
= =
(b) P(prime number) = P(2,3,5,7) 10 5 .
4 2
= =
(c) P(factor of 10) = P(1,2,5,10) 10 5 .
Example 4.
There are 12 balls in the urn, of which 3 are green, 4 red and 5 blue. What is the probability
of drawing.
(a) a green ball,
(b) a red ball,
(c) a blue ball.
Solution.
3 1
= =
(a) P(green) 12 4
4 1
= =
(b) P(red) 12 3
5
= .
(c) P(blue) 12
Counting Techniques
The probability formula for equally likely outcomes uses the size of the sample space. In
some simple experiments, this may be easily determined, but some experiments require more
sophisticated counting techniques.
The basis of most counting techniques is the Counting Principle.
Counting Principle. If two jobs need to be completed and there are m ways to do the first
job and n ways to do the second job, then there are m⋅n ways to do one job followed by the
other.
This principle can be extended to any number of jobs or experiments.
Counting Principle (General Case)
Consider a set of k experiments. Suppose the first experiment has n1 outcomes, the second
has n2 outcomes, and so on. Then the total number of outcomes is 1 2
n ⋅n ⋅.. .⋅n k for all k
experiments.
Example. How many integers between 100 and 1,000 consist of three different odd digits?
Solution. We are looking for 3-digit numbers, such as 157,359, and 753, all of whose
digits are odd and all of which are different. The best way to answer this question is to use the
counting principle. Think of writing a 3-digit number as three jobs that need to be done. The first
job is to select one of the five odd digits (1,3,5,7,9) and use it as the digit in the hundreds place.
This can be done in 5 ways. The second job is to select one of the four remaining odd digits for
the ten place. This can be done in 4 ways. Finally, the third job is to select one of the three odd
digits not yet used to be the digit in the ones place. This can be done in 3 ways. So, by using the
counting principle, the total number of ways to write a 3-digit number with three different odd
digits is 5⋅4⋅3=60 .
Combinatorics
Many problems in probability theory require that we count the number of ways that
particular event can occur. For this, we need to know some concepts of combinatorics (factorial,
permutations, combinations).
Combinatorics is a branch of mathematics concerning the study of finite or countable
discrete structures. The combinatorics studies the number of selections (or variants, or ways)
subordinated to the certain conditions. Combinatorial problems arise in many areas of
mathematics, notably in probability theory. Combinatorics also has many applications in
computer science and statistical physics.
Factorial.
Definition. Let n be a positive integer. We define n! (read “n factorial”) by
n !=n(n−1)(n−2)⋅. . .⋅3⋅2⋅1 .
By convention, we set 0! =1 .
The factorial n ! gives the number of ways in which n distinct objects can be permuted.
Permutations.
Definition. The different arrangements or selections of a given number of distinct objects
are called permutations.
The number of all the possible permutations of n distinct objects is
Pn =n !
Arrangements (Permutations)
Definition. The different arrangements or selection of a given number of distinct objects
(things) taken k at a time are called arrangements (or permutations).
The number of all the possible arrangements of n things, taken k at a time, is given by:
n!
Ank=P n, k =n( n−1 )( n−2 ). ..(n−k +1 )=
(n−k )!
n
In particular An =Pn , n =n ! , i.e. An =Pn .
n
Combinations.
Definitions. Each of the different groups or selections of a given number of objects, and
which can be formed by taking some or all of a number of objects is called a combination.
The number of all combinations of n things, taken k at a time is given by:
C kn =¿ ( n ¿ ) ¿ ¿ ¿
¿
Note. It is useful to remember:
In permutations the order matters
In combinations the order does not matter
A kn n!
C kn = =
Properties: 1. Pk k !(n−k )!
k k
2. An =C n⋅Pk
0 n
3. C n =Cn =1
k n−k
4. C n =Cn
k k−1 k
5. C n =Cn−1 +C n−1
Example 1.
There are 10 balls in the box, of which 6 are green and 4 blue. We draw two balls. What is
the probability that they are
(a) 2 green balls
(b) 2 blue balls
(c) 1 green, 1 blue
Solution.
C 26 15 1
= =
C 2 45 3
(a) P(2 green) = 10
C 24 6 2
= =
C210 45 15
(b) P(2 blue) =
C16⋅C 14 6⋅4 8
= =
C 210 45 15
(c) P(1 green, 1 blue) =
Example 2.
A bag contains 4 red, 3 blue and 8 green beads.
Five beads are selected at random from the bag without replacement. Find the probability
that they are
(a) 5 green beads,
(b) 2 red and 3 green,
(c) 4 red and 1 blue
(d) 1 red, 2 blue and 2 green
Solution.
C 58 8
=
5 429
(a) P(5 green) = C15
C24⋅C 38 16
=
C 5 143
(b) P(2 red and 3 green) = 15
C 44⋅C 13 1
=
C 515 1001
(c) P(4 red and 1 blue) =
C14⋅C 23⋅C28 16
=
C515 143
(d) P(1 red, 2 blue and 2 green) =
Example 3.
Six students from a group of 30 have passed the test with excellent marks, ten students
with good marks, and nine students have got satisfactory marks. What is the probability that
three students chosen randomly from this group have got failing marks for the test?
Solution. The number of students that have got failing marks is 30−(6+10+ 9)=5 . Then
3
the number of cases favourable to the event is determined by the equation m=C 5 , i.e. m=10.
The number of all events here is
30 !
n=C330 = =28⋅29⋅5
3 ! 27 !
3
m C 5 10 1
P= = 3 = =
n C 28⋅29⋅5 406
Therefore 30 .
Example 4.
There are 1000 tickets in the lottery, of which 500 are winning tickets and 500 are non-
winning. We buy two tickets. What is the probability of both tickets being winning?
Solution. The number of all possible outcomes here is
1000 !
n=C21000 = =999⋅500
2 ! 998 !
The number of outcomes favourable to the event here is
500 !
m=C 2500= =499⋅250
2 ! 498 ! .
Thus, we have
2
m C 500 499 ⋅250 499
P= = 2 = =
n C 999⋅500 1998
1000 .
Now we shall generalize the sample space method to efficiently calculate probabilities of
combined events.
Definition. The union of event A and event B is written A∪B and represents the event
that either A or event B occurs or both occur.
Definition. The intersection of event A and B is written A∩B and represents the event
that both A and B occur.
We can use Venn diagrams to illustrate the concepts of union and intersection. Events can
be represented graphically by a Venn diagram. Venn diagrams are named after English
mathematician John Venn (1834-1923). We use set notation to identify different areas on a Venn
diagram.
A rectangle represents the sample space and it contains
closed curves that represent A B S events. It includes all the
possible outcomes of an experiment so the probability of
the sample space is 1 or P(S)=1.
A∪B A∩B
A B S A B S
Ā B̄
A B S A B S
We are also interested in events not occurring. For example, Ā( or A ¿ ) is the complement
of A and is the event “A does not occur”, P( Ā )=1−P ( A ) .
Venn diagrams can help us solve probability problems.
Theorem (Probability Addition Rule)
P( A∪B )=P( A )+P( B)−P ( A∩B ) .
It is useful to remember that we can rearrange this if we want to find the probability of the
intersection.
P( A∩B )=P( A )+P( B)−P ( A∪B )
Definition. Two events are said to be exclusive (mutually exclusive) if they cannot occur
at the same time.
Obviously, if there is no possibility of A and B occurring at the same time, that is A and B
are mutually exclusive, then P( A∩B )=0 and the above formula reduces to
P( A∪B )=P( A )+P( B) .
Definition. The event consisting in the non-occurrence of the event A is said to be an event
opposite, or contrary, to the event A and is denoted Ā( or A ¿ ) .
The union of the events A and Ā yields a certain event, and hence, P( A∪ Ā )=1 , and
since the events A and Ā are mutually exclusive, we have
P( A )+ P( Ā )=1 or P( Ā )=1−P ( A ) .
Example 1.
A and B are two events and P(A) =0.6, P(B)=0.5 and P( A∪B )=0 . 8 . Find the following:
(a) P( A∩B )
P( Ā ) A B S
(b)
0.3 0.3 0.2
(c) P( Ā∪B )
(d) P( Ā∩B ) 0.2
Solution.
(a) P( A∩B )=P( A )+P( B)−P ( A∪B )
P( A∩B )=0 . 6+0 . 5−0. 8
P( A∩B )=0 . 3
(b) P( Ā )=1−P ( A )
P( Ā )=1−0.6=0.4
(c) P( Ā∪B )=(0.2+0.2)+0.3=0.7
(d) P( Ā∩B )=P( Ā )+P( B)−P ( Ā∪B )
P( Ā∩B )=0.4+0.5−0.7=0.2
Example 2.
A and B are two events and P(A) =0.4, P(B)=0.3 and P( A∩B )=0 . 1 . Find the following:
(a) P( A∪B )
A B S
(b) P( B̄) 0.1 0.2
0.3
(c) P( A∩B̄ )
(d) P( A∪B̄ )
0.4
Solution.
(a) P( A∪B )=P( A )+P( B)−P ( A∩B )
P( A∪B )=0 . 4 +0 . 3−0 . 1=0 . 6
(b) P( B̄)=1−P( B)
P( B̄)=1−0.3=0.7
(c) P( A∩B̄ )=0 . 3
(d) P( A∪B̄ )=P( A )+P( B̄)−P ( A∩ B̄ )
P( A∪B̄ )=0.4+0.7−0.3=0.8
Example 3.
C and D are two events and P(C) =0.4, P(D)=0.5 and P(C∪D )=0 . 6 . Find the following:
(a) P(C∩D )
C D S
(b) P( C̄ )
0.1 0.3 0.2
(c) P(C∪ D̄ )
0.4
(d) P( C̄∪D )
Solution.
(a) P(C∩D )=P(C )+P( D)−P (C∪D)
P(C∩D )=0 . 4+0. 5−0 .6=0 . 3
(b) P( C̄ )=1−P(C )
P( C̄ )=1−0.4=0.6
(c) P(C∪ D̄ )=0 . 8
(d) P( C̄∪D )=P( C̄ )+P( D)−P ( C̄∩D)
P( C̄∪D )=0.6+0.5−0.2=0.9
Example 4.
There are 20 students in a certain tutor group at Baku-Oxford School. There are 12
students in the tutor group studying French, 11 students studying Spanish and five students
studying both Spanish and French.
Find the probability that a randomly chosen student in the tutor group
(a) studies Spanish
(b) studies Spanish and French
(c) studies Spanish but not French
(d) does not study Spanish or French
Solution.
Draw a Venn diagram to represent this information. There are 20 students in the sample
space. A student is chosen at random so all outcomes are equally likely.
11
P( S )= =0 .55
(a) 20
5 F S
P( S∩F )= =0 .25 5 6
(b) 20 7
6
P( S∩ F̄ )= =0 .3 2
(c) 20
2
P( S̄∩ F̄ )= =0 .1
(d) 20
Example 5.
A card is chosen at random from a pack of 52 playing cards. H is the event ‘the card
chosen is a Heart’ and Q is the event ‘the card chosen is a Queen! Find the following:
(a) P(Q ) (d) P( H∪Q )
(b) P( H ) (e) P( H̄ )
(c) P( H∩Q ) (f) P( Q̄∩H )
Solution.
Draw a Venn diagram to represent this information. There are 52 cards in the sample
space. A card is chosen at random so all outcomes are equally likely.
4 1
P(Q )= =
(a) 52 13
H Q
13 1
P( H )= = 12 1 3
(b) 52 4
36
1
P( H∩Q )=
(c) 52
16 4
P( H∪Q )= =
(d) 52 13
1 3
P( H̄ )=1− =
(e) 4 4
12 3
P( Q̄∩H )= =
(f) 52 13
Let us now consider two events A and B. The probability of an event B may be different if
we know that a dependent event A has already occurred.
Consider the Venn diagram. A B The event A has happened, so
we are considering probabi- lities inside this closed curve.
a–i i b–i
We are looking for the probability of B given A has
happened so we divide i by a .
S
Thus
P( B∩ A )
P( B given A )=
P(A)
P( B∩ A ) 0 .18
P( B |A )= = =0. 9
(a) P(A) 0 .2
(b) P( Ā∩B̄ )=0 . 38 A B
0.38
Example 7.
A and B are two events such that P( A )=0 . 6 , P( B)=0. 5 and P( A∩B )=0 . 4 . Find the
following
(a) P( A∪B )
(b) P( B|A ) A B
(c) P( A|B) 0.2 0.4 0.1
Solution. We draw the Venn diagram for better understanding although we can solve the
problem without it.
(a) P( A∪B )=P( A )+P( B)−P ( A∩B )
P( A∪B )=0 . 6+0 . 5−0. 4=0. 7
P( B∩ A ) 0 . 4 2
P( B |A )= = = =0 .(6 )
(b) P(A) 0.6 3
P( A∩B) 0 . 4
P( A |B)= = =0 . 8
(c) P( B) 0 . 5
P( A∩ B̄) 0 .2
P( A | B̄)= = =0. 4
(d) P( B̄) 0 .5
Example 8.
A and B are mutually exclusive and P( A )=0 . 3 and P( B )=0 . 4 . Find
(a) P( A∪B )
(b) P( A ∩ B̄)
(c) P( Ā ∩ B̄)
Solution. It is clear that when events A and B are mutually exclusive they have no
outcomes in common. Therefore remembering it we
draw the corresponding A 0.4 Venn diagram
B
0.3
(a)Since P( A ∩B)=0 we have
0.3
(b) P( A ∩ B̄) 28
(c) P( Ā ∩ B̄)
Solution. A and B are independent, so
1 1 1
P( A∩B )=P( A )⋅P( B)= ⋅ =
(a) 4 7 28 .
If A and B are independent, then so are A and B̄ , and hence
1 6 6 3
P( A∩B̄ )=P( A )⋅P( B̄)= ⋅ = =
(b) 4 7 28 14 .
If A and B are independent, then so are Ā and B̄ , and therefore
3 6 18 9
P( Ā∩B̄ )=P( Ā )⋅P( B̄)= ⋅ = =
(c) 4 7 28 14 .
Example 11.
A box contains 6 white and 8 red balls. We draw two balls (without replacing the drawn
ball into the box). Find the probability of both balls being white.
Solution. Suppose the event A is the drawing of a white ball in the first trial and the event
B is the drawing of a white ball in the second trial. By the probability multiplication rule for the
case of dependent events we have
6 5 15
P( A∩B )=P( A )⋅P( B|A )= ⋅ =
14 13 91
Example 12.
The probability of winning a certain game is 0.2. Suppose the game is played on two
different occasions. What is the probability of
(a) winning both times?
(b) losing both times?
(c) winning once and losing once?
Solution.
(a) The results of the two different trials are independent, so the probability of winning
both times can be found by multiplying the probability of winning each time.
P(winning both games)=0 . 2⋅0 .2=0 . 04
(b) Since losing is the compliment of winning, the probability of losing is 1−0. 2=0. 8 .
The probability of losing both times can be found by multiplying the probability of
losing each time
P(losing both games)=0 . 8⋅0 . 8=0. 64
(c) The complement of the event (winning once and losing once ) is the set of the two
events in parts (a) and (b).
The event in parts (a) and (b) are mutually exclusive, because it is impossible to win
both times and lose both times, so their probabilities may be added.
P(winning once and losing once)=
=1−(0 .04 +0 . 64 )=0 . 32 .
Lecture 13
Bernoulli’s Formula. The Most Probable Number of Occurrences of an Event
If n independent trials are performed in each of which the probability of occurrence of the
event A is the same and is equal to p, then the probability of the event A occurring m times in
these n trials is expressed by Bernoulli’s formula:
Pm, n =C mn pm q n−m
where q=1− p . Thus we have
Po , n =C on po qn−o =q n ,
P1 , n=C 1n p1 q n−1 =npq n−1 ,
n(n−1) 2 n−2
P2 , n=C 2n p2 qn−2= p q ,
1⋅2
. .. .. . .. .. . .. .. .. . .. .. . .. .. . .. .. . .. .. . .. .. . .. .. .
Pn , n =p n
Definition. The number
m0 is called the most probable number of occurrences of the
event A in n trials if at
m=m0 the value of Pm,n is not less than all the other values of Pm,n , i.e.
Pm , n ≥Pm ,n mi≠m0 .
0 i at
m
If p≠0 and p≠1 , the number 0 can be determined from the double inequality
np−q≤m0 ≤np+ p .
The difference between the end point values in this double inequality is 1:
(np+ p )−(np−q )= p+q=1 .
If np+ p is not an integer, then the double inequality specifies only one most probable
value
m 0 . Now if np+ p is an integer, then there are two most probable values,
¿
m0 =np−q and m //0 =np+ p
Theorem. The probability that the event will occur (in n independent trials):
a) less than k times (¿ k ) is found by the formula
Po ,n +P 1,n+...+P k−1 ,n
b) more than k times (¿ k ) is found by the formula
Pk+1 ,n +Pk+2 ,n +...+Pn ,n
c) no less than k times (¿ k ) is found by the formula
Pk ,n +P k+1 ,n +P k+2 ,n +...+P n,n
d) no more than k times (¿ k )is found by the formula
Po ,n +P 1,n+...+P k−1 ,n+Pk ,n .
Let us consider some examples.
Example 1. The probability of occurrence of the event A is 0.4. What is the probability of
the event A occurring not more than three times in 10 trials?
Solution. Here p=0.4, q=0.6. We have:
probability of occurrence of event A 0 times is
P0 , 10=q 10 ;
probability of occurrence of event A 1 time is
P1 ,10 =10 pq 9 ;
probability of occurrence of event A 2 times is
P2 ,10 =45 p2 q 8 ;
probability of occurrence of event A 3 times is
P3 , 10=120 p 3 q7 .
The probability of the event A occurring not more than three times is equal to
P=P 0,10 +P1,10 +P2 ,10 +P3 ,10
that is
P=q10 +10 pq 9 +45 p2 q 8 +120 p3 q7
or
P=q7 (q3 +10 pq 2 +45 p2 q+120 p3 )
Putting p=0.4, q=0.6, we obtain
P=0. 67 (0 . 216+1 . 44+4 .32+7 . 68 )≈0 . 38 .
Example 2. Determine the probability of the fact that in a family having five children there
are three girls and two boys. The probabilities of a girl and a boy being born are assumed to be
equal.
Solution. The probability of a girl being born is p=0.5, then q=1− p=0 .5 (the probability
of a boy being born).
Therefore, the sought-for probability is
5! 5
P3 ,5=C 35 p3 q2 = (0.5)3 (0.5)2=
3! 2! 16 .
Example 3. Under the conditions of the previous problem find the probability of the fact
that there will not be more than three girls among the children born to the family.
Solution. We have
P0 , 5= ()
15 1
=
2 32 ;
P1 ,5 =5⋅ ()
15 5
=
2 32 ;
P2 ,5 =10⋅ =()
15 5 15 5
P =10⋅ =
2 16 ; 3 , 5 2 16 . ()
13
P=P 0, 5 +P 1, 5 +P2 , 5 +P3 , 5 =
16 .
Example 4. Two chess-players of the same level play chess. What is more probable to win:
two games out of four or three games out of six?
Solution. As players of the same level, then the probability of winning is equal to 0.5; then
the probability of loss is 0.5. Let us find the probability of the fact that two games out of four
will be won. Applying the Bernoulli’s formula we get
6
P2 , 4 =C 24 p2 q2 =
16 .
Now we find the probability of the event that three games out of six will be won. We have
5
P3 , 6 =C36 p 3 q3 =
16 .
P >P
Since 2 , 4 3, 6 , then of four games to win two is more likely than three games out of six.
Example 5. We toss a coin five times. Find the probability that the Heads comes up
(a) less than two times.
(b) no less than two times.
Solution.
1 5 3
P0 , 5 +P1 ,5 = + =
(a) 32 32 16
13
P2 ,5 +P3 ,5 +P4 , 5 +P 5, 5 =1−(P 0 ,5 +P1, 5 )=
(b) 16
Example 6. We toss a coin eight times. Find the probability that the Heads comes up no
more than three times.
Solution. We have
P=P 0, 8 +P 1, 8 +P 2, 8 +P 3, 8 =
¿
2()
1 8 0 1 2 3
(C 8+C 8+C8 +C 8)=
=
2()
18
(1+8+28+56 )=
93
256 .
Example 7. There are 10 white and 40 red balls in the box. We successively draw 14 balls,
register the colour of each drawn ball and replace it into the box. Determine the most probable
number of occurrences of a white ball.
10 1 4
n=14 , p= = , q=1− p=
Solution. Here 50 5 5.
Using the double inequality
np−q≤m0 ≤np+ p
at the values of n, p and q indicated, we obtain
14 4 14 1
− ≤m0 ≤ +
5 5 5 5,
i.e. 0 2≤m ≤3
.
Thus, the problem admits of two solutions:
¿
m 0 =2 , m //0 =3
Example 8. The probability of a marksman hitting the target is 0.8. He fires 20 shots.
Determine the most probable number of times he hits the target.
Solution. Here n=20 , p=0 . 8 , q=0 .2 . It follows that
20⋅0 ,8−0 , 2≤m0 ≤20⋅0 , 8+0 , 8
i.e
15 . 8≤m0 ≤16 . 8 .
Since m is an integer, we have 0
m =16
.
Example 9. As a result of long observations it was established that the probability of rain
3
falling on October 14 in Baku is 8 . Determine the most probable number of rainy days on
October 14 in Baku for 10 years.
3 5
n=10 , p= , q=
Solution. Here 8 8.
Thus we have
3 5 3 3
10⋅ − ≤m0 ≤10⋅ +
8 8 8 8
1 1
3 ≤m0 ≤4 ,
or 8 8 i.e. m0 =4 .
Example 10. A balloon manufacturer claims that 95% of his balloons will not burst when
blown up. If you have 20 of these balloons to blow up for a birthday party what is the probability
that none of them burst when blown up?
Solution. To solve this problem we use the Bernoulli’s formula. Here
m=n=20 , p=0 .95 , q=0 . 05 . We have
P20 , 20=C 20 20 0 20
20 p q =0. 95 =0 .3584 .. .≈0. 358 .
Lecture 14
Random Variable and the Law of Its Distribution
Mathematics has many applications in most areas of modern day life. If you construct a
mathematical model of a real world situation, you can learn about the real situation by analyzing
the mathematical model.
A mathematical model is a simplification of a real world situation. It can be used to make
predictions about a real world problem. By analyzing the model an improved understanding of
the situation may be obtained.
Statistics play an important role in this process. In statistics you collect observations or
measurements of some variable. Such observations are known as data.
Definition. A random variable is a variable quantity which randomly assumes a certain
numerical value resulting from the outcome of a trial. This value depends on chance and,
generally speaking, varies as the trials are repeated. Examples of random variables are the
number of girls in a family, the number of students attending a lecture, the duration of life of a
person and so on.
Definition. A variable is said to be discrete if it can take only specific values in a given
range.
Definition. A variable is said to be continuous if it can take any value in a given range.
Example 1. State whether or not each of the following variables is discrete or continuous.
(a) Number of people on a bus
(b) Time required to run 100m
(c) Number of boys in a family
(d) Weigh of watermelon
(e) The number of apples on the trees in an orchard
(f) Shoe size
(g) Length
(h) Time waiting in a queue.
It is clear that (a), (c), (e) and (f) are discrete variables since these variables can’t take any
value from a given range. On the other hand, (b), (d), (g) and (h) are continuous variables
because these variables can assume any value in a given interval.
We can define these concepts more precisely in terms of mathematics.
A random variable is a quantity whose value depends on chance. A random variable can
assume one or another value from a certain number set, but it is not known beforehand which
value it will assume. A random variable is designated by capital letters X, Y,…, and the values
that the random variable can take are represented by the corresponding lower-case letters x,y,…
Definition. If the values that the given random variable X can assume form a discrete
x , x ,..., x ,...,
(finite or infinite) number series 1 2 n then the random variable is called discrete.
Definition. If the values that the given random variable X can assume fill up the whole
finite or infinite interval (a,b) then the random variable is said to be continuous.
x
To each value n of a random variable X of a discrete type there corresponds a definite
probability pn. To specify a discrete random variable completely, you need to know its set of
possible values and the probability with which it takes each one; this information is best
displayed in a table.
X x1 x2 … xn
P( X =x ) p1 p2 … pn (
Definition. The relationship establishing in one way or another the connection between the
possible values of a random variable and their probabilities is called the Law of distribution of
a random variable.
The law of distribution of a random variable you can specify in a table, analytically (i.e. by
means of formulas) and graphically.
Definition. A table representing the correspondence between the possible values of
variable and their probabilities is called the Law of distribution of a discrete random variable.
The table (1) is called the Law of distribution or the probability distribution of a
discrete random variable. The total of all the probabilities in a probability distribution must
p + p +.. .+ p =1
always equal 1. In this case 1 2 n . This fact can be useful if we do not have
complete information about the probabilities.
Example 2. A fair six-sided die is rolled. Write down the probability distribution of the
outcome of rolling a fair die.
Solution.
X 1 2 3 4 5 6
P( X =x ) 1 1 1 1 1 1
6 6 6 6 6 6
Here the sum of all the probabilities is equal to 1.
6
∑ Pi = 16 + 16 + 16 + 16 + 16 + 16 =1
i=1
Example 3. Three fair coins are tossed. The number of heads X, is counted. Write down
the probability distribution of getting heads.
Solution. In this experiment we have eight possible outcomes HHH, HHT, HTH, HTT,
THH, THT, TTH, TTT.
Consequently,
Number of Heads (X) 0 1 2 3
P( X =x ) 1 3 3 1
8 8 8 8
Here the sum of all the probabilities is equal to 1.
4
∑ Pi = 18 + 38 + 38 + 18 =1
i=1
Example 4. A discrete random variable X has the following probability distribution:
X 1 2 3 4
P( X =x ) 1 k 1 3
5 4 10
Find the value of k.
Solution. For a discrete random variable the sum of all the probabilities must always equal
1.
1 1 3 1
+ k + + =1⇒ k=
5 4 10 4
Example 5. Given P( X =x )=kx for x=1,2,3,4. Find k.
Solution. According to the condition we can make a table to show the probability
distribution
X 1 2 3 4
P( X =x ) k 2k 3k 4k
1
k + 2k + 3 k +4 k=1 ⇒10 k =1⇒ k =
We have 10 .
Let us now consider a continuous random variable.
To each interval (a,b) belonging to the range of a random variable of the continuous type
there corresponds a definite probability P( a< X <b ) that the value assumed by the random
variable will fall in that interval.
It is convenient to represent the law of distribution of a continuous random variable with
the aid of the so-called probability density function f ( x ) . The probability P( a< X <b ) of the
fact that the value assumed by the random variable X will fall in the interval (a,b) is defined by
the equality
b
P(a< X <b )=∫ f ( x )dx
a .
Definition. The graph of the probability density function f ( x ) is called a distribution
curve.
In terms of geometry, the probability that the random variable will fall in the interval (a,b)
is equal to the area of the corresponding curvilinear trapezoid bounded by the distribution curve,
the Ox axis and the straight lines x=a, x=b.
f (x)
o a b x
The probability density function f ( x ) possesses the following properties:
1. f ( x )≥0 (i.e f ( x ) is non-negative)
+∞
2. −∞
∫ f ( x)dx=1 .
Moreover, if all the values of the random variable X belong to the interval (a,b), the last
property can be written as
b
∫ f ( x )dx=1
a .
Properties
Definition. The expectation (or mean value) of a discrete random variable is the sum of
the products of the values of the random variables by the probabilities of these values.
The expectation (or mean value) of a random variable X is written E(X). The expectation is
also known as mathematical expectation, average, mean value, or mean. If the random variable X
has the following probability distribution
x x1 x2 … xn
P( X =x ) p1 p2 … pn
x 1 2 3 4
P( X =x ) 0.4 0.3 0.2 0.1
Calculate E(X).
Solution. According to the definition we have to calculate the product xP( X=x ) for all
possible values of x and add up the results:
E( x )=1⋅0 . 4+2⋅0 .3+3⋅0 . 2+4⋅0 .1=2 .
Example 2. A random variable X has probability distribution P( X =x )=k ( x+1 ) for
x=2, 3, 4 , 5, 6. Find E(X).
Solution. According to condition we have the following probability distribution
x 2 3 4 5 6
P( X =x ) 3k 4k 5k 6k 7k
First we find k. Since for a discrete random variable the sum of all the probabilities must
add up to one we get
1
k= =0 , 04
25 k =1 or 25
It follows that the probability distribution will be:
x 2 3 4 5 6
P( X =x ) 0.12 0.16 0.2 0.2 0.28
4
Now we can easily find E(X):
E( x )=2⋅0 . 12+3⋅0 .16 +4⋅0 . 2+5⋅0 . 24+6⋅0 .28=4 . 4
Example 3. A random variable X has the following probability distribution
x 1 2 3 4 5
P( X =x ) 0.1 a b 0.2 0.1
{2 a+3b=1.5 ¿¿¿¿
Solving this system, we find a=0.3 and b=0.3.
Note the following:
1. The expectation of a random variable, intuitively, is the average value of repetitions of the
experiment it represents.
2. The expected value (expectation) of a random variable is greater than the least value and
less than the highest possible value that the variable can take.
3. The expectation of a random variable need not be one of the values that the variable can
take.
Properties of Expectation
Note some important properties of expectation.
1. The expectation of a constant variable is equal to the constant itself:
E(C )=C
Proof. We shall consider a constant C as a discrete random variable which can assume
only one possible value C with the probability p=1. Hence,
E(C )=C⋅1=C
So, if C is a constant, then E(C )=C .
2. A constant factor can be taken out of the sign of the expectation:
E(CX )=CE( X )
Proof. Suppose the probability distribution of the discrete random variable X is:
X x1 x2 … xn
P( X =x ) p1 p2 … pn
X x1 x2 X y1 y2
P( X =x ) p1 p2 P(Y = y ) q1 q2
XY x1 y1 x2 y1 x1 y2 x2 y2
P p1 q 1 p2 q 1 p1 q 2 p2 q 2
X 3 5 8 Y 4 7
P( X =x ) 0.1 0.3 0.6 P(Y = y ) 0.2 0.8
Find E( XY ) .
Solution. We find the expectations of each of the given variables
E( X )=3⋅0 .1+5⋅0 .3+8⋅0 . 6=6 . 6
E(Y )=4⋅0. 2+7⋅0. 8=6 . 4 .
Since the random variables X and Y are independent the expectation E( XY ) can be
calculated as
E( XY )=E ( X )⋅E (Y )=6 .6⋅6 . 4=42 . 24 .
Example 2. Two fair six-sided dice numbered 1 to 6 are thrown once. The random
variables X and Y represent the values on the upper face of each of the dice. Find E( X +Y ) .
Solution. We have
X 1 2 3 4 5 6
P( X =x ) 1 1 1 1 1 1
6 6 6 6 6 6
Y 1 2 3 4 5 6
P(Y = y ) 1 1 1 1 1 1
6 6 6 6 6 6
Suppose two discrete random variables X and Y are given by the following probability
distributions
X x1 x2 … xn
P( X =x ) p1 p2 … pn
Properties of Variance
Example. A fair six-sided die is thrown. What is Var ( X ) if X is outcome of one fair die?
Solution. We have
1 1 1 1 1 1 7
E( X )=1⋅ +2⋅ +3⋅ +4⋅ +5⋅ +6⋅ =
6 6 6 6 6 6 2
1 1 1 1 1 1 91
E( X 2 )=12⋅ +22⋅ +3 2⋅ + 4 2⋅ +52⋅ +6 2⋅ =
6 6 6 6 6 6 6
Var ( X )=E( X 2 )−E2 ( X ) = − ()
91 7 2 35
=
6 2 12 .
The concepts of the expectation and the variance can be extended to a continuous random
variable. Suppose f ( x ) is the probability density function of the random variable X.
Definition. The expectation of the continuous random variable X is specified by the
equality
+∞
E( X)= ∫ xf ( x)dx
−∞
(provided the value of the integral is finite).
Definition. The variance of the continuous random variable X is specified by the formula
+∞
Var( X)= ∫ [ x−E( X)] f ( x)dx .
2
−∞
Standard Deviation
√
Definition. The quantity σ = Var( X ) is called the standard deviation of the random
variable X.
The standard deviation is a measure of the spread of values (scores) within of data. The
standard deviation is a measure that is used to quantify how the values of a set of data are spread.
This measures the average distance of data items from the expectation.
Example. A discrete random variable X has the following law of distribution:
X -5 2 3 4
P 0.4 0.3 0.1 0.2
X2 25 4 9 16
P 0.4 0.3 0.1 0.2
2
We find E( X )
E( X 2 )=25⋅0 . 4 +4⋅0 . 3+9⋅0. 1+16⋅0. 2=15 ,3 .
Now we find the variance:
2 2 2
Var ( X )=E( X )− [ E ( X ) ] =15 .3−(−0 . 3) =15 . 21 .
Therefore
σ =√ Var( X )=√ 15 .21=3.9 .
Uniform Distribution
The distribution of random variables whose all values lie in an interval [a,b] and possess a
constant probability density on that interval is known as uniform distribution.
Let us find the probability density function f ( x ) of the uniform distribution. It is known
that all the possible values of the random variable lie in the interval [a,b] on which the function
f ( x ) has constant value C. In other words, the random variable X does not assumes the values
out of the interval [a,b].
Thus,
o a x
b
Let us now find the constant C. Since all the possible values of the random variable X
belong to the interval [a,b], the following equality holds true
b b
∫ f ( x ) dx=1 ∫ Cdx=1
a or a .
1
C=
Since C (b−a)=1 , we have b−a and, hence
{
1
f(x)=¿ { 0, if x<a¿ , if a≤x≤b ¿ ¿¿¿
b−a
Now we can easily find the mathematical expectation, the variance and the standard
deviation for a random variable with uniform distribution. We have
b b
x 1 b2 −a 2 a+b
E( X )=∫ xf (x )dx=∫ dx= ⋅ =
a a b−a b−a 2 2 .
Thus,
a+ b
E( X )= ,
2
which should be expected because of the symmetry of distribution.
To calculate the variance we use the formula
a+ b
2 2 E( X )=
Var ( X )=E( X )− [ E ( X ) ] taking into account the value 2 that has already
2
been found. Thus, it remains to calculate E( X ) . We have
b b
x2 b2 + ab+ a2
E( X )=∫ x f ( x )dx=∫
2 2
dx=
a a b−a 3 .
It follows that
2 2
b 2 +ab +a2 ( a+b ) ( b−a )
Var ( X )= − =
3 4 12
and, hence
b−a
σ =√ Var ( X )=
2 √3 .
Binomial Distribution
Thus, the first termpn of the expansion defines the probability of the event occurring n
n−1
times in n independent trials; the second term n p q defines the probability of the event
n
occurring n-1 times;…; the last term q defines the probability of the event occurring not once
(i.e. never).
The mathematical expectation and the variance for the binomial distribution of a random
variable X are determined by the following formulas:
E( X )=np and Var ( X )=npq .
Poisson’s Distribution
The distribution of the random variable X which can assume any integer non-negative
values (0,1,2,...,n) described by the formula
am −a
P( X =m)= e
m! ,
is known as Poisson’s distribution.
The mathematical expectation and the variance for the Poisson’s distribution of a random
variable X are determined by the formulas:
E( X )=a and Var ( X )=a .