Week 4

The document discusses Chebyshev's Theorem, which provides a way to estimate the probability of a random variable falling within a certain range around its mean based on its standard deviation. It also covers the Law of Large Numbers and methods for approximating the mean and variance of nonlinear functions using Taylor expansions. Additionally, it includes examples illustrating the application of these concepts in statistical analysis.

Uploaded by

qawsedrf010588

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views5 pages

Week 4

Uploaded by

qawsedrf010588

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Week 4 : Math 230-02

1. Chebyshef’s Theorem
If a random variable X has a small standard deviation σX , it means that the
distribution is concentrated around its mean value µ. The smaller the standard
deviation is, the more the pdf is concentrated around the mean.
It is often useful to quantitably estimate the probability of the distribution lies
in a symmetric interval around the mean.
Theorem 1.1 (Chebyshef’s Theorem). Let X be any random variable, and consider
the symmetric interval [µ − kσ, µ + kσ] for any number k > 0. Then we have
1
P = (|X − µ|| < kσ) = P (µ − kσ < X < µ + kσ) ≥ 1 − .
k2
We can rewrite
P (µ − kσ < X < µ + kσ) = P (|X − µ|| < kσ).
Here is another form of Chebyshef’s theorem.
Corollary 1.2. Let X be any random variable. Then we have
1
P (|X − µ| ≥ kσ) ≤ .
k2
Proof. Note that
1 = P (|X − µ| < kσ) + P (|X − µ| ≥ kσ).
Therefore we have
1 1
P (|X − µ| ≥ kσ) = 1 − P (|X − µ| < kσ) ≥ 1 − (1 − ) = 2.
k2 k
This finishes the proof.

Example 1.3. Let a random variable X has mean µ = 8 and the standard deviation
σ = 3 (or equivalently the variance σ 2 = 9). Give a lower estimate of the probability
P (−4 < X < 20).
Solution: We first express the interval [−4, 20] of the form
[µ − kσ, µ + kσ]
by setting
(
µ − kσ = −4
µ + kσ = 20.
Solving the system of this equation (actually one is enough to determine k), we get
k = 24 24 1 35
σ = 3 = 6. Therefore the probability is at least greater than 1 − 62 = 36 .

Example 1.4. In a class of 50 students, the midterm exam scores have a mean of
70 points and a standard deviation of 10 points. Your score is 90 points. Assuming
that the scores are symmetrically distributed about the mean, are you among the
top 10 students?
1
2

Solution: Set X be the random variable of the midterm exam scores. Then we
have
µ = 70, σ = 10.
We would like to find the probability of P (X > 90) to answer the question. We
note 90 = 70 + 2 × 10 = µ + 2σ. Obviously, we have
P (|X − µ| > 2σ) = P (X > µ + 2σ) + P (X < µ − 2σ).
By the symmetry about the mean µ = 70, we also know
P (X > µ + 2σ) = P (X < µ − 2σ)
and hence
1 1 1 1
P (X > 90) = P (X > µ + 2σ) = P (|X − µ| > 2σ) < × 2 = .
2 2 2 8
1
We compute 50 × 8 ∼ 6.3 and hence you are among the top 10 students.

Remark 1.5. Note that Chebyshef’s inequality is universal in that it applies to

any random variable.
Here is an important application of Chebyshef’s theorem.
Theorem 1.6 (The Law of Large Numbers). Let X1 , · · · , Xn be independent and
identically distributed random variables with mean µ and variance σ 2 . Consider the
average random variable
X1 + · · · + Xn
X := .
n
Then for any given positive ε > 0, we have
lim P (|X − µ| ≥ ε) = 0.
n→∞

This theorem explains what the ‘statistical average’ means. An example of the
circumstance appearing in the theorem is as follows: Think about your laboratory
and make some experiments. Imagine you or your colleagues do the same experi-
ments repeatedly. Each of your experiment results is a random variable Xi which
are independent.

Proof. Denote by µ and σ 2 the mean and the variance of X respectively. We

compute

X1 + · · · + Xn 1
E(X) = E = (E(X1 ) + · · · + E(Xn )
n n
 
1 1
= µ + · · · + µ = (nµ) = µ.
n | {z } n
ntimes

On the other hand, we compute

2 X1 + · · · + Xn
σ = Var(X) = Var
n
1 1 2 σ2
= (Var(X1 ) + · · · + Var(Xn )) = (nσ ) =
n2 n2 n
3

from which we obtain σ 2 = nσ 2 . Now to apply Chebyshef’s theorem, we put ε = kσ.

We express σ in terms of the given constants ε and σ,
p √
ε = kσ = k nσ 2 = (k n) σ.
By Chebyshef’s inequality applied to X, we obtain
√ 1 1
P (|X − µ| ≥ ε) = P |X − µ| ≥ (k n)σ ≤ √ 2 = 2 .
(k n) k n
This proves
1
lim P (|X − µ| ≥ ε) ≤ lim = 0.
n→∞ n→∞ k2 n
This finishes the proof.

2. Approximating Mean and Variance of nonlinear function g(X, Y )

First consider the linear function g(x, y) = ax + by + c. Then we have
E(g(X, Y )) = aE(X) + bE(Y ) + c.
Theorem 2.1. We have
σaX+bY +c = a2 σX
2
+ b2 σY2 + 2abσXY .
2 2
Corollary 2.2. (1) σX+c = σX .
2 2 2 2 2
(2) σaX+bY = a σX + b σY .
(3) σa21 X1 +···+an Xn = a21 σX
2
1
2
+ · · · a2n σX n
.

When g is not a linear function, even such as g(X/Y ), it is not simple to compute
2
E(X/Y ) or σX/Y in general, i.e., there is no simple formula as above. We need to
find a way of finding a good approximate value. One most common easy way is to
use linear approximations using the Taylor expansion of g(x, y) at the mean center.
(µX , µY )
Theorem 2.3 (Taylor’s formula; one variable cases). Suppose X be a random
variable. Let c = µX be given. Then we have
1
g(x) = g(c) + gx (c)(x − c) + gxx (c)(x − c)2 + “higher order terms”. (2.1)
2
We commonly right the difference as
∆x = x − c.
Corollary 2.4. Suppose X be a random variable. Then
(1) We have approximate mean
1 2
E(g(X)) ∼ g(µX ) + gxx (µX )σX .
2
(2) We have approximate variance
Var(g(X)) ∼ (gx (µX ))2 σX
2
.
4

Proof. For (1), we put c = µX and apply E to the equation (2.1) and dropping the
higher order terms. Then we get

1 2
E(g(X)) ∼ E g(µX ) + gx (µX )(X − µX ) + gxx (µX )(X − µX )
2

1 2
= E(g(µX )) + E(gx (µX )(X − µX )) + E gxx (µX )(X − µX )
2
1
= E(g(µX )) + gx (µX )(E(X − µX )) + gxx (µX )E (X − µX )2 .

2
2
2
We note E(X − µX ) = 0 and E (X − µX ) = σX which finishes the proof.
For (2), we note
Var(g(X)) = E((g(X) − µg(X) )2 ).
Then we rewrite (2.1) into
1
g(x) − g(c) = gx (c)(x − c) + gxx (c)(x − c)2 + “higher order terms”.
2
Therefore dropping the multiple of “higher order terms”, we obtain
2
1
(g(x) − g(c))2 = gx (c)(x − c) + gxx (c)(x − c)2 + “higher order terms”
2
2
2 1 2
∼ (gx (c)(x − c)) + 2(gx (c)(x − c)) gxx (c)(x − c)
2
2
1
+ gxx (c)(x − c)2
2
Still further dropping the terms higher than the quadratic terms of (x − c)
(g(x) − g(c))2 ∼ (gx (c)(x − c))2 = (gx (c))2 (x − c)2 .
Therefore
E(g(X)) − g(µX ) ∼ (gx (c))2 (x − c)2 .
Combining the above, we get
2
= E((g(X) − µg(X) )2 ) ∼ E (gx (µX ))2 (X − c)2

σg(X)
= (gx (µX ))2 E((X − µX )2 ) = (gx (µX ))2 σX
2
.
This finishes the proof.

Example 2.5.
2 1
Given the random variable X with mean µX = 1 and variance σX = 2 estimate
1
the mean and the variance of the random variable g( X+1 ).
Solution: We have c = µX = 1. Then we compute
1 2
gx (x) = − , gxx (x) = .
(1 + x)2 (1 + x)3
Therefore
1 1 1
g(1) = , gx (1) = − , gxx (1) = .
2 4 4
5

This gives rise to

1 2 1 1 1 1 3
E(g(X) ∼ g(1) + gxx (1)σX = − × × =
2 2 2 4 2 16
2
2 2 2 2 2 1 1 1
σg(X) ∼ (gx (µg(X) )) σX = (gx (1)) σX = − × = .
4 2 32
For the multi random variables, we similarly apply the Tayler’s formula for multi-
variable functions y = h(x1 , · · · , xk ).
Theorem 2.6. Let X1 , · · · , Xk be independent random variables, and consider the
random variable given by
Y = h(X1 , · · · , Xk ).
Write µXi = µi and σXi = σi . Then we have
k
X σ2 i
E(Y ) ∼ h(µ1 , · · · , µk ) + hxi xi (µ1 , · · · , µn )
i=1
2
k
X
Var Y ∼ (hxi (µ1 , · · · , µk ))2 σi2 .
i=1

The approximation for random variable inter-dependent becomes more compli-

cated because of the appearance the mixed terms in the second derivatives, or the
Hessian matrix.
Example 2.7. In Newton’s theory of gravity, the force between two planets (or
two stars) are given by the formula
Gm1 m2
F =−
r2
where G is called the gravitational constant which is universal independent of the
planets. What we have to ask about is to determine this constant. How? We
can estimate G by measuring the masses of many pairs of planets and the forces
between them. Regard M1 and M2 be two random variables of measuring the
masses R the random variable measuring the distance r and F the one of the force.
Suppose E(Mi ) = 100, 500, σi2 = 100, for i = 1, 2, and E(F ) = 200, σF2 = 5, and
2
E(R) = 1, 0000000, σR = 20. Estimate the constant G and the standard deviation
of this estimation.
Solutions: From the formula, we derive
F r2
G=− .
m1 m2
x4 x23
If we write h(x1 , x2 , x3 , x4 ) = x1 x2 ,then we have G = h(F, r, m1 , m2 ), and hence
F r2

G ∼ E(G) = E = E(h(R, M1 , M2 , F )).
m1 m2

Cheby 1
No ratings yet
Cheby 1
5 pages
Mathematical Expectation and Variance
No ratings yet
Mathematical Expectation and Variance
39 pages
UMA401 A4 Solutions
No ratings yet
UMA401 A4 Solutions
7 pages
Expected Value and Variance Explained
No ratings yet
Expected Value and Variance Explained
18 pages
UMA401 A4 Solutions
No ratings yet
UMA401 A4 Solutions
7 pages
Expectation and Moments in Probability
No ratings yet
Expectation and Moments in Probability
8 pages
Probability and Statistics Exam Solutions
No ratings yet
Probability and Statistics Exam Solutions
11 pages
Markov and Chebyshev Inequalities
No ratings yet
Markov and Chebyshev Inequalities
25 pages
Bounds and Inequalities in Probability
No ratings yet
Bounds and Inequalities in Probability
25 pages
Math 3215 Probability Solutions
100% (1)
Math 3215 Probability Solutions
4 pages
Mean and Variance of Random Variables
No ratings yet
Mean and Variance of Random Variables
26 pages
Advanced Probability Homework Solutions
No ratings yet
Advanced Probability Homework Solutions
9 pages
Variance and Expectation in Probability
No ratings yet
Variance and Expectation in Probability
6 pages
Gaussian Random Variables and Their PDFs
No ratings yet
Gaussian Random Variables and Their PDFs
38 pages
Chebyshev's Inequality Explained
No ratings yet
Chebyshev's Inequality Explained
21 pages
Mathematical Expectation and Variance
No ratings yet
Mathematical Expectation and Variance
52 pages
Mean and Variance of Discrete Distributions
No ratings yet
Mean and Variance of Discrete Distributions
5 pages
Practice Final Solution
No ratings yet
Practice Final Solution
7 pages
Understanding Moments in Probability
No ratings yet
Understanding Moments in Probability
20 pages
Understanding Moments in Probability
No ratings yet
Understanding Moments in Probability
20 pages
Markov's Inequality in Machine Learning
No ratings yet
Markov's Inequality in Machine Learning
5 pages
Math 472 Homework: MGF, Mean, Variance
No ratings yet
Math 472 Homework: MGF, Mean, Variance
10 pages
Chebyshev's Inequality Applications
No ratings yet
Chebyshev's Inequality Applications
6 pages
Understanding Random Variables and Distributions
No ratings yet
Understanding Random Variables and Distributions
7 pages
Probability Solutions for Homework 4
No ratings yet
Probability Solutions for Homework 4
6 pages
Simulation Methods for Random Variables
No ratings yet
Simulation Methods for Random Variables
21 pages
Mathematical Expectation 1 19
No ratings yet
Mathematical Expectation 1 19
19 pages
Random Variables and Their Expectations
No ratings yet
Random Variables and Their Expectations
8 pages
Marginal Distributions and Expectations Analysis
No ratings yet
Marginal Distributions and Expectations Analysis
6 pages
Probability Distributions Overview
No ratings yet
Probability Distributions Overview
22 pages
Statistical Estimation Overview
No ratings yet
Statistical Estimation Overview
19 pages
Probability Distribution Cheat Sheet
No ratings yet
Probability Distribution Cheat Sheet
1 page
Discrete Random Variables Analysis
No ratings yet
Discrete Random Variables Analysis
10 pages
Continuous Random Variables Overview
No ratings yet
Continuous Random Variables Overview
17 pages
Statistical Problem Solutions and Analysis
No ratings yet
Statistical Problem Solutions and Analysis
2 pages
Understanding Chebyshev's Theorem
No ratings yet
Understanding Chebyshev's Theorem
29 pages
Understanding Expected Value in Statistics
No ratings yet
Understanding Expected Value in Statistics
17 pages
UCSD ECE153 Homework Set #4 Solutions
No ratings yet
UCSD ECE153 Homework Set #4 Solutions
11 pages
Poisson and Normal Distributions in Engineering
No ratings yet
Poisson and Normal Distributions in Engineering
9 pages
EC404 Winter 2023 Statistics Problem Set
No ratings yet
EC404 Winter 2023 Statistics Problem Set
9 pages
ELEN90054 Probability Solutions Week 5
No ratings yet
ELEN90054 Probability Solutions Week 5
14 pages
Applied Probability II Concepts and Theorems
No ratings yet
Applied Probability II Concepts and Theorems
28 pages
2020 Student Math Contest Solutions
No ratings yet
2020 Student Math Contest Solutions
5 pages
Statistical Problem Set Solutions
No ratings yet
Statistical Problem Set Solutions
7 pages
Da-Ec550 06
No ratings yet
Da-Ec550 06
11 pages
Random Variable Transformations Explained
No ratings yet
Random Variable Transformations Explained
6 pages
Probability and Variance Problems Explained
No ratings yet
Probability and Variance Problems Explained
7 pages
Probability Concepts: Expectation & Variance
No ratings yet
Probability Concepts: Expectation & Variance
4 pages
Review Questions on Probability and Statistics
No ratings yet
Review Questions on Probability and Statistics
5 pages
All of Statistics 13
No ratings yet
All of Statistics 13
5 pages
Homework Set #4 Solutions Explained
No ratings yet
Homework Set #4 Solutions Explained
11 pages
Variance of Sum of Random Variables
No ratings yet
Variance of Sum of Random Variables
18 pages
Mathematical Expectation and Variance Explained
No ratings yet
Mathematical Expectation and Variance Explained
19 pages
Properties of Expectation Explained
No ratings yet
Properties of Expectation Explained
55 pages
Understanding Mean and Variance Concepts
No ratings yet
Understanding Mean and Variance Concepts
17 pages
Geiger Counter Experiment Guidelines
No ratings yet
Geiger Counter Experiment Guidelines
7 pages
Data Visualization in Statistics
100% (1)
Data Visualization in Statistics
125 pages
Adolescent Internet Use Trends and Insights
No ratings yet
Adolescent Internet Use Trends and Insights
17 pages
Grade 4 Math Variability Worksheet
No ratings yet
Grade 4 Math Variability Worksheet
6 pages
Sampling and Estimation in Statistics
No ratings yet
Sampling and Estimation in Statistics
36 pages
TDM Control Set.04714768001.V6.En
No ratings yet
TDM Control Set.04714768001.V6.En
2 pages
Evaluating Laboratory Wheel-Tracking Devices
No ratings yet
Evaluating Laboratory Wheel-Tracking Devices
8 pages
Confidence Interval for Employee Hours
100% (1)
Confidence Interval for Employee Hours
6 pages
Evans Analytics1e PPT 04
No ratings yet
Evans Analytics1e PPT 04
64 pages
Understanding Human Factors and Ergonomics
No ratings yet
Understanding Human Factors and Ergonomics
9 pages
Understanding Sampling Distributions
No ratings yet
Understanding Sampling Distributions
87 pages
Polytechnic Statistics Material Guide
No ratings yet
Polytechnic Statistics Material Guide
56 pages
Finding Missing Values in Discrete Distributions
No ratings yet
Finding Missing Values in Discrete Distributions
4 pages
Strength-Hardness Correlation in API X65 Steel
No ratings yet
Strength-Hardness Correlation in API X65 Steel
40 pages
Statistik Induktif oleh Samsubar Saleh
No ratings yet
Statistik Induktif oleh Samsubar Saleh
41 pages
Statistical
No ratings yet
Statistical
9 pages
Bollinger Bands Bitcoin Futures Profitability
No ratings yet
Bollinger Bands Bitcoin Futures Profitability
8 pages
Understanding Sampling in Statistics
No ratings yet
Understanding Sampling in Statistics
19 pages
Understanding Total Quality Management
No ratings yet
Understanding Total Quality Management
41 pages
Expected Value of Project Cash Flows
No ratings yet
Expected Value of Project Cash Flows
21 pages
Assignment 2 Solved
No ratings yet
Assignment 2 Solved
9 pages
HSC Advanced Statistics: Normal Distribution
No ratings yet
HSC Advanced Statistics: Normal Distribution
8 pages
Six Sigma Metrics and Calculations Guide
No ratings yet
Six Sigma Metrics and Calculations Guide
8 pages
Understanding Probability Distributions
No ratings yet
Understanding Probability Distributions
34 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
78 pages
Finch Evolution Data Analysis Worksheet
No ratings yet
Finch Evolution Data Analysis Worksheet
7 pages
Astm E11 - 13
No ratings yet
Astm E11 - 13
9 pages
Golf Ball Distance Hypothesis Testing
No ratings yet
Golf Ball Distance Hypothesis Testing
6 pages
Rank Correlation Coefficient Tests
No ratings yet
Rank Correlation Coefficient Tests
13 pages
Exalang 3-6 Assessment Manual
No ratings yet
Exalang 3-6 Assessment Manual
44 pages

Week 4

Uploaded by

Week 4

Uploaded by

Week 4 : Math 230-02

Remark 1.5. Note that Chebyshef’s inequality is universal in that it applies to

Proof. Denote by µ and σ 2 the mean and the variance of X respectively. We

On the other hand, we compute

from which we obtain σ 2 = nσ 2 . Now to apply Chebyshef’s theorem, we put ε = kσ.

2. Approximating Mean and Variance of nonlinear function g(X, Y )

This gives rise to

The approximation for random variable inter-dependent becomes more compli-

You might also like