0% found this document useful (0 votes)
52 views5 pages

Statistical Inference Exam Questions

This document outlines the final examination for a Statistical Inference course at Bahir Dar University, detailing the instructions, topics covered, and specific questions related to maximum likelihood estimation, confidence intervals, and hypothesis testing. Students are required to answer all questions using a scientific calculator and show their work for full credit. The exam includes various statistical distributions and concepts, including Bernoulli, normal, Pareto distributions, and the definitions of type I and type II errors.

Uploaded by

Dereje
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
52 views5 pages

Statistical Inference Exam Questions

This document outlines the final examination for a Statistical Inference course at Bahir Dar University, detailing the instructions, topics covered, and specific questions related to maximum likelihood estimation, confidence intervals, and hypothesis testing. Students are required to answer all questions using a scientific calculator and show their work for full credit. The exam includes various statistical distributions and concepts, including Bernoulli, normal, Pareto distributions, and the definitions of type I and type II errors.

Uploaded by

Dereje
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

BAHIR DAR UNIVERSITY

COLLEGE OF SCIENCE, DEPARTMENT STATISTICS


Statistical inference (Stat) Final Examination (50%), July 20, 2024
Name________________________ID.N O__________Dept_____Time Allowed: 1:30 hrs.
INSTRUCTIONS
 Please fill in your Name and Id. No on top of every answer sheet.
 ALL questions should be attempted and use of scientific calculator is allowed.
 Please use a Black or Blue pen to answer the questions
 Show all your works to earn full marks and illegible answers will have no credit.

1. Let X 1 , X 2 ,…, X n be a random sample from geometric distribution with ( 5 pts)

Find the Maximum likelihood


estimator for p.

2. If a random sample of size n is taken from the Bernoulli distr., find MLE of P. ( 5 pts)

Solution: f ( xi ; θ ) = p ( 1−p )
ix 1−x i
, 0≤p≤1 , and
x i=0 or 1 .
n
∑ xi n− ∑ x i x n−x x=∑ x i = ∑ x i
L(θ ) =L(p)= p = p (1− p )
(1− p) , where i=1 .
⇒ lnL(p) = x*lnp + (n-x)*ln (1-p)
∂ ln L( p ) x n−x x
= − =o ^p= = x̄
⇒ ∂p p 1− p ⇒ x – pn =0 ⇒ n .
3. If X 1 , X 2 ,…, X n constitute a random sample of size n from a normal population with mean
μ and variance σ 2 , find the joint MLE of the parameters ( 6 pts)

1
4. Suppose that X is a discrete random variable with the following probability mass function:
where 0 ≤ θ ≤ 1 is a parameter. The following 10 independent observations. ( 6 pts)

x 0 1 2 3

P(X=x) 2θ θ 2(1−θ ) 1−θ


3 3 3 3

were taken from such a distribution: (3,0,2,1,3,2,1,0,2,1). What is the maximum likelihood
estimate of θ.

2
5. The Pareto distribution has a probability density function.

where α and θ are positive parameters of the distribution. Assume that α is known and that
X 1 , X 2 ,…, X n
is a random sample of size n.
A. Find the maximum likelihood estimator for θ. ( 5 pts)
B. Estimate θ based on these data: 3, 5, 2, 3, 4, 1, 4, 3, 3, 3. ( 5 pts)

3
6. Explain the steps and construct the confidence interval for population variance when μ is
unknown (4 pts)
( n−1)S 2
2
Here, we know that σ2 has χ n−1 , so that
2 (n−1 )S 2 2α /2 , n−1
P {χ 1−α /2 , n−1 < <χ }=1−α
σ2 .
( n−1) S 2 ( n−1 ) S2
⇒P{ 2
<σ 2 <
2
}=1−α
α / 2 , n−1
χ χ 1−α / 2, n−1 .
2 2
(n−1 )S (n−1 )S
[ 2 , 2 ]
α /2, n−1 1−α /2, n−1
χ χ is the required C.I.

7. Let Q be a function of X1, …, Xn and θ ; that is, Q= q(X1, …, Xn ;θ ). If Q has a distribution


that does not depend on θ , then Q is called: Pivotal Quantity ( 3 pts)

4
8. Explain the role of the t-statistic in the construction of the confidence interval for the
difference in population means. Why is the t-distribution used instead of the standard normal
distribution? ( 3 pts)

9. Define type I and type II errors ( 4 pts)


 Type I error is the probability that we reject the null hypothesis when it is true; α ≡P
[reject H0 | H0 is true]
 Type II error is the probability that we do not reject the null hypothesis when it is not
true; β
10. Define composite hypotheses and simple hypotheses with examples for each. ( 4 pts)
 If the hypothesis completely specifies the population (or distribution), then it is called a
simple hypothesis; if not, it is called a composite hypothesis.

Common questions

Powered by AI

Type I error occurs when the null hypothesis is rejected despite being true, with probability denoted by α. Type II error occurs when the null hypothesis is not rejected despite being false, with probability denoted by β. These errors impact conclusions by indicating the risk of wrong decisions: a Type I error leads to a false positive, while a Type II error results in a missed detection or false negative. The balance between these errors is managed by adjusting the significance level (α) and power (1-β) of a test, influencing the reliability of statistical inferences .

To estimate the parameter 'θ' from a discrete distribution given data (e.g., x=0,1,2,3 with corresponding probabilities), you calculate the likelihood function based on the observed data. For example, using observations from x={3,0,2,1,3,2,1,0,2,1}, the probabilities P(X=x) might be given as functions of θ, such as 2θ, 3θ/3, 2(1−θ)/3, 1−θ/3. Compute the likelihood for the given observations and differentiate it with respect to θ. Set the derivative to zero to solve for θ, which gives the MLE. Careful computation using observed frequency matches against the probability function helps isolate 'θ' accurately .

To construct a confidence interval for the population variance σ² when the population mean is unknown, use the chi-square distribution. The formula is based on (n−1)S²/σ², which follows a χ² distribution with n−1 degrees of freedom. The confidence interval for σ² is given by [(n−1)S²/χ²_(α/2,n-1), (n−1)S²/χ²_(1-α/2,n-1)], ensuring a (1-α) confidence level. This method is appropriate because it accounts for sample variability by using sample variance (S²) and the chi-square distribution accurately reflects the distribution of the sample variance for normally distributed data .

A simple hypothesis completely specifies the population distribution, such as H0: μ = 5, where all parameters (e.g., mean and variance) are specified. A composite hypothesis provides partial specification, such as H0: μ > 5, where the distribution's full characteristics aren't fully defined. Simple hypotheses allow straightforward calculation of test statistics and p-values, while composite hypotheses require broader testing procedures often invoking broader confidence intervals and complex test statistics as they account for greater parameter uncertainty. This complexity affects both the sensitivity and specificity of tests as applied to data analysis scenarios .

Showing detailed work and using a scientific calculator in statistical examinations ensures clarity, precision, and accuracy in the completion of complex numerical calculations and logical reasoning steps required for statistical inference problems. It allows examiners to follow the student's reasoning process, verifying correctness and understanding, thus facilitating awarding partial credits. This protocol is crucial in examinations such as those at Bahir Dar University, where adherence to systematic calculation and documentation standards ensures fair evaluation of each candidate's actual competency and grasp of statistical concepts .

The t-distribution is used instead of the standard normal distribution when constructing a confidence interval for the difference between two population means because the sample size is typically small and the population standard deviation is unknown. The t-distribution accounts for the additional uncertainty introduced by estimating the population standard deviation from the sample. It has heavier tails than the normal distribution, providing wider confidence intervals that better encompass the true parameter with a given probability, which is particularly critical when sample sizes are not large enough for the Central Limit Theorem to ensure normality .

A pivotal quantity is a function of the sampled data and unknown parameter which has a known probability distribution independent of the parameter. It is critical in constructing confidence intervals because it enables the derivation of intervals without directly knowing the parameter distribution. For instance, (n−1)S²/σ² follows a χ² distribution regardless of σ², allowing calculation of σ²'s confidence interval. This detachment from the parameter distribution simplifies interval estimation and ensures consistent estimators across trials, significantly enhancing the reliability and accuracy of parameter estimates under uncertainty .

Sample variance plays a crucial role in constructing confidence intervals for variance or standard deviation because it acts as an unbiased estimator of the population variance when the mean is unknown. Using the sample variance, (n−1)S², corrects for bias in estimating variability without needing to specify the mean. In constructing intervals for variance, reliance on sample variance allows the chi-square statistic to effectively compensate for missing mean information, thereby maintaining interval accuracy and providing conservative estimates reflective of real-world conditions with unknown population means .

To derive the MLE for 'p' in a Bernoulli distribution, consider a random sample, which follows the probability mass function f(x;θ)=p^x(1−p)^(1−x). The likelihood function is L(p)=p^Σx(1−p)^(n−Σx), where Σx is the sum of all observed data points. Taking the natural logarithm yields the log-likelihood lnL(p) = Σx*ln(p) + (n−Σx)*ln(1−p). Differentiating with respect to 'p' and solving for zero gives x − np = 0, leading to the MLE, ^p = Σx/n .

The Maximum Likelihood Estimator (MLE) is used in statistical inference to find the parameter values that maximize the likelihood function, which represents the probability of obtaining observed data. MLE provides a method to estimate parameters of probability distributions such as the mean, variance, and probability in various statistical models. It is especially significant because MLE estimators have desirable properties, such as being consistent, asymptotically normal, and efficient under certain regularity conditions .

You might also like