0% found this document useful (0 votes)
15 views2 pages

Confidence Intervals in Statistics Analysis

This document provides examples and questions related to constructing confidence intervals from sample data. Confidence intervals provide a range of values that are likely to contain the true population parameter based on a given level of confidence. Some key points covered include: - How to calculate 90%, 95%, and 99% confidence intervals for things like average IQ scores, weight loss amounts, and debit card spending based on sample data and assumptions about the population distribution. - How the required sample size to achieve a given precision depends on the population standard deviation, with larger variations requiring more data. - Comparing 95% and 99% confidence intervals calculated from test score data, noting the 99% CI will have a longer total length.

Uploaded by

Gowshika Sekar
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views2 pages

Confidence Intervals in Statistics Analysis

This document provides examples and questions related to constructing confidence intervals from sample data. Confidence intervals provide a range of values that are likely to contain the true population parameter based on a given level of confidence. Some key points covered include: - How to calculate 90%, 95%, and 99% confidence intervals for things like average IQ scores, weight loss amounts, and debit card spending based on sample data and assumptions about the population distribution. - How the required sample size to achieve a given precision depends on the population standard deviation, with larger variations requiring more data. - Comparing 95% and 99% confidence intervals calculated from test score data, noting the 99% CI will have a longer total length.

Uploaded by

Gowshika Sekar
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Nanyang Business School

AB1202 – STATISTICS AND ANALYSIS

Tutorial :8
Topics : Confidence Interval

1. IQ tests are designed to yield scores that are approximately normally distributed. A
reporter gathers the IQ scores from 22 employees of this firm and records the sample
mean IQ as 106. She assumes that the population standard deviation is 15.
(1) Compute 90% confidence interval for the average IQ in this firm.
(2) Compute 99% confidence interval for the average IQ in this firm.

2. U.S. consumers are increasingly viewing debit cards as a convenient substitute for cash
and checks. The average amount spent annually on a debit card is $7790. Assume that this
average was based on a sample of 100 consumers and that the population standard
deviation is $500. Construct the 99% confidence interval for the population mean amount
spent annually on a debit card.

3. In order to estimate the mean weight loss of all customers in a weight loss program, a
nutritionist takes a sample of 18 dieters and records their weight loss one month after
joining the program. He computes the sample mean and standard deviation of weight loss
as 11.5 pounds and 9.2 pounds, respectively. He believes weight loss is likely to be
normally distributed. Calculate the 95% confidence interval for the population mean.

4. Consider the built-in dataset ChickWeight in R.


(1) Consider the population of 10-day old chicken (Time==10), where we define an RV of
their weights as 𝑋!" . Use the data of 10-day old chicken, build a 90% confidence
interval of the weight of 10-day old chicken 𝜇!"
(2) Consider the population of newborn chicken (Time==0), where we define an RV of
their weight as 𝑋" . Use the data of newborn chicken, build a 95% confidence interval
of the weight of newborn chicken 𝜇"

5. Suppose that 𝑋! , … , 𝑋# form a random sample from the normal distribution with
unknown mean µ and known variance 𝜎 $ . How large must a random sample be in order
that there will be a 95% confidence interval for µ with a total length less than 0.01σ?
(Total length of a CI =2* 𝑍".&'( 𝜎/√𝑛)

6. An analyst would like to construct 95% confidence intervals for the mean stock returns in
two industries. Industry A is a high-risk industry with a known population standard
deviation of 20.6%, whereas industry B is a low-risk industry with a known population
standard deviation of 12.8%.
1
Nanyang Business School

(1) What is the minimum sample size required by the analyst if she wants to restrict the
)
absolute error (i.e., (𝑍 ∗ ) in a CI ) to 4% for Industry A?
√#

(2) What is the minimum sample size required by the analyst if she wants to restrict the
absolute error to 4% for Industry B?
(3) Why do the results differ if they use the same absolute error?

7. Consider the file [Link], which contains the test scores of 149 students.
(1) Load the file into R and calculate 95% and 99% CI for the mean test scores
(2) Compare the 95% CI with the 99% CI. Is the 99% CI’s total length longer or shorter than
the 95% CI?
(3) If we collect data from another 150 students, how would this larger sample affect the
total length of the 95% CI? Suppose the sample standard deviation under the larger
sample is similar to the original one
(4) Based on your answer from (1), can we conclude that there is a 0.95 probability that
the E(X) is in the CI you just calculated? why?

Common questions

Powered by AI

A 99% confidence interval is wider than a 95% confidence interval because it must encompass more of the distribution to increase the probability that the true population parameter lies within the interval. Specifically, higher confidence levels increase the Z value, thereby widening the interval to maintain the desired level of confidence .

The sample size required differs due to variations in population standard deviations. Industry A has a standard deviation of 20.6%, while Industry B's is 12.8%. A higher standard deviation, like Industry A's, indicates greater variability, thus requiring a larger sample to achieve the same absolute error margin. The formula: n = (Z*σ/E)^2 reflects this, where E is the maximum error, further explaining why Industry A needs a larger sample despite identical error requirements .

The length of a confidence interval is directly proportional to the square root of the population variance. For example, in the question about having a 95% confidence interval length less than 0.01σ, it is vital to derive n using the equation CI length = Z*(σ/√n), solving for n provides the necessary sample size to ensure the interval's total length remains under the desired threshold .

The risk level, shown by standard deviation, affects the width of confidence intervals; greater risk (higher standard deviation) results in wider intervals. This necessitates larger sample sizes for maintaining the same level of precision. For Industry A's high risk and 20.6% standard deviation, a wider interval requires a larger sample compared to Industry B's lower risk (12.8%), reflecting trade-offs in confidence interval precision and sample size .

Stating a 0.95 probability that the true mean lies within the confidence interval misinterprets the concept of confidence intervals. A 95% confidence interval means that if we were to take many samples and build a confidence interval from each, 95% of those intervals would contain the true mean. However, for any given interval, the true mean is either within it or not, with no probability aspect to its presence .

Increasing the sample size reduces the length of the confidence interval, as it decreases the standard error (σ/√n). For instance, if we increase the sample size from 149 to 299 students while the sample standard deviation remains constant, the denominator in the formula for standard error grows, thus narrowing the confidence interval .

The assumption of normally distributed data allows the use of the t-distribution for small samples when the population standard deviation is unknown, or the normal distribution when it's known. In the weight loss program example, the calculated 95% confidence interval assumes normal distribution, which justifies using a t-score from the t-distribution for accurate interval estimates for a small sample (n=18).

To construct the 95% confidence interval for debit card expenditure: Use CI = x̄ ± Z*(σ/√n), given x̄ is $7790, σ is $500, and n is 100. With a Z value of 1.96 for 95% confidence, the interval is $7790 ± 1.96*(500/√100), resulting in ($7690, $7890). This interval suggests that if you took many samples, approximately 95% would contain the true average expenditure .

Degrees of freedom are crucial in small sample size contexts, affecting the choice of t-distribution over the normal distribution due to the sample mean's greater variability. With 18 dieters, using t-distribution with 17 degrees of freedom adjusts for estimation error in the sample standard deviation, providing a more accurate confidence interval .

To calculate a 90% confidence interval for the average IQ score with a population standard deviation of 15, one should use the formula: CI = x̄ ± Z*(σ/√n). Here, x̄ is 106, σ is 15, and n is 22. For a 90% confidence level, the Z value is 1.645. Therefore, the confidence interval is 106 ± 1.645*(15/√22), which results in (101.33, 110.67).

You might also like