0% found this document useful (0 votes)

25 views4 pages

Normality Tests and T-Tests Analysis

This document contains instructions for completing 4 assignments involving statistical analyses: 1. Checking if a variable is normally distributed using normality tests and confidence intervals. 2. Creating a 95% confidence interval for sleep averages and determining if it includes the recommended average. 3. Conducting a one-sample t-test to determine if baby weights are lower than average. 4. Performing another one-sample t-test to see if height differs from a proposed average. The assignments involve applying statistical tests like the Shapiro-Wilk normality test and t-tests, interpreting p-values and confidence intervals, and making conclusions about null hypotheses.

Uploaded by

Naresh Suwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views4 pages

Normality Tests and T-Tests Analysis

Uploaded by

Naresh Suwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Assignment 06

1. Normality Test: To complete this homework, you will need the dataset Kinesiology_1.csv which you
will find on Moodle.

a) Check to see if the variable HR follows a normal distribution. Formulate the appropriate hypothesis
and report the corresponding p-value. What is your verdict?
H0: The variable HR is normally distributed.
Ha: The variable HR is not normally distributed.
α = 0.05
> Data <- [Link]("Kinesiology_1.csv")
> Data_HR <- Data$HR
> df <- [Link](y = Data_HR)
> p <- ggplot(df, aes(sample = y)) +
+ stat_qq() +
+ stat_qq_line() +
+ theme_classic()
>p
> [Link](Data_HR)

Shapiro-Wilk normality test

data: Data_HR
W = 0.96431, p-value = 0.07662

From Shapiro-Wilk test, the critical value is 0.96431 and the p-value is 0.07662. Here,
0.07662 > α, we cannot reject the null hypothesis. There is insufficient evidence to
claim that HR is not normally distributed.

b) Repeat (a) only for 5_min and then only for 15_min.
5_min:
H0: HR is normally distributed for the REST group 5_min.
Ha: HR is not normally distributed for the REST group 5_min.
α = 0.05
> [Link](Data_HR_5)

Shapiro-Wilk normality test

data: Data_HR_5
W = 0.91906, p-value = 0.1864

Using the Shapiro-Wilk test, the critical value is 0.91906 and the p-value is 0.1864. Here,
0.1864 > α, we cannot reject the null hypothesis. There is insufficient evidence to claim that
HR is not normally distributed for the REST group 5_min.

15_min:
H0: HR is normally distributed for the REST group 15_min.
Ha: HR is not normally distributed for the REST group 15_min.
α = 0.05

> [Link](Data_HR_15)

Shapiro-Wilk normality test

data: Data_HR_15
W = 0.96845, p-value = 0.8345

Using the Shapiro-Wilk test, the critical value is 0.96845 and the p-value is 0.8345. Here,
0.8345 > α, we cannot reject the null hypothesis. There is insufficient evidence to claim that
HR is not normally distributed for the REST group 15_min.

2. Confidence interval using the T distribution: It is recommended that the average hours of sleep an
adult should receive daily is 8. As a graduate student, this can be difficult to achieve some times. The
following is a set of 10 measurements from my sleep schedule the past 10 days:

3 6 7 7 6 5 7 3 6 8

a) Create a 2 tailed 95% confidence interval with the mean and standard error of the above dataset
using a T-distribution.
> a <- c(3,6,7,7,6,5,7,3,6,8)
> describe (a)
𝝈
𝒔=
√𝒏
𝟏.𝟔𝟗
Since, 𝝈 = 𝟏. 𝟔𝟗 and n = 10, we have 𝒔 = = 𝟎. 𝟓𝟑𝟒𝟒 , x̅ = 5.8
√𝟏𝟎
[x̅ - t9, 0.025 *(SE), x̅ + t9, 0.025 *(SE)]
[5.8 – 2.262(.5344), 5.8 + 2.262(.5344)]
[4.5911, 7.008]

b) Why is it better to use a T-distribution in this example?

We do not know the population standard deviation and the sample size is less than thirty.
c) Does the recommended average fall in that confidence interval? What does that imply, or what
would you say based on that?
The recommended average hours of sleep do not fall in the confidence interval. Hence, we
reject null hypothesis. The sample average is different that the population average. Here, the
recommended average of sleep is higher than the sample average and confidence interval, it
can be concluded that graduate student is not getting enough sleep with given data.

(In this question, you can use a software to compute some descriptive statistics, but you should complete
the problem by hand)

3. One-Sample T-test: The US CDC reports that the average weight of healthy 12-hour-old infants is
7.5 lb. A sample of 10 newborn babies from a low-income neighborhood yielded the following weights (in
pounds) at 12 hours after birth:

6.0 8.6 7.5 8.2 8.0 8.1 6.4 6.0 7.2 4.8

The researcher wants to know if we can conclude that babies from this neighborhood are underweight with
α = 0.01.
a) Write the null and alternate hypotheses.
Ho: The babies from low-income neighborhoods weighted same as the population
infant weight of 7.5lbs.
Ho: µ = 7.5
H1: The babies from low-income neighborhoods weigh less than 7.5lbs.
H1: µ < 7.5
b) The researcher argues that a one-sided test is needed. Can you support her claim logically? Do you
think a one-sided test could be justified here? Explain.
Here, we are conducting test to see if the babies of low-income neighborhoods weigh less than
7.5 lbs. only, so one – sided test is sufficient. If, however, we were determining a difference in
weights, then a two-sided test would be necessary to determine both inequalities.
c) Run a one-sample t-test using the sample data above. What is your p-value from your results?
> babies = c(6,8.6,7.5, 8.2, 8, 8.1, 6.4, 6, 7.2, 4.8)
> [Link](babies, mu = 7.5)

One Sample t-test

data: babies
t = -1.079, df = 9, p-value = 0.3086
alternative hypothesis: true mean is not equal to 7.5
95 percent confidence interval:
6.199468 7.960532
sample estimates:
mean of x
7.08

We found a p-value for a two-sided t test. To get, p-value for a one-sided t test, we divide the
p-value listed above by 2. Thus, our p-value is 0.1543.

d) What is your conclusion to our hypotheses?

The p-value is 0.1543, which is greater than 0.01. Therefore, we failed to reject the
null hypothesis. The practical meaning is that there is not enough evidence to support
the claim that the average weight of babies from low-income neighborhoods is lower
than 7.5lbs from this data set.

4. One-Sample T-test: To complete this question, use the dataset Kinesiology_1.csv again. We will do a
one-sample t-test on the variable HT, assuming that the test is two-sided with α = 0.05. We are interested
in seeing if the mean height equals 170 cm or not.
a) Write the null and alternate hypotheses.
H0: μHT = 170 (mean height is equal to 170 cm)
HA: μHT ≠ 170 (mean height is not equal to 170 cm)

b) Run a one-sample t-test on the variable HT. What is your p-value from your results?
> Data_HT <- Data$HT
> [Link](Data_HT, mu = 170)

One Sample t-test

data: Data_HT
t = 6.0426, df = 59, p-value = 1.099e-07
alternative hypothesis: true mean is not equal to 170
95 percent confidence interval:
173.4112 176.7888
sample estimates:
mean of x
175.1
c) What is your conclusion to our hypotheses?
Since p-value is less than 0.05, we reject the null hypothesis. The mean grade of our sample
is significantly different from 170.

[Link] questions: Complete the following concept questions from the book, in Chapter 4:
1, 8, 12, 13

1) True
8) True
12) False, there must be assumptions that are met when the t test is applied.
13) False, the degrees of freedom for the t test do depend on the sample size.

Common questions

The confidence interval for sleep hours, calculated using a T-distribution, was [4.5911, 7.008], which does not include the recommended average of 8 hours. This implies that there is strong evidence against the null hypothesis that the student's average sleep aligns with the recommended 8 hours. It suggests that the student gets significantly less sleep, highlighting a potential lifestyle issue or health concern that requires attention .

The Shapiro-Wilk test was employed to assess the normality of HR data for the REST group at both 5_min and 15_min intervals. For the 5_min interval, the test produced a p-value of 0.1864, and for the 15_min interval, it generated a p-value of 0.8345. In both cases, the p-values exceeded the alpha level of 0.05, leading to the conclusion that there was no significant departure from normality. This meant sufficient evidence was lacking to reject the null hypothesis of normal distribution for both intervals .

The one-sample T-test on infant weights yielded a p-value of 0.1543, which is greater than the significance level of 0.01. Thus, statistically, we fail to reject the null hypothesis, implying insufficient evidence exists to claim that infants from the low-income neighborhood are underweight compared to the population average of 7.5 lbs. This outcome underscores the need for more substantial data to establish such a claim with statistical confidence .

A one-sample T-test is justified in this context because it allows the comparison of the sample mean from the observed infant weights against a known population mean (7.5 lbs), while taking into account the standard deviation of the sample and sample size. This test is ideal when evaluating mean differences in a specific direction, providing a method to infer if the observed sample can statistically be considered different from the general reference population .

A T-distribution is more appropriate in situations where the sample size is small (typically less than 30) and the population standard deviation is unknown. The T-distribution accounts for the additional uncertainty due to these factors, resulting in wider confidence intervals than those produced by the normal distribution. This makes it a better fit for the sleep data example, where only 10 measurements are available and the population standard deviation is not known .

The Shapiro-Wilk test assesses the normality of a dataset by comparing the order statistics of the sample to the expected order statistics under a normal distribution. It calculates a W statistic, where a value close to 1 indicates a distribution close to normal. For the HR variable, the test resulted in a W of 0.96431 with a p-value of 0.07662, which is greater than the significance level of 0.05. This indicates that there was insufficient evidence to reject the null hypothesis, suggesting that HR is approximately normally distributed .

The assumption of normality is critical for the validity of T-tests, especially when sample sizes are small, as it ensures the sampling distribution of the mean approximates normality. For the HR and REST group data, the Shapiro-Wilk test was used to check this assumption, concluding that the data was approximately normally distributed. This validation allowed reliable application of T-tests in subsequent analysis while ensuring that the violation of normality wouldn't skew the results .

To construct a 95% confidence interval for average sleep hours, first compute the sample mean (x̅) and standard deviation (s). Given n=10 sleep measurements, use x̅=5.8 and s=1.69. Calculate the standard error as SE = s/√n = 0.5344. Using the T-distribution with df=n-1=9 and the critical value for α=0.025, find the T-value (approximately 2.262). Formulate the interval as [x̅ - T*SE, x̅ + T*SE], resulting in [4.5911, 7.008]. Assumptions include the sample being random and approximately normal, regardless of whether the population's normal distribution is initially unknown .

The one-sample T-test conducted on the variable HT resulted in a p-value of 1.099e-07, which is significantly less than the significance level of 0.05. This indicates a statistically significant difference in the mean height from the hypothesized value of 170 cm. Therefore, we reject the null hypothesis and conclude that the mean height is different from 170 cm .

A one-sided T-test tests for the possibility of a relationship in only one direction, whereas a two-sided test examines both directions for differences. In the context of examining infant weight, the research question specifically sought to determine if infants were underweight compared to the population average. Since the concern was only about whether the mean weight was less than 7.5 lbs, a one-sided test was appropriate to capture this single-direction hypothesis without considering an overestimate alternative .

One Sample T-Test Overview and Examples
No ratings yet
One Sample T-Test Overview and Examples
33 pages
Statistical Analysis Homework Guide
No ratings yet
Statistical Analysis Homework Guide
12 pages
Hypothesis Testing for Single Samples
No ratings yet
Hypothesis Testing for Single Samples
11 pages
One-Sample T-Test Overview and Examples
No ratings yet
One-Sample T-Test Overview and Examples
10 pages
Test of Difference
No ratings yet
Test of Difference
14 pages
Virginia Average Heights Analysis
100% (1)
Virginia Average Heights Analysis
11 pages
One Sample T-Test Explained
No ratings yet
One Sample T-Test Explained
63 pages
Caterina's One-Sample t-Test Analysis
No ratings yet
Caterina's One-Sample t-Test Analysis
18 pages
Hypothesis Unit Review Answer Key
No ratings yet
Hypothesis Unit Review Answer Key
6 pages
Hypothesis Testing for Single Mean
No ratings yet
Hypothesis Testing for Single Mean
18 pages
STAT4 Lesson 8 T Test
No ratings yet
STAT4 Lesson 8 T Test
14 pages
Statistical Hypothesis Testing Explained
No ratings yet
Statistical Hypothesis Testing Explained
59 pages
Statistical Tests and Confidence Intervals
No ratings yet
Statistical Tests and Confidence Intervals
8 pages
Understanding T-Tests in Statistics
No ratings yet
Understanding T-Tests in Statistics
37 pages
T-Test for Mean: Statistics Guide
No ratings yet
T-Test for Mean: Statistics Guide
7 pages
One Sample t Test Overview and Uses
No ratings yet
One Sample t Test Overview and Uses
10 pages
Hypothesis Testing in Bioengineering
No ratings yet
Hypothesis Testing in Bioengineering
47 pages
Evaluating Weight Loss Program Effectiveness
No ratings yet
Evaluating Weight Loss Program Effectiveness
26 pages
One Sample Hypothesis Testing Guide
No ratings yet
One Sample Hypothesis Testing Guide
9 pages
Hypothesis Testing Fundamentals
100% (1)
Hypothesis Testing Fundamentals
29 pages
Hypothesis Testing Overview
No ratings yet
Hypothesis Testing Overview
3 pages
One Sample T Test Explained
No ratings yet
One Sample T Test Explained
16 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
27 pages
P-Value Range in Left-Tailed T-Test
No ratings yet
P-Value Range in Left-Tailed T-Test
46 pages
Understanding T-Distribution and Its Applications
No ratings yet
Understanding T-Distribution and Its Applications
32 pages
One-Sample T-Test Explained with Examples
No ratings yet
One-Sample T-Test Explained with Examples
8 pages
One-Sample T-Test for Protein Content
No ratings yet
One-Sample T-Test for Protein Content
8 pages
Hypothesis Testing and T-Tests Explained
No ratings yet
Hypothesis Testing and T-Tests Explained
62 pages
Hypothesis Testing: Null vs. Alternative
No ratings yet
Hypothesis Testing: Null vs. Alternative
44 pages
Hypothesis Testing: t and z Tests Explained
No ratings yet
Hypothesis Testing: t and z Tests Explained
12 pages
Hypothesis Testing at 0.01 Significance
No ratings yet
Hypothesis Testing at 0.01 Significance
13 pages
Hypothesis Testing Methods and Examples
No ratings yet
Hypothesis Testing Methods and Examples
40 pages
Understanding the t Test for Means
No ratings yet
Understanding the t Test for Means
3 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
21 pages
t-Test Hypothesis Problems and Solutions
No ratings yet
t-Test Hypothesis Problems and Solutions
5 pages
Hypothesis Testing Notes
No ratings yet
Hypothesis Testing Notes
44 pages
Inferential Statistics
No ratings yet
Inferential Statistics
5 pages
Comprehensive Guide to Hypothesis Testing
No ratings yet
Comprehensive Guide to Hypothesis Testing
43 pages
SAS Project 2: Statistical Inference Techniques
No ratings yet
SAS Project 2: Statistical Inference Techniques
21 pages
Hypothesis Testing Explained: Methods & Examples
No ratings yet
Hypothesis Testing Explained: Methods & Examples
31 pages
Understanding Statistics: Normality & Hypothesis Testing
No ratings yet
Understanding Statistics: Normality & Hypothesis Testing
28 pages
Hypothesis Testing in Various Scenarios
No ratings yet
Hypothesis Testing in Various Scenarios
5 pages
Hypothesis Testing in Biostatistics
No ratings yet
Hypothesis Testing in Biostatistics
52 pages
Hypothesis Testing for Population Mean
No ratings yet
Hypothesis Testing for Population Mean
14 pages
Understanding Hypothesis Testing Errors
No ratings yet
Understanding Hypothesis Testing Errors
20 pages
Statistical Hypothesis Testing Guide
No ratings yet
Statistical Hypothesis Testing Guide
54 pages
Hypothesis Testing: Concepts & Examples
No ratings yet
Hypothesis Testing: Concepts & Examples
27 pages
Hypothesis Testing Fundamentals
No ratings yet
Hypothesis Testing Fundamentals
23 pages
T-Test Hypothesis Testing Guide
No ratings yet
T-Test Hypothesis Testing Guide
102 pages
Hypothesis Testing Basics Explained
67% (6)
Hypothesis Testing Basics Explained
29 pages
HT Ttest & Analysis of Variance - SPSS
No ratings yet
HT Ttest & Analysis of Variance - SPSS
22 pages
Hypothesis Testing and T-Tests in R
No ratings yet
Hypothesis Testing and T-Tests in R
16 pages
Hypothesis Testing in Statistics
No ratings yet
Hypothesis Testing in Statistics
34 pages
Biostatistics Confidence Interval Analysis
No ratings yet
Biostatistics Confidence Interval Analysis
7 pages
One-Sample T-Test Analysis Results
No ratings yet
One-Sample T-Test Analysis Results
13 pages
Assumptions Lecture For Class Presentation-Trimmed Theory
No ratings yet
Assumptions Lecture For Class Presentation-Trimmed Theory
44 pages
Hypothesis Testing Procedures Explained
No ratings yet
Hypothesis Testing Procedures Explained
24 pages
Hypothesis Testing for Population Mean
No ratings yet
Hypothesis Testing for Population Mean
6 pages
Multi Objective Decision Making
No ratings yet
Multi Objective Decision Making
213 pages
Bheri Babai Diversion Project Overview
No ratings yet
Bheri Babai Diversion Project Overview
7 pages
Fluid Motion: Key Equations & Concepts
No ratings yet
Fluid Motion: Key Equations & Concepts
55 pages
Irrotational Motion and Potential Flow
No ratings yet
Irrotational Motion and Potential Flow
40 pages
Fluid Motion Kinematics Explained
No ratings yet
Fluid Motion Kinematics Explained
72 pages
Group 7 Publishing Format
No ratings yet
Group 7 Publishing Format
16 pages
Overview of 7 QC Tools
No ratings yet
Overview of 7 QC Tools
46 pages
AI Data Science Internship Report
No ratings yet
AI Data Science Internship Report
17 pages
Deep Learning Question Bank 18CS731
75% (4)
Deep Learning Question Bank 18CS731
5 pages
Clinical Lab Method Validation Essentials
No ratings yet
Clinical Lab Method Validation Essentials
10 pages
MSc Programs at Mithibai College
No ratings yet
MSc Programs at Mithibai College
20 pages
Bayesian Quiz Results and Insights
No ratings yet
Bayesian Quiz Results and Insights
1 page
PTSD and Psychosocial Risks in First Responders
No ratings yet
PTSD and Psychosocial Risks in First Responders
9 pages
Data Analysis of Housing Prices in NY
No ratings yet
Data Analysis of Housing Prices in NY
3 pages
7th Grade English Placement Test Skills
No ratings yet
7th Grade English Placement Test Skills
7 pages
Impact of Non-Taxable Income on Tax Revenue
No ratings yet
Impact of Non-Taxable Income on Tax Revenue
7 pages
Conclusions and Recommendations in Research
No ratings yet
Conclusions and Recommendations in Research
13 pages
Australian Pine Height Regression Analysis
No ratings yet
Australian Pine Height Regression Analysis
19 pages
Financial Budgeting's Impact on ABM Students
No ratings yet
Financial Budgeting's Impact on ABM Students
7 pages
Audit Sampling Techniques Explained
No ratings yet
Audit Sampling Techniques Explained
58 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
5 pages
Statistics Formulas Cheat Sheet
100% (2)
Statistics Formulas Cheat Sheet
3 pages
Statistical Analysis of Student Data
33% (3)
Statistical Analysis of Student Data
9 pages
Probability and Statistics For Engineering and The Sciences 9th Edition by Jay L. Devore Completed Chapters
100% (11)
Probability and Statistics For Engineering and The Sciences 9th Edition by Jay L. Devore Completed Chapters
206 pages
Lumpy Demand Forecasting Techniques
No ratings yet
Lumpy Demand Forecasting Techniques
3 pages
Fall 2025 Statistics Course Syllabus
No ratings yet
Fall 2025 Statistics Course Syllabus
7 pages
Statistics For Business and Economics: Probability
No ratings yet
Statistics For Business and Economics: Probability
36 pages
JC SS Unit 2
No ratings yet
JC SS Unit 2
73 pages
Effects of Excessive Schoolwork on Students
100% (2)
Effects of Excessive Schoolwork on Students
48 pages
Wavelet vs Logistic Regression for Financial Distress Prediction
No ratings yet
Wavelet vs Logistic Regression for Financial Distress Prediction
14 pages
Customer Awareness of SBI Loan Products
No ratings yet
Customer Awareness of SBI Loan Products
16 pages
Understanding Tact Theory in Research
No ratings yet
Understanding Tact Theory in Research
24 pages
Record Management in Ondo Secondary Schools
No ratings yet
Record Management in Ondo Secondary Schools
7 pages
Statistics Using R An Integrative Approach 2nd Edition Sharon L. Weinberg Ebook Revised 2026 Edition
100% (3)
Statistics Using R An Integrative Approach 2nd Edition Sharon L. Weinberg Ebook Revised 2026 Edition
148 pages
Audit Sampling Procedures Case Study
No ratings yet
Audit Sampling Procedures Case Study
2 pages

Normality Tests and T-Tests Analysis

Uploaded by

Normality Tests and T-Tests Analysis

Uploaded by

Assignment 06

Shapiro-Wilk normality test

Shapiro-Wilk normality test

Shapiro-Wilk normality test

b) Why is it better to use a T-distribution in this example?

One Sample t-test

d) What is your conclusion to our hypotheses?

One Sample t-test

Common questions

What are the implications of the confidence interval for sleep hours not including the recommended average of 8 hours in the analysis conducted with T-distribution on a graduate student’s sleep schedule?

How did the researcher use the Shapiro-Wilk test results to address the normality assumption of the REST group at 5_min and 15_min intervals, and what were the findings?

Can the p-value obtained from the one-sample T-test on infant weights be interpreted to mean that infants from a particular neighborhood are indeed underweight? Discuss the conclusion within the statistical context given a significance level of 0.01.

What justifies using a one-sample T-test to evaluate whether the average weight of infants in the study differs significantly from the known population mean?

Why might a T-distribution be more appropriate than a normal distribution for constructing confidence intervals in certain situations, such as with the sleep data example?

How does the Shapiro-Wilk test help in determining the normal distribution of a dataset, and what were the outcomes for the HR variable in the dataset provided?

In what scenarios does the assumption of normality become crucial for performing a T-test, and how does this relate to the assumptions checked in the normality tests for HR and REST group data?

Describe the steps and calculations necessary to construct a 95% confidence interval for the average sleep hours of a graduate student using the T-distribution. What assumptions must be satisfied?

What conclusion can be drawn from the one-sample T-test conducted on the variable HT from the Kinesiology_1.csv dataset?

How does the one-sided T-test differ from the two-sided T-test in context of examining infant weight from a low-income neighborhood, and why was a one-sided test deemed appropriate in this study?

You might also like