0% found this document useful (0 votes)

24 views19 pages

Mann-Whitney Test for Medians

This document discusses nonparametric statistics (NPS) which are used for data that is not normally distributed. It covers several nonparametric tests including the sign test, Mann-Whitney test, and Kruskal-Wallis test. Examples are provided for both the sign test and Mann-Whitney test. The sign test is used to test if the median of a single sample differs from a hypothesized value, while the Mann-Whitney test compares the medians of two independent samples to determine if they are identical.

Uploaded by

MUHAMMAD IMRAN HAKIM BIN SULAIMAN STUDENT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views19 pages

Mann-Whitney Test for Medians

Uploaded by

MUHAMMAD IMRAN HAKIM BIN SULAIMAN STUDENT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CHAPTER 3:

STATISTICAL
INFERENCES (WEEK 10)
3.1 Introduction
3.2 Sampling distribution WEEK 8

3.3 Inference for single population

3.4 Inference for two populations WEEK 9

3.5 Nonparametric statistics WEEK 10

NONPARAMETRIC STATISTICS (NPS)
INTRODUCTION TO NPS

NPS is the alternative to test a non-normal distribution such as

@ flat distribution
@ peaked distribution
@ skewed distribution
(refer figure 3.9 page 77)
NPS is referred as distribution-free tests.

NPS use median in making inferences about a population (parametric tests use mean).

NPS is also used to infer non-numerical data that require ranking approach such as
1. Nominal data
2. Ordinal data
3. Interval scale or ratio scale data but there is no assumption regarding the probability
distribution of the population where the sample is selected.
Normal distributed data ➔ Parametric test (CI, HT, t-test, F-test,
ANOVA)
Non-normal distributed data ➔ Nonparametric test.

NPS are:
❑ Sign Test
❑ Mann-Whitney Test
❑ Kruskal Wallis Test
❑ Wilcoxon Signed Rank Test
❑ Spearman’s Rank Correlation Test
Sign Test
➔The simplest NPS
➔Test the value of the median from a single sample.
➔Convert the data value into +/- sign.
Sign Test Procedures
1. State the hypotheses (determine the type of test: 2-tailed test,
left/right tailed test).

Note: Mo is the hypothesized median

2. Determine the sign

Put a + sign for the value greater than the hypothesized median value
Put a - sign for the value less than the hypothesized median value
Put a 0 for the value equal to the hypothesized median value

3. Compute test statistic, k

4. Find the critical value from sign test table.
Information needed:
1. significance level, α
2. size sample, n
where n = total number of signs + and - signs

5. Make a decision
Reject Ho if test statistic, k ≤ critical value

6. Conclusion
Example 1:
The following data constitute a random sample of 15 measurement of the octane rating of a
certain kind gasoline:
99.0 102.3 99.8 100.5 99.7 96.2 99.1 102.5
103.3 97.4 100.4 98.9 98.3 98.0 101.6

Test the null hypothesis median = 98 against the alternative hypothesis median > 98 at 0.05
level of significance.
Solution:
1. H0: median = 98
H1: median > 98 (Claim) ➔ Right-tailed test
2. 99.0 102.3 99.8 100.5 99.7 96.2 99.1 102.5
+ + + + + - + + + sign = 12
103.3 97.4 100.4 98.9 98.3 98.0 101.6 -sign = 2
+ - + + + 0 + +/- sign = 14

3. This is right-tailed test, so test statistic, k = number of - signs = 2

cont:

4. significance level, α = 0.05

size sample, n = 14
➔ critical value = 3

5. Since test statistic, k = 2 ≤ critical value = 3 so we reject Ho .

6. There is enough evidence to support the claim that the median of the octane rating of
a certain kind gasoline is greater than 98.
Example 2:
An owner of a souvenir shop hypothesizes that the median number of items sold per day is 40.
A random of 20 days yields the following data for the number of items sold each day. At α =
0.05 test the owner’s hypothesis.
18 43 40 16 22
30 29 32 37 36
39 34 39 45 28
36 40 34 39 52

Solution:
1. H0: median = 40 (Claim)
H1: median ≠ 40 ➔ two-tailed test
2.
- + 0 - -
- - - - -
+ sign = 3
- - - + -
-sign = 15
- 0 - - +
+/- sign = 18
3. This is two-tailed test, so test statistic, k = minimum number between + and - signs = 3
cont:

4. significance level, α = 0.05

size sample, n = 18
➔ critical value = 4

5. Since test statistic, k = 3 ≤ critical value = 4 so we reject Ho .

6. There is enough evidence to reject the claim that the median number of items sold per
day is 40.
Mann-Whitney Test (MWT)
• To determine whether a difference exist between two populations median of non-
normal distribution.
• Sometimes called as Wilcoxon rank sum test.
• Equivalent parametric test to MWT is the t-test for two independent samples.

Mann-Whitney Test Procedures

1. State the hypotheses (determine the type of test)
2. Rank the data values
[Link] all the data from the two samples (regard them as 1 sample).
2. Rank the data from smallest to largest (from 1 and so on, if there is tie
data, each of the data will get the average rank of the data).
3. Calculate the test statistic
1. Label sample 1 and sample 2. Let:
- sample 1 ➔ smaller sample size between the two independent samples.
- if the both samples have same size, either one can be regard as sample 1.
2. List the ranks of data values ranked in step 2 for both sample 1 and sample 2.
3. Calculate the sum of ranks for both samples.
Test statistic, T is based on:
where ƩR1 = sum of ranks from sample 1
n1 = sample size of sample 1
n2 = sample size of sample 2

Summary of test statistic for Mann-Whitney test

4. Find and calculate critical value, Tcv
Tcv = [TL , TU ]
where TL ➔ find from table of MWT for given α, n1 and n2.
TU = n1(n1+ n2+1) - TL

5. Make a decision base on:

Note:
means not included.

6. Conclusion
Example:
Data below show the marks obtained by electrical engineering students in an
examination:
Gender Marks
Male 60
Male 62
Male 78
Male 83
Female 40
Female 65
Female 70
Female 88
Female 92

Can we conclude

= 0.1
the achievements of male and female students identical at significance
level ?
Solution:
1. H0: There is no difference in the achievements of male and female students
H1: There is a difference in the achievements of male and female students ➔ two-tailed test
Gender Marks Rank
2.
Male 60 2
Male 62 3
n=4 Thus n1 is
Male 78 6
Male 83 7 sample from
Female 40 1 male and n2 is
Female 65 4 sample from
Female 70 5 n=5 female
Female 88 8
Female 92 9

n1 = 4this
3. Since
We have 5; T1 = test,
, n2is= two-tailed R1 =thus
2 + 3test
+ 6statistic, T1* = 4 ( 4 +(T51+, T
+ 7 = 18;T = minimum 1)1*−)18 = 22
T = min (T1 ,T1* ) = min (18, 22 ) = 18
4. Critical value, Tcv = [TL , TU ]
α = 0.1 ➔ α/2 = 0.05 ; n1 = 4, n2 = 5, thus from table TL = 13
Calculate TU = n1(n1+ n2+1) - TL = 4(4+5+1) – 13 = 27
➔ Tcv = [13,27]

5. Make a decision
T  TL , TU 
For two-tailed test, we reject H0 when
Since T = 18 ϵ Tcv = [13,27], thus we we fail to reject H0 .

6. Conclusion
There is not enough evidence to support the claim that there is a difference in the
achievement between male and female students.

Common questions

The Spearman's Rank Correlation Test is preferable over Pearson's Correlation when the data have a skewed distribution, contain outliers, or are ordinal in nature. It measures the strength and direction of the monotonic relationship between two variables, using ranks rather than raw data, which makes it less sensitive to anomalies and non-parametric data distributions .

The Mann-Whitney Test, also known as the Wilcoxon rank-sum test, differs from the t-test as it is a nonparametric alternative that does not require normally distributed data. It is used to determine whether there is a difference between the medians of two populations with non-normal distributions. Steps include: 1) Stating the hypotheses, 2) Ranking all combined sample data, 3) Calculating test statistics based on rank sums, 4) Determining critical values from tables, and 5) Making a decision based on comparing test statistics with critical values. This test is equivalent to the t-test for two independent samples when assumptions for the t-test do not hold .

The procedure for conducting a Sign Test involves the following steps: 1) State the hypotheses to determine the type of test (two-tailed or one-tailed), 2) Assign '+' for values greater than the hypothesized median and '-' for values less; '0' for values equal to the median, 3) Compute the test statistic, k, which is the number of minus signs, 4) Find the critical value from the sign test table using the significance level and sample size, and 5) Make a decision by rejecting the null hypothesis if k is less than or equal to the critical value. This test is used to determine if the true population median differs from a specified value .

The test statistic in the Mann-Whitney Test is determined by assigning ranks to the combined data from two samples and calculating the sum of these ranks for each sample. The test statistic, T, is then the smaller or larger rank sum, depending on the sample strategy. Its value, when compared to critical values, indicates whether there is a statistically significant difference between the medians of the two samples, suggesting they do not come from the same distribution if significant .

Nonparametric tests differ from parametric tests as they do not assume a specific probability distribution for the population from which a sample is drawn, making them applicable for non-normal distributions such as flat, peaked, or skewed distributions. They use the median to make inferences about a population rather than the mean, which is used in parametric tests. Nonparametric tests are suitable for non-numerical data that require a ranking approach, including nominal, ordinal, interval scale, or ratio scale data .

The Kruskal Wallis Test is a nonparametric method used for comparing more than two groups when data do not necessarily follow a normal distribution. It extends the Mann-Whitney Test for multiple groups to test the null hypothesis that all groups have the same distribution. The test involves ranking all data together, calculating the sum of ranks for each group, and using these sums to determine the test statistic. A significant test statistic indicates at least one group distribution differs. This test is crucial in scenarios where parametric ANOVA assumptions are violated .

Nonparametric tests, including the Wilcoxon Signed Rank Test, are used instead of parametric tests when the data do not meet the assumptions necessary for parametric tests, such as normal distribution or when dealing with ordinal data. The Wilcoxon Signed Rank Test is particularly suitable for paired samples or repeated measures to test if their population median differences are zero. It ranks the absolute differences of pairs and evaluates signs, providing a robust alternative to the paired t-test for non-normally distributed data .

Using medians rather than means in nonparametric statistical inferences is significant because medians are robust measures not influenced by outliers or skewed data, making them more applicable for non-normally distributed data. This approach allows nonparametric tests to handle a wider range of data types effectively, focusing on the central tendency for ordinal and non-numerical data .

The advantages of the Sign Test include its simplicity and minimal assumptions, making it suitable for small sample sizes and nominal data. It only requires the data to be converted into '+' and '-' signs relative to a median, offering robust alternatives in these scenarios. However, the limitations include its low power compared to other tests, as it ignores the magnitude of differences, potentially overlooking real differences when sample sizes are larger or when more complex data analysis is required .

A researcher might choose the Sign Test over the Wilcoxon Signed Rank Test when the data do not meet the assumptions required for the Wilcoxon Signed Rank Test, such as symmetry of distribution or when the sample size is too small to provide reliable results from more complex tests. The Sign Test is simpler, converting data to '+' and '-' based on a median, and is used when only the signs of differences matter rather than their magnitude, offering a straightforward nonparametric alternative .

Mann-Whitney U Test Overview
No ratings yet
Mann-Whitney U Test Overview
35 pages
Statistical Analysis Course Overview
No ratings yet
Statistical Analysis Course Overview
129 pages
Statistics Support Resource Guide
No ratings yet
Statistics Support Resource Guide
186 pages
Sign Test for Population Median Analysis
No ratings yet
Sign Test for Population Median Analysis
57 pages
Two-Sample Tests of Hypothesis
No ratings yet
Two-Sample Tests of Hypothesis
22 pages
Statistical Significance in Education
No ratings yet
Statistical Significance in Education
65 pages
Understanding Mode, Median, and Mean
No ratings yet
Understanding Mode, Median, and Mean
15 pages
SPSS Data Analysis Assignments Guide
No ratings yet
SPSS Data Analysis Assignments Guide
6 pages
Wilcoxon Rank Sum Test Overview
No ratings yet
Wilcoxon Rank Sum Test Overview
16 pages
Correlation and T-Test Analysis Guide
No ratings yet
Correlation and T-Test Analysis Guide
17 pages
Understanding Measures of Central Tendency
No ratings yet
Understanding Measures of Central Tendency
11 pages
Statistical Treatment of Data Methods
No ratings yet
Statistical Treatment of Data Methods
3 pages
Behavioral Sciences Statistics Guide
No ratings yet
Behavioral Sciences Statistics Guide
5 pages
Hypothesis Testing Overview and Methods
No ratings yet
Hypothesis Testing Overview and Methods
7 pages
Sampling Basics in Statistics
No ratings yet
Sampling Basics in Statistics
16 pages
Hypothesis Testing for Small Samples
No ratings yet
Hypothesis Testing for Small Samples
40 pages
SPSS Analysis: Factor & ANOVA Insights
100% (2)
SPSS Analysis: Factor & ANOVA Insights
15 pages
Chi-Square Test for Association Example
100% (1)
Chi-Square Test for Association Example
8 pages
Objectives of Measures of Dispersion
No ratings yet
Objectives of Measures of Dispersion
71 pages
Chapter 13 Error Analysis Solutions
No ratings yet
Chapter 13 Error Analysis Solutions
17 pages
Understanding the Z-Test in Statistics
No ratings yet
Understanding the Z-Test in Statistics
18 pages
Statistics Practice Test 2023
No ratings yet
Statistics Practice Test 2023
15 pages
Wilcoxon Test for Paired Data Analysis
No ratings yet
Wilcoxon Test for Paired Data Analysis
4 pages
Hypothesis Testing Quiz Questions
No ratings yet
Hypothesis Testing Quiz Questions
4 pages
Independent T-Test Hypothesis Guide
No ratings yet
Independent T-Test Hypothesis Guide
5 pages
Sign Test for Median Hypothesis Testing
No ratings yet
Sign Test for Median Hypothesis Testing
8 pages
Criteria for Effective Sampling Design
100% (1)
Criteria for Effective Sampling Design
13 pages
Sign Test for Paired Sample Analysis
No ratings yet
Sign Test for Paired Sample Analysis
12 pages
Mann-Whitney U Test Explained
No ratings yet
Mann-Whitney U Test Explained
9 pages
Simple Random Sampling Explained
100% (1)
Simple Random Sampling Explained
10 pages
Non-Random Sampling Techniques Explained
94% (17)
Non-Random Sampling Techniques Explained
4 pages
SPSS Analysis on Work-Life Balance and Education
50% (2)
SPSS Analysis on Work-Life Balance and Education
7 pages
Kindergarten Reading Comprehension Analysis
17% (6)
Kindergarten Reading Comprehension Analysis
6 pages
Introduction to Statistics Concepts
No ratings yet
Introduction to Statistics Concepts
161 pages
Z-Test and Hypothesis Testing Guide
No ratings yet
Z-Test and Hypothesis Testing Guide
36 pages
Understanding the Paired Samples T-Test
No ratings yet
Understanding the Paired Samples T-Test
14 pages
Systematic Sampling Method Explained
No ratings yet
Systematic Sampling Method Explained
2 pages
Two-Way Mixed ANOVA Overview
No ratings yet
Two-Way Mixed ANOVA Overview
10 pages
Inferential Statistics Exercises
No ratings yet
Inferential Statistics Exercises
2 pages
Psychology Statistics Assignment Guide
No ratings yet
Psychology Statistics Assignment Guide
18 pages
Hypothesis Testing in SPSS Explained
0% (1)
Hypothesis Testing in SPSS Explained
6 pages
Introduction to Statistics Overview
No ratings yet
Introduction to Statistics Overview
14 pages
Final Exam Review: Statistics Problems
No ratings yet
Final Exam Review: Statistics Problems
5 pages
Hypothesis Testing Example with Solutions
No ratings yet
Hypothesis Testing Example with Solutions
3 pages
Confidence Intervals for Unknown Variance
No ratings yet
Confidence Intervals for Unknown Variance
8 pages
Confidence Intervals and Sample Size
100% (1)
Confidence Intervals and Sample Size
44 pages
Educational Statistics Course Overview
100% (1)
Educational Statistics Course Overview
5 pages
Points and Interval Estimation in Statistics
No ratings yet
Points and Interval Estimation in Statistics
23 pages
Basic Statistics Q&A with Pie Charts
No ratings yet
Basic Statistics Q&A with Pie Charts
9 pages
One-Sample T-Test Explained
100% (1)
One-Sample T-Test Explained
43 pages
Non-Parametric Tests in SPSS
No ratings yet
Non-Parametric Tests in SPSS
37 pages
Advanced Statistical Inference Guide
100% (1)
Advanced Statistical Inference Guide
3 pages
Tests For Two Proportions
No ratings yet
Tests For Two Proportions
29 pages
Understanding One-Way ANOVA
No ratings yet
Understanding One-Way ANOVA
25 pages
Central Tendency and Dispersion Measures
No ratings yet
Central Tendency and Dispersion Measures
31 pages
History and Development of Statistics
No ratings yet
History and Development of Statistics
24 pages
T-Test vs Z-Test: Key Differences
No ratings yet
T-Test vs Z-Test: Key Differences
7 pages
Statistics Assignment 1 Answers
No ratings yet
Statistics Assignment 1 Answers
2 pages
Non-Parametric Statistical Tests Guide
No ratings yet
Non-Parametric Statistical Tests Guide
11 pages
Hypothesis Testing in Statistics
No ratings yet
Hypothesis Testing in Statistics
54 pages
Identifying Independent and Dependent Variables
No ratings yet
Identifying Independent and Dependent Variables
16 pages
Literature Review and Research Frameworks
No ratings yet
Literature Review and Research Frameworks
4 pages
Impact of Manganese on Arabidopsis Roots
No ratings yet
Impact of Manganese on Arabidopsis Roots
2 pages
Machine Learning for Heart Disease Prediction
No ratings yet
Machine Learning for Heart Disease Prediction
8 pages
Statistical Methods in Medical Research
No ratings yet
Statistical Methods in Medical Research
15 pages
MSc in Environmental Contamination
100% (1)
MSc in Environmental Contamination
78 pages
Infection Prevention Practices in Ethiopia
No ratings yet
Infection Prevention Practices in Ethiopia
29 pages
GEE and GLMM in Neuroscience Research
No ratings yet
GEE and GLMM in Neuroscience Research
10 pages
Moneyball: Lessons in Strategy and Data
No ratings yet
Moneyball: Lessons in Strategy and Data
4 pages
GameTruck and GameTrailer Spawns
No ratings yet
GameTruck and GameTrailer Spawns
9 pages
Estimating Multiple Linear Regression
No ratings yet
Estimating Multiple Linear Regression
18 pages
Hypothesis Testing with Ducks
No ratings yet
Hypothesis Testing with Ducks
5 pages
Statistics Applications in Business Analysis
No ratings yet
Statistics Applications in Business Analysis
3 pages
SMS Spam Filter Using Naïve Bayes
No ratings yet
SMS Spam Filter Using Naïve Bayes
5 pages
Education Indicators and Research Outcomes
No ratings yet
Education Indicators and Research Outcomes
13 pages
Data Management: Central Tendency & Dispersion
No ratings yet
Data Management: Central Tendency & Dispersion
18 pages
Practical Research 2: Quantitative Methods
No ratings yet
Practical Research 2: Quantitative Methods
6 pages
CA Zambia Syllabus Overview
100% (4)
CA Zambia Syllabus Overview
75 pages
2011 June Exam
100% (1)
2011 June Exam
19 pages
Writing Effective Research Objectives
No ratings yet
Writing Effective Research Objectives
28 pages
Waste Management Design for AMC
No ratings yet
Waste Management Design for AMC
25 pages
Survival Analysis in Medical Research
No ratings yet
Survival Analysis in Medical Research
16 pages
Understanding Spatial Autocorrelation
No ratings yet
Understanding Spatial Autocorrelation
19 pages
Pengantar Biostatistik dan Statistik Medis
No ratings yet
Pengantar Biostatistik dan Statistik Medis
16 pages
Excel MCQ Questions and Answers
No ratings yet
Excel MCQ Questions and Answers
48 pages
Biostatistics: Key Concepts and Applications
No ratings yet
Biostatistics: Key Concepts and Applications
26 pages
Confidence Intervals for Mean Estimation
No ratings yet
Confidence Intervals for Mean Estimation
29 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
15 pages
NCQC Quality Circle Knowledge Test 2019
No ratings yet
NCQC Quality Circle Knowledge Test 2019
3 pages
Hostel vs. Day Scholar Academic Performance
No ratings yet
Hostel vs. Day Scholar Academic Performance
13 pages

Mann-Whitney Test for Medians

Uploaded by

Mann-Whitney Test for Medians

Uploaded by

CHAPTER 3:

3.3 Inference for single population

3.5 Nonparametric statistics WEEK 10

NPS is the alternative to test a non-normal distribution such as

Note: Mo is the hypothesized median

3. Compute test statistic, k

3. This is right-tailed test, so test statistic, k = number of - signs = 2

4. significance level, α = 0.05

5. Since test statistic, k = 2 ≤ critical value = 3 so we reject Ho .

4. significance level, α = 0.05

5. Since test statistic, k = 3 ≤ critical value = 4 so we reject Ho .

Mann-Whitney Test Procedures

Summary of test statistic for Mann-Whitney test

5. Make a decision base on:

Can we conclude

Common questions

In what scenarios would the Spearman's Rank Correlation Test be preferable to Pearson's Correlation, and what does it measure?

How does the Mann-Whitney Test compare to the t-test for two independent samples, and what are the main steps involved in performing this test?

What is the procedure for conducting a Sign Test and how is it used to determine the median of a sample?

How is the test statistic determined in the Mann-Whitney Test, and what does its value indicate about the sample data?

How do nonparametric tests differ from parametric tests in terms of underlying assumptions and applications?

Explain how the Kruskal Wallis Test is used for multiple groups and its significance in nonparametric statistics.

What are the main reasons for using nonparametric tests like the Wilcoxon Signed Rank Test instead of parametric counterparts?

Discuss the significance of using medians rather than means in nonparametric statistical inferences.

In the context of nonparametric tests, what are the advantages and limitations of using the Sign Test?

Why might a researcher choose the Sign Test over the Wilcoxon Signed Rank Test for a particular study?

You might also like