0% found this document useful (0 votes)

28 views58 pages

Biostatistics: Sampling & Hypothesis Testing

The document discusses sampling, hypothesis testing, and errors in hypothesis testing. It provides examples of hypothesis testing including comparing forced expiratory volume between a treatment and control group in a clinical trial.

Uploaded by

aishp2897

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views58 pages

Biostatistics: Sampling & Hypothesis Testing

Uploaded by

aishp2897

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Basic Biostatistics Part 2

1st March, 2017

Content

• Part 1 Summary
• Sampling
• Statistical Hypothesis Tests
• Errors in Hypothesis Tests
• Power and Sample Size
• Examples
• Correlation and Regression
Part 1 Summary

• What were the key learning points from Part 1?

− In groups, identify 3 key learning points from

the first session
Sampling
Sampling
• An investigation of a population is said to be a survey or study of the
population
• A population is a group of individuals or objects that meets a set of
pre-defined criteria; e.g.
- All people with permanent residence in the UK
- All patient records held in a database
- All patients with schizophrenia
- All staff members of an organisation
- All patients registered to a particular specialist
- All members of the population diagnosed with a particular health
condition
• A survey or study that collects information from every member of a
population is referred to as a census
Sampling

• Not always possible to collect information from every

member of a population due to time and resources
• A ‘good sample’ can be used to reliably estimate
characteristics (e.g. the mean) of the population
• Sample – any subset of a population

Sample

Population
Sampling Error

• Errors in surveys can be divided into two categories

• Sampling error - error due to taking a sample rather
than studying the whole population

- e.g. if a psychiatrist randomly selects a sample of

patients and records the duration of each appointment,
the average treatment time can be calculated
- if the times for all patients were recorded (i.e. the entire
population) then the population average would most
likely differ from the sample average
Non-sampling error

• Non-Sampling error is error due to:

- poor selection of strata or sample (coverage errors)
- poor data entry (processing errors)
- inaccurate responses (measurement errors)
- non-response errors
• In surveys, non-sampling errors can be more of a
problem than sampling errors
Statistical Hypothesis Tests
Hypothesis Testing

• A process called Hypothesis Testing is used

to quantify a belief against a particular
hypothesis about the population
• There are many different types of hypothesis
tests
• Five stages for hypothesis testing can be
defined:
5 Stages

1. Define the Null & Alternative Hypotheses

2. Collect data
3. Calculate the value of the test statistic
4. Compare the value of the test statistic to
values from a known probability
distribution
5. Interpret the P-value and results
The Null Hypothesis

• The Null Hypothesis is tested which assumes

no effect (e.g. the difference in means equals
zero) in the population

• Example: Comparing the rates of

hallucinations in men and woman in the
population
− Null Hypothesis (H0): rates of hallucinations
are the same in men and woman in the
population
The Alternative Hypothesis

• The Alternative Hypothesis is holds if the Null

Hypothesis is not true

• Example
− Alternative Hypothesis (H1): rates of
hallucinations are different in men and
woman in the population
The test statistic

• After data collection, the sample data is used

to calculate a test statistic

• The test statistic is effectively the amount of

evidence in the data against H0

• Generally, the larger the value (irrelevant of

sign), the greater the evidence against H0
The P-value

• The test statistic is compared to values from

the relevant probability distribution to obtain a
P-value
• The P-value is the probability of obtaining
our results, or something more extreme, if
the Null Hypothesis is true
• The smaller the P-value, the greater the
evidence against H0
Rejecting H0

• Conventionally, if the P-value < 0.05, there is

sufficient evidence to reject H0

• There is only a small chance of the results

occurring if H0 is true
– H0 is rejected, the results are statistically
significant at the 5% level
Not rejecting H0

• If the P-value ≥ 0.05, there is insufficient

evidence to reject H0
– H0 is not rejected, the results are not
statistically significant at the 5% level

• NB: This does not mean that the null

hypothesis is true, simply that we do not have
enough evidence to reject it!
Parametric vs. Non-Parametric tests

• Tests which are based on the assumption that the

data follows a known probability distribution (often the
Normal) are known as parametric tests

• Sometimes data does not conform to the assumption

so non-parametric tests can be used

• Non-Parametric tests make no assumption about

the probability distribution
Non-parametric tests
• Useful when:

− sample size is small

− data is measured on a categorical scale (though
they are used for numerical data as well)

• However:

− they have less power of detecting a real difference

than the equivalent parametric tests
− they lead to decisions rather than generating a true
understanding of the data
Statistical tests

• Numerical data (Parametric tests)

– One-sample t-test
– Independent t-test
– Paired t-test
– One-way ANOVA
Statistical tests

• Numerical data, (non-parametric tests)

– Sign test
– Wilcoxon signed rank test
– Wilcoxon rank sum test
– Kruskal-Wallis test
Statistical tests

• Categorical data

– z-test for a proportion

– Sign test
– McNemar’s test
– Chi-squared test
– Chi-squared trend test
– Fisher’s exact test
Choosing a statistical test

• Useful medical statistical books will contain a

flowchart to provide guidance

• Considerations include:

– what is the data type?

– how many groups of data are there?
– can a probability distribution be assumed?
Errors in Hypothesis Testing
Making a wrong decision
• There is the possibility of making a wrong
decision when conducting a Hypothesis test

• A wrong decision may be made when rejecting

or not rejecting the Null Hypothesis

• The possible mistakes that can be made are a:

– Type I error
– Type II error
Type I error
• Rejecting the Null Hypothesis when it is true

• Concluding that there is an effect (difference)

when in reality there is none

• The maximum chance of making a Type I error

is denoted by alpha α

• α is the significance level of the test, we reject

the null hypothesis if the p-value is less than
the significance level
Type II error
• Not rejecting the Null Hypothesis when it is
false

• Concluding that there is no effect (difference)

when one really exists

• The chance of making a Type II error is

denoted by beta β

• Its compliment 1- β, is the Power of the test

Power and Sample Size
Power of the test

• The Power is the probability of rejecting the

Null Hypothesis when it is false

– i.e. the probability of making a correct decision

• The ideal power of the test is 100%

• However there is always a possibility of making

a Type II error
Sample size

• If the number of patients/samples in the study is small,

there may be inadequate power to detect an important
existing effect – wasted resources

• If the sample is too large, the study may be

unnecessarily time – consuming, expensive or
unethical

• Need to choose an optimal sample size that strikes a

balance between the implications of making a Type I or
Type II error
Calculating an optimal sample size for a test

• The following quantities need to be specified at

the design stage of the investigation in order to
calculate an optimal sample size:

– The Power
– Significance Level
– Variability
– Smallest effect of interest
Recall: 5 stages

1. Define the Null & Alternative Hypotheses

2. Collect data
3. Calculate the value of the test statistic
4. Compare the value of the test statistic to
values from a known probability distribution
5. Interpret the P-value and results
Examples
Scenario 1

• A randomised double blind trial to determine

the effect of inhaled corticosteroids on
wheezing episodes in children
• An inhaled beclomethasone dipropionate was
compared to a Placebo
• Response variable was average forced
expiratory volume (FEV) over a 6 month
period
• Sample sizes: Treatment group =50, Placebo
group = 48
Stages 1 and 2
• Stage 1: Define Ho and H1:
Ho: the mean FEV in the population of children is
the same in the two groups
H1: the mean FEV in the population of children is
different in the two groups

• Stage 2: Collect data

Graphical Analysis
Boxplots comparing treated group to control group
2.50

2.25
Forced Expiratory Volume (FEV)

2.00

1.75

1.50

1.25

1.00

Treated Group Control Group

Selecting a test

• What is the data type? Numerical

• How many groups are there? 2
• Are the groups Paired or Independent?
Independent
• Is Normality and equal variances of the data
assumed? Yes

→Unpaired (Independent) t-test

Analysis Output

Stages 3 and 4: Calculate the

Sample N Mean StDev SE Mean test statistic and compare to
1 50 1.640 0.286 0.040 values from a known probability
2 48 1.537 0.246 0.035 distribution

Difference = mu (1) - mu (2)

Estimate for difference: 0.1033
95% CI for difference: (-0.0038, 0.2104)
T-Test of difference = 0 (vs not =): T-Value = 1.91 P-Value = 0.059 DF = 96
Both use Pooled StDev = 0.2670
Stage 5: Interpret the results

• The P-value is 0.059

• There is insufficient evidence (just!) to reject Ho
at the 5% level
• There is insufficient statistical evidence of a
difference between the 2 groups
• The Power of the Test should be checked
• A Type II error may be made when not
rejecting Ho
Scenario 2

• A study was conducted to determine if a heart condition

influences the age at which children start to walk
• Response variable was age the children started to walk
• 30 children with a specific heart condition were analysed in
the study
• Children (in general) are known to start walking at an age
of 11.4 months
• Does the heart condition influence the age at which
children start to walk?
Stages 1 and 2

• Stage 1: Define Ho and H1

Ho: the mean walking age of the children with
the heart condition = 11.4 months
H1: the mean walking age of the children with
the heart condition ≠ 11.4 months
• Stage 2: Collect data
Graphical Analysis
Histogram showing walking age of children

4
Frequency

0
10 12 14 16 18
Months
Selecting a test

• What is the data type? Numerical

• How many groups are there? 1
• Is Normality of the data assumed? Yes

→One-sample t-test
Analysis Output

One-Sample T Stages 3 and 4: Calculate the test

statistic and compare to values from a
known prob distribution
Test of mu = 11.4 vs not = 11.4

N Mean StDev SE Mean 95% CI T P

30 13.158 2.583 0.472 (12.193, 14.123) 3.73 0.001
Stage 5: Interpret the results

• The P-value is 0.001

• There is strong evidence to reject Ho
• There is statistical evidence that the heart
condition influences the age at which children
start to walk
• The Probability that a Type I error has been
made in drawing this conclusion is 0.1%
Correlation and Regression
Correlation and Regression

• Correlation
– measures the strength of association
between two variables

• Regression
– models a relationship between two or
more variables
Correlation
• The degree of association between two variables is
called their correlation

• Positive correlation - when the points appear in a

band running from lower left to upper right (when x
increases, y increases)

• Negative correlation - when the points appear in a

band from upper left to lower right (when x increases,
y decreases)

• No correlation - when the points are randomly

scattered about the graph
Correlation and “Line of best fit”

Here are
some
examples
Be Careful!

"Correlation does not imply causality"

• In other words, the scatter plot may show that

a relationship exists, but it does not and cannot
prove that one factor is causing the other

• The scatter plot can only provide a clue that

two factors may be “cause and effect”
Correlation - example

• Driving test scores – written paper

• Outcome compared by plotting scores against

number of lessons (1-10)

– does score improve as the number of lessons

increases?
Scatter plot for learner drivers
170

160

150

140
marks3

130

120

110

100

90
0 2 4 6 8 10
classes
Linear Regression

• Investigates a straight line (linear) association

between variables

• Straight line fitted to the scatter diagram is

known as the regression equation

• Least squares – the sum of the squared

differences between the observed and
predicted values is minimised
Medical example

• Does increasing hardness improve abrasion resistance

for composites?

• Does increasing etch time improve bond strength to

enamel?

• Both questions require a regression approach

– using just two or three materials of different hardness
is not acceptable

– using just two etch times would not provide answers

Data

Composite Hardness Wear rate

1 120 56
2 168 46
3 290 21
4 42 98
5 78 80
6 90 65
7 130 32
Regression equation 1

A regression equation is: wear = 94.6 - 0.288 hardness

Fitted Line Plot

wear = 94.65 - 0.2882 hardness
100 S 14.5829
R-Sq 75.4%
R-Sq(adj) 70.4%

60
wear

0
50 100 150 200 250 300
hardness
Regression equation 2
• Etch time 5 to 60 s
• Bond strength 15 to 26 MPa
Regression equation: bond strength = 17.3 + 0.110 etch time
Fitted Line Plot
bond strength = 17.31 + 0.1103 etch time
27.5 S 2.51095
R-Sq 35.2%
R-Sq(adj) 32.2%
25.0
bond strength

22.5

20.0

17.5

15.0

0 10 20 30 40 50 60
etch time
Summary

• Part 2 Summary
• Sampling
• Statistical Hypothesis Tests
• Errors in Hypothesis Tests
• Power and Sample Size
• Examples
• Correlation and Regression

Statistical Hypothesis Testing Guide
No ratings yet
Statistical Hypothesis Testing Guide
34 pages
ST Analysis For Beginners
No ratings yet
ST Analysis For Beginners
51 pages
Sampling Distribution & Hypothesis Testing
No ratings yet
Sampling Distribution & Hypothesis Testing
31 pages
Hypothesis Tests
No ratings yet
Hypothesis Tests
19 pages
Hypothesis Testing and Bootstrapping Notes
No ratings yet
Hypothesis Testing and Bootstrapping Notes
9 pages
Understanding Z-Value in Hypothesis Testing
No ratings yet
Understanding Z-Value in Hypothesis Testing
60 pages
Understanding Hypothesis Testing
No ratings yet
Understanding Hypothesis Testing
86 pages
Understanding Null Hypothesis
No ratings yet
Understanding Null Hypothesis
34 pages
Data Analysis and Hypothesis Testing
No ratings yet
Data Analysis and Hypothesis Testing
9 pages
Biostatistics Skills Training with R & Excel
No ratings yet
Biostatistics Skills Training with R & Excel
66 pages
Hypothesis Testing BTE 711 NOTE 2
No ratings yet
Hypothesis Testing BTE 711 NOTE 2
7 pages
Q4 Lesson 4 Hypothesis Testing
No ratings yet
Q4 Lesson 4 Hypothesis Testing
204 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
32 pages
Anatomy of Confidence Intervals Explained
No ratings yet
Anatomy of Confidence Intervals Explained
70 pages
Understanding Inferential Statistics
No ratings yet
Understanding Inferential Statistics
21 pages
Hypothesis Testing Chi Square T Test & Z Test
No ratings yet
Hypothesis Testing Chi Square T Test & Z Test
12 pages
Hypothesis Testing Overview and Examples
No ratings yet
Hypothesis Testing Overview and Examples
44 pages
Inferential Statistics in Social Research
No ratings yet
Inferential Statistics in Social Research
33 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
17 pages
Hypothesis Testing Reviewer
No ratings yet
Hypothesis Testing Reviewer
95 pages
Biostatistic - 8 Hypotesis Tesing
No ratings yet
Biostatistic - 8 Hypotesis Tesing
46 pages
Statistical Inference and Hypothesis Testing
No ratings yet
Statistical Inference and Hypothesis Testing
46 pages
Point and Interval Estimation Explained
No ratings yet
Point and Interval Estimation Explained
58 pages
Understanding Hypothesis Testing Power
No ratings yet
Understanding Hypothesis Testing Power
37 pages
Hypothesis Testing Explained
No ratings yet
Hypothesis Testing Explained
12 pages
Hypothesis Testing Principles Explained
No ratings yet
Hypothesis Testing Principles Explained
2 pages
Estimation and Hypothesis Testing Explained
No ratings yet
Estimation and Hypothesis Testing Explained
5 pages
Statistical Analysis for Medical Residents
No ratings yet
Statistical Analysis for Medical Residents
50 pages
Statistical Inference
No ratings yet
Statistical Inference
23 pages
Hypothesis Testing Fundamentals
No ratings yet
Hypothesis Testing Fundamentals
33 pages
Understanding Hypothesis Testing in Research
No ratings yet
Understanding Hypothesis Testing in Research
72 pages
Parametric vs Non-Parametric Tests Guide
No ratings yet
Parametric vs Non-Parametric Tests Guide
49 pages
Hypothesis Testing Overview and Steps
No ratings yet
Hypothesis Testing Overview and Steps
93 pages
Hypothesis Testing in Statistics
No ratings yet
Hypothesis Testing in Statistics
69 pages
Module 4 - Lesson 2
No ratings yet
Module 4 - Lesson 2
12 pages
Understanding Inferential Statistics Basics
No ratings yet
Understanding Inferential Statistics Basics
37 pages
Hypothesis Testing in Medical Research
No ratings yet
Hypothesis Testing in Medical Research
56 pages
Hypothesis Testing and Significance Tests
No ratings yet
Hypothesis Testing and Significance Tests
37 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
4 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
34 pages
Critical Value for Two Means Test
No ratings yet
Critical Value for Two Means Test
35 pages
Hypothesis Testing in Inferential Statistics
No ratings yet
Hypothesis Testing in Inferential Statistics
60 pages
Understanding Hypothesis Testing in Statistics
No ratings yet
Understanding Hypothesis Testing in Statistics
17 pages
Hypothesis Testing in Statistics
No ratings yet
Hypothesis Testing in Statistics
26 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
15 pages
Understanding Inferential Statistics
No ratings yet
Understanding Inferential Statistics
28 pages
Introduction to Inferential Statistics
100% (6)
Introduction to Inferential Statistics
28 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
27 pages
Hypothesis Testing in Biostatistics
No ratings yet
Hypothesis Testing in Biostatistics
38 pages
T-test vs ANOVA: Statistical Comparison Guide
No ratings yet
T-test vs ANOVA: Statistical Comparison Guide
39 pages
Understanding Inferential Statistics
No ratings yet
Understanding Inferential Statistics
42 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
15 pages
Understanding Inferential Statistics
No ratings yet
Understanding Inferential Statistics
83 pages
Hypothesis Testing in Statistics Basics
No ratings yet
Hypothesis Testing in Statistics Basics
51 pages
Introduction to Inferential Statistics
No ratings yet
Introduction to Inferential Statistics
34 pages
Hypothesis Testing Explained: Steps & Errors
No ratings yet
Hypothesis Testing Explained: Steps & Errors
11 pages
Hypothesis Testing With Errors and Examples
No ratings yet
Hypothesis Testing With Errors and Examples
3 pages
Elbow Carrying Angle: Gender Differences
100% (1)
Elbow Carrying Angle: Gender Differences
1 page
Ankle-Foot Complex Biomechanics Overview
No ratings yet
Ankle-Foot Complex Biomechanics Overview
13 pages
MSK 2010
No ratings yet
MSK 2010
27 pages
Overview of Resistance Exercise Techniques
No ratings yet
Overview of Resistance Exercise Techniques
22 pages
Research Study Methodology Overview
No ratings yet
Research Study Methodology Overview
1 page
Master's Guide to Physiotherapy Research
No ratings yet
Master's Guide to Physiotherapy Research
2 pages
Motor Control and Learning
89% (9)
Motor Control and Learning
592 pages
ANOVA Test: Formula and Calculations
100% (1)
ANOVA Test: Formula and Calculations
8 pages
Survey Research Design and Methods
No ratings yet
Survey Research Design and Methods
18 pages
Amputation and Prosthesis Overview
100% (1)
Amputation and Prosthesis Overview
5 pages
Understanding Measures of Central Tendency
No ratings yet
Understanding Measures of Central Tendency
46 pages
Understanding Physical Fitness Essentials
No ratings yet
Understanding Physical Fitness Essentials
2 pages
Understanding VO2max and Its Impact
100% (1)
Understanding VO2max and Its Impact
7 pages
Kinetic Analysis of Gait Mechanics
No ratings yet
Kinetic Analysis of Gait Mechanics
20 pages
Gait Biomechanics: Phases & Mechanics
No ratings yet
Gait Biomechanics: Phases & Mechanics
55 pages
Hypermobility Exercise Program Guide
No ratings yet
Hypermobility Exercise Program Guide
7 pages
MPT Part-I Exam: Physical Diagnosis Papers
100% (3)
MPT Part-I Exam: Physical Diagnosis Papers
10 pages
Exercise Guide for Kidney Disease
No ratings yet
Exercise Guide for Kidney Disease
16 pages
Managing Rotator Cuff Injuries
No ratings yet
Managing Rotator Cuff Injuries
76 pages
Nanda's Electrotherapy Guide PDF
92% (13)
Nanda's Electrotherapy Guide PDF
535 pages
Understanding Applied Work Physiology
No ratings yet
Understanding Applied Work Physiology
2 pages
Blood Gas Analysis Guidelines
No ratings yet
Blood Gas Analysis Guidelines
1 page
ACL Injury Assessment and Care Plan
100% (1)
ACL Injury Assessment and Care Plan
10 pages
Endotracheal Tube Extubation Procedure
No ratings yet
Endotracheal Tube Extubation Procedure
6 pages
Surface Models: Techniques & Exercises
No ratings yet
Surface Models: Techniques & Exercises
8 pages
Wind Turbine State-Space Modeling Techniques
No ratings yet
Wind Turbine State-Space Modeling Techniques
33 pages
VoLTE Technology Market Analysis
No ratings yet
VoLTE Technology Market Analysis
29 pages
MANET Routing Protocols Overview
No ratings yet
MANET Routing Protocols Overview
74 pages
Rapid Test for Carbapenemase Detection
No ratings yet
Rapid Test for Carbapenemase Detection
2 pages
Thermodynamics Problems and Solutions
No ratings yet
Thermodynamics Problems and Solutions
7 pages
Finding Shippers 101
No ratings yet
Finding Shippers 101
14 pages
Credit Analyst Interview Master Guide 30QA
No ratings yet
Credit Analyst Interview Master Guide 30QA
6 pages
Java Programming Assignment Guide
No ratings yet
Java Programming Assignment Guide
8 pages
High Performance PWM Controller BIT3368O
No ratings yet
High Performance PWM Controller BIT3368O
9 pages
Snowfall Patterns in Southern Appalachians
No ratings yet
Snowfall Patterns in Southern Appalachians
20 pages
Beam
No ratings yet
Beam
29 pages
Types of Concrete Admixtures Explained
No ratings yet
Types of Concrete Admixtures Explained
5 pages
Hough Transform for Edge Detection
No ratings yet
Hough Transform for Edge Detection
16 pages
Annona Squamosa Extract's Effects on A549 Cells
No ratings yet
Annona Squamosa Extract's Effects on A549 Cells
10 pages
Energy Efficiency in Motor Systems Proceedings of The 11th International Conference
100% (1)
Energy Efficiency in Motor Systems Proceedings of The 11th International Conference
748 pages
A Pavement Crack Detection and Evaluation Framework For A
No ratings yet
A Pavement Crack Detection and Evaluation Framework For A
24 pages
Atmospheric Pressure Plasma Jet Powered by Piezoelectric Direct Discharge
No ratings yet
Atmospheric Pressure Plasma Jet Powered by Piezoelectric Direct Discharge
14 pages
Metric Handbook for Building Materials
No ratings yet
Metric Handbook for Building Materials
22 pages
NC2x MIDI Implementation Overview
No ratings yet
NC2x MIDI Implementation Overview
9 pages
Oil Well Cementing Techniques and Types
No ratings yet
Oil Well Cementing Techniques and Types
31 pages
Understanding Numbers in AS3
No ratings yet
Understanding Numbers in AS3
4 pages
Overview of Biomedical Sensors
100% (1)
Overview of Biomedical Sensors
20 pages
Compact Energy-Saving Medium-Voltage Drive
No ratings yet
Compact Energy-Saving Medium-Voltage Drive
24 pages
FOC Control of DFIG with MRAS Observer
No ratings yet
FOC Control of DFIG with MRAS Observer
7 pages
Mycoplasma Detection 5994 2657EN Agilent
No ratings yet
Mycoplasma Detection 5994 2657EN Agilent
3 pages
Standard U-Bolt Specifications and Ratings
100% (1)
Standard U-Bolt Specifications and Ratings
2 pages
PCS 7 - Programming Instructions For Blocks
50% (4)
PCS 7 - Programming Instructions For Blocks
220 pages
Class XII Applied Mathematics Pre-Board 2025-26
No ratings yet
Class XII Applied Mathematics Pre-Board 2025-26
5 pages
2000+ Free Photoshop Patterns Pack
No ratings yet
2000+ Free Photoshop Patterns Pack
40 pages