Module 4
Module 4
1. Correlation
Definition:
Correlation measures the degree and direction of relationship between two variables.
Example:
2. Types of Correlation
1. Positive Correlation
2. Negative Correlation
• High correlation
• Low correlation
• Zero correlation (r = 0)
1. Scatter Diagram
• Graphical method
Formula:
Interpretation of r:
Value of r Meaning
+1 Perfect positive
−1 Perfect negative
0 No correlation
Formula:
6∑𝑑 2
𝑟 =1−
𝑛(𝑛2 − 1)
Where:
• 𝑑= difference in ranks
• 𝑛= number of observations
4. Properties of Correlation
5. Limitations of Correlation
6. Regression Analysis
Definition:
Regression analysis studies the functional relationship between variables and helps in
prediction.
7. Types of Regression
1. Simple Regression
8. Regression Lines
1. Regression of Y on X
2. Regression of X on Y
Equations:
Regression of Y on X
𝑌 − 𝑌ˉ = 𝑏𝑦𝑥 (𝑋 − 𝑋ˉ)
Regression of X on Y
𝑋 − 𝑋ˉ = 𝑏𝑥𝑦 (𝑌 − 𝑌ˉ)
9. Regression Coefficients
Cov(𝑋, 𝑌)
𝑏𝑦𝑥 =
𝜎𝑋2
Cov(𝑋, 𝑌)
𝑏𝑥𝑦 =
𝜎𝑌2
𝑟 = √𝑏𝑥𝑦 ⋅ 𝑏𝑦𝑥
Sign of r depends on regression coefficients
Correlation Regression
Symmetrical Asymmetrical
• Sales forecasting
• Demand estimation
• Cost analysis
• Market research
Interpretation:
• Points close to a straight line → High correlation
Formula:
1 − 𝑟2
𝑃. 𝐸. = 0.6745 ×
√𝑛
Interpretation:
𝑟 2 = (correlation coefficient)2
Meaning:
Example:
1 − 𝑟2
6. Assumptions of Correlation
• Relationship is linear
• Data is homogeneous
• No extreme outliers
8. Spurious Correlation
Example:
• Ice cream sales & drowning cases
(Both increase in summer but not directly related)
Regression shows:
• Two lines:
o Y on X
o X on Y
Key Points:
Concept-based question
13. Difference Between r and r²
r r²
Range: -1 to +1 Range: 0 to 1
• Business forecasting
• Demand estimation
• Price analysis
• Financial planning
• Economic research
High correlation:
• Strong relationship
Low correlation:
• Weak relationship
Zero correlation:
• No relationship
Shortcut Formula
𝑁∑𝑋𝑌 − (∑𝑋)(∑𝑌)
𝑟=
√[𝑁∑𝑋 2 − (∑𝑋)2 ][𝑁∑𝑌 2 − (∑𝑌)2 ]
6∑𝑑 2
𝑟 =1−
𝑛(𝑛2 − 1)
Where:
• 𝑑= difference in ranks
• 𝑛= number of observations
1 − 𝑟2
𝑃. 𝐸. = 0.6745 ×
√𝑛
4. Coefficient of Determination
𝑟2
5. Coefficient of Non-Determination
1 − 𝑟2
6. Regression Equations
Regression of Y on X
𝑌 − 𝑌ˉ = 𝑏𝑦𝑥 (𝑋 − 𝑋ˉ)
Regression of X on Y
𝑋 − 𝑋ˉ = 𝑏𝑥𝑦 (𝑌 − 𝑌ˉ)
7. Regression Coefficients
𝜎𝑌
𝑏𝑦𝑥 = 𝑟 ⋅
𝜎𝑋
𝜎𝑋
𝑏𝑥𝑦 = 𝑟 ⋅
𝜎𝑌
𝑟 = √𝑏𝑥𝑦 ⋅ 𝑏𝑦𝑥
Data:
X12345
Y 2 4 6 8 10
Step 1: Observe relationship
Y = 2X → perfectly linear
So,
𝑟 = +1
Q2. Find r
X123
Y235
∑𝑋 = 6, ∑𝑌 = 10
∑𝑋𝑌 = (1 × 2) + (2 × 3) + (3 × 5) = 2 + 6 + 15 = 23
∑𝑋 2 = 14, ∑𝑌 2 = 38
X 10 20 30 40
Y 15 25 35 45
X Rank Y Rank d
10 1 15 1 0
20 2 25 2 0
30 3 35 3 0
40 4 45 4 0
∑𝑑 2 = 0
Step 2: Formula
6(0)
𝑟 =1− =1
4(16 − 1)
4. Probable Error
Given: 𝑟 = 0.8, 𝑛 = 16
1 − 𝑟2
𝑃. 𝐸. = 0.6745 ×
√𝑛
1 − 0.64 0.36
= 0.6745 × = 0.6745 × = 0.6745 × 0.09 ≈ 0.061
4 4
Answer: 0.061
5. Regression Coefficient
Given:
𝑟 = 0.6, 𝜎𝑋 = 2, 𝜎𝑌 = 3
𝜎𝑌 3
𝑏𝑦𝑥 = 𝑟 × = 0.6 × = 0.6 × 1.5 = 0.9
𝜎𝑋 2
Answer: 0.9
6. Regression Equation
Given:
𝑋ˉ = 10, 𝑌ˉ = 20, 𝑏𝑦𝑥 = 2
Step 1: Formula
𝑌 − 𝑌ˉ = 𝑏𝑦𝑥 (𝑋 − 𝑋ˉ)
Step 2: Substitute
𝑌 − 20 = 2(𝑋 − 10)
𝑌 − 20 = 2𝑋 − 20
𝑌 = 2𝑋
Answer: 𝑌 = 2𝑋
7. Find Correlation from Regression Coefficients
Q7. Given:
Answer: 0.63
8. Coefficient of Determination
Q8. If 𝒓 = 𝟎. 𝟕, find 𝒓𝟐
𝑟 2 = (0.7)2 = 0.49
Answer: 0.49
Basics of Correlation
1. Correlation measures:
A. Average
B. Relationship between variables
C. Frequency
D. Dispersion
Answer: B
3. If r = +1, it indicates:
A. No correlation
B. Perfect positive correlation
C. Perfect negative correlation
D. Weak correlation
Answer: B
4. If r = 0, it means:
A. Perfect correlation
B. No correlation
C. Negative correlation
D. High correlation
Answer: B
Types of Correlation
7. Scatter diagram is a:
A. Mathematical method
B. Graphical method
C. Statistical test
D. Algebraic method
Answer: B
Karl Pearson
Regression
11. Regression is used for:
A. Description
B. Prediction
C. Classification
D. Tabulation
Answer: B
Regression Coefficients
Relationship
Probable Error
Determination
18. r² represents:
A. Mean
B. Variance
C. Explained variation
D. Total frequency
Answer: C
Concepts
Numerical-Type
Application-Based