0% found this document useful (0 votes)

7 views26 pages

Inference in Simple Regression Analysis

Uploaded by

eceozkaya33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views26 pages

Inference in Simple Regression Analysis

Uploaded by

eceozkaya33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Inference in Simple Regression

(SW Chapter 5)

1
Overview of where we are heading:
We want to learn about a population relation. We have data from a
sample, so there is sampling uncertainty. There are five steps towards
this goal:
1. State the population object of interest.
2. Provide an estimator of this population object.
3. Derive the sampling distribution of the estimator (this requires
certain assumptions). In large samples this sampling
distribution will be approximately normal (CLT).
4. The square root of the estimated variance of the sampling
distribution is the standard error (SE) of the estimator.
5. Use the SE to construct t-statistics (for hypothesis tests) and
confidence intervals.

2
Object of interest is described by the population regression model:
Yi = 0 + 1Xi + ui, i = 1,…, n

1 = Y/X, for an autonomous change in X.

Estimator: the OLS estimator ˆ1 .

( X i − X )(Yi − Y )
𝑠𝑋𝑌
ˆ1 = i =1
n
= 2 (4.7)
𝑠𝑋
 i
( X
i =1
− X ) 2

The OLS estimator of the intercept,

𝛽̂0 = 𝑌̅ − 𝛽̂1 𝑋̅ (4.8)
3
To derive the statistical properties, we relied on:
The Least Squares Assumptions:
1. E(u|X = x) = 0.
2. (Xi,Yi), i = 1, …, n, are i.i.d.
3. Large outliers are rare.

Under the Least Squares Assumptions, the C.L.T. assures that for n
large, is approximately normally distributed:
𝐴
𝛽𝑗 ~ 𝑁(𝛽𝑗 , 𝑉(𝛽̂𝑗 ))
̂ for j=0,1

Note that: The expression of 𝑉(𝛽̂1 ) depends on V(𝑢𝑖 |𝑋).

Also, to put this into use, 𝑉(𝛽̂𝑗 ) has to be estimated.

4
Remember that SE( ˆ1 ) is the positive square-root of the estimated
variance of ˆ :
1

SE( ˆ1 ) = +√𝑉̂ (𝛽̂1 )

Remark: We use 𝑉̂ (𝛽̂1 ) (i.e., with ^ on top of V) as an estimator of

𝑉(𝛽̂1 ).

The expression of 𝑉(𝛽̂1 ) depends on V(𝑢𝑖 |𝑋).

• So, based on the assumption about V(𝑢𝑖 |𝑋), 𝑉̂ (𝛽̂1 ) differs.
• Therefore, 𝑆𝐸(𝛽̂1 ) differs.

5
Formula for SE( ˆ1 ) – for the general case
(i.e., V(ui | X) = 𝜎𝑖2 , different for each i)
The expression for the variance of ˆ (for large n) is:
1

ˆ 1 var[(𝑋𝑖 −𝜇𝑥 )𝑢𝑖 ]  2

V(  ) = = v
, where vi = (Xi – X)ui.
var[𝑋𝑖 ]2 n( )
1 𝑛 2 2
X

The estimator of V( ˆ1 ) replaces the unknown population values of  2

 2
and X by estimators constructed from the data:
1 n 2
1 estimator of  2
1 
n − 2 i =1
vˆi
̂ ˆ
𝑉( 1 ) = ˆ ˆ = 
2 v
= 
1
n (estimator of  X ) 2 2
n 1 n
2
2

 n ( X i − X ) 
 i =1 
where vˆi = ( X i − X )uˆi . [Do you remember the significance of “hats” (^)?]

6
Formula for SE( ˆ1 ) – for the general case:

SE( ˆ1 ) = + ˆ 2ˆ = the Standard Error of ˆ1 ,

1 ∑𝑛 [(𝑋 𝑖 ̅
−𝑋 ̂
)𝑢 𝑖 ] 2 /(𝑛−2)
ˆ 2ˆ = 𝑖=1
.
1 𝑛 [ ∑𝑛 ̅ 2
𝑖=1(𝑋𝑖 −𝑋 ) /𝑛]
2

This looks complicated, but it is easily calculated:

• The numerator estimates 𝜎𝑣2 , using 𝑣̂𝑖 = (𝑋𝑖 – 𝑋̅)𝑢̂𝑖 .
• Why n – 2? This is the degrees-of-freedom adjustment.
Because 2 coefficients have been estimated (0 & 1), we have n – 2.
• The denominator estimates [𝜎𝑋2 ]2.

In practice SE( ˆ1 ) is computed by regression software.(Stata:robust)

7
Formula for SE( ˆ1 ) – for the special case
(i.e., V(ui | X) = 𝜎𝑢2 , the same for all i, independently of X. )
The expression for the unconditional variance of ˆ1 (for large n) is:
1 𝜎𝑢2
V( ˆ1 ) =
1 var[𝑢𝑖 ]
= ( 2)
𝑛 var[𝑋𝑖 ] 𝑛 𝜎𝑋

The estimator of V( ˆ1 ) replaces the unknown population values of 𝜎𝑢2

and 𝜎𝑋2 by estimators constructed from the data:
1 ∑𝑛 2
𝑉̂( ˆ1 ) =
̂
𝑖=1 𝑖 /(𝑛−2)
𝑢
where 𝑢̂𝑖 refers to residuals.
𝑛 ∑𝑛 ̅ 2
𝑖=1(𝑋𝑖 −𝑋 ) /𝑛

Then, SE( ˆ1 ) = +√𝑉̂ ( ˆ1 ) = the standard error of ˆ1 can be
calculated.

Software packages have this option as well. (Stata: drop the robust)
8
Precision of our estimates
Remark 1: The larger the sample size (i.e., n), the smaller the
variance of 𝛽̂1 .
Remark 2: The smaller the variance of the error term (i.e., 𝜎𝑢2 ), the
smaller the variance of 𝛽̂1 .
Remark 3: The larger the sample variance of X (i.e., 𝑠𝑋2 =
(∑𝑖 𝑥𝑖2 )/(𝑛 − 1) ), which is an estimator of 𝜎𝑥2 , the smaller the
variance of ˆ1 .
Intuition: If there is more variation in X, then there is more
information in the data that you can use to locate the regression line.
This is most easily seen in a figure…

9
The larger the variance of X, the smaller the variance of ˆ1

Question: Which set of dots would yield a more accurate regression

line, blue dots, or black dots?
Hint: Blue dots are more “concentrated,” 𝒔𝟐𝑿 < 𝒔𝟐𝑿 .

10
Let’s compare the following two outputs:

11
Summary: There are two ways to compute standard errors:
• Homoskedasticity-only standard errors – these are valid only if
the errors are homoskedastic. (Strong assumption!)
These can be obtained by omitting the Stata subcommand
“robust.”
• Heteroskedasticity – robust standard errors -- these are always
valid; hence Stock & Watson prefer them. These require the Stata
subcommand “robust.”
The main advantage of the homoskedasticity-only standard errors is
that the formula is simpler. But the disadvantage is that the formula
is correct only if the errors are homoskedastic.
Since Stata calculates standard errors for us, it is better to adopt the
general approach.
12
Conventional way to report regression results concisely:
• Put standard errors in parentheses below the estimated
coefficients to which they apply.
• Write goodness of fit statistics on the same line as the equation.

̂
𝑇𝑒𝑠𝑡𝑆𝑐𝑜𝑟𝑒 = 698.9 – 2.28STR, R2 = .0512, SER = 18.6
(10.4) (0.52)

How do we find these numbers?

The formulas are given in Stock & Watson, also in Lecture Notes.
Stata will find them for us :)
Caution: Stata reports the root mean squared error (RMSE) ≈ SER.

13
Hypothesis Testing using 𝛽̂𝑗 and the Standard Error of 𝛽̂𝑗
The objective is to reach a conclusion regarding the numerical value
of 1, using data (information in a random sample).
General setup
Null hypothesis and two-sided alternative:
H0: 1 = 1,0 vs. H1: 1  1,0
where 1,0 is the hypothesized numerical value under the null.

Null hypothesis and one-sided alternative:

H0: 1 = 1,0 vs. H1: 1 < 1,0
OR
H0: 1 = 1,0 vs. H1: 1 > 1,0

14
General approach: construct t-statistic, and compute p-value (or
obtain the critical value) using the standard normal c.d.f. table.

estimator - hypothesized value

• In general: t=
standard error of the estimator
where the SE of the estimator is the square root of an estimator
of the variance of the estimator.
𝑌̅−𝜇𝑌,0
• Math 201: test on the mean of Y is based on t = ;
𝑆𝐸(𝑌̅)

ˆ1 − 1,0
• Econ 311: test on 1 is based on t0 = ,
ˆ
SE ( 1 )
where SE( ˆ1 ) = the positive square root of a (consistent)
estimator of the variance of the sampling distribution of ˆ1 .
15
Hypothesis testing - mechanics: to test
H0: 1 = 1,0 vs. H1: 1  1,0,
Construct the t-statistic
̂1 −𝜷𝟏,𝟎
𝛽
t0 = ̂1 ) .
𝑆𝐸(𝛽

(i) Reject H0 at 5% significance level if |t0 | > 1.96.

(ii) Calculate the p-value = 2 Pr(Z > | t0 |) for the two-sided
alternative, which is the probability contained in the tails of a
standard normal beyond |t0 |. Reject H0 if the p-value is “small”.
Note the relation between (i) and (ii): You will surely reject at the
5% significance level if the p-value is ≤ 0.05.

16
Remarks:
• By opting for the standard normal, we engage in “practical”
inference.
• This procedure relies on the CLT. Typically n = 30 is large enough
for the approximation to be a good one.
• The language “t-statistic” invokes memories of “Student’s t-
distribution.” However, “t-statistic” will distribute as “Student’s
t-distribution” only under some special conditions (i.e.,
assumptions).

17
Zero slope null: The simplest (and most widely used) hypothesis
test sets the numerical value of 1,0 to ”zero”:
H0: 1 = 0 vs. H1: 1  0
This test is known as ”test of (statistical) significance” of 𝛽̂1 .
When 1,0 = 0, the test statistic
̂1 −𝛽1,0
𝛽
t0 = ̂1 )
𝑆𝐸(𝛽
simplifies to the ”t-ratio”:
̂1
𝛽
t-ratio = ̂1 ) .
𝑆𝐸(𝛽

In applications users typically check the t-ratio first, to decide

whether the estimated slope is (statistically) different from zero.

18
On our choice of notation: Stock and Watson refer to the numerical
value of the test statistic for a particular value of 𝛽̂1 as:
ˆ −
tact = 1 1,0

SE ( ˆ1 )
where superscript “act” stands for “actual.” This notation does not
capture the fact that for a given estimate 𝛽̂1 of the unknown
parameter 1, the value of the test statistic depends on 1,0, the
numerical value stated in the null hypothesis.
We chose our notation to emphasize the link between the value of
the test statistic and the hypothesized value of 1,0:
̂1 −𝛽1,0
𝛽
t0 = ̂1 ) .
𝑆𝐸(𝛽

19
Example: Test Scores and STR, California school data
A convenient method for summarizing the regression results is to
write down the estimated regression equation, where the standard
errors are shown under the estimated coefficients:
̂
𝑇𝑒𝑠𝑡𝑆𝑐𝑜𝑟𝑒 = 698.9 – 2.28 STR
(10.4) (0.52)
That is, SE( ˆ ) = 10.4, SE( ˆ ) = 0.52.
0 1

The t-ratio for the slope is: –2.28/0.52 = –4.38.

(i) At the 1% significance level, the 2-sided critical value from the
standard normal table is z*= 2.58; since |–4.38| > 2.58, we
reject the null at the 1% significance level.

20
Note that with a t-ratio of this magnitude, the evidence against the
null is “very” strong. Even if we were to impose a tougher standard
than 1%, we would still reject the null.

Question: What is the toughest standard that we can apply, and

still reject the null?
Answer: p-value (marginal level of significance).
(ii) We can compute the p-value for the two-sided alternative from
the standard normal table:
With Z ~ N(0, 1),
p-value = 2Pr(Z > |–4.38| ) = 0.00001 (= 10–5).
When the p-value is this small, we often write: “p-value << 0.01”
and say “the evidence against the null is extremely strong.”
21
Geometry:

p-value << 0.01

22
Confidence Intervals for 1

Recall that a 95% confidence interval is, equivalently:

• The set of numerical values 1,0 that cannot be rejected at the 5%
significance level;
• An interval computed as a function of the data that contains the
true parameter value 95% of the time in repeated samples.
CLT: The t-statistic for 1 becomes Z ~ N(0,1) in large samples. Thus
the (approximate) 95% symmetric confidence interval for 1 is:

ˆ1  1.96SE( ˆ1 ).

23
Example: Test Scores and STR, California school data

̂
𝑇𝑒𝑠𝑡𝑆𝑐𝑜𝑟𝑒 = 698.9 – 2.28 STR
(10.4) (0.52)
The parameter of interest is 1. We have 𝛽̂1 = – 2.28 and
SE( ˆ ) = 0.52. The (approximate) 95% confidence interval for 1 is:
1

ˆ1  1.96SE( ˆ1 ) or –2.28  1.960.52

= (–3.30, –1.26).
The following two statements are equivalent
• The 95% confidence interval does not include zero;
• The hypothesis 1 = 0 is rejected at the 5% level.

24
Test Scores and STR, California school data:
regress testscr str, robust
Regression with robust standard errors Number of obs = 420
F( 1, 418) = 19.26
Prob > F = 0.0000
R-squared = 0.0512
Root MSE = 18.581
-------------------------------------------------------------------------
| Robust
testscr | Coef. Std. Err. t P>|t| [95% Conf. Interval]
--------+----------------------------------------------------------------
str | -2.279808 .5194892 -4.38 0.000 -3.300945 -1.258671
_cons | 698.933 10.36436 67.44 0.000 678.5602 719.3057
-------------------------------------------------------------------------
_cons denotes the intercept. Slope is identified with variable name.
̂
𝑇𝑒𝑠𝑡𝑆𝑐𝑜𝑟𝑒 = 698.9 – 2.28STR, R2 = .0512, SER = 18.6.
(10.4) (0.52)
t-ratio for 1 = –4.38; p-value = 0.000 (2-sided)
Approx. 95% confidence interval for 1 is: (–3.30, –1.26).
Statistical inference about the intercept, 0 -- summary
25
Estimation:
• OLS estimator of 0 is ˆ0 .
• ˆ has an approx. normal distribution in large samples.
0

Testing:
• H0: 0 = 0,0 vs. 0  0,0 (0,0 is the value of 0 under H0).
• t0 = ( ˆ0 – 0,0)/SE( ˆ0 ).
• p-value = area under standard normal outside (-|t0|, |t0|).
Confidence Intervals:
• 95% confidence interval for 0 is [ ˆ0  1.96SE( ˆ0 )].
This is the set of 0 that is not rejected at the 5% level.

Hypothesis Testing in Simple Regression
No ratings yet
Hypothesis Testing in Simple Regression
46 pages
Regression Analysis: Hypothesis Tests & CI
No ratings yet
Regression Analysis: Hypothesis Tests & CI
42 pages
3SW3e Ch5 Slides 2026
No ratings yet
3SW3e Ch5 Slides 2026
34 pages
Part 3 4 5 Linear Regression DP 2025
No ratings yet
Part 3 4 5 Linear Regression DP 2025
100 pages
Regression Analysis: Hypothesis Testing & CI
No ratings yet
Regression Analysis: Hypothesis Testing & CI
130 pages
Manzan SW4e Ch05
No ratings yet
Manzan SW4e Ch05
32 pages
Statistical Inference in Linear Regression
No ratings yet
Statistical Inference in Linear Regression
49 pages
Lecture 5 Applied Econometrics For Accounting and Finance
No ratings yet
Lecture 5 Applied Econometrics For Accounting and Finance
53 pages
Topic2 (Partial)
No ratings yet
Topic2 (Partial)
55 pages
SW CH 05 Piskula Mod
No ratings yet
SW CH 05 Piskula Mod
39 pages
Interpreting Excel Regression Output
No ratings yet
Interpreting Excel Regression Output
5 pages
Econometrics Assignment Overview
No ratings yet
Econometrics Assignment Overview
20 pages
Class 3 - Hypothesis Test
No ratings yet
Class 3 - Hypothesis Test
44 pages
Understanding P-Values and Confidence Intervals
No ratings yet
Understanding P-Values and Confidence Intervals
30 pages
Multiple Regression Inference Overview
No ratings yet
Multiple Regression Inference Overview
38 pages
Linear Regression Assumptions Explained
No ratings yet
Linear Regression Assumptions Explained
27 pages
Hypothesis Testing in Regression Analysis
No ratings yet
Hypothesis Testing in Regression Analysis
5 pages
Regression and Correlation Analysis Guide
No ratings yet
Regression and Correlation Analysis Guide
17 pages
OLS Assumptions for Linear Regression
No ratings yet
OLS Assumptions for Linear Regression
6 pages
Test of Significance From SLR
No ratings yet
Test of Significance From SLR
4 pages
Interpreting MSE and SSE in Regression
No ratings yet
Interpreting MSE and SSE in Regression
15 pages
Regression Analysis: Hypothesis Testing & CI
No ratings yet
Regression Analysis: Hypothesis Testing & CI
27 pages
Introduction to Regression Analysis
No ratings yet
Introduction to Regression Analysis
29 pages
Regression Validity Tests Explained
No ratings yet
Regression Validity Tests Explained
21 pages
Multiple Regression Inference Techniques
No ratings yet
Multiple Regression Inference Techniques
52 pages
Interpreting R-Squared in Regression
No ratings yet
Interpreting R-Squared in Regression
51 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
35 pages
Statistical Tests and Analysis Guide
No ratings yet
Statistical Tests and Analysis Guide
11 pages
Regression Analysis and OLS Estimation
No ratings yet
Regression Analysis and OLS Estimation
7 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
7 pages
t-Tests and CLM Assumptions in Regression
No ratings yet
t-Tests and CLM Assumptions in Regression
34 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
37 pages
Hypothesis Testing in Multiple Regression
No ratings yet
Hypothesis Testing in Multiple Regression
26 pages
Linear Regression Inference Techniques
No ratings yet
Linear Regression Inference Techniques
30 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
32 pages
Multiple Regression Hypothesis Testing
No ratings yet
Multiple Regression Hypothesis Testing
10 pages
Lecture 3 - 4 - Ordinary Least Squares Formulas
No ratings yet
Lecture 3 - 4 - Ordinary Least Squares Formulas
30 pages
Strongest Linear Regression Analysis
No ratings yet
Strongest Linear Regression Analysis
5 pages
Hypothesis Testing in Linear Regression
No ratings yet
Hypothesis Testing in Linear Regression
43 pages
Econometrics For Financ
No ratings yet
Econometrics For Financ
42 pages
Lecture Note 4
No ratings yet
Lecture Note 4
41 pages
Hypothesis Testing and Regression Analysis
No ratings yet
Hypothesis Testing and Regression Analysis
11 pages
Linear Regression Basics for Analysts
No ratings yet
Linear Regression Basics for Analysts
4 pages
OLS Estimator Study Guide for Midterm
No ratings yet
OLS Estimator Study Guide for Midterm
7 pages
Statistical Tests for Least Squares Estimates
No ratings yet
Statistical Tests for Least Squares Estimates
16 pages
Hypothesis Testing in Econometrics
No ratings yet
Hypothesis Testing in Econometrics
35 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
64 pages
ANOVA and Regression Significance Testing
No ratings yet
ANOVA and Regression Significance Testing
23 pages
Statistical Inference in OLS Regression
No ratings yet
Statistical Inference in OLS Regression
62 pages
Sta - Session 2
No ratings yet
Sta - Session 2
53 pages
Regression and Correlation Analysis Basics
No ratings yet
Regression and Correlation Analysis Basics
39 pages
Testing Hypotheses in Linear Regression
No ratings yet
Testing Hypotheses in Linear Regression
4 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
55 pages
Standard Error and Hypothesis Testing in Econometrics
No ratings yet
Standard Error and Hypothesis Testing in Econometrics
28 pages
Simple Linear Regression in Econometrics
No ratings yet
Simple Linear Regression in Econometrics
6 pages
Developing Least Squares Regression Equation
No ratings yet
Developing Least Squares Regression Equation
12 pages
Week 3 Assignment 3: Signal Analysis Insights
100% (2)
Week 3 Assignment 3: Signal Analysis Insights
3 pages
Regression Analysis Summary Report
No ratings yet
Regression Analysis Summary Report
29 pages
Leadership Styles Impact on Employee Performance
No ratings yet
Leadership Styles Impact on Employee Performance
18 pages
Linear Regression Analysis Quiz
No ratings yet
Linear Regression Analysis Quiz
32 pages
PLC Control in Measurement Systems
No ratings yet
PLC Control in Measurement Systems
51 pages
Understanding MANOVA: Concepts & Applications
No ratings yet
Understanding MANOVA: Concepts & Applications
7 pages
Technical Efficiency in Kisii Farmers
No ratings yet
Technical Efficiency in Kisii Farmers
7 pages
Logit Model for Default Prediction
No ratings yet
Logit Model for Default Prediction
23 pages
Parameter Estimation with Examples
No ratings yet
Parameter Estimation with Examples
8 pages
Impact of Maternal Age on Infant Weight
No ratings yet
Impact of Maternal Age on Infant Weight
4 pages
Basic CRM Assumptions Violations Explained
No ratings yet
Basic CRM Assumptions Violations Explained
14 pages
Understanding Sampling in Audits
No ratings yet
Understanding Sampling in Audits
61 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
41 pages
Coefficient of Determination in Regression
No ratings yet
Coefficient of Determination in Regression
18 pages
Econometrics Exam Guidelines and Content
No ratings yet
Econometrics Exam Guidelines and Content
11 pages
Correlation of Knowledge, Motivation, Experience on Performance
No ratings yet
Correlation of Knowledge, Motivation, Experience on Performance
13 pages
SPSS Data Analysis and Regression Output
No ratings yet
SPSS Data Analysis and Regression Output
4 pages
Brand Image Analysis of Apollo Pharmacy
100% (1)
Brand Image Analysis of Apollo Pharmacy
60 pages
Barbieri Et Al 2023
No ratings yet
Barbieri Et Al 2023
22 pages
GLMM Analysis with R: A Guide by Knudson
No ratings yet
GLMM Analysis with R: A Guide by Knudson
15 pages
Hypothesis Testing Examples and Solutions
No ratings yet
Hypothesis Testing Examples and Solutions
22 pages
Yelp Reviews Impact on Restaurant Revenue
No ratings yet
Yelp Reviews Impact on Restaurant Revenue
40 pages
MATLAB Least Squares Curve Fitting
No ratings yet
MATLAB Least Squares Curve Fitting
18 pages
Windfall Income's Impact on Consumption
No ratings yet
Windfall Income's Impact on Consumption
14 pages
Gaussian Distributions: Overview: This Worksheet Introduces The Properties of Gaussian Distributions, The
100% (1)
Gaussian Distributions: Overview: This Worksheet Introduces The Properties of Gaussian Distributions, The
25 pages
Statistical Analysis Exam Questions
100% (1)
Statistical Analysis Exam Questions
7 pages
Understanding Multiple Correlation Coefficient
No ratings yet
Understanding Multiple Correlation Coefficient
6 pages
Groundwater Hotspot Analysis in Ethiopia
No ratings yet
Groundwater Hotspot Analysis in Ethiopia
14 pages
Polynomial Regression in ML Pipeline
No ratings yet
Polynomial Regression in ML Pipeline
58 pages
Analisis Homogenitas dan Normalitas Data
No ratings yet
Analisis Homogenitas dan Normalitas Data
7 pages

Inference in Simple Regression Analysis

Uploaded by

Inference in Simple Regression Analysis

Uploaded by

Inference in Simple Regression

1 = Y/X, for an autonomous change in X.

Estimator: the OLS estimator ˆ1 .

The OLS estimator of the intercept,

Note that: The expression of 𝑉(𝛽̂1 ) depends on V(𝑢𝑖 |𝑋).

SE( ˆ1 ) = +√𝑉̂ (𝛽̂1 )

Remark: We use 𝑉̂ (𝛽̂1 ) (i.e., with ^ on top of V) as an estimator of

The expression of 𝑉(𝛽̂1 ) depends on V(𝑢𝑖 |𝑋).

ˆ 1 var[(𝑋𝑖 −𝜇𝑥 )𝑢𝑖 ]  2

The estimator of V( ˆ1 ) replaces the unknown population values of  2

SE( ˆ1 ) = + ˆ 2ˆ = the Standard Error of ˆ1 ,

This looks complicated, but it is easily calculated:

In practice SE( ˆ1 ) is computed by regression software.(Stata:robust)

The estimator of V( ˆ1 ) replaces the unknown population values of 𝜎𝑢2

Question: Which set of dots would yield a more accurate regression

How do we find these numbers?

Null hypothesis and one-sided alternative:

estimator - hypothesized value

(i) Reject H0 at 5% significance level if |t0 | > 1.96.

In applications users typically check the t-ratio first, to decide

The t-ratio for the slope is: –2.28/0.52 = –4.38.

Question: What is the toughest standard that we can apply, and

p-value << 0.01

Recall that a 95% confidence interval is, equivalently:

ˆ1  1.96SE( ˆ1 ).

ˆ1  1.96SE( ˆ1 ) or –2.28  1.960.52

You might also like