0% found this document useful (0 votes)

13 views57 pages

Econometrics: MRA and Inference Basics

The document outlines key concepts in econometrics, focusing on multiple regression models, OLS estimation, and the importance of assumptions such as linearity and zero conditional mean. It discusses the interpretation of regression coefficients, goodness-of-fit measures like R-squared, and the implications of omitted variable bias. Additionally, it provides examples to illustrate the effects of including or excluding variables in regression analysis.

Uploaded by

hassan.domiaty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as KEY, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views57 pages

Econometrics: MRA and Inference Basics

Uploaded by

hassan.domiaty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as KEY, PDF, TXT or read online on Scribd

Lecture 2: MRA and inference

Dr. Yundan Gong

Econometrics (6YYD0017)
Test

1. Econometrics is the branch of economics that _____.

a. studies the behaviour of individual economic agents in
making economic decisions
b. develops and uses statistical methods for estimating
economic relationships
c. deals with the performance, structure, behaviour, and
decision-making of an economy as a whole
d. applies mathematical methods to represent economic
theories and solve economic problems
Test

2. The term ‘u’ in an econometric model is usually referred to as

the _____.

a. error term

b. parameter

c. hypothesis

d. dependent variable
Test

3. The parameters of an econometric model _____.

a. include all unobserved factors affecting the variable

being studied

b. describe the strength of the relationship between the

variable under study and the factors affecting it

c. refer to the explanatory variables included in the model

d. refer to the predictions that can be made using the

model
Test

4. _ has a causal effect on _.

a. Income; unemployment

b. Height; health

c. Income; consumption

d. Age; wage
Test

5. If a change in variable x causes a change in variable y, variable

x is called the _____.
1.

a. dependent variable
b. explained variable
c. explanatory variable
d. response variable
Test

6. In the equation is the _____.

a. dependent variable
b. independent variable
c. slope parameter
d. Intercept parameter

7. And what is the estimated value of β0 ?

a.
b.
c.

d.
Test

8. What does the equation denote if the regression

equation is
?
a. The explained sum of squares
b. The total sum of squares
c. The sample regression function
d. The population regression function
Test

9. If the total sum of squares (SST) in a regression equation is 81,

and the residual sum of squares (SSR) is 25, what is the explained
sum of squares (SSE)?
a. 64
b. 56
c. 32
d. 18
Test

10. If the residual sum of squares (SSR) in a regression analysis is

66 and the total sum of squares (SST) is equal to 90, what is the
value of the coefficient of determination?
a. 0.73
b. 0.55
c. 0.27
d. 1.2
Test

11. Which of the following is true of ?

a. is also called the standard error of regression.

b. A low indicates that the Ordinary Least Squares line fits the
data well.
c. usually decreases with an increase in the number of
independent variables in a regression.
d. shows what percentage of the total variation in the dependent
variable, Y, is explained by the explanatory variables.
Test

12. The value of always _____.

a. lies below 0
b. lies above 1
c. lies between 0 and 1
d. lies between 1 and 1.5
Today‘s agenda

Interpretation of OLS

The expected value of the OLS estimation

Efficiency of OLS: THE Gauss-Markov Theorem

Discussion of the normality assumption

Reading List

Wooldridge, J., Introductory Econometrics: A Modern Approach,

EMEA , 2014, South-Western, chapter 3, 4&7 (HB129WOO2014)

Hill, Griffiths and Lim, Principles of Econometrics, 4th ed., 2011,

Wiley, chapter 5&6 (HB139HIL)

Goodness-of-fit

Goodness-of-Fit

“How well does the explanatory variable explain the dependent

variable?”
Measures of Variation

Total sum of squares, Explained sum of Residual sum of

represents total squares, squares,
variation represents variation represents variation
in the dependent explained by not
variable regression explained by
regression
R-squared
The Simple Regression Model

Decomposition of total variation

Total Explained Unexplained

variation part part

Goodness-of-fit measure (R-squared)

R-squared measures the fraction

of the total variation that is
explained by the regression
R_squared examples
The Simple Regression Model

CEO Salary and return on equity

The regression explains only

1.3%
of the total variation in salaries

Voting outcomes and campaign expenditures

The regression explains 85.6%

of the total variation in election
outcomes

Caution: A high R-squared does not necessarily mean that the

regression has a causal interpretation!
Incorporating
The Simple Regression Model
nonlinearities: Semi-
logarithmic
Regression of log wages on form
years of education

Natural logarithm of
This wage
changes the interpretation of the regression coefficient:

Percentage change of
wage

… if years of education
are increased by one
year
Fitted regression
The Simple Regression Model

The wage increases by 8.3% for

every additional year of
education
(= return to another year of
education)
For
example:

Growth rate of wage is 8.3%

per year of education
Incorporating
The Simple Regression Model
nonlinearities: log-log form
CEO salary and firm sales

Natural logarithm of CEO Natural logarithm of his/her

salary firm‘s sales
This changes the interpretation of the regression coefficient:

Percentage change of
salary
… if sales increase by
1%
Logarithmic changes
are
always percentage
changes
Fitted regression
The Simple Regression Model

CEO salary and firm sales: fitted regression

+ 1% sales; + 0.257%
For example: salary

The log-log form postulates a constant elasticity model, whereas the

semi-log form assumes a semi-elasticity model
Definition of the multiple
linear regression model
“Explains variable in terms of variables
”

Interce Slope
pt parameters

Dependent
variable, Error term,
Independent disturbance,
explained variables,
variable, unobservable
explanatory s,…
response variables,
variable,… regressors,…
Motivation for multiple
regression
Motivation:
Incorporate more explanatory factors into the model
Explicitly hold fixed other factors that otherwise would be in
Allow for more flexible functional forms

Example: Wage equation

Now measures effect of education explicitly holding
experience fixed

All other
factors…

Hourly Years of Years of labor market

wage education experience
Example: Average test
scores and per student
spending
Other
factors
Average Per student Average family
standardized spending income
test score of at this school of students at this
school school

Per student spending is likely to be correlated with average family

income at a given high school because of school financing
Omitting average family income in regression would lead to biased
estimate of the effect of spending on average test scores
In a simple regression model, effect of per student spending would
partly include the effect of family income on test scores
Example: Family income
and family consumption

Other
factors
Family Family Family income
consumption income squared
Model has two explanatory variables: income and income squared
Consumption is explained as a quadratic function of income
One has to be very careful when interpreting the coefficients :

By how much does Depends on

consumption how much
increase if income is income is
increased already there
by one unit?
Example: CEO salary, sales,
and CEO tenure

Log of CEO Log Quadratic function of CEO tenure with

. salary sales the firm

Model assumes a constant elasticity relationship between CEO salary

and the sales of his or her firm
Model assumes a quadratic relationship between CEO salary and his or
her tenure with the firm

Meaning of “linear” regression

The model has to be linear in the parameters (not in the variables)

OLS estimation of the
multiple regression model
OLS Estimation of the multiple regression model
Random sample

Regression residuals

Minimize sum of squared residuals

Minimization will be carried out by

computer
Interpretation of the
multiple regression model

By how much does the dependent variable change if

the j-th
independent variable is increased by one unit,
holding all
other independent variables and the error term
The multiple linear constant
regression model manages to hold the values
of other explanatory variables fixed even if, in reality, they are
correlated with the explanatory variable under consideration

“Ceteris paribus”-interpretation

It has still to be assumed that unobserved factors do not change if

the explanatory variables are changed
Example: Determinants of
college GPA

Grade point average at High school grade point Achievement test

college average score

Interpretation
.

Holding ACT fixed, another point on high school grade point

average is associated with another .453 points college grade point
average
Or: If we compare two students with the same ACT, but the hsGPA
of student A is one point higher, we predict student A to have a
colGPA that is .453 higher than that of student B
Holding high school grade point average fixed, another 10 points
on ACT are associated with less than one point on college GPA
Properties of OLS on any
sample of data
Fitted values and residuals

Fitted or predicted Residu

values als
Algebraic properties of OLS regression

Deviations from Covariance between Sample averages of y and

regression line sum deviations and regressors of the regressors lie on
up to zero are zero regression line
Goodness-of-Fit

Decomposition of total variation

SST = SSE + SSR

Notice that R-squared can
R-squared only
increase if another
explanatory
variable is added to the
regression

Alternative expression for R-squared

R-squared is equal to the
squared
correlation coefficient
between the
actual and the predicted
value of
the dependent variable
Example: Explaining arrest
records
Number of Proportion prior Months in prison Quarters employed
times arrests 1986 1986
arrested 1986 that led to
conviction

Interpretation:
.

If the proportion prior arrests increases by 0.5, the predicted fall

in arrests is 7.5 arrests per 100 men
If the months in prison increase from 0 to 12, the predicted fall in
arrests is 0.408 arrests for a particular man
If the quarters employed increase by 1, the predicted fall in
arrests is 10.4 arrests per 100 men
Example: Explaining arrest
records (cont.)
An additional explanatory variable is added:

Average sentence in prior

convictions

R-squared increases only

Interpretation:
slightly

Average prior sentence increases number of arrests (?)

Limited additional explanatory power as R-squared increases by
little

General remark on R-squared

Even if R-squared is small (as in the given example), regression may

still provide good estimates of ceteris paribus effects
Standard assumptions for
the multiple regression
model
Assumption MLR.1 (Linear in parameters)

In the population, the

relation-
ship between y and the
Assumption MLR.2 (Random sampling) expla-
natory variables is linear

The data is a random

sample
drawn from the
population

Each data point therefore follows the population

equation
Standard assumptions for the
multiple regression model
(cont.)
Assumption MLR.3 (No perfect collinearity)
“In the sample (and therefore in the population), none
of the independent variables is constant and there are
no exact linear relationships among the independent
variables.”

Remarks on MLR.3

The assumption only rules out perfect collinearity/correlation

between explanatory variables; imperfect correlation is allowed
If an explanatory variable is a perfect linear combination of other
explanatory variables it is superfluous and may be eliminated
Constant variables are also ruled out (collinear with intercept)
Examples for perfect
collinearity
Example for perfect collinearity: small sample

In a small sample, avginc may accidentally be an exact multiple of

expend; it will not
Example for perfect
be possible collinearity:
to disentangle relationships
their separate between
effects because there isregressors
exact
covariation

Either shareA or shareB will have to be dropped from the regression

because there
is an exact linear relationship between them: shareA + shareB = 1
Standard assumptions for the
multiple regression model
(cont.)
Assumption MLR.4 (Zero conditional mean)

The value of the explanatory

variables
must contain no information about
the mean of the unobserved factors

In a multiple regression model, the zero conditional mean

assumption is much more likely to hold because fewer things end
up in the error

Example: Average test scores

If avginc was not included in the regression, it would end up in the

error term; it would then be hard to defend that expend is
uncorrelated with the error
Zero conditional mean

Discussion of the zero mean conditional assumption

Explanatory variables that are correlated with the error term are
called endogenous; endogeneity is a violation of assumption MLR.4
Explanatory variables that are uncorrelated with the error term are
called exogenous; MLR.4 holds if all explanat. var. are exogenous
Exogeneity is the key assumption for a causal interpretation of the
regression, and for unbiasedness of the OLS estimators

Theorem 3.1 (Unbiasedness of OLS)

Unbiasedness is an average property in repeated samples; in a

given sample, the estimates may still be far away from the true
Including irrelevant
variables/Omitted variable
bias
Including irrelevant variables in a regression model

No problem because = 0 in the population

.
However, including irrevelant variables may increase
sampling variance.
Omitting relevant variables: the simple case

True model (contains x1 and x2)

Estimated model (x2 is

omitted)
Omitted variable bias

Omitted variable bias

If x1 and x2 are correlated, assume a
linear regression relationship
between them

If y is only If y is only error term

regressed regressed
on x1 this will be on x1, this will be
Conclusion: All estimatedthe
the estimated coefficients
estimated will be biased
intercept slope on x1
Omitted variable bias

Example: Omitting ability in a wage equation

Will both be positive

The return to education will be overestimated because . It

will look
as if people with many years of education earn very high wages, but this is
When is there
partly no omitted variable bias?
. due to the fact that people with more education are also more able on
average.
If the omitted variable is irrelevant or uncorrelated
Omitted variable bias

Omitted variable bias: more general cases

True model (contains x1, x2,

and x3)

Estimated model (x3 is

omitted)

No general statements possible about direction of bias

Analysis as in simple case if one regressor uncorrelated with
others

Example: Omitting ability in a wage equation

If exper is approximately uncorrelated with educ and abil, then the

direction
of the omitted variable bias can be as analyzed in the simple two
variable case.
Standard assumptions for the
multiple regression model
(cont.)
Assumption MLR.5 (Homoskedasticity)

The value of the explanatory

variables
must contain no information about
the variance of the unobserved
Example: Wage equation factors
This assumption may also be
hard
to justify in many cases

Short hand notation

All explanatory variables are
collected in a random vector

wit
h
Graphical illustration
The Simple Regression Model of
homoskedasticity

The variability of the

unobserved
influences does not depend on
the value of the explanatory
variable
An
The example for
Simple Regression Model
heteroskedasticity
An example for heteroskedasticity: Wage and education

The variance of the unobserved

determinants of wages
increases
with the level of education
Theorem 3.2 (sampling
variances of the OLS slope
estimators)
Under assumptions MLR.1 –
MLR.5:

Variance of the error

term

Total sample variation in R-squared from a regression of explanatory

explanatory variable xj: variable xj on all other independent
variables
(including a constant)
Components of OLS variances

The error variance

A high error variance increases the sampling variance because

there is more “noise” in the equation
A large error variance necessarily makes estimates imprecise
The error variance does not decrease with sample size

The total sample variation in the explanatory variable

More sample variation leads to more precise estimates

Total sample variation automatically increases with the sample size
Increasing the sample size is thus a way to get more precise
estimates
Components of OLS variances

Linear relationships among the independent variables

Regress on all other independent variables (including a

constant)

The R-squared of this regression will be the

higher
the better xj can be linearly explained by the
other independent variables

Sampling variance of will be the higher the better explanatory

variable can be linearly explained by other independent variables
The problem of almost linearly dependent explanatory variables is
called multicollinearity (i.e. for some )
An example for
multicollinearity

Average Expenditu Expenditures Other

standardized res for in- ex-
test score of for structional penditu
school teachers materials res

The different expenditure categories will be strongly correlated because if a

school has a lot of resources it will spend a lot on everything.
It will be hard to estimate the differential effects of different expenditure
categories because all expenditures are either high or low. For precise estimates
of the differential effects, one would need information about situations where
expenditure categories change differentially.
As a consequence, sampling variance of the estimated effects will be large.
Discussion of the
multicollinearity problem
.

In the above example, it would probably be better to lump all

expenditure categories together because effects cannot be
disentangled

In other cases, dropping some independent variables may reduce

multicollinearity (but this may lead to omitted variable bias)
Discussion of the
multicollinearity problem

Only the sampling variance of the variables involved in

multicollinearity will be inflated; the estimates of other effects may
be very precise
Note that multicollinearity is not a violation of MLR.3 in the strict
sense
Multicollinearity may be detected As through “variance
an (arbitrary) inflation
rule of thumb, the
variance
factors” inflation factor should not be larger
than 10
Variances in misspecified
models
.

The choice of whether to include a particular variable in a

regression can be made by analyzing the tradeoff between bias and
variance True population
model

Estimated model 1

Estimated model 2

It might be the case that the likely omitted variable bias in the
misspecified model 2 is overcompensated by a smaller variance
Variances in misspecified
models (cont.)

Conditional on x1 and x2,

the variance in model 2
is always smaller than
that in model 1

Case 1:
Conclusion: Do not include irrelevant
regressors

Case 2:
Trade off bias and variance; Caution: bias will not vanish even in
large samples
Estimating the error variance

An unbiased estimate of the error variance can be obtained by substracting the

number of estimated regression coefficients from the number of observations.
The number of obser-vations minus the number of estimated parameters is also
called the degrees of freedom. The n estimated squared residuals in the sum
are not completely independent but related
through the k+1 equations that define the first order conditions of the
minimization problem.

Theorem 3.3 (Unbiased estimator of the error variance)

Estimation of the sampling
variances of the OLS
estimators
The true sampling
variation of the
estimated

Plug in for the unknown

The estimated
samp-
ling variation of
the
Note that theseformulas are only valid under assumptions MLR.1-
estimated
MLR.5 (in particular, there has to be homoskedasticity)
Efficiency of OLS: the Gauss-
Markov Theorem
.

Under assumptions MLR.1 - MLR.5, OLS is unbiased

However, under these assumptions there may be many other
estimators that are unbiased
Which one is the unbiased estimator with the smallest variance?
In order to answer this question one usually limits oneself to linear
estimators, i.e. estimators linear in the dependent variable

May be an arbitrary function of the sample

values of all the explanatory variables; the
OLS estimator
can be shown to be of this form
Sampling distribution of
the OLS
Statistical inference in the regression model
.

Hypothesis tests about population parameters

Construction of confidence intervals

Sampling distributions of the OLS estimators

The OLS estimators are random variables

We already know their expected values and their variances
However, for hypothesis tests we need to know their distribution
In order to derive their distribution we need additional
assumptions
Assumption about distribution of errors: normal distribution

Econometrics: MRA and Inference Overview
No ratings yet
Econometrics: MRA and Inference Overview
52 pages
Multiple Regression Model Estimation
No ratings yet
Multiple Regression Model Estimation
40 pages
Week 2 - Module 2 - The Simple Regression Model
No ratings yet
Week 2 - Module 2 - The Simple Regression Model
29 pages
Understanding Multiple Regression Analysis
No ratings yet
Understanding Multiple Regression Analysis
56 pages
Understanding Simple Regression Models
No ratings yet
Understanding Simple Regression Models
26 pages
Econometrics II: Regression Analysis Basics
No ratings yet
Econometrics II: Regression Analysis Basics
55 pages
Econometrics: Simple Regression Model
No ratings yet
Econometrics: Simple Regression Model
49 pages
CH 03 Wooldridge 5e PPT PDF
100% (3)
CH 03 Wooldridge 5e PPT PDF
35 pages
Simple Regression Model Overview
No ratings yet
Simple Regression Model Overview
41 pages
Understanding Multiple Regression Analysis
100% (1)
Understanding Multiple Regression Analysis
35 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
35 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
99 pages
Econometrics Assignment Overview
No ratings yet
Econometrics Assignment Overview
3 pages
Understanding Multiple Linear Regression
No ratings yet
Understanding Multiple Linear Regression
14 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
40 pages
Simple Linear Regression Analysis Guide
No ratings yet
Simple Linear Regression Analysis Guide
43 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
43 pages
Locating Coefficients in Regression Analysis
No ratings yet
Locating Coefficients in Regression Analysis
104 pages
Multiple Regression Analysis Explained
No ratings yet
Multiple Regression Analysis Explained
36 pages
Understanding Regression and Residuals
No ratings yet
Understanding Regression and Residuals
77 pages
Linear Regression Analysis of Class Size
No ratings yet
Linear Regression Analysis of Class Size
38 pages
Applied Econometrics 2014 1
No ratings yet
Applied Econometrics 2014 1
90 pages
Regression Analysis and Interpretation
No ratings yet
Regression Analysis and Interpretation
11 pages
MLR Estimation Techniques Explained
No ratings yet
MLR Estimation Techniques Explained
52 pages
Understanding Multiple Regression Analysis
No ratings yet
Understanding Multiple Regression Analysis
31 pages
Multiple Regression Analysis Essentials
No ratings yet
Multiple Regression Analysis Essentials
11 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
39 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
45 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
27 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
43 pages
Conditional Mean Independence in Regression
No ratings yet
Conditional Mean Independence in Regression
5 pages
Multiple Regression Analysis Estimation Guide
No ratings yet
Multiple Regression Analysis Estimation Guide
32 pages
Econometrics Exercise Set 1 Solutions
No ratings yet
Econometrics Exercise Set 1 Solutions
7 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
23 pages
Introduction to Linear Regression Model
No ratings yet
Introduction to Linear Regression Model
21 pages
MLR Estimation and Assumptions Explained
No ratings yet
MLR Estimation and Assumptions Explained
52 pages
Multiple Linear Regression Analysis
No ratings yet
Multiple Linear Regression Analysis
77 pages
Regression Analysis for Business Decisions
No ratings yet
Regression Analysis for Business Decisions
18 pages
Regression Analysis in Inferential Stats
No ratings yet
Regression Analysis in Inferential Stats
68 pages
Bivariate Regression Analysis Overview
100% (1)
Bivariate Regression Analysis Overview
54 pages
1.6linear Regression Model
No ratings yet
1.6linear Regression Model
17 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
9 pages
CH - 03 - Multiple Regression Analysis Estimation
No ratings yet
CH - 03 - Multiple Regression Analysis Estimation
36 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
16 pages
Understanding Multiple Regression Analysis
No ratings yet
Understanding Multiple Regression Analysis
56 pages
Chapter 3 Multiple Regression
No ratings yet
Chapter 3 Multiple Regression
25 pages
Econometrics Notes: Regression Analysis
No ratings yet
Econometrics Notes: Regression Analysis
15 pages
Understanding Multiple Linear Regression
No ratings yet
Understanding Multiple Linear Regression
37 pages
Ace Classes Ecotrex Problems
No ratings yet
Ace Classes Ecotrex Problems
64 pages
Simple Regression Model Overview
No ratings yet
Simple Regression Model Overview
33 pages
Regression Analysis in STAT 445
No ratings yet
Regression Analysis in STAT 445
49 pages
Worksheet Econometrics
No ratings yet
Worksheet Econometrics
6 pages
Introductory Econometrics Answers
No ratings yet
Introductory Econometrics Answers
6 pages
Crop Price Predictions Using Machine Learning in MP - A Pilot Study
100% (1)
Crop Price Predictions Using Machine Learning in MP - A Pilot Study
95 pages
Geostatistical Mapping of PM2.5 in Canada
No ratings yet
Geostatistical Mapping of PM2.5 in Canada
13 pages
PSMCS Tutorial II: Correlation & Regression
No ratings yet
PSMCS Tutorial II: Correlation & Regression
5 pages
Aerospace Manufacturing Cost Prediction From A Mea
No ratings yet
Aerospace Manufacturing Cost Prediction From A Mea
10 pages
Burnout in Special Needs Teachers
No ratings yet
Burnout in Special Needs Teachers
16 pages
Economic vs Non-Economic Factors in Women's Empowerment
No ratings yet
Economic vs Non-Economic Factors in Women's Empowerment
38 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
10 pages
Classification Metrics for Mixed Targets
100% (9)
Classification Metrics for Mixed Targets
114 pages
Data Science Overview and Key Concepts
No ratings yet
Data Science Overview and Key Concepts
11 pages
SOLIDWORKS Simulation Premium Training Guide
No ratings yet
SOLIDWORKS Simulation Premium Training Guide
2 pages
Efflux Time Analysis in Chemical Engineering
No ratings yet
Efflux Time Analysis in Chemical Engineering
21 pages
Econometrics Course Exam Questions
No ratings yet
Econometrics Course Exam Questions
3 pages
E-Commerce Sales Forecasting with ML
No ratings yet
E-Commerce Sales Forecasting with ML
20 pages
Wysax Timetable Overview for Bahrain
No ratings yet
Wysax Timetable Overview for Bahrain
43 pages
Audit Committees and Profitability in Nigeria
No ratings yet
Audit Committees and Profitability in Nigeria
33 pages
4.6 Assignment - Culminating Investigation
No ratings yet
4.6 Assignment - Culminating Investigation
9 pages
Introductory Econometrics Guide
No ratings yet
Introductory Econometrics Guide
131 pages
Workplace Nepotism and Employees Job Satifaction
No ratings yet
Workplace Nepotism and Employees Job Satifaction
10 pages
Coastal Erosion Vulnerability in Digha
No ratings yet
Coastal Erosion Vulnerability in Digha
12 pages
Canada GDP Regression Report
No ratings yet
Canada GDP Regression Report
3 pages
Determinants of Dividend Payout in IT Sector
No ratings yet
Determinants of Dividend Payout in IT Sector
9 pages
Machine Learning for Room Occupancy
No ratings yet
Machine Learning for Room Occupancy
54 pages
2021 Exam Information & Learning Objective Statements: CMT Level Ii
100% (1)
2021 Exam Information & Learning Objective Statements: CMT Level Ii
26 pages
Tourist Satisfaction in Langkawi Island
No ratings yet
Tourist Satisfaction in Langkawi Island
20 pages
Linear Regression for Calibration Curves
No ratings yet
Linear Regression for Calibration Curves
14 pages
Curve Fitting Techniques Explained
No ratings yet
Curve Fitting Techniques Explained
48 pages
24 - 1982 August
100% (1)
24 - 1982 August
76 pages
Managerial Economics 12th Edition Christopher Thomas All Chapters Available
100% (3)
Managerial Economics 12th Edition Christopher Thomas All Chapters Available
155 pages
Bivariate Descriptive Statistics Guide
No ratings yet
Bivariate Descriptive Statistics Guide
19 pages
Data Preprocessing Techniques Overview
No ratings yet
Data Preprocessing Techniques Overview
65 pages

Econometrics: MRA and Inference Basics

Uploaded by

Econometrics: MRA and Inference Basics

Uploaded by

Lecture 2: MRA and inference

Dr. Yundan Gong

1. Econometrics is the branch of economics that _____.

2. The term ‘u’ in an econometric model is usually referred to as

3. The parameters of an econometric model _____.

a. include all unobserved factors affecting the variable

b. describe the strength of the relationship between the

variable under study and the factors affecting it

c. refer to the explanatory variables included in the model

4. _____ has a causal effect on _____.

5. If a change in variable x causes a change in variable y, variable

6. In the equation is the _____.

7. And what is the estimated value of β0 ?

8. What does the equation denote if the regression

9. If the total sum of squares (SST) in a regression equation is 81,

10. If the residual sum of squares (SSR) in a regression analysis is

11. Which of the following is true of ￼?

a. ￼ is also called the standard error of regression.

12. The value of ￼ always _____.

The expected value of the OLS estimation

Efficiency of OLS: THE Gauss-Markov Theorem

Discussion of the normality assumption

Wooldridge, J., Introductory Econometrics: A Modern Approach,

EMEA , 2014, South-Western, chapter 3, 4&7 (HB129WOO2014)

Hill, Griffiths and Lim, Principles of Econometrics, 4th ed., 2011,

Wiley, chapter 5&6 (HB139HIL)

“How well does the explanatory variable explain the dependent

Total sum of squares, Explained sum of Residual sum of

Decomposition of total variation

Total Explained Unexplained

Goodness-of-fit measure (R-squared)

R-squared measures the fraction

CEO Salary and return on equity

The regression explains only

Voting outcomes and campaign expenditures

The regression explains 85.6%

Caution: A high R-squared does not necessarily mean that the

The wage increases by 8.3% for

Growth rate of wage is 8.3%

Natural logarithm of CEO Natural logarithm of his/her

CEO salary and firm sales: fitted regression

The log-log form postulates a constant elasticity model, whereas the

Example: Wage equation

Hourly Years of Years of labor market

Per student spending is likely to be correlated with average family

By how much does Depends on

Log of CEO Log Quadratic function of CEO tenure with

Model assumes a constant elasticity relationship between CEO salary

Meaning of “linear” regression

The model has to be linear in the parameters (not in the variables)

Minimize sum of squared residuals

Minimization will be carried out by

By how much does the dependent variable change if

It has still to be assumed that unobserved factors do not change if

Grade point average at High school grade point Achievement test

Holding ACT fixed, another point on high school grade point

Fitted or predicted Residu

Deviations from Covariance between Sample averages of y and

Decomposition of total variation

SST = SSE + SSR

Alternative expression for R-squared

If the proportion prior arrests increases by 0.5, the predicted fall

Average sentence in prior

R-squared increases only

Average prior sentence increases number of arrests (?)

General remark on R-squared

Even if R-squared is small (as in the given example), regression may

In the population, the

The data is a random

Each data point therefore follows the population

The assumption only rules out perfect collinearity/correlation

In a small sample, avginc may accidentally be an exact multiple of

Either shareA or shareB will have to be dropped from the regression

The value of the explanatory

In a multiple regression model, the zero conditional mean

Example: Average test scores

4. _ has a causal effect on _.

11. Which of the following is true of ?

a. is also called the standard error of regression.

12. The value of always _____.