0% found this document useful (0 votes)

7 views31 pages

Chap 2 Panel Data

The document discusses panel data models, focusing on fixed effects and difference-in-differences (DiD) methods for estimating causal effects in economics. It explains how fixed effects control for unobserved, time-invariant characteristics and how DiD combines before-and-after comparisons to account for time trends and treatment effects. Additionally, it briefly introduces random effects models and their key assumptions.

Uploaded by

nguyenthithuthao2547

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views31 pages

Chap 2 Panel Data

Uploaded by

nguyenthithuthao2547

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Lecture 2: Panel Data Models

Dr. Viet Ha Pham

Faculty of Development Economics
University of Economics and Business, Vietnam National University, Hanoi

Semester II, 2025-2026

1 / 30
Fixed Effects Models

2x2 Difference-in-difference

Bonus: Random Effects Models

2 / 30
Fixed Effects Models

3 / 30
Panel Data

▶ Multiple units for multiple periods

▶ For the same unit, you can observe them in many periods of time
▶ Can you give me an example?
▶ Anyone remember dummy variables? (Someone informs me that it is called "biến
giả"in Vietnamese)

4 / 30
Panel Data and Control for the Unobservable
▶ Consider the univariate OLS model:

yit = β0 + β1 xit + εit ,

▶ There are i = 1, 2, ..., N units and t = 1, 2, ..., T time periods.

▶ OLS estimates would be biased if the error term εit is correlated with the
explanatory variable xit
▶ If we have panel data, we can "zoom"into how units change over time.
▶ We are controlling for anything that are fixed over time by doing this
▶ Mechanically, we can include a dummy variables which is equal 1 for the specific
unit.
▶ The name "fixed"effects also suggests that we are controlling for anything that is
fixed over time.

5 / 30
Fixed Effects Model
▶ More formally, consider a panel data model with a single explanatory variable:

yit = β1 xit + ai + uit ,

▶ Can be easily expanded to multiple explanatory variables

▶ εit is broken into two terms
▶ ai captures unobserved, time-invariant characteristics for individual i
▶ Example:
▶ The econometrician observe labor outcome (wages, output, productivity,...) of a
fixed set of workers over 10 years.
▶ ai captures unobserved characteristics of the individuals which does not change over
time (Can you give me an example?)
▶ ai does not capture characteristics which change over time (Can you give me an
example?)

6 / 30
Fixed Effects Estimation

▶ Averaging over time for each individual i, we obtain:

ȳi = β1 x̄i + ai + ūi ,

where bars denote time averages.

▶ Subtracting the time-averaged equation from the original equation:

yit − ȳi = β1 (xit − x̄i ) + (uit − ūi ).

▶ This is known as the within transformation.

▶ The unobserved effect ai is eliminated.

7 / 30
Fixed Effects Estimation

▶ Under a strict exogeneity assumption, the fixed effects estimator is unbiased.

▶ The FE estimator allows arbitrary correlation between ai and the explanatory
variables.
▶ Any explanatory variable that is constant over time is removed by the fixed effects
transformation.

8 / 30
Fixed Effects Estimation

▶ The composite error is:

εit = ai + uit .
▶ Since ai is constant over time, εit is serially correlated.
▶ Correct standard errors must account for the panel (grouped) structure of the
data.
▶ Manually transforming the data (e.g. demeaning for fixed effects) yields correct
coefficient estimates but not correct standard errors.
▶ Standard errors therefore need to be clustered, allowing error terms to be
correlated within each unit over time.
▶ In practice, we use built-in commands and library (e.g., xtreg, fe in Stata,
fixest in R or pyfixest in Python).

9 / 30
Question

▶ In a fixed effects regression, the fixed effect term plays a role similar to a familiar
component in a standard OLS regression.
▶ What is this component, and in what sense are they similar?

10 / 30
Answer

▶ A fixed effects model can be viewed as an OLS regression in which each unit has
its own intercept, capturing all time-invariant differences across units.
▶ Fun fact: Each ai has no meaning in levels, because all fixed effects are only
identified up to an additive constant.

11 / 30
One more Questions :)

How do we interpret the results of FE models once we have estimated it?

12 / 30
Answer :)

How do we interpret the results of FE models once we have estimated it?

▶ Answer: Within the same unit, how is variation in X related to variation in Y.

13 / 30
2x2 Difference-in-difference

14 / 30
Motivation

▶ One of the most widely used empirical methods in applied economics (and other
fields).
▶ Goal: estimate the causal effect of a treatment when randomized experiments are
not available.

15 / 30
Classic Example (Snow (1856))
▶ Prevailing belief at the time: cholera was caused by bad air (the miasma theory).

Hình: An 1831 color lithograph by Robert Seymour depicts cholera as a deadly cloud.

16 / 30
Classic Example (Snow (1856))

▶ John Snow’s hypothesis: cholera is transmitted through contaminated drinking

water.
▶ Setting: Cholera outbreak in London, 1854. London households received water
from different private water companies. One water company changed its water
intake upstream higher up the Thames in 1849.
▶ Control group: Households supplied by a water company drawing water
downstream from the Thames.
▶ Treatment group: Households supplied by companies drawing water upstream.
▶ Key insight: Households were geographically mixed, but water sources differed.
▶ Outcome: Cholera mortality rates. Compare changes in mortality:
▶ Before vs. after 1849,
▶ Between households with contaminated vs. clean water sources.

17 / 30
Classic Example (Snow (1856))

18 / 30
Motivation and Introduction

▶ Many policy questions involve outcomes observed for the same units over time.
▶ Simple before–after comparisons are misleading due to time trends.
▶ Simple treated-control comparisons are misleading due to permanent differences.
▶ Difference-in-Differences combines both comparisons.
▶ DiD is a panel data method that removes:
▶ time-invariant differences across units
▶ common shocks over time

19 / 30
Difference-in-Differences: The 2×2 Panel Setup

▶ Observe an outcome Yit for two groups over two periods.

▶ Groups:
▶ Treated group (Di = 1),
▶ Control group (Di = 0).
▶ Periods:
▶ Pre-treatment (t = 0),
▶ Post-treatment (t = 1).
▶ Parallel trends assumption: Had no treatment occurred, the gap between
treated and untreated groups would have remained constant.
▶ Policy impact is identified from changes in Yit over time between the two groups.

20 / 30
Difference-in-Differences: The 2×2 Panel Setup

▶ Observe an outcome Yit for two groups over two periods.

▶ For the treated group (Di = 1), the change in outcomes is:

∆YT = ȲT ,post − ȲT ,pre .

▶ For the control group (Di = 0), the change in outcomes is:

∆YC = ȲC ,post − ȲC ,pre .

▶ The Difference-in-Differences estimator is:

d = ∆YT − ∆YC .
DiD

21 / 30
DiD Graphics

22 / 30
DiD estimation

▶ The 2×2 Difference-in-Differences regression can be written as:

Yit = β0 + β1 Treati + β2 Postt + β3 (Treati × Postt ) + εit .

▶ Treati : indicator for units that are ever treated.

▶ Postt : indicator for post-treatment periods.
▶ β3 : the DiD estimator, identified from within-unit changes over time.
▶ DiD is a building block for many modern panel data methods.

23 / 30
Understanding 2x2 DiD

Treatment = 0 Treatment = 1 Difference

Post = 0 β0 β0 + β 1 β1
Post = 1 β0 + β2 β0 + β 1 + β 2 + β 3 β1 + β3
Difference β2 β2 + β 3 β3

24 / 30
Questions, comments, and discussions are welcome.
Email: viethap@[Link]

25 / 30
Bonus: Random Effects Models

26 / 30
Random Effects Models

We start from the same panel data model as in Fixed Effects:

yit = βxit + ai + uit ,

where:
▶ ai is an individual-specific effect
▶ uit is an idiosyncratic error term

27 / 30
Key Assumption (Random Effects)

The crucial assumption in the Random Effects (RE) model is:

E[ai | xi1 , xi2 , . . . , xiT ] = 0.

▶ The individual effect ai is uncorrelated with the regressors

▶ This is stronger than the Fixed Effects assumption
▶ We can estimate β using one crosssection of the data, or used pooled OLS
▶ But we could do better

28 / 30
Serial Correlation in Random Effects

Consider the panel model:

yit = βxit + ai + uit .
▶ The composite error is:
vit = ai + uit .
▶ Because ai is common across time for unit i, the composite errors are serially
correlated:
σ2
Corr(vit , vis ) = 2 a 2 , t ̸= s.
σa + σ u
▶ Pooled OLS ignores this correlation:
▶ Coefficients are unbiased
▶ Standard errors and test statistics are incorrect

29 / 30
Quasi-Demeaning Transformation

The GLS-transformed equation is:

yit − λȳi = β0 (1 − λ) + β(xit − λx̄i ) + (vit − λv̄i ),

where bars denote time averages.

▶ This is called quasi-demeaning
▶ Compare:
▶ Fixed Effects: subtract full time average
▶ Random Effects: subtract a fraction of the time average
▶ The fraction λ depends on:
σa2 , σu2 , and T .
▶ GLS = pooled OLS on the transformed equation

30 / 30
Snow, J. (1856). Cholera and the water supply in the south districts of london in 1854.
Journal of Public Health, and Sanitary Review, 2(7):239.

30 / 30

Fixed Effects vs. Random Effects Models
No ratings yet
Fixed Effects vs. Random Effects Models
20 pages
Fixed Effects in Panel Data Analysis
No ratings yet
Fixed Effects in Panel Data Analysis
61 pages
Panel Data Analysis Techniques Explained
No ratings yet
Panel Data Analysis Techniques Explained
61 pages
Panel Data Analysis Techniques Explained
No ratings yet
Panel Data Analysis Techniques Explained
14 pages
Advanced Panel Data Analysis Techniques
No ratings yet
Advanced Panel Data Analysis Techniques
13 pages
Advanced Panel Data Methods Explained
No ratings yet
Advanced Panel Data Methods Explained
22 pages
Panel Data Regression Techniques Explained
No ratings yet
Panel Data Regression Techniques Explained
32 pages
Understanding Fixed Effects in Panel Data
100% (1)
Understanding Fixed Effects in Panel Data
11 pages
Understanding Panel Data Models
No ratings yet
Understanding Panel Data Models
20 pages
Longitudinal Data for Causal Inference
No ratings yet
Longitudinal Data for Causal Inference
17 pages
Panel Data Analysis Techniques Explained
No ratings yet
Panel Data Analysis Techniques Explained
25 pages
Panel Data Regression Explained
No ratings yet
Panel Data Regression Explained
9 pages
06 Panel Data Approaches
No ratings yet
06 Panel Data Approaches
38 pages
Fixed Effects Estimators in Panel Data
No ratings yet
Fixed Effects Estimators in Panel Data
38 pages
Understanding Panel Data Analysis
No ratings yet
Understanding Panel Data Analysis
50 pages
Advanced Panel Data Methods in Econometrics
100% (1)
Advanced Panel Data Methods in Econometrics
38 pages
Notes13 PDF
No ratings yet
Notes13 PDF
9 pages
Difference-in-Differences Evaluation Methods
No ratings yet
Difference-in-Differences Evaluation Methods
14 pages
Understanding Panel Data Models
No ratings yet
Understanding Panel Data Models
39 pages
Dynamic Panel Data Methods Overview
No ratings yet
Dynamic Panel Data Methods Overview
47 pages
Panel Data Analysis in Econometrics
No ratings yet
Panel Data Analysis in Econometrics
56 pages
Understanding Panel Data Models
No ratings yet
Understanding Panel Data Models
20 pages
Panel Data Regression Techniques
No ratings yet
Panel Data Regression Techniques
76 pages
Week 3 Advanced Panel Data Methods
No ratings yet
Week 3 Advanced Panel Data Methods
20 pages
Understanding Panel Data Models
No ratings yet
Understanding Panel Data Models
46 pages
Difference-in-Differences Analysis Explained
No ratings yet
Difference-in-Differences Analysis Explained
14 pages
Panel Data Analysis: Fixed vs Random Effects
No ratings yet
Panel Data Analysis: Fixed vs Random Effects
42 pages
Pooled vs. Panel Data Analysis
No ratings yet
Pooled vs. Panel Data Analysis
9 pages
Understanding Panel Data Models
No ratings yet
Understanding Panel Data Models
46 pages
Panel Data Presentation Slide
No ratings yet
Panel Data Presentation Slide
29 pages
Panel Data Models in Econometrics
No ratings yet
Panel Data Models in Econometrics
18 pages
Understanding Panel Data Models
No ratings yet
Understanding Panel Data Models
19 pages
Panel Data Regression Techniques
100% (2)
Panel Data Regression Techniques
30 pages
Panel Data Analysis: Before and After Method
No ratings yet
Panel Data Analysis: Before and After Method
14 pages
Understanding Panel Data Estimation
No ratings yet
Understanding Panel Data Estimation
21 pages
Panel Fixed Effects in Econometrics
No ratings yet
Panel Fixed Effects in Econometrics
18 pages
Addressing Endogeneity with Panel Data
No ratings yet
Addressing Endogeneity with Panel Data
35 pages
Panel Data Analysis in Empirical Methods
No ratings yet
Panel Data Analysis in Empirical Methods
84 pages
Panel Data Analysis: Fixed vs Random Effects
No ratings yet
Panel Data Analysis: Fixed vs Random Effects
8 pages
Part 8 Panel Regression DP 2025
No ratings yet
Part 8 Panel Regression DP 2025
36 pages
Static Panel Data Analysis and Models
No ratings yet
Static Panel Data Analysis and Models
21 pages
Causal Inference: Event Studies & DiD
No ratings yet
Causal Inference: Event Studies & DiD
50 pages
Panel Data Analysis: Fixed vs Random Effects
No ratings yet
Panel Data Analysis: Fixed vs Random Effects
8 pages
Materi Teknik Data Panel
No ratings yet
Materi Teknik Data Panel
30 pages
Panel Regression in 2b2t Census Analysis
No ratings yet
Panel Regression in 2b2t Census Analysis
18 pages
Slides 6 Man
No ratings yet
Slides 6 Man
46 pages
Econometrics Complete Guide Part2
No ratings yet
Econometrics Complete Guide Part2
26 pages
Understanding Panel Data Regression Techniques
No ratings yet
Understanding Panel Data Regression Techniques
25 pages
Difference-in-Differences Method Explained
No ratings yet
Difference-in-Differences Method Explained
4 pages
Econometric Methods for Panel Data Analysis
No ratings yet
Econometric Methods for Panel Data Analysis
4 pages
Understanding Panel Data Analysis
No ratings yet
Understanding Panel Data Analysis
57 pages
Causal Reasoning in Event Studies
No ratings yet
Causal Reasoning in Event Studies
50 pages
Evaluating Randomized Management Experiments
No ratings yet
Evaluating Randomized Management Experiments
37 pages
Panel Using Stata
No ratings yet
Panel Using Stata
40 pages
Fixed Effects in Panel Data Econometrics
100% (2)
Fixed Effects in Panel Data Econometrics
34 pages
Louie Gascon's Empowered Consumerism Guide
No ratings yet
Louie Gascon's Empowered Consumerism Guide
4 pages
Nursing Performance Evaluation Checklist
No ratings yet
Nursing Performance Evaluation Checklist
2 pages
Contact Information for University of Dhaka
No ratings yet
Contact Information for University of Dhaka
3 pages
Non-Brownian Suspension Dynamics Analysis
No ratings yet
Non-Brownian Suspension Dynamics Analysis
18 pages
Balraj Singh Malik Vs Govt. of NCT of Delhi & Anr On 22 Decembe
No ratings yet
Balraj Singh Malik Vs Govt. of NCT of Delhi & Anr On 22 Decembe
8 pages
Git Basics: Undoing Changes Guide
No ratings yet
Git Basics: Undoing Changes Guide
4 pages
Geometric Transformations for Image Alignment
No ratings yet
Geometric Transformations for Image Alignment
11 pages
Boosting Tobacco Taxes in Timor-Leste
No ratings yet
Boosting Tobacco Taxes in Timor-Leste
11 pages
Empathy in Design Thinking Process
No ratings yet
Empathy in Design Thinking Process
6 pages
Analysis of Diversified Mutual Funds
No ratings yet
Analysis of Diversified Mutual Funds
12 pages
Grace Chisala CV - Mass Communication Graduate
No ratings yet
Grace Chisala CV - Mass Communication Graduate
4 pages
ADC DAC Interfacing With FPGA - ADC DAC VHDL Code
No ratings yet
ADC DAC Interfacing With FPGA - ADC DAC VHDL Code
8 pages
Zasady Korespondencji Formalnej w Angielskim
No ratings yet
Zasady Korespondencji Formalnej w Angielskim
33 pages
Position Sensors: A Beginner's Guide
No ratings yet
Position Sensors: A Beginner's Guide
13 pages
Multi-Org Features in Oracle 11i
No ratings yet
Multi-Org Features in Oracle 11i
13 pages
Fashion Invoice Template Summary
No ratings yet
Fashion Invoice Template Summary
3 pages
Cambridge Exam Timetable May/June 2025
No ratings yet
Cambridge Exam Timetable May/June 2025
2 pages
Owner of Airtel Network Explained
No ratings yet
Owner of Airtel Network Explained
23 pages
June 2024 MIS Report: Production & Sales Analysis
No ratings yet
June 2024 MIS Report: Production & Sales Analysis
10 pages
Karnataka Wind Power Project List
0% (1)
Karnataka Wind Power Project List
42 pages
Java Full Stack Internship Report
No ratings yet
Java Full Stack Internship Report
13 pages
EMC Engineering Basics Overview
No ratings yet
EMC Engineering Basics Overview
16 pages
Career Assessment Analysis Report
No ratings yet
Career Assessment Analysis Report
26 pages
Sieve Analysis and Sampling Techniques
No ratings yet
Sieve Analysis and Sampling Techniques
20 pages
Cyber Insurance Policy Wordings
No ratings yet
Cyber Insurance Policy Wordings
11 pages
Thakur, BaTiO3
No ratings yet
Thakur, BaTiO3
13 pages
FMC Pump Assembly Report 1284778
No ratings yet
FMC Pump Assembly Report 1284778
9 pages
Understanding Vegetables and Starch Types
No ratings yet
Understanding Vegetables and Starch Types
26 pages
Understanding Materials Handling Basics
No ratings yet
Understanding Materials Handling Basics
12 pages
Green Earth Natural Caskets
No ratings yet
Green Earth Natural Caskets
4 pages

Chap 2 Panel Data

Uploaded by

Chap 2 Panel Data

Uploaded by

Lecture 2: Panel Data Models

Dr. Viet Ha Pham

Semester II, 2025-2026

Bonus: Random Effects Models

▶ Multiple units for multiple periods

yit = β0 + β1 xit + εit ,

▶ There are i = 1, 2, ..., N units and t = 1, 2, ..., T time periods.

yit = β1 xit + ai + uit ,

▶ Can be easily expanded to multiple explanatory variables

▶ Averaging over time for each individual i, we obtain:

ȳi = β1 x̄i + ai + ūi ,

where bars denote time averages.

yit − ȳi = β1 (xit − x̄i ) + (uit − ūi ).

▶ This is known as the within transformation.

▶ Under a strict exogeneity assumption, the fixed effects estimator is unbiased.

▶ The composite error is:

How do we interpret the results of FE models once we have estimated it?

How do we interpret the results of FE models once we have estimated it?

▶ John Snow’s hypothesis: cholera is transmitted through contaminated drinking

▶ Observe an outcome Yit for two groups over two periods.

▶ Observe an outcome Yit for two groups over two periods.

∆YT = ȲT ,post − ȲT ,pre .

∆YC = ȲC ,post − ȲC ,pre .

▶ The Difference-in-Differences estimator is:

▶ The 2×2 Difference-in-Differences regression can be written as:

Yit = β0 + β1 Treati + β2 Postt + β3 (Treati × Postt ) + εit .

▶ Treati : indicator for units that are ever treated.

Treatment = 0 Treatment = 1 Difference

We start from the same panel data model as in Fixed Effects:

yit = βxit + ai + uit ,

The crucial assumption in the Random Effects (RE) model is:

E[ai | xi1 , xi2 , . . . , xiT ] = 0.

▶ The individual effect ai is uncorrelated with the regressors

Consider the panel model:

The GLS-transformed equation is:

yit − λȳi = β0 (1 − λ) + β(xit − λx̄i ) + (vit − λv̄i ),

where bars denote time averages.

You might also like