0% found this document useful (0 votes)

9 views50 pages

Understanding Autocorrelation in Data

The document discusses autocorrelation, its nature, and implications in statistical modeling, particularly in time series data. It outlines methods for detecting autocorrelation, such as graphical methods and statistical tests like the Durbin-Watson and Breusch-Godfrey tests, as well as remedies for addressing it, including generalized least squares and the Newey-West method. The importance of understanding autocorrelation is emphasized for accurate hypothesis testing and reliable regression results.

Uploaded by

debasisheco718

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views50 pages

Understanding Autocorrelation in Data

Uploaded by

debasisheco718

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Autocorrelation

Some Questions To Deal With

• What Happens If the Error Terms Are Correlated?

• What is the nature of autocorrelation?
• What are the theoretical and practical consequences of
autocorrelation?
• How does one know that there is autocorrelation in any
given situation?
• How does one remedy the problem of autocorrelation?
Introduction
• In cross-section studies, data are often collected on the basis of a random sample
of cross-sectional units, such as households or firms.
• Therefore, there is no prior reason to believe that the error term pertaining to one
household or firm is correlated with the error term of another household or firm.
• If by chance such a correlation is observed in cross-sectional units, it is called
spatial autocorrelation, that is, correlation in space rather than over time.
• Autocorrelation:
If we are dealing with time series data, for the observations in such data follow a
natural ordering over time so that successive observations are likely to exhibit
intercorrelations, especially if the time interval between successive observations
is short, such as a day, a week, or a month rather than a year.
What is Autocorrelation and Its Nature?

• Put simply, the classical model assumes that the disturbance term relating to any observation is not
influenced by the disturbance term relating to any other observation.
• For example (time series auto correlation), if we are dealing with quarterly time series data
involving the regression of output on labor and capital inputs and if, say, there is a labor strike
affecting output in one quarter, there is no reason to believe that this disruption will be carried over to
the next quarter. That is, if output is lower this quarter, there is no reason to expect it to be lower next
quarter.
• Another Example (Spatial auto correlation): if we are dealing with cross-sectional data involving
the regression of family consumption expenditure on family income, the effect of an increase of one
family’s income on its consumption expenditure is not expected to affect the consumption
expenditure of another family.
Cont…

• In this situation, the disruption caused by a strike this quarter may very well affect
output next quarter, or the increases in the consumption expenditure of one family
may very well prompt another family to increase its consumption expenditure.
Difference Between Autocorrelation and Serial Correlation

• Although it is now a common practice to treat the terms autocorrelation

and serial correlation synonymously, some authors prefer to distinguish
the two terms.
• Tintner (1965) defines autocorrelation as “lag correlation of a given
series with itself, lagged by a number of time units,’’ whereas serial
correlation refers to “lag correlation between two different series.”
• Thus, correlation between two time series such as 𝒖𝟏 , 𝒖𝟐 , ..., 𝒖𝟏𝟎 and
𝒖𝟐 , 𝒖𝟑 , ..., 𝒖𝟏𝟏 , where the former is the latter series lagged by one time
period, is autocorrelation, whereas correlation between time series such
as 𝒖𝟏 , 𝒖𝟐 , ..., 𝒖𝟏𝟎 and 𝒗𝟐 , 𝒗𝟑 , ..., 𝒗𝟏𝟏 , where u and v are two different
time series, is called serial correlation.
Patterns of Auto- and Non-Autocorrelation,
Why Does Serial Correlation Occur?
Why Does Serial Correlation Occur?
Why Does Serial Correlation Occur?

• Lags:
o In a time series regression of consumption expenditure on income, it is not
uncommon to find that the consumption expenditure in the current period depends,
among other things, on the consumption expenditure of the previous period.

o Above regression is known as autoregression because one of the explanatory

variables is the lagged value of the dependent variable.
o The rationale for the above model is that consumers do not change their
consumption habits readily for psychological, technological, or institutional reasons.
o Now if we neglect the lagged term in Eq. (12.1.7), the resulting error term will
reflect a systematic pattern due to the influence of lagged consumption on current
consumption.
Why Does Serial Correlation Occur?

• Manipulation of Data:
o In time series regressions involving quarterly data, such data are usually derived from the monthly
data by simply adding three monthly observations and dividing the sum by 3.
o This averaging introduces smoothness into the data by dampening the fluctuations in the monthly
data.
o Therefore, the graph plotting the quarterly data looks much smoother than the monthly data, and this
smoothness may itself lend to a systematic pattern in the disturbances, thereby introducing
autocorrelation.
o Another source of manipulation is interpolation or extrapolation of data.
o For example, the Census of Population is conducted every 10 years in this country, the last being in
2000 and the one before that in 1990. Now if there is a need to obtain data for some year within the
inter-census period 1990–2000, the common practice is to interpolate on the basis of some ad hoc
assumptions.
o All such data “massaging’’ techniques might impose upon the data a systematic pattern that might not
exist in the original data.
Why Does Serial Correlation Occur?
Why Does Serial Correlation Occur?

Proof of error term 𝒗𝒕 in Eq. (4) is autocorrelated.

Why Does Serial Correlation Occur?
Mean, Variance, and Covariance of error term (𝒖𝒕 ) and OLS estimator (𝜷𝟐 )
Cont..
Consequences of Using OLS in the Presence of Autocorrelation

• As in the case of heteroscedasticity, in the presence of autocorrelation the

OLS estimators are still linear unbiased as well as consistent and
asymptotically normally distributed, but they are no longer efficient (i.e.,
minimum variance).
• What then happens to our usual hypothesis testing procedures if we
continue to use the OLS estimators?
OLS Estimation Allowing for Autocorrelation

The implication of this finding for hypothesis

testing is that we are likely to declare a coefficient
statistically insignificant even though in fact (i.e.,
based on the correct GLS procedure) it may be.

In the fig since b2 lies in the OLS confidence

interval, we could accept the hypothesis that true
β2 is zero with 95 percent confidence.

But if we were to use the (correct) GLS confidence

interval, we could reject the null hypothesis that
true β2 is zero, forb2 lies in the region of rejection.
OLS Estimation Disregarding Autocorrelation
How do we know if our data suffer from autocorrelation??
How do we know if our data suffer from autocorrelation??
Double-Log Model
Linear Model

Qualitatively, both the models give similar results. In both cases the estimated
coefficients are “highly” significant, as indicated by the high t values.

• How reliable are the results given in the above two models if there is autocorrelation?
• As stated previously, if there is autocorrelation, the estimated standard errors are biased, as a result of
which the estimated t ratios are unreliable.
• We obviously need to find out if our data suffer from autocorrelation.
Detecting Autocorrelation

1. Graphical Method
2. Runs Test
3. Durbin–Watson d Test
4. Breusch–Godfrey (BG) Test
Detecting Autocorrelation: Graphical Method

𝒖𝟐𝒕 ) against time can

ෝ 𝒕 or (ෝ
• A visual examination of 𝒖
provide useful information about autocorrelation.
That is called time sequence plot
• Alternatively, we can plot the standardized residuals
against time which is shown in figure.

• Examining the time sequence plot given in Figure, we

ෝ 𝒕 and the standardized 𝒖
observe that both 𝒖 ෝ 𝒕 exhibit a
pattern suggesting that perhaps 𝒖𝒕 are not random.
Detecting Autocorrelation: Graphical Method

• To see this differently, we

can plot 𝒖 ෝ 𝒕 against 𝒖
ෝ 𝒕−𝟏 ,
that is, plot the residuals at
time t against their value at
time (t - 1), a kind of
empirical test of the AR(1)
scheme.
• If the residuals are
nonrandom, we should
obtain pictures similar to
those shown in following
Figure.
Detecting Autocorrelation: Runs Test

• If we carefully examine this Figure, we

notice a peculiar feature: Initially, we have
several residuals that are negative, then
there is a series of positive residuals, and
then there are several residuals that are
negative.
• If these residuals were purely random,
could we observe such a pattern?
• Intuitively, it seems unlikely.
• This intuition can be checked by the so-
called runs test, sometimes also known as
the Geary test, a nonparametric test.
Detecting Autocorrelation: Explaining Runs Test
• To explain the runs test, let us simply note down
the signs (+ or -) of the residuals obtained from
the regression, which are given in the first column
of Table.
• Thus there are 8 negative residuals, followed by
21 positive residuals, followed by 11 negative
residuals, followed by 3 positive residuals,
followed by 3 negative residuals, for a total of 46
observations.
• We now define a run as an uninterrupted
sequence of one symbol or attribute, such as + or
−. We further define the length of a run as the
number of elements in it.
• In the sequence shown above in bracket, there are
5 runs: a run of 8 minuses (i.e., of length 8), a run
of 21 pluses (i.e., of length 21), a run of 11
minuses (i.e., of length 11), a run of 3 pluses (i.e.,
of length 3), and a run of 3 minuses (i.e., of
length 3).
Cont…
Are the 5 runs observed in our example
consisting of 46 observations too many
or too few compared with the number
of runs expected in a strictly random
sequence of 46 observations?

• If there are too many runs, it would

mean that in our example the residuals
change sign frequently, thus indicating
negative serial correlation.

• Similarly, if there are too few runs,

they may suggest positive
autocorrelation. A priori, then, would
indicate positive correlation in the
residuals.
Obviously, this interval does not include 5. Hence, we can reject
the hypothesis that the residuals in our wages–productivity
regression are random with 95% confidence.
Durbin–Watson d Test
Important Assumptions of Durbin–Watson d Test
Durbin–Watson d Test: Decision Rule
Durbin–Watson d Test: Decision Rule
Breusch–Godfrey (BG) Test: A General Test of Autocorrelation
• Breusch and Godfrey have developed a test of autocorrelation that is general in the
sense that it allows for
(1) nonstochastic regressors, such as the lagged values of the regressand;
(2) higher-order autoregressive schemes, such as AR(1), AR(2), etc.; and

(3) simple or higher-order moving averages of white noise error terms, such as ε𝒕
• BG test is also known as the LM test as it is based on the Lagrange multiplier principle.
Steps in Breusch–Godfrey (BG) Test
Remedies
What to Do When You Find Autocorrelation?

We have four options as follows:

1. Try to find out if the autocorrelation is pure autocorrelation and not the result of
mis-specification of the model.
2. If it is pure autocorrelation, one can use appropriate transformation of the
original model so that in the transformed model we do not have the problem of
(pure) autocorrelation. As in the case of heteroscedasticity, we will have to use some
type of generalized least-square (GLS) method.
3. In large samples, we can use the Newey–West method to obtain standard errors of
OLS estimators that are corrected for autocorrelation. This method is actually an
extension of White’s heteroscedasticity-consistent standard errors method.
4. In some situations we can continue to use the OLS method.
Remedies
A. Model Mis-Specification versus Pure Autocorrelation
Cont…
Remedies
B. Correcting for (Pure) Autocorrelation: Generalized Least Squares (GLS) Method
Once we got to know that its pure autocorrelation not specification error, then what to do?

• The remedy depends on the knowledge one has about the nature of interdependence
among the disturbances, that is, knowledge about the structure of autocorrelation.
Remedies
(B.1) When ρ is Known
Cont…
Remedies
(B.2) When ρ Is Not Known

Since the error term (ϵ𝒕 ) in last Equation is free from (first-order) serial correlation, to run the regression all one has to
do is form the first differences of both the regressand and regressor(s) and run the regression on these first differences.
Cont…
Cont…

• Compared with the level form regression (level form), we see that the slope coefficient has not changed much,
but the r 𝟐 value has dropped considerably.
• This is generally the case because by taking the first differences we are essentially studying the behavior of
variables around their (linear) trend values.
• We cannot compare the r 𝟐 of Eq. (first difference) directly with that of the r 𝟐 of Eq. (level form) because the
dependent variables in the two models are different.
• Also, notice that compared with the original regression, the d value has increased dramatically, perhaps
indicating that there is little autocorrelation in the first difference regression.
Cont…

• If the original time series are nonstationary, very often their first differences become stationary.
• Therefore, first-difference transformation serves a dual purpose in that it might get rid of (first-order)
autocorrelation and also render the time series stationary.
Cont…

(B) ρ Estimated from the Residuals:

where 𝒖ෝ 𝒕 are the residuals obtained from the original (level form) regression and where 𝒗𝒕
are the error term of this regression.
Note that there is no need to introduce the intercept term in the above Equation, for
we know the OLS residuals sum to zero.

Use ρො in estimating the following equation

Cont…
(B) ρ Estimated from the Residuals: example

Use ρො in estimating the following equation

Cont…

(C) Iterative Methods of Estimating ρ:

The Newey–West Method of Correcting the OLS Standard Errors

Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
23 pages
Understanding Autocorrelation in Data
No ratings yet
Understanding Autocorrelation in Data
84 pages
Understanding Autocorrelation in Time Series
No ratings yet
Understanding Autocorrelation in Time Series
49 pages
Autocorrelation in Linear Regression
No ratings yet
Autocorrelation in Linear Regression
18 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
8 pages
Understanding Autocorrelation in Econometrics
No ratings yet
Understanding Autocorrelation in Econometrics
17 pages
Understanding Autocorrelation in Errors
No ratings yet
Understanding Autocorrelation in Errors
10 pages
Understanding Autocorrelation in Econometrics
No ratings yet
Understanding Autocorrelation in Econometrics
18 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
22 pages
Lect7 Serial EF3450 20260115a
No ratings yet
Lect7 Serial EF3450 20260115a
21 pages
Understanding Autocorrelation in Econometrics
No ratings yet
Understanding Autocorrelation in Econometrics
28 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
6 pages
Correlated Error Terms in Autocorrelation
No ratings yet
Correlated Error Terms in Autocorrelation
21 pages
Autocorrelation Lecture Notes
No ratings yet
Autocorrelation Lecture Notes
5 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
8 pages
Understanding Autocorrelation in Econometrics
100% (1)
Understanding Autocorrelation in Econometrics
8 pages
Understanding Autocorrelation in Economics
No ratings yet
Understanding Autocorrelation in Economics
45 pages
Autocorrelation in Regression Analysis
No ratings yet
Autocorrelation in Regression Analysis
16 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
14 pages
Understanding Autocorrelation in Data
No ratings yet
Understanding Autocorrelation in Data
33 pages
Understanding Autocorrelation in CLRM
No ratings yet
Understanding Autocorrelation in CLRM
31 pages
Eco 222 - Autocorrelation 23.03.2015
No ratings yet
Eco 222 - Autocorrelation 23.03.2015
5 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
24 pages
Understanding Autocorrelation Effects
No ratings yet
Understanding Autocorrelation Effects
43 pages
Understanding Autocorrelation in Econometrics
No ratings yet
Understanding Autocorrelation in Econometrics
37 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
13 pages
Understanding Serial Correlation in Time Series
No ratings yet
Understanding Serial Correlation in Time Series
6 pages
Understanding Autocorrelation in Econometrics
No ratings yet
Understanding Autocorrelation in Econometrics
23 pages
Understanding Autocorrelation in OLS
No ratings yet
Understanding Autocorrelation in OLS
36 pages
Understanding Autocorrelation in Econometrics
No ratings yet
Understanding Autocorrelation in Econometrics
52 pages
Causes and Measures of Autocorrelation
No ratings yet
Causes and Measures of Autocorrelation
16 pages
Remedial Measures for Autocorrelation
No ratings yet
Remedial Measures for Autocorrelation
11 pages
Autocorrelation and Its Statistical Tests
100% (2)
Autocorrelation and Its Statistical Tests
13 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
54 pages
Autocorrelation in Financial Econometrics
No ratings yet
Autocorrelation in Financial Econometrics
49 pages
AUTOCORELATION
No ratings yet
AUTOCORELATION
28 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
25 pages
Autocorrelation in Regression Analysis
No ratings yet
Autocorrelation in Regression Analysis
34 pages
Chapter - 4b Violation of CLRM Assumptions v2
No ratings yet
Chapter - 4b Violation of CLRM Assumptions v2
80 pages
Understanding Autocorrelation in Time Series
No ratings yet
Understanding Autocorrelation in Time Series
5 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
17 pages
Autocorrelation in Time Series Data
No ratings yet
Autocorrelation in Time Series Data
37 pages
Understanding Autocorrelation in OLS
No ratings yet
Understanding Autocorrelation in OLS
52 pages
Understanding Autocorrelation in Data
No ratings yet
Understanding Autocorrelation in Data
5 pages
Understanding Autocorrelation in Time Series
No ratings yet
Understanding Autocorrelation in Time Series
52 pages
Causes of Autocorrelation in Data
No ratings yet
Causes of Autocorrelation in Data
17 pages
11 Autocorrelation
No ratings yet
11 Autocorrelation
17 pages
Understanding Autocorrelation in OLS
No ratings yet
Understanding Autocorrelation in OLS
16 pages
Analyzing Time Series Data Patterns
No ratings yet
Analyzing Time Series Data Patterns
24 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
6 pages
Understanding Serial Correlation in Regression
No ratings yet
Understanding Serial Correlation in Regression
27 pages
Autocorrelation
No ratings yet
Autocorrelation
65 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
20 pages
Understanding Autocorrelation in Regression
No ratings yet
Understanding Autocorrelation in Regression
15 pages
Understanding Autocorrelation in Time Series
No ratings yet
Understanding Autocorrelation in Time Series
29 pages
Calypso Curve
No ratings yet
Calypso Curve
116 pages
Data Analytics With Python - Unit 10 - Week 7
No ratings yet
Data Analytics With Python - Unit 10 - Week 7
3 pages
Linear and Logistic Regression Overview
No ratings yet
Linear and Logistic Regression Overview
22 pages
Quadratic Curve Fitting Explained
No ratings yet
Quadratic Curve Fitting Explained
28 pages
Modeling Rock Climbing Falls Dynamics
No ratings yet
Modeling Rock Climbing Falls Dynamics
36 pages
Advanced Curve Fitting in Origin
No ratings yet
Advanced Curve Fitting in Origin
2 pages
BCS Impact on Colostrum Production
No ratings yet
BCS Impact on Colostrum Production
6 pages
GRACE TWSA Trends in Nile Basin Analysis
No ratings yet
GRACE TWSA Trends in Nile Basin Analysis
14 pages
Linear Models in Animal Breeding Analysis
No ratings yet
Linear Models in Animal Breeding Analysis
33 pages
Model Evaluation in Prediction Analysis
No ratings yet
Model Evaluation in Prediction Analysis
2 pages
Information Science Engineering Syllabus 2019-23
No ratings yet
Information Science Engineering Syllabus 2019-23
191 pages
Multilevel Binary Logistic Regression SPSS
No ratings yet
Multilevel Binary Logistic Regression SPSS
52 pages
Econ 122b Uc Irvine
No ratings yet
Econ 122b Uc Irvine
5 pages
Data Science Course Syllabus: Unit 2
No ratings yet
Data Science Course Syllabus: Unit 2
15 pages
Testbank for Real Stats in Econometrics
No ratings yet
Testbank for Real Stats in Econometrics
16 pages
Linear Regression in Data Mining
No ratings yet
Linear Regression in Data Mining
27 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
25 pages
Earnings per Share Regression Analysis
No ratings yet
Earnings per Share Regression Analysis
4 pages
Understanding Correlation and Regression
No ratings yet
Understanding Correlation and Regression
3 pages
Understanding Panel Data Regression
No ratings yet
Understanding Panel Data Regression
5 pages
Exponentials and Logarithms Overview
No ratings yet
Exponentials and Logarithms Overview
21 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
17 pages
Understanding Multivariate Regression
No ratings yet
Understanding Multivariate Regression
61 pages
Least Squares Method in Time Series Analysis
No ratings yet
Least Squares Method in Time Series Analysis
8 pages
ANOVA and Regression Analysis in Excel
No ratings yet
ANOVA and Regression Analysis in Excel
15 pages
Understanding Regression in Machine Learning
No ratings yet
Understanding Regression in Machine Learning
8 pages
Learning Styles Impact on Matric Performance
No ratings yet
Learning Styles Impact on Matric Performance
50 pages
Estimating Binary Models in EViews 6
No ratings yet
Estimating Binary Models in EViews 6
12 pages
Linear Regression Overview and Metrics
No ratings yet
Linear Regression Overview and Metrics
12 pages
Multiple Linear Regression Analysis Assignment
No ratings yet
Multiple Linear Regression Analysis Assignment
4 pages