0% found this document useful (0 votes)

16 views24 pages

Time Series Analysis Techniques

Chapter 6 discusses time series analysis, which involves predicting future values based on past data organized chronologically. It covers the classification of variations in time series, including trends, seasonality, and irregular fluctuations, as well as methods for analyzing and removing these components. The chapter also introduces key concepts such as stationary time series, autocorrelation, and the use of Python for time-series analysis.

Uploaded by

mazaa.xr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views24 pages

Time Series Analysis Techniques

Uploaded by

mazaa.xr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CHAPTER 6

Time Series
A time series is a series of data points arranged chronologically. Most
commonly, the time points are equally spaced. A few examples are the
passenger loads of an airline recorded each month for the past two years
or the price of an instrument in the share market recorded each day for the
last year. The primary aim of time-series analysis is to predict the future
value of a parameter based on its past data.

Classification of Variation
Traditionally, time-series analysis divides the variation into three major
components, namely, trends, seasonal variations, and other cyclic
changes. The variation that remains is attributed to “irregular” fluctuations
or error term. This approach is particularly valuable when the variation is
mostly comprised of trends and seasonality.

Analyzing a Series Containing a Trend

A trend is a change in the mean level that is long-term in nature. For
example, if you have a series like 2, 4, 6, 8 and someone asks you for the
next value, the obvious answer is 10. You can justify your answer by fitting
a line to the data using the simple least square estimation or any other
regression method. A trend can also be nonlinear. Figure 6-1 shows an
example of a time series with trends.

© Sayan Mukhopadhyay, Pratip Samanta 2023 161

S. Mukhopadhyay and P. Samanta, Advanced Data Analytics Using Python,
[Link]
Chapter 6 Time Series

Figure 6-1. A time series with trends

The simplest type of time series is the familiar “linear trend plus noise”
for which the observation at time t is a random variable Xt, as follows:

Xt t t

Here, α,β are constants, and εt denotes a random error term with
a mean of 0. The average level at time t is given by mt= (α + βt). This is
sometimes called the trend term.

162
Chapter 6 Time Series

C
urve Fitting
Fitting a simple function of time such as a polynomial curve (linear,
quadratic, etc.), a Gompertz curve, or a logistic curve is a well-known
method of dealing with nonseasonal data that contains a trend,
particularly yearly data. The global linear trend is the simplest type of
polynomial curve. The Gompertz curve can be written in the following
format, where α, β, and γ are parameters with 0 < r < 1:

X t exp exp t

This looks quite different but is actually equivalent, provided t > 0.

The logistic curve is as follows:

X t a / 1 be ct

Both these curves are S-shaped and approach an asymptotic value

as t → ∞, with the Gompertz curve generally converging slower than the
logistic one. Fitting the curves to data may lead to nonlinear simultaneous
equations.
For all curves of this nature, the fitted function provides a measure
of the trend, and the residuals provide an estimate of local fluctuations
where the residuals are the differences between the observations and the
corresponding values of the fitted curve.

Removing Trends from a Time Series

Differentiating a given time series until it becomes stationary is a special
type of filtering that is particularly useful for removing a trend. You will
see that this is an integral part of the Box-Jenkins procedure. For data
with a linear trend, a first-order differencing is usually enough to remove
the trend.

163
Chapter 6 Time Series

Mathematically, it looks like this:

yt a t c
y t 1 a t 1 c
z t y t 1 y t a c; no trend present in z t

A trend can be exponential as well. In this case, you will have to do a

logarithmic transformation to convert the trend from exponential to linear.
Mathematically, it looks like this:

y t a exp t
z t log y t t log a ; z t is a linear function of t

Analyzing a Series Containing Seasonality

Many time series, such as airline passenger loads or weather readings,
display variations that repeat after a specific time period. For instance, in
India, there will always be an increase in airline passenger loads during
the holiday of Diwali. This yearly variation is easy to understand and can
be estimated if seasonality is of direct interest. Similarly, like trends, if you
have a series such as 1, 2, 1, 2, 1, 2, your obvious choices for the next values
of the series will be 1 and 2.
The Holt-Winters model is a popular model to realize time series with
seasonality and is also known as exponential smoothing. The Holt-Winters
model has two variations: additive and multiplicative. In the additive model with
a single exponential smoothing time series, seasonality is realized as follows:

X t 1 Xt 1 St 1

In this model, every point is realized as a weighted average of the

previous point and seasonality. So, X(t+1) will be calculated as a function
X(t-1) and S(t-2) and square of α. In this way, the more you go on,

164
Chapter 6 Time Series

the α value increases exponentially. This is why it is known as exponential

smoothing. The starting value of St is crucial in this method. Commonly,
this value starts with a 1 or with an average of the first four observations.
The multiplicative seasonal model time series is as follows:

X t 1 b1 b 2 t St noise,

Here, b1, often referred to as the permanent component, is the initial

weight of the seasonality; b2 represents the trend, which is linear in this case.
However, there is no standard implementation of the Holt-Winters
model in Python. It is available in R (see Chapter 1 for how R’s Holt-
Winters model can be called from Python code).

Removing Seasonality from a Time Series

There are two ways of removing seasonality from a time series.

• By filtering

• By differencing

B
y Filtering
The series {xt} is converted into another called {yt} with the linear operation
shown here, where {ar} is a set of weights:

Yt r q a r X t r
s

To smooth out local fluctuations and estimate the local mean, you
should clearly choose the weights so that ∑ar = 1; then the operation is
often referred to as a moving average. They are often symmetric with
s = q and aj = a-j. The simplest example of a symmetric smoothing filter is
the simple moving average, for which ar = 1 / (2q+1) for r = -q, …, + q.

165
Chapter 6 Time Series

The smoothed value of xt is given by the following:

1
Sm x t
q
Xtr
2q 1 r= q

The simple moving average is useful for removing seasonal variations,

but it is unable to deal well with trends.

B
y Differencing
Differencing is widely used and often works well. Seasonal differencing
removes seasonal variation.
Mathematically, if time series y(t) contains additive seasonality S(t)
with time period T, then:

y t a S t b t c
y t T aS t T b t T c
z t y t T y t b T noise term

Similar to trends, you can convert the multiplicative seasonality to

additive by log transformation.
Now, finding time period T in a time series is the critical part. It can be
done in two ways, either by using an autocorrelation function in the time
domain or by using the Fourier transform in the frequency domain. In both
cases, you will see a spike in the plot. For autocorrelation, the plot spike will
be at lag T, whereas for FT distribution, the spike will be at frequency 1/T.

T ransformation
Up to now I have discussed the various kinds of transformation in a time
series. The three main reasons for making a transformation are covered in
the next sections.

166
Chapter 6 Time Series

To Stabilize the Variance

The standard way to do this is to take a logarithmic transformation of the
series; it brings closer the points in space that are widely scattered.

To Make the Seasonal Effect Additive

If the series has a trend and the volume of the seasonal effect appears to
be on the rise with the mean, then it may be advisable to modify the data
so as to make the seasonal effect constant from year to year. This seasonal
effect is said to be additive. However, if the volume of the seasonal effect
is directly proportional to the mean, then the seasonal effect is said to
be multiplicative, and a logarithmic transformation is needed to make it
additive again.

To Make the Data Distribution Normal

In most probability models, it is assumed that distribution of data is
Gaussian or normal. For example, there can be evidence of skewness in a
trend that causes “spikes” in the time plot that are all in the same direction.
To transform the data in a normal distribution, the most common
transform is to subtract the mean and then divide by the standard
deviation. I gave an example of this transformation in the RNN example in
Chapter 5; I’ll give another in the final example of the current chapter. The
logic behind this transformation is it makes the mean 0 and the standard
deviation 1, which is a characteristic of a normal distribution. Another
popular transformation is to use the logarithm. The major advantage of a
logarithm is it reduces the variation and logarithm of Gaussian distribution
data that is also Gaussian. Transformation may be problem-specific or
domain-specific. For instance, in a time series of an airline’s passenger
load data, the series can be normalized by dividing by the number of days
in the month or by the number of holidays in a month.

167
Chapter 6 Time Series

C
yclic Variation
In some time series, seasonality is not a constant but a stochastic variable.
That is known as cyclic variation. In this case, the periodicity first has to
be predicted and then has to be removed in the same way as done for
seasonal variation.

I rregular Fluctuations
A time series without trends and cyclic variations can be realized as a
weekly stationary time series. In the next section, you will examine various
probabilistic models to realize weekly time series.

Stationary Time Series

Normally, a time series is said to be stationary if there is no systematic
change in mean and variance and if strictly periodic variations have
been done away with. In real life, there are no stationary time series.
Whatever data you receive by using transformations, you may try to make
it somehow nearer to a stationary series.

S
tationary Process
A time series is strictly stationary if the joint distribution of X(t1),...,X(tk) is
the same as the joint distribution of X(t1 + τ),...,X(tk + τ) for all t1,…,tk,τ.
If k=1, strict stationary implies that the distribution of X(t) is the same for
all t, so provided the first two moments are finite, you have the following:

2 t 2

168
Chapter 6 Time Series

They are both constants, which do not depend on the value of t.

A weekly stationary time series is a stochastic process where the mean
is constant and autocovariance is a function of time lag.

A
utocorrelation and the Correlogram
Quantities called sample autocorrelation coefficients act as an important
guide to the properties of a time series. They evaluate the correlation,
if any, between observations at different distances apart and provide
valuable descriptive information. You will see that they are also an
important tool in model building and often provide valuable clues for
a suitable probability model for a given set of data. The quantity lies in
the range [-1,1] and measures the forcefulness of the linear association
between the two variables. It can be easily shown that the value does
not depend on the units in which the two variables are measured; if the
variables are independent, then the ideal correlation is zero.
A helpful supplement in interpreting a set of autocorrelation
coefficients is a graph called a correlogram. The correlogram may be
alternatively called the sample autocorrelation function.
Suppose a stationary stochastic process X(t) has a mean μ, variance
σ , auto covariance function (acv.f.) γ(t), and auto correlation function
2

(ac.f.) ρ(τ).

/ 2
0

169
Chapter 6 Time Series

E stimating Autocovariance and

Autocorrelation Functions
In the stochastic process, the autocovariance is the covariance of the
process with itself at pairs of time points. Autocovariance is calculated as
follows:

n h
h
1
x x
n t 1 t h
x x , n h n
t

Figure 6-2 shows a sample autocorrelation distribution.

Figure 6-2. Sample autocorrelations

170
Chapter 6 Time Series

Time-Series Analysis with Python

A complement to SciPy for statistical computations including descriptive
statistics and estimation of statistical models is provided by Statsmodels,
which is a Python package. Besides the early models, linear regression,
robust linear models, generalized linear models, and models for discrete
data, the latest release of [Link] includes some basic tools and
models for time-series analysis, such as descriptive statistics, statistical
tests, and several linear model classes. The linear model classes include
autoregressive (AR), autoregressive moving-average (ARMA), and vector
autoregressive models (VAR).

U
seful Methods
Let’s start with a moving average.

Moving Average Process

Suppose that {Zt} is a purely random process with mean 0 and variance σz2.
Then a process {Xt} is said to be a moving average process of order q.

X t 0 Z t 1Z t 1 q Z t q

Here, {βi} are constants. The Zs are usually scaled so that β0 = 1.

E Xt 0
q
Var X t Z2 i2
i 0

171
Chapter 6 Time Series

The Zs are independent.

k Cov X t ,X t k
Cov 0 Z t q Z t q , 0 Z t k q Z t k q
0 k q
q k

Z2 i i k k 0,1,, q
i 0
k k 0

2 st
Cov Z s , Z t Z
0 st

As γ(k) is not dependent on t and the mean is constant, the process is

second-order stationary for all values of {βi}.

1 k 0
qk q
/ 2

k i 0
i ik i
i 0
k 1,, q
0 k q

k k 0

Fitting Moving Average Process

The moving-average (MA) model is a well-known approach for realizing a
single-variable weekly stationary time series (see Figure 6-3). The moving-
average model specifies that the output variable is linearly dependent on
its own previous error terms as well as on a stochastic term. The AR model
is called the moving-average model, which is a special case and a key
component of the ARMA and ARIMA models of time series.

172
Chapter 6 Time Series

p q b
X t t i X t i i t i i dt i
i 1 i 1 i 1

Figure 6-3. Example of moving average

Here’s the example code for a moving-average model:

Please install numpy by following command:

pip install numpy

import numpy as np
def moving_average(x, w):
return [Link](x, [Link](w), 'valid') / w

data = [Link]([10,5,8,9,15,22,26,11,15,16,18,7])
print(moving_average(data,4))

A
utoregressive Processes
Suppose {Zt} is a purely random process with mean 0 and variance σz2.
After that, a process {Xt} is said to be of autoregressive process of order p if
you have this:

173
Chapter 6 Time Series

xt 1 xt 1 p xt p zt , or
p
xt i xi 1 zt
i 1

The autocovariance function is given by the following:

k z2
k , k 0,1, 2, hence
1 2

k
k k
0

Figure 6-4 shows a time series and its autocorrelation plot of the
AR model.

Figure 6-4. A time series and AR model

174
Chapter 6 Time Series

Estimating Parameters of an AR Process

A process is called weakly stationary if its mean is constant and the
autocovariance function depends only on time lag. There is no weakly
stationary process, but it is imposed on time-series data to do some
stochastic analysis. Suppose Z(t) is a weak stationary process with mean 0
and constant variance. Then X(t) is an autoregressive process of order p if
you have the following:
X t a1 x X t -1 a 2 x X t-2 ap x X t - p Z t , where a R and p I I

Now, E[X(t)] is the expected value of X(t).

Covariance X t ,X t h E X t - E X t X t h - E X t h

E X t - m X t h - m

If X(t) is a weak stationary process, then:

E X t E X t h m constant
E X t X t h m 2 c h

Here, m is constant, and cov[X(t),X(t+h)] is the function of only h(c(h))

for the weakly stationary process. c(h) is known as autocovariance.
Similarly, the correlation (X(t),X(t+h) = ρ(h) = r(h) = c(h) = c(0) is
known as autocorrelation.
If X(t) is a stationary process that is realized as an autoregressive
model, then:

X t a1 X t 1 a 2 X t 2 ap X t p Z t

175
Chapter 6 Time Series

Correlation(X(t),X(t)) = a1 * correlation (X(t),X(t-1)) + …. + ap *

correlation (X(t),X(t-p))+0
As covariance, (X(t),X(t+h)) is dependent only on h, so:

r 0 a1 r1 a 2 r 2 ap rp

r1 a1 r 0 a 2 r1 . ap r p-1

So, for an n-order model, you can easily generate the n equation and
from there find the n coefficient by solving the n equation system.
In this case, realize the data sets only in the first-order and second-
order autoregressive model and choose the model whose mean of residual
is less. For that, the reduced formulae are as follows:
• First order: a1 = r1

• Second order: a1 r1 1 r2 1 r12 , a 2 r2 r12 1 r12

Here is some example code for an autoregressive model:

Please download the required CSC file from [Link]

pratips/book-Advanced-Data-Analytics-Using-Python and install
the Python libraries using the pip command.

from pandas import read_csv

from matplotlib import pyplot
from [Link].ar_model import AutoReg
from [Link] import mean_squared_error
from math import sqrt

# load dataset
series = read_csv('[Link]', header=0, index_col=0,
parse_dates=True, squeeze=True)

176
Chapter 6 Time Series

# split dataset
X = [Link]
train, test = X[1:len(X)-7], X[len(X)-7:]

# train autoregression
model = AutoReg(train, lags=29)
model_fit = [Link]()
print('Coefficients: %s' % model_fit.params)

# make predictions
predictions = model_fit.predict(start=len(train),
end=len(train)+len(test)-1, dynamic=False)
for i in range(len(predictions)):
print('predicted=%f, expected=%f' % (predictions[i], test[i]))
rmse = sqrt(mean_squared_error(test, predictions))
print('Test RMSE: %.3f' % rmse)

# plot results
[Link](test)
[Link](predictions, color='red')
[Link]()

Mixed ARMA Models

Mixed ARMA models are a combination of MA and AR processes. A mixed
autoregressive/moving average process containing p AR terms and q
MA terms is said to be an ARMA process of order (p,q). It is given by the
following:

X t 1 X t 1 p X t p Z t 1Z t 1 q Z t q

The following example code was taken from the stat model site to
realize time-series data as an ARMA model.

177
Chapter 6 Time Series

Please install matplotlib, statsmodels, and numpy using the pip

command before running the code.

import [Link] as plt

import numpy as np
from [Link].arima_process import ArmaProcess
import [Link] as sm

[Link](1234)
data = [Link]([1, .85, -.43, -.63, .8])
parameter = [Link]([1, .41])
model = ArmaProcess(data, parameter)
fig = [Link](figsize=(12, 8))
ax = fig.add_subplot(111)
[Link](model.generate_sample(nsample=50))
[Link]()

Here is how to estimate parameters of an ARMA model:

1. After specifying the order of a stationary ARMA

process, you need to estimate the parameters.

2. Assume the following:

• The model order (p and q) is known.

• The data has zero mean.

3. If step 2 is not a reasonable assumption, you can

subtract the sample mean Y and fit a 0 mean ARMA
model, as in Ø(B)Xt = θ(B)at where Xt = Yt – Y. Then
use Xt + Y as the model for Yt.

178
Chapter 6 Time Series

Integrated ARMA Models

To fit a stationary model such as the one discussed earlier, it is imperative
to remove nonstationary sources of variation. Differencing is widely used
for econometric data. If Xt is replaced by ∇dXt, then you have a model
capable of describing certain types of nonstationary series.

Yt 1 L X t
d

These are the estimating parameters of an ARIMA model:

• ARIMA models are designated by the level of

autoregression, integration, and moving averages.
• This does not assume any pattern uses an iterative
approach of identifying a model.
• The model “fits” if residuals are generally small,
randomly distributed, and, in general, contain no
useful information.

Here is the example code for an ARIMA model.

Please download the required CSV file from [Link]
pratips/book-Advanced-Data-Analytics-Using-Python and install the
Python libraries using the pip command.

# importing libraries

from pandas import read_csv

from matplotlib import pyplot
from [Link] import ARIMA
from [Link] import mean_squared_error

# readfing csv

179
Chapter 6 Time Series

series = read_csv('[Link]', header=0, index_col=0,

parse_dates=True, squeeze=True)
P = [Link]
size = int(len(P) * 0.9)

#splitting into train test set

train, test = P[0:size], P[size:len(P)]

history = [p for p in train]
predictions = list()
for t in range(len(test)):

# fitting model

model = ARIMA(history, order=(5,1,0))

    model_fit = [Link]()
    output = model_fit.forecast()
    yhat = output[0]
    [Link](yhat)
    obs = test[t]
    [Link](obs)
    print('predicted=%f, expected=%f' % (yhat, obs))
error = mean_squared_error(test, predictions)
print('Test MSE: %.3f' % error)

# plotting results

[Link](test)
[Link](predictions, color='red')
[Link]()

180
Chapter 6 Time Series

The Fourier Transform

The representation of nonperiodic signals by everlasting exponential
signals can be accomplished by a simple limiting process, and I will
illustrate that nonperiodic signals can be expressed as a continuous sum
(integral) of everlasting exponential signals. Say you want to represent the
nonperiodic signal g(t). Realizing any nonperiodic signal as a periodic
signal with an infinite time period, you get the following:

1
g t G e
jt
d
2

T0 / 2

G n lim g p t e jnt dt

T
T0 / 2

Hence:

G g t e- jt dt

G(w) is known as a Fourier transform of g(t).

Here is the relation between autocovariance and the Fourier
transform:

0 X2 dF F
0

181
Chapter 6 Time Series

An Exceptional Scenario

In the airline or hotel domain, the passenger load of month t is less
correlated with data of t-1 or t-2 month, but it is more correlated for t-12
month. For example, the passenger load in the month of Diwali (October)
is more correlated with last year’s Diwali data than with the same year’s
August and September data. Historically, the pick-up model is used to
predict this kind of data. The pick-up model has two variations.
In the additive pick-up model,

X t X t - 1 X t - 12 - X t - 13

In the multiplicative pick-up model,

X t X t - 1 X t - 12 / X t - 13

Studies have shown that for this kind of data the neural network–based
predictor gives more accuracy than the time-series model.
In high-frequency trading in investment banking, time-series models
are too time-consuming to capture the latest pattern of the instrument.
So, they on the fly calculate dX/dt and d2X/dt2, where X is the price of
the instruments. If both are positive, they blindly send an order to buy the
instrument. If both are negative, they blindly sell the instrument if they
have it in their portfolio. But if they have an opposite sign, then they do a
more detailed analysis using the time-series data.
As I stated earlier, there are many scenarios in time-series analysis
where R is a better choice than Python. So, here is an example of time-
series forecasting using R. The beauty of the [Link] model is that it
automatically finds the order, trends, and seasonality of the data and fits
the model. In the forecast, we are printing only the mean value, but the
model provides the upper limit and the lower limit of the prediction in
forecasting.

182
Chapter 6 Time Series

Please install the pmdarima library by using the following command:

pip install pmdarima

Once you install, you can try running the following code based on
pmdarima’s manual. It loads a data set available in the package and then
tries to fit auto arima and predicts as well.

import numpy as np
import pmdarima as pm
from [Link] import load_wineind

# loading dataset
wineind = load_wineind().astype(np.float64)
# fitting a stepwise model:
stepwise_fit = pm.auto_arima(wineind, start_p=1, start_q=1,
max_p=3, max_q=3, m=12, start_P=0, seasonal=True, d=1, D=1,
trace=True, error_action=’ignore’, suppress_warnings=True,
stepwise=True)

stepwise_fit.summary()
predicted_15 = stepwise_fit.predict(n_periods=15)
print(predicted_15)

Missing Data
One important aspect of time series and many other data analysis work is
figuring out how to deal with missing data. In the previous code, you fill in
the missing record with the average value. This is fine when the number of
missing data instances is not very high. But if it is high, then the average of
the highest and lowest values is a better alternative.

183
Chapter 6 Time Series

Summary
Following feature engineering, we’ll go over some basic statistics,
particularly time-series models. One thing to keep in mind is that you may
apply any supervised machine learning model on time-series data if you
transform feature vectors (1xn) to the matrix (kxn), where each k element
of each row is the latest k observation of the first column.

184

Common questions

Autoregressive models estimate parameters by utilizing the data’s autocorrelation structure. An autoregressive process depends on its previous values, and parameter estimation is conducted by solving a set of linear equations that relate current observations to their past values. The parameters are estimated through methods such as the Yule-Walker equations, which involve autocorrelation calculations. The correct lag order is determined by minimizing the residuals' mean, thereby ensuring the model fits the data appropriately .

Differencing a time series is a technique used to transform non-stationary data into stationary data by removing trends and stabilizing seasonality. For data with a linear trend, first-order differencing is typically sufficient to eliminate the trend . Seasonal differencing is used specifically to remove seasonal fluctuations from the time series . Without these transformations, the data might contain trends or seasonal variations that could distort analysis, thus differencing is fundamental in the Box-Jenkins procedure to make the data appropriate for such models .

The Holt-Winters model caters to time series with seasonality, using exponential smoothing to capture patterns. It has two variations: the additive model and the multiplicative model. In the additive model, seasonality is realized by adding seasonal components to the trend and cyclical components, which makes it suitable when seasonal variations remain constant over time . The multiplicative model, on the other hand, scales the seasonal components by the trend level, making it suitable for proportional seasonal variations that change with the size of the series .

Weak stationarity, or second-order stationarity, implies that a time series has a constant mean and a time-dependent autocovariance function but does not demand the joint distribution of data to be invariant over time. In contrast, strict stationarity requires that the entire joint distribution of the data is time-invariant . Weak stationarity is crucial in time series analysis because many statistical methods, such as autoregressive models, rely on it. Although real-world time series rarely exhibit exact weak stationarity, practitioners often impose it on data for analytical purposes due to its practical viability compared to stringent strict stationarity conditions .

To stabilize variance in a time series, a logarithmic transformation is commonly used. This transformation is necessary as it brings scattered data points closer in space, reducing variance and helping to manage skewness in the data, which could otherwise introduce spikes in trends . Stabilizing variance is crucial for ensuring that the resulting data adheres more closely to the assumptions of stationary models, thereby improving the reliability of statistical analysis and forecasting .

The Fourier Transform assists in deciphering nonperiodic signals in a time series by transforming signals into their frequency components. It represents nonperiodic signals as a continuous sum, enabling the identification of underlying frequencies that constitute the signal. This transformation facilitates the analysis of the frequency domain of the data, which is particularly useful in detecting and analyzing cyclic patterns within nonperiodic time series, thus providing deeper insight into the data’s structural properties beyond the time domain .

The autocorrelation function is significant in identifying the periodicity of a time series as it measures how observations at different times relate to each other. By plotting the autocorrelation function, one can observe spikes at certain lags, indicating repeating patterns or periodicity in the time series. This is crucial in identifying the time period T in a seasonal series, essential for further processes like seasonal differencing to be accurately applied . The autocorrelation plot provides a clear visualization of such periodicity or seasonality measurement .

ARIMA models handle nonstationary time series data by incorporating differencing techniques to stabilize the mean, thus achieving stationarity. Differentiation is a pivotal step in converting a nonstationary series into a stationary one, as it removes trends and periodic variations. Within ARIMA, the integration component (the 'I') refers to the level of differencing applied to the data, whereas the AR (autoregressive) and MA (moving average) components model the dependencies among observations . This allows ARIMA models to predict and analyze time series data effectively once stationarity is achieved .

The primary advantages of using a logarithmic transformation in time series analysis include the reduction of variance and stabilization of data scale, making disparate data more uniform. This transformation helps in dampening the effect of outliers and can convert a multiplicative seasonality into additive, making the data compatible with linear analysis techniques. Moreover, it aids in converting non-Gaussian data into a form more suitable for probabilistic models that assume a normal distribution .

Cyclic variation differs from typical seasonality in that it is stochastic and not necessarily tied to fixed periods. While seasonality involves consistent, predictable changes at the same time every cycle (such as annually), cyclic variation can vary in duration and may not be exactly periodic. It is often influenced by broader economic or environmental factors with more complex, less predictable patterns. Accurately predicting the periodicity of cyclic variations is crucial for adequately removing them from time series, akin to seasonal adjustments, but this prediction is inherently more challenging due to its variable nature .

Unit 5 - Time Series Models
No ratings yet
Unit 5 - Time Series Models
30 pages
Time Series Analysis and Forecasting Techniques
No ratings yet
Time Series Analysis and Forecasting Techniques
36 pages
Time Series Analysis and Forecasting
No ratings yet
Time Series Analysis and Forecasting
69 pages
Deep Temporal Models for Time Series Analysis
No ratings yet
Deep Temporal Models for Time Series Analysis
84 pages
Analyzing Time Trends in Data
No ratings yet
Analyzing Time Trends in Data
14 pages
Time Series Analysis Overview
No ratings yet
Time Series Analysis Overview
19 pages
Introduction to Time Series Analysis
No ratings yet
Introduction to Time Series Analysis
26 pages
Introduction to Time Series Analysis
No ratings yet
Introduction to Time Series Analysis
68 pages
MBAS 921-Time Series Analysis (2) - With Trend and Seasonality
No ratings yet
MBAS 921-Time Series Analysis (2) - With Trend and Seasonality
26 pages
Understanding Time Series Models
No ratings yet
Understanding Time Series Models
22 pages
Time Series Analysis and Forecasting Guide
No ratings yet
Time Series Analysis and Forecasting Guide
17 pages
Time Series Analysis and Forecasting Guide
No ratings yet
Time Series Analysis and Forecasting Guide
86 pages
TIMES SERIES DATA - Notes
No ratings yet
TIMES SERIES DATA - Notes
5 pages
Time Series Analysis and Forecasting Guide
No ratings yet
Time Series Analysis and Forecasting Guide
46 pages
Lecture 2 DSEB
No ratings yet
Lecture 2 DSEB
30 pages
Time Series Analysis and Forecasting
No ratings yet
Time Series Analysis and Forecasting
14 pages
Time Series Analysis Techniques
No ratings yet
Time Series Analysis Techniques
5 pages
Seasonal Time Series Analysis Techniques
No ratings yet
Seasonal Time Series Analysis Techniques
8 pages
Time Series Forecasting Overview
100% (1)
Time Series Forecasting Overview
61 pages
Time Series Analysis and Forecasting
No ratings yet
Time Series Analysis and Forecasting
26 pages
Time Series Analysis: Components Explained
No ratings yet
Time Series Analysis: Components Explained
58 pages
Time Series Forecasting Techniques
No ratings yet
Time Series Forecasting Techniques
36 pages
Time Series Modeling and Analysis Guide
No ratings yet
Time Series Modeling and Analysis Guide
98 pages
Time Series Analysis Techniques Explained
No ratings yet
Time Series Analysis Techniques Explained
11 pages
Understanding Cyclical Trends in Time Series
No ratings yet
Understanding Cyclical Trends in Time Series
20 pages
Introduction to Time Series Analysis
No ratings yet
Introduction to Time Series Analysis
53 pages
Time Series Forecasting Techniques
No ratings yet
Time Series Forecasting Techniques
41 pages
Time Series Analysis Techniques Explained
No ratings yet
Time Series Analysis Techniques Explained
45 pages
Time Series Analysis and Forecasting Techniques
No ratings yet
Time Series Analysis and Forecasting Techniques
65 pages
Time Series Analysis: Trends & Models
100% (1)
Time Series Analysis: Trends & Models
48 pages
Time Series Forecasting Tutorial Part-1
No ratings yet
Time Series Forecasting Tutorial Part-1
10 pages
Time Series Analysis Overview
No ratings yet
Time Series Analysis Overview
67 pages
Time Series Analysis Course Overview
No ratings yet
Time Series Analysis Course Overview
146 pages
Time Series Analysis
No ratings yet
Time Series Analysis
11 pages
Time Series Analysis Fundamentals
No ratings yet
Time Series Analysis Fundamentals
189 pages
Introduction to Time Series Analysis
No ratings yet
Introduction to Time Series Analysis
6 pages
Time Series Analysis: Key Concepts & Models
No ratings yet
Time Series Analysis: Key Concepts & Models
39 pages
Time Series Analysis Overview Guide
No ratings yet
Time Series Analysis Overview Guide
65 pages
Understanding Time Series Data Analysis
No ratings yet
Understanding Time Series Data Analysis
81 pages
Time Series Analysis of Air Quality Data
100% (1)
Time Series Analysis of Air Quality Data
7 pages
Time Series Analysis Overview
No ratings yet
Time Series Analysis Overview
11 pages
ARMA Time Series Forecasting Guide
No ratings yet
ARMA Time Series Forecasting Guide
49 pages
Time Series Analysis and Forecasting Techniques
No ratings yet
Time Series Analysis and Forecasting Techniques
57 pages
Week 12
No ratings yet
Week 12
73 pages
Time Series Analysis and Trends
No ratings yet
Time Series Analysis and Trends
82 pages
Time Series Analysis Course Overview
No ratings yet
Time Series Analysis Course Overview
26 pages
ML TimeSeries Complete Notes
No ratings yet
ML TimeSeries Complete Notes
28 pages
TS Week 1
No ratings yet
TS Week 1
25 pages
CH 2 Part 2
No ratings yet
CH 2 Part 2
51 pages
ATSA Script v160527
No ratings yet
ATSA Script v160527
194 pages
Time Series Analysis in Industrial Engineering
No ratings yet
Time Series Analysis in Industrial Engineering
21 pages
Time Series Modeling in Veterinary Science
No ratings yet
Time Series Modeling in Veterinary Science
18 pages
Time Series Analysis and Forecasting Guide
No ratings yet
Time Series Analysis and Forecasting Guide
30 pages
Time Series Analysis Techniques Explained
No ratings yet
Time Series Analysis Techniques Explained
74 pages
Understanding Time Series Analysis
No ratings yet
Understanding Time Series Analysis
28 pages
Rajasthan Budget Analysis 2025-26
No ratings yet
Rajasthan Budget Analysis 2025-26
7 pages
Reinforcement Details for Water Tank
No ratings yet
Reinforcement Details for Water Tank
1 page
Grade 12 Economics: International Trade Notes
No ratings yet
Grade 12 Economics: International Trade Notes
16 pages
Storey Enclosure Estimation Method Explained
No ratings yet
Storey Enclosure Estimation Method Explained
3 pages
Unmanned Vehicle Gasket Specifications
No ratings yet
Unmanned Vehicle Gasket Specifications
2 pages
CMA Inter Group 2 Study Material Guide
No ratings yet
CMA Inter Group 2 Study Material Guide
139 pages
Tax Invoice for Geddit Convenience Pvt. Ltd.
No ratings yet
Tax Invoice for Geddit Convenience Pvt. Ltd.
3 pages
China's Economic Growth Strategies Explained
No ratings yet
China's Economic Growth Strategies Explained
6 pages
Post-Keynesian Economics Overview
100% (10)
Post-Keynesian Economics Overview
14 pages
India Sugar Sector Update 2018
No ratings yet
India Sugar Sector Update 2018
37 pages
Business Rescue Proceedings Overview
No ratings yet
Business Rescue Proceedings Overview
6 pages
UNEB Chemistry Paper 1 2024 Guide
67% (3)
UNEB Chemistry Paper 1 2024 Guide
4 pages
Karnataka Bus Fare Invoice Details
No ratings yet
Karnataka Bus Fare Invoice Details
1 page
International Flow of Funds Overview
No ratings yet
International Flow of Funds Overview
28 pages
Error Identification in English Sentences
No ratings yet
Error Identification in English Sentences
8 pages
Economics Assignment: Statistics Overview
No ratings yet
Economics Assignment: Statistics Overview
38 pages
Grade 8 Mathematics Assignment: Comparing Quantities
No ratings yet
Grade 8 Mathematics Assignment: Comparing Quantities
11 pages
Understanding Economic Growth Basics
No ratings yet
Understanding Economic Growth Basics
23 pages
Insights on Quantitative Trading Strategies
No ratings yet
Insights on Quantitative Trading Strategies
23 pages
LEED v5 Renewable Energy Credit Guide
No ratings yet
LEED v5 Renewable Energy Credit Guide
27 pages
SAP Variance Analysis in Co-Product Costing
No ratings yet
SAP Variance Analysis in Co-Product Costing
12 pages
The Harris - Todaro Model of Labour Migration and Its Implication in Commercial Policy Implication
No ratings yet
The Harris - Todaro Model of Labour Migration and Its Implication in Commercial Policy Implication
101 pages
ACCC 122 2022 Mock Exam Solutions
No ratings yet
ACCC 122 2022 Mock Exam Solutions
8 pages
M Series Cylinders
No ratings yet
M Series Cylinders
61 pages
TOEIC Vocabulary by Category PDF
No ratings yet
TOEIC Vocabulary by Category PDF
12 pages
Sick Sdn. Bhd. Industrial Building Plans
No ratings yet
Sick Sdn. Bhd. Industrial Building Plans
1 page
Rural Tourism Development Bid Evaluation
No ratings yet
Rural Tourism Development Bid Evaluation
7 pages
Sonali Bank RTGS Application Form
76% (17)
Sonali Bank RTGS Application Form
2 pages
Shafira Hotel Yogyakarta Booking Confirmation
No ratings yet
Shafira Hotel Yogyakarta Booking Confirmation
1 page
Pengaruh Iklan pada Pembelian Floridina
No ratings yet
Pengaruh Iklan pada Pembelian Floridina
9 pages

Time Series Analysis Techniques

Uploaded by

Time Series Analysis Techniques

Uploaded by

CHAPTER 6

Analyzing a Series Containing a Trend

© Sayan Mukhopadhyay, Pratip Samanta 2023 161

Figure 6-1. A time series with trends

This looks quite different but is actually equivalent, provided t > 0.

Both these curves are S-shaped and approach an asymptotic value

Removing Trends from a Time Series

Mathematically, it looks like this:

A trend can be exponential as well. In this case, you will have to do a

Analyzing a Series Containing Seasonality

In this model, every point is realized as a weighted average of the

the α value increases exponentially. This is why it is known as exponential

Here, b1, often referred to as the permanent component, is the initial

Removing Seasonality from a Time Series

The smoothed value of xt is given by the following:

The simple moving average is useful for removing seasonal variations,

Similar to trends, you can convert the multiplicative seasonality to

To Stabilize the Variance

To Make the Seasonal Effect Additive

To Make the Data Distribution Normal

Stationary Time Series

They are both constants, which do not depend on the value of t.

E stimating Autocovariance and

Figure 6-2 shows a sample autocorrelation distribution.

Figure 6-2. Sample autocorrelations

Time-Series Analysis with Python

Moving Average Process

Here, {βi} are constants. The Zs are usually scaled so that β0 = 1.

The Zs are independent.

As γ(k) is not dependent on t and the mean is constant, the process is

Fitting Moving Average Process

Figure 6-3. Example of moving average

Here’s the example code for a moving-average model:

Please install numpy by following command:

The autocovariance function is given by the following:

Figure 6-4. A time series and AR model

Estimating Parameters of an AR Process

Now, E[X(t)] is the expected value of X(t).

If X(t) is a weak stationary process, then:

Here, m is constant, and cov[X(t),X(t+h)] is the function of only h(c(h))

Correlation(X(t),X(t)) = a1 * correlation (X(t),X(t-1)) + …. + ap *

• Second order: a1  r1 1  r2   1  r12  , a 2   r2  r12   1  r12 

Here is some example code for an autoregressive model:

Please download the required CSC file from [Link]

from pandas import read_csv

Mixed ARMA Models

Please install matplotlib, statsmodels, and numpy using the pip

import [Link] as plt

Here is how to estimate parameters of an ARMA model:

1. After specifying the order of a stationary ARMA

2. Assume the following:

• The data has zero mean.

3. If step 2 is not a reasonable assumption, you can

Integrated ARMA Models

These are the estimating parameters of an ARIMA model:

• ARIMA models are designated by the level of

Here is the example code for an ARIMA model.

from pandas import read_csv

series = read_csv('[Link]', header=0, index_col=0,

#splitting into train test set

train, test = P[0:size], P[size:len(P)]

# fitting model

model = ARIMA(history, order=(5,1,0))

The Fourier Transform

G  n   lim  g p  t  e  jnt dt

G(w) is known as a Fourier transform of g(t).

An Exceptional Scenario

In the multiplicative pick-up model,

Please install the pmdarima library by using the following command:

pip install pmdarima

Common questions

In what manner do autoregressive models achieve parameter estimation for a given time series?

In what manner do autoregressive models achieve parameter estimation for a given time series?

What is the purpose of differencing a time series in the context of removing trends and stabilizing seasonality?

What is the purpose of differencing a time series in the context of removing trends and stabilizing seasonality?

How does the Holt-Winters model differentiate between additive and multiplicative seasonality?

Analyzing a Series Containing a Trend

Removing Trends from a Time Series

Analyzing a Series Containing Seasonality

Removing Seasonality from a Time Series

To Stabilize the Variance

To Make the Seasonal Effect Additive

To Make the Data Distribution Normal

Stationary Time Series

E stimating Autocovariance and

Time-Series Analysis with Python

Moving Average Process

Fitting Moving Average Process

Estimating Parameters of an AR Process

• Second order: a1 r1 1 r2 1 r12 , a 2 r2 r12 1 r12

Mixed ARMA Models

Integrated ARMA Models

The Fourier Transform

G n lim g p t e jnt dt

An Exceptional Scenario