0% found this document useful (0 votes)

24 views44 pages

ARIMA Model for Time Series Forecasting

The document provides a comprehensive guide on using ARIMA models for time series forecasting in Python, detailing the steps to build an optimal ARIMA model, including the identification of parameters p, d, and q. It covers the concepts of univariate and multivariate forecasting, the importance of making a time series stationary, and techniques for determining the order of differencing, AR terms, and MA terms. Additionally, it discusses the implementation of ARIMA, SARIMA, and SARIMAX models, along with practical exercises and accuracy metrics.

Uploaded by

jigsan5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views44 pages

ARIMA Model for Time Series Forecasting

Uploaded by

jigsan5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

ARIMA Model – Complete

Guide to Time Series

Forecasting in Python
by Selva Prabhakaran |Posted on

FacebookTwitterWhatsAppLinkedInRedditGoogle BookmarksShare

Using ARIMA model, you can forecast a time series using the
series past values. In this post, we build an optimal ARIMA
model from scratch and extend it to Seasonal ARIMA (SARIMA)
and SARIMAX models. You will also see how to build autoarima
models in python

ARIMA Model – Time Series Forecasting. Photo by Cerquiera

Contents
1. Introduction to Time Series Forecasting
2. Introduction to ARIMA Models
3. What does the p, d and q in ARIMA model mean?
4. What are AR and MA models
5. How to find the order of differencing (d) in ARIMA model
6. How to find the order of the AR term (p)
7. How to find the order of the MA term (q)
8. How to handle if a time series is slightly under or over
differenced
9. How to build the ARIMA Model
10. How to do find the optimal ARIMA model manually using
Out-of-Time Cross validation
11. Accuracy Metrics for Time Series Forecast
12. How to do Auto Arima Forecast in Python
13. How to interpret the residual plots in ARIMA model
14. How to automatically build SARIMA model in python
15. How to build SARIMAX Model with exogenous variable
16. Practice Exercises
17. Conclusion

1. Introduction to Time Series

Forecasting
A time series is a sequence where a metric is recorded over regular time
intervals.

Depending on the frequency, a time series can be of yearly (ex: annual

budget), quarterly (ex: expenses), monthly (ex: air traffic), weekly (ex:
sales qty), daily (ex: weather), hourly (ex: stocks price), minutes (ex:
inbound calls in a call canter) and even seconds wise (ex: web traffic).

We have already seen the steps involved in a previous post on Time

Series Analysis . If you haven’t read it, I highly encourage you to do so.

Forecasting is the next step where you want to predict the future values
the series is going to take.

But why forecast?

Because, forecasting a time series (like demand and sales) is often of

tremendous commercial value.

In most manufacturing companies, it drives the fundamental business

planning, procurement and production activities. Any errors in the
forecasts will ripple down throughout the supply chain or any business
context for that matter. So it’s important to get the forecasts accurate in
order to save on costs and is critical to success.

Not just in manufacturing, the techniques and concepts behind time

series forecasting are applicable in any business.

Now forecasting a time series can be broadly divided into two types.

If you use only the previous values of the time series to predict its future
values, it is called Univariate Time Series Forecasting .

And if you use predictors other than the series (a.k.a exogenous
variables) to forecast it is called Multi Variate Time Series
Forecasting .

This post focuses on a particular type of forecasting method

called ARIMA modeling.

ARIMA, short for ‘AutoRegressive Integrated Moving Average’, is a

forecasting algorithm based on the idea that the information in the past
values of the time series can alone be used to predict the future values.

2. Introduction to ARIMA Models

So what exactly is an ARIMA model?
ARIMA, short for ‘Auto Regressive Integrated Moving Average’ is actually a
class of models that ‘explains’ a given time series based on its own past
values, that is, its own lags and the lagged forecast errors, so that equation
can be used to forecast future values.

Any ‘non-seasonal’ time series that exhibits patterns and is not a random
white noise can be modeled with ARIMA models.

An ARIMA model is characterized by 3 terms: p, d, q

where,

p is the order of the AR term

q is the order of the MA term

d is the number of differencing required to make the time series

stationary

If a time series, has seasonal patterns, then you need to add seasonal
terms and it becomes SARIMA, short for ‘Seasonal ARIMA’. More on that
once we finish ARIMA.

So, what does the ‘order of AR term’ even mean? Before we go there,
let’s first look at the ‘d’ term.

3. What does the p, d and q in ARIMA

model mean
The first step to build an ARIMA model is to make the time series
stationary .

Why?

Because, term ‘Auto Regressive’ in ARIMA means it is a linear regression

model that uses its own lags as predictors. Linear regression models, as
you know, work best when the predictors are not correlated and are
independent of each other.

So how to make a series stationary?

The most common approach is to difference it. That is, subtract the
previous value from the current value. Sometimes, depending on the
complexity of the series, more than one differencing may be needed.

The value of d, therefore, is the minimum number of differencing needed

to make the series stationary. And if the time series is already stationary,
then d = 0.
Next, what are the ‘p’ and ‘q’ terms?

‘p’ is the order of the ‘Auto Regressive’ (AR) term. It refers to the number
of lags of Y to be used as predictors. And ‘q’ is the order of the ‘Moving
Average’ (MA) term. It refers to the number of lagged forecast errors that
should go into the ARIMA Model.

4. What are AR and MA models

So what are AR and MA models? what is the actual mathematical formula
for the AR and MA models?

A pure Auto Regressive (AR only) model is one where Yt depends

only on its own lags. That is, Yt is a function of the ‘lags of Yt’.

where, $Y {t-1}$ is the lag1 of the series, $\beta 1$ is the

coefficient of lag1 that the model estimates and $\alpha$ is the intercept
term, also estimated by the model.

Likewise a pure Moving Average (MA only) model is one where Yt

depends only on the lagged forecast errors.

where the error terms are the errors of the autoregressive models of the
respective lags. The errors Et and E(t-1) are the errors from the following
equations :
[Error: The beta coefficients in the second equation above is incorrect. ]

That was AR and MA models respectively.

So what does the equation of an ARIMA model look like?

An ARIMA model is one where the time series was differenced at least
once to make it stationary and you combine the AR and the MA terms. So
the equation becomes:

ARIMA model in words:

Predicted Yt = Constant + Linear combination Lags of Y (upto

p lags) + Linear Combination of Lagged forecast errors (upto q
lags)

The objective, therefore, is to identify the values of p, d and q. But how?

Let’s start with finding the ‘d’.

5. How to find the order of

differencing (d) in ARIMA model
The purpose of differencing it to make the time series stationary.

But you need to be careful to not over-difference the series. Because, an

over differenced series may still be stationary, which in turn will affect
the model parameters.

So how to determine the right order of differencing?

The right order of differencing is the minimum differencing required to

get a near-stationary series which roams around a defined mean and the
ACF plot reaches to zero fairly quick.

If the autocorrelations are positive for many number of lags (10 or more),
then the series needs further differencing. On the other hand, if the lag 1
autocorrelation itself is too negative, then the series is probably over-
differenced.

In the event, you can’t really decide between two orders of differencing,
then go with the order that gives the least standard deviation in the
differenced series.
Let’s see how to do it with an example.

First, I am going to check if the series is stationary using the Augmented

Dickey Fuller test ( adfuller() ), from the statsmodels package.

Why?

Because, you need differencing only if the series is non-stationary. Else,

no differencing is needed, that is, d=0.

The null hypothesis of the ADF test is that the time series is non-
stationary. So, if the p-value of the test is less than the significance level
(0.05) then you reject the null hypothesis and infer that the time series is
indeed stationary.

So, in our case, if P Value > 0.05 we go ahead with finding the order of
differencing.

from [Link] import adfuller

from numpy import log

result = adfuller([Link]())

print('ADF Statistic: %f' % result[0])

print('p-value: %f' % result[1])

ADF Statistic: -2.464240

p-value: 0.124419

Since P-value is greater than the significance level, let’s difference the
series and see how the autocorrelation plot looks like.

import numpy as np, pandas as pd

from [Link] import plot_acf, plot_pacf

import [Link] as plt

[Link]({'[Link]':(9,7), '[Link]':120})

# Import data
df =
pd.read_csv('[Link]
er/[Link] ', names=['value'], header=0)

# Original Series
fig, axes = [Link](3, 2, sharex=True)
axes[0, 0].plot([Link]); axes[0, 0].set_title('Original Series')
plot_acf([Link], ax=axes[0, 1])

# 1st Differencing
axes[1, 0].plot([Link]()); axes[1, 0].set_title('1st Order
Differencing')
plot_acf([Link]().dropna(), ax=axes[1, 1])

# 2nd Differencing
axes[2, 0].plot([Link]().diff()); axes[2, 0].set_title('2nd
Order Differencing')
plot_acf([Link]().diff().dropna(), ax=axes[2, 1])

[Link]()

Order of Differencing

For the above series, the time series reaches stationarity with two orders
of differencing. But on looking at the autocorrelation plot for the 2nd
differencing the lag goes into the far negative zone fairly quick, which
indicates, the series might have been over differenced.

So, I am going to tentatively fix the order of differencing as 1 even

though the series is not perfectly stationary (weak stationarity).
from [Link] import ndiffs

df =
pd.read_csv('[Link]
er/[Link] ', names=['value'], header=0)
y = [Link]

## Adf Test
ndiffs(y, test='adf') # 2

# KPSS test
ndiffs(y, test='kpss') # 0

# PP test:
ndiffs(y, test='pp') # 2
2 0 2

6. How to find the order of the AR

term (p)
The next step is to identify if the model needs any AR terms. You can find
out the required number of AR terms by inspecting the Partial
Autocorrelation (PACF) plot.

But what is PACF?

Partial autocorrelation can be imagined as the correlation between the

series and its lag, after excluding the contributions from the intermediate
lags. So, PACF sort of conveys the pure correlation between a lag and the
series. That way, you will know if that lag is needed in the AR term or
not.

So what is the formula for PACF mathematically?

Partial autocorrelation of lag (k) of a series is the coefficient of that lag in

the autoregression equation of Y.

$$Y t = \alpha 0 + \alpha 1 Y {t-1} + \alpha 2 Y {t-2} + \alpha 3 Y {t-3}$

$That is, suppose, if Y_t is the current series and Y_t-1 is the lag 1 of Y ,
then the partial autocorrelation of lag 3 ( Y_t-3 ) is the coefficient $\
alpha_3$ of Y_t-3 in the above equation.

Good. Now, how to find the number of AR terms?

Any autocorrelation in a stationarized series can be rectified by adding

enough AR terms. So, we initially take the order of AR term to be equal
to as many lags that crosses the significance limit in the PACF plot.

# PACF plot of 1st differenced series

[Link]({'[Link]':(9,3), '[Link]':120})

fig, axes = [Link](1, 2, sharex=True)

axes[0].plot([Link]()); axes[0].set_title('1st Differencing')

axes[1].set(ylim=(0,5))

plot_pacf([Link]().dropna(), ax=axes[1])

[Link]()

/Library/Frameworks/[Link]/Versions/3.7/lib/python3.7/site-
packages/statsmodels/regression/linear_model.py:1283: RuntimeWarning:
invalid value encountered in sqrt

return rho, [Link](sigmasq)

Order of AR Term

You can observe that the PACF lag 1 is quite significant since is well
above the significance line. Lag 2 turns out to be significant as well,
slightly managing to cross the significance limit (blue region). But I am
going to be conservative and tentatively fix the p as 1.
7. How to find the order of the MA
term (q)
Just like how we looked at the PACF plot for the number of AR terms, you
can look at the ACF plot for the number of MA terms. An MA term is
technically, the error of the lagged forecast.

The ACF tells how many MA terms are required to remove any
autocorrelation in the stationarized series.

Let’s see the autocorrelation plot of the differenced series.

import pandas as pd

from [Link] import plot_acf, plot_pacf

import [Link] as plt

[Link]({'[Link]':(9,3), '[Link]':120})

# Import data

df =
pd.read_csv('[Link]
er/[Link] ')

fig, axes = [Link](1, 2, sharex=True)

axes[0].plot([Link]()); axes[0].set_title('1st Differencing')
axes[1].set(ylim=(0,1.2))
plot_acf([Link]().dropna(), ax=axes[1])

[Link]()
Order of MA Term

Couple of lags are well above the significance line. So, let’s tentatively
fix q as 2. When in doubt, go with the simpler model that sufficiently
explains the Y.

8. How to handle if a time series is

slightly under or over differenced
It may so happen that your series is slightly under differenced, that
differencing it one more time makes it slightly over-differenced.

How to handle this case?

If your series is slightly under differenced, adding one or more additional

AR terms usually makes it up. Likewise, if it is slightly over-differenced,
try adding an additional MA term.

9. How to build the ARIMA Model

Now that you’ve determined the values of p, d and q, you have
everything needed to fit the ARIMA model. Let’s use
the ARIMA() implementation in statsmodels package.

from [Link].arima_model import ARIMA

# 1,1,2 ARIMA Model

model = ARIMA([Link], order=(1,1,2))

model_fit = [Link](disp=0)

print(model_fit.summary())

ARIMA Model Results

=======================================================================
=======

Dep. Variable: [Link] No. Observations:

99
Model: ARIMA(1, 1, 2) Log Likelihood -
253.790

Method: css-mle S.D. of innovations

3.119

Date: Wed, 06 Feb 2019 AIC

517.579

Time: 23:32:56 BIC

530.555

Sample: 1 HQIC
522.829

=======================================================================
==========

coef std err z P>|z| [0.025

0.975]

-----------------------------------------------------------------------
----------

const 1.1202 1.290 0.868 0.387 -1.409

3.649

[Link] 0.6351 0.257 2.469 0.015 0.131

1.139

[Link] 0.5287 0.355 1.489 0.140 -0.167

1.224

[Link] -0.0010 0.321 -0.003 0.998 -0.631

0.629

Roots

=======================================================================
======

Real Imaginary Modulus

Frequency
-----------------------------------------------------------------------
------

AR.1 1.5746 +0.0000j 1.5746

0.0000

MA.1 -1.8850 +0.0000j 1.8850

0.5000

MA.2 545.3515 +0.0000j 545.3515

0.0000

-----------------------------------------------------------------------
------

The model summary reveals a lot of information. The table in the middle
is the coefficients table where the values under ‘coef’ are the weights of
the respective terms.

Notice here the coefficient of the MA2 term is close to zero and the P-
Value in ‘P>|z|’ column is highly insignificant. It should ideally be less
than 0.05 for the respective X to be significant.

So, let’s rebuild the model without the MA2 term.

# 1,1,1 ARIMA Model

model = ARIMA([Link], order=(1,1,1))

model_fit = [Link](disp=0)

print(model_fit.summary())

ARIMA Model Results

=======================================================================
=======

Dep. Variable: [Link] No. Observations:

Model: ARIMA(1, 1, 1) Log Likelihood -

253.790

Method: css-mle S.D. of innovations

3.119
Date: Sat, 09 Feb 2019 AIC
515.579

Time: 12:16:06 BIC

525.960

Sample: 1 HQIC
519.779

=======================================================================
==========

coef std err z P>|z| [0.025

0.975]

-----------------------------------------------------------------------
----------

const 1.1205 1.286 0.871 0.386 -1.400

3.641

[Link] 0.6344 0.087 7.317 0.000 0.464

0.804

[Link] 0.5297 0.089 5.932 0.000 0.355

0.705

Roots

=======================================================================
======

Real Imaginary Modulus

Frequency

-----------------------------------------------------------------------
------

AR.1 1.5764 +0.0000j 1.5764

0.0000

MA.1 -1.8879 +0.0000j 1.8879

0.5000
-----------------------------------------------------------------------
------

The model AIC has reduced, which is good. The P Values of the AR1 and
MA1 terms have improved and are highly significant (<< 0.05).

Let’s plot the residuals to ensure there are no patterns (that is, look for
constant mean and variance).

# Plot residual errors

residuals = [Link](model_fit.resid)

fig, ax = [Link](1,2)

[Link](title="Residuals", ax=ax[0])

[Link](kind='kde', title='Density', ax=ax[1])

[Link]()

Residuals Density

The residual errors seem fine with near zero mean and uniform variance.
Let’s plot the actuals against the fitted values using plot_predict() .

# Actual vs Fitted

model_fit.plot_predict(dynamic=False)

[Link]()
Actual vs Fitted

When you set dynamic=False the in-sample lagged values are used for
prediction.

That is, the model gets trained up until the previous value to make the
next prediction. This can make the fitted forecast and actuals look
artificially good.

So, we seem to have a decent ARIMA model. But is that the best?

Can’t say that at this point because we haven’t actually forecasted into the
future and compared the forecast with the actual performance.

So, the real validation you need now is the Out-of-Time cross-validation.

10. How to do find the optimal ARIMA

model manually using Out-of-Time
Cross validation
In Out-of-Time cross-validation, you take few steps back in time and
forecast into the future to as many steps you took back. Then you
compare the forecast against the actuals.

To do out-of-time cross-validation, you need to create the training and

testing dataset by splitting the time series into 2 contiguous parts in
approximately 75:25 ratio or a reasonable proportion based on time
frequency of series.

Why am I not sampling the training data randomly you ask?

That’s because the order sequence of the time series should be intact in
order to use it for forecasting.
from [Link] import acf

# Create Training and Test

train = [Link][:85]

test = [Link][85:]

You can now build the ARIMA model on training dataset, forecast and plot
it.

# Build Model

# model = ARIMA(train, order=(3,2,1))

model = ARIMA(train, order=(1, 1, 1))

fitted = [Link](disp=-1)

# Forecast

fc, se, conf = [Link](15, alpha=0.05) # 95% conf

# Make as pandas series

fc_series = [Link](fc, index=[Link])

lower_series = [Link](conf[:, 0], index=[Link])

upper_series = [Link](conf[:, 1], index=[Link])

# Plot

[Link](figsize=(12,5), dpi=100)

[Link](train, label='training')
[Link](test, label='actual')

[Link](fc_series, label='forecast')

plt.fill_between(lower_series.index, lower_series, upper_series,

color='k', alpha=.15)

[Link]('Forecast vs Actuals')

[Link](loc='upper left', fontsize=8)

[Link]()

Forecast vs Actuals

From the chart, the ARIMA(1,1,1) model seems to give a directionally

correct forecast. And the actual observed values lie within the 95%
confidence band. That seems fine.

But each of the predicted forecasts is consistently below the actuals.

That means, by adding a small constant to our forecast, the accuracy will
certainly improve. So, there is definitely scope for improvement.
So, what I am going to do is to increase the order of differencing to two,
that is set d=2 and iteratively increase p to up to 5 and then q up to 5 to
see which model gives least AIC and also look for a chart that gives
closer actuals and forecasts.

While doing this, I keep an eye on the P values of the AR and MA terms in
the model summary. They should be as close to zero, ideally, less than
0.05.

# Build Model

model = ARIMA(train, order=(3, 2, 1))

fitted = [Link](disp=-1)

print([Link]())

# Forecast

fc, se, conf = [Link](15, alpha=0.05) # 95% conf

# Make as pandas series

fc_series = [Link](fc, index=[Link])

lower_series = [Link](conf[:, 0], index=[Link])

upper_series = [Link](conf[:, 1], index=[Link])

# Plot

[Link](figsize=(12,5), dpi=100)

[Link](train, label='training')

[Link](test, label='actual')

[Link](fc_series, label='forecast')
plt.fill_between(lower_series.index, lower_series, upper_series,

color='k', alpha=.15)

[Link]('Forecast vs Actuals')

[Link](loc='upper left', fontsize=8)

[Link]()

ARIMA Model Results

=======================================================================
=======

Dep. Variable: [Link] No. Observations:

Model: ARIMA(3, 2, 1) Log Likelihood -

214.248

Method: css-mle S.D. of innovations

3.153

Date: Sat, 09 Feb 2019 AIC

440.497

Time: 12:49:01 BIC

455.010

Sample: 2 HQIC
446.327

=======================================================================
===========

coef std err z P>|z| [0.025

0.975]

-----------------------------------------------------------------------
-----------

const 0.0483 0.084 0.577 0.565 -0.116

0.212
[Link] 1.1386 0.109 10.399 0.000 0.924
1.353

[Link] -0.5923 0.155 -3.827 0.000 -0.896

-0.289

[Link] 0.3079 0.111 2.778 0.007 0.091

0.525

[Link] -1.0000 0.035 -28.799 0.000 -1.068

-0.932

Roots

=======================================================================
======

Real Imaginary Modulus

Frequency

-----------------------------------------------------------------------
------

AR.1 1.1557 -0.0000j 1.1557 -

0.0000

AR.2 0.3839 -1.6318j 1.6763 -

0.2132

AR.3 0.3839 +1.6318j 1.6763

0.2132

MA.1 1.0000 +0.0000j 1.0000

0.0000

-----------------------------------------------------------------------
------
Revised Forecast vs Actuals

The AIC has reduced to 440 from 515. Good. The P-values of the X terms
are less the < 0.05, which is great.

So overall it’s much better.

Ideally, you should go back multiple points in time, like, go back 1, 2, 3

and 4 quarters and see how your forecasts are performing at various
points in the year.

Here’s a great practice exercise: Try to go back 27, 30, 33, 36 data
points and see how the forcasts performs. The forecast performance can
be judged using various accuracy metrics discussed next.

11. Accuracy Metrics for Time Series

Forecast
The commonly used accuracy metrics to judge forecasts are:

1. Mean Absolute Percentage Error (MAPE)

2. Mean Error (ME)
3. Mean Absolute Error (MAE)
4. Mean Percentage Error (MPE)
5. Root Mean Squared Error (RMSE)
6. Lag 1 Autocorrelation of Error (ACF1)
7. Correlation between the Actual and the Forecast (corr)
8. Min-Max Error (minmax)

Typically, if you are comparing forecasts of two different series, the

MAPE, Correlation and Min-Max Error can be used.

Why not use the other metrics?

Because only the above three are percentage errors that vary between 0
and 1. That way, you can judge how good is the forecast irrespective of
the scale of the series.

The other error metrics are quantities. That implies, an RMSE of 100 for a
series whose mean is in 1000’s is better than an RMSE of 5 for series in 10’s.
So, you can’t really use them to compare the forecasts of two different scaled
time series.
# Accuracy metrics

def forecast_accuracy(forecast, actual):

mape = [Link]([Link](forecast - actual)/[Link](actual)) # MAPE

me = [Link](forecast - actual) # ME

mae = [Link]([Link](forecast - actual)) # MAE

mpe = [Link]((forecast - actual)/actual) # MPE

rmse = [Link]((forecast - actual)2).5 # RMSE

corr = [Link](forecast, actual)[0,1] # corr

mins = [Link]([Link]([forecast[:,None],

actual[:,None]]), axis=1)

maxs = [Link]([Link]([forecast[:,None],

actual[:,None]]), axis=1)

minmax = 1 - [Link](mins/maxs) # minmax

acf1 = acf(fc-test)[1] # ACF1

return({'mape':mape, 'me':me, 'mae': mae,

'mpe': mpe, 'rmse':rmse, 'acf1':acf1,

'corr':corr, 'minmax':minmax})

forecast_accuracy(fc, [Link])

#> {'mape': 0.02250131357314834,

#> 'me': 3.230783108990054,

#> 'mae': 4.548322194530069,

#> 'mpe': 0.016421001932706705,

#> 'rmse': 6.373238534601827,

#> 'acf1': 0.5105506325288692,

#> 'corr': 0.9674576513924394,

#> 'minmax': 0.02163154777672227}

Around 2.2% MAPE implies the model is about 97.8% accurate in

predicting the next 15 observations.

Now you know how to build an ARIMA model manually.

But in industrial situations, you will be given a lot of time series to be

forecasted and the forecasting exercise be repeated regularly.

So we need a way to automate the best model selection process.

12. How to do Auto Arima Forecast in

Python
Like R’s popular [Link]() function, the pmdarima package
provides auto_arima() with similar functionality.

auto_arima() uses a stepwise approach to search multiple combinations of

p,d,q parameters and chooses the best model that has the least AIC.
from [Link].arima_model import ARIMA

import pmdarima as pm

df =
pd.read_csv('[Link]
er/[Link] ', names=['value'], header=0)

print([Link]())

#> Fit ARIMA: order=(1, 2, 1); AIC=525.586, BIC=535.926, Fit time=0.060

seconds
#> Fit ARIMA: order=(0, 2, 0); AIC=533.474, BIC=538.644, Fit time=0.005
seconds
#> Fit ARIMA: order=(1, 2, 0); AIC=532.437, BIC=540.192, Fit time=0.035
seconds
#> Fit ARIMA: order=(0, 2, 1); AIC=525.893, BIC=533.648, Fit time=0.040
seconds
#> Fit ARIMA: order=(2, 2, 1); AIC=515.248, BIC=528.173, Fit time=0.105
seconds
#> Fit ARIMA: order=(2, 2, 0); AIC=513.459, BIC=523.798, Fit time=0.063
seconds
#> Fit ARIMA: order=(3, 2, 1); AIC=512.552, BIC=528.062, Fit time=0.272
seconds
#> Fit ARIMA: order=(3, 2, 0); AIC=515.284, BIC=528.209, Fit time=0.042
seconds
#> Fit ARIMA: order=(3, 2, 2); AIC=514.514, BIC=532.609, Fit time=0.234
seconds
#> Total fit time: 0.865 seconds
#> ARIMA Model Results
#>
=======================================================================
=======
#> Dep. Variable: D2.y No. Observations:
98
#> Model: ARIMA(3, 2, 1) Log Likelihood
-250.276
#> Method: css-mle S.D. of innovations
3.069
#> Date: Sat, 09 Feb 2019 AIC
512.552
#> Time: 12:57:22 BIC
528.062
#> Sample: 2 HQIC
518.825
#>
#>
=======================================================================
=======
#> coef std err z P>|z| [0.025
0.975]
#>
-----------------------------------------------------------------------
-------
#> const 0.0234 0.058 0.404 0.687 -0.090
0.137
#> ar.L1.D2.y 1.1586 0.097 11.965 0.000 0.969
1.348
#> ar.L2.D2.y -0.6640 0.136 -4.890 0.000 -0.930
-0.398
#> ar.L3.D2.y 0.3453 0.096 3.588 0.001 0.157
0.534
#> ma.L1.D2.y -1.0000 0.028 -36.302 0.000 -1.054
-0.946
#> Roots
#>
=======================================================================
======
#> Real Imaginary Modulus
Frequency
#>
-----------------------------------------------------------------------
------
#> AR.1 1.1703 -0.0000j 1.1703
-0.0000
#> AR.2 0.3763 -1.5274j 1.5731
-0.2116
#> AR.3 0.3763 +1.5274j 1.5731
0.2116
#> MA.1 1.0000 +0.0000j 1.0000
0.0000
#>
-----------------------------------------------------------------------
------

13. How to interpret the residual

plots in ARIMA model
Let’s review the residual plots using stepwise_fit.

model.plot_diagnostics(figsize=(7,5))
[Link]()

Residuals Chart

So how to interpret the plot diagnostics?

Top left: The residual errors seem to fluctuate around a mean of zero
and have a uniform variance.

Top Right: The density plot suggest normal distribution with mean zero.

Bottom left: All the dots should fall perfectly in line with the red line.
Any significant deviations would imply the distribution is skewed.

Bottom Right: The Correlogram, aka, ACF plot shows the residual
errors are not autocorrelated. Any autocorrelation would imply that there
is some pattern in the residual errors which are not explained in the
model. So you will need to look for more X’s (predictors) to the model.

Overall, it seems to be a good fit. Let’s forecast.

# Forecast

n_periods = 24

fc, confint = [Link](n_periods=n_periods, return_conf_int=True)

index_of_fc = [Link](len([Link]), len([Link])+n_periods)

# make series for plotting purpose

fc_series = [Link](fc, index=index_of_fc)

lower_series = [Link](confint[:, 0], index=index_of_fc)

upper_series = [Link](confint[:, 1], index=index_of_fc)

# Plot

[Link]([Link])

[Link](fc_series, color='darkgreen')

plt.fill_between(lower_series.index,

lower_series,

upper_series,

color='k', alpha=.15)

[Link]("Final Forecast of WWW Usage")

[Link]()
Final Forecast of WWW Usage

14. How to automatically build

SARIMA model in python
The problem with plain ARIMA model is it does not support seasonality.

If your time series has defined seasonality, then, go for SARIMA which
uses seasonal differencing.

Seasonal differencing is similar to regular differencing, but, instead of

subtracting consecutive terms, you subtract the value from previous season.

So, the model will be represented as SARIMA(p,d,q)x(P,D,Q), where, P, D

and Q are SAR, order of seasonal differencing and SMA terms
respectively and 'x' is the frequency of the time series.

If your model has well defined seasonal patterns, then enforce D=1 for a
given frequency ‘x’.

Here’s some practical advice on building SARIMA model:

As a general rule, set the model parameters such that D never exceeds
one. And the total differencing ‘d + D’ never exceeds 2. Try to keep only
either SAR or SMA terms if your model has seasonal components.

Let’s build an SARIMA model on 'a10' – the drug sales dataset.

# Import
data =
pd.read_csv('[Link]
er/[Link] ', parse_dates=['date'], index_col='date')

# Plot
fig, axes = [Link](2, 1, figsize=(10,5), dpi=100, sharex=True)

# Usual Differencing
axes[0].plot(data[:], label='Original Series')
axes[0].plot(data[:].diff(1), label='Usual Differencing')
axes[0].set_title('Usual Differencing')
axes[0].legend(loc='upper left', fontsize=10)

# Seasinal Dei
axes[1].plot(data[:], label='Original Series')
axes[1].plot(data[:].diff(12), label='Seasonal Differencing',
color='green')
axes[1].set_title('Seasonal Differencing')
[Link](loc='upper left', fontsize=10)
[Link]('a10 - Drug Sales', fontsize=16)
[Link]()

Seasonal Differencing

As you can clearly see, the seasonal spikes is intact after applying usual
differencing (lag 1). Whereas, it is rectified after seasonal differencing.
Let’s build the SARIMA model using pmdarima ‘s auto_arima() . To do that,
you need to set seasonal=True , set the frequency m=12 for month wise
series and enforce D=1 .

# !pip3 install pyramid-arima

import pmdarima as pm

# Seasonal - fit stepwise auto-ARIMA

smodel = pm.auto_arima(data, start_p=1, start_q=1,

test='adf',

max_p=3, max_q=3, m=12,

start_P=0, seasonal=True,

d=None, D=1, trace=True,

error_action='ignore',

suppress_warnings=True,

stepwise=True)

[Link]()

Fit ARIMA: order=(1, 0, 1) seasonal_order=(0, 1, 1, 12); AIC=534.818,

BIC=551.105, Fit time=1.742 seconds

Fit ARIMA: order=(0, 0, 0) seasonal_order=(0, 1, 0, 12); AIC=624.061,

BIC=630.576, Fit time=0.028 seconds

Fit ARIMA: order=(1, 0, 0) seasonal_order=(1, 1, 0, 12); AIC=596.004,

BIC=609.034, Fit time=0.683 seconds

Fit ARIMA: order=(0, 0, 1) seasonal_order=(0, 1, 1, 12); AIC=611.475,

BIC=624.505, Fit time=0.709 seconds

Fit ARIMA: order=(1, 0, 1) seasonal_order=(1, 1, 1, 12); AIC=557.501,

BIC=577.046, Fit time=3.687 seconds
(...TRUNCATED...)

Fit ARIMA: order=(3, 0, 0) seasonal_order=(1, 1, 1, 12); AIC=554.570,

BIC=577.372, Fit time=2.431 seconds

Fit ARIMA: order=(3, 0, 0) seasonal_order=(0, 1, 0, 12); AIC=554.094,

BIC=570.381, Fit time=0.220 seconds

Fit ARIMA: order=(3, 0, 0) seasonal_order=(0, 1, 2, 12); AIC=529.502,

BIC=552.305, Fit time=2.120 seconds

Fit ARIMA: order=(3, 0, 0) seasonal_order=(1, 1, 2, 12); AIC=nan,

BIC=nan, Fit time=nan seconds

Total fit time: 31.613 seconds

The model has estimated the AIC and the P values of the coefficients look
significant. Let’s look at the residual diagnostics plot.

The best model SARIMAX(3, 0, 0)x(0, 1, 1, 12) has an AIC of 528.6 and the P
Values are significant.

Let’s forecast for the next 24 months.

# Forecast

n_periods = 24

fitted, confint = [Link](n_periods=n_periods,

return_conf_int=True)

index_of_fc = pd.date_range([Link][-1], periods = n_periods,

freq='MS')

# make series for plotting purpose

fitted_series = [Link](fitted, index=index_of_fc)

lower_series = [Link](confint[:, 0], index=index_of_fc)

upper_series = [Link](confint[:, 1], index=index_of_fc)

# Plot

[Link](data)

[Link](fitted_series, color='darkgreen')

plt.fill_between(lower_series.index,

lower_series,

upper_series,

color='k', alpha=.15)
[Link]("SARIMA - Final Forecast of a10 - Drug Sales")

[Link]()
SARIMA – Final Forecasts

There you have a nice forecast that captures the expected seasonal
demand pattern.

15. How to build SARIMAX Model with

exogenous variable
The SARIMA model we built is good. I would stop here typically.

But for the sake of completeness, let’s try and force an external
predictor, also called, ‘exogenous variable’ into the model. This model is
called the SARIMAX model.

The only requirement to use an exogenous variable is you need to know

the value of the variable during the forecast period as well.

For the sake of demonstration, I am going to use the seasonal index from
the classical seasonal decomposition on the latest 36 months of data.

Why the seasonal index? Isn’t SARIMA already modeling the seasonality,
you ask?

You are correct.

But also, I want to see how the model looks if we force the recent
seasonality pattern into the training and forecast.

Secondly, this is a good variable for demo purpose. So you can use this
as a template and plug in any of your variables into the code. The
seasonal index is a good exogenous variable because it repeats every
frequency cycle, 12 months in this case.

So, you will always know what values the seasonal index will hold for the
future forecasts.

# Import Data

data =
pd.read_csv('[Link]
er/[Link] ', parse_dates=['date'], index_col='date')

Let’s compute the seasonal index so that it can be forced as a

(exogenous) predictor to the SARIMAX model.

# Compute Seasonal Index

from [Link] import seasonal_decompose

from [Link] import parse

# multiplicative seasonal component

result_mul = seasonal_decompose(data['value'][-36:], # 3 years

model='multiplicative',

extrapolate_trend='freq')

seasonal_index = result_mul.seasonal[-12:].to_frame()

seasonal_index['month'] = pd.to_datetime(seasonal_index.index).month

# merge with the base data

data['month'] = [Link]

df = [Link](data, seasonal_index, how='left', on='month')

[Link] = ['value', 'month', 'seasonal_index']

[Link] = [Link] # reassign the index.

The exogenous variable (seasonal index) is ready. Let’s build the

SARIMAX model.

import pmdarima as pm

# SARIMAX Model

sxmodel = pm.auto_arima(df[['value']],
exogenous=df[['seasonal_index']],

start_p=1, start_q=1,

test='adf',
max_p=3, max_q=3, m=12,

start_P=0, seasonal=True,

d=None, D=1, trace=True,

error_action='ignore',

suppress_warnings=True,

stepwise=True)

[Link]()

Fit ARIMA: order=(1, 0, 1) seasonal_order=(0, 1, 1, 12); AIC=536.818,

BIC=556.362, Fit time=2.083 seconds

Fit ARIMA: order=(0, 0, 0) seasonal_order=(0, 1, 0, 12); AIC=626.061,

BIC=635.834, Fit time=0.033 seconds

Fit ARIMA: order=(1, 0, 0) seasonal_order=(1, 1, 0, 12); AIC=598.004,

BIC=614.292, Fit time=0.682 seconds

Fit ARIMA: order=(0, 0, 1) seasonal_order=(0, 1, 1, 12); AIC=613.475,

BIC=629.762, Fit time=0.510 seconds

Fit ARIMA: order=(1, 0, 1) seasonal_order=(1, 1, 1, 12); AIC=559.530,

BIC=582.332, Fit time=3.129 seconds

(...Truncated...)

Fit ARIMA: order=(3, 0, 0) seasonal_order=(0, 1, 0, 12); AIC=556.094,

BIC=575.639, Fit time=0.260 seconds

Fit ARIMA: order=(3, 0, 0) seasonal_order=(0, 1, 2, 12); AIC=531.502,

BIC=557.562, Fit time=2.375 seconds

Fit ARIMA: order=(3, 0, 0) seasonal_order=(1, 1, 2, 12); AIC=nan,

BIC=nan, Fit time=nan seconds

Total fit time: 30.781 seconds

So, we have the model with the exogenous term. But the coefficient is
very small for x1 , so the contribution from that variable will be negligible.
Let’s forecast it anyway.

We have effectively forced the latest seasonal effect of the latest 3 years
into the model instead of the entire history.

Alright let’s forecast into the next 24 months. For this, you need the
value of the seasonal index for the next 24 months.

# Forecast

n_periods = 24

fitted, confint = [Link](n_periods=n_periods,

exogenous=[Link](seasonal_index.value, 2).reshape(-1,1),

return_conf_int=True)

index_of_fc = pd.date_range([Link][-1], periods = n_periods,

freq='MS')

# make series for plotting purpose

fitted_series = [Link](fitted, index=index_of_fc)

lower_series = [Link](confint[:, 0], index=index_of_fc)

upper_series = [Link](confint[:, 1], index=index_of_fc)

# Plot

[Link](data['value'])

[Link](fitted_series, color='darkgreen')

plt.fill_between(lower_series.index,
lower_series,

upper_series,

color='k', alpha=.15)

[Link]("SARIMAX Forecast of a10 - Drug Sales")

[Link]()
SARIMAX Forecast

16. Practice Exercises

In the AirPassengers dataset, go back 12 months in time and build the
SARIMA forecast for the next 12 months.

1. Is the series stationary? If not what sort of differencing is

required?
2. What is the order of your best model?
3. What is the AIC of your model?
4. What is the MAPE achieved in OOT cross-validation?
5. What is the order of the best model predicted
by auto_arima() method?

17. Conclusion
Congrats if you reached this point. Give yourself a BIG hug if you were
able to solve the practice exercises.

Introduction to ARIMA Models
No ratings yet
Introduction to ARIMA Models
12 pages
Lecture 9
No ratings yet
Lecture 9
35 pages
ARIMA Modeling Tutorial in R
No ratings yet
ARIMA Modeling Tutorial in R
12 pages
B29 Ads Exp 8
No ratings yet
B29 Ads Exp 8
17 pages
Time Series Analysis: ARIMA Models Explained
No ratings yet
Time Series Analysis: ARIMA Models Explained
51 pages
Part Ii - Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
No ratings yet
Part Ii - Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
14 pages
ARMA Model Basics and Forecasting Guide
No ratings yet
ARMA Model Basics and Forecasting Guide
12 pages
ARIMA Model Overview and Differences
No ratings yet
ARIMA Model Overview and Differences
21 pages
ARIMA Model: Pros and Cons Explained
No ratings yet
ARIMA Model: Pros and Cons Explained
6 pages
Understanding ARIMA for Time Series Forecasting
No ratings yet
Understanding ARIMA for Time Series Forecasting
13 pages
Understanding ARIMA Models for Forecasting
No ratings yet
Understanding ARIMA Models for Forecasting
21 pages
ADS Exp 8
No ratings yet
ADS Exp 8
5 pages
ARIMA Model: Pros and Cons Explained
No ratings yet
ARIMA Model: Pros and Cons Explained
3 pages
Understanding the ARIMA Model in Data Science
No ratings yet
Understanding the ARIMA Model in Data Science
52 pages
Understanding ARIMA Models for Time Series
No ratings yet
Understanding ARIMA Models for Time Series
10 pages
ARIMA Model Overview and Applications
No ratings yet
ARIMA Model Overview and Applications
6 pages
Understanding ARIMA and SARIMA Models
100% (1)
Understanding ARIMA and SARIMA Models
6 pages
Auto Regressive Integrated Moving Average - Material
No ratings yet
Auto Regressive Integrated Moving Average - Material
21 pages
ARIMA Time Series Forecasting in R
No ratings yet
ARIMA Time Series Forecasting in R
3 pages
Arima Arma
No ratings yet
Arima Arma
10 pages
Understanding ARIMA Models in Forecasting
No ratings yet
Understanding ARIMA Models in Forecasting
3 pages
ARIMA Time Series Forecasting Guide
No ratings yet
ARIMA Time Series Forecasting Guide
12 pages
Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
No ratings yet
Time Series Analysis: C5 ARIMA (Box-Jenkins) Models
14 pages
Understanding the ARIMA Model for Time Series Forecasting
No ratings yet
Understanding the ARIMA Model for Time Series Forecasting
3 pages
Univariate Time Series Modeling Techniques
No ratings yet
Univariate Time Series Modeling Techniques
12 pages
ARIMA vs Linear Regression in Trading
No ratings yet
ARIMA vs Linear Regression in Trading
35 pages
ARIMA Modelling in R: Overview and Steps
No ratings yet
ARIMA Modelling in R: Overview and Steps
10 pages
Understanding ARIMA(p,d,q) Parameters
No ratings yet
Understanding ARIMA(p,d,q) Parameters
9 pages
ARIMA Model Overview for Business Forecasting
No ratings yet
ARIMA Model Overview for Business Forecasting
8 pages
Univariate Time Series Models Overview
No ratings yet
Univariate Time Series Models Overview
71 pages
ARIMA Components and Forecasting Guide
No ratings yet
ARIMA Components and Forecasting Guide
2 pages
Arima
No ratings yet
Arima
6 pages
Non-seasonal ARIMA Models Overview
No ratings yet
Non-seasonal ARIMA Models Overview
14 pages
ARIMA Forecasting Overview
No ratings yet
ARIMA Forecasting Overview
17 pages
ARIMA Model Basics and Forecasting Techniques
No ratings yet
ARIMA Model Basics and Forecasting Techniques
30 pages
Simulating Time Series with arima.sim
No ratings yet
Simulating Time Series with arima.sim
10 pages
ARIMA & LSTM for Stock Forecasting
No ratings yet
ARIMA & LSTM for Stock Forecasting
33 pages
Understanding Non-Seasonal ARIMA Models
No ratings yet
Understanding Non-Seasonal ARIMA Models
19 pages
Time Series Forecasting Models Explained
No ratings yet
Time Series Forecasting Models Explained
3 pages
Understanding ARIMA Models for Forecasting
No ratings yet
Understanding ARIMA Models for Forecasting
4 pages
Time Series Analysis and ARIMA Model Guide
No ratings yet
Time Series Analysis and ARIMA Model Guide
11 pages
Time Series Analysis and Models Guide
No ratings yet
Time Series Analysis and Models Guide
33 pages
Univariate Time Series Models Guide
No ratings yet
Univariate Time Series Models Guide
9 pages
Time Series Data Analysis with ARIMA
No ratings yet
Time Series Data Analysis with ARIMA
47 pages
Module 5 University Question With Solution
No ratings yet
Module 5 University Question With Solution
16 pages
Understanding ARIMA Models
No ratings yet
Understanding ARIMA Models
5 pages
Understanding ARIMA Models for Time Series
No ratings yet
Understanding ARIMA Models for Time Series
10 pages
Time Series Forecasting with ARIMA Models
No ratings yet
Time Series Forecasting with ARIMA Models
7 pages
Unit-2 Pba
No ratings yet
Unit-2 Pba
113 pages
Time Series Analysis and ARIMA Models
No ratings yet
Time Series Analysis and ARIMA Models
30 pages
ARIMA Time Series Forecasting Project
No ratings yet
ARIMA Time Series Forecasting Project
19 pages
Understanding Strict Stationarity
No ratings yet
Understanding Strict Stationarity
5 pages
Time Series ARIMA Models PDF
No ratings yet
Time Series ARIMA Models PDF
22 pages
7 SEng5305 Chapter 7 Time Series Analysis
No ratings yet
7 SEng5305 Chapter 7 Time Series Analysis
30 pages
ARIMA Model Prediction Techniques
No ratings yet
ARIMA Model Prediction Techniques
18 pages
Understanding ARIMA Models for Forecasting
No ratings yet
Understanding ARIMA Models for Forecasting
2 pages
ARIMA Model Explained with Example
No ratings yet
ARIMA Model Explained with Example
2 pages
Stock Price Forecasting and Risk Analysis
No ratings yet
Stock Price Forecasting and Risk Analysis
2 pages
Credit Risk Management Fundamentals
No ratings yet
Credit Risk Management Fundamentals
111 pages
A Sensible Mutual Fund Selection Model
No ratings yet
A Sensible Mutual Fund Selection Model
14 pages
Data Preprocessing Techniques in Python
No ratings yet
Data Preprocessing Techniques in Python
11 pages
Basel Norms in Banking Regulation Explained
No ratings yet
Basel Norms in Banking Regulation Explained
6 pages
Statsmodel Linear Regression Summary Explained
No ratings yet
Statsmodel Linear Regression Summary Explained
19 pages
NVDA MACD Strategy Backtest Results
No ratings yet
NVDA MACD Strategy Backtest Results
202 pages
Quant Investing with Python Insights
No ratings yet
Quant Investing with Python Insights
21 pages
SPAN Margin System Overview
No ratings yet
SPAN Margin System Overview
18 pages
Python Data Science Roadmap Guide
No ratings yet
Python Data Science Roadmap Guide
12 pages
Local Weather Chart Lesson Plan
No ratings yet
Local Weather Chart Lesson Plan
16 pages
La Crosse 308-805 Weather Station Manual
No ratings yet
La Crosse 308-805 Weather Station Manual
2 pages
Temperature Effects on Aluminum Fatigue
No ratings yet
Temperature Effects on Aluminum Fatigue
6 pages
Introduction to Extreme Value Analysis
No ratings yet
Introduction to Extreme Value Analysis
58 pages
December 20 Storm Impact on Nanaimo
No ratings yet
December 20 Storm Impact on Nanaimo
9 pages
An Exploratory Investigation of New Product Forecasting Practices (Kahn 2001)
No ratings yet
An Exploratory Investigation of New Product Forecasting Practices (Kahn 2001)
11 pages
Aptis ESOL - Advanced - Practice - Book - 2024 - A4
No ratings yet
Aptis ESOL - Advanced - Practice - Book - 2024 - A4
42 pages
Slozka
No ratings yet
Slozka
8 pages
Deep Generative Models for Precipitation Nowcasting
No ratings yet
Deep Generative Models for Precipitation Nowcasting
46 pages
Typhoon Julian Bulletin - September 30, 2024
No ratings yet
Typhoon Julian Bulletin - September 30, 2024
4 pages
Understanding Weather: Key Concepts
No ratings yet
Understanding Weather: Key Concepts
2 pages
Climate Classification in Africa
No ratings yet
Climate Classification in Africa
18 pages
Understanding Typhoons for Grade 8
No ratings yet
Understanding Typhoons for Grade 8
49 pages
Effective Forecasting Techniques in Operations
No ratings yet
Effective Forecasting Techniques in Operations
16 pages
Today's Weather Activities for Kids
No ratings yet
Today's Weather Activities for Kids
3 pages
LSTM-Based Weather Forecasting Model
No ratings yet
LSTM-Based Weather Forecasting Model
2 pages
H2S Oxidation Kinetics and Modelling
No ratings yet
H2S Oxidation Kinetics and Modelling
8 pages
Cross-Impact Method Overview
No ratings yet
Cross-Impact Method Overview
21 pages
Quantitative Forecasting Techniques Explained
No ratings yet
Quantitative Forecasting Techniques Explained
3 pages
Forest Fire Prediction Model Using AI
No ratings yet
Forest Fire Prediction Model Using AI
16 pages
Technological Forecasting Methods 1918-1973
No ratings yet
Technological Forecasting Methods 1918-1973
5 pages
Unit8 PDF
No ratings yet
Unit8 PDF
2 pages
Wild Weather Form 1 Worksheet
No ratings yet
Wild Weather Form 1 Worksheet
45 pages
Sales Forecast Using Chain Ratio Method
No ratings yet
Sales Forecast Using Chain Ratio Method
20 pages
Rainfall Measurement Techniques
No ratings yet
Rainfall Measurement Techniques
14 pages
Indian Climate Patterns and Seasons
No ratings yet
Indian Climate Patterns and Seasons
4 pages
Afrikaans Weather Forecast Insights
No ratings yet
Afrikaans Weather Forecast Insights
6 pages
Business Analytics and Decision-Making Guide
No ratings yet
Business Analytics and Decision-Making Guide
4 pages
El Niño's Delayed Impact on India's Spring SAT
No ratings yet
El Niño's Delayed Impact on India's Spring SAT
14 pages
AI and Math: Patterns and Probability
No ratings yet
AI and Math: Patterns and Probability
7 pages