Implied Correlation in Equity Baskets
Implied Correlation in Equity Baskets
1 Introduction
Equity basket correlation is an important risk factor. It characterizes the
strength of linear dependence between assets and thus measures the degree
of portfolio diversification. It is an input for many pricing models, plays a
key role in portfolio optimization and risk management. The concept of a
time-varying correlation is frequently used in studies that describe the joint
dynamics of assets, Bollerslev et al. (1988), Engle (2002). However, the idea
of considering the correlation as an asset, on its own, is relatively new and has
recently gained popularity together with the emergence of such derivative in-
struments as variance, volatility, correlation swaps and trading strategies with
them, Demeterfi et al. (1999), Bossu et al. (2005). In this context being able
2 Wolfgang Karl Härdle and Elena Silyakova
the future realized correlation. Driessen et al. (2009) investigate the power
of options implied correlation to explain the future realized correlation and
conclude that its predictive power is quite high.
Here we model the implied correlation (IC), which is an object of very high
dimensionality. Similarly to the IV, every day one recovers an IC surface. We
model the IC with a dynamic semiparametric factor model (DSFM), Fengler
et al. (2007), Park et al. (2009) and Song et al. (2014), and find that it yields
a low dimensional representation as a linear combination of a small number
of time-invariant basis functions (surfaces), whose time evolution is driven by
a series of coefficients; technical aspects are also described in Sperlich et al.
(1999). We produce an IC forecast and use it in several hedging schemes
for a dispersion strategy. For the empirical analysis we chose the German
market represented by the DAX portfolio over the 2-years sample period from
20100802 to 20120801 (dates are written as YYYYMMDD). Backtesting shows
that the hedge allows to the reduction of potential losses and increases the
average profitability of the strategy.
The paper is structured as follows. In Section 2 we introduce the notions
of realized, model-implied and model-free implied volatility and correlation
and describe the basic setup of a dispersion strategy with variance swaps.
The DSFM model for IC is introduced in Section 3 starting with general de-
scription in Section 3.1, followed by the description of the functional principal
component analysis (FPCA) approach to find the basis functions in Section
3.2 and the estimation procedure for both factors and factor loadings in Sec-
tion 3.3. Section 4 presents the dataset taken for the empirical study, followed
by a description of the estimation results in Section 5. Here, first, we inter-
pret obtained factors and factor loadings and propose a time series model for
low-dimensional factors in 5.1. Finally in Section 5.2 we propose and compare
alternative dispersion strategy setups: a no hedge, a naïve approach and an
advanced hedge. Section 6 concludes.
4 Wolfgang Karl Härdle and Elena Silyakova
2 Correlation trading
2.1 Average basket correlation
X XX
2
σB = wi2 σi2 + wi wj σi σj ρij , (1)
i i j6=i
where σi2 denotes the variance of the i-th asset return and wi its weight
in the basket. Now, assuming that ρij is constant for every pair (i, j), one can
imply the equicorrelation ρ from (1):
2 − P 2 2
σB i wi σi
ρ= P P . (2)
i j6=i wi wj σi σj
wi wj σi σj
ci,j = P P . (4)
i j6=i wi wj σi σj
2
σB,min is achieved when ρ = 0 that is when the assets in a basket are
fully diversified. In the case of no diversification, one observes the maximal
2
possible basket variance σB,max corresponding to ρ = 1.
Further we can rewrite ρ by substituting (5) and (6) to (2):
2 − σ2
σB B,min
ρ= 2 2 (7)
σB,max − σB,min
and obtain an additional interpretation as a measure for the degree of
diversification, Skintzi and Refenes (2005). In fact (7) shows how far σB2 is from
2 2 2
its minimal value σB,min relative to the possible value range σB,max − σB,min ,
or in other words, how far the basket is from the perfect diversification. High
ρ is the sign of a poorly diversified portfolio, which is typical for the market
downturn, when asset prices simultaneously drop driving σB 2 up. It means
Based on (2) we conclude that the exposure to the basket correlation ρ can be
2 and its constituents, σ 2 .
achieved by exposures to the variances of a basket σB i
Such trades can be realized via a combination of variance swaps. A variance
swap is an over-the-counter contract opened at t, which at t + τ pays the
2
difference between the variance cumulated over the life time of the swap σt+τ
2
and the fixed pre-defined strike σ˜t (τ ):
n o
2
σt+τ − σ̃t2 (τ ) Nvar , (8)
6 Wolfgang Karl Härdle and Elena Silyakova
70 0.7 0.7
60 0.6 0.6
Correlation of DAX
50 0.5 0.5
40 0.4 0.4
30 0.3 0.3
20 0.2 0.2
10 0.1 0.1
0 0 0
Mar10 Sep10 Apr11 Oct11 May12 Nov12 0 10 20 30 40 50 60 70 80
Fig. 1. Left panel: DAX correlation (2) - dashed, DAX volatility (11) - solid black, volatility
of DAX constituents Adidas, BMW, Siemens, Daimler, [Link], Lufthansa volatilities (11)
- color, the stock market fall 2011 - shaded area. Right panel: scatter plot DAX volatility
vs. correlation. Estimation period - from 20100104 to 20121228, estimation window - 3
months.
where Nvar is the notional amount. Here and later t and τ are given in
fractions of a year.
The strike of a variance swap is the risk-neutral expectation at t of the
integrated variance from t to t + τ . It is also known as the model-free implied
variance (MFIV), where “model-free” indicates that the expectation does not
depend on the specification of the underlying price process, Britten-Jones and
Neuberger (2000). MFIV can be approximated by a function of current option
prices, Breeden and Litzenberger (1978), Carr and Madan (1998), Britten-
Jones and Neuberger (2000), which has the following form
t+τ
Z
σ˜t 2 (τ ) = EQ
t σ 2 (s)ds =
t
2erτ ZSt P (K, τ )dK
Z∞
Ct (K, τ )dK
t
+ , (9)
τ
K2 K2
0 St
MFIV can be opposed to the implied variance σ bt2 (κ, τ ), the square of the
implied volatility (IV), which is obtained by solving
Vt (σ
b, κ, τ ) − V̆t (κ, τ ) = 0, (10)
where Vt is the theoretical (model) option price, V̆t option price taken from
K
the market, κ = moneyness of the option. IV, in comparison to MFIV,
St erτ
is a function of both κ and τ , meaning that at every t one recovers a cloud of
points, which can be approximated by a surface, Cont and Da Fonseca (2002),
Fengler et al. (2007).
The floating leg of the variance swap, the realized variance (RV) of an
asset from t to t + τ , can be computed from the time series of daily asset
returns in different ways, depending on the contract specification. Here we
use the most common following form
252(t+τ ) 2
X Si
2
σt+τ = τ −1 log . (11)
i=252t
Si−1
1.4 1.4
1.2 1.2
MFIC (red), RC (blue)
1 1
0.8 0.8
0.6 0.6
0.4 0.4
0.2 0.2
0 0
Mar10 Sep10 Apr11 Oct11 May12 Nov12 0 0.2 0.4 0.6 0.8 1 1.2 1.4
Fig. 2. Left panel: DAX ρt,τ - blue, ρ̃t (τ ) - red, right panel: scatter plot of DAX ρt,τ (hori-
zontal axis) vs ρ̃t (τ ) (vertical axis), for t + 0.25 from 20100802 till 20120801.
MFIV and their differences for all 30 DAX constituents. The latter are found
to be negative for most of the stocks and for the DAX index.
Driessen et al. (2009) interpret their t-test results as indirect evidence that
a negative correlation risk premium (CRP) exists. To identify the existence
of CRP in the DAX dataset we compute the model free implied correlation
(MFIC) ρ̃t (τ ) from the MFIVs of DAX and its constituents and the realized
correlation (RC) ρt+τ from the corresponding RV by applying (2):
2 P 2 2
σ̃t,DAX (τ ) − i wi σ̃t,i (τ )
ρ̃t (τ ) = P P , (12)
i j6=i wi wj σ̃t,i (τ )σ̃t,j (τ )
2 P 2 2
σt+τ,DAX − i wi σt+τ,i
ρt+τ = P P . (13)
i j6=i wi wj σt+τ,i σt+τ,j
Figure 2 plots the MFIC and the RC of DAX computed over the 3-
month window and with 3 month maturity respectively (τ = 0.25). The H0 :
RC=MFIC of the t-test is strongly rejected. Using this finding and taking into
account results in the literature we would expect the ρt+τ − ρ̃t (τ ) (CRP) to
be negative most of the time. One of the ways of exploiting this observation
is to make a bet on the market correlation by entering a dispersion strategy.
Implied Basket Correlation Dynamics 9
n o N
X n o
2 2
Dt+τ = − σt+τ,B − σ̃t,B (τ ) + 2
wi2 σt+τ,i 2
− σ̃t,i (τ ) . (14)
i=1
XX XX
Dt+τ = ρ̃t (τ ) wi wj σ̃t,i (τ )σ̃t,j (τ ) − ρt+τ wi wj σt+τ,i σt+τ,j . (15)
i j6=i i j6=i
which illustrates that by entering the dispersion strategy one obtains ex-
posure to ρt+τ − ρ̃t (τ ), where the floating leg ρt+τ is computed with (11) and
(2) at expiry, and the fixed leg ρ̃t (τ ) is a function of variance swap strikes (9).
Test results described in Section 2.2 suggest that we should, on average, ex-
pect ρt+τ − ρ̃t (τ ) < 0. It also means the dispersion strategy with payoff Dt+τ
on average would have a profit. However, as one can see in Figure 2, there
might be days when ρt+τ − ρ̃t (τ ) ≥ 0. In order to hedge against these potential
losses one needs a forecast of the floating leg of the dispersion strategy.
Another possible modification of the dispersion trading strategy does not
involve trading on the OTC market and can be implemented with standard-
ized market instruments, puts and calls. The strategy consists in selling index
option straddles and purchasing straddles in options on index components.
The forecast of the implied correlation surface can provide the insight into the
relative cost of index options compared to the price of options on individual
stocks that comprise the index. In comparison to the single historical or im-
plied volatility forecast, usually used for this purpose, the correlation surface
can provide information for trading options on the whole maturity spectrum.
10 Wolfgang Karl Härdle and Elena Silyakova
Which means one can buy straddles with different strikes, depending on the
implied correlation forecast.
2 (κ, τ ) − P 2 b 2 (κ, τ )
σ
bt,B i wi σt,i
ρbt (κ, τ ) = P P . (17)
i j6=i wi wj σ
bt,i (κ, τ )σ
bt,j (κ, τ )
Figure 3 displays ρbt (κ, τ ) in different trading days: 20111209, 20120710.
Due to the specific option data structure, every day one observes a “cloud of
strings” that visually resembles a surface and can be recovered by applying
nonparametric smoothing. One can clearly see that surfaces have shape sim-
ilarities, but vary in levels, slopes and curvatures. Thus they may be treated
as daily realizations of a random function. In addition one can observe that
the strings do not have fixed spacial locations. In order to model the dynam-
ics of such a complicated multi-dimensional object we apply the DSFM that
reduces the dimensionality of the problem and allows the ICS to be studied
in a conventional time-series context.
def 1 1+u
T (u) = log (18)
2 1−u
def
with Yt,j = T {ρb(κt,j , τt,j )}.
Our aim is to model the dynamics of {(Yt,j , Xt,j ), 1 ≤ t ≤ T, 1 ≤ j ≤ Jt },
where Xt,j = (κt,j , τt,j ). The technique we employ allows us to reduce the
Implied Basket Correlation Dynamics 11
Fig. 3. ICS implied by prices of DAX options traded on the 20111209, 20120710, surfaces
recovered by the Nadaraya-Watson smoothing
L
X
Yt,j = m0 (Xt,j ) + Zt,l ml (Xt,j ) + εt,j . (19)
l=1
In representation (19) m are chosen data driven and do not have a par-
ticular (parametric) form.
Here two important remarks are appropriate. Firstly, the unknown basis
functions m must be estimated. Fengler et al. (2007) estimate both m and Zt
iteratively using kernel smoothing techniques, Park et al. (2009) approximates
m by tensor B-splines basis functions weighted by a coefficients matrix. Here
we employ a functional principal component analysis (FPCA) approach that
will be described in Section 3.2. The nonparametric estimation procedure that
we use is introduced in Section 3.3; the basics of this technique can be found
in Song et al. (2014).
12 Wolfgang Karl Härdle and Elena Silyakova
The second issue is the estimation of the latent factors Zt . Having the
data-driven basis m
b l in hand, we can estimate daily factors by the ordinary
least squares (OLS) method. Afterwards one fits the econometric model to Zbt ,
as it was done by Cont and Da Fonseca (2002) and Hafner (2004), who fitted
AR(1) to every Zt,l , l ∈ {1, . . . , L}, or in Fengler et al. (2007) who considered
a multivariate VAR(2) process.
2 ) = λ and
called the j -th principal component score with E(ζtj ) = 0, E(ζtj j
E(ζtj ζik ) = 0 for j 6= k, Ramsay and Silverman (2010). Thus one may rewrite
ρt (u) − µ(u) in the Karhunen-Loève form:
∞
X
ρt (u) − µ(u) = ζtj γj (u). (23)
j=1
Here ζtj indicates how strong the influence of the j -th basis function on
the shape of the t-th surface is. The higher the score, the closer the shape of
ρt resembles the shape of the j -th eigenfunction.
In practice one needs to take L eigenfunctions to replace the infinite sum
in (23) by the finite sum of L basis functions, corresponding to the highest
eigenvalues. One calls {γj }Lj=1 the empirical orthonormal basis, Ramsay and
Silverman (2010). In the next Section we discuss the estimation procedure for
{γj }L
j=1 as well as criteria for the L selection.
Jt
T X
X
{Yt,j − aµ − bµ (u − Xt,j )}2 Khµ (Xt,j − u) , (24)
t=1 j=1
T
X X
{Yt,j Yt,k − aφ − bφ,1 (u − Xt,j ) − bφ,2 (v − Xt,k )}2 (25)
t=1 j,k:1≤j6=k≤Jt
×Khφ (Xt,j − u) Khφ Xt,k − v ,
b v) = b
and take φ(u, aφ .
Here Kh denotes the two-dimensional product kernel, Kh (q̄) = kh1 (q̄1 ) ×
kh2 (q̄2 ), h = (h1 , h2 )⊤ , based on one-dimensional kh (q̄) = h−1 k(h−1 q̄). For our
14 Wolfgang Karl Härdle and Elena Silyakova
application we selected the quartic kernel, where k(q̄) = 15/16(1− q̄ 2)2 for |q̄| <
1 and 0 otherwise. For both (24) and (25) kernel bandwidths hµ = (hµ,1 , hµ,2 )⊤
and hφ = (hφ,1 , hφ,2 )⊤ are to be selected. The procedure is described in Ap-
pendix A.5. Figure 4 shows an example of µ b(u) estimated using the dataset
described in Section 4 for a sub-sample from 20100802 to 20110801.
Finally, having estimates µ b v), we compute ψ(u,
b(u) and φ(u, b v) using (20)
and take its L eigenfunctions corresponding to the largest eigenvalues as m b l,
l ∈ {1, . . . , L}. Parameter L is chosen in such a way that the selected eigen-
functions explain the large share of variability in the original data. It is also
necessary to mention that ψ(u, b v) is a matrix of a very large dimensionality.
To obtain its consistent estimator, suitable for further spectral decomposition,
various matrix regularization techniques can be used., e.g. banding as in Bickel
and Levina (2008b), thresholding in Bickel and Levina (2008a), eigenvalues
cleaning as in Laloux et al. (1999) and factor models described in Fan et al.
(2008). We use the latter in this step.
In the second step using m bt = (1, Z
b we obtain the estimates Z bt,1 , . . . , Z
bt,L )⊤
as minimizers of the following least squares criterion:
X Jt n
T X o2
Zbt = arg min Yt,j − Zt⊤ m(X
b t,j ) . (26)
Zt
t=1 j=1
Implied Basket Correlation Dynamics 15
Table 1. Summary statistics: IC data computed from the DAX index and constituents op-
tions over the period from 20090803 to 20120801 including the 1 year estimation period (3
years, 770 trading days, 135 obs./day). MFIC computed from daily variance swaps rates.
RC computed from daily stock returns from 20100802 to 20120801 (2 years, 515 trading
days). The figures are given after filtering and data preparation.
4 Data
We study the dispersion strategy over the two year sample period from
20100802 to 20120801 on the German market represented by the DAX bas-
ket. The basket is composed of 23 stocks, constituents of DAX, with the most
liquidly traded options and weights proportional to the current market capi-
talization. To model the dynamics of the IC and to construct the dispersion
trade we operate with three main variables representing different correlation
estimates. MFIC, RC, and IC. The datasets are described in Table 1.
The MFIC dataset contains daily series of MFICs with maturities 0.083,
0.25, 0.5 and 1 years computed via (12) from variance swap rates given by
Bloomberg as a discrete approximation of (9).
The RC dataset contains daily series of RCs computed with (11) and (13)
from the Bloomberg end-of-day stock prices over estimation windows 0.083,
0.25, 0.5 and 1 years.
The IC dataset is constructed using out-of-the-money (OTM) DAX and
single stock options from the EUREX database. To estimate the DSFM model
and produce forecasts for the sample period the dataset covers one additional
year from 20090803 to 20100730. The dataset is transaction-based, meaning
every trade is registered with the date it occurred, expiry date, underlying
16 Wolfgang Karl Härdle and Elena Silyakova
ticker, exercise price (strike) and settlement price. To obtain IV from option
prices via (10) we distinguish between index and single stock options. For
index options, which have the European type of option payoff, the Black-
Sholes (BS) model is used. To account for dividends and early execution in
options on single stocks (American payoff) we use binomial trees, Cox et al.
(1979), and bisection algorithm. Other necessary model parameters, such as
stock prices, index levels, dividend amounts for constituent stocks, interest
rates and stock market capitalization are taken from the Bloomberg database.
As a risk free rate proxy we take daily values of EURIBOR (Euro Interbank
Offered Rate) with 1 week up to 1 year maturities and use linear interpolation
to obtain values for required option τ . We use the most liquid segment of
data with κ ranging from 0.8 to 1.2 and τ from 10 days to 1 year. Options
outside of this range are excluded from the data set due to the poor data
quality, which does not allow to recover implied volatility surfaces for the
DAX and all constituents and to compute implied correlation on a daily basis.
Figure 3 in Section 3 shows an example of the ICS plotted using the entire
available option data, including options outside of the τ -range from 0.8 to 1.2,
for two selected “rich with data” days (20111209 and 2012071). As one can
see, some correlations observed in Figure 3 are more extreme in comparison
to the values in Table 1. The plots show the nature of the implied correlation
estimate, which is not necessarily observed in a range from 0 to 1. Those days
reveal the possibility of a so-called “volatility arbitrage”. Having in mind the
empirical findings described in Section 2.2, stating that the VRP of an index
is much more pronounced then of constituents, one might take a short position
in a too-expensive delta-hedged index option, when the implied correlation is
considerably higher than 1.
Options from original EUREX dataset are not given on a regular (κ, τ )-
grid, required in (17). In the τ -dimension, maturities are standardized by
market regulation, so every t one can find several τt , similarly for the index
and for all constituents. However, in κ-dimension one needs to interpolate. At
every t we use the original (κt , τt ) grid of the index and linearly interpolate
IVs of all constituents to obtain values corresponding to this grid. To avoid
computational problems with a highly skewed empirical distribution of (κt , τt ),
we transform the initial space [0.8, 1.2] × [0.03, 1] to [0, 1]2 using an empirical
distribution function. Also, we remove options with extremely high IVs (larger
than 50%) considering them the misprints in trade registration and finally use
(17) to obtain IC, which produces, on average, 135 observations per day.
Figure 1 shows there is a linear dependence between basket correlation
and volatility. We check this finding in the RC dataset for different estimation
windows and in IC dataset for different maturities. The RC data allows for
Implied Basket Correlation Dynamics 17
5 Empirical results
5.1 Estimation Results and Factor Modeling
1 1 1
0.5 0.5 0.5
t,1
t,2
t,3
d
d
0 0 0
Z
Z
−0.5 −0.5 −0.5
−1 −1 −1
Aug09 Nov09 Mar10 Jul10 Aug09 Nov09 Mar10 Jul10 Aug09 Nov09 Mar10 Jul10
t t t
1 1 1
0 0 0
bt,1 , Z
Fig. 6. Driving factors of the DAX ICS Z bt,2 , Z
bt,3 and ACF up to the 20th lag from
20090803 to 20100730
Zbt,1 , Zbt,2 , Zbt,3 for stationarity, Appendix A.6. For Zbt,2 in sub-sample 20100802
- 20110729 we cannot reject the hypothesis of a unit root, so we use its first
differences instead. Then we define the appropriate number of lags, or order
p, by computing Akaike’s information criterion (AIC), Schwarz’s Bayesian in-
formation criterion (SBIC), and the Hannan and Quinn information criterion
(HQIC) values, Appendix A.7. An * appearing next to the test statistics in-
dicates the optimal lag. Except for the sub-sample 20110802 - 20120801, the
test statistics suggest p = 2, so we make a choice in favor of this specification.
The estimation results are summarized in Appendix A.8. We also conducted
a portmanteau (Q) test for the null hypothesis that a series of residuals ex-
hibits no autocorrelation. The test does not indicate the presence of a serial
correlation.
Based on the results, we can distinguish the influence of each factor on
the time evolution of the ICS. The first factor can be interpreted as level, the
second as maturity and the third as a moneyness effect. The relative size of
the largest eigenvalues of (20) suggest that m b 1 is capable of capturing the
biggest share of the surface variability. The variation captured by the second
mb 2 has a smaller influence, since it is only responsible for the surface shape
transformation in the τ dimension. Finally, since the variation of the ICS in
the κ dimension is relatively small, the m b 3 has a smaller impact, which is also
b
reflected in the Zt,3 series.
The forecast of Zbt,1 , Zbt,2 , Zbt,3 modeled with VAR(2) together with esti-
mated fixed m b 0, m
b 1, m
b 2, m
b 3 give a forecast of the ICS.
Implied Basket Correlation Dynamics 19
Here we show that using the correlation forecast one can improve the original
dispersion strategy (14) and test it empirically over the 2-years sample period
20100801 - 20120802. We compare the payoff of the strategy without hedging
with the naïve hedging strategy and propose its improvement, the advanced
strategy.
To obtain the value of the naïve hedge position to be held over ∆t days
from t + τ − ∆t till t + τ we make a ∆t-days ahead DSFM forecast ρbt+τ (1, t + τ )
and use it as ρt+τ in (14). Thus the size of the position is defined by
XX
h
Dt+τ = wi wj σ̃t,i (τ )σ̃t,j (τ ) {ρ̃t (τ ) − ρbt+τ (1, t + τ )} . (27)
i j6=i
h
Table 3. Summary statistics for payoffs Dt+τ (no hedge), Dt+τ − Dt+τ (naïve hedge),
adv (advanced hedge) from 20100101 to 20120801, ∆t is one day, best results (highest
Dt+τ
min, max, mean and smallest stdd.) are given in italic
in dispersion strategy), do not hedge. Thus we can write the payoff of the
advanced strategy at t + τ as follows:
(
h
Dt+τ − Dt+τ , if ρbt+τ (1, t + τ ) ≥ ρ̃t (τ )
adv
Dt+τ = (29)
Dt+τ , if ρbt+τ (1, t + τ ) < ρ̃t (τ ).
Since a variance swap contract costs nothing to initiate (we ignore trans-
actions costs), the presented series of daily payoffs correspond to daily P&L
of the hypothetical trade where swaps expire daily over the whole period from
20100801 till 20120802. We compare the cash flows from three strategies. As
one can see in Table 3, the advanced strategy outperforms the other two by
having the smallest maximal losses, highest maximal gains (τ = 0.25, 0.5) and
the highest (second highest for τ = 1) average payoff over the studied sample
period.
In this simplified setting the financing costs are not taken into account for
both strategies.
6 Conclusions
In this study we investigated the implied correlation (IC) of the DAX index
basket and introduced a hedging approach for the dispersion trading strategy
Implied Basket Correlation Dynamics 21
References
Allen, P. and Granger, N. (2005). Correlation vehicles. techniques for trading equity correla-
tion., Technical report, JPMorgan European Equity Derivatives Strategy.
Bai, Z. D. (1999). Methodologies in spectral analysis of large dimensional random matrices,
Statistica Sinica 9: 611–677.
Bakshi, G., Cao, C. and Chen, Z. (2000). Do call and underlying prices always move in the
same direction?, Review of Financial Studies 13(3): 549–584.
Bakshi, G., Kapadia, N. and Madan, D. (2003). Stock return characteristics, skew laws, and
differential pricing of individual equity options, Review of Financial Studies 16: 101–
22 Wolfgang Karl Härdle and Elena Silyakova
143.
Bickel, P. J. and Levina, E. (2008a). Covariance regularization by thresholding, Ann.
Statist. 36(6): 2577–2604.
Bickel, P. J. and Levina, E. (2008b). Regularized estimation of large covariance matrices,
Ann. Statist. 36(1): 199–227.
Blair, B. J., Poon, S.-H. and Taylor, S. J. (2001). Forecasting S&P100 volatility: the in-
cremental information content of implied volatilities and high-frequency index returns,
Journal of Econometrics 105(1): 5 – 26.
Bollerslev, T., Engle, R. F. and Wooldridge, J. M. (1988). A capital asset pricing model
with time-varying covariances, Journal of Political Economy 96(1): 116–31.
Bossu, S., Strasser, S. and Guichard, R. (2005). Just What You Need To Know About
Variance Swaps, Technical report, JPMorgan Equity Derivatives report.
Bourgoin, F. (2001). Stress-testing correlations: An application to portfolio risk manage-
ment, in C. Dunis, A. Timmermann and J. Moody (eds), Developments in forecast
combination and portfolio choice, New York: Wiley.
Breeden, D. T. and Litzenberger, R. H. (1978). Prices of state-contingent claims implicit in
option prices, Journal of Business 51(4): 621–51.
Britten-Jones, M. and Neuberger, A. J. (2000). Option prices, implied price processes, and
stochastic volatility, Journal of Finance 55(2): 839–866.
Campa, J. M. and Chang, P. (1998). The forecasting ability of correlations implied in for-
eign exchange options, Journal of International Money and Finance 17(6): 855 – 880.
Carr, P. and Madan, D. (1998). Towards a theory of volatility trading, Reprinted in Option
Pricing, Interest Rates, and Risk Management, Musiella, Jouini, Cvitanic, University
Press, pp. 417–427.
Carr, P. and Wu, L. (2009). Variance risk premiums, Review of Financial Studies
22(3)(3): 1311–1341.
Christensen, B. and Prabhala, N. (1998). The relation between implied and realized volatil-
ity, Journal of Financial Economics 50(2): 125 – 150.
Cont, R. and Da Fonseca, J. (2002). Dynamics of implied volatility surfaces, Quantitative
Finance 2: 45–60.
Cox, J. C., Ross, S. A. and Rubinstein, M. (1979). Option pricing: A simplified approach,
Journal of Financial Economics 7: 229–263.
Demeterfi, K., Derman, E., Kamal, M. and Zou, J. (1999). More than You Ever Wanted to
Know about Volatility Swaps, Technical report, Goldman Sachs Quantitative Strategies
Research Notes.
Driessen, J., Maenhout, P. J. and Vilkov, G. (2009). The price of correlation risk: Evidence
from equity options, Journal of Finance 9: 1377–1406.
Engle, R. (2002). Dynamic conditional correlation: A simple class of multivariate generalized
autoregressive conditional heteroskedasticity models, Journal of Business & Economic
Statistics 20(3): 339–50.
Engle, R. F., Shephard, N. and Sheppard, K. (2008). Fitting vast dimensional time-varying
covariance models, Economics Series Working Papers 403, University of Oxford, De-
partment of Economics.
URL: [Link]
Fan, J., Fan, Y. and Lv, J. (2008). High dimensional covariance matrix estimation using a
factor model, Journal of Econometrics 147: 186–197.
Implied Basket Correlation Dynamics 23
Fengler, M. R., Härdle, W. K. and Mammen, E. (2007). A semiparametric factor model for
implied volatility surface dynamics, Journal of Financial Econometrics 5(2): 189–218.
Fleming, J. (1998). The quality of market volatility forecasts implied by S&P100 index
option prices, Journal of Empirical Finance 5(4): 317 – 345.
Hafner, R. (2004). Stochastic Implied Volatility, Springer Verlag, Heidelberg.
Hall, P., Müller, H.-G. and Wang, J.-L. (2006). Properties of principal component methods
for functional and longitudinal data analysis, Ann. Statist. 34(3): 1493–1517.
Härdle, W., Müller, M., Sperlich, S. and Werwatz, A. (2004). Nonparametric and Semipara-
metric Models, Springer Verlag, Heidelberg.
Härdle, W. and Simar, L. (2012). Applied Multivariate Statistical Analysis, 4rd edn,
Springer Verlag, Heidelberg.
Indritz, J. (1963). Methods in analysis, Macmillan New York.
Laloux, L., Cizeau, P., Bouchaud, J.-P. and Potters, M. (1999). Noise dressing of financial
correlation matrices, Physical Review Letters 83: 1467–1470.
Ledoit, O. and Wolf, M. (2003). Improved estimation of the covariance matrix of stock
returns with an application to portfolio selection, Journal of Empirical Finance
10(5): 603–621.
Lopez, J. and Walter, C. (2000). Is implied correlation worth calculating? Evidence from
foreign exchange options and historical data, Journal of Derivatives 7(3): 65 – 81.
Muggeo, V. M. R. (2003). Estimating regression models with unknown break-points, Statis-
tics in Medicine 22(19): 3055–3071.
Park, B., Mammen, E., Härdle, W. and Borak, S. (2009). Dynamic semiparametric factor
models, J. Am. Statist. Assoc. 104(485): 284–298.
Plerou, V., Gopikrishnan, P., Rosenow, B., Amaral, L. A. N., Guhr, T. and Stanley, H. E.
(2002). Random matrix approach to cross correlations in financial data, Phys. Rev. E
65: 066126.
URL: [Link]
Ramsay, J. and Silverman, B. W. (2010). Functional Data Analysis (Springer Series in
Statistics), 2nd edn, Springer Verlag, Heidelberg.
Skintzi, V. and Refenes, A. (2005). Implied correlation index: A new measure of diversifica-
tion, Journal of Futures Markets 25(2): 171–197.
Song, S., Härdle, W. K. and Ritov, Y. (2014). High dimensional nonstationary time series
modelling with generalized dynamic semiparametric factor model, Econometrics Journal
17: 1–32.
Sperlich, S., Linton, O. B. and Härdle, W. K. (1999). Integration and backfitting methods
in additive models-finite sample properties and comparison, Test 8(2): 419–458.
Yao, F., Müller, H.-G. and Wang, J.-L. (2005). Functional data analysis for sparse longitu-
dinal data, J. Am. Statist. Assoc. 100: 577–590.
24 Wolfgang Karl Härdle and Elena Silyakova
A Appendix
A.1 Realized versus model free implied volatility
τ = 0.25 τ = 0.5 τ =1
BASF 0.00000 0.00000 0.15298*
Commerzbank 0.00000 0.00005 0.66073*
Continental AG 0.99998* 0.99998* 0.99998*
Deutsche Bank 0.00001 0.00000 0.06985*
Deutsche Börse 0.95674* 0.95064* 0.96357*
Fresenius Medical Care 0.45640* 0.27716* 0.81540*
Henkel 0.90404* 0.99997* 0.99680*
K+S 0.00000 0.00000 0.99981*
Lanxess 0.27625* 0.05776* 0.99989*
Linde 0.76162* 0.87214* 1.00000*
RWE 0.40725* 0.21673* 0.05305*
ThyssenKrupp 0.01733 0.00272 0.69073*
Volkswagen 0.99998* 0.99998* 0.99998*
Table 4. The results of t-test for the equality of sample averages with H0 : RV = M F IV
against the alternative RV < M F IV of DAX constituents for which the H0 is not rejected
at 5% significance level at least for one τ . * denotes test results, which cannot reject H0 .
The test was performed for the volatilities of the DAX index and its 30 constituent stocks
computed over the time period 20100104 - 20121228 for 3 different maturities/estimation
windows: τ = 0.25, 0.5, 1. The test results are presented for the subsample of 13 DAX
constituents, for which H0 cannot be rejected at least for one τ .
Implied Basket Correlation Dynamics 25
τ = 0.25 τ = 0.5 τ =1
σ σ̃ σ − σ̃ σ σ̃ σ − σ̃ σ σ̃ σ − σ̃
Adidas 27.56 30.06 -2.50 28.35 31.07 -2.71 29.72 31.45 -1.73
Allianz 29.38 31.93 -2.55 30.44 33.03 -2.58 32.72 33.94 -1.22
BASF 28.95 31.08 -2.13 29.73 31.62 -1.89 31.32 31.58 -0.26
Bayer 27.39 30.74 -3.35 27.93 31.23 -3.31 28.85 31.17 -2.31
Beiersdorf 33.90 37.09 -3.20 34.53 37.96 -3.43 35.95 37.94 -1.99
BMW 19.57 23.95 -4.38 19.79 24.17 -4.38 20.26 23.90 -3.64
Commerzbank 46.47 52.77 -6.30 47.40 52.54 -5.14 51.02 51.70 -0.68
Continental AG 41.45 39.82 1.63* 43.84 40.55 3.29* 48.28 41.60 6.69*
Daimler 34.18 37.55 -3.36 34.93 38.45 -3.52 36.93 38.85 -1.93
Deutsche Bank 39.26 43.56 -4.30 39.76 43.93 -4.18 42.52 43.51 -1.00
Deutsche Börse 30.03 30.40 -0.37 31.09 31.10 0.00* 32.90 31.48 1.43*
Deutsche Post 31.61 33.63 -2.02 32.05 34.18 -2.13 33.13 34.67 -1.54
Deutsche
25.50 27.54 -2.04 26.20 28.26 -2.06 27.66 29.12 -1.46
Telekom
[Link] 23.62 26.26 -2.64 24.11 26.53 -2.41 24.67 27.48 -2.81
Fresenius Med-
28.12 29.21 -1.09 28.67 29.68 -1.01 30.19 30.04 0.16*
ical Care
Fresenius SE 19.01 22.44 -3.43 19.21 23.17 -3.96 19.89 23.62 -3.73
HeidelbergCement21.16 23.56 -2.40 21.61 23.69 -2.08 22.77 24.08 -1.31
Henkel 39.04 39.00 0.05* 40.96 39.49 1.46* 45.03 39.99 5.04*
Infineon 23.15 26.08 -2.94 23.57 26.70 -3.13 24.30 27.05 -2.75
K+S 40.78 43.75 -2.98 41.75 45.03 -3.27 45.88 45.08 0.80*
Lanxess 29.99 30.84 -0.85 30.86 31.78 -0.92 33.51 32.52 1.00*
Linde 39.90 39.92 -0.02 41.02 40.55 0.47* 42.83 41.07 1.76*
Lufthansa 22.18 25.18 -2.99 22.72 26.32 -3.61 23.47 26.77 -3.30
Merck 23.83 26.06 -2.23 24.43 26.40 -1.97 25.51 26.39 -0.89
Munich Re 23.35 26.54 -3.19 23.88 27.91 -4.03 25.28 29.10 -3.82
RWE 27.43 29.15 -1.72 27.93 29.71 -1.78 28.75 30.38 -1.63
SAP 21.45 23.67 -2.22 21.73 25.14 -3.42 22.00 26.82 -4.82
Siemens 25.94 28.53 -2.59 26.82 29.78 -2.95 28.63 30.35 -1.72
ThyssenKrupp 36.32 38.37 -2.04 36.78 38.71 -1.93 38.80 38.90 -0.10
Volkswagen 37.91 36.31 1.61* 39.48 36.90 2.59* 41.94 36.91 5.03*
DAX Index 21.72 25.38 -3.66 22.30 26.67 -4.37 23.40 27.81 -4.41
√ √ √ √
Table 5. Mean of the RV (σ) and M F IV (σ̃) and their difference RV − M F IV
(σ − σ̃), for DAX index and its 30 constituent stocks computed over the time period
20100104 - 20121228 for 3 different maturities/estimation windows: τ = 0.25, 0.5, 1).
σ − σ̃ ≥ 0 are marked with *.
26 Wolfgang Karl Härdle and Elena Silyakova
Fig. 7. DAX σB,t,τ (solid line) vs ρt,τ (dashed line), scatter plot σB,t,τ vs ρt,τ , for t, τ
from 20100104 till 20121228, estimated with (11) and (2) over 1 month (τ = 0.083), 3
months (τ = 0.25) and 6 months (τ = 0.5) window. Shaded area: Aug 2011 market fall.
Switch point for two regression line is defined as described in Section 4.
Implied Basket Correlation Dynamics 27
Fig. 8. DAX σ bt,B (1, τ ) (solid line) vs ρbt (1, τ ) (dashed line), scatter plot σ
bt,B (1, τ ) and
ρbt (1, τ ), for t, τ from 20100104 till 20121228, estimated from IVs with (17) for option with
1 month (τ = 0.083), 3 months (τ = 0.25) and 6 months (τ = 0.5) maturity. Shaded area:
Aug 2011 market fall.
28 Wolfgang Karl Härdle and Elena Silyakova
For both (24) and (25) kernel bandwidths hµ = (hµ,1 , hµ,2 )⊤ and hφ = (hφ,1 , hφ,2 )⊤
are to be selected. As suggested in Härdle et al. (2004) we use the penalizing
function approach to select optimal hopt
µ , minimizing mean integrated squared
error (MISE):
T Jt
( L
)2
1X 1 X X Wh∗ ,t,j (Xt,j )
Yt,j − Zbt,l m
b l (Xt,j ) wh∗ ,t (Xt,j )ΞAIC ,
T t=1 Jt j=1 l=1
T Jt
(30)
with the Akaike (1970) Information Criterion (AIC) as penalizing function
ΞAIC (q) = exp(2q) and Wh∗ ,t,j (Xt,j ) defined by
Kh (0)
Wh∗ ,t,j (Xt,j ) = P Jt , (31)
Jt−1 k=1 Kh Xt,k − Xt,j
for every Xt,j , 1 ≤ t ≤ T , 1 ≤ j ≤ Jt .
Implied Basket Correlation Dynamics 29
is defined by:
Jt
X
ph∗ ,t (Xt,j ) = Jt−1 Kh Xt,k − Xt,j , (32)
k=1
bt,1
Z Zbt,2 Zbt,3
20090803 - 20100730 (the 1st year) -2.991 (1) -6.982 (1) -5.710 (3)
20100802 - 20110729 (the 2nd year) -1.666* (3) -3.090 (2) -4.480 (1)
20110802 - 20120801 (the 3rd year) -3.511 (2) -3.796 (3) -3.480 (2)
20090803 - 20120801 (the entire sample) -4.025 (1) -6.912 (3) -8.979 (1)
Table 9. The estimated parameters for VAR(2) model for DAX ICS factors. * marks esti-
mates which are not significant at 5% level.