stock volatility regimes - carol alexander · 1 i introduction modelling regimes in market...
TRANSCRIPT
Modelling Regime Specific Stock Volatility Behaviour
Abstract
Any GARCH model with a single volatility state identifies only one mechanism governing the dynamic
response of volatility to market shocks, and the conditional standardized higher moments are constant,
unless modelled explicitly. Therefore such models cannot capture state dependent behaviour of
volatility and neither can they explain why the equity index skew persists into long dated options.
However the family of Markov switching (MS) GARCH models, which specify several deterministic
mean-reverting volatility states, have endogenous time-varying conditional variance, skewness and
kurtosis. Within this model class the normal mixture (NM) GARCH model, a MS GARCH with
constant state probabilities, is simplest to estimate. This paper extends previous theoretical and
empirical work on the NM GARCH model by introducing a state dependent leverage effect., which is
essential for modelling equity market volatility. With this construction we can explain the typical
empirical characteristics of equity index returns and implied volatility skews, even with a zero or
constant volatility risk premium. An empirical study of fifteen GARCH models of four European
equity indices between 1991 and 2005 identifies two-state asymmetric NM GARCH as the best fitting
model. The models show that the volatility behaviour is broadly similar in all indices during usual
market circumstances. But there is a small probability (between 2% and 5%) of the market crashing,
when the volatility behaviour of different indices becomes quite diverse, and the mean-reversion and
leverage effects are quite different from those in the usual regime.
JEL Classification Codes: C32, G13.
Keywords: GARCH, normal mixture, leverage effect, mean-reverting volatility, equity skew, stock market regimes, market crash
1
I Introduction
Modelling regimes in market volatility is of interest to most finance practitioners. For their assessment of
risk capital and for setting traders’ limits, risk managers need to understand how volatility may respond
differently to market shocks during a predominantly stable period, compared with a crash or recovery
period. When computing economic capital, and to enable traders' limits to be raised when markets are
considered to be exceptionally volatile, risk models are subjected to ‘stress tests’. These tests are also
required by most banking regulators but they are often performed using ad hoc adjustments. However, a
more precise approach to stress testing would specify an intensity for the shock to volatility and the
probability associated with the stressful scenario, and this requires a regime dependent volatility model.
Managing volatility following large price falls in an index is essential for governments and policy makers as
well, who are concerned about systemic risk, where insolvency in one sector of the economy may lead to
an economic crisis as in the global crash of 1987. Regulators need to understand how markets behave
under stressful conditions if their aim is to avert insolvency in a particular sector of the economy. Finally,
an understanding of best discrete time volatility model should help option prices to select from the
plethora of possible stochastic volatility models available, many of which will fit the market implied
volatility smile equally well.
A main characteristic of volatility is ‘mean-reversion’, whereby volatility persists after a market shock but
in the absence of any further shocks it will eventually return to a long-run volatility level. Mean-reversion
is captured in the standard approach to modelling volatility behaviour, i.e. the generalized autoregressive
conditional heteroscedasticity (GARCH) framework introduced by Engle (1982) and Bollerslev (1986).
However, normal GARCH models do not explain why both conditional and unconditional returns on
financial assets have such high skewness and kurtosis. For this reason Bollerslev (1987) extended the
GARCH model to include the Student’s t conditional distribution and Fernandez and Steel (1998)
enhanced it further using the skewed t-distribution.
Bekaert and Wu (2000), Wu (2001), Hansen, Lunde and Nason (2003) and many others argue that it is
important to also characterise the ‘leverage effect’ in stock markets, whereby volatility responds more to
negative shocks than to positive shocks of the same magnitude. Nelson (1991) introduced the first
GARCH process with a leverage effect. In this exponential model the returns are conditionally symmetric
(assuming volatility is known) but unconditional asymmetry is obtained from the dynamics of the variance
process. Amongst other GARCH processes with leverage effects are the absolute value GARCH based on
Taylor (1986, pp. 78-79) and Schwert (1989), the asymmetric and nonlinear asymmetric GARCH
introduced by Engle (1990) and Engle and Ng (1993)1, the GJR model of Glosten, Jagannathan and
1 As kindly noted by one of the referees, the asymmetric GARCH model was initially suggested by Engle (1990) and subsequently discussed in Engle and Ng (1993).
2
Runkle (1993), the asymmetric power GARCH model of Ding, Granger and Engle (1993), the threshold
GARCH of Zakoïan (1994), the quadratic GARCH of Sentana (1995), the Box-Cox transformed
GARCH of Hentschel (1995) and Hwang and Basawa (2004), which nests several different models, the
model of Brännäs and de Gooijer (2004) that extends QGARCH and, more recently, the model of Lanne
and Saikkonen (2007) that generalizes the AGARCH model.
All the above models assume that the GARCH volatility process has regime-invariant parameters and they
do not allow different types of shocks (of the same size) to precipitate different types of responses. Yet a
rumour may have an enormous effect but die out very quickly whereas an announcement of important
changes to economic policy (if represented by a shock of the same size as the rumour) may produce a less
volatile but more persistent response, after which the general level of volatility is raised. If the mean-
reversion and leverage effects are state dependent, the parameters estimated in a single state GARCH
process can represent only an average of these. Most single-state GARCH models also have constant
conditional skewness and conditional kurtosis. As a result, the option implied volatility smiles and skews
that are generated from these models are often unrealistic. Bates (1991), Christoffersen, Heston and
Jacobs (2006) and others argue that, for a model to capture the empirical characteristics of option implied
volatility skews, it is essential to account for time-variability in the physical conditional skewness and
kurtosis. So another reason for using more than one volatility state in a GARCH model is that both
conditional skewness and kurtosis can be endogenously time-varying.
More recent research examines Markov Switching (MS) GARCH and its more tractable relation: normal
mixture (NM) GARCH. Switching ARCH models were first introduced by Cai (1994) and Hamilton and
Susmel (1994). The straightforward generalization of this model to GARCH processes has estimation
problems due to the dependence of the conditional variance on the entire history of states. A solution was
suggested by Gray (1996) by replacing the lagged variance in the GARCH equation with its conditional
expectation; also the error term is allowed to follow a Student’s t distribution with switching degrees of
freedom parameter. Later on an enhancement was suggested by Klaassen (2002) making use of the full
information set available when computing the conditional probability of a state; see also Marcucci (2005)
for a model that allows for regime-specific means. However, these models are not very tractable.
One type of MS-GARCH was introduced by Francq, Roussignol and Zakoïan (2001); here there is only
one variance process but the parameters are state-dependent. The existence of the moments and the L2-
structures of this model were described in Francq and Zakoïan (2005), whilst the autocovariance structure
of the squared residuals were discussed in Francq and Zakoïan (2007); this paper also proposes a GMM
procedure for the estimation of the parameters. Henneke, Rachev and Fabozzi (2006) and Bauwens,
3
Preminger and Rombouts (2006, 2007) use Bayesian techniques to estimate this type of MS-GARCH
processes, whilst Dueker (1997) discusses an estimation method using a collapsing procedure to
approximate the likelihood function for a simplified version of MS-GARCH.
An improved MS-GARCH model was introduced and its properties presented by Haas, Mittnik and
Paolella (2004b) assuming that each state is characterized by its own variance and that the variance
process of each state depends only on its lagged values (as opposed to lagged values of the other variance
processes) and the squared residuals; for this model ML estimation is feasible. Weak stationarity properties
of the (1,1) model and conditions for the existence of the higher moments, relaxing the assumption of
initial finite variance, were discussed in Liu (2006); Abramson and Cohen (2007) also develop stationarity
properties for the models of Klaassen (2002) and Haas, Mittnik and Paolella (2004b) – for models of
order (p,q) – using a technique based on backward recursion.
Liu (2007) introduces a general model that allows for asymmetries in the variance equations; the focus of
this paper is the discussion of the stationarity properties and existence of higher moments of a
generalization of the model of Haas, Mittnik and Paolella (2004b) – a MS Box-Cox transformed threshold
GARCH(1,1), to be more precise.
All the above models are rather complex to estimate; a simpler model that ignores the autocorrelation in
the state variable (but, as opposed to some MS models, allows for different means in the states) is normal
mixture GARCH. Here the error term follows a mixture distribution. Early versions of NM-GARCH
models were discussed in Vlaar and Palm (1993), Bauwens, Bos and van Dijk (1999), Bai, Russell and
Tiao (2001, 2003) and Ding and Granger (1996). Concentrating on the individual means in the mixture
density, Wong and Li (2001) introduced a NM-ARCH model with individual AR processes in the mean
equations. This model was later extended to GARCH processes by Lanne and Saikkonen (2003); they also
assume that the mixing weights are functions of past observations. The general format of this model was
introduced by Haas, Mittnik and Paolella (2004a) and Alexander and Lazar (2006), providing strong
evidence that these models provide a better fit to exchange rate data than single-state GARCH models.
Bayesian estimation of the NM-GARCH was discussed in Bauwens and Rombouts (2007). Multivariate
extensions were considered by Haas, Mittnik and Paolella (2006), Fong, Li and An (2006) and Bauwens,
Hafner and Rombouts (2007) – the latter uses an estimation technique based on the EM algorithm.
Yet none of these models, with the exception of Henneke, Rachev and Fabozzi (2006) and Liu (2007),
have leverage effects in the state dependent GARCH volatility processes. The only source of skewness in
the physical returns densities is the different means in the normal components of the mixture conditional
4
density. Hence such models are not suitable when the leverage effect is known to be very important, as in
individual stock and stock index markets.
This paper examines the theoretical and empirical behaviour of state-dependent asymmetric volatility
processes in a normal mixture GARCH model. A main strength of the model is that it allows the
asymmetric volatility response to stock price shocks to be quantified in both usual and stressful market
circumstances. When two-state normal mixture models are applied to financial markets, often two states
can be differentiated: a ‘normal’ state that occurs most of the time and a ‘stressful’ state that occurs only
rarely. At every point of time the state is selected randomly, independently of the previous state. However,
as we will see later, following this random and unknown state-selection and after a return is realized from
the given distribution, it is possible to estimate the probability that a state was selected, and this will be the
ex-post inference about the probability of the state at time t, and it will be closely related to the actual state
of the world. Also, it allows the conditional higher moments to change in time. Implementing such a
model would allow risk capital assessments to be more refined and help risk managers to derive regime
specific limits for traders. It would also bring new insights, relevant to investors and policy makers, about
the likelihood of a market crash and the returns/volatility behaviour during a crash period. Two
asymmetric models are considered, namely the AGARCH and GJR extension of NM-GARCH, based on
the papers of Engle (1990), Engle and Ng (1993) and Glosten et al. (1993). One of the models, the NM-
GJR is a special case of the model of Liu (2007), with δ = 1 and with no autocorrelation for the state
variable; thus the theoretical results therein can be applied for the NM-GJR model in this paper. However,
this is not a restricted model of Henneke, Rachev and Fabozzi (2006), even though both consider GJR
variance processes.2 The other model discussed in this paper, NM-AGARCH, is not nested within the
family of models considered by Liu (2007).
Our empirical study on four major European equity market indices demonstrates the superiority of a two-
state asymmetric normal mixture GARCH model to capture the empirical characteristics of stock index
returns. We restrict our study to asymmetric NM-GARCH models as MS models don’t necessarily
perform better than mixture models3 and our focus is on the regime-specific asymmetric behaviour of the
variance process. We also compare the risk neutral skews generated by single component GARCH models
with the skews that are generated by the two-state asymmetric GARCH models. Even without a risk
premium the volatility smile implied by asymmetric normal mixture GARCH models exhibits a
pronounced skew that persists into long-dated options.
2 This is due to that NM-GARCH is a special case of the MS-GARCH of Haas, Mittnik and Paolella (2004b), which is a different formulation than the model of Henneke, Rachev and Fabozzi (2006). 3 A thorough theoretical and empirical comparison of MS and NM-GARCH models is done by Haas, Mittnik and Paolella (2004b) by evaluating the fit of these models for three currencies. Using the BIC criterion, NM models are preferred in 2 out of 3 cases. When the forecasting performance of these models is studied, then again in most cases the NM models proved to dominate. The only criterion according to which MS models are (marginally) preferred is the fit of the conditional higher moments.
5
The rest of the paper is organized as follows: Section II defines the general asymmetric normal mixture
GARCH model and describes two sources of asymmetry in these models; Section III characterizes the
equity index data for four major equity markets and presents the estimation methodology; Section IV
reports our empirical results. We start with our estimations for fifteen different GARCH models,
including two-state asymmetric and symmetric normal mixture GARCH models, and symmetric and
skewed t-GARCH with both symmetric and asymmetric variance processes. Then several model selection
criteria are applied to identify the best model. Choosing the two-state normal mixture AGARCH and GJR
models as best overall, the estimated parameters are used to infer various aspects of volatility behaviour in
European stock markets. Section V simulates prices of European calls and puts of various maturities and
compares the implied equity index skews of the FTSE index simulated by symmetric and asymmetric
GARCH models with one and two states. Section VI summarizes and concludes.
II The Asymmetric Normal Mixture GARCH Model
The model extends the GARCH processes studied in Haas, Mittnik and Paoella (2004a) and Alexander
and Lazar (2006) to include a leverage effect in the GARCH variance equations. Specifically, the model
has one equation for the mean and K variance equations. For simplicity the conditional mean equation is
written yt = εt, but this can easily be extended to allow for explanatory variables because their coefficients
can be estimated separately. The error term εt is assumed to have a conditional normal mixture density
with zero mean, which is a weighted average of K normal density functions with different means and
variances. We write:
( )2 21 1 1 1ε ~ ,..., ,μ , ...,μ ,σ , ...,σt t K K t KtI NM p p− ,
1 11, μ 0
K K
i i ii i
p p= =∑ ∑= = (1)
and the conditional density of the error term is:
( ) ( )1
η ε φ εK
t i it ti
p=∑= (2)
where φit represent normal density functions at time t with different means μi and different time-varying
variances 2σ it for i = 1,…, K. The mixing law ( )1 , ..., Kp p p ′= has elements that may be interpreted as the
relative frequency of each state occurring over a long period of time; for reasons of identifiability we
assume that 1 ≥ p1 ≥ p2 ≥… ≥ pK ≥ 0. There are K conditional variance components and these can follow
any GARCH process, but for the purpose of this paper we assume that there are three possibilities:
(i) NM-GARCH(1,1): 2 2 2
1 1σ ω α ε β σit i i t i it− −= + + for i = 1,…, K (3)
(ii) NM-AGARCH(1,1) (based on the Engle, 1990 and Engle and Ng, 1993 model):
( )22 21 1σ ω α ε λ β σit i i t i i it− −= + − + for i = 1,…, K (4)
6
(iii) NM-GJR(1,1) (based on the Glosten et al., 1993 model): 2 2 2 2
1 1 1 1σ ω α ε λ ε β σit i i t i t t i itd−− − − −= + + + for i = 1,…, K; (5)
where td− = 1 if εt < 0, and 0 otherwise.
For the normal densities to be well identified, we also assume that:4
|μi – μj| + |ωi – ωj|+ |αi – αj| + |βi – βj| + |λi – λj| > 0 for i ≠ j.
The main difference between NM-GARCH and the MS-GARCH model of Haas, Mittnik and Paolella
(2004b) is that the latter allows for autocorrelation in the state variable; in other words, for the NM-
GARCH model the transition matrix is of rank 1, so it can be expressed as
( ) ( )1Δ Δ ' where 1, ...,1 'ij t tP p P i j p−⎡ ⎤= = = = = =⎣ ⎦ 1 1
Also, the NM-GARCH adds asymmetry by allowing for non-zero means for the different components.
The overall conditional variance can be expressed as
2 2 2
1 1σ σ μ
K K
t i it i ii i
p p= =∑ ∑= + (6)
and, given the data, it is possible to compute an ex-post estimate of the probability of the ith state at an
arbitrary time t, expressed by:
( )( )
,
1
φ ε
φ εi i t
i t K
j j tj
pp
p=∑
= (7)
This is a time-varying probability, giving a realistic ex-post inference about the probability of state i
occurring at time t. Thus at any point in time we have an ex-ante probability of the states, given by the
mixing law, and an ex-post estimate of the state-probabilities, given by the above formula.
The weak stationarity conditions for the two models are stated in the Appendix. For K > 1, the existence
of second, third and fourth moments are assured by imposing less stringent conditions than in the single
component (K = 1) models. Namely, it is not necessary to assume that each component is stationary in
the conventional sense. For instance, it may happen that the more volatile component would be
considered non-stationary according to standard interpretation (i.e. αi + βi > 1 for that component).
However, the model has to be considered as a whole, and interpreting its components in isolation would
be misleading. 5
4 Also note that we do not require that ωi > 0; actually it often happens that the ω parameter of one of the components is slightly negative. However, as long as it is small in absolute value, it does not affect the positivity of the variance series (also, see Alexander and Lazar, 2006) 5 When the models are restricted to NM-GARCH(1,1), then the stationarity conditions are equivalent to those given by Alexander and Lazar (2006). Also, they are equivalent to those given by Haas, Mittnik and Paolella (2004a) and Liu (2006) when the ωi > 0
7
There are two distinct sources of asymmetry in the model:
• Persistent Asymmetry: This arises in all three models when the conditional returns density is a
mixture of normal density components having different means; it is generated by the difference
between the expected returns under different market circumstances. The Appendix shows that
even the unconditional density will have non-zero skewness, and that this increases with the
differentiation of the component means. For instance, when K = 2, there is positive skewness in
the overall conditional returns density if the component with the higher probability has low
volatility and a negative mean (as it generally happens), and negative skewness in the overall
conditional returns density if the component with the higher probability has low volatility and a
positive mean.6
• Dynamic Asymmetry: This only occurs in models (ii) and (iii) and is due to the leverage
parameters in the component variance processes. If the leverage parameter λi is positive, the
conditional variance in this component is higher following a negative unexpected return than
following a positive unexpected return. In equity markets, where a negative unexpected return
increases the leverage of a firm we expect positive values of the leverage parameter. However,
some studies (e.g. Koutmos et al., 1998 and Chortareas et al., 2000) report negative leverage
coefficients in stock indices of developing economies.7
Taken together, these two sources of skewness in the physical conditional returns density offer a much
richer structure for capturing the shape of equity index skews than is given by traditional GARCH models.
The unconditional skewness and excess kurtosis are both non-zero and the conditional higher moments
are also non-zero and time-varying. See the Appendix for the derivation of the unconditional higher
moments.
The conditional normal mixture densities can be interpreted as merely a device to increase the flexibility of
the returns density, because the model will capture the behaviour of returns during different regimes and
identify the long-run probability of each state, based on time series data. The model is considerably easier
to estimate than the class of Markov switching GARCH models introduced by Hamilton and Susmel
(1994) even with the restrictions and improvements introduced by Cai (1994), Gray (1996), Klaassen
condition is imposed (in this case the stationarity condition given in the Appendix reduces to n > 0 only). Since the GJR-GARCH is a special case of the model of Liu (2007), its stationarity conditions can be deducted from Liu’s results. When we also assume that ωi > 0, then our stationarity condition reduces to n > 0 (see Appendix), which is equivalent to the weak stationarity results of Liu (2007) restricted to NM-GJR models 6 As suggested by one of the referees, this can be seen by expressing the third moment as
( )( ) ( )( )3 2 23 1 2 2 1 1 2 1 2 1 2μ μ 3 μ μ σ σm p p p p⎡ ⎤= − − + − −⎣ ⎦ .
7 Note that the ith variance component depends on the dispersion of the unexpected return, not around its mean μi in the individual density, but around the overall mean 0. Hence there is a third effect that induces skewness in each component conditional return density but not in the overall conditional return density.
8
(2002) and Haas, Mittnik and Paoella (2004b). The difficulty with estimating most of these models (except
the last one) lies in the co-dependencies of the state variances. However, the normal mixture GARCH
models considered here have a very straightforward relationship between the individual variances, because
they are tied to each other only through their dependence on the error term. Hence an advantage of
normal mixture GARCH is that quite complex volatility feedback mechanisms can be included and the
model remains easy to estimate.
III Data and Parameter Estimation
Our results are based on the daily closing prices of four major European equity market indices: CAC40
(total number of observations is 3733), DAX30 (3730), FTSE100 (3739) and DJ Eurostoxx 50 (3810)
from 1 January 1991 to 21 October 2005. This period encompasses a number of stock market crises,
including the Asian crisis in 1997, the Russian crisis in 1998, the technology market crash in 2000 and the
terrorist attack on the US in 2001. Table 1 summarises the general characteristics of the daily returns. The
skewness is negative and the excess kurtosis is positive and in most indices these are highly significant. At
first, it can be noticed that during this period the FTSE index was less volatile in general than the other
markets.
Table 1: Summary Statistics of the Stock Market Indices8
CAC DAX FTSE Eurostoxx
Volatility 21.24% 22.82% 16.41% 20.13%
Skewness –0.099* –0.271*** –0.113** –0.166***
Excess kurtosis 2.86*** 4.03*** 3.19*** 4.25***
Maximum moment exponent 5.58** 4.59*** 6.70* 5.66***
For each index, we estimate the conditional variance parameters on the residuals εt from constant
conditional mean equations.9 Then, maximizing the likelihood function, or equivalently, maximizing
( ) ( )1
|ε ln η εT
tt
L=∑= ⎡ ⎤⎣ ⎦θ
the optimal parameter values are found, given the data. The updating formula has the following form,
where g is the gradient vector, H the Hessian matrix and s represents the step-length:
( ) ( )11m m m ms
−
+ = − ⎡ ⎤⎣ ⎦θ θ H θ g θ
8 The standard errors (s.e.) of the sample estimates of the mean, variance, skewness and excess kurtosis parameters are as follows: s.e. of the sample mean = σ T , s.e. of the sample variance = 22 σ T , s.e. of the sample skewness ≈ T/6 , s.e. of the sample
excess kurtosis ≈ T/24 , where T represents the total number of observations. In Table 1 *** represent results significantly different from zero at the 0.1% level, ** at 1% and * at 5%. The last row reports the HKKP estimate for the maximum moment exponent; here ***, ** and * denote results that are significantly higher than 4 at 0.1%, 1% and 5% significance level, respectively. 9 We fitted ARMA models to the series; the best fitting model for all series was an ARMA(0,0) model. Thus we only removed the means of the series and continued with the demeaned series.
9
To compute the Hessian matrix and the gradient vector we can use either analytic or numerical first and
second order derivatives of the likelihood – see the Appendix for the numerical derivatives.10
IV Empirical Results
IV.1 Overview of Models
We fitted three symmetric and twelve asymmetric GARCH models to the equity index data. The first nine
models have a single variance component and the last six models have two variance components.11 The
models are:12
A. Models with normally distributed errors:
(1) GARCH
(2) AGARCH
(3) GJR
B. Models with symmetric Student’s t distributed errors:
(4) GARCH
(5) AGARCH
(6) GJR
C. Models with skewed Student’s t distributed errors:
(7) GARCH
(8) AGARCH
(9) GJR
D. Normal mixture GARCH models with zero means in the mixture component densities:
(10) NM-GARCH
(11) NM-AGARCH
(12) NM-GJR
E. General normal mixture GARCH models:
(13) NM-GARCH
(14) NM-AGARCH
(15) NM-GJR 10 The results were generated using C++ and Ox version 3.30 (Doornik, 2002) and the G@rch package version 3.0 (Laurent, S. and Peters, J.-P., 2002) 11 Several restricted versions of the normal mixture GARCH model were also fitted to the data (assuming a constant variance component or assuming constant difference between the two variance processes) – but these performed quite badly according to some of the selection criteria and thus these models are not discussed here. 12 All models are GARCH(1,1) specifications. To simplify notation, this will be implied in the following. Models (2), (3), (5), (6), (7), (11), (12) and (13) have one source of asymmetry, whilst models (8), (9), (14) and (15) have two sources. Also, all normal mixture models will be based on two normal distributions in the mixture, as important conclusions can be drawn from such a setup – see also Marcucci (2005) and Bauwens, Preminger and Rombouts (2007). We do not consider models with more than two components as those would increase the number of parameters from 10 to at least 16 and, as the results of Haas, Mittnik and Paolella (2004a,b) and Alexander and Lazar (2006) clearly indicate, in the case of three-component mixture models convergence is harder to achieve and also these models prove to be over-parameterized when it comes to out-of-sample forecasting.
10
The estimation results are reported in Tables 2 – 5.13 The upper figure in each cell reports the parameter
estimate and the lower figure is the t-ratio. Note that in these tables the first row reports the degrees of
freedom for the t-GARCH models (4) – (9) and the weight of the first component in the NM-GARCH
models for models (10) – (15). Similarly, the third row reports the skewness parameter for models (7) –
(9) and the mean of the first normal density for the normal mixture GARCH models.
IV.2 Ex-post State Probabilities
Figure 1 presents the daily Eurostoxx index returns for the period Jan 2004 – October 2005. It is expected
that the ex-post state probabilities will indicate a jump from the first state (characterized by a lower
variance) to the second state (characterized by a larger variance) when the absolute returns increase in
magnitude – or, from the second state to the first one, when absolute returns decrease. Figure 2 shows
that this is indeed the case – the graph presents the estimated conditional volatilities of the two
components plotted against the time-varying ex-post probability of the first state. Note that when the
returns increase in absolute value after a relatively tranquil period, the ex-post probability of the first state
decreases, suggesting a switch from the first, less volatile state to the second state. In such a situation both
conditional volatilities show an upward jump. The volatilities exhibit a downward jump when the switch is
from the second to the first state.
IV.3 Model Selection:
The following selection criteria are used:
(a) Bayesian Information Criterion: The model with the lowest BIC is chosen.
(b) Moment specification tests: Following Newey (1985), we test for normality in the standardized
residuals, checking the first four moments and for zero autocorrelations in their first four powers,
using a Wald test. Although this test has the disadvantage that it tends to reject the null too often
in small samples (Godfrey, 1988), this test was used for GARCH-type processes by Nelson
(1991), Harvey and Siddique (1999) and Brooks, Burke and Persand (2005) and a multivariate test
was developed by Ding and Engle (2001)14. We proceed with the transformation:
( )( )1
1Φ Φ ε
K
t i i ti
u p−
=∑=
where Φ is the cumulative normal density function; if the model is the true DGP, then this
transformation should lead to i.i.d. standard normal series.15 We test a total of 20 conditions (the
13 Note that the results in Tables 2 – 5 are for variance-annualized unexpected returns. That is daily returns are pre-multiplied by √250 before estimation. Thus, volatilities are quoted in annualized terms. 14 Interestingly, the author have shown using Monte Carlo simulations that the small sample properties of this test’s statistics are good. 15 A similar transformation was used by Haas, Mittnik and Paolella (2004a). This transformation is different from the one of Lanne and Saikkonen (2003), where the resulting series have unit variance and preserve autocorrelation, but are not i.i.d. normal.
11
first four moments of ut and the first four autocorrelations in its first four powers) and the test
statistics for the moment tests have a χ2(1) distribution. The tables report the number of tests
(out of 20) that are rejected at 1% significance level.
(c) Unconditional density fit: The density test is on the histogram fit between the model simulated
data and the original data. This is one of the most difficult tests for models to pass as it tests for
the unconditional distributional fit. The model returns are simulated16 and their histogram is
estimated using a nonparametric kernel approach. Several alternatives are available for the kernel,
our chosen function being that of Epanechnikov (1969). Then the model selection criterion is
based on the modified Kolmogorov-Smirnov (KS) statistic (Kolmogoroff, 1933, Smirnov, 1939,
Massey, 1951 and Khamis, 2000).
(d) ACF analysis: In contrast to (c) this test captures the dynamic properties of the model squared
returns – namely, the fit to the empirical autocorrelations of the squared returns. The Appendix
states the theoretical autocorrelation functions (ACF) of the different models and we apply the
Mean Squared Error (MSE) criterion to assess the fit of the models.
(e) Value at Risk (VaR) analysis: The VaR tests developed by Christoffersen (1998) are
performed. Using the estimated parameters, the in-sample VaR estimates are computed. Based on
these values, the occurrences of exceeding the VaR are counted. This is achieved by building
several indicator functions. The first one equals zero except when a return lower than minus the
VaR occurs, when it equals one:
| ;α;α
1,0,
t t t jt
r VaRI
otherwise−< −⎧
= ⎨⎩
, where α is the level of significance
Then there are a set of four indicator functions:
1;α ;α, ;α
1,0,
t tij t
I i and I jJ
otherwise− = =⎧
= ⎨⎩
, for i, j = 0, 1
The three test statistics, for unconditional coverage, independence and conditional coverage are as
follows:
( )0 ;α 1;α
2;α
α α
1 α α2 log ~ χ 11 π π
n n
ucLR⎛ ⎞⎛ ⎞ ⎛ ⎞−⎜ ⎟= − ⎜ ⎟ ⎜ ⎟⎜ ⎟−⎝ ⎠ ⎝ ⎠⎝ ⎠
( )00 ;α 01;α 10 ;α 11;α
2;α 2;α 2;α 2;α 2;α
01;α 01;α 11;α 11;α
1 π π 1 π π2 log ~ χ 1
1 π π 1 π π
n n n n
indLR⎛ ⎞⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞− −⎜ ⎟= − ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟⎜ ⎟− −⎝ ⎠ ⎝ ⎠ ⎝ ⎠ ⎝ ⎠⎝ ⎠
( )2;α ;α ;α ~ χ 2cc uc indLR LR LR= +
16 We simulate returns, based on the estimated parameters, and to ensure that the simulated density is not affected by small sample size we use 50,000 replications. Also, to avoid any influence of the starting values, each simulation has 1000 steps ahead in time but we only use the last simulated return.
12
where 1;α ;α1
T
tt
n I=∑= ; 0;α 0;αn T n= − ; α 1;απ /n T= ; ;α , ;α
1
T
ij ij tt
n J=∑= ;
( )01;α 01;α 00;α 01;απ /n n n= + ; ( )11;α 11;α 10;α 11;απ /n n n= + ; ( )2;α 01;α 11;απ /n n T= +
For simplicity, only the first two test statistics are reported and analyzed.
The results of these specification tests are shown in the last four rows of Tables 2 – 5 and in Table 6;
these can be interpreted as follows:
(a) Bayesian Information Criterion: Most series favour the skewed t-AGARCH and skewed t-GJR
models, except the FTSE for which the symmetric t-GJR model is preferable. According to the
BIC, normal mixture models are outperformed by t-GARCH models. Asymmetric NM-GARCH
models fare better than symmetric models, and also models with non-zero means have a lower
BIC than normal mixture models that assume zero means for the normal components. However,
this is a very simple criterion and it ignores the models’ ability to capture the characteristics of the
data.
(b) Moment specification tests: These tests show that the most basic models, i.e. (1) – (3) do not
capture the higher moments. But beyond this observation, the moment tests do not distinguish
well between the models. We find that most models have several rejections for these tests. None
of the models performs consistently well based on these moment tests.
(c) Unconditional density fit: This shows a clear preference for the NM-GARCH models (13),
(14) and (15) (i.e. with different component means). We conclude that it is necessary to model
both sources of asymmetry. It is important to notice that the t-GARCH models perform quite
badly according to this criterion. Often they have very unrealistic (negative) unconditional
kurtosis estimates, suggesting that for these models the fourth moment does not exist. To test for
the existence of the fourth moment, we performed the HKKP procedure derived by Huisman et
al. (2001, 2002) which estimates the maximum moment exponent based on the method
introduced by Hill (1975). The estimates for the maximum exponent are found in Table 1.17 As
the results illustrate, we can reject the hypothesis that the 4th moment does not exist for all series.
(d) ACF analysis: This test also favours the asymmetric NM-GARCH models. Again we see that
models based on the t distribution perform badly in many situations and quite often the ACF
estimates of these models are negative. This is probably because these models do not capture the
fourth moment. By contrast, all the normal mixture models perform very well according to this
criterion.
17 We have used OLS for the estimation of the maximum exponent instead of WLS because the WLS procedure suggested by Huisman et al. (2001, 2002) actually increased the level of heteroscedasticity of the residuals. To correct for heteroscedasticity we used White’s heteroscedasticity consistent covariance matrix. We used the 500 most extreme observations in the HKKP regression.
13
(e) VaR analysis: The independence and conditional coverage test results in Table 6 show that
normal GARCH models are in general very unsuitable for VaR estimations. Additionally,
restricted NM-GARCH models fail some of the VaR tests, again proving that they are not
suitable for VaR calculations. Symmetric and skewed t-GARCH models fail only few of the tests
but the NM-AGARCH and NM-GJR models achieve the best results, the latter passing both tests
for all indices.
In summary, the normal GARCH model is the worst fit by all criteria and the Student’s t-GARCH models
fit well according to the BIC criteria but often don’t capture the fourth moment of the returns,
performing badly according to the density fit and ACF criteria. The normal mixture models with different
component means fit better than those with identical means. We conclude that the in-sample fit of a two-
state normal mixture GARCH model with two sources of asymmetry, both persistent and dynamic, is
superior to the other models considered. However, for the dynamic asymmetry (i.e. the leverage effect) it
is not possible to decide which asymmetric specification for the two volatility components is preferred –
this depends on the specific index considered.
IV.4 Regime Specific Volatility Behaviour
Each of the two-state normal mixture AGARCH models reveals a lower volatility component that occurs
with a high probability (the ‘usual’ component) and a high volatility component with a very low
probability (the ‘crash’ component). Table 7 summarizes the characteristics of these volatility components
in each of the four European stock markets during the sample period considered. The results in this table
are based on the NM-AGARCH model (14). To obtain the annualized mean returns we multiply the
mean estimates by √250.
In each market the ‘usual’ component is characterized by a high associated probability, a positive mean
return and a low volatility. The ‘usual’ mean return is lowest in the FTSE, at only 4.6% per annum but the
unconditional volatility is also low: at 15% it is the lowest of all. Notably the Eurostoxx index has the
highest return (8.5%) and the second lowest volatility of the four markets (17%). On the other hand, the
CAC and DAX indices have higher volatility, around 21%, but their mean return is less than that of
Eurostoxx during their usual state. However, they have been in their ‘usual’ regime more often than the
FTSE and Eurostoxx: for the latter a ‘usual’ regime occurred only about 95% of the time between 1991
and 2005. In the ‘usual’ regime the least reactive and most persistent volatility is that of the FTSE series
but the Eurostoxx and DAX display slightly more reaction and less persistence than the other indices. The
leverage effect is most pronounced in the CAC index.
The ‘crash’ market regime occurred between about 2% and 5% of the time, depending on the index. The
CAC and the DAX have the highest unconditional volatility (over 40%) and the lowest return during a
14
crash (it is over –350% in annual terms for the CAC). This is probably because the CAC and DAX indices
are less liquid than the FTSE and Eurostoxx. By comparison, during the 1987 crisis the FTSE lost over
400% in annual terms. Yet in our sample the ‘crash’ mean returns are around –100% for the FTSE and –
160% for the Eurostoxx. All indices, and the FTSE and CAC in particular, are highly reactive to market
shocks in the ‘crash’ regime, yet the effect dies out soon because the persistence parameters are all low,
especially on the FTSE. The leverage effects are similar to the ‘usual’ leverage effects for the DAX and
Eurostoxx, but the highly reactive FTSE and CAC indices have an inverse leverage effect in a crash
regime. This indicates that a positive return will lead to higher volatility than a negative return, but only
during the crash regime. A possible explanation of this is that investors anticipate further falls after a
modest recovery.
Table 7: Summary of Regime Specific Behaviour in Volatility
CAC DAX FTSE Eurostoxx
‘Usual’ Component
Probability 0.978 0.970 0.958 0.949 Annualized Mean Return 7.9% 6.5% 4.6% 8.5% Unconditional Volatility 21% 21% 15% 17% Volatility Reaction (α1) 0.0524 0.064 0.0482 0.0642
Volatility Persistence (β1) 0.9311 0.922 0.9362 0.9206 Leverage Effect (λ1) 0.1058 0.0639 0.0899 0.0443
‘Crash’ Component
Probability 0.022 0.030 0.042 0.051 Annualized Mean Return -353% -210% -106% -161% Unconditional Volatility 44% 43% 27% 32% Volatility Reaction (α2) 1.5497 0.3729 0.9401 0.2288
Volatility Persistence (β2) 0.6172 0.7155 0.5821 0.8332 Leverage Effect (λ2) -0.0311 0.0470 -0.0192 0.0641
V Equity Index Implied Volatility Skews
Out-of-the-money put options on an equity index are an attractive form of insurance for investors that
fear a general market decline; and with index option market makers in relatively short supply, the market
prices of these options are often far higher than the Black-Scholes (1973) model prices based on the at-
the-money volatility. Consequently the implied volatility of these options is commonly found to be higher
than the implied volatility of at-the-money call and put options and out-of-the-money calls. This leads to
the skew (or ‘smirk’) in equity index implied volatility that has been very pronounced since the global
stock market crash in 1987, as shown by Bates (1991), Rubinstein (1994), Jackwerth and Rubinstein
(1996), Derman and Kamal (1997), Tompkins (2001) and many others. This equity index implied
volatility skew is associated with a negatively skewed implied risk neutral returns density.
Since the global crash of 1987 the skewness in risk neutral index densities has, in general, been much
greater than the skewness estimated from historical data on stock index returns. The difference between
the physical and risk neutral skews has been the subject of extensive academic research, as in Bakshi et. al
(2003), Bates (1997, 2000) and many others. These papers argue that a volatility risk aversion adjustment
15
is necessary to reconcile the physical and risk neutral distributions of equity indices. The volatility risk
aversion can be recovered using empirical pricing kernels based either on unconditional historical returns,
as in Ait-Sahalia and Lo (2000) and Jackwerth (2000) or on conditional returns densities, as in Rosenberg
and Engle (2002). However, Bates (2003) argued that the difference between the risk-neutral and real-
world distributions cannot be explained, except perhaps by the existence of a time-varying volatility risk
premium.
One conclusion of this research was that the index skew is too pronounced and too persistent to verify
the standard approach and that more sophisticated time series models of the conditional densities of index
returns were required. The asymmetric normal mixture GARCH process considered here provides just
such a model: since time-varying conditional skewness and kurtosis are endogenous the model volatility
skew exhibits a term structure even in the physical measure. With single state GARCH models this is
impossible. The uncertainty over two possible volatility states, each of which exhibits volatility clustering
but with quite different characteristics, provides a strong justification for the inclusion of a time-varying
volatility risk premium.
A natural question to ask, therefore, is how these properties are reflected in the equity skews implied by
these models. In this section we compare the implied volatility skews in the physical measure generated by
asymmetric normal mixture GARCH models with those implied by other GARCH models. As we shall
see, leverage effects are clearly very important. In all markets we find very pronounced normal mixture
GARCH skews, even in the absence of a risk premium, and the skew persistence that is captured by the
difference in the means of the variance components is only of secondary importance.
For illustration we use the parameter estimates of the FTSE index returns given in Table 5 for a
representative selection of five of the fifteen models: normal GARCH, asymmetric t-GARCH, asymmetric
t-GJR, NM-GARCH and NM-GJR models. We simulate volatility skew surfaces using each of these five
models and compare their characteristics. Starting with S0 = 100 and using r = 0.03, we simulate the
dynamics of the index value as:
( )( )2Δ exp σ / 2 Δ ε Δt t t t tS S r t t−= − +
Then, for a fixed strike K and maturity T the time zero price of a European call option is computed as
( ) ( )( )exp max 0, TrT E S K− − . Simulating 50,000 times and computing the average call value gives the
estimate of the option price. Then, applying the inverse Black-Scholes formula gives the simulated implied
volatility at (K, T). We take a range of strikes between 80 and 130 and a range of maturities from 3 to 18
months. The results are shown in Figure 3.
16
We have used the same vertical scale from 10% – 25% volatility for each smile, as this makes the
comparison easier. The first two skews, from the normal and skewed t-GARCH models are unrealistic.
The volatility level is too low and there is no evidence of a negative skew even for the skewed t-GARCH
model. The GJR skew in figure (c) is more realistic, with substantially higher volatility for ITM calls than
OTM calls. Still, there is little evidence of a volatility term structure as, for a given moneyness, volatility is
almost constant as maturity increases. The NM-GARCH without leverage effect has more of a term
structure and the short-term skew is not exactly linear with respect the moneyness, a feature that is quite
common in index markets. Finally, the NM-GJR model produces the most realistic skew: it is more
pronounced and less linear than in the single component GJR model, and there is more variation of
volatility over time. We conclude that including non-zero means in the components of the mixture
provides a skew that persists into longer dated options but it is the leverage effect that primarily
determines the slope of the skew for short dated options.
VII Summary and Conclusions
The majority of generalised autoregressive conditional heteroscedasticity (GARCH) models specify a
single time-varying volatility state and thus offer only one scenario for market behaviour. Also their
standardized higher moment specification is not realistic: time-variation in conditional skewness and
kurtosis is ignored except in a few instances where it is specified exogenously to the GARCH model. This
paper considers the normal mixture GARCH model with two volatility states and endogenous time-
varying conditional higher moments and introduces additional skewness to model the asymmetry due to
leverage effects in equity markets.
We first ask whether this additional source of asymmetry is necessary, given that normal mixture
GARCH(1,1) models with different mean returns already exhibit time-varying conditional skewness and
kurtosis, in contrast to single-state GARCH models. The answer to this question is undoubtedly yes. Both
the statistical criteria and the simulations of the index skew justify the addition of both unconditional and
dynamic types of asymmetry. The different component means give a non-zero unconditional skewness,
but the addition of dynamic asymmetry is very highly significant and dramatically improves the time series
fit of the normal mixture GARCH models.
Empirical results on four European indices compare the fit of these models to single-state normal and
symmetric and skewed t-GARCH specifications. The overall conclusion from applying five statistical
criteria to fifteen fitted models is a clear superiority of asymmetric normal mixture GARCH models.
These models are then used to provide considerable insight to the regime specific behaviour of the
European equity indices. Between 1991 and 2005 the relative frequency of a ‘crash’ regime varied from
about 2% for the CAC to about 5% for the Eurostoxx50. In the ‘usual’ regime the indices have broadly
similar behaviour (except that leverage effects are strongest in the CAC and weakest in the Eurostoxx) but
17
the crash regime behaviour is more diverse. The CAC index displays the most extreme crash behaviour,
returning –350% in annual terms with an unconditional volatility of 44%; by contrast the FTSE returned
only –100% and the unconditional volatility is far lower. All indices are highly sensitive to market shocks
in the crash regime, particularly the CAC, but persistence in volatility is very low indicating that volatility
quickly returns to more normal levels. The leverage effect in the CAC and the FTSE is positive (but small)
during a crash period, indicating that investors can anticipate further falls after only a modest recovery.
Although the Eurostoxx has the highest crash probability, in the ‘usual’ regime it has the highest mean
return (8.5%) and a low unconditional volatility (17%), although this is not as low as the FTSE volatility.
Since the crash behaviour of the Eurostoxx is also the least extreme, apart from the FTSE, it may be the
best investment alternative of the indices considered.
If agents’ beliefs about the future can be informed by past behaviour, the rich structure that is revealed
from an estimation of a two-state asymmetric normal mixture GARCH model could provide invaluable
information not only to investors, but also to risk managers and policy makers. Stress tests are commonly
applied using ad hoc rules, such as changing volatilities to 100% without associating any probability to this
event. The NM-GARCH model that we have implemented in this paper has the potential to provide
detailed, objective stress tests of equity risk factor volatility and this is clearly an interesting subject for
further research. Finally we have demonstrated that single-state GARCH models imply unrealistic shapes
for the implied index skew surface. However, asymmetric normal mixture GARCH models, even without
a volatility risk premium, provide a rich behavioural structure that can match the empirical characteristics
of implied volatility skew surfaces. We conclude that option pricing based on two-state asymmetric
GARCH models is another area suitable for further research.
18
References
Abramson, A. and I. Cohen (2007): ‘On the Stationarity of Markov Switching GARCH Processes’, Econometric Theory 23, 485-500
Ait-Sahalia, Y. and A.W. Lo (2000): ‘Nonparametric Risk Management and Implied Risk Aversion’, Journal of Econometrics 94, 9-51.
Alexander, C. and E. Lazar (2006): ‘Normal Mixture GARCH(1,1): Applications to Exchange Rate Modelling’, Journal of Applied Econometrics 21, 307-336
Bai, X., J.R. Russell, and G.C. Tiao (2001): ‘Beyond Merton’s Utopia (I): Effects of Non-normality and Dependence on the Precision of Variance Using High-frequency Financial Data’, University of Chicago GSB Working Paper July 2001
Bai, X., J.R. Russell, and G.C. Tiao (2003): ‘Kurtosis of GARCH and Stochastic Volatility Models with Non-normal Innovations’ Journal of Econometrics 114(2), 349-360
Bakshi, G., N. Kapadia and D. Madan (2003): 'Stock Return Characteristics, Skew Laws, and the Differential Pricing for Individual Equity Options’, The Review of Financial Studies 16, 101-143
Bates, D.S. (1991): ‘The Crash of '87: Was It Expected? The Evidence from Options Markets’, Journal of Finance 46, 1009-1044
Bates, D.S. (1997): ‘The Skewness Premium: Option Pricing under Asymmetric Processes’, Advances in Futures and Options Research 9, 51-82
Bates, D.S. (2000): ‘Post-'87 Crash Fears in the S&P 500 Futures Option Market’, Journal of Econometrics 94, 181-238
Bates, D.S. (2003): ‘Empricial Option Pricing: A Retrospection’, The Journal of Econometrics 116, 387-404
Bauwens, L., C.S. Bos, and H.K. van Dijk (1999): ‘Adaptive Polar Sampling with an Application to a Bayes measure of Value-at-Risk’, Tinbergen Institute Discussion Paper TI 99-082/4
Bauwens, L., C.Hafner and J. Rombouts (2007): ‘Multivariate Mixed Normal Conditional Heteroskedasticity’, Computational Statistics and Data Analysis 51, 3551-3566
Bauwens, L., A. Preminger and J. Rombouts (2006): ‘Regime Switching GARCH Models’, CORE Discussion Paper 2006/11
Bauwens, L., A. Preminger and J. Rombouts (2007): ‘Theory and Inference for a Markov-Switching GARCH Model’, CIRPÉE Working Paper 07-33
Bauwens, L. and J.V.K. Rombouts (2007): ‘Bayesian Inference for the Mixed Conditional Heteroskedasticity Model’, The Econometrics Journal 10, 408-425
Bekaert, G. and G. Wu (2000): ‘Asymmetric Volatility and Risk in Equity Markets’, The Review of Financial Studies, 13(1), 1-42
Black, F. and M. Scholes (1973): ‘The Pricing of Options and Corporate Liabilities’, Journal of Political Economy 81, 637-659
Bollerslev, T. (1986): ‘Generalized Autoregressive Conditional Heteroskedasticity’, Journal of Econometrics 31, 309-328
Bollerslev, T. (1987): ‘A Conditionally Heteroskedastic Time Series Model for Speculative Prices and Rates of Return’, Review of Economics and Statistics 69, 542-547
Brännäs, K. and J.G. de Gooijer (2004): ‘Asymmetries in Conditional Mean and Variance: Modelling Stock Returns by asMA-asQGARCH’, Journal of Forecasting 23, 155-171
19
Brooks, C., S.P. Burke and G. Persand (2005): ‘Autoregressive Conditional Kurtosis’, Journal of Financial Econometrics 3(3), 1-23
Cai, J. (1994): ‘A Markov Model of Switching-Regime ARCH’, Journal of Business & Economic Statistics 12, 309-316
Christoffersen, P. (1998): ‘Evaluating Interval Forecasts’, International Economic Review 39, 841-862
Christoffersen, P., S. Heston and K. Jacobs (2006): ‘Option Valuation with Conditional Skewness’, Journal of Econometrics 131, 253-284
Chortareas, G.E., J.B. McDermott and T.E. Ritsatos (2000): ‘Stock Market Volatility in an Emerging Market: Further Evidence from the Athens Stock Exchange, Journal of Business Finance and Accounting 27(7), 983-1002
Derman, E. and M. Kamal (1997): ‘The Patterns of Change in Implied Index Volatilities’, Quantitative Strategies Research Notes, Goldman Sachs
Ding, Z. and R.F. Engle (2001): ‘Large Scale Conditional Covariance Matrix Modelling, Estimation and Testing’, Academia Economic Papers 29(2), 157-184
Ding, Z. and C.W.J. Granger (1996): ‘Modeling Volatility Persistence of Speculative Returns: A New Approach’, Journal of Econometrics 73, 185-215
Ding, Z., C.W.J. Granger and R.F. Engle (1993): ‘Long Memory Property of Stock Market Returns and a New Model’, Journal of Empirical Finance 1, 83-106
Doornik, J.A. (2002): ‘Object-Oriented Matrix Programming Using Ox’, 3rd ed. London, Timberlake Consultants Press and Oxford, available from www.nuff.ox.ac.uk/Users/Doornik
Dueker, M.J. (1997): ‘Markov Switching in GARCH Processes and Mean Reverting Stock Market Volatility’, Journal of Business and Economic Statistics 15, 26-34
Engle, R.F. (1982): ‘Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of United Kingdom Inflation’, Econometrica 50(4), 987-1007
Engle, R.F. (1990): ‘Stock Volatility and the Crash of ’87: Discussion’, The Review of Financial Studies 3(1), 103-106
Engle, R.F. and V.K. Ng (1993): ‘Measuring and Testing the Impact of News on Volatility’, Journal of Finance 48, 1749-1778
Epanechnikov, V. (1969): ‘Nonparametric Estimation of a Multidimensional Probability Density’, Teoriya Veroyatnostej i Ee Primeneniya 14, 156-162
Fernandez, C. and M. Steel (1998): ‘On Bayesian Modelling of Fat Tails and Skewness’, Journal of the American Statistical Association 93, 359-371
Fong, P.W., W.K. Li and H.-Z. An (2006): ‘A Simple Multivariate ARCH Model Specified by Random Coefficients’, Computational Statistics & Data Analysis 51, 1779-1802
Francq, C., Roussignol, M. and J.-M. Zakoïan (2001): ‘Conditional Heteroskedasticity Driven by Hidden Markov Chains’, Journal of Time Series Analysis 22, 197-220
Francq, C. and J.-M. Zakoïan (2005): ‘The L2-Structures of Standard and Switching-Regime GARCH Models’, Stochastic Processes and their Applications 115, 1557-1582
Francq, C. and J.-M. Zakoïan (2007): ‘Deriving the Autocovariances of Powers of Markov-Switching GARCH Models, with Applications to Statistical Inference’, forthcoming in Computational Statistics & Data Analysis
Godfrey, L.G. (1988): ‘Misspecification Tests in Econometrics’, Cambridge University Press
20
Glosten, L. R, R. Jagannathan and D.E. Runkle (1993): ‘On the Relationship between Expected Value and the Volatility of Excess Returns on Stocks’, Journal of Finance 48, 1779-1801
Gray, S. F. (1996): ‘Modelling the Conditional Distribution of Interest Rates as a Regime-Switching Process’, Journal of Financial Economics 42, 27-62
Haas, M., S. Mittnik and M.S. Paolella (2004a): ‘Mixed Normal Conditional Heteroskedasticity’, Journal of Financial Econometrics 2, 211-250
Haas, M., S. Mittnik and M.S. Paolella (2004b): ‘A New Approach to Markov Switching GARCH Models,” Journal of Financial Econometrics, 2, 493-530
Haas, M., S. Mittnik and M.S. Paolella (2006): ‘Multivariate Normal Mixture GARCH’, Center for Financial Studies Discussion Paper 2006/09
Hamilton, J.D. and R. Susmel (1994): ‘Autoregressive Conditional Heteroscedasticity and Changes in Regime’, Journal of Econometrics 64, 307-333
Hansen, P.R., A. Lunde and J.M. Nason (2003): ‘Choosing the Best Volatility Models: The Model Confidence Set Approach’, Oxford Bulletin of Economics and Statistics 65(Supplement), 839-861
Harvey, C.R. and A. Siddique (1999): ‘Autoregressive Conditional Skewness’, Journal of Financial and Quantitative Studies, Vol. 34, No. 4, 465-487
Henneke, J.S., S.T. Rachev and F.J. Fabozzi (2006): ‘MCMC Based Estimation of Markov Switching ARMA-GARCH Models’, University of Karlsruhe Working Paper
Hentschel, L. (1995): ‘All in the Family: Nesting Symmetric and Asymmetric GARCH Models’, Journal of Financial Economics 39, 71-104
Hill, B.M. (1975): ‘A Simple General Approach to Inference about the Tail of a Distribution’, The Annals of Statistics 3(5), 1163-1174
Huisman, R., K.G. Koedijk, C.J.M. Kool and F. Palm (2001): ‘Tail-Index Estimator in Small Sample’, Journal of Business and Economic Statistics 19, 208-216
Huisman, R., K.G. Koedijk, C.J.M. Kool and F. Palm (2002): ‘The Tail-Fatness of FX Returns Reconsidered’, De Economist 150, 299-312
Hwang, S.Y. and I.V. Basawa (2004): ‘Stationarity and Moment Structure for Box-Cox Transformed Threshold GARCH(1,1) Processes’, Statistics & Probability Letters 68, 209-220
Jackwerth, J. (2000): ‘Recovering Risk Aversion from Option Prices and Realized Returns’, Review of Financial Studies 13, 433-451.
Jackwerth, J.C. and M. Rubinstein (1996): ‘Recovering Probabilities from Option Prices’, Journal of Finance 51, 1611-1631
Khamis, H. J. (2000): ‘The Two-Stage δ-Corrected Kolmogorov-Smirnov Test’, Journal of Applied Statistics 27(4), 439-450
Klaassen, F. (2002): ‘Improving GARCH Volatility Forecasts with Regime-Switching GARCH’, Empirical Economics 27, 363-394
Kolmogoroff, A. (1933): ‘Sulla Determinazione Empirica Di Una Legge Di Distribuzione’, Giornale dell’Istituto Italiano degh Attuari 4, 83-91
Koutmos, G., C. Negakis and P. Theodossiou (1993): ‘Stochastic Behaviour of the Athens Stock Exchange’, Applied Financial Economics 3(2), 119-126
Lanne, M. and P. Saikkonen (2003): ‘Modeling the U.S. Short-Term Interest Rate by Mixture Autoregressive Processes’, Journal of Financial Econometrics 1, 96-125
21
Lanne, M. and P. Saikkonen (2007): ‘Modeling Conditional Skewness in Stock Returns’, forthcoming in The European Journal of Finance
Laurent, S. and J.-P. Peters (2002): ‘G@RCH 2.2: an Ox Package for Estimating and Forecasting Various ARCH Models’, Journal of Economic Surveys 16(3), 447-485
Liu, J.C. (2006): ‘Stationarity of a Markov-Switching GARCH Model’, Journal of Financial Econometrics 4(4), 573-593
Liu, J.C. (2007): ‘Stationarity for a Markov-Switching Box-Cox Transformed Threshold GARCH Process’ Statistics and Probability Letters 77, 1428-1438
Marcucci, J. (2005): ‘Forecasting Stock Market Volatility with Regime-Switching GARCH Models’, Studies in Nonlinear Dynamics and Econometrics 9(4), Article 6
Massey, F. J. Jr. (1951): ‘The Kolmogorov-Smirnov Test for Goodness of Fit’, Journal of the American Statistical Association 46(253), 68-78
Nelson, D.B. (1990): ‘Stationarity and Persistence in the GARCH(1,1) Model’, Econometric Theory 6, 318-334
Nelson, D.B. (1991): ‘Conditional Heteroscedasticity in Asset Returns: a New Approach’, Econometrica 59, 347-370
Newey, W.K. (1985): ‘Generalized Method of Moments Specification Testing’, Journal of Econometrics 29, 229-256
Rosenberg, J.V. and R.F. Engle (2002): ‘Empirical Pricing Kernels’, Journal of Financial Economics 64, 341–372.
Rubinstein, M. (1994): ‘Implied Binomial Trees’, Journal of Finance 49, 771-818
Schwert, G.W. (1989): ‘Why Does Stock Market Volatility Change Over Time?’, Journal of Finance 44, 1115-1153
Sentana, E. (1995): ‘Quadratic ARCH Models’, Review of Economic Studies 62, 639-661
Smirnov, N. (1939): ‘Sur Les Ecarts De La Courbe De Distribution Empirique’ (Russian, French summary), Matematicheskii Sbornik 48(6), 3-26
Taylor, S. (1986): ‘Modelling Financial Time Series’, John Wiley & Sons
Tompkins, R.G. (2001): ‘Implied Volatility Surfaces: Uncovering Regularities for Options on Financial Futures’, The European Journal of Finance 7, 198-230
Vlaar, P.J.G. and F.C. Palm (1993): ‘The Message in Weekly Exchange Rates in the European Monetary System: Mean Reversion, Conditional Heteroscedasaticity, and Jumps’, Journal of Business and Economic Statistics 11(3), 351-60
Wong, C.S. and W.K. Li (2001): ‘On a Mixture Autoregressive Conditional Heteroscedastic Model’, Journal of the American Statistical Association 96 (455), 982-995
Wu, G. (2001): ‘The Determinants of Asymmetric Volatility’, The Review of Financial Studies 14(3), 837-859
Zakoïan, J.-M. (1994): ‘Threshold Heteroskedastic Models’, Journal of Economic Dynamics & Control 18(5), 931-955
22
Appendix: Theoretical Properties of Asymmetric NM-GARCH Models
I: Moments
We use the following notations: ( ) ( )2 2ε σt tx E E= = and ( )2σi ity E= for i = 1, …, K.
Taking expectations of (3) and (6) yields the result for NM-AGARCH and taking expectations of (4) and
(6) yields the result for the NM-GJR model. We get:
( ) ( )
*2
1 12 2
1
ωμ(1 β )ε σδ1
(1 β )
K Ki i
i ii i i
t tK
i i
i i
pp
x E Ep
= =
=
∑ ∑
∑
+−
= = =⎛ ⎞
−⎜ ⎟−⎝ ⎠
,
( ) ( )1 *1 β ω δi i i iy x−= − + i = 1,…, K
where 2
* ω α λ for NM - AGARCHω
ω for NM -GJRi i i
ii
⎧ += ⎨
⎩
( )( )α for NM - AGARCH
δα λ 1 ρ for NM -GJR
i
ii i tE d−
⎧⎪= ⎨ + +⎪⎩
and ρ is the correlation between td− and 2εt .18
The third moment is ( ) ( ) ( )3 3 2
1 1ε ε μ 3 μ
K K
t i i t i i i ii i
h E p E p y= =∑ ∑= = = + and the skewness can be expressed as:
3/2
hs
x= .
The excess kurtosis in both models is:
( )( )
4
2 22
εκ 3 3
εt
t
E zxE
= − = − where ( )1
41
3 'ε1 3 't
sz E
−
−
−= =
−p B f
p B g
The fourth moment uses the following notation:
( )1 , , 'Kp p=p … ( )2 2 4
16μ μ
K
i i i ii
s p y=∑= +
( )
21 1 1 11 1 1 12 1 1 1
22 2 21 2 2 2 22 2
21 2
1 β 2δ β 2δ β 2δ β2δ β 1 β 2δ β 2δ β
2δ β 2δ β 1 β 2δ β
K
K K Kij
K K K K K K K K K KK
e e e
e e eb
e e e
⎡ ⎤− − − −⎢ ⎥− − − −⎢ ⎥= =⎢ ⎥⎢ ⎥
− − − −⎢ ⎥⎣ ⎦
B
……
…
where and ij ij je a p=
18 In our empirical application we have approximated ( )tE d − by 0.5 and ρ by 0.
23
and ( )
11 2 1 2
1 1 1 2 11
21 2 1 2
12 1 2 22
1 1 2 2
11 2
δ ββ δ δ β11 β β 1 β β 1 β β
δ βδ β β δ11 β β 1 β β 1 β β
δ β δ β β δ11 β β 1 β β 1 β β
KK Kk k
k k Kk
KK Kk k
k k Kij k
KK K k K k
kK K K kk K
pp p
pp p
a
p p p
=≠
=≠
=≠
∑
∑
∑
⎡ ⎤− − −⎢ ⎥− − −⎢ ⎥⎢ ⎥⎢ ⎥− − −
− − −⎢ ⎥= =⎢ ⎥⎢ ⎥⎢ ⎥⎢ ⎥− − −
− − −⎢ ⎥⎣ ⎦
A
…
…
…
1 1 1 1γ 2δ β
γ 2δ βK K K K
d
d
+⎛ ⎞⎜ ⎟= ⎜ ⎟⎜ ⎟+⎝ ⎠
g and 1 1 1 12δ β
2δ βK K K K
w c
w c
+⎛ ⎞⎜ ⎟= ⎜ ⎟⎜ ⎟+⎝ ⎠
f
Also: 1 1
δ δ1 β β
K K k j ki ij
j k j kk j
pd a
= =≠
∑ ∑⎛ ⎞⎜ ⎟=⎜ ⎟−⎝ ⎠
and 2
2
α for NM - AGARCHγ
α λ δ for NM -GJRi
ii i i
⎧= ⎨
+⎩
Furthermore:
( ) ( ) ( )2 2 2 2 4 2 2
2
ω 2ω α 6α λ α λ 4λ 2ω α λ 2 ω α λ β for NM - AGARCH
ω 2ω δ 2ω β for NM - GJRi i i i i i i i i i i i i i i i
i
i i i i i
x h yw
x y
⎧ + + + − + + +⎪= ⎨+ +⎪⎩
1 1 1 β βK K k jk
i ij jj k j kk j
p rc a y q
= =≠
∑ ∑⎡ ⎤⎛ ⎞⎢ ⎥⎜ ⎟= +⎢ ⎥⎜ ⎟−⎝ ⎠⎣ ⎦
with 2
1μ
K
k kk
q p=∑=
( ) ( ) ( )
( ) ( )( )
2 2
2 2 2 2 2 2
ω ω ω α ω α α α λ 4λ λ λ 2α α λ λfor NM - AGARCH
β ω α λ β ω α λ ω α λ ω α λ α α λ λ
ω ω ω δ ω δ β ω β ω for NM -GJR
i k i k k i i k i i k k i k i k
ik i i k k k k k i i i i k k k i i i k i k
i k i k k i i i k k k i
x h
r y y
x y y
⎧ ⎡ ⎤+ + + + + − +⎣ ⎦⎪⎪= ⎨+ + + + + + +⎪
+ + + +⎪⎩
II: Parameter Constraints
We require the following set of conditions for the existence and non-negativity of variance and the
finiteness of the third moment.19 These conditions are required for the existence and positivity of the
overall variance and individual variances as well. However, as kindly noted by one of the referees, these
conditions don’t guarantee the existence of the third moment. These conditions only assure that, if the
third moment exists, then it will be finite. Nelson �1990� gave sufficient and necessary conditions for
the existence of the moments of the GARCH�1,1� model, proving that for the normal GARCH model
given by the equation
( )2 2 21 1σ ω αη β σt t t− −= + +
the pth moment of the error term exists iff
19
24
( ) /221αη β 1.
p
tE −⎡ ⎤+ <⎢ ⎥⎣ ⎦
For example, when we assume that α = 0.09875 and β = 0.9 then the second moment exists, but the third
and higher moments do not. Unfortunately, no such sufficient and necessary conditions exist for the NM-
GARCH model. Also, no straightforward parameter constraint exists for the existence and finiteness of
the fourth moment; we simply put ( )40 εtE< < ∞ .
In both the NM-AGARCH and NM-GJR models we require that:
( )*
2
1 1
ωμ 01 β
K Ki i
i ii i i
pm p
= =∑ ∑= + >
−,
( )( )1
1 δ β0
1 βK i i i
i i
pn
=∑
− −= >
− and *ω δ 0i i
mn
+ >
Also, we must have that 1
10 1, 1, , 1, 1, 0 α , 0 β 1
K
i i i ii
p i K p−
=∑< < = − < < ≤ <… .
III: Numerical Derivatives of the Asymmetric NM-GARCH Models
The only difference from the NM-GARCH model numerical derivatives (Alexander and Lazar, 2006) is
the first and second order derivatives of 2σ it with respect to γi and these are as follows:
2 21σ σβit it
i−∂ ∂
= +∂ ∂it
i i
zγ γ
; 2 2 2 2
1σ σβ' '
it itit iw −∂ ∂
= +∂ ∂ ∂ ∂i i i iγ γ γ γ
with Tit it itw A A= + .
NM-AGARCH: ( ) ( )( )2 21 1 11, ε λ , 2α ε λ ,σ 't i i t i it− − −= − − −itz . The starting values (t = 0) are:
( )2
2 2 20σ 1, λ , 2α λ , 'ii i is s
∂= +
∂ iγ, where
2
2 1ε
T
ttsT
=∑
=
1
'21
0 0 0 00 0 0 00 2(ε λ ) α 0
σ
t i iit
it
A −
−
⎡ ⎤⎢ ⎥⎢ ⎥⎢ ⎥− −=⎢ ⎥
⎛ ⎞⎢ ⎥∂⎜ ⎟⎢ ⎥∂⎝ ⎠⎣ ⎦iγ
with ( )
2 22 20
2 2 2
0 0 0 10 0 2λ λσ 10 2λ 2α 2α λ' 1 β1 λ 2α λ 2
i ii
i i i ii
i i i
s
s s
⎡ ⎤⎢ ⎥+∂ ⎢ ⎥=⎢ ⎥∂ ∂ −⎢ ⎥+⎣ ⎦
i iγ γ.
NM-GJR: ( )2 2 21 1 1 11,ε , ε ,σ 't t t itd−
− − − −=itz . The starting values for this expression (t = 0) are:
( )2
2 2 20σ 1, , 0.5 , 'i s s s∂
=∂ iγ
, where
2
2 1ε
T
ttsT
=∑
=
25
'21
0 0 0 00 0 0 00 0 0 0
σit
it
A
−
⎡ ⎤⎢ ⎥⎢ ⎥⎢ ⎥=⎢ ⎥
⎛ ⎞⎢ ⎥∂⎜ ⎟⎢ ⎥∂⎝ ⎠⎣ ⎦iγ
with ( )
22 20
2
2 2 2
0 0 0 10 0 0σ 10 0 0 0.5' 1 β1 0.5 2
i
i
s
s
s s s
⎡ ⎤⎢ ⎥∂ ⎢ ⎥=⎢ ⎥∂ ∂ −⎢ ⎥⎣ ⎦
i iγ γ.
IV: ACF of the Squared Errors in the Asymmetric NM-GARCH Models20
( ) ( )( )2 2 2 2 2 2
2 224 22
ε ,ε ε ερ ε ,ε
εεt t k t t k k
k t t ktt
Cov E x c xCorr
z xE xVar− −
−
⎡ ⎤ − −⎣ ⎦= = = =−⎡ ⎤ −⎣ ⎦
, where
2 2 2 2 2 2
1 1 1 1ε ε μ σ ε μ
K K K K
k t t k i i i it t k i i i iki i i i
c E x p p E x p p b− −= = = =∑ ∑ ∑ ∑⎡ ⎤ ⎡ ⎤= = + = +⎣ ⎦ ⎣ ⎦
NM-AGARCH: ( )( )
21 1
21 0 0
ω α λ α β , 1
ω α λ α β 2α λ , 1
ik i i i i k i ik
i i i i i i i i i
b x c b k
b x c b h k
− −= + + + >
= + + + + − =
NM-GJR: ( ) 1 1ω α 0.5λ βik i i i k i ikb x c b− −= + + +
The starting values are: 0c z= and ( )10 'i i i ib c d z z−= + + +e B f g .
20 Since the variance of the NM(K)-GARCH(1,1) model can be expressed as a GARCH(K,K) variance, according to Bollerslev (1986) the autocorrelations can also be written as an AR(K) process.
26
Table 2. Estimation results for the CAC 40 Index
Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) p1(d.f. for (4)-(9)) 11.5408 12.2000 12.1752 12.1359 12.6697 12.7269 0.9356 0.9790 0.9765 0.9141 0.9780 (8.60) (8.88) (8.79) (7.99) (8.27) (8.13) (40.62) (78.45) (70.10) (30.24) (79.76)
μ1(Sk..for (7)-(9)) -0.0724 -0.0728 -0.0751 0.0084 0.0050 (-2.96) (-2.99) (-3.08) (2.48) (2.63) ω1 6.4E-4 9.6E-5 5.9E-4 3.9E-4 -8.2E-5 4.6E-4 3.9E-4 -5.8E-5 4.7E-4 2.2E-4 -1.2E-4 4.0E-4 1.7E-4 -1.1E-4 (6.43) (0.66) (5.84) (3.24) (-0.43) (4.01) (3.24) (-0.31) (4.04) (2.44) (-0.75) (4.06) (2.06) (-0.69) α1 0.0663 0.0495 0.0166 0.0600 0.0545 0.0166 0.0606 0.0553 0.0173 0.0486 0.0517 0.0157 0.0446 0.0524 (9.83) (10.97) (3.02) (7.95) (7.60) (2.17) (8.05) (7.66) (2.26) (7.59) (7.90) (2.25) (7.20) (8.07) λ1 0.1060 0.0664 0.1082 0.0732 0.1067 0.0732 0.1094 0.0692 0.1058 (7.08) (7.91) (5.89) (6.25) (5.87) (6.26) (6.17) (6.57) (6.22) β1 0.9186 0.9345 0.9348 0.9310 0.9319 0.9346 0.9303 0.9312 0.9340 0.9386 0.9313 0.9344 0.9432 0.9311 (112.80) (150.07) (141.81) (108.12) (113.70) (115.80) (108.16) (112.93) (115.80) (124.47) (121.15) (123.20) (131.68) (122.93) p2 0.0644 0.0210 0.0235 0.0859 0.0220 μ2 -0.0890 -0.2235 ω2 0.0148 0.0087 0.0174 0.0085 -0.0018 (1.70) (0.18) (0.35) (1.88) (-0.12) α2 0.5600 1.6171 1.5287 0.5340 1.5497 (0.87) (0.52) (0.45) (1.33) (0.85) λ2 -0.0375 -0.4967 -0.0311 (-0.45) (-0.33) (-0.62) β2 0.6794 0.6336 0.6352 0.6893 0.6172 (2.72) (1.04) (0.95) (4.33) (1.68) Unconditional σ 20.55% 20.18% 19.65% 20.86% 20.22% 19.47% 20.69% 20.55% 19.79% 20.80% 21.30% 20.30% 20.81% 21.77%
Unconditional σ1 19.46% 20.35% 19.39% 19.23% 20.74%
Unconditional σ2 34.89% 47.97% 43.86% 31.93% 43.71%
Unconditional τ -0.0881 -0.0861 -0.0883 -4.32E-6 -1.41E-5 -1.33E-5 -0.172 -0.235
Unconditional k 1.2503 0.8485 1.1207 3.3426 2.6090 4.8650 – – – 4.0435 12.8803 6.0332 4.8605 11.8355
BIC -0.4634 -0.4728 -0.4722 -0.4842 -0.4941 -0.4931 -0.4844 -0.4943 -0.4934 -0.4768 -0.4851 -0.4838 -0.4794 -0.4875
Moment tests 1% 3 3 3 2 1 1 3 1 1 4 3 3 4 1
Density 1.3101 1.4771 0.9944 0.9228 1.1713 1.5195 0.8417 0.9824 0.8340 0.7382 0.8662 0.9965 0.5757 1.1570
ACF 0.0500 0.1848 0.1033 0.3976 0.0677 0.3012 – – – 0.0665 0.0375 0.0463 0.1077 0.0393
27
Table 3. Estimation results for the DAX 30 Index
Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) p1(d.f. for (4)-(9)) 9.002572 9.537122 9.596825 9.392559 9.8487 9.9290 0.9660 0.9762 0.9680 0.9613 0.9701 0.9604 (11.72) (11.94) (11.80) (11.18) (11.40) (11.25) (100.65) (118.45) (98.95) (92.86) (97.75) (89.04)
μ1(Sk..for (7)-(9)) -0.1056 -0.1064 -0.1069 0.0044 0.0041 0.0043 (-4.41) (-4.43) (-4.46) (2.29) (2.12) (2.19) ω1 8.8E-4 4.7E-4 8.9E-4 3.4E-4 1.4E-4 4.4E-4 3.5E-4 1.7E-4 4.6E-4 1.5E-4 3.2E-5 2.5E-4 1.4E-4 2.9E-5 2.3E-4 (8.02) (3.80) (7.83) (3.37) (0.95) (4.05) (3.42) (1.17) (4.17) (2.32) (0.29) (3.18) (2.21) (0.27) (3.11) α1 0.0825 0.0668 0.0320 0.0789 0.0772 0.0413 0.0787 0.0776 0.0414 0.0608 0.0664 0.0363 0.0595 0.0640 0.0355 (11.59) (13.59) (4.93) (8.48) (8.27) (3.83) (8.53) (8.33) (3.92) (9.19) (8.98) (4.24) (9.18) (8.92) (4.26) λ1 0.0764 0.0768 0.0629 0.0731 0.0628 0.0730 0.0633 0.0549 0.0639 0.0538 (7.82) (7.53) (4.64) (5.01) (4.69) (5.00) (4.93) (4.63) (4.89) (4.54) β1 0.8982 0.9135 0.9080 0.9155 0.9142 0.9121 0.9153 0.9137 0.9121 0.9282 0.9200 0.9216 0.9291 0.9220 0.9230 (100.01) (134.74) (115.60) (96.70) (96.17) (93.55) (97.01) (96.16) (93.91) (125.62) (113.55) (113.67) (127.09) (114.79) (115.37) p2 0.0340 0.0238 0.0320 0.0387 0.0299 0.0396 μ2 -0.1096 -0.1328 -0.1039 ω2 0.0354 -0.0150 0.0394 0.0259 0.0335 0.0317 (0.49) (-0.04) (0.48) (0.59) (0.39) (0.59) α2 0.4442 0.2226 -0.1441 0.3941 0.3729 -0.1193 (0.58) (0.31) (-0.31) (0.70) (0.48) (-0.71) λ2 0.7468 0.8197 0.0470 0.7002 (0.35) (0.63) (0.19) (0.75) β2 0.7120 0.4451 0.7110 0.7345 0.7155 0.7155 (1.30) (0.84) (1.18) (1.75) (1.04) (1.48) Unconditional σ 21.31% 20.86% 20.22% 24.85% 22.75% 20.97% 24.10% 23.31% 21.45% 21.83% 21.52% 20.11% 21.40% 21.71% 20.22%
Unconditional σ1 20.60% 20.54% 18.98% 20.11% 20.59% 19.01%
Unconditional σ2 44.30% 46.39% 41.64% 40.68% 42.70% 38.01%
Unconditional τ -0.1517 -0.1467 -0.1464 -7.97E-6 -8.40E-6 -1.01E-5 -0.1678 -0.1697 -0.1673
Unconditional k 1.6567 1.1240 1.5100 – 12.6898 – – 1.7023 – 5.1276 3.5368 3.0636 4.6434 3.9313 2.7148
BIC -0.4339 -0.4393 -0.4410 -0.4884 -0.4925 -0.4927 -0.4912 -0.4955 -0.4957 -0.4853 -0.4878 -0.4875 -0.4870 -0.4891 -0.4893
Moment tests 1% 2 2 3 2 1 2 0 1 1 3 2 2 2 2 2
Density 2.2739 2.3948 1.9538 1.4085 1.8255 2.4724 1.2864 1.4249 1.3516 1.0503 1.2268 2.1285 0.9681 1.0154 1.5470
ACF 0.1889 0.3591 0.3116 – 0.9764 – – 0.0570 – 0.0584 0.1345 0.1800 0.0443 0.0620 0.2201
28
Table 4. Estimation results for the FTSE 100 Index
Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) p1(d.f. for (4)-(9)) 13.7316 14.1768 14.0745 14.4429 14.6183 14.6046 0.9462 0.9598 0.9587 0.9288 0.9583 0.9564 (6.47) (6.67) (6.74) (6.42) (6.59) (6.69) (32.97) (42.96) (40.14) (1.00) (42.80) (38.70)
μ1(Sk..for (7)-(9)) -0.0679 -0.0657 -0.0669 0.0049 0.0029 0.0036 (-2.87) (-2.80) (-2.85) (1.00) (1.68) (1.97) ω1 2.5E-4 -4.7E-5 2.4E-4 2.1E-4 -1.2E-4 2.1E-4 2.0E-4 -1.1E-4 2.1E-4 1.3E-4 -1.7E-4 1.6E-4 1.1E-4 -1.6E-4 1.6E-4 (4.40) (-0.54) (5.39) (3.35) (-1.10) (4.25) (3.34) (-1.02) (4.26) (2.52) (-1.74) (3.72) (2.24) (-1.65) (3.78) α1 0.0734 0.0558 0.0155 0.0674 0.0547 0.0114 0.0674 0.0547 0.0122 0.0570 0.0493 0.0074 0.0532 0.0482 0.0069 (10.94) (8.54) (2.42) (8.09) (7.23) (1.61) (8.19) (7.32) (1.75) (7.47) (7.25) (1.25) (7.17) (7.26) (1.21) λ1 0.0785 0.0781 0.0862 0.0820 0.0849 0.0810 0.0902 0.0756 0.0899 0.0738 (6.39) (8.96) (5.63) (7.28) (5.66) (7.26) (6.07) (7.72) (6.03) (7.73) β1 0.9171 0.9316 0.9350 0.9247 0.9331 0.9379 0.9247 0.9333 0.9379 0.9305 0.9348 0.9416 0.9344 0.9362 0.9432 (118.21) (129.34) (137.35) (101.89) (112.72) (121.93) (103.05) (114.24) (122.91) (109.31) (118.63) (133.69) (1.00) (121.85) (137.95) p2 0.0538 0.0402 0.0413 0.0712 0.0417 0.0436 μ2 -0.0632 -0.0670 -0.0796 ω2 0.0101 0.0109 0.0113 0.0056 0.0076 0.0051 (1.19) (0.83) (0.65) (1.00) (0.78) (0.78) α2 0.7933 0.9230 0.9226 0.6722 0.9401 0.9888 (2.12) (1.70) (1.82) (1.00) (1.88) (1.94) λ2 -0.0109 -0.0159 -0.0192 -0.3350 (-0.24) (-0.02) (-0.44) (-0.54) β2 0.5785 0.5644 0.5786 0.6250 0.5821 0.6658 (2.82) (2.12) (1.70) (1.00) (2.59) (4.37) Unconditional σ 16.24% 15.38% 15.05% 16.13% 15.17% 14.79% 15.95% 15.37% 15.01% 15.86% 15.21% 14.81% 15.72% 15.51% 15.24%
Unconditional σ1 15.01% 14.49% 14.03% 14.74% 14.74% 14.39%
Unconditional σ2 26.69% 27.26% 27.27% 24.36% 27.04% 26.89%
Unconditional τ -0.0742 -0.0713 -0.0726 -7.26E-6 -8.67E-6 -1.13E-5 -0.1354 -0.1186 -0.1576
Unconditional k 3.9916 1.5162 3.0582 5.5489 2.7292 8.2826 – – – 7.8609 3.8048 3.6864 7.9404 4.3925 4.4979
BIC -1.0583 -1.0689 -1.0704 -1.0687 -1.0794 -1.0812 -1.0685 -1.0791 -1.0810 -1.0638 -1.0738 -1.0760 -1.0657 -1.0750 -1.0777
Moment tests 1% 4 1 1 2 1 2 2 1 0 2 1 1 2 0 0
Density 60.7739 1.2624 0.6579 0.7213 1.2385 1.7367 1.0483 1.1607 0.9953 0.6626 1.0954 1.6242 0.8146 0.8484 0.8520
ACF 0.4478 0.0988 0.1832 0.8115 0.0976 0.6602 – – – 0.2547 0.1275 0.1529 0.2636 0.1087 0.1239
29
Table 5. Estimation results for the Eurostoxx 50 Index
Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) p1(d.f. for (4)-(9)) 8.1884 8.5490 8.5869 8.5988 8.8517 8.9319 0.9506 0.9567 0.9571 0.9444 0.9493 0.9490 (10.91) (10.77) (10.86) (10.33) (10.22) (10.27) (69.16) (82.20) (78.95) (64.15) (66.80) (67.90)
μ1(Sk..for (7)-(9)) -0.1017 -0.1029 -0.1049 0.0057 0.0054 0.0055 (-4.25) (-4.31) (-4.39) (3.21) (2.98) (3.07) ω1 4.5E-4 1.6E-4 4.5E-4 2.2E-4 9.1E-5 2.7E-4 2.3E-4 1.1E-4 2.8E-4 1.1E-4 4.0E-5 1.6E-4 1.1E-4 5.4E-5 1.7E-4 (8.23) (1.88) (8.30) (3.34) (0.95) (3.89) (3.38) (1.18) (4.01) (2.30) (0.60) (2.95) (2.26) (0.80) (3.03) α1 0.0691 0.0586 0.0299 0.0743 0.0720 0.0410 0.0752 0.0734 0.0419 0.0624 0.0661 0.0394 0.0605 0.0642 0.0384 (11.79) (13.24) (4.98) (8.77) (8.52) (4.02) (8.89) (8.61) (4.19) (9.37) (9.26) (4.64) (9.17) (8.97) (4.55) λ1 0.0709 0.0630 0.0508 0.0623 0.0504 0.0635 0.0428 0.0510 0.0443 0.0520 (7.70) (6.97) (4.24) (4.61) (4.32) (4.70) (4.01) (4.57) (4.14) (4.66) β1 0.9167 0.9270 0.9230 0.9209 0.9208 0.9198 0.9198 0.9194 0.9185 0.9249 0.9208 0.9203 0.9261 0.9206 0.9196 (131.78) (168.25) (147.83) (109.75) (110.01) (108.43) (109.38) (108.94) (107.88) (124.03) (119.86) (117.61) (121.98) (112.35) (110.74) p2 0.0494 0.0433 0.0429 0.0556 0.0507 0.0510 μ2 -0.0967 -0.1018 -0.1030 ω2 0.0140 -0.0055 0.0253 0.0098 0.0084 0.0089 (0.85) (-0.07) (0.67) (1.03) (0.48) (0.58) α2 0.3306 0.2710 -0.0625 0.3155 0.2288 0.1122 (0.88) (0.67) (-0.20) (1.05) (0.64) (0.26) λ2 0.4416 0.6205 0.0641 0.1900 (0.77) (0.60) (0.39) (0.69) β2 0.7860 0.4909 0.7008 0.7996 0.8332 0.8416 (3.37) (2.92) (1.63) (4.44) (3.24) (3.15) Unconditional σ 17.89% 17.78% 17.02% 21.60% 19.57% 18.06% 21.17% 20.37% 18.80% 18.65% 18.26% 17.32% 18.21% 18.47% 17.64%
Unconditional σ1 17.43% 17.27% 16.25% 16.91% 17.28% 16.43%
Unconditional σ2 34.54% 33.29% 33.09% 31.77% 32.09% 31.09%
Unconditional τ -0.1591 -0.1564 -0.1579 -1.05E-5 -8.26E-6 -1.20E-5 -0.2015 -0.1882 -0.2101
Unconditional k 1.5502 1.2574 1.5431 – 14.7870 – – 3.7850 – 5.1022 3.2070 2.8952 4.7779 3.8641 3.3103
BIC -0.7481 -0.7553 -0.7547 -0.7979 -0.8014 -0.8015 -0.8008 -0.8045 -0.8047 -0.7895 -0.7910 -0.7912 -0.7927 -0.7936 -0.7943
Moment tests 1% 2 2 3 2 2 2 2 1 2 4 4 4 3 2 3
Density 2.1384 2.2230 1.8409 1.1173 1.5988 2.2568 1.0173 1.1309 1.0852 1.2707 1.2732 1.4062 0.7634 0.9386 0.7297
ACF 0.2479 0.4044 0.3427 – 0.8580 – – 0.2339 – 0.0508 0.3303 0.3867 0.0720 0.1501 0.2778
30
Notes for tables 2, 3, 4 and 5: The models are: (1) GARCH, (2) AGARCH and (3) GJR, all three with normally distributed errors; (4) GARCH, (5) AGARCH and (6) GJR, all three with symmetric Student’s t distributed errors; (7) GARCH, (8) AGARCH and (9) GJR, all three with skewed Student’s t distributed errors; (10) NM-GARCH, (11) NM-AGARCH and (12) NM-GJR, all three normal mixture GARCH models with zero means in the mixture component densities; (13) NM-GARCH, (14) NM-AGARCH and (15) NM-GJR, all three general normal mixture GARCH models. Numbers in parenthesis represent t-values. A ‘–’ indicates negative kurtosis, or an unreasonably high value for the ACF statistic; meaning that in that model the fourth moment is not defined.
31
Table 6. In-sample VaR results
Test Significance Data (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15)
CAC 7.4*** 6.6** 7.4*** 2.3 1.9 1.5 0.0 0.1 0.2 2.8* 5.2** 5.2** 0.0 0.8
1% DAX 5.9** 5.2** 3.9** 0.4 0.6 0.8 0.0 0.1 0.0 2.4 4.6** 4.6** 0.0 0.6 0.2
FTSE 2.8* 2.3 1.5 1.1 0.0 0.2 0.0 0.2 0.3 1.5 1.1 0.6 0.0 0.1 0.0
EUROS 14*** 9.1*** 9.1*** 2.9* 0.6 0.6 0.0 0.0 0.0 6.7*** 2.9* 4.6** 0.0 0.0 0.0
CAC 9.0*** 11*** 17*** 0.1 0.0 0.1 1.3 1.9 3.7* 0.9 5.8** 5.8** 0.0 0.0
Independence 0.5% DAX 14*** 11*** 13*** 2.0 0.9 1.4 0.2 0.2 0.0 0.9 2.6 2.6 2.7* 1.9 0.8
FTSE 7.8*** 4.0** 5.8** 1.4 0.3 0.9 0.0 0.0 0.0 2.6 0.9 1.9 0.0 0.1 0.2
EUROS 19*** 13*** 15*** 3.7* 3.0 3.0 1.7 0.2 0.8 4.5** 0.8 1.2 0.2 0.2 0.1
CAC 7.2*** 5.3** 7.2*** 3.7* 2.3 2.3 0.2 1.2 0.4 1.0 0.2 0.2 1.0 1.0
0.1% DAX 14*** 9.3*** 9.3*** 2.3 0.4 0.4 1.2 0.4 0.0 0.2 1.2 1.2 1.0 1.0 0.2
FTSE 14*** 11*** 14*** 1.2 3.7* 5.3** 0.4 1.2 3.7* 1.2 1.2 1.2 0.4 0.0 0.4
EUROS 41*** 24*** 31*** 1.1 1.1 1.1 0.2 0.0 0.3 0.2 0.2 0.0 1.0 1.0 1.0
CAC 0.0 0.1 1.6 0.2 1.1 1.1 0.8 0.8 0.9 1.3 1.5 1.5 0.7 1.0
1% DAX 4.0** 4.2** 1.8 3.0* 2.8* 2.7 3.7* 4.0** 0.8 2.1 4.4** 4.4** 3.5* 2.8* 0.6
FTSE 1.2 1.2 1.1 1.0 0.7 0.9 0.8 0.7 0.6 1.1 1.0 1.0 0.8 0.8 0.7
EUROS 0.7 1.1 1.1 2.0 2.7* 2.7* 3.6* 0.7 0.7 1.3 1.3 0.1 0.8 0.8 0.8
CAC 0.6 0.7 0.8 0.2 0.2 0.2 0.1 0.1 0.1 0.3 0.5 0.5 0.2 0.2
Conditional 0.5% DAX 3.7* 0.9 0.8 6.5** 2.3 2.1 3.4* 3.4* 3.0* 0.3 0.4 0.4 0.1 0.1 0.1
Coverage FTSE 0.6 0.4 0.5 0.3 0.2 0.3 0.2 0.2 0.2 0.4 0.3 0.3 0.2 0.2 0.2
EUROS 3.0* 0.7 0.7 1.6 0.4 0.4 0.3 0.2 0.3 0.4 0.3 0.3 0.2 0.2 0.2
CAC 0.1 0.0 0.1 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
0.1% DAX 0.1 0.1 0.1 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
FTSE 0.1 0.1 0.1 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
EUROS 0.3 0.2 0.2 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
Note: The models are: (1) GARCH, (2) AGARCH and (3) GJR, all three with normally distributed errors; (4) GARCH, (5) AGARCH and (6) GJR, all three with symmetric Student’s t distributed errors; (7) GARCH, (8) AGARCH and (9) GJR, all three with skewed Student’s t distributed errors; (10) NM-GARCH, (11) NM-AGARCH and (12) NM-GJR, all three normal mixture GARCH models with zero means in the mixture component densities; (13) NM-GARCH, (14) NM-AGARCH and (15) NM-GJR, all three general normal mixture GARCH models. The reported test statistics follow a χ2(1) distribution. *, ** and *** indicate values significant at 10%, 5% and 1% significance level, respectively.
32
Fig. 1. The Returns on the Eurostoxx Index
-0.5
-0.4
-0.3
-0.2
-0.1
0
0.1
0.2
0.3
0.4
0.5
Jan-2004 Jan-2005
Fig. 2. The Conditional Volatilities and the Ex-post Time-varying Probability of the First State
for the Eurostoxx index
0%
50%
100%
Jan-2004 Jan-2005-1
-0.5
0
0.5
1
volatility 1
volatility 2
p 1
33
Fig. 3. Simulated FTSE Index Skews up to 3 Months
(a) GARCH
(b) GARCH with skewed t distribution
(c) GJR with skewed t distribution
34
(d) NM-GARCH
(e) NM-GJR
Note: Implied volatility skews for call options are simulated using 50,000 runs. Zero interest rate is assumed. The parameters for the simulation are taken from Table 3.