stock volatility regimes - carol alexander · 1 i introduction modelling regimes in market...

Modelling Regime Specific Stock Volatility Behaviour

Abstract

Any GARCH model with a single volatility state identifies only one mechanism governing the dynamic

response of volatility to market shocks, and the conditional standardized higher moments are constant,

unless modelled explicitly. Therefore such models cannot capture state dependent behaviour of

volatility and neither can they explain why the equity index skew persists into long dated options.

However the family of Markov switching (MS) GARCH models, which specify several deterministic

mean-reverting volatility states, have endogenous time-varying conditional variance, skewness and

kurtosis. Within this model class the normal mixture (NM) GARCH model, a MS GARCH with

constant state probabilities, is simplest to estimate. This paper extends previous theoretical and

empirical work on the NM GARCH model by introducing a state dependent leverage effect., which is

essential for modelling equity market volatility. With this construction we can explain the typical

empirical characteristics of equity index returns and implied volatility skews, even with a zero or

constant volatility risk premium. An empirical study of fifteen GARCH models of four European

equity indices between 1991 and 2005 identifies two-state asymmetric NM GARCH as the best fitting

model. The models show that the volatility behaviour is broadly similar in all indices during usual

market circumstances. But there is a small probability (between 2% and 5%) of the market crashing,

when the volatility behaviour of different indices becomes quite diverse, and the mean-reversion and

leverage effects are quite different from those in the usual regime.

JEL Classification Codes: C32, G13.

Keywords: GARCH, normal mixture, leverage effect, mean-reverting volatility, equity skew, stock market regimes, market crash

1

I Introduction

Modelling regimes in market volatility is of interest to most finance practitioners. For their assessment of

risk capital and for setting traders’ limits, risk managers need to understand how volatility may respond

differently to market shocks during a predominantly stable period, compared with a crash or recovery

period. When computing economic capital, and to enable traders' limits to be raised when markets are

considered to be exceptionally volatile, risk models are subjected to ‘stress tests’. These tests are also

required by most banking regulators but they are often performed using ad hoc adjustments. However, a

more precise approach to stress testing would specify an intensity for the shock to volatility and the

probability associated with the stressful scenario, and this requires a regime dependent volatility model.

Managing volatility following large price falls in an index is essential for governments and policy makers as

well, who are concerned about systemic risk, where insolvency in one sector of the economy may lead to

an economic crisis as in the global crash of 1987. Regulators need to understand how markets behave

under stressful conditions if their aim is to avert insolvency in a particular sector of the economy. Finally,

an understanding of best discrete time volatility model should help option prices to select from the

plethora of possible stochastic volatility models available, many of which will fit the market implied

volatility smile equally well.

A main characteristic of volatility is ‘mean-reversion’, whereby volatility persists after a market shock but

in the absence of any further shocks it will eventually return to a long-run volatility level. Mean-reversion

is captured in the standard approach to modelling volatility behaviour, i.e. the generalized autoregressive

conditional heteroscedasticity (GARCH) framework introduced by Engle (1982) and Bollerslev (1986).

However, normal GARCH models do not explain why both conditional and unconditional returns on

financial assets have such high skewness and kurtosis. For this reason Bollerslev (1987) extended the

GARCH model to include the Student’s t conditional distribution and Fernandez and Steel (1998)

enhanced it further using the skewed t-distribution.

Bekaert and Wu (2000), Wu (2001), Hansen, Lunde and Nason (2003) and many others argue that it is

important to also characterise the ‘leverage effect’ in stock markets, whereby volatility responds more to

negative shocks than to positive shocks of the same magnitude. Nelson (1991) introduced the first

GARCH process with a leverage effect. In this exponential model the returns are conditionally symmetric

(assuming volatility is known) but unconditional asymmetry is obtained from the dynamics of the variance

process. Amongst other GARCH processes with leverage effects are the absolute value GARCH based on

Taylor (1986, pp. 78-79) and Schwert (1989), the asymmetric and nonlinear asymmetric GARCH

introduced by Engle (1990) and Engle and Ng (1993)1, the GJR model of Glosten, Jagannathan and

1 As kindly noted by one of the referees, the asymmetric GARCH model was initially suggested by Engle (1990) and subsequently discussed in Engle and Ng (1993).

2

Runkle (1993), the asymmetric power GARCH model of Ding, Granger and Engle (1993), the threshold

GARCH of Zakoïan (1994), the quadratic GARCH of Sentana (1995), the Box-Cox transformed

GARCH of Hentschel (1995) and Hwang and Basawa (2004), which nests several different models, the

model of Brännäs and de Gooijer (2004) that extends QGARCH and, more recently, the model of Lanne

and Saikkonen (2007) that generalizes the AGARCH model.

All the above models assume that the GARCH volatility process has regime-invariant parameters and they

do not allow different types of shocks (of the same size) to precipitate different types of responses. Yet a

rumour may have an enormous effect but die out very quickly whereas an announcement of important

changes to economic policy (if represented by a shock of the same size as the rumour) may produce a less

volatile but more persistent response, after which the general level of volatility is raised. If the mean-

reversion and leverage effects are state dependent, the parameters estimated in a single state GARCH

process can represent only an average of these. Most single-state GARCH models also have constant

conditional skewness and conditional kurtosis. As a result, the option implied volatility smiles and skews

that are generated from these models are often unrealistic. Bates (1991), Christoffersen, Heston and

Jacobs (2006) and others argue that, for a model to capture the empirical characteristics of option implied

volatility skews, it is essential to account for time-variability in the physical conditional skewness and

kurtosis. So another reason for using more than one volatility state in a GARCH model is that both

conditional skewness and kurtosis can be endogenously time-varying.

More recent research examines Markov Switching (MS) GARCH and its more tractable relation: normal

mixture (NM) GARCH. Switching ARCH models were first introduced by Cai (1994) and Hamilton and

Susmel (1994). The straightforward generalization of this model to GARCH processes has estimation

problems due to the dependence of the conditional variance on the entire history of states. A solution was

suggested by Gray (1996) by replacing the lagged variance in the GARCH equation with its conditional

expectation; also the error term is allowed to follow a Student’s t distribution with switching degrees of

freedom parameter. Later on an enhancement was suggested by Klaassen (2002) making use of the full

information set available when computing the conditional probability of a state; see also Marcucci (2005)

for a model that allows for regime-specific means. However, these models are not very tractable.

One type of MS-GARCH was introduced by Francq, Roussignol and Zakoïan (2001); here there is only

one variance process but the parameters are state-dependent. The existence of the moments and the L2-

structures of this model were described in Francq and Zakoïan (2005), whilst the autocovariance structure

of the squared residuals were discussed in Francq and Zakoïan (2007); this paper also proposes a GMM

procedure for the estimation of the parameters. Henneke, Rachev and Fabozzi (2006) and Bauwens,

3

Preminger and Rombouts (2006, 2007) use Bayesian techniques to estimate this type of MS-GARCH

processes, whilst Dueker (1997) discusses an estimation method using a collapsing procedure to

approximate the likelihood function for a simplified version of MS-GARCH.

An improved MS-GARCH model was introduced and its properties presented by Haas, Mittnik and

Paolella (2004b) assuming that each state is characterized by its own variance and that the variance

process of each state depends only on its lagged values (as opposed to lagged values of the other variance

processes) and the squared residuals; for this model ML estimation is feasible. Weak stationarity properties

of the (1,1) model and conditions for the existence of the higher moments, relaxing the assumption of

initial finite variance, were discussed in Liu (2006); Abramson and Cohen (2007) also develop stationarity

properties for the models of Klaassen (2002) and Haas, Mittnik and Paolella (2004b) – for models of

order (p,q) – using a technique based on backward recursion.

Liu (2007) introduces a general model that allows for asymmetries in the variance equations; the focus of

this paper is the discussion of the stationarity properties and existence of higher moments of a

generalization of the model of Haas, Mittnik and Paolella (2004b) – a MS Box-Cox transformed threshold

GARCH(1,1), to be more precise.

All the above models are rather complex to estimate; a simpler model that ignores the autocorrelation in

the state variable (but, as opposed to some MS models, allows for different means in the states) is normal

mixture GARCH. Here the error term follows a mixture distribution. Early versions of NM-GARCH

models were discussed in Vlaar and Palm (1993), Bauwens, Bos and van Dijk (1999), Bai, Russell and

Tiao (2001, 2003) and Ding and Granger (1996). Concentrating on the individual means in the mixture

density, Wong and Li (2001) introduced a NM-ARCH model with individual AR processes in the mean

equations. This model was later extended to GARCH processes by Lanne and Saikkonen (2003); they also

assume that the mixing weights are functions of past observations. The general format of this model was

introduced by Haas, Mittnik and Paolella (2004a) and Alexander and Lazar (2006), providing strong

evidence that these models provide a better fit to exchange rate data than single-state GARCH models.

Bayesian estimation of the NM-GARCH was discussed in Bauwens and Rombouts (2007). Multivariate

extensions were considered by Haas, Mittnik and Paolella (2006), Fong, Li and An (2006) and Bauwens,

Hafner and Rombouts (2007) – the latter uses an estimation technique based on the EM algorithm.

Yet none of these models, with the exception of Henneke, Rachev and Fabozzi (2006) and Liu (2007),

have leverage effects in the state dependent GARCH volatility processes. The only source of skewness in

the physical returns densities is the different means in the normal components of the mixture conditional

4

density. Hence such models are not suitable when the leverage effect is known to be very important, as in

individual stock and stock index markets.

This paper examines the theoretical and empirical behaviour of state-dependent asymmetric volatility

processes in a normal mixture GARCH model. A main strength of the model is that it allows the

asymmetric volatility response to stock price shocks to be quantified in both usual and stressful market

circumstances. When two-state normal mixture models are applied to financial markets, often two states

can be differentiated: a ‘normal’ state that occurs most of the time and a ‘stressful’ state that occurs only

rarely. At every point of time the state is selected randomly, independently of the previous state. However,

as we will see later, following this random and unknown state-selection and after a return is realized from

the given distribution, it is possible to estimate the probability that a state was selected, and this will be the

ex-post inference about the probability of the state at time t, and it will be closely related to the actual state

of the world. Also, it allows the conditional higher moments to change in time. Implementing such a

model would allow risk capital assessments to be more refined and help risk managers to derive regime

specific limits for traders. It would also bring new insights, relevant to investors and policy makers, about

the likelihood of a market crash and the returns/volatility behaviour during a crash period. Two

asymmetric models are considered, namely the AGARCH and GJR extension of NM-GARCH, based on

the papers of Engle (1990), Engle and Ng (1993) and Glosten et al. (1993). One of the models, the NM-

GJR is a special case of the model of Liu (2007), with δ = 1 and with no autocorrelation for the state

variable; thus the theoretical results therein can be applied for the NM-GJR model in this paper. However,

this is not a restricted model of Henneke, Rachev and Fabozzi (2006), even though both consider GJR

variance processes.2 The other model discussed in this paper, NM-AGARCH, is not nested within the

family of models considered by Liu (2007).

Our empirical study on four major European equity market indices demonstrates the superiority of a two-

state asymmetric normal mixture GARCH model to capture the empirical characteristics of stock index

returns. We restrict our study to asymmetric NM-GARCH models as MS models don’t necessarily

perform better than mixture models3 and our focus is on the regime-specific asymmetric behaviour of the

variance process. We also compare the risk neutral skews generated by single component GARCH models

with the skews that are generated by the two-state asymmetric GARCH models. Even without a risk

premium the volatility smile implied by asymmetric normal mixture GARCH models exhibits a

pronounced skew that persists into long-dated options.

2 This is due to that NM-GARCH is a special case of the MS-GARCH of Haas, Mittnik and Paolella (2004b), which is a different formulation than the model of Henneke, Rachev and Fabozzi (2006). 3 A thorough theoretical and empirical comparison of MS and NM-GARCH models is done by Haas, Mittnik and Paolella (2004b) by evaluating the fit of these models for three currencies. Using the BIC criterion, NM models are preferred in 2 out of 3 cases. When the forecasting performance of these models is studied, then again in most cases the NM models proved to dominate. The only criterion according to which MS models are (marginally) preferred is the fit of the conditional higher moments.

5

The rest of the paper is organized as follows: Section II defines the general asymmetric normal mixture

GARCH model and describes two sources of asymmetry in these models; Section III characterizes the

equity index data for four major equity markets and presents the estimation methodology; Section IV

reports our empirical results. We start with our estimations for fifteen different GARCH models,

including two-state asymmetric and symmetric normal mixture GARCH models, and symmetric and

skewed t-GARCH with both symmetric and asymmetric variance processes. Then several model selection

criteria are applied to identify the best model. Choosing the two-state normal mixture AGARCH and GJR

models as best overall, the estimated parameters are used to infer various aspects of volatility behaviour in

European stock markets. Section V simulates prices of European calls and puts of various maturities and

compares the implied equity index skews of the FTSE index simulated by symmetric and asymmetric

GARCH models with one and two states. Section VI summarizes and concludes.

II The Asymmetric Normal Mixture GARCH Model

The model extends the GARCH processes studied in Haas, Mittnik and Paoella (2004a) and Alexander

and Lazar (2006) to include a leverage effect in the GARCH variance equations. Specifically, the model

has one equation for the mean and K variance equations. For simplicity the conditional mean equation is

written yt = εt, but this can easily be extended to allow for explanatory variables because their coefficients

can be estimated separately. The error term εt is assumed to have a conditional normal mixture density

with zero mean, which is a weighted average of K normal density functions with different means and

variances. We write:

( )2 21 1 1 1ε ~ ,..., ,μ , ...,μ ,σ , ...,σt t K K t KtI NM p p− ,

1 11, μ 0

K K

i i ii i

p p= =∑ ∑= = (1)

and the conditional density of the error term is:

( ) ( )1

η ε φ εK

t i it ti

p=∑= (2)

where φit represent normal density functions at time t with different means μi and different time-varying

variances 2σ it for i = 1,…, K. The mixing law ( )1 , ..., Kp p p ′= has elements that may be interpreted as the

relative frequency of each state occurring over a long period of time; for reasons of identifiability we

assume that 1 ≥ p1 ≥ p2 ≥… ≥ pK ≥ 0. There are K conditional variance components and these can follow

any GARCH process, but for the purpose of this paper we assume that there are three possibilities:

(i) NM-GARCH(1,1): 2 2 2

1 1σ ω α ε β σit i i t i it− −= + + for i = 1,…, K (3)

(ii) NM-AGARCH(1,1) (based on the Engle, 1990 and Engle and Ng, 1993 model):

( )22 21 1σ ω α ε λ β σit i i t i i it− −= + − + for i = 1,…, K (4)

6

(iii) NM-GJR(1,1) (based on the Glosten et al., 1993 model): 2 2 2 2

1 1 1 1σ ω α ε λ ε β σit i i t i t t i itd−− − − −= + + + for i = 1,…, K; (5)

where td− = 1 if εt < 0, and 0 otherwise.

For the normal densities to be well identified, we also assume that:4

|μi – μj| + |ωi – ωj|+ |αi – αj| + |βi – βj| + |λi – λj| > 0 for i ≠ j.

The main difference between NM-GARCH and the MS-GARCH model of Haas, Mittnik and Paolella

(2004b) is that the latter allows for autocorrelation in the state variable; in other words, for the NM-

GARCH model the transition matrix is of rank 1, so it can be expressed as

( ) ( )1Δ Δ ' where 1, ...,1 'ij t tP p P i j p−⎡ ⎤= = = = = =⎣ ⎦ 1 1

Also, the NM-GARCH adds asymmetry by allowing for non-zero means for the different components.

The overall conditional variance can be expressed as

2 2 2

1 1σ σ μ

K K

t i it i ii i

p p= =∑ ∑= + (6)

and, given the data, it is possible to compute an ex-post estimate of the probability of the ith state at an

arbitrary time t, expressed by:

( )( )

,

1

φ ε

φ εi i t

i t K

j j tj

pp

p=∑

= (7)

This is a time-varying probability, giving a realistic ex-post inference about the probability of state i

occurring at time t. Thus at any point in time we have an ex-ante probability of the states, given by the

mixing law, and an ex-post estimate of the state-probabilities, given by the above formula.

The weak stationarity conditions for the two models are stated in the Appendix. For K > 1, the existence

of second, third and fourth moments are assured by imposing less stringent conditions than in the single

component (K = 1) models. Namely, it is not necessary to assume that each component is stationary in

the conventional sense. For instance, it may happen that the more volatile component would be

considered non-stationary according to standard interpretation (i.e. αi + βi > 1 for that component).

However, the model has to be considered as a whole, and interpreting its components in isolation would

be misleading. 5

4 Also note that we do not require that ωi > 0; actually it often happens that the ω parameter of one of the components is slightly negative. However, as long as it is small in absolute value, it does not affect the positivity of the variance series (also, see Alexander and Lazar, 2006) 5 When the models are restricted to NM-GARCH(1,1), then the stationarity conditions are equivalent to those given by Alexander and Lazar (2006). Also, they are equivalent to those given by Haas, Mittnik and Paolella (2004a) and Liu (2006) when the ωi > 0

7

There are two distinct sources of asymmetry in the model:

• Persistent Asymmetry: This arises in all three models when the conditional returns density is a

mixture of normal density components having different means; it is generated by the difference

between the expected returns under different market circumstances. The Appendix shows that

even the unconditional density will have non-zero skewness, and that this increases with the

differentiation of the component means. For instance, when K = 2, there is positive skewness in

the overall conditional returns density if the component with the higher probability has low

volatility and a negative mean (as it generally happens), and negative skewness in the overall

conditional returns density if the component with the higher probability has low volatility and a

positive mean.6

• Dynamic Asymmetry: This only occurs in models (ii) and (iii) and is due to the leverage

parameters in the component variance processes. If the leverage parameter λi is positive, the

conditional variance in this component is higher following a negative unexpected return than

following a positive unexpected return. In equity markets, where a negative unexpected return

increases the leverage of a firm we expect positive values of the leverage parameter. However,

some studies (e.g. Koutmos et al., 1998 and Chortareas et al., 2000) report negative leverage

coefficients in stock indices of developing economies.7

Taken together, these two sources of skewness in the physical conditional returns density offer a much

richer structure for capturing the shape of equity index skews than is given by traditional GARCH models.

The unconditional skewness and excess kurtosis are both non-zero and the conditional higher moments

are also non-zero and time-varying. See the Appendix for the derivation of the unconditional higher

moments.

The conditional normal mixture densities can be interpreted as merely a device to increase the flexibility of

the returns density, because the model will capture the behaviour of returns during different regimes and

identify the long-run probability of each state, based on time series data. The model is considerably easier

to estimate than the class of Markov switching GARCH models introduced by Hamilton and Susmel

(1994) even with the restrictions and improvements introduced by Cai (1994), Gray (1996), Klaassen

condition is imposed (in this case the stationarity condition given in the Appendix reduces to n > 0 only). Since the GJR-GARCH is a special case of the model of Liu (2007), its stationarity conditions can be deducted from Liu’s results. When we also assume that ωi > 0, then our stationarity condition reduces to n > 0 (see Appendix), which is equivalent to the weak stationarity results of Liu (2007) restricted to NM-GJR models 6 As suggested by one of the referees, this can be seen by expressing the third moment as

( )( ) ( )( )3 2 23 1 2 2 1 1 2 1 2 1 2μ μ 3 μ μ σ σm p p p p⎡ ⎤= − − + − −⎣ ⎦ .

7 Note that the ith variance component depends on the dispersion of the unexpected return, not around its mean μi in the individual density, but around the overall mean 0. Hence there is a third effect that induces skewness in each component conditional return density but not in the overall conditional return density.

8

(2002) and Haas, Mittnik and Paoella (2004b). The difficulty with estimating most of these models (except

the last one) lies in the co-dependencies of the state variances. However, the normal mixture GARCH

models considered here have a very straightforward relationship between the individual variances, because

they are tied to each other only through their dependence on the error term. Hence an advantage of

normal mixture GARCH is that quite complex volatility feedback mechanisms can be included and the

model remains easy to estimate.

III Data and Parameter Estimation

Our results are based on the daily closing prices of four major European equity market indices: CAC40

(total number of observations is 3733), DAX30 (3730), FTSE100 (3739) and DJ Eurostoxx 50 (3810)

from 1 January 1991 to 21 October 2005. This period encompasses a number of stock market crises,

including the Asian crisis in 1997, the Russian crisis in 1998, the technology market crash in 2000 and the

terrorist attack on the US in 2001. Table 1 summarises the general characteristics of the daily returns. The

skewness is negative and the excess kurtosis is positive and in most indices these are highly significant. At

first, it can be noticed that during this period the FTSE index was less volatile in general than the other

markets.

Table 1: Summary Statistics of the Stock Market Indices8

CAC DAX FTSE Eurostoxx

Volatility 21.24% 22.82% 16.41% 20.13%

Skewness –0.099* –0.271*** –0.113** –0.166***

Excess kurtosis 2.86*** 4.03*** 3.19*** 4.25***

Maximum moment exponent 5.58** 4.59*** 6.70* 5.66***

For each index, we estimate the conditional variance parameters on the residuals εt from constant

conditional mean equations.9 Then, maximizing the likelihood function, or equivalently, maximizing

( ) ( )1

|ε ln η εT

tt

L=∑= ⎡ ⎤⎣ ⎦θ

the optimal parameter values are found, given the data. The updating formula has the following form,

where g is the gradient vector, H the Hessian matrix and s represents the step-length:

( ) ( )11m m m ms

−

+ = − ⎡ ⎤⎣ ⎦θ θ H θ g θ

8 The standard errors (s.e.) of the sample estimates of the mean, variance, skewness and excess kurtosis parameters are as follows: s.e. of the sample mean = σ T , s.e. of the sample variance = 22 σ T , s.e. of the sample skewness ≈ T/6 , s.e. of the sample

excess kurtosis ≈ T/24 , where T represents the total number of observations. In Table 1 *** represent results significantly different from zero at the 0.1% level, ** at 1% and * at 5%. The last row reports the HKKP estimate for the maximum moment exponent; here ***, ** and * denote results that are significantly higher than 4 at 0.1%, 1% and 5% significance level, respectively. 9 We fitted ARMA models to the series; the best fitting model for all series was an ARMA(0,0) model. Thus we only removed the means of the series and continued with the demeaned series.

9

To compute the Hessian matrix and the gradient vector we can use either analytic or numerical first and

second order derivatives of the likelihood – see the Appendix for the numerical derivatives.10

IV Empirical Results

IV.1 Overview of Models

We fitted three symmetric and twelve asymmetric GARCH models to the equity index data. The first nine

models have a single variance component and the last six models have two variance components.11 The

models are:12

A. Models with normally distributed errors:

(1) GARCH

(2) AGARCH

(3) GJR

B. Models with symmetric Student’s t distributed errors:

(4) GARCH

(5) AGARCH

(6) GJR

C. Models with skewed Student’s t distributed errors:

(7) GARCH

(8) AGARCH

(9) GJR

D. Normal mixture GARCH models with zero means in the mixture component densities:

(10) NM-GARCH

(11) NM-AGARCH

(12) NM-GJR

E. General normal mixture GARCH models:

(13) NM-GARCH

(14) NM-AGARCH

(15) NM-GJR 10 The results were generated using C++ and Ox version 3.30 (Doornik, 2002) and the G@rch package version 3.0 (Laurent, S. and Peters, J.-P., 2002) 11 Several restricted versions of the normal mixture GARCH model were also fitted to the data (assuming a constant variance component or assuming constant difference between the two variance processes) – but these performed quite badly according to some of the selection criteria and thus these models are not discussed here. 12 All models are GARCH(1,1) specifications. To simplify notation, this will be implied in the following. Models (2), (3), (5), (6), (7), (11), (12) and (13) have one source of asymmetry, whilst models (8), (9), (14) and (15) have two sources. Also, all normal mixture models will be based on two normal distributions in the mixture, as important conclusions can be drawn from such a setup – see also Marcucci (2005) and Bauwens, Preminger and Rombouts (2007). We do not consider models with more than two components as those would increase the number of parameters from 10 to at least 16 and, as the results of Haas, Mittnik and Paolella (2004a,b) and Alexander and Lazar (2006) clearly indicate, in the case of three-component mixture models convergence is harder to achieve and also these models prove to be over-parameterized when it comes to out-of-sample forecasting.

10

The estimation results are reported in Tables 2 – 5.13 The upper figure in each cell reports the parameter

estimate and the lower figure is the t-ratio. Note that in these tables the first row reports the degrees of

freedom for the t-GARCH models (4) – (9) and the weight of the first component in the NM-GARCH

models for models (10) – (15). Similarly, the third row reports the skewness parameter for models (7) –

(9) and the mean of the first normal density for the normal mixture GARCH models.

IV.2 Ex-post State Probabilities

Figure 1 presents the daily Eurostoxx index returns for the period Jan 2004 – October 2005. It is expected

that the ex-post state probabilities will indicate a jump from the first state (characterized by a lower

variance) to the second state (characterized by a larger variance) when the absolute returns increase in

magnitude – or, from the second state to the first one, when absolute returns decrease. Figure 2 shows

that this is indeed the case – the graph presents the estimated conditional volatilities of the two

components plotted against the time-varying ex-post probability of the first state. Note that when the

returns increase in absolute value after a relatively tranquil period, the ex-post probability of the first state

decreases, suggesting a switch from the first, less volatile state to the second state. In such a situation both

conditional volatilities show an upward jump. The volatilities exhibit a downward jump when the switch is

from the second to the first state.

IV.3 Model Selection:

The following selection criteria are used:

(a) Bayesian Information Criterion: The model with the lowest BIC is chosen.

(b) Moment specification tests: Following Newey (1985), we test for normality in the standardized

residuals, checking the first four moments and for zero autocorrelations in their first four powers,

using a Wald test. Although this test has the disadvantage that it tends to reject the null too often

in small samples (Godfrey, 1988), this test was used for GARCH-type processes by Nelson

(1991), Harvey and Siddique (1999) and Brooks, Burke and Persand (2005) and a multivariate test

was developed by Ding and Engle (2001)14. We proceed with the transformation:

( )( )1

1Φ Φ ε

K

t i i ti

u p−

=∑=

where Φ is the cumulative normal density function; if the model is the true DGP, then this

transformation should lead to i.i.d. standard normal series.15 We test a total of 20 conditions (the

13 Note that the results in Tables 2 – 5 are for variance-annualized unexpected returns. That is daily returns are pre-multiplied by √250 before estimation. Thus, volatilities are quoted in annualized terms. 14 Interestingly, the author have shown using Monte Carlo simulations that the small sample properties of this test’s statistics are good. 15 A similar transformation was used by Haas, Mittnik and Paolella (2004a). This transformation is different from the one of Lanne and Saikkonen (2003), where the resulting series have unit variance and preserve autocorrelation, but are not i.i.d. normal.

11

first four moments of ut and the first four autocorrelations in its first four powers) and the test

statistics for the moment tests have a χ2(1) distribution. The tables report the number of tests

(out of 20) that are rejected at 1% significance level.

(c) Unconditional density fit: The density test is on the histogram fit between the model simulated

data and the original data. This is one of the most difficult tests for models to pass as it tests for

the unconditional distributional fit. The model returns are simulated16 and their histogram is

estimated using a nonparametric kernel approach. Several alternatives are available for the kernel,

our chosen function being that of Epanechnikov (1969). Then the model selection criterion is

based on the modified Kolmogorov-Smirnov (KS) statistic (Kolmogoroff, 1933, Smirnov, 1939,

Massey, 1951 and Khamis, 2000).

(d) ACF analysis: In contrast to (c) this test captures the dynamic properties of the model squared

returns – namely, the fit to the empirical autocorrelations of the squared returns. The Appendix

states the theoretical autocorrelation functions (ACF) of the different models and we apply the

Mean Squared Error (MSE) criterion to assess the fit of the models.

(e) Value at Risk (VaR) analysis: The VaR tests developed by Christoffersen (1998) are

performed. Using the estimated parameters, the in-sample VaR estimates are computed. Based on

these values, the occurrences of exceeding the VaR are counted. This is achieved by building

several indicator functions. The first one equals zero except when a return lower than minus the

VaR occurs, when it equals one:

| ;α;α

1,0,

t t t jt

r VaRI

otherwise−< −⎧

= ⎨⎩

, where α is the level of significance

Then there are a set of four indicator functions:

1;α ;α, ;α

1,0,

t tij t

I i and I jJ

otherwise− = =⎧

= ⎨⎩

, for i, j = 0, 1

The three test statistics, for unconditional coverage, independence and conditional coverage are as

follows:

( )0 ;α 1;α

2;α

α α

1 α α2 log ~ χ 11 π π

n n

ucLR⎛ ⎞⎛ ⎞ ⎛ ⎞−⎜ ⎟= − ⎜ ⎟ ⎜ ⎟⎜ ⎟−⎝ ⎠ ⎝ ⎠⎝ ⎠

( )00 ;α 01;α 10 ;α 11;α

2;α 2;α 2;α 2;α 2;α

01;α 01;α 11;α 11;α

1 π π 1 π π2 log ~ χ 1

1 π π 1 π π

n n n n

indLR⎛ ⎞⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞− −⎜ ⎟= − ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟⎜ ⎟− −⎝ ⎠ ⎝ ⎠ ⎝ ⎠ ⎝ ⎠⎝ ⎠

( )2;α ;α ;α ~ χ 2cc uc indLR LR LR= +

16 We simulate returns, based on the estimated parameters, and to ensure that the simulated density is not affected by small sample size we use 50,000 replications. Also, to avoid any influence of the starting values, each simulation has 1000 steps ahead in time but we only use the last simulated return.

12

where 1;α ;α1

T

tt

n I=∑= ; 0;α 0;αn T n= − ; α 1;απ /n T= ; ;α , ;α

1

T

ij ij tt

n J=∑= ;

( )01;α 01;α 00;α 01;απ /n n n= + ; ( )11;α 11;α 10;α 11;απ /n n n= + ; ( )2;α 01;α 11;απ /n n T= +

For simplicity, only the first two test statistics are reported and analyzed.

The results of these specification tests are shown in the last four rows of Tables 2 – 5 and in Table 6;

these can be interpreted as follows:

(a) Bayesian Information Criterion: Most series favour the skewed t-AGARCH and skewed t-GJR

models, except the FTSE for which the symmetric t-GJR model is preferable. According to the

BIC, normal mixture models are outperformed by t-GARCH models. Asymmetric NM-GARCH

models fare better than symmetric models, and also models with non-zero means have a lower

BIC than normal mixture models that assume zero means for the normal components. However,

this is a very simple criterion and it ignores the models’ ability to capture the characteristics of the

data.

(b) Moment specification tests: These tests show that the most basic models, i.e. (1) – (3) do not

capture the higher moments. But beyond this observation, the moment tests do not distinguish

well between the models. We find that most models have several rejections for these tests. None

of the models performs consistently well based on these moment tests.

(c) Unconditional density fit: This shows a clear preference for the NM-GARCH models (13),

(14) and (15) (i.e. with different component means). We conclude that it is necessary to model

both sources of asymmetry. It is important to notice that the t-GARCH models perform quite

badly according to this criterion. Often they have very unrealistic (negative) unconditional

kurtosis estimates, suggesting that for these models the fourth moment does not exist. To test for

the existence of the fourth moment, we performed the HKKP procedure derived by Huisman et

al. (2001, 2002) which estimates the maximum moment exponent based on the method

introduced by Hill (1975). The estimates for the maximum exponent are found in Table 1.17 As

the results illustrate, we can reject the hypothesis that the 4th moment does not exist for all series.

(d) ACF analysis: This test also favours the asymmetric NM-GARCH models. Again we see that

models based on the t distribution perform badly in many situations and quite often the ACF

estimates of these models are negative. This is probably because these models do not capture the

fourth moment. By contrast, all the normal mixture models perform very well according to this

criterion.

17 We have used OLS for the estimation of the maximum exponent instead of WLS because the WLS procedure suggested by Huisman et al. (2001, 2002) actually increased the level of heteroscedasticity of the residuals. To correct for heteroscedasticity we used White’s heteroscedasticity consistent covariance matrix. We used the 500 most extreme observations in the HKKP regression.

13

(e) VaR analysis: The independence and conditional coverage test results in Table 6 show that

normal GARCH models are in general very unsuitable for VaR estimations. Additionally,

restricted NM-GARCH models fail some of the VaR tests, again proving that they are not

suitable for VaR calculations. Symmetric and skewed t-GARCH models fail only few of the tests

but the NM-AGARCH and NM-GJR models achieve the best results, the latter passing both tests

for all indices.

In summary, the normal GARCH model is the worst fit by all criteria and the Student’s t-GARCH models

fit well according to the BIC criteria but often don’t capture the fourth moment of the returns,

performing badly according to the density fit and ACF criteria. The normal mixture models with different

component means fit better than those with identical means. We conclude that the in-sample fit of a two-

state normal mixture GARCH model with two sources of asymmetry, both persistent and dynamic, is

superior to the other models considered. However, for the dynamic asymmetry (i.e. the leverage effect) it

is not possible to decide which asymmetric specification for the two volatility components is preferred –

this depends on the specific index considered.

IV.4 Regime Specific Volatility Behaviour

Each of the two-state normal mixture AGARCH models reveals a lower volatility component that occurs

with a high probability (the ‘usual’ component) and a high volatility component with a very low

probability (the ‘crash’ component). Table 7 summarizes the characteristics of these volatility components

in each of the four European stock markets during the sample period considered. The results in this table

are based on the NM-AGARCH model (14). To obtain the annualized mean returns we multiply the

mean estimates by √250.

In each market the ‘usual’ component is characterized by a high associated probability, a positive mean

return and a low volatility. The ‘usual’ mean return is lowest in the FTSE, at only 4.6% per annum but the

unconditional volatility is also low: at 15% it is the lowest of all. Notably the Eurostoxx index has the

highest return (8.5%) and the second lowest volatility of the four markets (17%). On the other hand, the

CAC and DAX indices have higher volatility, around 21%, but their mean return is less than that of

Eurostoxx during their usual state. However, they have been in their ‘usual’ regime more often than the

FTSE and Eurostoxx: for the latter a ‘usual’ regime occurred only about 95% of the time between 1991

and 2005. In the ‘usual’ regime the least reactive and most persistent volatility is that of the FTSE series

but the Eurostoxx and DAX display slightly more reaction and less persistence than the other indices. The

leverage effect is most pronounced in the CAC index.

The ‘crash’ market regime occurred between about 2% and 5% of the time, depending on the index. The

CAC and the DAX have the highest unconditional volatility (over 40%) and the lowest return during a

14

crash (it is over –350% in annual terms for the CAC). This is probably because the CAC and DAX indices

are less liquid than the FTSE and Eurostoxx. By comparison, during the 1987 crisis the FTSE lost over

400% in annual terms. Yet in our sample the ‘crash’ mean returns are around –100% for the FTSE and –

160% for the Eurostoxx. All indices, and the FTSE and CAC in particular, are highly reactive to market

shocks in the ‘crash’ regime, yet the effect dies out soon because the persistence parameters are all low,

especially on the FTSE. The leverage effects are similar to the ‘usual’ leverage effects for the DAX and

Eurostoxx, but the highly reactive FTSE and CAC indices have an inverse leverage effect in a crash

regime. This indicates that a positive return will lead to higher volatility than a negative return, but only

during the crash regime. A possible explanation of this is that investors anticipate further falls after a

modest recovery.

Table 7: Summary of Regime Specific Behaviour in Volatility

CAC DAX FTSE Eurostoxx

‘Usual’ Component

Probability 0.978 0.970 0.958 0.949 Annualized Mean Return 7.9% 6.5% 4.6% 8.5% Unconditional Volatility 21% 21% 15% 17% Volatility Reaction (α1) 0.0524 0.064 0.0482 0.0642

Volatility Persistence (β1) 0.9311 0.922 0.9362 0.9206 Leverage Effect (λ1) 0.1058 0.0639 0.0899 0.0443

‘Crash’ Component

Probability 0.022 0.030 0.042 0.051 Annualized Mean Return -353% -210% -106% -161% Unconditional Volatility 44% 43% 27% 32% Volatility Reaction (α2) 1.5497 0.3729 0.9401 0.2288

Volatility Persistence (β2) 0.6172 0.7155 0.5821 0.8332 Leverage Effect (λ2) -0.0311 0.0470 -0.0192 0.0641

V Equity Index Implied Volatility Skews

Out-of-the-money put options on an equity index are an attractive form of insurance for investors that

fear a general market decline; and with index option market makers in relatively short supply, the market

prices of these options are often far higher than the Black-Scholes (1973) model prices based on the at-

the-money volatility. Consequently the implied volatility of these options is commonly found to be higher

than the implied volatility of at-the-money call and put options and out-of-the-money calls. This leads to

the skew (or ‘smirk’) in equity index implied volatility that has been very pronounced since the global

stock market crash in 1987, as shown by Bates (1991), Rubinstein (1994), Jackwerth and Rubinstein

(1996), Derman and Kamal (1997), Tompkins (2001) and many others. This equity index implied

volatility skew is associated with a negatively skewed implied risk neutral returns density.

Since the global crash of 1987 the skewness in risk neutral index densities has, in general, been much

greater than the skewness estimated from historical data on stock index returns. The difference between

the physical and risk neutral skews has been the subject of extensive academic research, as in Bakshi et. al

(2003), Bates (1997, 2000) and many others. These papers argue that a volatility risk aversion adjustment

15

is necessary to reconcile the physical and risk neutral distributions of equity indices. The volatility risk

aversion can be recovered using empirical pricing kernels based either on unconditional historical returns,

as in Ait-Sahalia and Lo (2000) and Jackwerth (2000) or on conditional returns densities, as in Rosenberg

and Engle (2002). However, Bates (2003) argued that the difference between the risk-neutral and real-

world distributions cannot be explained, except perhaps by the existence of a time-varying volatility risk

premium.

One conclusion of this research was that the index skew is too pronounced and too persistent to verify

the standard approach and that more sophisticated time series models of the conditional densities of index

returns were required. The asymmetric normal mixture GARCH process considered here provides just

such a model: since time-varying conditional skewness and kurtosis are endogenous the model volatility

skew exhibits a term structure even in the physical measure. With single state GARCH models this is

impossible. The uncertainty over two possible volatility states, each of which exhibits volatility clustering

but with quite different characteristics, provides a strong justification for the inclusion of a time-varying

volatility risk premium.

A natural question to ask, therefore, is how these properties are reflected in the equity skews implied by

these models. In this section we compare the implied volatility skews in the physical measure generated by

asymmetric normal mixture GARCH models with those implied by other GARCH models. As we shall

see, leverage effects are clearly very important. In all markets we find very pronounced normal mixture

GARCH skews, even in the absence of a risk premium, and the skew persistence that is captured by the

difference in the means of the variance components is only of secondary importance.

For illustration we use the parameter estimates of the FTSE index returns given in Table 5 for a

representative selection of five of the fifteen models: normal GARCH, asymmetric t-GARCH, asymmetric

t-GJR, NM-GARCH and NM-GJR models. We simulate volatility skew surfaces using each of these five

models and compare their characteristics. Starting with S0 = 100 and using r = 0.03, we simulate the

dynamics of the index value as:

( )( )2Δ exp σ / 2 Δ ε Δt t t t tS S r t t−= − +

Then, for a fixed strike K and maturity T the time zero price of a European call option is computed as

( ) ( )( )exp max 0, TrT E S K− − . Simulating 50,000 times and computing the average call value gives the

estimate of the option price. Then, applying the inverse Black-Scholes formula gives the simulated implied

volatility at (K, T). We take a range of strikes between 80 and 130 and a range of maturities from 3 to 18

months. The results are shown in Figure 3.

16

We have used the same vertical scale from 10% – 25% volatility for each smile, as this makes the

comparison easier. The first two skews, from the normal and skewed t-GARCH models are unrealistic.

The volatility level is too low and there is no evidence of a negative skew even for the skewed t-GARCH

model. The GJR skew in figure (c) is more realistic, with substantially higher volatility for ITM calls than

OTM calls. Still, there is little evidence of a volatility term structure as, for a given moneyness, volatility is

almost constant as maturity increases. The NM-GARCH without leverage effect has more of a term

structure and the short-term skew is not exactly linear with respect the moneyness, a feature that is quite

common in index markets. Finally, the NM-GJR model produces the most realistic skew: it is more

pronounced and less linear than in the single component GJR model, and there is more variation of

volatility over time. We conclude that including non-zero means in the components of the mixture

provides a skew that persists into longer dated options but it is the leverage effect that primarily

determines the slope of the skew for short dated options.

VII Summary and Conclusions

The majority of generalised autoregressive conditional heteroscedasticity (GARCH) models specify a

single time-varying volatility state and thus offer only one scenario for market behaviour. Also their

standardized higher moment specification is not realistic: time-variation in conditional skewness and

kurtosis is ignored except in a few instances where it is specified exogenously to the GARCH model. This

paper considers the normal mixture GARCH model with two volatility states and endogenous time-

varying conditional higher moments and introduces additional skewness to model the asymmetry due to

leverage effects in equity markets.

We first ask whether this additional source of asymmetry is necessary, given that normal mixture

GARCH(1,1) models with different mean returns already exhibit time-varying conditional skewness and

kurtosis, in contrast to single-state GARCH models. The answer to this question is undoubtedly yes. Both

the statistical criteria and the simulations of the index skew justify the addition of both unconditional and

dynamic types of asymmetry. The different component means give a non-zero unconditional skewness,

but the addition of dynamic asymmetry is very highly significant and dramatically improves the time series

fit of the normal mixture GARCH models.

Empirical results on four European indices compare the fit of these models to single-state normal and

symmetric and skewed t-GARCH specifications. The overall conclusion from applying five statistical

criteria to fifteen fitted models is a clear superiority of asymmetric normal mixture GARCH models.

These models are then used to provide considerable insight to the regime specific behaviour of the

European equity indices. Between 1991 and 2005 the relative frequency of a ‘crash’ regime varied from

about 2% for the CAC to about 5% for the Eurostoxx50. In the ‘usual’ regime the indices have broadly

similar behaviour (except that leverage effects are strongest in the CAC and weakest in the Eurostoxx) but

17

the crash regime behaviour is more diverse. The CAC index displays the most extreme crash behaviour,

returning –350% in annual terms with an unconditional volatility of 44%; by contrast the FTSE returned

only –100% and the unconditional volatility is far lower. All indices are highly sensitive to market shocks

in the crash regime, particularly the CAC, but persistence in volatility is very low indicating that volatility

quickly returns to more normal levels. The leverage effect in the CAC and the FTSE is positive (but small)

during a crash period, indicating that investors can anticipate further falls after only a modest recovery.

Although the Eurostoxx has the highest crash probability, in the ‘usual’ regime it has the highest mean

return (8.5%) and a low unconditional volatility (17%), although this is not as low as the FTSE volatility.

Since the crash behaviour of the Eurostoxx is also the least extreme, apart from the FTSE, it may be the

best investment alternative of the indices considered.

If agents’ beliefs about the future can be informed by past behaviour, the rich structure that is revealed

from an estimation of a two-state asymmetric normal mixture GARCH model could provide invaluable

information not only to investors, but also to risk managers and policy makers. Stress tests are commonly

applied using ad hoc rules, such as changing volatilities to 100% without associating any probability to this

event. The NM-GARCH model that we have implemented in this paper has the potential to provide

detailed, objective stress tests of equity risk factor volatility and this is clearly an interesting subject for

further research. Finally we have demonstrated that single-state GARCH models imply unrealistic shapes

for the implied index skew surface. However, asymmetric normal mixture GARCH models, even without

a volatility risk premium, provide a rich behavioural structure that can match the empirical characteristics

of implied volatility skew surfaces. We conclude that option pricing based on two-state asymmetric

GARCH models is another area suitable for further research.

18

References

Abramson, A. and I. Cohen (2007): ‘On the Stationarity of Markov Switching GARCH Processes’, Econometric Theory 23, 485-500

Ait-Sahalia, Y. and A.W. Lo (2000): ‘Nonparametric Risk Management and Implied Risk Aversion’, Journal of Econometrics 94, 9-51.

Alexander, C. and E. Lazar (2006): ‘Normal Mixture GARCH(1,1): Applications to Exchange Rate Modelling’, Journal of Applied Econometrics 21, 307-336

Bai, X., J.R. Russell, and G.C. Tiao (2001): ‘Beyond Merton’s Utopia (I): Effects of Non-normality and Dependence on the Precision of Variance Using High-frequency Financial Data’, University of Chicago GSB Working Paper July 2001

Bai, X., J.R. Russell, and G.C. Tiao (2003): ‘Kurtosis of GARCH and Stochastic Volatility Models with Non-normal Innovations’ Journal of Econometrics 114(2), 349-360

Bakshi, G., N. Kapadia and D. Madan (2003): 'Stock Return Characteristics, Skew Laws, and the Differential Pricing for Individual Equity Options’, The Review of Financial Studies 16, 101-143

Bates, D.S. (1991): ‘The Crash of '87: Was It Expected? The Evidence from Options Markets’, Journal of Finance 46, 1009-1044

Bates, D.S. (1997): ‘The Skewness Premium: Option Pricing under Asymmetric Processes’, Advances in Futures and Options Research 9, 51-82

Bates, D.S. (2000): ‘Post-'87 Crash Fears in the S&P 500 Futures Option Market’, Journal of Econometrics 94, 181-238

Bates, D.S. (2003): ‘Empricial Option Pricing: A Retrospection’, The Journal of Econometrics 116, 387-404

Bauwens, L., C.S. Bos, and H.K. van Dijk (1999): ‘Adaptive Polar Sampling with an Application to a Bayes measure of Value-at-Risk’, Tinbergen Institute Discussion Paper TI 99-082/4

Bauwens, L., C.Hafner and J. Rombouts (2007): ‘Multivariate Mixed Normal Conditional Heteroskedasticity’, Computational Statistics and Data Analysis 51, 3551-3566

Bauwens, L., A. Preminger and J. Rombouts (2006): ‘Regime Switching GARCH Models’, CORE Discussion Paper 2006/11

Bauwens, L., A. Preminger and J. Rombouts (2007): ‘Theory and Inference for a Markov-Switching GARCH Model’, CIRPÉE Working Paper 07-33

Bauwens, L. and J.V.K. Rombouts (2007): ‘Bayesian Inference for the Mixed Conditional Heteroskedasticity Model’, The Econometrics Journal 10, 408-425

Bekaert, G. and G. Wu (2000): ‘Asymmetric Volatility and Risk in Equity Markets’, The Review of Financial Studies, 13(1), 1-42

Black, F. and M. Scholes (1973): ‘The Pricing of Options and Corporate Liabilities’, Journal of Political Economy 81, 637-659

Bollerslev, T. (1986): ‘Generalized Autoregressive Conditional Heteroskedasticity’, Journal of Econometrics 31, 309-328

Bollerslev, T. (1987): ‘A Conditionally Heteroskedastic Time Series Model for Speculative Prices and Rates of Return’, Review of Economics and Statistics 69, 542-547

Brännäs, K. and J.G. de Gooijer (2004): ‘Asymmetries in Conditional Mean and Variance: Modelling Stock Returns by asMA-asQGARCH’, Journal of Forecasting 23, 155-171

19

Brooks, C., S.P. Burke and G. Persand (2005): ‘Autoregressive Conditional Kurtosis’, Journal of Financial Econometrics 3(3), 1-23

Cai, J. (1994): ‘A Markov Model of Switching-Regime ARCH’, Journal of Business & Economic Statistics 12, 309-316

Christoffersen, P. (1998): ‘Evaluating Interval Forecasts’, International Economic Review 39, 841-862

Christoffersen, P., S. Heston and K. Jacobs (2006): ‘Option Valuation with Conditional Skewness’, Journal of Econometrics 131, 253-284

Chortareas, G.E., J.B. McDermott and T.E. Ritsatos (2000): ‘Stock Market Volatility in an Emerging Market: Further Evidence from the Athens Stock Exchange, Journal of Business Finance and Accounting 27(7), 983-1002

Derman, E. and M. Kamal (1997): ‘The Patterns of Change in Implied Index Volatilities’, Quantitative Strategies Research Notes, Goldman Sachs

Ding, Z. and R.F. Engle (2001): ‘Large Scale Conditional Covariance Matrix Modelling, Estimation and Testing’, Academia Economic Papers 29(2), 157-184

Ding, Z. and C.W.J. Granger (1996): ‘Modeling Volatility Persistence of Speculative Returns: A New Approach’, Journal of Econometrics 73, 185-215

Ding, Z., C.W.J. Granger and R.F. Engle (1993): ‘Long Memory Property of Stock Market Returns and a New Model’, Journal of Empirical Finance 1, 83-106

Doornik, J.A. (2002): ‘Object-Oriented Matrix Programming Using Ox’, 3rd ed. London, Timberlake Consultants Press and Oxford, available from www.nuff.ox.ac.uk/Users/Doornik

Dueker, M.J. (1997): ‘Markov Switching in GARCH Processes and Mean Reverting Stock Market Volatility’, Journal of Business and Economic Statistics 15, 26-34

Engle, R.F. (1982): ‘Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of United Kingdom Inflation’, Econometrica 50(4), 987-1007

Engle, R.F. (1990): ‘Stock Volatility and the Crash of ’87: Discussion’, The Review of Financial Studies 3(1), 103-106

Engle, R.F. and V.K. Ng (1993): ‘Measuring and Testing the Impact of News on Volatility’, Journal of Finance 48, 1749-1778

Epanechnikov, V. (1969): ‘Nonparametric Estimation of a Multidimensional Probability Density’, Teoriya Veroyatnostej i Ee Primeneniya 14, 156-162

Fernandez, C. and M. Steel (1998): ‘On Bayesian Modelling of Fat Tails and Skewness’, Journal of the American Statistical Association 93, 359-371

Fong, P.W., W.K. Li and H.-Z. An (2006): ‘A Simple Multivariate ARCH Model Specified by Random Coefficients’, Computational Statistics & Data Analysis 51, 1779-1802

Francq, C., Roussignol, M. and J.-M. Zakoïan (2001): ‘Conditional Heteroskedasticity Driven by Hidden Markov Chains’, Journal of Time Series Analysis 22, 197-220

Francq, C. and J.-M. Zakoïan (2005): ‘The L2-Structures of Standard and Switching-Regime GARCH Models’, Stochastic Processes and their Applications 115, 1557-1582

Francq, C. and J.-M. Zakoïan (2007): ‘Deriving the Autocovariances of Powers of Markov-Switching GARCH Models, with Applications to Statistical Inference’, forthcoming in Computational Statistics & Data Analysis

Godfrey, L.G. (1988): ‘Misspecification Tests in Econometrics’, Cambridge University Press

20

Glosten, L. R, R. Jagannathan and D.E. Runkle (1993): ‘On the Relationship between Expected Value and the Volatility of Excess Returns on Stocks’, Journal of Finance 48, 1779-1801

Gray, S. F. (1996): ‘Modelling the Conditional Distribution of Interest Rates as a Regime-Switching Process’, Journal of Financial Economics 42, 27-62

Haas, M., S. Mittnik and M.S. Paolella (2004a): ‘Mixed Normal Conditional Heteroskedasticity’, Journal of Financial Econometrics 2, 211-250

Haas, M., S. Mittnik and M.S. Paolella (2004b): ‘A New Approach to Markov Switching GARCH Models,” Journal of Financial Econometrics, 2, 493-530

Haas, M., S. Mittnik and M.S. Paolella (2006): ‘Multivariate Normal Mixture GARCH’, Center for Financial Studies Discussion Paper 2006/09

Hamilton, J.D. and R. Susmel (1994): ‘Autoregressive Conditional Heteroscedasticity and Changes in Regime’, Journal of Econometrics 64, 307-333

Hansen, P.R., A. Lunde and J.M. Nason (2003): ‘Choosing the Best Volatility Models: The Model Confidence Set Approach’, Oxford Bulletin of Economics and Statistics 65(Supplement), 839-861

Harvey, C.R. and A. Siddique (1999): ‘Autoregressive Conditional Skewness’, Journal of Financial and Quantitative Studies, Vol. 34, No. 4, 465-487

Henneke, J.S., S.T. Rachev and F.J. Fabozzi (2006): ‘MCMC Based Estimation of Markov Switching ARMA-GARCH Models’, University of Karlsruhe Working Paper

Hentschel, L. (1995): ‘All in the Family: Nesting Symmetric and Asymmetric GARCH Models’, Journal of Financial Economics 39, 71-104

Hill, B.M. (1975): ‘A Simple General Approach to Inference about the Tail of a Distribution’, The Annals of Statistics 3(5), 1163-1174

Huisman, R., K.G. Koedijk, C.J.M. Kool and F. Palm (2001): ‘Tail-Index Estimator in Small Sample’, Journal of Business and Economic Statistics 19, 208-216

Huisman, R., K.G. Koedijk, C.J.M. Kool and F. Palm (2002): ‘The Tail-Fatness of FX Returns Reconsidered’, De Economist 150, 299-312

Hwang, S.Y. and I.V. Basawa (2004): ‘Stationarity and Moment Structure for Box-Cox Transformed Threshold GARCH(1,1) Processes’, Statistics & Probability Letters 68, 209-220

Jackwerth, J. (2000): ‘Recovering Risk Aversion from Option Prices and Realized Returns’, Review of Financial Studies 13, 433-451.

Jackwerth, J.C. and M. Rubinstein (1996): ‘Recovering Probabilities from Option Prices’, Journal of Finance 51, 1611-1631

Khamis, H. J. (2000): ‘The Two-Stage δ-Corrected Kolmogorov-Smirnov Test’, Journal of Applied Statistics 27(4), 439-450

Klaassen, F. (2002): ‘Improving GARCH Volatility Forecasts with Regime-Switching GARCH’, Empirical Economics 27, 363-394

Kolmogoroff, A. (1933): ‘Sulla Determinazione Empirica Di Una Legge Di Distribuzione’, Giornale dell’Istituto Italiano degh Attuari 4, 83-91

Koutmos, G., C. Negakis and P. Theodossiou (1993): ‘Stochastic Behaviour of the Athens Stock Exchange’, Applied Financial Economics 3(2), 119-126

Lanne, M. and P. Saikkonen (2003): ‘Modeling the U.S. Short-Term Interest Rate by Mixture Autoregressive Processes’, Journal of Financial Econometrics 1, 96-125

21

Lanne, M. and P. Saikkonen (2007): ‘Modeling Conditional Skewness in Stock Returns’, forthcoming in The European Journal of Finance

Laurent, S. and J.-P. Peters (2002): ‘G@RCH 2.2: an Ox Package for Estimating and Forecasting Various ARCH Models’, Journal of Economic Surveys 16(3), 447-485

Liu, J.C. (2006): ‘Stationarity of a Markov-Switching GARCH Model’, Journal of Financial Econometrics 4(4), 573-593

Liu, J.C. (2007): ‘Stationarity for a Markov-Switching Box-Cox Transformed Threshold GARCH Process’ Statistics and Probability Letters 77, 1428-1438

Marcucci, J. (2005): ‘Forecasting Stock Market Volatility with Regime-Switching GARCH Models’, Studies in Nonlinear Dynamics and Econometrics 9(4), Article 6

Massey, F. J. Jr. (1951): ‘The Kolmogorov-Smirnov Test for Goodness of Fit’, Journal of the American Statistical Association 46(253), 68-78

Nelson, D.B. (1990): ‘Stationarity and Persistence in the GARCH(1,1) Model’, Econometric Theory 6, 318-334

Nelson, D.B. (1991): ‘Conditional Heteroscedasticity in Asset Returns: a New Approach’, Econometrica 59, 347-370

Newey, W.K. (1985): ‘Generalized Method of Moments Specification Testing’, Journal of Econometrics 29, 229-256

Rosenberg, J.V. and R.F. Engle (2002): ‘Empirical Pricing Kernels’, Journal of Financial Economics 64, 341–372.

Rubinstein, M. (1994): ‘Implied Binomial Trees’, Journal of Finance 49, 771-818

Schwert, G.W. (1989): ‘Why Does Stock Market Volatility Change Over Time?’, Journal of Finance 44, 1115-1153

Sentana, E. (1995): ‘Quadratic ARCH Models’, Review of Economic Studies 62, 639-661

Smirnov, N. (1939): ‘Sur Les Ecarts De La Courbe De Distribution Empirique’ (Russian, French summary), Matematicheskii Sbornik 48(6), 3-26

Taylor, S. (1986): ‘Modelling Financial Time Series’, John Wiley & Sons

Tompkins, R.G. (2001): ‘Implied Volatility Surfaces: Uncovering Regularities for Options on Financial Futures’, The European Journal of Finance 7, 198-230

Vlaar, P.J.G. and F.C. Palm (1993): ‘The Message in Weekly Exchange Rates in the European Monetary System: Mean Reversion, Conditional Heteroscedasaticity, and Jumps’, Journal of Business and Economic Statistics 11(3), 351-60

Wong, C.S. and W.K. Li (2001): ‘On a Mixture Autoregressive Conditional Heteroscedastic Model’, Journal of the American Statistical Association 96 (455), 982-995

Wu, G. (2001): ‘The Determinants of Asymmetric Volatility’, The Review of Financial Studies 14(3), 837-859

Zakoïan, J.-M. (1994): ‘Threshold Heteroskedastic Models’, Journal of Economic Dynamics & Control 18(5), 931-955

22

Appendix: Theoretical Properties of Asymmetric NM-GARCH Models

I: Moments

We use the following notations: ( ) ( )2 2ε σt tx E E= = and ( )2σi ity E= for i = 1, …, K.

Taking expectations of (3) and (6) yields the result for NM-AGARCH and taking expectations of (4) and

(6) yields the result for the NM-GJR model. We get:

( ) ( )

*2

1 12 2

1

ωμ(1 β )ε σδ1

(1 β )

K Ki i

i ii i i

t tK

i i

i i

pp

x E Ep

= =

=

∑ ∑

∑

+−

= = =⎛ ⎞

−⎜ ⎟−⎝ ⎠

,

( ) ( )1 *1 β ω δi i i iy x−= − + i = 1,…, K

where 2

* ω α λ for NM - AGARCHω

ω for NM -GJRi i i

ii

⎧ += ⎨

⎩

( )( )α for NM - AGARCH

δα λ 1 ρ for NM -GJR

i

ii i tE d−

⎧⎪= ⎨ + +⎪⎩

and ρ is the correlation between td− and 2εt .18

The third moment is ( ) ( ) ( )3 3 2

1 1ε ε μ 3 μ

K K

t i i t i i i ii i

h E p E p y= =∑ ∑= = = + and the skewness can be expressed as:

3/2

hs

x= .

The excess kurtosis in both models is:

( )( )

4

2 22

εκ 3 3

εt

t

E zxE

= − = − where ( )1

41

3 'ε1 3 't

sz E

−

−

−= =

−p B f

p B g

The fourth moment uses the following notation:

( )1 , , 'Kp p=p … ( )2 2 4

16μ μ

K

i i i ii

s p y=∑= +

( )

21 1 1 11 1 1 12 1 1 1

22 2 21 2 2 2 22 2

21 2

1 β 2δ β 2δ β 2δ β2δ β 1 β 2δ β 2δ β

2δ β 2δ β 1 β 2δ β

K

K K Kij

K K K K K K K K K KK

e e e

e e eb

e e e

⎡ ⎤− − − −⎢ ⎥− − − −⎢ ⎥= =⎢ ⎥⎢ ⎥

− − − −⎢ ⎥⎣ ⎦

B

……

…

where and ij ij je a p=

18 In our empirical application we have approximated ( )tE d − by 0.5 and ρ by 0.

23

and ( )

11 2 1 2

1 1 1 2 11

21 2 1 2

12 1 2 22

1 1 2 2

11 2

δ ββ δ δ β11 β β 1 β β 1 β β

δ βδ β β δ11 β β 1 β β 1 β β

δ β δ β β δ11 β β 1 β β 1 β β

KK Kk k

k k Kk

KK Kk k

k k Kij k

KK K k K k

kK K K kk K

pp p

pp p

a

p p p

=≠

=≠

=≠

∑

∑

∑

⎡ ⎤− − −⎢ ⎥− − −⎢ ⎥⎢ ⎥⎢ ⎥− − −

− − −⎢ ⎥= =⎢ ⎥⎢ ⎥⎢ ⎥⎢ ⎥− − −

− − −⎢ ⎥⎣ ⎦

A

…

…

…

1 1 1 1γ 2δ β

γ 2δ βK K K K

d

d

+⎛ ⎞⎜ ⎟= ⎜ ⎟⎜ ⎟+⎝ ⎠

g and 1 1 1 12δ β

2δ βK K K K

w c

w c

+⎛ ⎞⎜ ⎟= ⎜ ⎟⎜ ⎟+⎝ ⎠

f

Also: 1 1

δ δ1 β β

K K k j ki ij

j k j kk j

pd a

= =≠

∑ ∑⎛ ⎞⎜ ⎟=⎜ ⎟−⎝ ⎠

and 2

2

α for NM - AGARCHγ

α λ δ for NM -GJRi

ii i i

⎧= ⎨

+⎩

Furthermore:

( ) ( ) ( )2 2 2 2 4 2 2

2

ω 2ω α 6α λ α λ 4λ 2ω α λ 2 ω α λ β for NM - AGARCH

ω 2ω δ 2ω β for NM - GJRi i i i i i i i i i i i i i i i

i

i i i i i

x h yw

x y

⎧ + + + − + + +⎪= ⎨+ +⎪⎩

1 1 1 β βK K k jk

i ij jj k j kk j

p rc a y q

= =≠

∑ ∑⎡ ⎤⎛ ⎞⎢ ⎥⎜ ⎟= +⎢ ⎥⎜ ⎟−⎝ ⎠⎣ ⎦

with 2

1μ

K

k kk

q p=∑=

( ) ( ) ( )

( ) ( )( )

2 2

2 2 2 2 2 2

ω ω ω α ω α α α λ 4λ λ λ 2α α λ λfor NM - AGARCH

β ω α λ β ω α λ ω α λ ω α λ α α λ λ

ω ω ω δ ω δ β ω β ω for NM -GJR

i k i k k i i k i i k k i k i k

ik i i k k k k k i i i i k k k i i i k i k

i k i k k i i i k k k i

x h

r y y

x y y

⎧ ⎡ ⎤+ + + + + − +⎣ ⎦⎪⎪= ⎨+ + + + + + +⎪

+ + + +⎪⎩

II: Parameter Constraints

We require the following set of conditions for the existence and non-negativity of variance and the

finiteness of the third moment.19 These conditions are required for the existence and positivity of the

overall variance and individual variances as well. However, as kindly noted by one of the referees, these

conditions don’t guarantee the existence of the third moment. These conditions only assure that, if the

third moment exists, then it will be finite. Nelson �1990� gave sufficient and necessary conditions for

the existence of the moments of the GARCH�1,1� model, proving that for the normal GARCH model

given by the equation

( )2 2 21 1σ ω αη β σt t t− −= + +

the pth moment of the error term exists iff

19

24

( ) /221αη β 1.

p

tE −⎡ ⎤+ <⎢ ⎥⎣ ⎦

For example, when we assume that α = 0.09875 and β = 0.9 then the second moment exists, but the third

and higher moments do not. Unfortunately, no such sufficient and necessary conditions exist for the NM-

GARCH model. Also, no straightforward parameter constraint exists for the existence and finiteness of

the fourth moment; we simply put ( )40 εtE< < ∞ .

In both the NM-AGARCH and NM-GJR models we require that:

( )*

2

1 1

ωμ 01 β

K Ki i

i ii i i

pm p

= =∑ ∑= + >

−,

( )( )1

1 δ β0

1 βK i i i

i i

pn

=∑

− −= >

− and *ω δ 0i i

mn

+ >

Also, we must have that 1

10 1, 1, , 1, 1, 0 α , 0 β 1

K

i i i ii

p i K p−

=∑< < = − < < ≤ <… .

III: Numerical Derivatives of the Asymmetric NM-GARCH Models

The only difference from the NM-GARCH model numerical derivatives (Alexander and Lazar, 2006) is

the first and second order derivatives of 2σ it with respect to γi and these are as follows:

2 21σ σβit it

i−∂ ∂

= +∂ ∂it

i i

zγ γ

; 2 2 2 2

1σ σβ' '

it itit iw −∂ ∂

= +∂ ∂ ∂ ∂i i i iγ γ γ γ

with Tit it itw A A= + .

NM-AGARCH: ( ) ( )( )2 21 1 11, ε λ , 2α ε λ ,σ 't i i t i it− − −= − − −itz . The starting values (t = 0) are:

( )2

2 2 20σ 1, λ , 2α λ , 'ii i is s

∂= +

∂ iγ, where

2

2 1ε

T

ttsT

=∑

=

1

'21

0 0 0 00 0 0 00 2(ε λ ) α 0

σ

t i iit

it

A −

−

⎡ ⎤⎢ ⎥⎢ ⎥⎢ ⎥− −=⎢ ⎥

⎛ ⎞⎢ ⎥∂⎜ ⎟⎢ ⎥∂⎝ ⎠⎣ ⎦iγ

with ( )

2 22 20

2 2 2

0 0 0 10 0 2λ λσ 10 2λ 2α 2α λ' 1 β1 λ 2α λ 2

i ii

i i i ii

i i i

s

s s

⎡ ⎤⎢ ⎥+∂ ⎢ ⎥=⎢ ⎥∂ ∂ −⎢ ⎥+⎣ ⎦

i iγ γ.

NM-GJR: ( )2 2 21 1 1 11,ε , ε ,σ 't t t itd−

− − − −=itz . The starting values for this expression (t = 0) are:

( )2

2 2 20σ 1, , 0.5 , 'i s s s∂

=∂ iγ

, where

2

2 1ε

T

ttsT

=∑

=

25

'21

0 0 0 00 0 0 00 0 0 0

σit

it

A

−

⎡ ⎤⎢ ⎥⎢ ⎥⎢ ⎥=⎢ ⎥

⎛ ⎞⎢ ⎥∂⎜ ⎟⎢ ⎥∂⎝ ⎠⎣ ⎦iγ

with ( )

22 20

2

2 2 2

0 0 0 10 0 0σ 10 0 0 0.5' 1 β1 0.5 2

i

i

s

s

s s s

⎡ ⎤⎢ ⎥∂ ⎢ ⎥=⎢ ⎥∂ ∂ −⎢ ⎥⎣ ⎦

i iγ γ.

IV: ACF of the Squared Errors in the Asymmetric NM-GARCH Models20

( ) ( )( )2 2 2 2 2 2

2 224 22

ε ,ε ε ερ ε ,ε

εεt t k t t k k

k t t ktt

Cov E x c xCorr

z xE xVar− −

−

⎡ ⎤ − −⎣ ⎦= = = =−⎡ ⎤ −⎣ ⎦

, where

2 2 2 2 2 2

1 1 1 1ε ε μ σ ε μ

K K K K

k t t k i i i it t k i i i iki i i i

c E x p p E x p p b− −= = = =∑ ∑ ∑ ∑⎡ ⎤ ⎡ ⎤= = + = +⎣ ⎦ ⎣ ⎦

NM-AGARCH: ( )( )

21 1

21 0 0

ω α λ α β , 1

ω α λ α β 2α λ , 1

ik i i i i k i ik

i i i i i i i i i

b x c b k

b x c b h k

− −= + + + >

= + + + + − =

NM-GJR: ( ) 1 1ω α 0.5λ βik i i i k i ikb x c b− −= + + +

The starting values are: 0c z= and ( )10 'i i i ib c d z z−= + + +e B f g .

20 Since the variance of the NM(K)-GARCH(1,1) model can be expressed as a GARCH(K,K) variance, according to Bollerslev (1986) the autocorrelations can also be written as an AR(K) process.

26

Table 2. Estimation results for the CAC 40 Index

Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) p1(d.f. for (4)-(9)) 11.5408 12.2000 12.1752 12.1359 12.6697 12.7269 0.9356 0.9790 0.9765 0.9141 0.9780 (8.60) (8.88) (8.79) (7.99) (8.27) (8.13) (40.62) (78.45) (70.10) (30.24) (79.76)

μ1(Sk..for (7)-(9)) -0.0724 -0.0728 -0.0751 0.0084 0.0050 (-2.96) (-2.99) (-3.08) (2.48) (2.63) ω1 6.4E-4 9.6E-5 5.9E-4 3.9E-4 -8.2E-5 4.6E-4 3.9E-4 -5.8E-5 4.7E-4 2.2E-4 -1.2E-4 4.0E-4 1.7E-4 -1.1E-4 (6.43) (0.66) (5.84) (3.24) (-0.43) (4.01) (3.24) (-0.31) (4.04) (2.44) (-0.75) (4.06) (2.06) (-0.69) α1 0.0663 0.0495 0.0166 0.0600 0.0545 0.0166 0.0606 0.0553 0.0173 0.0486 0.0517 0.0157 0.0446 0.0524 (9.83) (10.97) (3.02) (7.95) (7.60) (2.17) (8.05) (7.66) (2.26) (7.59) (7.90) (2.25) (7.20) (8.07) λ1 0.1060 0.0664 0.1082 0.0732 0.1067 0.0732 0.1094 0.0692 0.1058 (7.08) (7.91) (5.89) (6.25) (5.87) (6.26) (6.17) (6.57) (6.22) β1 0.9186 0.9345 0.9348 0.9310 0.9319 0.9346 0.9303 0.9312 0.9340 0.9386 0.9313 0.9344 0.9432 0.9311 (112.80) (150.07) (141.81) (108.12) (113.70) (115.80) (108.16) (112.93) (115.80) (124.47) (121.15) (123.20) (131.68) (122.93) p2 0.0644 0.0210 0.0235 0.0859 0.0220 μ2 -0.0890 -0.2235 ω2 0.0148 0.0087 0.0174 0.0085 -0.0018 (1.70) (0.18) (0.35) (1.88) (-0.12) α2 0.5600 1.6171 1.5287 0.5340 1.5497 (0.87) (0.52) (0.45) (1.33) (0.85) λ2 -0.0375 -0.4967 -0.0311 (-0.45) (-0.33) (-0.62) β2 0.6794 0.6336 0.6352 0.6893 0.6172 (2.72) (1.04) (0.95) (4.33) (1.68) Unconditional σ 20.55% 20.18% 19.65% 20.86% 20.22% 19.47% 20.69% 20.55% 19.79% 20.80% 21.30% 20.30% 20.81% 21.77%

Unconditional σ1 19.46% 20.35% 19.39% 19.23% 20.74%

Unconditional σ2 34.89% 47.97% 43.86% 31.93% 43.71%

Unconditional τ -0.0881 -0.0861 -0.0883 -4.32E-6 -1.41E-5 -1.33E-5 -0.172 -0.235

Unconditional k 1.2503 0.8485 1.1207 3.3426 2.6090 4.8650 – – – 4.0435 12.8803 6.0332 4.8605 11.8355

BIC -0.4634 -0.4728 -0.4722 -0.4842 -0.4941 -0.4931 -0.4844 -0.4943 -0.4934 -0.4768 -0.4851 -0.4838 -0.4794 -0.4875

Moment tests 1% 3 3 3 2 1 1 3 1 1 4 3 3 4 1

Density 1.3101 1.4771 0.9944 0.9228 1.1713 1.5195 0.8417 0.9824 0.8340 0.7382 0.8662 0.9965 0.5757 1.1570

ACF 0.0500 0.1848 0.1033 0.3976 0.0677 0.3012 – – – 0.0665 0.0375 0.0463 0.1077 0.0393

27

Table 3. Estimation results for the DAX 30 Index

Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) p1(d.f. for (4)-(9)) 9.002572 9.537122 9.596825 9.392559 9.8487 9.9290 0.9660 0.9762 0.9680 0.9613 0.9701 0.9604 (11.72) (11.94) (11.80) (11.18) (11.40) (11.25) (100.65) (118.45) (98.95) (92.86) (97.75) (89.04)

μ1(Sk..for (7)-(9)) -0.1056 -0.1064 -0.1069 0.0044 0.0041 0.0043 (-4.41) (-4.43) (-4.46) (2.29) (2.12) (2.19) ω1 8.8E-4 4.7E-4 8.9E-4 3.4E-4 1.4E-4 4.4E-4 3.5E-4 1.7E-4 4.6E-4 1.5E-4 3.2E-5 2.5E-4 1.4E-4 2.9E-5 2.3E-4 (8.02) (3.80) (7.83) (3.37) (0.95) (4.05) (3.42) (1.17) (4.17) (2.32) (0.29) (3.18) (2.21) (0.27) (3.11) α1 0.0825 0.0668 0.0320 0.0789 0.0772 0.0413 0.0787 0.0776 0.0414 0.0608 0.0664 0.0363 0.0595 0.0640 0.0355 (11.59) (13.59) (4.93) (8.48) (8.27) (3.83) (8.53) (8.33) (3.92) (9.19) (8.98) (4.24) (9.18) (8.92) (4.26) λ1 0.0764 0.0768 0.0629 0.0731 0.0628 0.0730 0.0633 0.0549 0.0639 0.0538 (7.82) (7.53) (4.64) (5.01) (4.69) (5.00) (4.93) (4.63) (4.89) (4.54) β1 0.8982 0.9135 0.9080 0.9155 0.9142 0.9121 0.9153 0.9137 0.9121 0.9282 0.9200 0.9216 0.9291 0.9220 0.9230 (100.01) (134.74) (115.60) (96.70) (96.17) (93.55) (97.01) (96.16) (93.91) (125.62) (113.55) (113.67) (127.09) (114.79) (115.37) p2 0.0340 0.0238 0.0320 0.0387 0.0299 0.0396 μ2 -0.1096 -0.1328 -0.1039 ω2 0.0354 -0.0150 0.0394 0.0259 0.0335 0.0317 (0.49) (-0.04) (0.48) (0.59) (0.39) (0.59) α2 0.4442 0.2226 -0.1441 0.3941 0.3729 -0.1193 (0.58) (0.31) (-0.31) (0.70) (0.48) (-0.71) λ2 0.7468 0.8197 0.0470 0.7002 (0.35) (0.63) (0.19) (0.75) β2 0.7120 0.4451 0.7110 0.7345 0.7155 0.7155 (1.30) (0.84) (1.18) (1.75) (1.04) (1.48) Unconditional σ 21.31% 20.86% 20.22% 24.85% 22.75% 20.97% 24.10% 23.31% 21.45% 21.83% 21.52% 20.11% 21.40% 21.71% 20.22%

Unconditional σ1 20.60% 20.54% 18.98% 20.11% 20.59% 19.01%

Unconditional σ2 44.30% 46.39% 41.64% 40.68% 42.70% 38.01%

Unconditional τ -0.1517 -0.1467 -0.1464 -7.97E-6 -8.40E-6 -1.01E-5 -0.1678 -0.1697 -0.1673

Unconditional k 1.6567 1.1240 1.5100 – 12.6898 – – 1.7023 – 5.1276 3.5368 3.0636 4.6434 3.9313 2.7148

BIC -0.4339 -0.4393 -0.4410 -0.4884 -0.4925 -0.4927 -0.4912 -0.4955 -0.4957 -0.4853 -0.4878 -0.4875 -0.4870 -0.4891 -0.4893

Moment tests 1% 2 2 3 2 1 2 0 1 1 3 2 2 2 2 2

Density 2.2739 2.3948 1.9538 1.4085 1.8255 2.4724 1.2864 1.4249 1.3516 1.0503 1.2268 2.1285 0.9681 1.0154 1.5470

ACF 0.1889 0.3591 0.3116 – 0.9764 – – 0.0570 – 0.0584 0.1345 0.1800 0.0443 0.0620 0.2201

28

Table 4. Estimation results for the FTSE 100 Index

Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) p1(d.f. for (4)-(9)) 13.7316 14.1768 14.0745 14.4429 14.6183 14.6046 0.9462 0.9598 0.9587 0.9288 0.9583 0.9564 (6.47) (6.67) (6.74) (6.42) (6.59) (6.69) (32.97) (42.96) (40.14) (1.00) (42.80) (38.70)

μ1(Sk..for (7)-(9)) -0.0679 -0.0657 -0.0669 0.0049 0.0029 0.0036 (-2.87) (-2.80) (-2.85) (1.00) (1.68) (1.97) ω1 2.5E-4 -4.7E-5 2.4E-4 2.1E-4 -1.2E-4 2.1E-4 2.0E-4 -1.1E-4 2.1E-4 1.3E-4 -1.7E-4 1.6E-4 1.1E-4 -1.6E-4 1.6E-4 (4.40) (-0.54) (5.39) (3.35) (-1.10) (4.25) (3.34) (-1.02) (4.26) (2.52) (-1.74) (3.72) (2.24) (-1.65) (3.78) α1 0.0734 0.0558 0.0155 0.0674 0.0547 0.0114 0.0674 0.0547 0.0122 0.0570 0.0493 0.0074 0.0532 0.0482 0.0069 (10.94) (8.54) (2.42) (8.09) (7.23) (1.61) (8.19) (7.32) (1.75) (7.47) (7.25) (1.25) (7.17) (7.26) (1.21) λ1 0.0785 0.0781 0.0862 0.0820 0.0849 0.0810 0.0902 0.0756 0.0899 0.0738 (6.39) (8.96) (5.63) (7.28) (5.66) (7.26) (6.07) (7.72) (6.03) (7.73) β1 0.9171 0.9316 0.9350 0.9247 0.9331 0.9379 0.9247 0.9333 0.9379 0.9305 0.9348 0.9416 0.9344 0.9362 0.9432 (118.21) (129.34) (137.35) (101.89) (112.72) (121.93) (103.05) (114.24) (122.91) (109.31) (118.63) (133.69) (1.00) (121.85) (137.95) p2 0.0538 0.0402 0.0413 0.0712 0.0417 0.0436 μ2 -0.0632 -0.0670 -0.0796 ω2 0.0101 0.0109 0.0113 0.0056 0.0076 0.0051 (1.19) (0.83) (0.65) (1.00) (0.78) (0.78) α2 0.7933 0.9230 0.9226 0.6722 0.9401 0.9888 (2.12) (1.70) (1.82) (1.00) (1.88) (1.94) λ2 -0.0109 -0.0159 -0.0192 -0.3350 (-0.24) (-0.02) (-0.44) (-0.54) β2 0.5785 0.5644 0.5786 0.6250 0.5821 0.6658 (2.82) (2.12) (1.70) (1.00) (2.59) (4.37) Unconditional σ 16.24% 15.38% 15.05% 16.13% 15.17% 14.79% 15.95% 15.37% 15.01% 15.86% 15.21% 14.81% 15.72% 15.51% 15.24%

Unconditional σ1 15.01% 14.49% 14.03% 14.74% 14.74% 14.39%

Unconditional σ2 26.69% 27.26% 27.27% 24.36% 27.04% 26.89%


Unconditional k 3.9916 1.5162 3.0582 5.5489 2.7292 8.2826 – – – 7.8609 3.8048 3.6864 7.9404 4.3925 4.4979

BIC -1.0583 -1.0689 -1.0704 -1.0687 -1.0794 -1.0812 -1.0685 -1.0791 -1.0810 -1.0638 -1.0738 -1.0760 -1.0657 -1.0750 -1.0777

Moment tests 1% 4 1 1 2 1 2 2 1 0 2 1 1 2 0 0

Density 60.7739 1.2624 0.6579 0.7213 1.2385 1.7367 1.0483 1.1607 0.9953 0.6626 1.0954 1.6242 0.8146 0.8484 0.8520

ACF 0.4478 0.0988 0.1832 0.8115 0.0976 0.6602 – – – 0.2547 0.1275 0.1529 0.2636 0.1087 0.1239

29

Table 5. Estimation results for the Eurostoxx 50 Index

Model (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) p1(d.f. for (4)-(9)) 8.1884 8.5490 8.5869 8.5988 8.8517 8.9319 0.9506 0.9567 0.9571 0.9444 0.9493 0.9490 (10.91) (10.77) (10.86) (10.33) (10.22) (10.27) (69.16) (82.20) (78.95) (64.15) (66.80) (67.90)

μ1(Sk..for (7)-(9)) -0.1017 -0.1029 -0.1049 0.0057 0.0054 0.0055 (-4.25) (-4.31) (-4.39) (3.21) (2.98) (3.07) ω1 4.5E-4 1.6E-4 4.5E-4 2.2E-4 9.1E-5 2.7E-4 2.3E-4 1.1E-4 2.8E-4 1.1E-4 4.0E-5 1.6E-4 1.1E-4 5.4E-5 1.7E-4 (8.23) (1.88) (8.30) (3.34) (0.95) (3.89) (3.38) (1.18) (4.01) (2.30) (0.60) (2.95) (2.26) (0.80) (3.03) α1 0.0691 0.0586 0.0299 0.0743 0.0720 0.0410 0.0752 0.0734 0.0419 0.0624 0.0661 0.0394 0.0605 0.0642 0.0384 (11.79) (13.24) (4.98) (8.77) (8.52) (4.02) (8.89) (8.61) (4.19) (9.37) (9.26) (4.64) (9.17) (8.97) (4.55) λ1 0.0709 0.0630 0.0508 0.0623 0.0504 0.0635 0.0428 0.0510 0.0443 0.0520 (7.70) (6.97) (4.24) (4.61) (4.32) (4.70) (4.01) (4.57) (4.14) (4.66) β1 0.9167 0.9270 0.9230 0.9209 0.9208 0.9198 0.9198 0.9194 0.9185 0.9249 0.9208 0.9203 0.9261 0.9206 0.9196 (131.78) (168.25) (147.83) (109.75) (110.01) (108.43) (109.38) (108.94) (107.88) (124.03) (119.86) (117.61) (121.98) (112.35) (110.74) p2 0.0494 0.0433 0.0429 0.0556 0.0507 0.0510 μ2 -0.0967 -0.1018 -0.1030 ω2 0.0140 -0.0055 0.0253 0.0098 0.0084 0.0089 (0.85) (-0.07) (0.67) (1.03) (0.48) (0.58) α2 0.3306 0.2710 -0.0625 0.3155 0.2288 0.1122 (0.88) (0.67) (-0.20) (1.05) (0.64) (0.26) λ2 0.4416 0.6205 0.0641 0.1900 (0.77) (0.60) (0.39) (0.69) β2 0.7860 0.4909 0.7008 0.7996 0.8332 0.8416 (3.37) (2.92) (1.63) (4.44) (3.24) (3.15) Unconditional σ 17.89% 17.78% 17.02% 21.60% 19.57% 18.06% 21.17% 20.37% 18.80% 18.65% 18.26% 17.32% 18.21% 18.47% 17.64%

Unconditional σ1 17.43% 17.27% 16.25% 16.91% 17.28% 16.43%

Unconditional σ2 34.54% 33.29% 33.09% 31.77% 32.09% 31.09%


Unconditional k 1.5502 1.2574 1.5431 – 14.7870 – – 3.7850 – 5.1022 3.2070 2.8952 4.7779 3.8641 3.3103

BIC -0.7481 -0.7553 -0.7547 -0.7979 -0.8014 -0.8015 -0.8008 -0.8045 -0.8047 -0.7895 -0.7910 -0.7912 -0.7927 -0.7936 -0.7943

Moment tests 1% 2 2 3 2 2 2 2 1 2 4 4 4 3 2 3

Density 2.1384 2.2230 1.8409 1.1173 1.5988 2.2568 1.0173 1.1309 1.0852 1.2707 1.2732 1.4062 0.7634 0.9386 0.7297

ACF 0.2479 0.4044 0.3427 – 0.8580 – – 0.2339 – 0.0508 0.3303 0.3867 0.0720 0.1501 0.2778

30

Notes for tables 2, 3, 4 and 5: The models are: (1) GARCH, (2) AGARCH and (3) GJR, all three with normally distributed errors; (4) GARCH, (5) AGARCH and (6) GJR, all three with symmetric Student’s t distributed errors; (7) GARCH, (8) AGARCH and (9) GJR, all three with skewed Student’s t distributed errors; (10) NM-GARCH, (11) NM-AGARCH and (12) NM-GJR, all three normal mixture GARCH models with zero means in the mixture component densities; (13) NM-GARCH, (14) NM-AGARCH and (15) NM-GJR, all three general normal mixture GARCH models. Numbers in parenthesis represent t-values. A ‘–’ indicates negative kurtosis, or an unreasonably high value for the ACF statistic; meaning that in that model the fourth moment is not defined.

31

Table 6. In-sample VaR results

Test Significance Data (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15)

CAC 7.4*** 6.6** 7.4*** 2.3 1.9 1.5 0.0 0.1 0.2 2.8* 5.2** 5.2** 0.0 0.8

1% DAX 5.9** 5.2** 3.9** 0.4 0.6 0.8 0.0 0.1 0.0 2.4 4.6** 4.6** 0.0 0.6 0.2

FTSE 2.8* 2.3 1.5 1.1 0.0 0.2 0.0 0.2 0.3 1.5 1.1 0.6 0.0 0.1 0.0

EUROS 14*** 9.1*** 9.1*** 2.9* 0.6 0.6 0.0 0.0 0.0 6.7*** 2.9* 4.6** 0.0 0.0 0.0

CAC 9.0*** 11*** 17*** 0.1 0.0 0.1 1.3 1.9 3.7* 0.9 5.8** 5.8** 0.0 0.0

Independence 0.5% DAX 14*** 11*** 13*** 2.0 0.9 1.4 0.2 0.2 0.0 0.9 2.6 2.6 2.7* 1.9 0.8

FTSE 7.8*** 4.0** 5.8** 1.4 0.3 0.9 0.0 0.0 0.0 2.6 0.9 1.9 0.0 0.1 0.2

EUROS 19*** 13*** 15*** 3.7* 3.0 3.0 1.7 0.2 0.8 4.5** 0.8 1.2 0.2 0.2 0.1

CAC 7.2*** 5.3** 7.2*** 3.7* 2.3 2.3 0.2 1.2 0.4 1.0 0.2 0.2 1.0 1.0

0.1% DAX 14*** 9.3*** 9.3*** 2.3 0.4 0.4 1.2 0.4 0.0 0.2 1.2 1.2 1.0 1.0 0.2

FTSE 14*** 11*** 14*** 1.2 3.7* 5.3** 0.4 1.2 3.7* 1.2 1.2 1.2 0.4 0.0 0.4

EUROS 41*** 24*** 31*** 1.1 1.1 1.1 0.2 0.0 0.3 0.2 0.2 0.0 1.0 1.0 1.0

CAC 0.0 0.1 1.6 0.2 1.1 1.1 0.8 0.8 0.9 1.3 1.5 1.5 0.7 1.0

1% DAX 4.0** 4.2** 1.8 3.0* 2.8* 2.7 3.7* 4.0** 0.8 2.1 4.4** 4.4** 3.5* 2.8* 0.6

FTSE 1.2 1.2 1.1 1.0 0.7 0.9 0.8 0.7 0.6 1.1 1.0 1.0 0.8 0.8 0.7

EUROS 0.7 1.1 1.1 2.0 2.7* 2.7* 3.6* 0.7 0.7 1.3 1.3 0.1 0.8 0.8 0.8

CAC 0.6 0.7 0.8 0.2 0.2 0.2 0.1 0.1 0.1 0.3 0.5 0.5 0.2 0.2

Conditional 0.5% DAX 3.7* 0.9 0.8 6.5** 2.3 2.1 3.4* 3.4* 3.0* 0.3 0.4 0.4 0.1 0.1 0.1

Coverage FTSE 0.6 0.4 0.5 0.3 0.2 0.3 0.2 0.2 0.2 0.4 0.3 0.3 0.2 0.2 0.2

EUROS 3.0* 0.7 0.7 1.6 0.4 0.4 0.3 0.2 0.3 0.4 0.3 0.3 0.2 0.2 0.2

CAC 0.1 0.0 0.1 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

0.1% DAX 0.1 0.1 0.1 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

FTSE 0.1 0.1 0.1 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

EUROS 0.3 0.2 0.2 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

Note: The models are: (1) GARCH, (2) AGARCH and (3) GJR, all three with normally distributed errors; (4) GARCH, (5) AGARCH and (6) GJR, all three with symmetric Student’s t distributed errors; (7) GARCH, (8) AGARCH and (9) GJR, all three with skewed Student’s t distributed errors; (10) NM-GARCH, (11) NM-AGARCH and (12) NM-GJR, all three normal mixture GARCH models with zero means in the mixture component densities; (13) NM-GARCH, (14) NM-AGARCH and (15) NM-GJR, all three general normal mixture GARCH models. The reported test statistics follow a χ2(1) distribution. *, ** and *** indicate values significant at 10%, 5% and 1% significance level, respectively.

32

Fig. 1. The Returns on the Eurostoxx Index

-0.5

-0.4

-0.3

-0.2

-0.1

0

0.1

0.2

0.3

0.4

0.5

Jan-2004 Jan-2005

Fig. 2. The Conditional Volatilities and the Ex-post Time-varying Probability of the First State

for the Eurostoxx index

0%

50%

100%

Jan-2004 Jan-2005-1

-0.5

0

0.5

1

volatility 1

volatility 2

p 1

33

Fig. 3. Simulated FTSE Index Skews up to 3 Months

(a) GARCH

(b) GARCH with skewed t distribution

(c) GJR with skewed t distribution

34

(d) NM-GARCH

(e) NM-GJR

Note: Implied volatility skews for call options are simulated using 50,000 runs. Zero interest rate is assumed. The parameters for the simulation are taken from Table 3.

stock volatility regimes - carol alexander · 1 i introduction modelling regimes in market...

Documents