extreme value theory extreme value theory for a 1-in-200 event · • comparison of evt to other...

17
15/11/2011 1 The Life Conference Seth Eshun Will Machin James Sharpe Andrew Smith The Life Conference - Seth Eshun, Will Machin, James Sharpe, Andrew Smith Members of Extreme Events Working Party Extreme Value Theory © 2010 The Actuarial Profession www.actuaries.org.uk Extreme Value Theory For a 1-in-200 event 21 st November 2011 Contents Introduction to EVT How to fit EVT to a given set of data Case study – EVT fitted to FTSE 100 Comparison of EVT to other methods of determining 1 in 200 Simulation based exercise to validate EVT 1

Upload: others

Post on 10-Mar-2021

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

1

The Life Conference Seth Eshun Will Machin James Sharpe Andrew SmithThe Life Conference - Seth Eshun, Will Machin, James Sharpe, Andrew SmithMembers of Extreme Events Working Party

Extreme Value Theory

© 2010 The Actuarial Profession www.actuaries.org.uk

Extreme Value TheoryFor a 1-in-200 event

21st November 2011

Contents

• Introduction to EVT

• How to fit EVT to a given set of data

• Case study – EVT fitted to FTSE 100

• Comparison of EVT to other methods of determining 1 in 200

• Simulation based exercise to validate EVT

1

Page 2: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

2

Introduction to EVT

• Extreme Value Theory(EVT) is a statistical approach that allows a practitioner to model the occurrence of extreme events with relatively small amounts of extreme datarelatively small amounts of extreme data.

• A key difference between EVT and other statistical approaches is that, in EVT we fit a distribution to a subset of the available data, whiles in other statistical approaches, we fit a chosen distribution to the entire set of data.

• This difference can be viewed positively or negatively. Negatively as judgement is required in selecting the subset of data to use. Positively, as EVT allows us to concentrate on the part of the distribution we are most interested.

2

Introduction to EVT

• There are two EVT approaches:– Block Maxima Models (BMM): this is the traditional approach and it

involves the following:involves the following:– Grouping the data into samples/blocks;– Calculating the maximum observation in each block;– Fitting the Generalised Extreme Value (GEV) distribution to the maxima of the blocks;

and– Estimating the risk measure we are interested in from the fitted Generalised Extreme

Value (GEV) distribution.

– The Peak over Threshold: this is a more recent technique which involves the following:

– Selecting a threshold that defines which observation are included in modelling;– Calculate the exceedances (this is the excess of the observations over the threshold);– Fit the Generalised Pareto Distribution (GPD) to the exceedances or use the Hill

estimators to fit the excedances ; and – Compute the measure of risk that is desired.

• The POT is preferred over the BMM because data is used more efficiently.

3

Page 3: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

3

Introduction to GPD

• The GPD distribution is based on a concept similar to that of the central limit theorem.

• It assumes that the maximum values of a sample of independent and• It assumes that the maximum values of a sample of independent and identically distributed random variable would approximately follow a GPD distribution.

• Thus GPD is commonly used to model the tail of other distributions. It is defined by three parameters and its cumulative distribution function is:

– μ is the location parameter;

– σ > 0 the scale parameter; and

– ξ is the shape parameter.

4

Introduction to GPD

• The following are examples of tail behaviours where the GPD can be used as an approximation:

Decreasing E ponential s ch as the gamma normal and log normal Achie ed b– Decreasing Exponential: such as the gamma, normal and log-normal. Achieved by setting the shape parameter of the GPD to zero;

– Decreasing Polynomial: such as Student's t and Cauchy. Achieved by setting the shape parameter of the GPD to a positive number;

– Finite: such as the beta distribution. Achieved by setting the shape parameter of the GPD to a negative number.

• As the name suggests, the GPD simplifies to other distributions under specific conditions. For example:p p

– When ξ =0, it simplifies to an exponential distribution;

– When ξ >0, it simplifies to an ordinary Pareto distribution; and

– When ξ <0, it simplifies to a Pareto type II distribution.

5

Page 4: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

4

How to apply EVT to a set of data

Exploratory data l ianalysis

Choose threshold

Fit Generalised Pareto

6© 2010 The Actuarial Profession www.actuaries.org.uk

t Ge e a sed a etoDistribution

Maximum Likelihood

Probability Weighted Moments

How to apply EVT to a set of data

• The first step is to analyse the available data to assess whether the data, is fat tailed enough for us to apply EVT to it. The key tools used in this analysis are the Q-Q plots and the mean excess plots.

Th Q Q l t i hi l f i h th l d t f ll i– The Q-Q plot is a graphical means of assessing whether a sample data follows a given probability distribution.

– The mean excess function express the mean of the excess over a threshold as a function of the threshold. The mean excess function for the GPD is a linear function of the threshold u.

• The next step in applying EVT is selecting an appropriate threshold.

– A lower threshold reduces the variance of the estimates of the GPD model. However, a lower threshold can introduce bias in the data.

– A higher threshold reduces bias but increase the volatility of the estimate of the GPD di t ib tidistribution.

– The mean excess analysis may be used to select an optimum threshold.

– An alternative approach is to fit the GPD to the data using different thresholds.

• Finally the parameters of the GPD can be estimated by any of the the following approaches:

– Maximum likelihood;

– Probability-weighted moments.7

Page 5: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

5

Case Study - Fitting GPD to Equity returns

• This section of the presentation looks at applying the Generalised Pareto distribution (GPD) to equity returns

• Two data sets were used:Data Range

DMS Annual equity returns 1900-2008MSCI Monthly year-on-year returns 1969-2011

8© 2010 The Actuarial Profession www.actuaries.org.uk

Results – GPD distribution

• Estimates of the 99.5% quantile, using a threshold of 9%

Data DMS estimate MSCI estimate

UK ‐42% ‐37%

9© 2010 The Actuarial Profession www.actuaries.org.uk

USA ‐43% ‐40%

Belgium ‐52% ‐66%

Denmark ‐38% -42%

Page 6: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

6

Exploratory data analysis – QQ plots of negative annual returns vs. Exponential (GPD ξ=0) UK

10© 2010 The Actuarial Profession www.actuaries.org.uk

Figure 1: QQ plot of UK simple returns Figure 2: QQ plot of UK log returns

QQ plots - Interpretation

• QQ plot with a straight diagonal line of data points from the bottom left of the chart to the top right indicates an p gexponential distribution is a relatively good fit to the tail of this data

• A concave (ie starting bottom left and curving round in the top half of C shape to a horizontal line) shape to the QQ plot indicates fatter tail than the exponential distribution; so would suggest fitting a GPD with a ξ > 0

11© 2010 The Actuarial Profession www.actuaries.org.uk

; gg g ξ

• A convex shape (ie starting bottom left and curving round in the right half of a U shape to a vertical line) to the QQ plot indicates a thinner tail than the exponential distribution with ξ < 0

Page 7: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

7

QQ plots - Interpretation – UK data

• From the QQ plots, the exponential distributions appears a relatively good fit to the negative annual returnsy g g

• This indicates that annual returns in the UK from 1900 -2008 have not been particularly fat tailed

• Similar results seen for other major EU equity returns, with only German log returns indicating a tail potentially fatter than the exponential

12© 2010 The Actuarial Profession www.actuaries.org.uk

Exploratory data analysis – Mean Excess plots of negative annual returns

13© 2010 The Actuarial Profession www.actuaries.org.uk

Figure 3: QQ plot of UK simple returns Figure 4: QQ plot of UK log returns

Page 8: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

8

Mean Excess Plots - Interpretation

• The mean excess plot is a tool used to aid the choice of threshold and also to determine the adequacy of thethreshold and also to determine the adequacy of the GPD model in practice.

• A characteristic of a fat tailed GPD type distribution with positive shape parameter is a straight line from bottom left to top right of the mean excess plot

• A mean excess plot with a downwards sloping line from

14© 2010 The Actuarial Profession www.actuaries.org.uk

top left to bottom right indicates thin tailed behaviour. A straight horizontal line indicates exponential type behaviour

• There is some significant up and down behaviour in the UK mean excess plots, with a broad upwards trend

Generalised Pareto Distribution – Calibrated to UK annual returns

• The GPD is calibrated to the annual returns based on the points over a certain threshold

• Calibrations are made at each threshold

• Five plots are presented:1. The number of points above each threshold

2. The 99.5th percentile for each calibration at each threshold

3. The shape parameter ξ for each calibration at each threshold

4. The standard error for the shape parameter for each calibration at each

15© 2010 The Actuarial Profession www.actuaries.org.uk

p pthreshold

5. The maximum log likelihood for each calibration at each threshold

Page 9: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

9

Generalised Pareto Distribution – Calibrated to UK annual returns – Maximum Likelihood Estimate

16© 2010 The Actuarial Profession www.actuaries.org.uk

Generalised Pareto Distribution – Calibrated to UK annual returns – Maximum Likelihood Estimate

• Starting in the top left, the first plot shows how the number of points in the tail decreases as the calibration moves further into the tail

The second plot shows the calibration of the 99 5th percentile at each of the• The second plot shows the calibration of the 99.5th percentile at each of the thresholds. We can see how this varies as the calibration moves into the tail. There is a significant spike at the -10% threshold where a number of points are clustered. Otherwise the calibration of the 99.5th percentile is fairly stable with changes in threshold

• The top right plot shows the shape parameter for the calibration at each threshold. This stays around the 0 level for most of the thresholds, spiking upwards as the number of points in the calibration falls

17© 2010 The Actuarial Profession www.actuaries.org.uk

• The bottom left plot shows the standard error of the shape parameter estimate. The standard error rises as the number of points in the calibration falls

• The final plot shows the maximum log likelihood, which may indicate better calibration for higher values

Page 10: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

10

Generalised Pareto Distribution – Calibrated to UK annual returns – Probability Weighted Moments

18© 2010 The Actuarial Profession www.actuaries.org.uk

Generalised Pareto Distribution – Calibrated to UK annual returns – Probability Weighted Moments

• Starting in the top left, the first plot shows how the number of points in the tail decreases as the calibration moves further into the tailtail decreases as the calibration moves further into the tail

• The second plot shows the calibration of the 99.5th percentile at each of the thresholds. We can see how this falls from around 45% to around 30% as the threshold increases

• The bottom left plot shows the shape parameter for the calibration at each threshold. This is generally over 0 for most of the thresholds.

• The bottom right plot shows the standard error of the shape parameter estimate The standard error rises as the number of points in the calibration

19© 2010 The Actuarial Profession www.actuaries.org.uk

estimate. The standard error rises as the number of points in the calibration falls

Page 11: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

11

Confidence intervals around estimated 99.5th

percentiles – UK annual simple returns

• Using three different thresholds with the maximum likelihood calibration method, confidence intervals have been estimated for the 99 5th percentile stress level99.5th percentile stress level

Thresholds

UK 99.5th percentiles -1% -5% -10%

Lower CI -29.2% -28.6% -27.2%

Best Estimate -42.2% -42.9% -51.3%

Upper CI -97.7% -97.7% -97.7%

20© 2010 The Actuarial Profession www.actuaries.org.uk

• Confidence intervals are very wide reflecting significant uncertainties in calibration of the 99.5th percentile stress

• Confidence intervals created using profile likelihood method [McNeil et al ., 2005]

Software used for calibration

• Statistics package R has been used using the QRM library

• Other R libraries such as the fExtremes library were also investigated

• The work was partly re-created in excel as a check on the results

• The R libraries are freely available online and the code used to produce the results in this presentation and excel

21© 2010 The Actuarial Profession www.actuaries.org.uk

used to produce the results in this presentation and excel checking tool will be made available

Page 12: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

12

Effect of Threshold and Fitting Methodon 99.5%-ile Estimate – UK MSCI Data 12/69-08/08

53%

54%

eturn)

47%

48%

49%

50%

51%

52%

d 99.5%‐ilefall (sim

ple re

MOM simple

PWM simple

MOM log

PWM log

Follow red circles on next slide

44%

45%

46%

0% 10% 20% 30% 40% 50%

Estimated

Threshold

22

EVT Fit comparison to Other Fitted Distributions

3

4

e fr

om

N(0

,1)Data

40% fall0.5%-ilenormallognormallog PIV

-1

0

1

2

-100% -80% -60% -40% -20% 0% 20% 40% 60% 80% 100%

Pe

rce

nti

le

Annual simple equity return

log PIVlogHYPlog sym PIVsym HYPsym PIVlog sym HYPHYPPIVEVT loEVT hi

23© 2010 The Actuarial Profession www.actuaries.org.uk

-4

-3

-2

Page 13: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

13

Mean Estimates of 99.5%-ile (lognormal λ = 0.25)Scaled so true median = 0, true 99.5%-ile = 1

1.2

ns) 

Components of EVT Quantile Estimates

Oddly the bias is worst when

0.4

0.6

0.8

1

Estimate (5000 sim

ulatio

observations

Oddly, the bias is worst when we fix the shape parameter ξto its *correct* limiting value of zero.This relates to the purpose of our exercise, which is to estimate the 99.5%-ile and not the limiting parameters.For finite thresholds a bias in

0

0.2

50 60 70 80 90 100

Mean

 Eof 100 o

Threshold Rank99.5%‐ ile (ξ=0)99.5%‐ ile (PWM)99.5%ile (MOM)Mean exceedanceThreshold

For finite thresholds, a bias in the estimate of ξcompensates for slow convergence to the GPD limit

24

How the mean estimated 99.5%-ile variesaccording to the underlying model

1.05Mean Fitted 99.5% ‐ile, PWM

0.95

1

λ=‐0.5

λ= ‐0.25

λ=0

λ=0.25

λ=0 5

0.9

50 60 70 80 90 100

Threshold Rank

λ=0.5

25

Page 14: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

14

Fine Tuning the MethodEffect of Left/Right Position, MOM vs PWM

Mean Estimated 99.5%‐ileImpact of Fitting Method PWM estimates extrapolate

the shape of the QQ-plot close

1

1.2

1.4

ξ=0 Right

ξ=0 Left

PWM Left

PWM Right

the shape of the QQ plot close to the threshold. This gives rise to volatility in estimates as a function of the threshold.MOM estimates pay more attention to the extreme points which gives greater variability across data sets but more stability as a function of

0.8

‐0.5 ‐0.25 0 0.25 0.5

Lognormal parameter  λ

MOM Left

MOM Right

threshold for a given data set.Overall, for our purpose, our experiments show MOM gives slightly more precision than PWM for estimating extreme percentiles.

26

Estimated 99.5%-ile Standard DeviationTrue distribution matters more than threshold

0 25

Standard  Deviation of Estimated 99.5%‐ile

0.1

0.15

0.2

0.25

λ=0.5

λ=0.25

λ=0

λ=‐0 25

0

0.05

50 60 70 80 90 100

Threshold Rank

λ= 0.25

λ=‐0.5

27

Page 15: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

15

Probability {next observation ≤ estimated percentile}Coverage probability aka back-test probability

99.50%

Coverage Probabilities for Estimated 99.5% ‐ile

99.00%

99.25%

λ= ‐0.5λ = ‐0.25λ =0λ =0.25

98.75%

50 60 70 80 90 100

Threshold Rank

λ =0.5

28

Percentile Bias and Coverage Bias(Example based on standard normal)

99.5%

100.0%The target 99.5%-ile is 2.5758Suppose our noisy estimate lies in a

f id h 1 0

97.5%

98.0%

98.5%

99.0%

2 2 5 3 3 5

range of width =1.0If the percentile is unbiased then the estimate is 2.5758±0.5, but the average coverage (green oblong) is only 99.0% not 99.5%. This means that on a long back-test, the next observation would exceed the estimated 99.5%-ile a full 1% of the time2 2.5 3 3.5

Normal  Quantile

True distribution function 

Target

Unbiased quantile

Unbiased coverage

time.If we correct for the coverage bias (purple oblong) then the required percentile estimate is 2.8422±0.5The extra 0.2664 can be interpreted as the capital required for parameter risk

29

Page 16: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

16

Relating Coverage Bias Correction to Mean and Standard Deviation

Standard Deviation and Mean Percentile Estimates

Aft C Bi C ti

Standard Deviation and Mean Percentile Estimates 

Before Coverage Bias Correction

1.1

1.2

Estimated 99.5%‐ile

After Coverage Bias Correction

λ=‐0.5

λ=0.25

λ=0

1.1

1.2

n Estim

ated 99.5%‐ile

g

λ=‐0.5

λ=‐0.25

λ=0

λ

0.9

1

0 0.1 0.2 0.3 0.4

Mean

 

Standard Deviation of Estimated 99.5%‐ile

λ=‐0.25

λ=‐0.5

0.9

1

0 0.1 0.2 0.3 0.4

Mean

Standard Deviation  of Estimated 99.5%‐ile

λ=0.25

λ=‐0.5

30

Correction for Coverage BiasEffect on Mean Estimated 99.5%-ile

1.2

The theoretical coverage bias correction

1.1

1.15

n Estim

ated 99.5%‐ile

Model Risk

Parameter Risk

depends on the underling distribution which is (sadly) unknown.We can quantify model risk using a robust estimate based on a bias correction for the “worst case” distribution λ=0.5.This then overstates required capital (coverage probability > 99.5%) for other distributions.

1

1.05

‐0.5 ‐0.25 0 0.25 0.5

Mean

Lognormal Shape Parameter λ

Corrected  Coverage BiasUpward‐Only Coverage Bias 

31

Page 17: Extreme Value Theory Extreme Value Theory For a 1-in-200 event · • Comparison of EVT to other methods of determining 1 in 200 • Simulation based exercise to validate EVT 1. 15/11/2011

15/11/2011

17

Questions or comments?

Expressions of individual views by members of The Actuarial Profession and its staff are encouraged.

The views expressed in this presentation are those of the presenter.

32© 2010 The Actuarial Profession www.actuaries.org.uk

References

McNeil, A., Frey, R., and Embrechts, P. (2005).Quantitative Risk Management: Concepts, Techniques andTools.Princeton University Press, Princeton.

33© 2010 The Actuarial Profession www.actuaries.org.uk