end of training project report - uclouvain of training project report september10, 2008 patterns and...
TRANSCRIPT
End of Training Project
Report
September 10, 2008
Patterns and causes of sea-ice variability in the decadal
ECCO-GODAE (version 3) coupled ocean/sea-ice state estimate
Olivier Lecomte
ENSI08 - HY
Patterns and causes of sea-ice variability in the decadal
ECCO-GODAE (version 3) coupled ocean/sea-ice state estimate
Program of Atmospheres, Oceans, and Climate, Department of
Earth, Atmospheric and Planetary Sciences,
Massachusetts Institute of Technology, Cambridge, Massachusetts
Supervisors:
Patrick Heimbach, Department of Earth and Atmospheric Science,
MIT Building 54, Room 1518, 77 Massachusetts Avenue, Cambridge, MA, 02139
E-mail: [email protected]
Carl Wunsch, Department of Earth and Atmospheric Science,
MIT Building 54, Room 1518, 77 Massachusetts Avenue, Cambridge, MA, 02139
E-mail: [email protected]
2
Contents
1 Introduction 5
2 Sea ice concentration fields – a first description 7
2.1 Observations from passive microwave radiometry vs. Estimates from ECCO-GODAE, v3.35 7
2.2 Cost/misfit calculations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3 Sea ice concentration variability 17
3.1 Inter-annual Southern Ocean components . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
3.2 Sector correlations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
4 Atmospheric mechanisms 23
5 Oceanic mechanisms 26
5.1 Trends over the 15 years . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
5.2 EOF Analyses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
5.2.1 Sea Surface Temperature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
5.2.2 Sea Surface Salinity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
5.2.3 Mixed-Layer Depth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
5.3 Correlation Analyses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
6 Conclusions 36
A EOF/PCA and SVD analysis of Climatic Data 38
A.1 What is it all about and when to use each method . . . . . . . . . . . . . . . . . . . . . . 38
A.2 The Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
A.3 How to do it . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
A.3.1 Principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
3
CONTENTS
A.3.2 How to compute it . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
A.3.3 How to understand it physically . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
A.3.4 SVD of coupled fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
B Dates of Minimum and Maximum SIC 45
C AAO, SOI and Nino-3 Indices 46
D EOFs calculation MATLAB code 48
4
Chapter 1
Introduction
While the decline of Arctic sea-ice cover over the last two decades has been the subject of much media
attention (2007 marked the lowest coverage since the beginning of the combined SMMR/SSMI satellite
record1, Serreze et al. (2007)), a fundamentally different behaviour of the Southern Ocean (SO) sea-ice
cover has received little notice. Recent estimates of SO sea-ice concentration changes report a weak
upward trend in SO sea-ice concentration of 0.21 ± 0.12 106 km2/decade for the period 1979 to 2004
(e.g. Zwally et al. (2002); Kwok and Comiso (2002); Cavalieri and Parkinson (2008)). In fact the austral
summer of 2007/08 satellite observations reveal the highest Antarctic summer sea-ice extent since 1979).
SO sea-ice concentration exhibits a strong seasonal cycle with little perennial ice surviving from
one year to the next as sea-ice retreats during austral summer, it also exhibits significant interannual
variability. At the same time, regional analyses in the Weddell, Indian, Pacific, Ross, Amundsen, and
Bellinghausen sectors reveal strong regional variability with significant differences in character from
sector to sector. So despite previous work indicating an upward trend, there are large uncertainties on
current trend estimates because of seasonal and regional variability.
The nature and cause of regional variability of SO sea-ice concentration remains very much an open re-
search question. Various groups have analyzed (i) co-variability between different sectors around Antarc-
tica, as well as (ii) linkages between sea ice concentration anomalies and climate indices. With regard
to (i), two views dominate current thinking. One is the existence of a so-called Antarctic Circumpolar
Wave (ACW) which travels around Antarctica with an 8 to 10 year period, (e.g. White and Peterson
(1996)). The second focuses on an Antarctic Dipole (AD), a seesaw between Weddell and Amundsen
sector sea ice concentration anomalies, (cf. Yuan and Martinson (2000); Holland et al. (2005)). With
1Scanning Multichannel Microwave Radiometer (SMMR) and Special Sensor Microwave Imager (SSMI) satellites provideinformation on the global sea ice cover from 1978 to present in 1978.
5
CHAPTER 1. INTRODUCTION
regard to (ii), among the most studied atmospheric teleconnections linking Antarctic sea-ice anomalies
to climate indices are the Antarctic Oscillation, AAO (or Southern Annular Mode, SAM), as well as
ENSO-related indices such as the Southern Oscillation Index (SOI), and the Nino-3 index, (Carleton
(2003); Liu et al. (2004)). While such teleconnections are clearly corroborated by these studies, it seems
these by themselves cannot explain the bulk of of Antarctic sea-ice variability, in particular its regional
trend patterns.
Compared to the above analyses the role of the ocean has been little studied, probably in part due
to the lack of continuous observations in large parts of the SO. Here we use results from a new version
of the Estimating the Circulation and Climate of the Ocean – Global Ocean Data Assimilation (ECCO-
GODAE) effort. It is based on a dynamically consistent quasi-global coupled ocean-sea-ice state that has
been forced to most of the oceanic observations available since the early 1990s, such as satellite altimetry,
a GRACE-based mean dynamic topography, Reynolds SST fields, Argo profiling floats, CTD casts and
XBT observations from the WOCE program and many others (for a more comprehensive summary see
e.g. Wunsch and Heimbach (2007)).
Sea-ice observations have not been used to constrain the sea-ice component. The estimated sea
ice state needs special scrutiny which is a first step in our planned analyses. Its misfit to the combined
SMMR/SSMI sea-ice concentration fields obtained from the National Snow and Ice Data Center (NSIDC)
and retrieved via an updated version of the so-called ”Bootstrap algorithm”2 will be assessed for the
years 1992-2006 spanning the state estimate.
Thus, we will start by studying both data sets (observations and model data) in order to give a good
description of the Sea-Ice Concentration (SIC) fields, of their over-all trends and of the misfit between the
ECCO estimate and the satellite records. Our investigation will then follow along two routes. On the one
hand we will use climate indices available from the NOAA Climate Prediction Center (NOAA/CPC) to
infer regional patterns of co-variability between sea-ice anomalies and those indices in the spirit of earlier
studies. On the other hand we will take full advantage of the estimated ECCO-GODAE ocean state to
explore the role of the ocean. Near-surface properties such as upper ocean heat content, near-surface
salinity, mixed layer depths will receive special attention, as they provide the most immediate link to
sea-ice thermodynamical properties. Using a combination of statistical tools (EOF, SVD, CCA analyses)
and physical insights underlying Southern Ocean dynamics and thermodynamics, we will explore the role
of the ocean in driving Antarctic sea-ice concentration changes.2The bootstrap technique as described by Comiso (1986) and Comiso and Sullivan (1986) uses basic radiative transfer
equations and takes advantage of unique multichannel distributions of sea ice emissivity.
6
Chapter 2
Sea ice concentration fields – a first
description
2.1 Observations from passive microwave radiometry vs. Esti-
mates from ECCO-GODAE, v3.35
National Snow and Ice Data Center (NSIDC) provides with daily sea-ice concentration fields from January
1, 1992 to December 31, 2007 (obtained by satellite observation). However within the context of the
comparison of the observations with ECCO-GODAE output data, we will stop the study to the end of
2006 as there are no model data for 2007 at the time of writing. Observations for 2007 can be analysed
separately. The purpose of this first part is to display the data in a readable way in order to study
the sea-ice coverage evolution over the 15 years. NSIDC gives sea-ice concentration fields for the whole
ocean, plus the land-ice ones. So the first step consists in multiplying the ice concentration field by the
land mask (to keep the sea-ice coverage only) and displaying the Southern Ocean area of interest, below
latitude 50S. It is also interesting to display the different sectors of the Southern Ocean (Weddell sea,
Indian sector, Pacific sector, Amundsen sector, Bellingshausen sector and Ross sea) for the purpose of
further correlation studies.
7
CHAPTER 2. SEA ICE CONCENTRATION FIELDS – A FIRST DESCRIPTION
Figure 2.1: Display of SIE sector by sector over Southern ocean (fractional coverage by cell), scaled from0 (blue) to 1 (red). Top figure is observation, bottom one is model. Date is August 31st, 1992
Figure 2.2: Display of SIE sector by sector over Southern ocean (fractional coverage by cell), scaled from0 (blue) to 1 (red). A Lambert Azimuthal EqualArea projection is used. Left figure is observation, rightone is model. Date is August 31st, 1992
8
CHAPTER 2. SEA ICE CONCENTRATION FIELDS – A FIRST DESCRIPTION
We get a better quality of display when using a map projection tool (using Lambert Azimuthal
EqualArea but non conform projection). See figure 2.1 and 2.2 as examples.
The same kind of data is produced by the ECCO Model, so that we can apply the same process to the
output data and compare the results by plotting the sea-ice coverage for both model data and satellite
observations. Thus the time series of area-integrated ice concentration were computed and displayed
as movies showing the annual ice melting and forming cycle for every year from 1992 to 2006. Figures
2.3 shows the Southern-Ocean sea ice coverage evolution over 1992. These figures illustrate quite well
the seasonal cycle in four steps. Months of minimum and maximum coverage are respectively February
and September like it is almost for every year (exact months of minimum/maximum concentration were
determined from the sea ice integrated area signal and are displayed in the tab, appendix B), which is
consistent with Parkinson (2004); Cavalieri and Parkinson (2008). The first thing to notice (see figure
2.3) is that the model clearly underestimates the observations during austral summer. This implies that
sea ice will melt faster than in observations. Also If one looks at the SIC maps of September 1992 (figure
2.3.c), a pronounced polynia in the form of a hole in sea ice concentration in the Weddell Sea is apparent
which seems to point to an artifact in the model (right panel). Therefore when the misfit between
observations and model will be computed, large values of the error should show up in the corresponding
areas, if the anomalies are recurrent (when watching the SIC maps movies we see that indeed they are).
9
CHAPTER 2. SEA ICE CONCENTRATION FIELDS – A FIRST DESCRIPTION
(a) February 1992
(b) June 1992
(c) September 1992
(d) December 1992
Figure 2.3: Sea-Ice coverage (monthly averaged) for both observations (left) and model (right).
10
CHAPTER 2. SEA ICE CONCENTRATION FIELDS – A FIRST DESCRIPTION
For each year, the monthly averaged ice concentration was calculated (to set the data free of non-
relevant daily noise), and expressed both in square meters and in fractional area of the globe. Indeed the
ice coverage can be derived from the model output by summing the coverage (sea ice concentration in
each model grid-cell multiplied by the cell area) over the whole grid. Then by comparison with the area
of the entire grid (i.e. part of the global grid restricted to Southern Ocean) we get the fractional area of
the globe that ice covers. Figure 2.4 presents the sea-ice area integrated over the whole Southern-Ocean
along year 1992 for both observations and model data. Figure 2.5 is the same plot but extended to the
15-year period with the linear regressions associated with each curve that give a general trend for sea-ice
extent over the 15 years.
0 2 4 6 8 10 120
2
4
6
8
10
12
14
16x 10
12
Figure 2.4: Sea ice area variation along year 1992 (in m2). Blue curve represents the observations andred one corresponds to the model. X-axis in months.
0 20 40 60 80 100 120 140 160 1800
2
4
6
8
10
12
14
16x 10
12
Figure 2.5: Sea ice area variation along the 15 years sample, from 1992 to 2006 (in m2). Blue curverepresents the observations and red one corresponds to the model. X-axis in months.
11
CHAPTER 2. SEA ICE CONCENTRATION FIELDS – A FIRST DESCRIPTION
From these two figures we clearly identify the seasonal cycle. On the maps of the time series of the
raw data we could see that the model underestmates SIC during summer, now we see that in terms of
integrated area the model ( always) underestimate the observations. This is what the shift between the
two curves suggests. Also, the linear regressions performed on both plots give the following results:
Observation trend = 4.8232.109 m2/mth, Linear regression initial value = 8.4110.1012 m2
Model trend = -7.2907.108 m2/mth, Linear regression initial value = 6.7679.1012 m2
Thus, in addition to the noticeable shift between the curves there is an inconsistancy regarding the sign
of the trends. What will subsequently be interesting to do is to check if this inconsistancy persists when
the seasonal cycle is filtered out.
It is also necessary to remove the seasonal cycle to get a significant information regarding sea ice
inter-annual variability. In order to do this and to look at the decadal tendencies, two methods have
been implemented. The first one, which follows Fourier analysis principle, consists in fitting the data
with a periodic signal like sine and/or cosine to extract the seasonal variation of SIC. Let us assume that
this seasonal signal is close to the following:
f(x) = a0 + a1 cos(xw) + b1 sin(xw) (2.1)
where f would be either the sea ice area or the fractional coverage, x the months and w the frequency
of the seasonal cycle. Then performing the non-linear regression (based on dichotomia) of the data with
this function we find the coefficients a0, a1 and b1. The results are presented in figures 2.6 and 2.7.
We find values equal to 0.5232 and 0.5234 for w respectively for observations and model data. Now by
definition:
w = 2 π/T (2.2)
where T is the seasonal cycle period (in month). With such values for w, we find periods about 12
months as expected.
Once we get this seasonal signal characterized by the latter formula, we remove it from the whole
signal and perform a simple linear regression on the residual (figure 2.8). However one can notice that
a 12 months variability remains in the residual, for the reason that we only retain the first harmonic in
the Fourier decomposition of the signal, so its annual variability has not been entirely removed. Further
in this study we compute the EOFs decomposition of the SIC time series, that is why we need to remove
most of the seasonal signal so that it is not the dominant signal in the different EOF modes (in other
12
CHAPTER 2. SEA ICE CONCENTRATION FIELDS – A FIRST DESCRIPTION
words, to be able to see other modes of variability than the seasonal one). Thus, althougt this analysis
gives us a more precise idea of the sesonal cycle, it is not satisfactory. An other way to remove the annual
cycle is to run a 12-months running average on the data. This has been done in figure 2.9. On these
pictures we also see the linear regression performed on the residual signal. In this case it seems the main
part of the seasonal variability has disappeared. A summary of the standard deviations and trends from
the different linear regressions is given in the following tables.
Obs./Fourier Residual Obs./12mths running mean
Trend from linear regression (m2/mth) 4.5550 108 1.4192 109
Std of ’Residual-Regression’ (m2) 9.2529 1011 2.2769 1011
Mod./Fourier Residual Mod./12mths running mean
Trend from linear regression (m2/mth) -5.2498 109 -4.2392 109
Std of ’Residual-Regression’ (m2) 1.2777 1012 2.5384 1011
The two methods give consistant results one with each other (trends have different amplitudes but
the same sign). However, similarly to raw data, filtered model data suggest a different global tendency
(decreasing integrated SIC anomaly) from the one of the observations (increasing). That means the ice
extent in the model is decreasing over the 15 years, while observations indicate the opposite (which is
qualitatively consistent with trends published by other groups, Parkinson (2004); Cavalieri and Parkinson
(2008), although they are quantitatively different since they are calculated on a different spanning time).
Thus the inconsistancy noticed while performing linear regressions on raw data is still observable when
the annual cycle is removed, which means it is not an artifact coming from the period of calculation of
the trend but a real discrepency between observations and model.
13
CHAPTER 2. SEA ICE CONCENTRATION FIELDS – A FIRST DESCRIPTION
0 20 40 60 80 100 120 140 160 180
2
4
6
8
10
12
14
16x 10
12
Figure 2.6: Non-linear regression of theobservations with fourier function. a0=8.8581012m2, a1=-2.037 1012m2, b1=-6.6311012m2,w=0.5234. X-axis in months, Y-axisin m2.
0 20 40 60 80 100 120 140 160 180
0
2
4
6
8
10
12
14
16x 10
12
Figure 2.7: Non-linear regression of the Modeldata with fourier function. a0=6.786 1012m2,a1=-2.882 1012m2, b1=-7.142 1012m2,w=0.5232. X-axis in months, Y-axis in m2.
0 20 40 60 80 100 120 140 160 180−2.5
−2
−1.5
−1
−0.5
0
0.5
1
1.5
2x 10
12
0 20 40 60 80 100 120 140 160 180−4
−3
−2
−1
0
1
2
3x 10
12
Figure 2.8: Fourier Residual (Obs.-Regression for left figure and Mod.-Regression for right figure) andlinear regression of the residual. X-axis in months, Y-axis in m2.
0 20 40 60 80 100 120 140 160 1808.3
8.4
8.5
8.6
8.7
8.8
8.9
9
9.1
9.2
9.3x 10
12
0 20 40 60 80 100 120 140 160 1805.8
6
6.2
6.4
6.6
6.8
7
7.2
7.4x 10
12
Figure 2.9: 12-months running mean applied to SIC integrated area and linear regression of these signals.Left figure is observations, right figure is model. X-axis in months, Y-axis in m2.
14
CHAPTER 2. SEA ICE CONCENTRATION FIELDS – A FIRST DESCRIPTION
2.2 Cost/misfit calculations
To give oneself an idea of the amplitude of the misfit, standard deviation maps of the difference between
SIC fields from observations and model can be computed. This is what figure 2.10 and 2.11 show. The
quantity Obs.-Mod. was computed at each grid cell and maps of this error were separated by seasons.
Then standard deviation of the error at each grid point was calculated over periods of 15 years × 3
months/year.
Figure 2.10: Obs.-Mod. standard deviation map by season, calculated over periods of three month andover the 15 years.
Figure 2.11: Obs.-Mod. standard deviation map. Here Standard deviation was calculated over everyMarch (left) and September (right) months of the 15 years.
One would have expected to see strong standard deviations where model underestimate observations
during austral summer and at the location of the Weddell sector artifact we mentionned before. That is
15
CHAPTER 2. SEA ICE CONCENTRATION FIELDS – A FIRST DESCRIPTION
indeed what happens if we look at winter and summer maps, but there is something else to notice. Strong
values of standard deviation show up in the Southern Ocean periphery during austral winter periods.
Now studying sea ice state, we can speak in terms of concentration (SIC), thickness (characterizing sea
ice vertical state and growth), integrated area or even Sea Ice Edge (SIE, characterizing meridional sea
ice state and growth). Well the previous remark suggest that the model underestimate the SIE too.
However, very weak stantard deviations of the misfit can be noticed in some more inner parts of the
southern ocean; which is a good point to study and understand why the sea ice state estimate is quite
good in these areas and especially why the estimate is not as good elsewhere.
16
Chapter 3
Sea ice concentration variability
3.1 Inter-annual Southern Ocean components
To describe SIE variability, an Empirical Orthogonal Function (EOF) analysis (see appendix A) was per-
formed on the data (both observations and model data). First, to remove the ”intra-annual” variability,
a 12-months running mean was applied to every grid point of the field prior to the EOF analysis. Then
as usual for EOF compuations, the data was centered (mean over the 15 years was removed from it so
that we analyse an anomaly). The covariance matrix of the data was computed as its eigenvectors (via
singular value decomposition) which are also the so-called EOF ”components” or ”modes”. The follow-
ing figures present these patterns with the associated ”time-series” or ”expansion coefficients”. The first
EOF pattern (EOF1obs, fig. 3.1.a) of the observations, which explains the largest part of the variance of
the signal1 exhibits a dipole structure which is likely to be the signature of the so called Antarctic Dipole,
defined as an out-of-phase relationship between the ice and temperature anomalies in the central/eastern
Pacific and Atlantic sectors of the Antarctic, Yuan and Martinson (2001). The associated time series (or
expansion coefficients, fig. 3.2.a) seem to point out periods about 40 or 50 months which is somewhat
close to the characteristic period of an annular mode. In our following correlation analyses we will pay
particular attention to the one between these EOF time series and the Southern Oscillation Index (SOI)
and the Antarctic Oscillation index (AAO).
One of the drawbacks of EOF analysis is that we cannot always associate the components with true
physical phenomena. This is illustrated by the following components of the observations. On EOF2obs
1There we must understand largest part of the remaining variance. Indeed the first mode explains only for 23% of thetotal variance of the signal, but one must keep in mind that the seasonal cycle has been filtered thus removing a huge partof the signal fluctuations
17
CHAPTER 3. SEA ICE CONCENTRATION VARIABILITY
(fig. 3.1.b), one can notice some kind of tripole over the same area as the Antarctic Dipole one but it is
not obvious how to associate it with something physically real. Similarly EOF3obs (fig.3.1.c) is difficult
to interpret.
The first EOF of model (EOF1mod) also diplays a dipole-shaped pattern quite alike EOF1obs except
that it seems to be shifted towards east. In terms of scale (spatial variations in the EOFs and amplitude
of expansion coefficients), EOF1mod and EOF1obs are quite consistent, which may mean that in terms
of variance the model captures a significant part (that still has to be quantified) of the observed sea ice
variability. However, the next EOFs of both observations and model are no longer similar and cannot
be compared.
On the second EOF of the model (explaining a non-negligible part of 17.19% of the total variance)
one can also see an artifact (blue dot in the middle of Weddell Sea) that will be confirmed later on, when
analysing Mixed-Layer Depth fields.
18
CHAPTER 3. SEA ICE CONCENTRATION VARIABILITY
120
o W
60o W 0 o
60
o E
120o E
180 oW
80 oS
70 oS
60 oS
50 oS
40o S
−0.06
−0.04
−0.02
0
0.02
0.04
0.06
120
o W
60o W 0 o
60
o E
120o E
180 oW
80 oS
70 oS
60 oS
50 oS
40o S
−0.06
−0.04
−0.02
0
0.02
0.04
0.06
(a) EOF1obs (23.32%) and EOF1mod (23.06%)
120
o W
60o W 0 o
60
o E
120o E
180 oW
80 oS
70 oS
60 oS
50 oS
40o S
−0.06
−0.04
−0.02
0
0.02
0.04
0.06
120
o W
60o W 0 o
60
o E
120o E
180 oW
80 oS
70 oS
60 oS
50 oS
40o S
−0.06
−0.04
−0.02
0
0.02
0.04
0.06
(b) EOF2obs (11.93%) and EOF2mod (17.19%)
120
o W
60o W 0 o
60
o E
120o E
180 oW
80 oS
70 oS
60 oS
50 oS
40o S
−0.06
−0.04
−0.02
0
0.02
0.04
0.06
120
o W
60o W 0 o
60
o E
120o E
180 oW
80 oS
70 oS
60 oS
50 oS
40o S
−0.06
−0.04
−0.02
0
0.02
0.04
0.06
(c) EOF3obs (9.64%) and EOF3mod (10.30%)
Figure 3.1: Main components from EOF analysis applied to the observations (left) and model (right)and fraction of explained variance.
19
CHAPTER 3. SEA ICE CONCENTRATION VARIABILITY
0 20 40 60 80 100 120 140 160 180−6
−4
−2
0
2
4
6
0 20 40 60 80 100 120 140 160 180−6
−4
−2
0
2
4
6
(a) EOF1obs and EOF1mod time series
0 20 40 60 80 100 120 140 160 180−6
−4
−2
0
2
4
6
0 20 40 60 80 100 120 140 160 180−6
−4
−2
0
2
4
6
(b) EOF2obs and EOF2mod time series
0 20 40 60 80 100 120 140 160 180−4
−3
−2
−1
0
1
2
3
4
0 20 40 60 80 100 120 140 160 180−4
−3
−2
−1
0
1
2
3
4
(c) EOF3obs and EOF3mod time series
Figure 3.2: Time series (or expansion coefficients) associated with the main modes of variability of bothobservation (left) and model (right).
20
CHAPTER 3. SEA ICE CONCENTRATION VARIABILITY
3.2 Sector correlations
SVD Analysis (see Appendix A, SVD of coupled fields) between different sectors was computed (from
the observations) in order to identify the so-called ”Antarctic Dipole” and to look at the different modes
of covariability between these areas. Here the EOFs are calculated from the covariance matrix obtained
from the product of the matrix data of one region by the transpose of another one. The following tab
(fig.3.3) gives a short summary of the results.
Figure 3.3: SVD Analysis over the Antarctic Dipole ”supposed area”. The tab present the percent ofexplain covariance by every mode (each mode is a mode of covariability between the two concernedsectors). One mode suppose one given pattern for each sector, with the associated time-series. Thesimple cross-correlation between these time-series is also indicated.
This SVD analysis confirms what the first EOF of the observations inferred. The dipole pattern was
observed over the Ross, Amundsen, Bellingshausen and Weddell sectors with a positive (resp. negative)
anomaly in the Ross and Amundsen seas (resp. Belligshausen and Weddell seas). Indeed the SVD analysis
applied over the Antarctic Dipole ”supposed areas” (i.e. applied to pairs of sectors which are supposed
to be out of phase) results in quite strong anticorrelations between the time series associated with the
patterns of the first mode of covariability between the concerned areas. Especially Weddell/Amundsen
and Ross/Bellingshausen sectors, where the first mode explains about 50% of the covariance, show
SVD analyses with time series cross-correlations around -0.5. On the contrary the same SVD analysis
21
CHAPTER 3. SEA ICE CONCENTRATION VARIABILITY
applied over Weddell/Bellingshausen or Amnudsen/Ross shows positive cross-correlations (also around
0.5) between the time series. Since these time series are associated with the respective maps of the two
sectors where the analysis is performed, these results clearly corroborate the presence of the dipole.
This analysis does not give us much more information than the EOF one does, but it enables us
to verify our results thanks to a different way of calculation. We can therefore conclude that what we
observe is correct and that there is no lapse in the computation of each method.
22
Chapter 4
Atmospheric mechanisms
Thanks to the preliminary works, we have the time-series associated with the main components of SIC
fields for the Antarctic Ocean, but also those associated with the main pattern of the EOF analysis
applied on each sector (we perform exactly the same EOF analysis except that we restrain the data to
one chosen region). Regarding the atmospheric variability, we have the AAO (Antarctic Oscillation), SOI
(Southern Oscillation Index) and Nino-3 monthly indices from the National Oceanic and Atmospheric
Administration (NOAA, USA). A precise definition of these indices can be found in appendix C. Now as
the EOF time series (particularly for the first mode of variability) somehow represent the temporal evo-
lution of SIC (when the EOF themselves make the spatial pattern of variability explicit) it is particularly
interesting to look for correlations between them and the atmospheric indices. More precisly, what was
computed was the ”lag-correlation” between the time series and the atmospheric indices. Recall that the
correlation coefficient ρ(l) for lag l between two time sequences X = {x1, x2,..., xn} and Y = {y1, y2,...,
yn} of equal length n, is defined as follows:
ρ(l) =∑n
t=l+1(xt − x)(yt−l − y)√∑n
t=l+1(xt − x)2√∑n−l
t=1(yt − y)2
x =1
n− l
n∑
t=l+1
xt, y =1
n− l
n−l∑t=1
yt
where x, y denote the mean of X and Y, respectively. This way we do not only compute the straight
correlation between the two time sequences but also the correlation when one of the signal is delayed
or set ahead with respect to the other. These kind of simple calculations are often very useful when
searching for linkages or dynamic processes between time sequences. A summary of the results is given
23
CHAPTER 4. ATMOSPHERIC MECHANISMS
in figure 4.1 displaying the lag-correlation values of the three atmospheric indices with SIC depending
on the lag l.
The largest correlation seems to be between SIC and SOI, which is fairly consistent with Yuan (2004)
who has shown that AAO and ENSO explain a significant part of Antarctic sea ice variability. At
zero-lag, the correlation between SIC and SOI are quite strong for the whole Southern Ocean, but also
regionally for the Weddell, Ross and Amundsen seas. The other areas present weaker lag-correlation
values (about 10 months for Pacific sector, 12 months for indian and Bellingshausen seas).
The Lag-correlation values with Nino-3 have weaker amplitudes but seem to be exactly out of phase
with the previous one (correlations have opposite signs). This was somewhat expected since SOI and
Nino-3 are indeed supposed to be out of phase by nature of the ENSO phenomenon.
Lag-correlation with AAO is quite alike the one with SOI in the whole Southern Ocean, Weddell and
Bellingshausen seas. Especially, the lags are quite consitent relatively to those observed for correlation
of SIC with SOI and Nino-3 (-10 months in Bellingshausen sea,-2 months in Weddell sea) which might
infer teleconnections between SAM and ENSO through sea ice (but this has never been truly studied
and is absolutely not an established fact).
What would eventually be interesting is to look closer at the lags between SIC and the atmospheric
indices in the vicinity of the Antarctic Dipole to find out insights on its variability. However this kind of
investigation has already been well conducted, Yuan and Martinson (2001), and considering the time we
had for this research project we preferred to focus on its Model/Observations comparison aspect since we
would not have time enough to push the study as far as the current state of the art. Nonetheless, these
calculations were very useful to verify that our EOF analyses and correlation computations give consistent
results and to ensure that we could apply them in the following without worrying about numerical our
mathematical issues. Indeed the direct study of SIC teleconnections with oceanic mechanisms would
have been much more difficult to validate since they have not truly been studied so far.
24
CHAPTER 4. ATMOSPHERIC MECHANISMS
Figure 4.1: Lag-correlation between the time series of the main component (first EOF) of SIC andAtmospheric indices (AAO, SOI, Nino-3) by sector. X-axis is lag in months, Y-axis is the cross-correlationat each lag.
25
Chapter 5
Oceanic mechanisms
5.1 Trends over the 15 years
Here we use outputs from the ECCO-GODAE model again, namely the three following 2D-fields: Sea
Surface Temperature (SST), Sea Surface Salinity (SSS) and Mixed-Layer Depth (MLD). We have the
time series of these fields also from 1992 to 2006.
Before looking at the correlation between these oceanic quantities and SIC we are interested in
analysing these fields by themselves, that is to say looking at their mean, trends and main EOF patterns.
To compute the trends, a linear regression was applied to each grid point of the three fields, so that:
oceanic field(i, j) = a(i, j) + b(i, j)× t
where t is the time in months. Thus the operation results in one ”a coefficient” and one ”b coefficient”
map for each kind of field. The following pictures display (fig. 5.1) the trend maps (patterns at zero-time
(a coefficient) and trend patterns over the 15 years (b coefficient)) of SST, SSS and MLD.
The ”a coefficient maps” give us an insight of the fields temporal mean (but it is not the real time-
mean itself). They are interesting especially for Mixed-Layer Depth (MLD) fields (fig. 5.1.c) because
they provide us with the evidence that the artifact in the Weddell sea is indeed an artifact, insofar as
negative values of MLD appear in the area (which is impossible). For Sea Surface Temperature (SST, fig.
5.1.a) we see that Ross, Amundsen and Bellingshausen regions seem to be hotter than Indian and Pacific
ones. As for Sea Surface Salinity (SSS, fig. 5.1.b), one can observe a circularly shaped pattern with weak
values of salinity close to the coasts and the antarctic continent. The trends map are also interesting,
26
CHAPTER 5. OCEANIC MECHANISMS
for both SSS and MLD fields they display quite uniform maps except at the location of the Weddell sea
artifact where trends are different from anywhere else (positive trend while trends are negative in the
rest of the southern ocean for exemple). On the SST trend map, we can also see a region with stronger
positive trend values between the Pacific sector and the Ross sea. Normally the EOFs computed further
should also corobarate these trends.
27
CHAPTER 5. OCEANIC MECHANISMS
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S −2
0
2
4
6
8
10
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S
−6
−4
−2
0
2
4
6
x 10−3
(a) SST trend maps
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S 32.5
33
33.5
34
34.5
35
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S −4
−3
−2
−1
0
1
2
3
4x 10
−3
(b) SSS trend maps
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S −200
−100
0
100
200
300
400
500
600
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S −4
−3
−2
−1
0
1
2
3
4
(c) MLD trend maps
Figure 5.1: Trend maps of the three oceanic fields: SST, SSS and MLD. ”a coefficient” maps on the left,”b coefficient” maps on the right.
28
CHAPTER 5. OCEANIC MECHANISMS
5.2 EOF Analyses
5.2.1 Sea Surface Temperature
In order to study the correlation of these oceanic fields with SIC, an EOF analysis was performed on
them so that we can apply the correlation analysis between the time series associated with the different
modes. The EOF analysis applied to SST gives curious results at first look. In terms of trend, it is
completely consistent with the previous maps. On the first mode (fig. 5.2.a), the ”positive anomaly”
area between Ross and Pacific sectors combined with an over-all upward tendancy of the associated time
series is consistent with what figure 5.1.a suggested. However SST and sea ice are tightly linked; and
to produce sea ice, temperature must decrease below the freezing point so the cooler the water is the
more chances there are to have sea ice. Thus one would have expected to find EOFs quite alike SIC ones
with an opposite sign (it is also what Yuan and Martinson (2001) suggest when defining the Antarctic
Dipole), but it is not the case. More precisly, considering the first EOF of model SIC data, the spatial
pattern presented the Antarctic Dipole shifted relatively to its real location. Then we would expect to
find the opposite dipole in the first spatial pattern of SST but instead we find this kind of pattern only
on the second EOF (fig. 5.2.b). Even so, we will see further that the correlation between the time series
associated with the first EOFs of SIC and SST is quite strong.
To explain this, we have to look closer and see that the variance of the second EOF of SST explains
for 20.03% of the total variance, which is almost as large as percentage of explained variance by the first
EOF (29.91%). So finally we do find a dipole-like pattern consistent with EOF analysis of SIC, but in
mode 2, and it does not prevent the first modes (of SIC and SSS) to be highly correlated. Indeed during
winter when SIC reaches its maximum, SIC and SST are strongly linked (spatially and temporally), but
during summer when there is no longer sea ice then the spatial correlation between both fields collapses.
That is why we observe a strong temporal correlation (inferred by the EOF time series) but different
spatial patterns.
29
CHAPTER 5. OCEANIC MECHANISMS
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S
−0.03
−0.02
−0.01
0
0.01
0.02
0.03
0 20 40 60 80 100 120 140 160 180−30
−20
−10
0
10
20
30
(a) EOF1sst, explaining 29.91% of the total variance of the signal
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S
−0.03
−0.02
−0.01
0
0.01
0.02
0.03
0 20 40 60 80 100 120 140 160 180−25
−20
−15
−10
−5
0
5
10
15
20
(b) EOF2sst, explaining 20.03% of the total variance of the signal
Figure 5.2: EOF components of SST (left figures) with the associated time series (right figures) and theirfraction of explained variance.
30
CHAPTER 5. OCEANIC MECHANISMS
5.2.2 Sea Surface Salinity
The positive trend of the first SSS EOF component (EOF1sss, Fig. 5.3a), and the positive anomaly in
the Weddell Sea are consistent with the trend maps (Fig. 5.1c). The problem is that this area is located
where the artifact is in the MLD fields, and it is not obvious determining whether it is due to it or not.
In fact one could have made the same comment for SIC first component (from model data), the negative
anomaly area in the Antarctic Dipole is the one where the signal is the strongest and also where the
MLD artifact seems to show up. As a consequence it seems likely that these signals are related or a
consequence of the MLD artifact.
At least what can be noticed is that for SIC and SSS these strong signals appear only on the first
EOFs whereas the artifact is clearly present on every MLD components (see next paragraph). Thus,
if there is indeed an artifact in the data due to a problem in the model, one is likely to find its origin
looking a bit closer at MLD data.
31
CHAPTER 5. OCEANIC MECHANISMS
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S
−0.04
−0.03
−0.02
−0.01
0
0.01
0.02
0.03
0.04
0 20 40 60 80 100 120 140 160 180−15
−10
−5
0
5
10
(a) EOF1sss, explaining 30.81% of the total variance of the signal
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S
−0.05
−0.04
−0.03
−0.02
−0.01
0
0.01
0.02
0.03
0.04
0.05
0 20 40 60 80 100 120 140 160 180−15
−10
−5
0
5
10
(b) EOF2sss, explaining 18.50% of the total variance of the signal
Figure 5.3: EOF components of SSS (left figures) with the associated time series (right figures) and theirfraction of explained variance.
32
CHAPTER 5. OCEANIC MECHANISMS
5.2.3 Mixed-Layer Depth
Like it was mentioned in the previous part, the dominant signal in all MLD components is the artifact-
looking one in the Weddell sea (fig. 5.3) . That could mean MLD estimate from the model is tightly
linked with the origin of the problem and it would not be surprising. Indeed MLD combined with SST
gives us a good idea of the heat content under the ice, and sea ice forms and melts depending on this
heat content. Considering EOF1sst and EOF1mld (fig. 5.3.a) conjointly, one can see a positive SST
anomaly in the Weddell sea associated with big values of MLD (artifact), which implies a great heat
content under the ice in this area. That could consequently cause a melting of the ice and also the SIC
strong negative anomaly in the Weddell part of EOF1mod (fig. 3.1.a). The issue becomes to know why
the MLD in this area is abnormally big.
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S
−0.03
−0.02
−0.01
0
0.01
0.02
0.03
0 20 40 60 80 100 120 140 160 180−1.5
−1
−0.5
0
0.5
1
1.5x 10
4
(a) EOF1mld, explaining 38.39% of the total variance of the signal
120
o W
60o W 0 o
60
o E
120o E
180 oW
84 o
S 72 o
S 60 o
S 48 o
S 36 o
S
−0.03
−0.02
−0.01
0
0.01
0.02
0.03
0 20 40 60 80 100 120 140 160 180−1
−0.5
0
0.5
1
1.5x 10
4
(b) EOF2mld, explaining 17.02% of the total variance of the signal
Figure 5.4: EOF components of MLD (left figures) with the associated time series (right figures) andtheir fraction of explained variance.
33
CHAPTER 5. OCEANIC MECHANISMS
5.3 Correlation Analyses
Thanks to the EOF analysis already performed we have the time series associated with the components
of the SIC, SST, SSS and MLD fields. The next figure (5.5) shows the lag-correlation plots between these
expansion coefficients. It would have probably been more rigorous to perform an SVD decomposition
analysis between SIC and the other fields, but this simple lag-correlation analysis provide us with a fairly
good insight of how these physical parameters are related.
The first remark one can do is about the general strong zero-lag correlation between SIC in the
whole Southern Ocean and SST. On the contrary one could have expected an anticorrelation since the
colder the water is the more ice it is likely to be formed, but since the EOF patterns already have
opposite signs (see EOF1obs and EOF1sst) it is consistent. If one consider the regional correlations, the
Amundsen, Ross and Weddell sectors present a strong correlation between SIC and SST (at zero-lag).
The other areas display non-significant lag-correlations except Bellingshausen sea where the 10-months
lag anticorrelation is about -0.5.
For the correlation values with SSS and MLD, they are almost all equal to zero, which does not
give us a lot of information. The lag-correlation are not really significant either. There is a positive
10-months lag-correlation between SIC and SSS in Bellingshausen Sea that we might be tempted to
relate with the lag-correlation in SST. Indeed as the correlations have opposite signs we could associate
this to a density-conservation phenomenon, if there is a increase (resp. decrease) in the SST anomaly
then a decrease (resp. increase) in SSS anomaly would occur to compensate the the SST change and
maintain density. However the latter lag-correlation value is very weak (less than 0.2) and what is more
if one consider for example Pacific sector, there the two lag-correlations have the same sign, and the
explanation is no longer plausible.
Also in the Weddell plot of lag-correlation between SIC and MLD, the negative correlation at zero-lag
is something one could have expected. With respect to the fact that the MLD artifact is located in the
Weddell Sea, we could expect a strong anticorrelation in the vicinity of the area, insofar as a sharp
increase in MLD anomaly should induce a melting of the ice (and MLD time series variations are indeed
quite sharp, see figure 3.4). Even so the anticorrelation value is not that important. This has to be linked
with the first remark of this section, a lag-correlation calculation between the time series coming from a
true SVD analysis of coupled fields may have given more significant results (it was not done because of
a lack of time but it would be an interesting investigation to perform for a further study). Last but not
least, the outputs of the model are likely to be affected by the model issues quite differently depending on
34
CHAPTER 5. OCEANIC MECHANISMS
their nature and the considered area. That could consequently make the correlation results non-uniform.
Figure 5.5: Lag-correlation between the time series of the main components (first EOF) of SIC andOceanic fields (SST, SSS, MLD) by sector. X-axis is lag in months, Y-axis is the cross-correlation ateach lag.
35
Chapter 6
Conclusions
At the beginning of the study, the goal of the project was two-folded. The first aspect was to study
satellite observations of sea ice concentration to complete existing works about Southern Ocean sea ice
variability and the second was to compare the ECCO-GODAE v.3 sea ice state estimate with these
observations in order to assess its quality. As it was mentionned in the introduction, several groups have
studied the teleconections between sea ice variability and atmospheric phenomena like El Nino/Southern
Ocean or the Antarctic Oscillation (AAO) very well, but nobody really studied the potential linkages
with oceanic phenomena. Thus the first step was to look at the SIC variability in itself to coroborate
what had been done before on the present field and in the same way to check that our different methods
of data analysis were giving suitable results (like for EOF analysis for example). Then knowing our
routines worked properly we could apply them to the outputs of the model (SIC estimate, SST, SSS,
MLD) and intend to find correlations between the results. Mainly due to time constraints, we were not
able to carry out these two works completely. However some relevant indications on the model estimates
were established.
With respect to the Antarctic sea ice variability characterization, the study did not go further than
the conclusions plublished by other groups like Parkinson (2004) who achieved the same kind of analysis
based on both observations and data provided by models. Once the seasonal cycle is removed from the
SIC or sea ice integrated area signals, the residuals exhibits a positive trend of 1400 km2month−1 for the
observations and -4000 km2month−1 for the ECCO estimate, an inconsistancy that remains unexplained,
but may be also linked with the MLD articfact. The EOF and SVD analyses performed on SIC fields
enabled to confirm the existence of the Antarctic Dipole (ADP) through the correlation calculations
between time series of Bellingshausen-Weddell and Ross-Amundsen seas. This ”wavenumber-2 mode”
36
CHAPTER 6. CONCLUSIONS
was also naturally found in the second EOF of SST (although we would primarily have expected to find
it on the first EOF pattern). The ice variablity, somehow represented by the time series associated with
the first of its EOF components, presents some strong correlations with the Southern Annular Mode
(SAM) and ENSO-related indices (0.6 with AAO and SOI, -0.5 with Nino-3 at zero-lag for the whole
Southern Ocean) which is consistent with previous work (e.g. Yuan (2004)). The overall correlation, or
rather the overall anticorrelation between SIC and SST at zero-lag are as strong as expected although
they weaken in some regions like Indian, Pacific or Bellingshausen seas. One must keep in mind that SIC
and SST have a different nature in the correlation study (one is observations while the other was derived
from the model), so this is probably resulting from some issues with the sea ice estimate in the model.
The amplitude of the cross-correlation of SIC and SSS or MLD are less significant. These oceanic fields
estimated by the model were also very useful to corroborate and exhibits some defects first observed in
the ECCO-GODAE SIC estimate. It seems that the model always underestimates the observations ice
integrated area. Furthermore the standard deviation maps of the misfit between observations and model
show that it also undervalues the Ice Extent (Sea Ice Edge). These maps also exhibits strong values of
the misfit in the vicinity of Weddell sea where an articfact from model data were identified afterwards.
Indeed, it was noticed that strong anomalies in Mixed-Layer Depth (especially in the Weddell sea) could
generate the artifacts observed on the other fields derived from the model. To figure out what the root of
the problem is, one should know much better how the model works to understand why the mixed-layer
depth sharply deepens in these areas, thus drasticly changing the heat content under the ice and causing
it to melt abnormaly. However, this is currently far from being within the range of our study and would
require much more time than we had. The research obviously needs to be continued, and for further
analysis, one should go into details between sea variability and oceanic processes. Subsequently the first
step would likely be understanding the model code and why it creates the MLD issue. Once the problem
is solved, the correlation calculations would have to be reprocessed with true SVD of coupled fields
analysis in order to establish sufficiently reliable lag-correlation plots. The most interesting part of the
work would then be to come back to the ice variability aspects by combining the resulting curves. Looking
at the cross-correlation values depending on both lags and regions of Antarctic, one may hopefully be
able to identify dynamical processes propagating from one sector to another (through the annular mode
or different kinds of waves) and potentially explaining a substantial part of the southern ocean sea ice
variability.
37
Appendix A
EOF/PCA and SVD analysis of
Climatic Data
Except some comments and some changes in the problem’s formalism, the main part of the following
was taken from ”A Manual for EOF and SVD analyses of Climatic Data”, by H. Bjornsson and S. A.
Venegas. Some sections are also refering to ”Statistical Analysis in Climate Research”, by Hans von
Storch and Franci W. Zwiers.
A.1 What is it all about and when to use each method
This appendix contains a fairly dicussion of two methods for analysing the spatial and temporal variability
of geophysical fields. The two methods covered are the method Empirical Orthogonal Functions (EOFs),
also known as Principal Component Analysis (PCA), and the method of Singular Value Decomposition
(SVD). The EOF/PCA analysis is a method of choice for analysing the variability of a single field, i.e. a
field of only one scalar variable (SLP, SST etc). The purpose is to find the spatial patterns of variability,
their time variation, and gives a measure of the ”importance” of each pattern. As the method is tightly
linked with eigenvalues and eigenvectors calculations, the SVD analysis can be used for both single fields
EOFs determination and coupled fields analyses. The SVD applied to two fields together will identify
the modes modes of behavior in which the variations of the to fields are strongly coupled. It should be
stressed that even though the methods presented below break the data into ”modes of variability”, these
modes are primarily data modes, and not necessarily physical modes. Whether they are physical will be
matter of subjective interpretation. One more remark about terminology before we get started, the EOF
38
APPENDIX A. EOF/PCA AND SVD ANALYSIS OF CLIMATIC DATA
method finds both time series and spatial patterns. Most authors refer to the patterns as the ”EOFs”,
but some refer to them as the ”Principal component loading patterns” or just ”principal components”.
The time series are referred to as ”EOF time series”, ”expansion coefficient time series”, ”expansion
coefficients”, ”principal component time series” or just ”principal components”. In what is described
herein, we will refer to the patterns as the EOFs and the time series as ”principal components (PCs)”
or ”expansion coefficients”.
A.2 The Data
Let us assume that we have measurements of some variable at locations x1,x2,...xp taken at times
t1,t2,...tn. For each time tj (j=1,...n) we can think of the measurements xi (i=1,...p) as a map or
field.
Figure A.1: The matrix F. Each clumn is a map, and each row is a time series of observations for a givenlocation.
We store these measurements in a matrix F as n maps each being p points long (so typically if the
maps are k by l 2D fields, then the data has to be reshaped so that it become vectors of size k*l=p). We
can then interpret each of the p rows in F as a time series for a given location (see figure 43). The EOF
analysis is performed using F as the data matrix.
In the above example the variable in F might be sea ice concentration (SIC) over the southern ocean.
In this case we would apply the EOF analysis directly on F. We might also be interested in examining the
coupling between the SIC and sea surface temperature (SST) measured at locations y1,y2,...yr taken at
times t1,t2,...tn. These SST measurements are then ordered into another matrix S, in the same fashion
as before, and the two matrices, F and S, analysed using SVD method. Notice that the locations of these
measurements of the SIC and the SST does not have to be the same, so we could also apply the process
on two fields of a same climatic variable but in different places to see if the variability in the two areas
39
APPENDIX A. EOF/PCA AND SVD ANALYSIS OF CLIMATIC DATA
are close or not (this is what we use while looking at the SIC covariability between the different sectors
of southern ocean for example).
A.3 How to do it
A.3.1 Principle
Let us assume that we removed the mean from each of the p time series in F, so that every row has zero
mean.1
We form the covariance matrix of F by calculating R= 1N−1FtF and then we solve the eigenvalue
problem
RC = CΛ. (A.1)
Λ is a diagonal matrix containing the eigenvalues λi of R. The ci column vectors of C are the
eigenvectors of R corresponding to the eigenvalues λi. Both Λ and C are of the size pn bby n. Each of
these eigenvectors can the be regarded as a map, and they are the EOFs we were looking for. In what
follows we always assume that the eigenvectors are ordered according to the size of the eigenvalues (EOF1
is the eigenvector associated with the biggest eigenvalue, and the one associated with the second biggest
eigenvalue is EOF2, etc). Each eigenvalue λi, gives a measure of the fraction of the total variance in R
explained by the mode. This fraction is found by dividing the λi by the sum of all the other eigenvalues
(the trace of Λ).
The eigenvector matrix C has the property that CtC=CCt=I (where I is the identity matrix). This
means the EOFs are uncorrelated over space. Another way of stating this is to say that the eigenvectors
are orthogonal to each other. Hence the name Empirical Orthogonal Functions (EOF).
The pattern abtained when an EOF is plotted as a map, represents a standing oscillation. The time
evolution of en EOF shows how this pattern oscillates in time. To see how EOF1 ’evolves’ in time we
calculate:
~a1 = F ~c1 (A.2)
The n components of the vector ~a1 are the projections of the maps in F on EOF1, and the vector is a
time series for the evolution of EOF1. In general, for each calculated EOFj, we can find a corresponding
~aj . These are the principal component time series (PC’s) or the expansion coefficients of the EOFs. Just
1Removing the time means has nothing to do with the process of finding eigenvectors, but it allows us to interpret Ras the covariance matrix, and hence understand the results. Strictly speaking you can find EOFs without removing anymean, or you can remove the mean of each map, but not each time series.
40
APPENDIX A. EOF/PCA AND SVD ANALYSIS OF CLIMATIC DATA
as the EOFs were uncorrelated in space, the expansion coefficients are uncorrelated in time.
We can reconstruct the data from the EOFs and the expansion coefficients:
F =p∑
j=1
~aj(EOFj) (A.3)
A common use of EOFs is to reconstruct a ’cleaner’ version of the data by truncating this sum at some
j = N ¿ p, that is, we only use the EOFs of the few largest eigenvalues. The rationale is that the first
N eigenvectors are capturing the dynamical behavior of th system, and the other eigenvectors of the
smallest eigenvalues are just due to random noise. Needless to say this assumption is not always true.
A.3.2 How to compute it
The remaining issue is: How to solve the eigenvector and eigenvalue problem? One approach is to use
Singular Value Decomposition (SVD). Indeed any m×n matrix A can be given a SVD:
A = USV †
where U is m×n, S is n×n, V is n×n, and V† is the conjugate transpose of V (U and V are orthogonal
matrices). The first min(m,n) columns of U and V are orthgonal vectors of dimension n and m and
are called left and right singular vectors, respectively. Matrix S is a diagonal matrix with non-negative
elements sii=si, i=1,...,min(m,n), called singular values. All other elements of S are zero. Therefore:
AA†U = USV †V SU†U = US2
A†AV = V SU†USV †V = V S2
That is, the columns of V are the eigenvectors of A†A, the squares of the singular values si are the
eigenvalues of A†A, and the columns of U are the eigenvectors of AA† that correspond to these eigenvalues.
Note that all this is true when m≥n, when m<n we first write the SVD of A† = U ′S′V ′† where U’ is
n×m, S’ is m×m, and V’ is m×m, all with the properties as described above. Thus A=V’S’U’†.
Let us mention also a theorem that is often useful when computing eigenvalues and eigenvectors:
Theorem 1 Let A be any m×n matrix. If λ is a nonzero eigenvalue of multiplicity s of A†A with s
linearly independent eigenvectors ~e1, ..., ~es, then λ is also an s-fold eigenvalue of AA† with s linearly
independent eigenvectors A~e1, ..., A~es.
41
APPENDIX A. EOF/PCA AND SVD ANALYSIS OF CLIMATIC DATA
So, let us take again the data matrix F. In order to remove the temporal mean from the data (i.e.
we consider the anomaly field) we compute the matrix:
χ = F (I − 1n
J)
where I is the n×n identity matrix and J is the n×n matrix composed entirely of units. Therefore the
covariance matrix is:
Σ =1n
χχ†
As a consequence, If the sample size n is larger than the dimension of the problem p, then the EOFs are
calculated directly as the normalized eigenvectors of the p×p matrix Σ. On the other hand, if the sample
size, n, is smaller than the dimension of the problem, p, the EOFs may be obtained by first calculating
the normalized eigenvectors ~g of the n×n matrix Σ = 1n (I − 1
nJ)F †F (I − 1nJ), and then according to
the previous theorem, computing the EOFs as
~e =F (I − 1
nJ)~g‖F (I − 1
nJ)~g‖ .
Now the SVD of the p×n centred data matrix
χ = F (I − 1n
J),
is
χ = USV †
and therefore1n
χ†χV =1n
V S2.
So the eigenvectors of Σ are the columns of V and the corresponding eigenvalues are 1nS2. One can
finally retrieve the EOFs from these eigenvectors thanks to theorem 1 and the expansion coefficients by
calculating ~aj = F × EOFj since they are nothing but the projection of F onto the j-th EOF. It is also
interesting to compute the variance ratio of each EOF from the eigenvalues λi = 1ns2
i by computing
λi∑λi
.
This last quantity provide us with the ’amount of variance explained’ by each EOF.
42
APPENDIX A. EOF/PCA AND SVD ANALYSIS OF CLIMATIC DATA
A.3.3 How to understand it physically
Let us take a physical (and unrealistic) example. We have Pacific SST data, one SST map per month
sampled over a period of a few decades, that we put in a data matrix without removing the temporal
mean. We perform an EOF analysis and find that most of the data is explained by 2 eigenvectors only.
The first one (EOF1) is a map of contours that are positive in the northern hemisphere and negative
in the southern hemisphere. The corresponding PC is periodic with a period of about 12 months. We
therefore identify EOF1 as the annual cycle. EOF2 shows many contour lines close to the equator but
few elsewhere and the associated PC is quasi-periodic with a period of few years. We identify this
eigenvector to be associated with El Nino. From this data we would get the picture of Pacific SSTs and
their evolution in time: most of what happens is described by the SST change associated with the annual
cycle and the slower SSt change associated with El Nino. And since no other eigenvectors contribute
much (all other eigenvalues are very small) we assume that everything else found in the observations is
just noise. The EOF method thus allows us to view all the complicated variability in the original SST
data and explain it by two processes.
In practice it is inadvisable to keep the seasonal cycle in the data, since that signal is likely to
dominate everything else. That is why we often begin the analysis by removing the seasonal cycle from
the data using an appropriate filtering method.
A.3.4 SVD of coupled fields
The singular Value Decomposition (SVD) method can be thought of as a generalization to rectangular
matrices of the diagonalization of a square symmetric matrix (like in EOF anlysis). I is usally applied
in geophysics to two combined data fields, such as SLP and SST. The method identifies pairs of coupled
spatial patterns, with each pair explaining a fraction of the covariance between the txo fields. Hence, to
perform this decomposition, we construct the temporal cross-covariance matrix between two space and
time dependent data fields. As an example, let us assume we have two data matrices, S and I (S being
SST fields and I Ice concentration fields). Let us suppose that S is p×n and I is q×n (that means S and
I have been reshaped like F in the previous parts, so that their rows contain a time series for a particular
location, and their columns contain a map for a given time). Then if both S and I are centred in time,
then the cross-covariance matrix is C = S†P . This matrix does not need to be square as the two fields
may be defined on a different number of grid points.
The SVD
C = ULV †
43
APPENDIX A. EOF/PCA AND SVD ANALYSIS OF CLIMATIC DATA
of the cross-covariance matrix yields two spatially orthogonal sets of singular vectors (spatial patterns
analogous to the eigenvectors or EOFs, but one for each variable) and a set of singular values associated
with each pair of vectors (analogous to the eigenvalues). The singular vectors for S are the columns of
U, and the singular vectors for P are the columns of V. Each pair of spatial patterns describe a fraction
of the square covariance (SC) between the two variables (i.e. is a mode of co-variability between the
two fields). The first pair of patterns describes the largest fraction ot the SC and each succeeding pair
describes a maximum fraction of SC that is unexplained by the previous pairs. Just as with the EOFs,
these patterns represent standing oscillations in the data fields and we can find the expansion coefficients,
i.e. time series describing how each mode of variability oscillates in time. For S and P we respectively
calculate A = SU and B = PV (the k-th expension coefficient for each variable is computed by projecting
the corresponding original data field onto the k-th singular vector). The columns of the matrices A and
B contain the expension corfficients of each mode. The diagonal of L contains the singular values and
the total squared covariance in C is given by the sum of the squared diagonal values of L. This gives a
simple way of assessing the relative importance of the singular modes, through the squared covariance
fraction (SCF) explained by each mode. If li=L(i,i) is the i-th singular value, the fraction of squared
covariance explained by the corresponding singular vectors ~ui and ~vi is given by
SCFi =li
trace(L).
Finally the correlation value (r) between the k-th expansion coefficients ot the two variables indicates
how strongly related the k-th coupled patterns are.
44
Appendix B
Dates of Minimum and Maximum
SIC
Figure B.1: Area and dates of minimum/maximum coverage for every year from 1992 to 2006 for Ob-servations and Model.
45
Appendix C
AAO, SOI and Nino-3 Indices
∗The Antarctic Oscillation (AAO) is the dominant pattern of non-seasonal tropospheric circulation
variations south of 20S, and it is characterized by pressure anomalies of one sign centered in the Antarctic
and anomalies of the opposite sign centered about 40-50S. The AAO is also referred to as the Southern
Annular Mode (SAM). The daily AAO index is constructed by projecting the daily 700mb geopotential
height anomalies poleward of 20S onto the loading pattern of the AAO, the loading pattern of the AAO
being defined as the leading mode of Empirical Orthogonal Function (EOF) analysis of monthly mean
700 hPa height during 1979-2000 period (see fig. C.1).
∗The Southern Oscillation Index (SOI) is one measure of the large-scale fluctuations in air pressure
occurring between the western and eastern tropical Pacific (i.e., the state of the Southern Oscillation)
during El Nio and La Nia episodes. It is computed using monthly mean sea level pressure anomalies at
Tahiti (T) and Darwin (D). The SOI [T-D] is an optimal index that combines the Southern Oscillation
into one series.
∗El Nio-Southern Oscillation (ENSO; commonly referred to as simply El Nio) is a global coupled
ocean-atmosphere phenomenon. The Pacific ocean signatures, El Nio and La Nia are important tem-
perature fluctuations in surface waters of the tropical Eastern Pacific Ocean. NINO3 is an index that
measures the strength of an ENSO event: it is the SST anomaly averaged over [5S,5N] and [150W,90W],
i.e. the eastern equatorial Pacific. The atmospheric signature of ENSO is the Southern Oscillation (SO).
46
APPENDIX C. AAO, SOI AND NINO-3 INDICES
Figure C.1: Taken from NOAA Climate Prediction Center Website.
47
Appendix D
EOFs calculation MATLAB code
During my research work, I had to implement a huge number of codes to obtain the results presented in
this report. To avoid overloading the report and for the sake of paper saving I just put the EOF statistical
tool Matlab code in appendix.
function [alpha, EOFs,var_ratio,extracted_mean]=EOF(k,n,data,EOF_nb)
%-------------------------------------------------------------------------%
% alpha = Time-series associated with the EOFs
% EOFs = Components resulting from Singular Value Decomposition after
% re-normalization
% var_ratio = amount of variance explained by each EOF
% extracted_mean = Temporal mean of the data, necessary for field
% reconstruction from EOFs
%-------------------------------------------------------------------------%
% data = Input data
% n = Number of samples
% k = Dimension of the problem, number of positions of the problem
% EOF_nb = Number of EOFs wanted in output
%-------------------------------------------------------------------------%
khi=reshape(data,k,n); % Data formating
khi(find(isnan(khi)))=0;
48
APPENDIX D. EOFS CALCULATION MATLAB CODE
% khi2=detrend(khi’)’; % Matlab routine removing the temporal mean (mean of
% the rows) from the data.
% To remove temporal mean (mean of the rows) manually
I=eye(n,n);
J=ones(n,n);
khi2=khi*(I-J./n);
extracted_mean=khi-khi2;% Get back the mean for later total reconstruction
[U,S,V]=svd(khi2,0); % Singular Value Decomposition of data matrix
Calculation and normalization of the EOFs
for j=1:EOF_nb
EOFs(:,j)=(khi2*V(:,j))/norm(khi2*V(:,j));
alpha(:,j)=khi2’*EOFs(:,j);
end
L=S.^2/n;
var_ratio=diag(L)/trace(L); % Calculation of the amount of variance
% explained by each mode
49
Bibliography
Carleton, A., 2003: Atmospheric teleconnections involving the Southern Ocean. J. Geophys. Res,
108 (C4), 8080.
Cavalieri, D. J. and C. L. Parkinson, 2008: Antarctic sea ice variability and trends, 19792006. Journal
of Geophysical Research, 113, C07 004, doi:10.1029/2007JC004564.
Holland, M., C. Bitz, and E. Hunke, 2005: Mechanisms Forcing an Antarctic Dipole in Simulated Sea
Ice and Surface Ocean Conditions. Journal of Climate, 18 (12), 2052–2066.
Kwok, R. and J. Comiso, 2002: Southern Ocean Climate and Sea Ice Anomalies Associated with the
Southern Oscillation. Journal of Climate, 15 (5), 487–501.
Liu, J., J. Curry, and D. Martinson, 2004: Interpretation of recent Antarctic sea ice variability. Geophys.
Res. Lett, 31 (2), 1–4.
Parkinson, C., 2004: Southern Ocean sea ice and its wider linkages: insights revealed from models and
observations. Antarctic Science, 16 (04), 387–400.
Serreze, M., M. Holland, and J. Stroeve, 2007: Perspectives on the arctic’s shrinking sea-ice cover.
Science, 317, 1533–1536, doi:10.1126/science.1139426.
White, W. and R. Peterson, 1996: An Antarctic circumpolar wave in surface pressure, wind, temperature
and sea-ice extent. Nature, 380 (6576), 699–702.
Wunsch, C. and P. Heimbach, 2007: Practical global oceanic state estimation. Physica D, 230(1-2),
197–208.
Yuan, X., 2004: ENSO-related impacts on Antarctic sea ice: a synthesis of phenomenon and mechanisms.
Antarctic Science, 16 (04), 415–425.
50
BIBLIOGRAPHY
Yuan, X. and D. Martinson, 2000: Antarctic Sea Ice Extent Variability and Its Global Connectivity.
Journal of Climate, 13 (10), 1697–1717.
Yuan, X. and D. Martinson, 2001: The Antarctic dipole and its predictability. Geophys. Res. Lett,
28 (18), 3609–3612.
Zwally, H., J. Comiso, C. Parkinson, D. Cavalieri, and P. Gloersen, 2002: Variability of Antarctic sea ice
1979–1998. J. Geophys. Res, 107 (10.1029).
51