atmospheric data assimilation.pptx

Upload: jason-pajimola-punay

Post on 03-Apr-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    1/70

    ATMOSPHERICDATA ASSIMILATION

    By Roger Daley*

    Journ al of the Meteoro log ical Soc iety o f Japan, Vol. 75, No. 1B, pp.319-329, 1997

    (Manu scr ipt received 23 May 1995, in revis ed from 15 February 1996)

    *Naval Research Laboratory, 7 Grace Hopper Avenue, Monterey CA 93943-5502, USA

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    2/70

    DEFINITION

    Data assimilation is an analysis technique

    in which the observed information is

    accumulated into the model state by taking

    advantage of consistency constraints with laws of

    time evolution and physical properties.

    -F. Bouttier and P. Courtier

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    3/70

    OBJECTIVE (1)

    to produce a regular, physically consistent

    4 dimensional representation of the state of the

    atmosphere from a heterogeneous array of in situand remote instruments which sample imperfectly

    and irregularly in space and time.

    -Roger Daley

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    4/70

    OBJECTIVE (2)

    to provide a dynamically consistent

    motion picture of the atmosphere and

    oceans, in three space dimensions, with

    known error bars.

    -M. Ghil and P. Malanotte-Rizzoli

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    5/70

    OBJECTIVE (3)

    Extracts the signal from noisy observations

    (filtering)

    Interpolates is space and time (interpolation)

    Reconstructs state variables that are not

    sampled by the observation network

    (completeness)

    -R.Daley

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    6/70

    KEEP IN MIND (1) What is the purpose of the DA weather

    prediction, physical understanding, signal

    detection, environmental monitoring, etc?

    What are the physical characteristics of thephenomenon of interest?

    What are its temporal and spatial

    characteristics and what relations exist

    between state variables? What are the characteristics of other

    physical phenomena which might obscure

    the desired signal?

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    7/70

    KEEP IN MIND (2)

    What are the characteristics of the observing

    system?

    Is the observing system largely under the

    control of the scientist (as in field

    experiment) or is it given?

    Is it possible to influence the design of the

    observing system, can DA techniques be

    used in the observation system design?

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    8/70

    KEEP IN MIND (3)

    All models and observations are approximate

    The resulting analyses will be approximate

    The observations must be combined in some

    optimal fashion

    It is better to have enough observations tooverdetermine the problem

    The model is used to provide the preliminary

    estimate

    The final estimate should fit the observations withintheir (presumed) observation error.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    9/70

    MAXIMUM LIKELIHOOD

    ESTIMATION (1)

    Zero dimensional/scalar case and a definevariable x

    Observation x o and a forecast x f (produced bya model)

    Observation error: o= x ox Model error: f= x fx These errors are assumed to be random,

    unbiased, normally distributed.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    10/70

    MAXIMUM LIKELIHOOD

    ESTIMATION (2)

    A variable w/c is normally distributed with mean 0

    and variance 2 has a probability distribution

    () = 2 0.5 _1exp(2/22)

    The joint probability distribution of errors is

    (0, ) = 20.5 _1exp(2

    /22 2

    /22)

    error variance

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    11/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    12/70

    MINIMUM VARIANCE

    ESTIMATION

    Unbiased linear estimate ofxx e = cox o + cfxf(co, cf non-negative)

    co + cf = 1 Unbiased linear estimate error:e= x ex Expected error variance ofx: = (co)22+ (cf)22 Best LinearUnibiasedEstimate

    x e = (2x o + 2x f )(2+ 2)-1

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    13/70

    MAXIMUM LIKELIHOOD ESTIMATE

    VS.

    MINIMUM VARIANCE ESTIMATE The maximum likelihood estimate

    x a = x f + 2(2 + 2)-1(x o - x f)-finds the mode Best LinearUnibiasedEstimate

    x e = (2x o + 2x f )(2+ 2)-1-finds the mean

    When the error probabilities are normally-distributed as in() = 2 0

    .5 _1exp(2/22) , the mean and themode and the minimum variance and maximum likelihoodestimates are the same.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    14/70

    THE L2 NORM (1)

    In most meteorological practice, L2 norms are usedbecause they lead to linear analysis equations.

    L2 norm estimation yields the mean, L1 estimationgives the median and Lestimation determines themid-range.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    15/70

    THE L2 NORM (2)

    Example: estimation based on 5 observations.

    Assume that each observation has the same

    observation error variance 2

    The 5 observation values:

    -22.5, 1.1, 1.2, 1.3 and 650

    It seems likely that there must have been severe

    measurement problems in the first and lastobservations.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    16/70

    THE L2 NORM (3)

    The mean value of the observations is 126.3 (L2)

    The median value is 1.2 (L1)

    The mid-range value is 313.95 (L)

    In this example, minimization with respect to the L1norm gives the most credible estimate.

    The L1 norm is much superior to the L2 norm when itcomes to detecting and removing gross errors.

    In atmospheric data assimilation, there are situationswhere errors are not normally distributed.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    17/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    18/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    19/70

    All observing systems have their limitations, problems,

    and failures, resulting in the reported measurementsbeing sometimes incorrect.

    Such data must be identified and rejected by the data

    assimilation system in order to avoid corruption of theanalysis.

    Due to the amount of data handled this is done byautomatic routines, both in the form of preprocessing andduring the data assimilation stage.

    Xiang-Yu Huang and Henrik Vedel

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    20/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    21/70

    THREE-DIMENSIONAL

    SPATIAL ANALYSIS

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    22/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    23/70

    THE VECTOR CASE (2)

    Assume that the observation and forecast error are

    unbiased, normally-distributed and not mutually

    correlated. That is,

    = = 0

    T is the matrix transpose.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    24/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    25/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    26/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    27/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    28/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    29/70

    GENERAL OBSERVATION NETWORKS(3) THE MODIFIED COST FUNCTION

    Orig: J = 0.5[xo-xa]TR-1[xo-xa] + 0.5[xf-xa]T[Pb]-1[xf-xa]

    Mod: J = 0.5{[yo-H(xa)]TR-1[yo-H(xa)] + 0.5[xf-xa]T[Pb]-1[xf-xa]

    Reason: the observed and forecast variables are not

    necessarily the same. Observed variable asyo and forecast variable as xf.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    30/70

    GENERAL OBSERVATION NETWORKS(4) THE MODIFIED MAXIMUM LIKELIHOOD ESTIMATE

    Orig: xa = xf+ Pb[Pb + R]-1[xo-xf]

    Mod: xa = xf+ PbHT[HPbHT+ R]-1[yoH(xf)]

    Note: H(x) is frequently a non-linear operator, but it

    can be linearized by defining the tangent linear operatorH = ()

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    31/70

    GENERAL OBSERVATION NETWORKS(5) THE MODIFIED ANALYSIS ERROR COVARIANCE

    Orig: [Pa]-1 = R-1 + [Pb]-1

    Mod: [Pa]-1 = HTR-1H + [Pb]-1

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    32/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    33/70

    THE OPTIMAL INTERPOLATION (OI)

    TECHNIQUES (2)

    OI is often simplified so that it does not produce a whole

    domain analysis, but rather a number of local analyses at

    each gridpoint or small grid volume.

    OI methodology is sufficiently powerful to perform

    credible multivariate analysis that is, where the

    observations of one variable (temperature, say) are used

    in the analysis of another variable (wind, say).

    This is done by incorporating linear diagnostic relations

    (such as geostrophic and hydrostatic balance) between

    two variables in the forecast error covariance Pb

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    34/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    35/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    36/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    37/70

    THE FORECAST/ANALYSIS CYCLE (1)

    We modify the notation to introduce forecast x

    and

    analysis x vectors at time

    We define a model, which marches forward in time from

    time to time x = M (x )

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    38/70

    THE FORECAST/ANALYSIS CYCLE

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    39/70

    THE FORECAST/ANALYSIS CYCLE(3) THE CYCLE

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    40/70

    DYNAMICALLY-GENERATED ANALYSIS WEIGHTS(1) THE KALMAN FILTER

    While the forecast x

    is responsive to all the complexitiesof atmospheric flow simulated by the model, the forecast

    error covariancesbspecified in the OI and 3DVAr

    algorithms are completely insensitive to the flow.

    Modern assimilation techniques attempt to generate the

    analysis weights dynamically, explicitly using the model

    M.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    41/70

    DYNAMICALLY-GENERATED ANALYSIS WEIGHTS(2) THE KALMAN FILTER

    One way of doing this, is through an explicit evolution

    equation for the forecast error covariance.

    Defining the tangential linear model =

    ()

    and

    assuming that model M is imperfect, the forecast error at timetn is related to the analysis error covariance at time tn-1 by

    b =

    T

    Where Qn= is the model error covariance and

    nis the model error

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    42/70

    DYNAMICALLY-GENERATED ANALYSIS WEIGHTS(3) THE KALMAN FILTER

    The fundamental equation of the Kalman filter algorithm:

    b =

    T

    (an equation for the propagation of second moment error statistics)

    The non-linear form of Kalman filter is referred to asextended Kalman filter (EKF)

    The EKF is a sequential algorithm that makes use of the

    past and present observations.

    Statistical error moments higher than the second are

    generated.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    43/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    44/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    45/70

    THANK

    YOU

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    46/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    47/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    48/70

    THE INVERSE PROBLEM (2)

    The objective of an inverse problem is to find the bestmodel m such that (at least approximately)

    d = G(m)

    where G is an operator describing the explicitrelationship between the observed data, d , and the modelparameters.

    In various contexts, the operator G is calledforwardoperator, observation operator, orobservation function.

    In the most general context, G represents the governingequations that relate the model parameters to theobserved data (i.e. the governing physics).

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    49/70

    SIMPLE KALMAN FILTER EXAMPLE (1)SOURCE: http://credentiality2.blogspot.com/2010/08/simple-kalman-filter-example.html

    Kalman filters are a way to take a bunch of noisy

    measurements of something, and perhaps also some

    predictions of how that something is changing, and maybeeven some forces we're applying to that something, and to

    efficiently compute an accurate estimate of that

    something's true value.

    http://credentiality2.blogspot.com/2010/08/simple-kalman-filter-example.htmlhttp://credentiality2.blogspot.com/2010/08/simple-kalman-filter-example.htmlhttp://credentiality2.blogspot.com/2010/08/simple-kalman-filter-example.htmlhttp://credentiality2.blogspot.com/2010/08/simple-kalman-filter-example.htmlhttp://credentiality2.blogspot.com/2010/08/simple-kalman-filter-example.htmlhttp://credentiality2.blogspot.com/2010/08/simple-kalman-filter-example.htmlhttp://credentiality2.blogspot.com/2010/08/simple-kalman-filter-example.htmlhttp://credentiality2.blogspot.com/2010/08/simple-kalman-filter-example.html
  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    50/70

    SIMPLE KALMAN FILTER EXAMPLE (2)

    Let's say we want to measure the temperature in a room. We

    think it's about 72 degrees, 2 degrees. And we have a

    thermometer that gives uniformly random results within a

    range of 5 degrees of the true temperature.

    We take a measurement with the thermometer and it reads

    75. So what's our best estimate of the true temperature?

    Kalmanfilters use a weighted average to pick a point

    somewhere between our 72 degree guess and the 75 degreemeasurement.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    51/70

    SIMPLE KALMAN FILTER EXAMPLE (3)

    Here's how we choose the optimal weight, given the accuracy ofour guess and the accuracy of the thermometer:

    weight =temperature variance

    temperature variance + thermometer variance

    0.29 =

    +

    If the weight is large (approaching 1.0), we mostly trust ourthermometer. If the weight is small, we mostly trust our guess

    and ignore the thermometer. 29% weight means we'll trust our guess more than the

    thermometer, which makes sense, because we think our guessis good to 2 degrees, whereas the thermometer was onlygood to 5

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    52/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    53/70

    SIMPLE KALMAN FILTER EXAMPLE (4)

    how confident are we in our estimate of 72.87 degrees?

    estimate variance =temperature variancethermometer variance

    temperature variance + thermometer variance

    1.43 = +

    So we think our estimate is correct to 1.43 degrees

    we have a guess that the temperature in the room is

    72.87 degrees, 1.43 degrees.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    54/70

    SIMPLE KALMAN FILTER EXAMPLE (5)

    Now from the guess that the temperature in the room is

    72.87 degrees, 1.43 degrees.

    And we still have a thermometer that tells the temperature 5 degrees.

    That's basically the situation where we started, so we can run the wholealgorithm again:

    First we compute the weight, using our new, more accurate guess

    confidence:

    weight = temperature variancetemperature variance + thermometer variance

    0.22 =.43

    .43+

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    55/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    56/70

    SIMPLE KALMAN FILTER EXAMPLE (7)

    And the new confidence level:

    estimate variance =temperature variancethermometer variance

    temperature variance + thermometer variance

    1.11 = .43.43+

    So after the second measurement, we estimate that the

    actual temperature is

    72.46 degrees, 1.11 degrees

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    57/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    58/70

    HIGH FREQUENCY INTERFERENCE (1)

    The atmosphere has a number of timescales.

    In general, only limited frequency band will be important

    for a given atmospheric DA application.

    For synoptic and planetary scale forecast/analysespurposes, it is timescales of approximately one hour to

    one week which are of interest

    For mesoscale or convective scale modeling, the

    relevant timescales are minutes to hours For climate/environmental monitoring purposes,

    timescales from weeks to decades are of interest.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    59/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    60/70

    HIGH FREQUENCY INTERFERENCE (3)

    In atmosphere, high frequency modes generally haverelatively low amplitude.

    However, when models are integrated from analyses

    produced from observations by some analysisalgorithm, high frequency oscillations of an amplitudemuch larger than observed in nature may be excited.

    These oscillations may completely obscure thephenomena of interest or may cause perfectly goodobservations to be rejected.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    61/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    62/70

    HIGH FREQUENCY INTERFERENCE (5)

    Illustration of the effect of initialization.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    63/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    64/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    65/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    66/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    67/70

    THE PRIMITIVE EQUATIONS

    V(dot)+f k + = ()

    +=

    +

    =

    The LHS of the equation s have been linearized about a state of

    rest w ith a domain-averaged temperature f ield.

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    68/70

    INTIALIZATION

    Primitive equations are written as:

    z (dot) + Wzz = Rz

    y (dot) + Wyy = Ry

    Notes: Amplitudes of high frequency modes denoted by vectorz and the

    corresponding frequencies by the diagonal matrix Wz Amplitudes of slower (Rossby) modes denoted by vectory and the

    corresponding frequencies by the diagonal matrix Wy

    Rz and Ry are projections of Rv and R onto the fast and slowmodes, respectively and each is a non-linear function of both zand y

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    69/70

  • 7/28/2019 Atmospheric DATA ASSIMILATION.pptx

    70/70

    SUPPRESSION OF HIGH FREQUENCIES IN

    KF AND 4DVAR

    KF

    -high frequencies can be suppressed using normal

    mode theory of Q

    4DVAR

    -most successful is using the Machenhauer condition

    as a constraint in the minimization of J = Jp + Jn (from n=0to N)