`bottom-up' and `top-down' modelling dominant mode analysis: data-based mechanistic...
Post on 28-Mar-2015
221 Views
Preview:
TRANSCRIPT
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
`Bottom-up' and `Top-down' Modelling Dominant Mode Analysis:
Data-Based Mechanistic Modelling and methodology Global Carbon Cycle Examples Conclusions
Dominant Mode Analysis: Simplicity Out of Complexity
Peter YoungCentre for Research on Environmental Systems and Statistics
Lancaster Universityand
Centre for Resource and Environmental StudiesAustralian National University
Prepared for Complexity Seminar, May 26th 2004
• Two Illustrative examples
Data-Based Mechanistic (DBM) Modelling:
An Inductive Approach to Modelling Stochastic, Dynamic Systems
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Data-Based Mechanistic (DBM) Modelling
The general approach of DBM modelling is as follows:1. Define the objectives of the modelling exercise.2. Prior to the acquisition of data (or in situations of insufficient data), develop a
simulation model that satisfies the defined objectives. Then, evaluate the sensitivity of the model to uncertainty and develop a reduced order, dominant mode model that captures the most important aspects of its dynamic behaviour.
3. When sufficient data become available: identify and estimate from these data a parsimonious, dominant mode, stochastic model that again satisfies the defined objectives; reflects the information content in the data; and can be interpreted in physically meaningful terms.
4. Attempt to reconcile the models obtained in 1. and 2.
Data-Based Mechanistic (DBM) modelling has been developed at Lancaster over the past twenty years but is derived from research dating back to the early 1970’s.
Note: not all stages in this modelling process will be required; this will depend on the nature of the problem; the available date; and the modelling objectives.
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
The DBM Approach to Modelling When Data are Available
The most parametrically efficient (parsimonious, identifiable) model structure is first inferred statistically from the available time series data in an inductive manner, based on a generic class of dynamic models.
After this initial inductive modelling stage is complete, the model is interpreted in a physically meaningful, mechanistic manner based on the nature of the system under study and the physical, chemical, biological or socio-economic laws that are most likely to control its behaviour.
This inductive approach can be contrasted with the alternative hypothetico-deductive modelling approach (including simple ‘grey-box’ modelling), where the physically meaningful model structure is based on prior assumptions; and the parameters that characterise this simplified structure are estimated from data only after this structure has been specified by the modeller.
By delaying the mechanistic interpretation of the model in this manner, the DBM modeller avoids the temptation to attach too much importance to prior, subjective judgement when formulating the model equations.
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Methods of Recursive Time Series Analysis
The methodological tools used in DBM modelling are largely based on recursive methods (Gauss, 1826: see Young, 1984) of time series analysis that perform various tasks:
In the case of time-invariant parameter (stationary) models, the statistical identification of the Transfer Function model (an ODE model in the continuous-time case) structure and the optimal estimation of the constant parameters that characterise this structure. In the case of nonstationary and nonlinear models, the estimation of Time Variable (TVP) or State-Dependent (SDP) Parameters that, respectively, characterise the model in these cases. On-line forecasting and data-assimilation (Bayesian recursive estimation).In addition, MCS-based tools for uncertainty and sensitivity analysis are used to evaluate the implications of uncertainty in: (i) derived physically meaningful parameters (e.g. time constants, residence times, decay rates, partition percentages, etc. ); and (ii) model predictions.
Dominant Mode Analysis and Modelling
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Example 1: Uncertainty and Dominant Mode Analysis of the Enting-Lassey Global Carbon
Cycle Model
Research carried out by Stuart Parkinson and Peter Young (1996) (based on a Simulink version of a E-L simulation model developed by Stuart
Parkinson)
Key to Success in Environmental and Biological Systems Analysis
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Globally Averaged Carbon Cycle Climate Data
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
The Enting-Lassey Global Carbon Cycle Model
Stratosphere
Troposphere
Biosphere
Oceanic
Deep Ocean
mixed layer
Fossil
Fuel
Nuclear
Explosions
Cosmic
Rays
(sub-divided into
18 layers)
Biosphere
Young
Old
Light arrows indicate
anthropogenic effects
C decay from all boxes
14
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Comparison of Uncertainty Bounds for Deterministic Model Stochastic Uncertainty Analysis Using MCS
COMPARISON OF UNCERTAINTY IN FUTURE CO2: IPCC SCENARIO IS92A
YEAR UNCERTAINTY INATMOSPHERIC CO2
/ PPMV RANGE
SOURCE
2050 494 to 510
520 to 550491+/-86487+/-44
16
3017288
Enting (1994) pers.comm
Wigley & Raper (1992)Figure 13.5, dashed lineFigure 13.5, dotted line(uncertainties halved)
2100 667 to 719
740 to 800615 to 683664+/-14465+/-71
52
6068288142
Enting (1994) pers.comm
Wigley & Raper (1992)Enting & Lassey (1993)Figure 13.5 dashed lineFigure 13.5, dotted line(uncertainties halved)
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Comparison of Reduced Order and 23rd Order EL Model Impulse Responses
0 10 20 30 40 50 60 70 80 90 1000.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
0.5
0.55Comparison of Reduced Order and EL Model Outputs
Year
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Comparison of Reduced Order and 23rd Order EL Model Responses to Measured Fossil Fuel Input
1860 1880 1900 1920 1940 1960 1980270
280
290
300
310
320
330
Year
Comparison of Model Responses to Fossil Fuel Input
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Parallel Pathway Decomposition of Reduced Order Model: Physical Interpretation
Instantaneous
+
Fossil FuelCarbon Dioxide
Emissions
+
AtmosphericCarbon Dioxide
ResidenceTime4.3 y
ResidenceTime19.1 y
ResidenceTime467 y
3.2%
8.9%
87.1%
0.8%
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
MCS Analysis of Reduced Order Model Showing Uncertainty in the Residence Time Estimates
4.28 4.3 4.32 4.34 4.36 4.38 4.4 4.420
0.5
1
MCS Analysis: Estimated Residence Times
18.9 19 19.1 19.2 19.30
0.5
1
Normalised Frequency
466 466.5 467 467.5 468 468.50
0.5
1
Residence Time (years)
Example 2: Dominant Mode Analysis of the Lenton Global Carbon Cycle (GCC) Simulation
Model
Research currently being carried out by Andrew Jervis, Peter Watkins and Peter Young
(based on a Simulink version of a GCC simulation model developed by Tim Lenton, CEH, Bush, Edinburgh)
Key to Success in Environmental and Biological Systems Analysis
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Lenton Global Carbon Cycle (GCC) Simulation: Top Level of Complete Hierarchical Model
pCO2
Temperature
Ocean C sink
Ocean
Land-use CO2
pCO2
Temperature
Land C sink
Land
Total CO2
Land-use C02
Human influence
Total CO2
Land C sink
Ocean C sink
pCO2
Temperature
Atmosphere
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Lenton GCC Model, Second Level: Sub-Model Inside Green Block of Complete Model
1Land C sink
Land-use CO2
pCO2
T_s
P
R_p
L(C_v)1
VegetationL(C_v)
T_s
R_s1
Soil
3Temperature
2pCO2
1Land-use CO2
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Lenton GCC Model, Third Level: Model Inside Green Block of Sub-Model
Photosynthesis
Respiration
LitterfallNet flux into
vegetation3 L(C_v)1
2 R_p
1 P
C_vout
to workspace1
vfout To Workspace4
C_v0
Steady state vegetation
reservoir
f(u)
R_p(T,C_v)
u-288.15
P(CO2,DT)1
f(u)
P(CO2,DT)
Mux
Mux
k_D
Loss fraction
k_t*u
L(C_v)
s
1
s
1
Integrator
3T_s
2pCO2
1Land-use CO2
T_s
DT
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Second Order DBM Model Estimated from the Response of the High Order Simulation Model to an 1000 Unit Impulsive
Emissions Input
Full simulation model DBM second order reduction of full simulation model
100 200 300 400 500 600 700 800 900 10000
5
10
15
20
25
30
Date
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Second Order DBM Model Mimics the Behaviour of the High Order Simulation Model
pCO_2 Data Full simulation model output DBM second order reduction of full simulation model
1850 1900 1950 20000
1
2
3
4
5
6
7
8
9
10
Date
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Second Order DBM Model Based on the Impulse Response Estimation Mimics the Response of the High Order Simulation
Model to the Historic Emissions Input
Full simulation model DBM second order reduction of full simulation modelError+9 plotted above
1850 1900 1950 20000
1
2
3
4
5
6
7
8
9
10
Date
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Block Diagram and Physical Interpretation of Second Order DBM Reduced Order Model
DBM Modelling from the Global Data
Based on recent research by Jervis and Young
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Globally Averaged Carbon Cycle Climate Data
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Parameterisation of the Identified Non-Parametric SDP Model
The initial non-parametric modelling phase suggests the following SDP differential equation model form:
€
dy(t)
dt= a1{T(t )}y(t) + b0u(t −τ )+ c +ξ(t)
where is a very small, coloured noise process and is a pure time delay. A variety of parameterisations of the SDP were investigated but linear and sigmoidal relation-ships in were found to be most suitable.
Note the pure time delay: this suggests that higher order dynamic processes are possibly operative but their aggregative effect is to introduce the pure time delay (a common effect in real systems).
€
a1{T(t )}
€
T(t )
€
ξ(t )
€
τ
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Parametric Model with External Temperature as an Input: Estimation and Validation
data median 5 percentile 95 percentile Validation origin
1860 1880 1900 1920 1940 1960 1980 2000-0.5
00.5
Error (pa)
1860 1880 1900 1920 1940 1960 1980 20000
1
2
3
4
5
6
7
8
9
10
pCO_2 (pa)
Note the expanded scale on error plot
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Wigley (1993, 2000) Model Fitting Results
A Controlled Future?
Hypothetical but showing how SDP models can provide a vehicle for assessing future carbon dioxide emissions policy
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Achieving Stabilization Concentration Profiles
One approach to a more discerning definition of emission scenarios is to consider the ‘inverse problem’ (e.g. Wigley, 2000): i.e. compute the emission scenarios that are able to achieve a range of ‘stabilization concentration profiles’ for atmospheric , such as those utilized in the studies carried out by the IPCC (Schimel, et al 1996).
An alternative and more flexible approach that we have used is to exploit automatic control theory and so generate an emissions scenario (control input) that achieves some required objective, as defined by a specified ‘optimal’ criterion function. Here, the control system is designed to adjust the emissions input so that one of the required stabilization concentration profiles suggested by Wigley is followed as closely as possible.
€
CO2
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Optimal PIP Control of Emissions for Future Atmospheric Carbon Dioxide and Global Temperature Stabilization: Simulink Diagram
5 Year PureTime Delay
uu
Controlled Input to Workspace4
+
+
+
Sum
1/s
Emissions-CO2 Model
temp
Temp. Anomalyto Workspace3
yh
CO2 to Workspace
Scope
-+
Sum2
mean(yss)
Constant1
e
s+f
CO2-Temp. AnomalyModel
1
Outport
a
Gain2
c
Gain
d
Constant
[T,uss]
Historic Emissions up to2000 from Workspace
k
1-z -1
Integral Control
[Td,yd]
Desried CO2 Profile fromWorkspace
+-
Sum4
Switch to introduce PIP
control in2000
+-
Sum6
1
g
'Plus' ControlInput Prefilter
h Proportional Control
Note:most conservative linear model
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
PIP Emissions Policy for Future Atmospheric Carbon Dioxide and Global Temperature Stabilization using the Linear DBM Model
pCO2 Co2 emissions Temperature anomaly5 percentile 95 percentile
1900 1950 2000 2050 2100 2150 2200
0
2
4
6
8
10
12
14
16
18
pCO2, CO2 Emissions and Temperature anomaly
Date
Historic PIP Control Policy
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Comparison with Wigley’s Stabilization Results
We see that the objectives have been realized by 2200, with an emissions scenario stabilizing at a level of 7.4 Gt y-1, with a 5%-95% percentile range between 6.1 and 8.5 Gt y-1; i.e. between 3.6 to 6 Gt y-1 higher than that computed by Wigley for the same profile: i.e. the same result is achieved with a much smaller and more practically feasible reduction in the emissions. Note also that, unlike Wigley, we are able to provide a estimate of the uncert-ainties involved
We believe that this is a very important result. Wigley’s specifications for the emissions would be impossible to achieve in practice; whereas those suggested here are reasonably realistic (provided, of course, that binding international agreements are negotiated and maintained - which is an entirely different matter!)
€
CO2
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Large simulation models are products of the scientist's (hopefully inspired) imagination, most often constructed within a deterministic-reductionist (`bottom-up') paradigm using a hypothetico-deductive approach to modelling.
The responses to external stimuli of such models (and the systems they seek to simulate) are dictated by the dominant modes of the system behaviour, which are normally few in number. As a result, there is only sufficient information in the data (simulated or real) to identify and estimate the parameters that characterise the dominant modal behaviour: i.e. the large model is not fully identifiable from the data.
Data-Based Mechanistic (DBM) `Top-Down' models, identified and estimated in relation to observational data (simulated or real) using an inductive approach, capture the essence of the modally-dominant mechanisms.
CONCLUSIONS
RESearch
C
The Key to Success in
Environmental and Biological
Systems Analysis
Large simulation models can be reduced to their modally dominant form by DBM model reduction Dominant Mode Analysis (DMA) applied to data obtained from experiments conducted on the simulation model. This can help the modeller to better understand the strengths and limitations of the simulation model and what parts of it may be estimated from real data.
Often, the DBM models obtained from either simulated or real data will not have sufficient fine detail to allow for `what-if' studies. However, such detail can be added to the model in the same way that it would be in the synthesis of a simulation model (e.g. adding a wier to a DBM flow-routing model).
All of the analyses reported here were carried out using our MatlabToolbox CAPTAIN: http://www.es.lancs.ac.uk/cres/captain/.
CONCLUSIONS continued
top related