spatial interpolation & geostatistics

26
Geo327G/386G: GIS & GPS Applications in Earth Sciences Jackson School of Geosciences, University of Texas at Austin Spatial Interpolation & Geostatistics Lag Mean + Distance between pairs of points (Z i –Z j ) 2 / 2 + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + Lag 11/5/2019 1

Upload: others

Post on 18-Mar-2022

31 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Spatial Interpolation & Geostatistics

Lag Mean

+

Distance between pairs of points

(Zi–

Z j)2 /

2

++++

+

++

+

+

+

++

+

+

++ +

+ +

++

+

++ + ++

++

+

++

+ +

Lag

11/5/2019 1

Page 2: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Kriging – Step 1

Describe spatial variation with Semivariogram

x

y

7

56

67

5

5

34

2

++

Distance between pairs of points

(Zi – Zj)2 / 2

+++ +

+

++

+

+

+

++

+

+

++ +

++

++

+

++ + +

+

+

++

+

++

+ +

“Point cloud”

11/5/2019 2

Map

Page 3: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Kriging – Step 2Summarize spatial variation with a function

Several choices possible; curve fitting defines different types of Kriging (circular, spherical, exponential, gaussian, etc.)

+

Distance between pairs of points

(Zi–

Z j)2 /

2

+++ +

+

++

+

+

+

++

+

+

++ +

++

++

+

++ + +

+

++

+

++

+ +

11/5/2019 3

Page 4: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Kriging – Step 2

Key features of fitted variogram:

Nugget

Distance (d)

Sill

RangeSe

miv

aria

nce

Nugget: semivariance at d = 0

Range: d at which semivariance is constant

Sill: constant semivariance beyond the range

11/5/2019 4

Page 5: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Kriging – Part II

Goal: predict values where no data have been collectedRelies on first establishing:

DEPENDENCY – z is, in fact, correlated with distance

STATIONARITY – z values are stochastic (except for spatial dependency they are randomly distributed) and have no other dependence – use “detrending” or transformation tools if not Gaussian

DISTRIBUTION – works best if data are Gaussian. If not they have to first be made close to Gaussian.

11/5/2019 5

Page 6: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

ESRI Geostatistical Analyst Products

Map types:

Prediction – contours of interpolated values

Prediction Standard Errors – show error distribution, as quantified by minimized RMS error (see below)

Probability – show probability of values exceeding a specified threshold

Quantile – show where thresholds overestimate or underestimated predictions

11/5/2019 6

Page 7: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

ESRI Geostatistical Analyst Products

Maps: (e.g. max. ozone concentration, 1999)

Prediction

Prediction Standard Errors

Probability

Quantile

11/5/2019 7

Figure from ESRI “Intro. to Modeling Spatial Processes Using Geostatistical Analyst”

Page 8: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Some Kriging Products

Prediction map – interpolated values

Probability map- showing where critical values exceeded

11/5/2019 8

Figures from ESRI “Using Geostatistical Analyst”

Dark orange and red indicate a probability greater than 62.5% that radiocesium contamination exceeds the critical threshold in forest berries.

Prediction map of radioceasium soil contamination levels in of Belarus after the Chernobyl nuclear power plant accident.

Probability MapPrediction Map

Page 9: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Kriging – Part II

Goal: predict values where no data have been collected Relies on first establishing:

DEPENDENCY – z is, in fact, correlated with distance

STATIONARITY – z values are stochastic (except for spatial dependency they are randomly distributed) and have no other dependence – use “detrending” or transformation tools if not Gaussian

DISTRIBUTION – works best if data are Gaussian. If not they have to first be made close to Gaussian.

11/5/2019 9

Page 10: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Spatial Dependence:

Semivariogram and Semivariogram Surface

11/5/2019 10

Figures from ESRI “Intro. to Modeling Spatial Processes Using Geostatistical Analyst”

Strong Spatial Dependence Weak Spatial Dependence

Page 11: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Spatial Dependence:

Cross-Validation Diagnostic

Use a subset of the data to test measured vs. predicted values

11/5/2019 11

Strong Spatial Dependence Weak Spatial Dependence

Figures from ESRI “Intro. to Modeling Spatial Processes Using Geostatistical Analyst”

Page 12: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Kriging – Part II

Goal: predict values where no data have been collected

Relies on first establishing:DEPENDENCY – z is, in fact, correlated with distance

STATIONARITY – z values are stochastic (except for spatial dependency they are randomly distributed) and have no other dependence – use “detrending” or transformation tools if not Gaussian

DISTRIBUTION – works best if data are Gaussian. If not they have to first be made close to Gaussian.

11/5/2019 12

Page 13: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

2. Stationarity

Test with semivariogram & cross-validation plots

11/5/2019 13

Semivariogram and temperature distribution map of winter temperature data for the USA.

Figure from ESRI “Using Geostatistical Analyst”

• Violates Stationarity –detrend first

stationarity is the assumption that the covariance between two points is the same for a given distance and direction, regardless of which two points are chosen.

Page 14: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

2. STATIONARITY - Randomness Data variance and mean is the same at all localities (or within a

neighborhood of nearest points); data variance is constant in the neighborhood of investigation

Correlation (covariance) depends only on the vector that separates localities, not exact locations, number of measurement or direction (anisotropy)

11/5/2019 14

Figures from ESRI “Intro. to Modeling Spatial Processes Using Geostatistical Analyst”

Page 15: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

California Ozone Demo.

Data in “Geostat_demo” folder

11/5/2019 16

Page 16: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

ArcGIS Kriging Processing Steps

1. Add and display the data

2. Explore the data’s statistical properties

3. Select a model to create a surface – make a prediction map!

4. Assess the result

5. Compare to other models

11/5/2019 17

Page 17: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Data Exploration

1. Examine the distribution – normal (Gaussian)? Transformation to normal required?

Histograms and QQPlots

2. Identify trends, if any

Trend Analysis

3. Understand spatial autocorrelation and directional influences

Semivariogram analysis

11/5/2019 18

Page 18: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Data Exploration:

Examine the Distribution

Normal (Gaussian) distribution? (central value, spread, symmetry;

mean and median the same?) Standard Normal: mean = 0, standard dev. =1

Transformation to normal required?

Histogram tool, QQPlot tool (compare real and standard normal distributions)

11/5/2019 19

= nearly Normally Distributed data

unimodal, near-symmetric

straight line

Page 19: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Data Exploration:

Identify Trends, If Any

Underlying trends affect Kriging assumption of randomness – remove and work with “residuals”

Trend Analysis tool

11/5/2019 20

Strong ~NE-SW “U-shaped” Trend

Weaker ~NW-SE linear Trend

Page 20: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Data Exploration:

Spatial Autocorrelation & Directional Influences

Variogram Analysis:

Look for correlation with distance

Look for directional trends among pairs of points

Semivariogram/ Covariance Cloud tool

11/5/2019 21

Directional Influence shown here – pairs in NNW-SSE direction show largest covariance over shortest distances

Exhibits Spatial Autocorrelation

Page 21: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

ArcGISKriging Processing Steps

1. Add and display the data

2. Explore the data’s statistical properties

3. Select a model to create a surface – make a prediction map!

4. Assess the result

5. Compare to other models

11/5/2019 22

Page 22: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Mapping Ozone Concentration

1. Incorporate results of Data Exploration into Model selection

This example:

remove underlying trends discovered during data exploration that have a rational explanation. (Analysis is then performed on residuals and trend surface is added back into final surface) = “Detrending”

Remove directional trends between pairs of points – in certain directions closer points are more alike than in other directions = “anisotropy removal”

Lag Size – Important! Rule of Thumb: (Lag size) x (# of lags) = 0.5 largest distance (range) among points

11/5/2019 23

Page 23: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Mapping Ozone Concentration –Interpolation & Cross Validation

2. Define search neighborhood for interpolation (c.f. I.D.W.)

Use a search ellipse (or circle) to find nearest neighbors; specify radii of ellipse, min. & max. number of points per sectors

3. Examine Cross Validation plotPredicted vs. Measured for subset(s) of the data

“Mean error” should be close to zero

“RMS error” and “mean standardized error” should be small

“RMS standardized error” should be close to one.

11/5/2019 24

Page 24: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

ArcGIS Kriging Processing Steps

1. Add and display the data

2. Explore the data’s statistical properties

3. Select a model to create a surface – make a prediction map!

4. Assess the result – Cross Validation Plots

5. Compare to other models

11/5/2019 25

Page 25: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Comparing Model Results

Cross validation comparisons:

11/5/2019 26

• “Mean error” should be close to zero

• “RMS error” and “mean standardized error” should be small

• “RMS standardized error” should be close to one.

Page 26: Spatial Interpolation & Geostatistics

Geo327G/386G: GIS & GPS Applications in Earth SciencesJackson School of Geosciences, University of Texas at Austin

Probality Mapping with Indicator Kriging

Task: Make a map that show the probability of exceeding a critical threshold, e.g. 0.12 ppm ozone for an 8 hr. period

Technique:

Transform data to a series of 0s and 1s according to whether they are above or below the threshold

Use a semivariogram on transformed data; interpret indicator prediction values as probabilities

11/5/2019 27