gaussian processes in spatial statistics - university of...

18
Gaussian processes in spatial statistics Emiko Dupont 29 January 2018 Emiko Dupont Gaussian processes in spatial statistics

Upload: others

Post on 10-Jun-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Gaussian processes in spatial statistics

Emiko Dupont

29 January 2018

Emiko Dupont

Gaussian processes in spatial statistics

Page 2: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

What is a Gaussian process/Gaussian random field?Definition:Stochastic process {Z (s) | s ∈ D}, D︸︷︷︸

spatialdomain

⊂ Rd

Any finite collection {Z (s1), . . . ,Z (sk)} is multivariate normal:Z (s1)...

Z (sk)

∼ N

µ(s1)

...µ(sk)

, Cov(Z (si ),Z (sj))

Emiko Dupont

Gaussian processes in spatial statistics

Page 3: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

What is a Gaussian process/Gaussian random field?Definition:Stochastic process {Z (s) | s ∈ D}, D︸︷︷︸

spatialdomain

⊂ Rd

Any finite collection {Z (s1), . . . ,Z (sk)} is multivariate normal:Z (s1)...

Z (sk)

∼ N

µ(s1)

...µ(sk)

, Cov(Z (si ),Z (sj))

Note:In particular, Z (s) ∼ N(µ(s),Var(s)) for all s ∈ D

Emiko Dupont

Gaussian processes in spatial statistics

Page 4: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

What is a Gaussian process/Gaussian random field?

Spatial field: {Z (s) | s ∈ D}, D ⊂ R2

White noise

Z(s) ∼iid N(0, σ2)Any finite collection {Z(s1), . . . ,Z(sk)} ∼ N(0, σ2I )

Emiko Dupont

Gaussian processes in spatial statistics

Page 5: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

What is a Gaussian process/Gaussian random field?

Spatial field: {Z (s) | s ∈ D}, D ⊂ R2

Z (s) = concentration of mineral at location s

µ(s) = µCov(Z(s1),Z(s2)) = exp(−|s2 − s1|2/ R2︸︷︷︸

rangeparameter

)

Emiko Dupont

Gaussian processes in spatial statistics

Page 6: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

What is a Gaussian process/Gaussian random field?

Spatial field: {Z (i) | i = 1, . . . ,N}, N regions

Z (i) = relative risk of lung cancer in region i

Covariance: Neighbouring regions more similar than those far apart

Emiko Dupont

Gaussian processes in spatial statistics

Page 7: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

What are Gaussian processes used for?

Improve inference:

Identify spatial correlation structure/clustering

More powerful inference by pooling data

Prediction: Given observations of Z (s) at locations s1, . . . , sn

Estimate∫AZ (s)ds (e.g. total quantity of ore across region A)

Reconstruct entire field Z (s) (e.g. global sea surface temperature)

Emiko Dupont

Gaussian processes in spatial statistics

Page 8: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Applications

geology (e.g. estimating mineral concentration for mining)

environmental sciences (e.g. assessing time trends/spatial trends inflood risk/sea ice concentration/sea temperature...)

ecology (e.g. assess fish stock to avoid overexploitation)

epidemiology (e.g. understanding spatial distribution of diseases)

econometrics (e.g. financial time series modelling)

...

Emiko Dupont

Gaussian processes in spatial statistics

Page 9: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Gaussian process models

What’s so special about Gaussians?

A Gaussian is completely determined by its mean and covariance

Gaussians behave nicely under addition, conditioning etc.

Gaussians are often good approximations of other distributions

Emiko Dupont

Gaussian processes in spatial statistics

Page 10: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Gaussian process models

What’s so special about Gaussians?

A Gaussian is completely determined by its mean and covariance

Gaussians behave nicely under addition, conditioning etc.

Gaussians are often good approximations of other distributions

Common assumption:

Isotropy: Covariance depends only on |s1 − s2|

Cov(Z (s1),Z (s2)) = C (|s1 − s2|)︸ ︷︷ ︸covariance function

e.g. exponential/spherical/Matern

Typically: nearby points are more similar than those far apart

Emiko Dupont

Gaussian processes in spatial statistics

Page 11: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Estimating the spatial structure

Given z = (z1, . . . , zn) observations of Z (s) at locations s1, . . . , sn.

Assumption: Mean and variance known up to unknown parameters.

Goal: Estimate parameters

Emiko Dupont

Gaussian processes in spatial statistics

Page 12: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Estimating the spatial structure

Given z = (z1, . . . , zn) observations of Z (s) at locations s1, . . . , sn.

Model:z | β, α, θ︸ ︷︷ ︸

unknownparameters

∼ N(Xβ, αV (θ))

X observed covariates at locations s1, . . . , sn

For example:Z (s) = sea surface temperature at location sX = salinity at locations s1, . . . , snExponential covariance function with unknown range parameter θ = R

Emiko Dupont

Gaussian processes in spatial statistics

Page 13: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Estimating the spatial structure

Given z = (z1, . . . , zn) observations of Z (s) at locations s1, . . . , sn.

Model:z | β, α, θ︸ ︷︷ ︸

unknownparameters

∼ N(Xβ, αV (θ))

X observed covariates at locations s1, . . . , sn

Parameter estimation:

Maximum likelihood: (β, α, θ) = argmax f (z | β, α, θ)︸ ︷︷ ︸likelihood of data

Bayesian method: posterior ∝ prior × likelihood

Emiko Dupont

Gaussian processes in spatial statistics

Page 14: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Prediction: Kriging

Goal:Given: z = (z1, . . . , zn) observations of Z (s) at locations s1, . . . , snPredict z0 = Z (s0) in unobserved location s0

Emiko Dupont

Gaussian processes in spatial statistics

Page 15: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Prediction: Kriging

Goal:Given: z = (z1, . . . , zn) observations of Z (s) at locations s1, . . . , snPredict z0 = Z (s0) in unobserved location s0

Assumption: Covariance structure is known

Model

z ∼ N(Xβ,Σ), z0 ∼ N(xT0 β, σ20), Cov(z , z0) = τ

x0,X = observed covariates at locations s0, s1, . . . , snβ = unknown coefficients of covariatesσ20 , τ,Σ = known covariances

Emiko Dupont

Gaussian processes in spatial statistics

Page 16: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Prediction: Kriging

Goal:Given: z = (z1, . . . , zn) observations of Z (s) at locations s1, . . . , snPredict z0 = Z (s0) in unobserved location s0

Prediction: Choose z0 = λT z so that

z0 is unbiased (E(z0) = z0)

Mean squared prediction error E((z0 − z0)2) = Cov(z0) is minimised

Emiko Dupont

Gaussian processes in spatial statistics

Page 17: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

Tools for estimation and prediction of Gaussian processes

Frequentist methods

Directly optimise likelihood/REML/prediction error

nlme (linear mixed model formulation of Gaussian process) (uses MLor REML)

mgcv (GAM formulation) (uses penalised likelihood method)

Bayesian methods

Markov Chain Monte Carlo (WinBUGS/JAGS/Stan)

INLA for Gaussian Markov random fields (GMRFs) (uses integratednested Laplace approximation)

Emiko Dupont

Gaussian processes in spatial statistics

Page 18: Gaussian processes in spatial statistics - University of Bathpeople.bath.ac.uk/wg270/ITT7/EmikoDupontGaussianProcesses.pdf · Gaussian process models What’s so special about Gaussians?

ReferencesA. E. Gelfand, P. Diggle, P. Guttorp, and M. Fuentes,Handbook of spatial statistics, CRC press, 2010.

C. Paciorek, Technical vignette 3: Kriging, interpolation, anduncertainty: Department of biostatistics, Harvard School of PublicHealth, Version, 1 (2008).

H. Rue, A. Riebler, S. H. Sørbye, J. B. Illian, D. P.Simpson, and F. K. Lindgren, Bayesian computing with inla: Areview, arXiv preprint arXiv:1604.00860, (2016).

R. L. Smith, Environmental statistics, Facultad de CienciasEconomicas, Universidad Nacional del Cuyo, 1999.

Y. Sun, B. Li, and M. G. Genton, Geostatistics for largedatasets, in Advances and challenges in space-time modelling ofnatural events, Springer, 2012, pp. 55–77.

S. N. Wood, Generalized additive models: an introduction with R,CRC press, 2017.

Emiko Dupont

Gaussian processes in spatial statistics