stochastic monte carlo methods for non-linear statistical inverse problems benjamin r. herman...

16
Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New York [email protected]

Upload: morgan-walton

Post on 01-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Stochastic Monte Carlo methods for non-linear statistical inverse problems

Benjamin R. HermanDepartment of Electrical EngineeringCity College of New [email protected]

Page 2: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Overview

• Aerosol lidar system

• Retrieval of aerosol properties from lidar measurements

• Metropolis-Hastings algorithm for uncertainty analysis

• Techniques for its efficient implementation

Page 3: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

CCNY Aerosol/Water vapor lidar

Transmits light at 1064, 532, and 355nm

Detects signals at transmitted wavelengths as well as two wavelengths shifted from 355nm by Raman scattering from N2 and H2O

1: Three wavelength pulsed Nd:YAG Laser2: Newtonian Telescope3: Receiver optics and photo-detectors4: Licel transient recorder, analog and photon counting

Page 4: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Optical coefficients and lidar equations

tatmtmtatr RTRTRRR

ROCERP ,,),(),(

)(),( 22

2

Rmama RRRT

0 || d),(exp,

ratarmtmrtNr RTRTRTRTRR

ROCERP ,,,,,,

)(),( Ram,2 2

Elastic lidar equation

Raman lidar equation

tcoefficien Extinction -

ion transmissOptical -

tcoefficienr backscatteRaman Nitrogen -

molecular)or aerosol (

tcoefficienr backscatte Elastic -

|

|

Ram,

|

2

ma

ma

N

ma

T

function overlap Geometric -)(

tcoefficienn calibratio System -

energy pulseLaser -

h wavelengtReceived -

h wavelengtdTransmitte -

signal Optical - ),(

Range -

RO

C

E

RP

R

r

t

r

Page 5: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

λ=1064nm λ=1064nm

λ=355nm λ=355nm

λ=532nm λ=532nm

Multi-mode aerosol model and optical coefficients

M

iiniiin rmrKrrnVy

10

d);(),;()( θ

),,,,,,,( 1111 MMMM mrVmrV θ

Aerosol is modeled as being composed of different constituents, each with a size distribution and index of refraction

Volume concentrationMedian

RadiusGeometric standard deviation

Refractive index

Optical coefficient dependency

Aerosol state vector

Page 6: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

How can uncertainty in the high dimension aerosol model be understood to assess uncertainty of aerosol properties?

)ˆ(

)()|ˆ()ˆ(

marginal

priorlikelihoodposterior y

θθyy|θ

f

fff

εθyy )(ˆ

y

ε

Uncertainty analysis

Estimated optical coefficients (retrieval results from lidar signal processing)

Estimation error))(ˆ()ˆ(likelihood θ|θyyθ,y θ|ε ff

Uncertainty is expressed in the form of a posterior probability density function (PDF)

Page 7: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Stochastic Monte Carlo

dfKf nn )(),()(1

Sequence of Random Variables Θn with PDFs fn

)|(),,|( 11|21,2,1| nnnnnnnnnn ff

Kff nnnn 1||1

Markov Chain

Stationary distributions dfKf )(),()(

Monte Carlo simulation

Begin with Θ0 and generate subsequent Θ according to some K

Page 8: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Metropolis-Hastings Algorithm

Designed so that a target distribution π(θ) is stationary

1) Generate candidate Φ given current outcome Θ from a random number generator congruent with a proposal distribution whose PDF is q(φ|θ)

2) Accept Φ as the next outcome with probability,

)|()(

)|()(,1min),(

q

q

3) If Φ is not accepted then Θ is retained as the next outcome

Initial distributions f0(θ) will converge to π(θ) with only lose requirements of the proposal distribution. Its choice affects the rate of convergence but generally not the stationary distribution.

Types of Proposal Distributions•Random walk generators•Independent generators

Page 9: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Implementation

•Ensemble of parallel chains starting from an initial distribution

•Random walk generation for first several links

•Ensemble is analyzed partway along the chain, clusters are determined and used to form a hybrid proposal distribution

Hybrid proposal distribution

Randomly choose walk generation or generation from a distribution corresponding to one of the clusters, thereby allowing teleportation between regions. This aspect is essential for dealing with multiple local maxima in the posterior PDF.

Page 10: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Ensemble evolution animation

Example in a reduced single mode aerosol size distributionThe target distribution PDF is only a function of median radius and geometric standard deviation

Page 11: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Speeding convergence by initial resampling

Ensemble members are not getting represented by clusters•Results in tendency not to teleport

c

c

N

iiiww

N

iiiww

qpqp

qpqp

1

1

)()|()(

)()|()(

,1min

Hybrid B also includes a small probability of sampling from the initial distribution

Page 12: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Comparison with forward Monte Carlo analysis

Numerically integrated posterior cumulative distribution functions (CDFs) (Black) and CDFs estimated from the simple forward Monte Carlo method applied using the maximum a posteriori inverse (Blue) and the Ensemble Metropolis-Hastings algorithm (Red).

Page 13: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Extension to bi-modal particle size distributionsSpeeding convergence by fitting proposal PDF to the target PDF

Eigenvalue conditioningS must be positive definite to be a valid covariance matrix. This is not guaranteed if the target distribution is not Gaussian, and so it is remedied through eigenvector decomposition. Negative Eigenvalues are set to an upper limit and S is recomposed.

T

N

VVS

0

01

First derivative fitting

Second derivative fitting

Multivariate Gaussian PDF

cg T )()()(ln 121 θθSθθθ

)ln(1 gjiij S

)(ln θSθθ θ g

rMean vecto :

matrix Covariance :

θ

S

Page 14: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Implementations

Higher acceptance probability with uncorrelated random walk is due to scaling down the fitted variance

Acceptance rate is used for determining at which link to cluster the ensemble to form the hybrid proposal distribution

Full second order derivative fitting compared with uncorrelated partially fitted random walk generation

Page 15: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

After 10000 links there still remain some members around a local maximum

Link 1000

All ensemble members had been together since ~link 500

Hybridization only once

Periodic re-hybridization to speed convergence

Page 16: Stochastic Monte Carlo methods for non-linear statistical inverse problems Benjamin R. Herman Department of Electrical Engineering City College of New

Things to think about

• How to choose when to form the multi-component independent generator from clusters

• Can the independent generator be periodically reformulated indefinitely while still maintaining convergence to the target distribution

• Which clustering method works best

• Is there an optimal compromise between random walk with uncorrelated variance versus complete fitting of all mixed derivatives