lecture 2 probability and measurement error, part 1

Lecture 2

Probability and Measurement Error, Part 1

SyllabusLecture 01 Describing Inverse ProblemsLecture 02 Probability and Measurement Error, Part 1Lecture 03 Probability and Measurement Error, Part 2 Lecture 04 The L2 Norm and Simple Least SquaresLecture 05 A Priori Information and Weighted Least SquaredLecture 06 Resolution and Generalized InversesLecture 07 Backus-Gilbert Inverse and the Trade Off of Resolution and VarianceLecture 08 The Principle of Maximum LikelihoodLecture 09 Inexact TheoriesLecture 10 Nonuniqueness and Localized AveragesLecture 11 Vector Spaces and Singular Value DecompositionLecture 12 Equality and Inequality ConstraintsLecture 13 L1 , L∞ Norm Problems and Linear ProgrammingLecture 14 Nonlinear Problems: Grid and Monte Carlo Searches Lecture 15 Nonlinear Problems: Newton’s Method Lecture 16 Nonlinear Problems: Simulated Annealing and Bootstrap Confidence Intervals Lecture 17 Factor AnalysisLecture 18 Varimax Factors, Empircal Orthogonal FunctionsLecture 19 Backus-Gilbert Theory for Continuous Problems; Radon’s ProblemLecture 20 Linear Operators and Their AdjointsLecture 21 Fréchet DerivativesLecture 22 Exemplary Inverse Problems, incl. Filter DesignLecture 23 Exemplary Inverse Problems, incl. Earthquake LocationLecture 24 Exemplary Inverse Problems, incl. Vibrational Problems

Purpose of the Lecture

review random variables and their probability density functions

introduce correlation and the multivariate Gaussian distribution

relate error propagation to functions of random variables

Part 1

random variables and their probability density functions

random variable, dno fixed value until it is realized

d=?indeterminate

d=1.04indeterminate

d=0.98

random variables have systematics

tendency to takes on some values more often than others

0 20 40 60 80 100 120 140 160 180 2001

N realization of data

index, i

d i 50

(C) (B) (A)

0 5 100

counts

0 5 100

0 2 4 6 8 100

p(d)p(d)

probability of variable being between d and d+Δd is p(d) Δd

in general probability is the integral

probability that d is between d1 and d2

the probability that d has some value is 100% or unity

probability that d is between its minimum and

maximum bounds, dmin and dmax

How do these two p.d.f.’s differ?

dp(d)00

Summarizing a probability density function

typical value

“center of the p.d.f.”

amount of scatter around the typical value

“width of the p.d.f.”

0 2 4 6 8 100

p(d)p(d)

Several possibilities for a typical value

point beneath the peak or “maximum likelihood point” or

“mode”

0 2 4 6 8 100

p(d)p(d)

ddmedianpoint dividing area in half or “median”

50% 50%

0 2 4 6 8 100

p(d)p(d)

d<d>balancing point or

“mean” or “expectation”

can all be different

dML ≠ dmedian ≠ <d>

formula for“mean” or “expected value”

≈ sd

≈ s NsN

histogram

dsp≈ s P(ds)

probability distribution

step 1: usual formula for mean

step 2: replace data with its histogram

step 3: replace histogram with probability distribution.

If the data are continuous, use analogous formula containing an

integral:

≈ s P(ds)<d>

quantifying width

This function grows away from the typical value:

q(d) = (d-<d>)2so the function q(d)p(d) is

small if most of the area is near <d> , that is, a narrow p(d)large if most of the area is far from <d> , that is, a wide p(d)

so quantify width as the area under q(d)p(d)

0 5 100

(C)(B)(A)

(F)(E)(D)

variance

width is actually square root of variance, that is, σmean

estimating mean and variance from data

usual formula for “sample

mean”

usual formula for square of

“sample standard

deviation”

MabLab scripts for mean and variance

dbar = Dd*sum(d.*p);q = (d-dbar).^2; sigma2 = Dd*sum(q.*p); sigma = sqrt(sigma2);

from tabulated p.d.f. p

from realizations of data

dbar = mean(dr);sigma = std(dr);sigma2 = sigma^2;

two important probability density functions:

uniform

Gaussian (or Normal)

uniform p.d.f.

ddmin dmax

p(d)1/(dmax- dmin)

probability is the same everywhere in the range of possible values

box-shaped function

Large probability near the mean, d. Variance is σ2.

0 10 20 30 40 50 60 70 80 90 1000

Gaussian (or “Normal”) p.d.f.

bell-shaped function

probability between <d>±nσGaussian p.d.f.

Part 2

correlated errors

uncorrelated random variables

no pattern of between values of one variable and values of another

when d1 is higher than its meand2 is higher or lower than its mean with equal probability

d20 100

0.2pjoint probability density function

uncorrelated case

d20 100

no tendency for d2 to be either high or low when d1 is high

in uncorrelated case

joint p.d.f.is just the product of individual p.d.f.’s

d20 100

10 0.00

d20 100

10 0.00

tendency for d2 to be high when d1 is high

d20 100

10 0.00

2σ12σ2

θ<d1 >

d2<d2><d 1>

d2<d2>

(A) (B) (C)

formula for covariance

+ positive correlation high d1 high d2

- negative correlation high d1 low d2

joint p.d.f.mean is a vector

covariance is a symmetric matrix

diagonal elements: variancesoff-diagonal elements: covariances

estimating covariance from a table D of data

Dki: realization k of data-type i in MatLab, C=cov(D)

univariate p.d.f. formed from joint p.d.f.

p(d) → p(di)behavior of di irrespective of the other ds

integrate over everything but di

integrate over d2

integrate over d1

p(d1,d2)

multivariate Gaussian (or Normal) p.d.f.

covariance matrix

Part 3

functions of random variables

data with measurement

data analysis process

inferences with

uncertainty

functions of random variables

given p(d)with m=f(d)

what is p(m) ?

univariate p.d.f.use the chair rule

rule for transforming a univariate p.d.f.

multivariate p.d.f.

determinant of matrix with elements ∂di/∂mj

Jacobiandeterminant

multivariate p.d.f.

rule for transforming a multivariate p.d.f.

Jacobian determinant

simple example

inferences with

uncertainty

one datum, duniform p.d.f.

m = 2d one model parameter, m

m=2d and d= m½d=0 → m=0d=1 → m=2dd/dm = ½p(d ) =1p(m ) = ½

0 1 2 d01

0 1 2 m01

another simple example

inferences with

uncertainty

one datum, duniform p.d.f.

m = d2 one model parameter, m

m=d2 and d=m½d=0 → m=0d=1 → m=1dd/dm = m½ -½p(d ) =1p(m ) = m½ -½

0.0 0.2 0.4 0.6 0.8 1.01.02.0

p(d)p(m)

multivariate example

inferences with

uncertainty

2 data, d1 , d2uniform p.d.f.

for both

m1 = d1+d2m2 = d2-d2

2 model parameters, m1 and m2

m1=d1+d1

m2=d1-d1(A)

p(d ) = 1

m=Md with

d=M-1m so ∂d/ ∂ m = M-1 and J= |det(M-1)|= ½and |det(M)|=2

p(m ) = ½

m1=d1+d1

m2=d1-d1

0.5 p(m1)

Note that the shape in m is different than the shape in d

m1=d1+d1

m2=d1-d1

0.5 p(m1)

The shape is a square with sides of length √2. The amplitude is ½. So the area is ½⨉√2⨉√2 = 1

m1=d1+d1

m2=d1-d1

0.5 p(m1)

p(m) can behavior quite differently from p(d)

lecture 2 probability and measurement error, part 1

d pd d

pd d slide

d median slide

d max slide

indeterminate d

different d

inverse problems lecture

pd d d median point

Documents

lecture 3 probability and measurement error, part 2

measurement, error, and sig figs

error in measurement precision accuracy error types...

measurement of error in forecasting

probability and measurement uncertainty

experimenting with measurement error: techniques...

measurement error models - stanford university

diagnostic error analysis measurement & geometry

non-parametric kernel-based bit error probability

mixed modes and measurement error

error probability

payment error rate measurement (perm)

experimenting with measurement error: …

working measurement error and misclassification

1 measurement and error

bis measurement error r caulcutt

markers, models, and measurement error:

lec9 error probability

bpsk probability of error

error in measurement