two new approaches to smoothing over complex regions · outline smoothing over complex regions...

24
Two new approaches to smoothing over complex regions David Lawrence Miller Mathematical Sciences University of Bath useR! 2009, Rennes

Upload: trinhthuan

Post on 27-Aug-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

Two new approaches to smoothing overcomplex regions

David Lawrence Miller

Mathematical SciencesUniversity of Bath

useR! 2009, Rennes

Outline

Smoothing over complex regionsIntroSolutions

Schwarz-Christoffel transform

Multidimensional ScalingDetailsSimulation Results

Conclusions

Outline

Smoothing over complex regionsIntroSolutions

Schwarz-Christoffel transform

Multidimensional ScalingDetailsSimulation Results

Conclusions

Smoothing in 2 dimensions

I Have some geographical region and wish to find outsomething about the biological population in it.

I Response is eg. animal distribution, wish to predict basedon (x , y) and other covariates eg. habitat, size, sex, etc.

I This problem is relatively easy if the domain is simple.

Smoothing over complex domains

I Smoothing of complex domains makes this a lot moredifficult.

I Problem of leakage.I Euclidean distance doesn’t always make sense.I Models need to incorporate information about the intrinsic

structure of the domain.

−4

−3.5

−3

−2.

75

−2.

5

−2

−1.

75

−1.

5

−1

−0.

75

−0.

5

0

0.5

0.75

1

1.25

1.5

1.75

2

2.25

2.5

2.75

3

3.25

3.5

3.75

4

−4 −3.75

−3.25 −3

−2.75

−2.5

−2.25

−2

−1.75

−1.25

−1

−0.75

−0.5 −0.25 0

0.25

0.5 0.75 0.75 1 1.25

1.5

1.75 2 2.25

2.5

2.7

5

3

3.25

3.75

4

(modified) Ramsay test function Thin plate spline fit

Smoothing with penalties

I Objective function takes the form:

n∑i=1

(zi − f (xi , yi ; θ))2 + λ

∫Ω

Pf (x , y ; θ)dΩ

I f is the function you want to estimate, made up of somecombination of basis functions.

I P is some squared derivative penalty operator, usuallyP = ( ∂2

∂x2 + ∂2

∂y2 )2.

I This can be generalized to an additive model or GAM.

Possible solutions to leakage problems

I FELSPLINE (Ramsay, (2002).)I Domain morphing (Eilers, (2006).)I Within-area distance (Wang and Ranalli, (2007).)I Soap film smoothers (Wood et al, (2008).)

Why morph the domain?

I Takes into account within-area distance.I Gives a known domain that is easier to smooth over.I Potentially less computationally intensive.

However:

I Don’t maintain isotropy - distribution of points odd.I Not clear what this does to the smoothness penalty.

−2 0 2

−2

−1

0

1

2

3

4

2

34

5

6

7

8

9

10

11

12

13

14

1516

17

18

1

−4 −2 0 2 4

0

1

2

3

4

5

6

72

34 5

67

89

101112

13

14151617

18 1

Outline

Smoothing over complex regionsIntroSolutions

Schwarz-Christoffel transform

Multidimensional ScalingDetailsSimulation Results

Conclusions

The Schwarz-Christoffel transform

I Take a polygon in some domain W and morph it to a newdomain W ∗.

I We then have a function for the mapping, ϕ(x , y).I ϕ(x , y) is a conformal mapping.I Do this by starting at the new domain and working back to

the polygon.I Can draw a polygonal bounding box around some arbitrary

shape.

φ(x)

φ (x)-1

WW*

The mappingI Use a bounding box around the horseshoe.

1

4

8

5

2

3

6

7

I Morphing the horseshoe shape still gives a slightly odddomain however, we are still doing better than before.

Horseshoe plots

Truth

−4 −3.

75

−3.5

−3.25

−3

−2.

75

−2.5

−2.25

−2

−1.

75

−1.5

−1.25

−1

−0.

75

−0.5

−0.25 0

0.25

0.5

0.75

1

1.25

1.5

1.75

2

2.25

2.5

2.75

3

3.25

3.5

3.75

4

SC+PS

−4

−3.75

−3.5 −3.

25 −3

−2.

75 −

2.5

−2

−1.

75

−1.5

−1.25

−1

−0.

75

−0.5

−0.25 0

0.25

0.5 0.75

1

1.2

5 1

.5

1.7

5

2

2.25

2.5

2.75 3

3.25 3.5

3.7

5 4

SC+TPRS

−4

−3.75

−3.5

−3.

25 −

3

−2.

75

−2.

5

−2

−1.75

−1.

5 −1.25

−1

−0.

75

−0.

5

0

0.25

0.5

0.75

1 1.2

5

1.5

1.7

5 2

2.25

2.5 2.75 3 3.25

3.5

3.7

5 4

Soap film

−4

−3.75 −3.5

−3.25

−3 −

2.75

−2.5

−2

−1.75

−1.5

−1.25

−1

−0.

75

−0.5

0

0.25

0.5

0.75

1

1.2

5 1

.5

1.7

5 2

2.25

2.5

2.75 3 3.25

3.5

3.75 4

Problems

I Small:I Implementation is Matlab+R. (YUCK!)

I BIG:I Weird artifacts.I Morphing of domain appears to cause features to be

smoothed over.I Arbitrary selection of vertices.

A more realistic domain

0.0 0.4 0.8

0.0

0.4

0.8

truth

x

y

0.0 0.4 0.8

0.0

0.4

0.8

sc+tprs prediction

x

y

0.0 0.4 0.8

0.0

0.4

0.8

tprs prediction

x

y

0.0 0.4 0.8

0.0

0.4

0.8

soap prediction

x

y

A more realistic domain - what’s happening?

I Weird “crowding” effect.I Different with each vertex choice. All bad.

Outline

Smoothing over complex regionsIntroSolutions

Schwarz-Christoffel transform

Multidimensional ScalingDetailsSimulation Results

Conclusions

Multidimensional scaling and within-area distances

I Idea: use MDS to to arrange points in the domainaccording to their “within-domain distance.”

Scheme:

I First need to find the within-area distances.I Perform MDS on the matrix of within-area distances.I Smooth over the new points.

Multidimensional scaling refresher

I Double centre matrix of between point distances, D,(subtract row and column means) then find DDT .

I Finds a configuration of points such that Euclideandistance between points in new arrangement isapproximately the same as distance in the domain.

I Already implemented in R by cmdscale.

−3 −2 −1 0 1 2 3

−3

−2

−1

01

23

x

y

−4 −2 0 2 4 6

−1

01

23

newcoords[,1]

new

coor

ds[,2

]

Finding within-area distancesI Use a new algorithm to find the within area distances.

1 2 3 4 5 6

01

23

4

x

y

1 2 3 4 5 6

01

23

4

x

y

1 2 3 4 5 6

01

23

4

x

y

1 2 3 4 5 6

01

23

4

x

y

Ramsay simulations

−1 0 1 2 3

−1.0

−0.5

0.0

0.5

1.0

truth

x

y

−1 0 1 2 3

−1.0

−0.5

0.0

0.5

1.0

MDS

x

y

−1 0 1 2 3

−1.0

−0.5

0.0

0.5

1.0

tprs

x

y

−1 0 1 2 3

−1.0

−0.5

0.0

0.5

1.0

soap

x

y

A different domain

−3 −2 −1 0 1 2 3

−2

−1

0

1

2

truth

x

y

−3 −2 −1 0 1 2 3

−2

−1

0

1

2

mds

x

y

−3 −2 −1 0 1 2 3

−2

−1

0

1

2

tprs

x

y

−3 −2 −1 0 1 2 3

−2

−1

0

1

2

soap

x

y

Outline

Smoothing over complex regionsIntroSolutions

Schwarz-Christoffel transform

Multidimensional ScalingDetailsSimulation Results

Conclusions

Conclusions

I Seems that the S-C transform does not have much utility.I MDS shows more promise, easier to transfer to higher

dimensions.I MDS does not impose strict boundary conditions so

leakage still possible.I Pushing the data into more dimensions might be useful to

separate points.I After initial “transform” calculation, both methods only use

the same computational time as a thin plate regressionspline. (Soap is expensive.)

References

I S.N. Wood, M.V. Bravington, and S.L. Hedley. Soap filmsmoothing. JRSSB, 2008

I H. Wang and M.G. Ranalli. Low-rank smoothing splines oncomplicated domains. Biometrics, 2007

I T.A. Driscoll and L.N. Trefethen. Schwarz-ChristoffelMapping. Cambridge, 2002

I T. Ramsay. Spline smoothing over difficult regions. JRSSB,2001

I P.H.C. Eilers. P-spline smoothing on difficult domains.University of Munich seminar, 2006

I J.C. Gower. Adding a point to vector diagrams inmultivariate analysis. Biometrika, 1968.

Slides available at http://people.bath.ac.uk/dlm27