global and local spatial autocorrelation - epfl · 2018-12-14 · 7 spatial autocorrelation: a...

41
1

Upload: others

Post on 10-Jun-2020

19 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

1

Page 2: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

2

Landscape genomicsPhysalia courses , November 26-30, 2018, Berlin

Global and local spatial autocorrelation

Dr Stéphane Joost – Oliver Selmoni (Msc)

Laboratory of Geographic Information Systems (LASIG)

Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland

Page 3: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

3

OUTLINE

• Introduction: spatial dependence

• Measuring spatial dependence : Moran’s I

• Weighting

• Significance

• Local spatial autocorrelation

• Examples

• Conclusion

Page 4: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

4

MEASURING SPATIAL DEPENDENCE

• We want to measure how similar are the differentvalues of a given variable for a set of spatiallydistributed individuals…

To quantify the spatial regularity of a givenphenomenon

To determine the range of spatial dependence

What is spatial dependence ?

Page 5: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

5

Spatial dependence

Photo: copa2014.gov.br / CC BY 3.0

- A yellow people is more likely to interact with another yellow people

- Similarly, a r&w people is more likely to interact with another r&w people

- The membership determined the spatial distribution of people

- Spatial dependence induced by this membership is perceptible in space through colors

Page 6: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

6

Tracing the Origin and Spread of Agriculture in Europe (Pinhasi et al. 2005, PLOS Biology)

Radiocarbon data, 765 Neolithic Sites

Page 7: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

7

SPATIAL AUTOCORRELATION: A PARADOX

• Spatial depdendence can be measured bymeans of indices of spatial autocorrelation

• A paradox• First law of geography: “Everything is related

to everything else, but near things are morerelated than distant things”, W.Tobler (1970)

• This means that natural phenomena (e.g. temperature) as wellas socio-demographic ones (e.g. population density) are not spatially distributed at random

• But to measure the spatial structure of these phenomena, wehave to use classical statistical tools requiring a random spatial distribution of samples and independence between them

Page 8: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

8

STANDARD STATISTICS

• Classical statistics are non-spatial

• Based on a neutral geographic space hypothesis

• Geographic space should be the simple neutral support for studied phenomena

• Theoretically, the location of a set of observations in space should not influence their attributes

Page 9: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

9

SPATIAL DEPENDANCE - AUTOCORRELATION

• Geographical space is not neutral and consequently manystatistical tools are not appropriate

• For instance, ordinary linear regressions (OLR) should beimplemented only if observations are selected at random

• When observations show spatial dependence, estimated values for the whole data set are distorted/biased

• Indeed, sub-regions including a concentration of individuals withhigh values will have an important impact on the model and lead to a global overestimation of the variable under study over the wholearea

• In other words, a strong correlation between two variables at a single location of the territory will influence the measure of the relationship over the whole territory

Page 10: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

10

Genetic diversity

NW

Turkey

Older sites

Gradient

Older sites show higher genetic diversity

Page 11: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

11

We have to use standard statistical tools with caution with spatial data

(remember this paradox)

Page 12: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

12

Developed with standard statistics

• Geary’s C

• Ripley’s K

• Join Count Analysis

• Moran’s I

INDICES TO MEASURE SPATIAL

AUTOCORRELATION

Page 13: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

13

MEASURING SPATIAL AUTOCORRELATION

• Spatial autocorrelation indices express the degree of spatial structuring of a given variable

• Spatial autocorrelation indices permit to quantify the spatial regularity of a geographically distributedphenomenon

• Spatial autocorrelation is positive when values measuredat neighbouring points resemble each other

• It is negative in case of dissimilarity

Page 14: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

14

Neighborhood relationship andspatial weighting

• We want to know how similar is the value v of object o in comparison with the value v of other objects located in itsneighbourhood ? (to measure spatial dependence)

• We need to define this neighbourhood: it may be a distance (100m around each object) or a given number of neighbours(4 nearest neighbours for instance)

• This criterion (100m or 4 nearest neighbours) defines the spatial weight

Longitude

Latitude

Page 15: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

● Spatial autocorrelation is characterized by a correlation between measures

of a given phenomenon located close to each other

Neighborhood relationships

5km

Etc.

To quantify the spatial dependence and produce a measure of global spatial autocorrelation, it is

necessary to take into account the neigborhood of each of the considered geographic objects

Page 16: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

Several ways to define the spatial neighborhood

7

5kmk = 7 nearest neignbors

Adaptive kernel

d = 5000m

Fixed kernel

Bandwidth

Page 17: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

How to define the neighborhood of polygons

E

2

Contig order 1

Cont order 2

1

X

Page 18: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

Introduction aux systèmes d’information géographique

Spatial weighting

Spatial weighting scheme

Fixed kernel

Adaptive kernel

Bandwidth d

k nearest neighbors

Contig order n

Spatial weighting file

0 54 comvd_prec ide1 2 1895.706441 13 2031.241321 4 2062.650711 35 2365.333631 3 2474.904191 29 2533.849931 19 3041.12639…

n = 1

135

29

132

3

4

k = 7

Page 19: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

19

• Moran's autocorrelation coefficient I is an extension of Pearson product-moment correlation coefficient

Where

• N is the number of observation units

• Wij is a spatial weight applied to define the comparison between locations i and j

• Xi is the value of the variable at a location i

• Xj is the value of the variable at a location j

• X is the mean of the variable

−−=

i j i iji

i j jiji

XXW

XXXXWNI

2

,

,

)()(

))((

MORAN’S I

Page 20: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

20

Moran’s I as a regression coefficient

• Anselin (1996): Moran’s I can be interpreted as a regression coefficient

• → regression of WZ on Z

• The weighted value of Z (mean of Z according to the weighting criteria) on Z (the observed value at the point of interest)

• This interpretation provides a way to visualize the linear association between Z and WZ in the form of a bivariate scatterplot

• Anselin (1996) referred to this plot as the Moran scatterplot

• He pointed out that the least squares slope in a regression through the origin is equal to Moran’s I

Page 21: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

21

2

7

4

5

3

9 5

4

8

2

Weighting criteria: 2 nearest neighboursThe value of the variable is displayed in the centroids

C

B

A

F

D

E

G

H

J

I

WeightingValue

(X)

Mean of nearest neighbours values

(Y)

A B+D/2 7 3

B A+E/2 4 5

C D+A/2 5 4.5

D E+G/2 2 6

E F+D/2 3 2

F B+E/2 2 3.5

G H+I/2 9 4.5

H G+I/2 4 7

I G+H/2 5 6.5

J F+I/2 8 3.5

PROCESSING MORAN’S IRegression of Y on X

Page 22: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

22

Weig

hte

d v

alu

e (

mean o

f N

eare

st

Neig

hbours

)

Value

C

B

A

F

D

E

G

H

J

I

MORAN’S SCATTER PLOT

a = -0.103x + 5.0548

Weighting Value Mean of NN

A B+D/2 7 3

B A+E/2 4 5

C D+A/2 5 4.5

D E+G/2 2 6

E F+D/2 3 2

F B+E/2 2 3.5

G H+I/2 9 4.5

H G+I/2 4 7

I G+H/2 5 6.5

J F+I/2 8 3.5

2

Page 23: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

23

The slope of the

regression (0.734)is Moran’s I for the

Variable DATC14BP

I = SLOPE OF THE REGRESSION

Weighted variable: mean of neighbouringspatial units accordingto the selectedweighting criteria (50k)

Investigatedvariable

Page 24: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

24

• Moran’s I statistics ranges from -1 to 1

• A value close to 1 shows a strong positive spatial autocorrelation

• A value close to -1 shows a strong negative spatial autocorrelation (opposition betweenindividuals)

• 0 = no spatial autocorrelation = independencebetween individuals = neutral geographic space

MORAN’S I RANGE OF VALUES

Page 25: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

25

• We need a statistical test to determine whether data attached to individuals are distributed at random over the territory

• We use random permutations between individuals to perform this test

• How does the observed situation behave in comparison with all other possible configurations ?

MORAN’S I SIGNIFICANCE

Page 26: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

26

•For each run, the attributes of all individuals in the data set are randomly moved between allpossible locations

•Many runs are performed by means of Monte-Carlo method (e.g. 500, 1000, or more permutations)

•More runs = more significance

RANDOM PERMUTATIONS

Page 27: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

27

2

7

4

5

3

9 5

4

8

2

C

B

A

F

D

E

G

H

J

I

• Moran’s I is calculatedfor each run

• The total number of possible permutations is n! (here 10! = 3’628’800)

RANDOM PERMUTATIONS

Etc.

Page 28: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

Moran’s I as a coefficient of regression

Page 29: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

Visualization of the spatial structure

I = 0.79

Page 30: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

1 2

34

56

7 8

910

11 1213

14 1520

16

21

22

17

18 19

23

2425 26

27

2829

30

31 32

33

34 35

36 37 38

39

40

41

42 4344

45

4647

4849

50

51

52

54

53

1

2

34

5

67 8

9

10

1112

1314 15

20

16

21

22

1718 19

23

2425

26

27

28

29

30

31

32

33

34 35

36

37

38

39

40

41

42 43 44

45

4647

4849

50

51

52

54

53

1

2

3

4

5

67 8

9

10

111213

1415

20

16

21

22

1718

19

23

24

25

26

27

28

29

30

31

32

33

34 35

36

37

38

39

40

41

42

43 4445

46

47

48

49

50

5152

54

53

1

2

3

4

5

678

9

10

111213

1415

20

16

21

22

1718

19

23

24

25

26

27

2829

30

31

32

33

3435

36

37

38

39

40

41

42

43 4445

46

47

48

49

50

5152

54

53

Draw 1: I= I1

Observed situation : I= I0

Draw 2: I= I2

Draw 3: I= I3

Draw 4: I= I4…

54! possible configurations

Moran’s I significance and random permutations

Page 31: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

Histogram of random situations and p-values

+1-1

Neutral geographic space Mean of Moran’s I

Obtained from

random permutations

Observed Moran’s I

Page 32: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

How to calculate the significance and the p-value

A Moran’s I of 0.79 translates a spatial

structure significantly different from a

random spatial distribution

Null hypothesis

p-value = 𝑁𝑏 𝐼𝑎𝑙 ≥ 𝐼𝑜𝑏𝑠+1

𝑁𝑏 𝑝𝑒𝑟𝑚𝑢𝑡𝑎𝑡𝑖𝑜𝑛𝑠+1

𝑜𝑢𝑁𝑏 𝐼𝑎𝑙 ≤ 𝐼𝑜𝑏𝑠 + 1

𝑁𝑏 𝑝𝑒𝑟𝑚𝑢𝑡𝑎𝑡𝑖𝑜𝑛𝑠 + 1

p-value = 0+1

999+1= 0.001

Page 33: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

33

• It is also possible to calculate local Moran’s indices

• Moran’s I is decomposed into several local coefficients, whose sum over the whole studied area is proportionalto the global Moran’s I

• The significance is assessed locally (same procedure like Global I except that the value of the point of interest remains fixed, and neighbours are among N-1 values)

LISA : LOCAL INDICATORS OF

SPATIAL ASSOCIATION

Page 34: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

34

Calculation of Local I45 44 44

43 42 39

38 32 34

𝐼𝑖 =𝑍𝑖𝑆2

𝑗=1

𝑛

𝑤𝑖𝑗𝑧𝑗 , 𝑗 ≠ 𝑖

𝒚𝒊 𝒛𝒊 𝒘𝒊𝒋 𝒘𝒊𝒋𝒛𝒋

45 4.889 0 0

43 2.889 0.25 0.722

38 -2.111 0 0

44 3.889 0.25 0.972

42 1.889 0 0

32 -8.111 0.25 -2.028

44 3.889 0 0

39 -1.111 0.25 -0.278

34 -6.111 0 0

1 -0.611

Rook weighting scheme

Mean = 40.1S2 = Variance = 21.861

• Here weights are standardized (they sum to 1). We have 4 values (because of the Rook criterion).• For this location, 𝑧𝑖 = 42 - 40.111 = 1.889 • The sum of the weights multiplied by the deviations from the mean = -0.611• 𝐼𝑖 = 1.889/21.861 x -0.611 = -0.053• Local I values sum up to global Moran’s I

y = value

Z = dev. from the mean (40.1)

w = weight

Page 35: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

35

LISA

Thus it is possible:

– to map indices and corresponding p-values to show how local spatial autocorrelation varies over the territory

– to highlight local regimes of spatial autocorrelation

Page 36: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

36

Local Index of Spatial Association (DATC14BP)

Min = - 0.36

Page 37: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

37

Significance map

9999 permutations

Page 38: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

38

LISA CLUSTERS

HIGH-HIGH

HIGH-LOWLOW-LOW

LOW-HIGH

Page 39: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

39

LOCAL SPATIAL AUTOCORRELATION ON DATC14BP

Page 40: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

40

IN SUMMARY• To assess spatial dependence:

– Calculate global or local indicators of spatial autocorrelation– Indicators based on the resemblance between points of

interest and their neighborhood– Significance: establish a comparison with a neutral

geographic space (null hypothesis = the spatial distribution of attributes is random)

• We calculate spatial dependence with theories relying on contradictory arguments (space is not neutral vs spacemust be neutral)

• → Geographically Weighted Regression (GWR)• Includes spatial weighting in the processing of the

regression

Page 41: Global and local spatial autocorrelation - EPFL · 2018-12-14 · 7 SPATIAL AUTOCORRELATION: A PARADOX • Spatial depdendence can be measured by means of indices of spatial autocorrelation

41

REFERENCES

• Anselin, L. (1995). Local indicators of spatial association — LISA. Geographical Analysis, 27:93–115.

• Anselin, L. (1996). The Moran scatterplot as an ESDA tool to assess local instability in spatial association. In Fischer, M., Scholten, H., and Unwin, D., editors, Spatial Analytical Perspectives on GIS in Environmental and Socio-Economic Sciences, pages 111–125. Taylor and Francis, London.

• Legendre, P. (1993) Spatial Autocorrelation: Trouble or New Paradigm? Ecology, Vol. 74, No. 6, pages 1659-1673