spatial autocorrelation (3) global spatial autocorrelation

44
© 2003 Luc Anselin, All Rights Reserved Spatial Autocorrelation (3) Global Spatial Autocorrelation Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign http://sal.agecon.uiuc.edu

Upload: others

Post on 30-Apr-2022

38 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Spatial Autocorrelation (3)Global Spatial Autocorrelation

Luc AnselinSpatial Analysis Laboratory

Dept. Agricultural and Consumer EconomicsUniversity of Illinois, Urbana-Champaign

http://sal.agecon.uiuc.edu

Page 2: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Outline

�Principles�Join Count Statistic�Moran’s I�Moran Scatterplot�Bivariate/Space-Time Extension�Geary’s c

Page 3: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Principles

Page 4: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

General Cross-Product Statistic

�Combinatorial Data Analysis� N objects S = { o1, o2 ,..., oN }� measures of “similarity” between

objects• correlation matrix, distance matrix

� are indications significantly different ?� association between two proximity

matrices A = { aij } and B = { bij }� gamma statistic ΓAB = Σi Σj aijbij

Page 5: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Gamma Index of SpatialAutocorrelation

�Match Between Locational Similarityand Value Similarity� locational similarity: W� value similarity: A� Γ = Σi Σj wij.aij

�Types of Value Association� cross-product: xi.xj

� squared difference: (xi - xj)2

� absolute difference: | xi - xj |

Page 6: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Randomization Strategy

�Empirical Distribution Function� permute arrangement of objects

• associate values with locations• associate locations with values

� recomputed A and ΓAW

� compare observed Γ to distribution of ΓAW

�Pseudo-Significance� p = (T + 1) / (M + 1)

• M: # permutations; T: # times ΓAW ≥ Γ

Page 7: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Reference Distribution

Page 8: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Pros and Cons of Randomization

�Advantages� non-parametric

• no distributional assumptions

� easy to compute� easy to interpret

�Disadvantages� sample specific

• no generalization to population

� precision of pseudo significance arbitrary• 1/(99+1) yields 0.01,1/(999+1) yields 0.001

� sensitive to random number generator

Page 9: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Join Count Statistic

Page 10: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Join Count Statistics

�Binary Data�Match Value-Location

� count joins in value that match joins inspace = neighbors

� xi = 1 or 0, B(lack) or W(hite)

�Types of Joins� BB 1—1 xi.xj

� WW 0—0 (1-xi)(1-xj)� BW 1—0 (xi - xj)2

• three types exhaust sample

Page 11: Spatial Autocorrelation (3) Global Spatial Autocorrelation

Observed (left) and randomized (right)distribution for African Independence 60

BB = 18 BB = 9

Page 12: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Classic Join Count Statistics

�Statistics� BB: (1/2) Σi Σj wij.xi.xj

� WW: (1/2) Σi Σj wij.(1-xi).(1-xj)� BW: (1/2) Σi Σj wij.(xi - xj)2

� BB + WW + BW = (1/2) S0= (1/2) Σi Σj wij

�Weights� wij = 1 or 0 (binary, symmetric)

Page 13: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Inference for Join Counts

�Use Sampling Theory� binomial with or without replacement� free and non-free sampling

�Randomization/Permutation� compute reference distribution� assess “extreme” value� pseudo-significance

Page 14: Spatial Autocorrelation (3) Global Spatial Autocorrelation

Reference distribution for BB Count(999 permutations)

P[BB ≥ 18] = 0.04

Page 15: Spatial Autocorrelation (3) Global Spatial Autocorrelation

Moran’s I

Page 16: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Moran’s I

�Moran’s I Spatial AutocorrelationStatistic� cross-product statistic

I = (N/S0) Σi Σj wij. zi.zj / Σi zi2

with zi = xi - µ and S0 = Σi Σj wij

�Inference� normal distribution� randomization� permutation

Page 17: Spatial Autocorrelation (3) Global Spatial Autocorrelation

Observed (left) and randomized (right)distribution for Columbus Crime

Moran’s I = 0.486 Moran’s I = -0.003

Page 18: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Reference Distribution (CRIME)

Page 19: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Reference Distribution (OPEN)

Page 20: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Interpretation of Moran’s I

�Positive Spatial Autocorrelation� I > -1/(n-1), or z > 0� spatial clustering of high and/or low

values• no distinction between high or low

�Negative Spatial Autocorrelation� I < -1/(n-1), or z < 0� checkerboard pattern, “competition”

Page 21: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Moran’s I for Rates

�Variance Instability� violates assumption of stationarity� spurious spatial correlation

�EB Adjustment (Assuncao-Reis)� standardize each rate

• zi = (ri – R/P) / SE

� use standardized rate

Page 22: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Spatial (lag) Correlogram

�Visualization� spatial autocorrelation statistics for

increasing orders of contiguity� use z-values

�Interpretation� identification of spatial process� range of association

• possible indication of misspecified spatialweights and/or nonstationarity

Page 23: Spatial Autocorrelation (3) Global Spatial Autocorrelation

Moran’s I by W z-value by W

Page 24: Spatial Autocorrelation (3) Global Spatial Autocorrelation

Moran Scatterplot

Page 25: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Moran Scatterplot

� Linear Spatial Autocorrelation� linear association between value at i and

weighted average of neighbors:Σj wij yj vs. yi , or Wy vs y

� four quadrants• high-high, low-low = spatial clusters• high-low, low-high = spatial outliers

�Moran’s I� slope of linear scatterplot smoother� I = z’Wz / z’z

Page 26: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Moran Scatterplot

Page 27: Spatial Autocorrelation (3) Global Spatial Autocorrelation

Significance Envelope

Page 28: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

EB Adjusted Moran Scatterplot

Page 29: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Use of Moran Scatterplot

�Classification of SpatialAutocorrelation

�Local Nonstationarity� outliers� high leverage points� sensitivity to boundary values

�Regimes� different slopes in subsets of the data

Page 30: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Spatial Clusters

Page 31: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Spatial Outliers

Page 32: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Moran Scatterplot - Regimes

Page 33: Spatial Autocorrelation (3) Global Spatial Autocorrelation

Bivariate/Space-Time

Page 34: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Moran Scatterplot Extensions

�Generalized Moran Scatterplot� Regression slope of Wz2 on z1

• both variables standardized• = visualization of multivariate Moran statistic

(Wartenberg)

� Significance testing• permutation• permutation envelope (2.5% and 97.5% from

permutation reference distribution)

�Four Types of Association� High-high, Low-low; High-low, Low-high

Page 35: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Bivariate Moran

Page 36: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Reference Distribution Bivariate Moran

Page 37: Spatial Autocorrelation (3) Global Spatial Autocorrelation

Moran Scatterplot Matrix

Page 38: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Space-Time Association

�Serial Autocorrelation� z at i,t with z at i,t+1

• standard scatterplot

�Spatial Autocorrelation� z at i,t with z at j,t� z at i, t+1 with z at j,t+1

• Moran scatterplot

�Space-Time Correlation� z at i,t with z at j,t+1

• bivariate Moran scatterplot

Page 39: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Space-Time Scatterplots

Page 40: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Geary’s c Statistic

Page 41: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Geary’s c Spatial AutocorrelationStatistic

�Geary’s c� squared differences

c = (N-1). Σi Σj wij.(xi-xj)2

/ 2(S0). Σi zi2

with zi = xi - µ and S0 = Σi Σj wij

�Inference� normal distribution� randomization� permutation

Page 42: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Moments of Geary’s c

�Normal approximation� E[c] = 1

• does not depend on W or y, nor on sample size

� Var[c] = [ (2S1 + S2)(n-1) – 4S02 ]

/ [ 2(n+1)S02]

• normal approximation does not depend on r.v.,randomization approximation does

� Inference� z = { c – E[c] } / SD[c]� normal approximation

Page 43: Spatial Autocorrelation (3) Global Spatial Autocorrelation

© 2003 Luc Anselin, All Rights Reserved

Interpretation of Geary’s c

�Positive Spatial Autocorrelation� c < 1, or z < 0� spatial clustering of high and/or low

values• no distinction between high or low• opposite sign from Moran’s I

�Negative Spatial Autocorrelation� c > 1, or z > 0� checkerboard pattern, “competition”

Page 44: Spatial Autocorrelation (3) Global Spatial Autocorrelation