derivation of biophysical variables from fine re for … · derivation of biophysical variables...

DERIVATION OF BIOPHYSICAL VARIABLES FROM FINE RE SOLUTION IMAGERY

FOR CO-PROCESSING WITH SOCIO-ECONOMIC DATA

IN AN URBAN AREA

Gabor Zsigoïics

A thesis submitted in conformity with the requirements for the degree of Master of Science,

Graduate Department of Geography, University of Toronto

Q Copyright by Gabor Zsigovics 2000

National Library m*m of Canada Bibliothèque nationale du Canada

Acquisitions and Acquisitions et Bibliographie Services services bibliographiques

395 Wellington Street 395. rue Wdlingtori OnawaON K l A W ûtiawaON K 1 A W canada CMada

The author has granted a non- exclusive licence aliowing the National Library of Canada to reproduce, loan, distribute or seii copies of this thesis in microform, paper or electronic formats.

The author retains ownership of the copyright in this thesis. Neither the thesis nor substantial extracts fiom it may be printed or otheMrise reproduced without the author's permission.

L'auteur a accordé une licence non exclusive permettant à la Bibliothèque nationale du Canada de reproduire, prêter, distribuer ou vendre des copies de cette thèse sous la forme de microfiche/nlm, de reproduction sur papier ou sur format électronique.

L'auteur conserve la propriété du droit d'auteur qui protège cette thèse. Ni la thèse ni des extraits substantiels de celle-ci ne doivent être imprimés ou autrement reproduits sans son autorisation.

Abstract

Derivation of biophysical variables from fine resolution imagery for CO-processing

with socio-econornic data in an urban area.

Master of Science, 2000,

Gabor Zsigovics,

Graduate Department of Geography, University of Toronto.

The integration of the social and physical sciences is a prevalent issue,

especially in urban studies. Recent advances in remote sensing and G.I.S.

provide the technological background to derive spatially and temporally detailed

information about the biophysical environment (such as land use ana land

cover). There is great potential in using such detailed and up-to-date information

in modelliny phenomena related to 'quality of life'.

This study demonstrates the feasibility of deriving land cover

characteristics from remotely sensed data and explores their relationships with

some socio-economic characteristics. Vegetation amount and pattern measures

are derived from both satellite images and aerial photography (at various

resolutions between lm and 100m). Relationships between these biophysical

variables and socio-economic variables are then investigated using various

qualitative and quantitative tools. Finally, prediction of some of the socio-

economic characteristics are explored applying statistical model selection

criteria.

Acknowledgements

I would like to express my sincere gratitude to Dr. Ferenc Csillag who

"coached" me so well during the last two years. Although I entered the program

with very little computational experience and even less statistical background,

with perseverance and Dr. Csillag's unwavering belief that I would pull through

the technical roadblocks, I have mastered a small chunk of a very exciting part of

the current geographical expertise.

The GUESS Research groups constant support was much appreciated.

Many thanks to Scott, Hannah, Zuzu, Marcy, Rebecca, Andy, Richard, Peter,

and Ken who al1 helped me a great deal.

It is impossible not to thank those who have helped me corne this far:

llona and Frank, my parents. They gave me that last little push to finally

complete the writing of the thesis. A big thank-you to my sister, Agnes, who

helped me refresh some of those trigonometric equations, and even looked over

some of my bibliography.

And to Monica, rny girlfriend, I owe the world. Let it be visits to PGB or

coffee breaks. her radiance made my "bad cornputer days" seem quite

acceptable.

This research was in part supported by the generous contribution of the

National Sciences and Engineering Research Council of Canada ("NSERC)

through an Industrial Scholarship.

iii

Table of Contents

Page #

Abstract ........................................................................................ ii

Acknowledgements.. ...................................................................... iii

Table of Contents.. ........................................................................ .iv

List of Figures.. .............................................................................. vii

List of Tables.. .............................................................................. .x

Abbreviations ................................................................................ .xi

I Introduction

1.1 Importance of the integration of the social and physical

sciences.. ...................................................................... -1

1 -2 Rationale.. ..................................................................... -3 1 -3 Objectives.. ................................................................... ..4

................................................................. 1.4 Site description 5

1 -4.1 Physical environment.. .......................................... -5 1 -4.2 Social environment.. ............................................. .6 1 -4.3 Biological environment.. ........................................ ..6

II Derivation of vegetation cover and pattern

2.1 Remote sensing of urban environments. .............................. .8

2.2 Sources of remotely sensed data.. ..................................... .ll 2.2.1 Aerial photographs.. .............................................. 1 1

2.2.2 Satellite imagery .................................................. .12

Page #

2.3 Resampling .................................................................... -13

2.4 Texture layers ................................................................. -13

2.4.1 Optimal kernel size for calculating texture ................... 15

2.5 Vegetation indices ............................................................ -17

2.6 Image classification .......................................................... -18

2.6.1 Selection of training sites ....................................... -19

2.6.2 Training signature statistics and analysis .................... 20

2.6.3 Supervised classification ......................................... 21

2.6.4 ISODATA clustering .............................................. 23

2.6.5 Hybrid classification ............................................... 23

2.6.6 Classified images ................................................. -24

2.7 Accuracy assessment ....................................................... -29

2.7.1 Producer's and user's accuracy ................................ 31

2.8 Satellite image (LANDSAT Thematic Mapper) ......................... 33

......................... 2.9 Contagion: A measure of spatial arrangement 35

III Exploriny the relationship between biophysical and socio-economic variables

3.1 Sources of socio-economic data ........................................... 38

3.1 . 1 Census Data ......................................................... 38

3.1.2 Social Deprivation Index ......................................... -39 3.2 Exploratory Data Analysis (EDA) .......................................... -40

3.2.1 QualitativeNisual assessrnent of variables ................... 40

3.2.2 Univariate statistics ................................................ -47

3.2.2.1 Descriptives ............................................... 47

...................... 3.2.2.2 Spatial dependence: Moran's I -49

3.2.2.3 Spatial dependence: Semi-variograms ............ 50

3.2.3 Multivariate statistics ............................................... 54

3.2.3.1 Scatterplots ............................................... 54

Page #

3.2.3.2 Decile plots .............................................. 58

3.2.3.3 Pearson's correlation tables ........................ 61

IV Co-processing of biophysical and socio-economic data using simple statistical

rnodels

Spatial linear regression model (SLM) .................................. 63

Regression trees ............................................................ -64

Residual diagnostics ....................................................... -65

Which model? - Model selection strategies ........................... 74

4.4.1 Akaike's Information Criterion (AIC) ......................... 74

Finding the appropriate resolution ....................................... 76

............................................. Census Tract Level -77

Non-spatial scaling of socio-economic variables to

a regular grid ...................................................... 83

References ................................................................................. -91

Appendix 1 ................................................................................... 98

List of Figures

Page #

Figur .e 1 : The study area as seen from the air. This area is the subject of a large research initiative nained the Sout h-east Toronto (SETO) Health Mapping Project. .............................................................. ..7

Figure 2: Flowchart of steps to derive vegetation cover from aerial photographs and satellite imagery of different resolutions ....................................... .........9

Figure 3: False-colour composite of the study area taken by the LANDSAT TM satellite in July of 1992.. ........................................................... 12

Figure 4: Accuracy of supervised classification at 1 m resolution as a function of kernel size used to derive the texture layer. ................................... 16

Figure 5: Accuracy of supervised classification at 5m resolution as a function of kernel size used to derive the texture layer.. .................................. -1 6

Figure 6: Assessrnent of the effect of changing the threshold for the vegetation classes in supervised classification. .................................. ..22

Figure 7(a-f.): Vegetation cover maps at 2m, Sm, 10m, 25m, Som, and 1 OOm resolutions ............................................................................... 25-7

Figure 8: Percentage vegetation cover as a function of resolution calculated for the entire SETO area and three smaller subsets (Subset A - Rosedale; Subset B -East York; Subset C - Downtown).. ...................... 28

Figure 9: Location of subsets used for accuracy assessment.. ........................ -29

Figure 10: Vegetation cover derived from the satellite image ............................ 34

Figure 1 1 : Contagion as a function of vegetation cover computed for the East York subset on a square grid using classified images at different resolutions.. ................................................................................ -36

Figure 12: Contagion as a function of vegetation cover computed for the three subsets (Subset A- Rosedale, Subset B-East York, Subset C- Downtown). ............ .. ................................................................. -37

Figure 13 (a-e): Spatial distribution of selected census variables.. ................... 42-4

Figure 14: Histograms of selected census and biophysical variables.. .............. 45-6

Figure 1 5 (a-b): Variograms for census and biophysical variables. ................ -52-3

Figure 16: Scatterplot matrix of selected socio-economic and biophysical variables.. .................................................................................... 56

vii

Paçe #

Figure

Figure

Figure

17: 3-0 bar graph displaying the relationship between SINGLE. ..................................................................... POPDENS. and VEG 56

18: Scatterplot of vegetation cover derived from 2m resolution aerial photograph and median family income ............................................... 57

19 (a-f): Decile plots of census variables as a function of vegetation cover ......................................................................................... -58

Figure 20: Spatial interaction of neighbouring units taken into consideration in a SLM ........................................................................................... 63

Figure 21 (a-d): Reduction in deviance plot. in addition to three pruned tree models ................................................................................. 66

Figure 22: Residual plots for the SLM predicting LNSINGLE from LNTM25M at the EA level ................... .... ......... - 6 8

Figure 23: Residual plots for the RT model predicting LNSINGLE from LNTM25M at the EA level ............................................................... 69

............ Figure 24 (a-b): Predicted maps of LNSINGLE for both the RT and SLM 70

Figure 25: Residual plots for the SLM predicting LNPOPDENS from LNTM25M at the EA level ............. ...... ..................................................... 71

Figure 26: Residual plots for the RT model predicting LNPOPDENS from LNTM25M at the EA level ............................................................... 72

Figure 27 (a-b): Predicted maps of LNPOPDENS for both the RT and SLM ....... 73

Figure 28: AIC as a function of the number of terminal nodes .......................... 75

Figure 29: Scatterplot matrix for selected variables at the CT level ................... 78

Figure 30: Residual plots for the SLM predicting LNSINGLE from LNVEG2M at the CT level ............................................................................. 79

Figure 31 : Residual plots for the RT model predicting LNSINGLE from LNVEG2M at the CT level ....................... ........ ........................... 80

Figure 32: Predicted maps of LNSINGLE using the RT and SLM models ............................................... at the CT level .......................... .. 81

Figure 33: Observed map of LNSINGLE at the CT level ................................. 82

Figure 34: SINGLE scaled to a lOOm grid using a tree regression determined at the EA level ............................................................................. 84

viii

Page #

Figure 35: Observed SINGLE for the area corresponding to that of Fig . 34 ...... -85

Figure 36: The spatial distribution of high vegetation amount area in sclcio-economic ................................................... feature space located in figure 17 89

List of fables Page #

'able 1 : Digital number (DN) differences of means. between the two rows of images calculated over a 50 pixal wide and 500 pixel long area located

.................................. on the adjacent sides of the two rows of images 11

Table 2: Classification scheme used in supervised classification of the fine resolution imagery ................................................................. -19

Table 3: Transformed Divergence separability values for classes of the ................................................................ supervised classification -21

................ Table 4: Comparison of Khat (Kappa) accuracy with overall accuracy 31

Table 5: Accuracy assessment according to user's and producer's accuracies ..... 32

Table 6: Jarman's (1 983) weights for a social deprivation index ........................ -40

Table 7: Descriptive summaries for all of the variables selected for the study ....... 48

Table 8: Moran's I coefficients for selected variables ...................................... -50

Table 9: Pearson's correlation coefficients for selected biophysical and socio-economic variables .......................... .. .................... -61

Table 10: Pearson's correlation coefficients for a smaller subset ........................ 60

Table 1 1 : .4 1C values for various model runs ................................................. 75

Table 1 2: Pearson's correlation coefficients for selected socio-economic and biophysical variables at the CT level .......................................... -78

Abbreviations

Abbreviation iûescri~tion of variable - - - - - 1

POPDENS l~ooulation densitv - - - - 1

l MM1 POP ,

UNIDEG INCOME SINGLE INDEX

I Percent vegetation cover derived from the satellite image using clusterina

Proportion of population who are immigrants Proportion of population with a university diploma Median family income Proportion of population living in single-detached homes Social de~rivation index . . . . .

C . -

NDVl IPercent vegetation cover derived from the satellite image using NOVI LN(NAME) l ~ h e LN preceding variable names denotes that the natural logarithrn

VEG (2M- 1 OOM)

1 Itransformation is applied.

Percent vegetation cover at different resolutions from aerial ohotoara~hs

I

Introduction

1.1 Importance of the integration of the social and physical sciences

Recently, interest has grown substantially in the study of urban

ecosystems. This is not surprising given the percentage of people living in large

urban centres. Scientists from both the physical and social sciences are

recognising that in order to gain a more complete understanding of these highly

heterogeneous systems, it is necessary to integrate their respective fields.

Their prominent research question is: "How do the spatial structure of economic,

ecological and physical patterns in an urban area relate to one another, and

how do they change over time?" (Pickett et al., 1997). Physical scientists need

"to learn from social scientists what the institutional, organisational, and

interactive features of humans are that should be added to ecosystem models to

make them more complete and useful" (Pickett et al., 1997) while social

scientists require "that the principles of bidogy to be taken into consideration in

an interpretation of social phenomena; that human society is not entirely an

artificial creation.." (Machalis et al., 1996, citing Sorokin 1928, 207). The major

current research agendas reflect this need for integration. For example, the

U.S. National Science Foundation has set up two of its new long term ecological

research (LTER) sites in urtian areas (Baltimore and Phoenix), while the

Canadian National Centres of Excellence (NCE) program is strictly funding

interdisciplinary research.

A promising field, aiding the integration of the physical with the social, is

remote sensing that now provides unprecedented detail of the physical

environment. Some of the more specific research agendas include "pixelisation

of the social" and vice versa (Geoghegan et al., 1998). For example, Wood and

Skole (1 998) linked satellite, census, and survey data to study deforestation in

the Amazon. The use of census data has also found use in image classification,

whereby socio-economic data are used in the selection of the training sites

(Mesev, 1998).

Biophysical variables derived from remote sensing images have been

found to be useful in estimating census data (Lo, 1995). Grove (1996), studying

the relationship between patterns of social stratification and vegetation structure

of an urbanhural watershed, found that selected socio-economic features at the

census block level explained a large proportion of the variance in vegetation

cover and percent imperviousness. A cluster analysis by Fernandes et al. (In

Press) has reinforced the findings of Grove ( 1 996). Although Grove's research

indicated a positive association between socio-economic status and vegetation

cover, Ryznar (1998) found a negative relationship between socio-economic

status and the percentage change in vegetation over a longer time period in the

city of Detroit. This negative relationship was attributed to the fact that

abandoned urban areas experienced a rapid proliferation of vegetation, such as

weeds.

The integration will lead not only to more varied and perhaps more

appropriate methodology, but it will allow the full description of the highly

heterogeneous urban environmen: with more appropriate concepts. Concepts

such as the ecosystem concept have been redefined to "human ecosystemsn fcr

applications to human populations (Machilis et al., 1997). The ecological

gradient paradigm has been a very useful analytical tool to provide cross-

sectional analysis of urban areas (McDonell and Pickett, 1990; Foresman et ai.

1997). Increasingly. investigators are using biophysical data to measure the

quaiity of life in neighbourhoods (Lo and Faber, 1997). From these examples it

is evident that interdisciplinary studies provide insight into the patterns and

processes of urban ecosystems.

Although previous research has shown significant relationships between

biophysical and socio-economic data, the relationships are not at al1 trivial.

Fu rther, exploratory analyses are needed to assess the relationships between

biophysical and socio-economic data at various scales. This study

demonstrates the feasibility of deriving land cover characteristics from remotely

sensed data and explores their relationships with some socio-economic

charactaristics derived from the census. Vegetation amount and pattern

measures are derived from both satellite images and aerial photography (at

various resolutions between 1 m and 1 OOm). Relationships between these

biophysical variables and census variables are then investigated using various

qualitative and quantitative tools. Finally, modelling of some of the socio-

economic characteristics are explored applying statistical mode1 selection

criteria.

1.2 Rationale

One of the major limitations of studies employing socio-economic data

(collected through censuses) is that, due to confidentiality issues, the finest unit

at which data are available is at the enurneration area (EA) level. This means

that a single value of a certain variable, such as median family income, is

associated to an area that could include a very diverse population. The

variance is effactively reduced to zero within a unit, potentially leading to issues

of ecological fallacy when relationships determined at one scale are associated

with associations at another scale (Openshaw, 1984). Furthermore, the census

units have fixed boundaries that do not necessarily reflect the patterns of human

or physical processes.

The second major limitation of census data is the collection frequency. In

Canada, a nation-wide census is conducted every five years. However, many

urban processes (e.g. urban sprawl) operate under shorter time scales. Thus,

the need for of up-to-date socio-economic data at various scales is becoming

more apparent (Lo and Faber, 1997).

On the other hand, remotely sensed imagery is now becoming not only

more frequently obtained (3 days in the case of the new IKONOS satellite

(Mulroney, 2000)) but also at a spatial resolution where one can almost discern

individual humans. The imagery is relatively inexpensive (in comparison to

ground suweys), therefore, it has the potential to become the ideal source of

data for a wide variety of applications.

The largest limitation of the imagery is that the variables studied by social

scientists are not readily derived from such images. The relationships are

cornplex, however, variables such as land use, urban temperature, vegetation

indices. and building structure derived from remotely sensed images have

already been demonstrated to have some association with socio-economic

variables such as median income (Lo and Faber, 1997; Grove, 1996). This

study will further explore the relationship between biophysical and socio-

economic variables in a smalfer part of a large metropolitan area.

1.3 Objectives

The purpose of this study is to:

1. Derive vegetation cover and pattern from satellite imagery and aerial

photographs of various resolution.

II. Explore the relationships between biophysical and socio-economic

variables.

I 1 1 . lmplement simple statistical models using both biophysical and socio-

econornic data.

1 -4 Site description

The study area is situated just east of the downtown core of the City of

Toronto, Canada (Fig. 1). Covering an area of about 16 km2, it has a

heterogeneous mix of both social and physical environments. This area is in

part chosen because of this great heterogeneity and in part due to the fact that

a large health project (Southeast Toronto (SETO) Health Mapping Project) is

focusing on the area, which might benefit from the results of this study.

1.4.1 Physical environment

The physical environment is characterised by a gently rising lake bed (of

ancient Lake Iroquois), divided by the Don River Valley which empties into Lake

Ontario. The relatively flat lakebed rises from the present Lake Ontario

shoreline at 75m a.s.1. to the old Iroquois shoreline at 150m a.s.l..at the northern

edge of the study area (Westgate et a/., :999). Wide remnant valleys incise this

northern area (Rosedale). The built physical environment adds another layer to

the relief. The area adjacent to Yonge Street is mostly covered with high rise

offices and apartmentslcondominiums. Moving east, Cabbagetown and

Riverdale, on the east side of the Don River, are characteristic of tree covered

residential areas. Regent Park to the south and Rosedale to the north create a

polarised axis of wooded estates and apartment blocks. The main

thoroughfares for traffic are the GardinerIDon Valley Park expressways, Yonge,

BloodDanforth, Mt. Pieasant /Jarvis, Queen, College streets.

The macroclimate of Toronto can be described as continental, with cold

winters (-3OC) and warm, humid surnmers (26OC) with winds dominated by the

westerlies (Munn et al., 1999). However, the macroclimate is significantly

altered by the urban structure. The urban heat island is well established,

especially over the western part of the study area (Munn et al., 1999). Heat

islands are not only associated with higher temperatures but also give rise to air

circulation patterns that can trap pollutants (Oke, 1987).

1.4.2 Social environment

The study area comprises a great mixture of communities, some of the

wealthiest (Rosedale) and poorest (Regent Park) neighbourhoods of Toronto

and of Canada as well. According to the 1991 Census of Canada. there were

122,830 people living in the area. The arbitrarily determined enurneration areas

(EA) divide this total population into 335 EAs, each consisting of on average

400 persons. In turn these 335 EAs are grouped into 28 nested census tracts

(CT), with an average population of 4200 persons.

The socio-economic environment conforms in many aspects to the

physical environment. The wealthiest neighbourhoods are those of Rosedale,

Riverdale, and Cabbagetown with tree covered single story houses (average

income $68,000). On the other hand, the poorest neighbourhoods are Regent

Park and Don Mount Court, which are public housing initiatives (average income

<$20,000). Many of the tenants in these public housing units are recent

immigrants to Canada.

1.4.3 Biological environment

Most of the original tree cover has been removed by human activities,

except for large tracts of woodlands in the northern parts of the study area

(Rosedale). Deciduous trees such as sugar maple, beech, and basswood and

conifers such as white pine and white cedar dominate these forests

(Eckenwalder, 1999). The other well-established neighbourhoods such as

Riverdale also have considerable tree cover with many gardens of introduced

species from various parts of the world. However, these vegetation

cornmunities created by people are not as structurally complex and do not fomi

a closed canopy. The marshlands still occupying the shoreline in the 19"

century around Ashbridge's Bay have been converted to industrial land-use,

wiping out a whole ecosystem (Zimmerman, 1999).

Figure 1: The study area (marked with the thick red line) as seen from the air. This area is the subject of a large research initiative named the South-east Toronto (SETO) Health Mapping Project.

II

Derivation of vegetation covei and pattern

The aim of this chapter is to present the methods of deriving vegetation

cover and vegetation pattern from satellite imagery and aerial photography. A

quick overview of the methods is included as a flowchart in Figure 2. First, the

usefulness of remote sensing will be discussed in the urban context. Sections

will follow on pre-processing, including resampling and texture layer calculations.

The classification of imagery, using supervised and unsupervised methods, will

be presented next, including accuracy assessment of the binary vegetation

maps. Finally, the importance and computation of a measure of vegetation

pattern will be discussed.

2.1 Remote sensing of urban environments

With the introduction of commercially available remotely sensed data,

manÿ areas of science have embraced this new data source with great

enthusiasm. One of the great advantages of rernotely sensed data is that it

aliows the researcher to study a rnuch larger region than traditional field studies.

Fields such as landscape ecology, whose focus is the larger ecological unit

covering many hectares, have just began to mature with the spread of this new

data source. Remotely sensed images not only cover a larger area, but they

also offer greater detail (1 m pixel resolution in the case of the new IKONOS

satellite). For most regions of the Earth, one is able to choose from imagery

collected by sensors of variable resolutions (1 m to 1 km). This scalability allows

for issues of scale to be addressed explicitly. The fine spatial resolution is often

matched by high temporal resolution (3 days in the case of the IKONOS

satellite). Although remotely sensed images offer a much wider geographical

scale, fine spatial and temporal resolutions, in most cases they do not directly

Figure 2: Flowchart of steps to derive vegetation cover frorn aerial photographs and satellite imagery of different resolutions. Of importance is the loop from accuracy assessrnent back to the classification process. The final vegetation cover maps are derived after many iterations of the classification process in which training signatures are continually redef ined,

provide the researcher with the variables that they are studying. These variables

of interest need to be derived from the images that are often represented by

digital numbers (DN), corresponding to reflectance. For example, land use/land

cover studies classify images using a classification system of interest (Gong and

Howarth, 1992); landscape ecologists derive leaf area index (LAI) or biomass

information through vegetation indices (Peterson et al., 1987); social scientist

estimate population counts (Lo, 1995), etc.

This study also harnesses the advantages of remotely sensed images and

evaluates their utility for urban studies. The main purpose of this derivation is to

obtain biophysical variables (vegetation cover, vegetation fragmentation and a

vegetation index) which would have been very difficult to obtain with ground level

collection and then evaluate their usefulness in CO-processing with socio-

economic data.

Althoug h vegetation cover has been mapped f rom remotely sensed

images (Gaydos, 1992), the mapping of vegetation in urban areas has not

received as much attention by the remote sensing community as it warrants,

pointed out Nowak (1994). Yet a more accurate knowledge of the spatial

distribution of vegetation cover can be vital for understanding the hydrology,

climatology of urban systems and as this study is hypothesisiny, it can help in the

identification of socio-economic status. Unfortunately, one of the most popular

classification systems, the USGS defined systern (Anderson, 1976) is very

deficient in labelling vegetation in urban areas. It is very much biased towards

classes of residential and commercial even if there were a significant canopy

layer. This bias is understandable given the fact that this system was designed

to help in the classification of the entire United States of which only 1% is

urbanised. The approach adopted in this study is to classify rernotely sensed

images obtained both from aeroplane and satellite platforms, using a land cover

classification based on tone and its spatial variation (texture).

2.2 Sources of remotely sensed data

2.2.1 Aerial photographs

Aerial photography at a scale of 1 :20,000 (pixel resolution of lm) is

obtained from the Colour Orthophoto Collection of the University of Toronto Map

LiSrary7s online database. The pictures were taken by the Triathlon Mapping

Corporation (1 995) who scanned, oriented, and rectified the diapositives. Seven

images are identified in the photo-collection that cover the south-east part of the

City of Toronto. A quick visual inspection reveals that the digital numbers are not

norrnalised between the images in the two rows (the southern row, covering part

of the lake, being somewhat brighter). A quantitative cornparison of a 50 cell

wide band on the edge of the two rows reveals that in the green and blue bands,

the digital numbers (DN) are higher

Table 1. Digital number (DN) differences of means, between the two rows of images calculated over a 50 pixel wide and 500 pixel long area located on the adjacent sides of the two rows of images. The numbers in brackets indicate the standard deviation.

Row 2 Row 1

in the southern row (Table 1). However, subsequent work with the images also

shows variation in intensity within the same photograph. These are important

considerations in the classification process since tonal differences in the same

image will require well-defined training areas for the various classes.

Due to the reflectance differences and to avoid working with very large

databases (>50MB) the photos from the two rows are kept separate. However,

photographs of the same row are merged (mosaicked) into one larger database.

RED 150.1 (46.1) 150.1 (37.8)

GREEN 141.57 (60.1 8) 147.9 (37.9)

BLUE 1 38.2 (53.94) 141 -5 (34.2)

N 25896 261 29

2.2.2 Satellite imagery

The satellite image was taken by the LANDSAT THEMATIC MAPPER

(TM) satellite, with a resolution of 30m, in July of 1992 (Fig. 3). This image,

comprising three bands (TM2-green:0.52-0.6pm; TM3-red:0.63-0.69pm; TM40

NI R:O.76-O.gpm), has been resampled to 25m resolution.

Satellite imagery provides a useful way of studying the resolution

dependence of the classification of vegetation. For CO-processing with socio-

economic data at the €A level, it may turn out that the use of very fine resolution

(1 m - 1 0m) is not warranted. Although the images were taken a few years before

the aerial photographs, it is assumed that the vegetation cover between the two

years remained relatively constant. This assumption is reasonable given the fact

t hat the study area includes long established neighbourhoods.

Figure 3: False-colour composite of the study area taken by the LANDSAT TM satellite in July of 1992. With this cotour coding , vegetation appears as red.

2.3 Resampling

To assess the effect of resolution on accuracy and percent cover, the

aerial photographs are resampled from l m resolution to Zm, Sm, 10m, 25m,

50m, and lOOm resolutions. The resampling algorithm uses an arithmetic

average over the specified window size that moves across the image without

overlapping. Although this is a very crude approximation in what truly happens in

sensors of coarser resolutions, it has been used widely in remote sensing studies

of scale (G raniero, 1 999; Burchfield, 2000).

2.4 Texture layers

A major limitation of traditional classification algorithms is that the spatial

structure of the data is not taken into consideration. However, as Tobler's First

Law of Geography states, it is to be expected that pixels near each other will

belong to more similar classes than pixels farther away frorn each other.

Therefore, classes that overlap in spectral space could be separated based on

spatial information on who are the neighboun. A great deal of effort has been

lately devoted to improve per-pixel classification by the inclusion of

neig h bou rhood information (Pesaresi, 2000; Chica-Olmo and Abaraca-

Hernandez, 2000; Graniero, 1999). It should be noted that most classification

implementations still use per-pixel methods, however, they niake use of

contextual layers that explicitly define the spatial structure.

There are several ways to obtain contextual information. Semivariograms,

correllelograms. etc. are used extensively (Arai, 1 993; Miranda et al., 1998;

Woodcock et al. 1988), however, probably the most often used technique is the

derivation of texture measures using some predefined algorithm (Hudak and

Wessman, 1998; Graniero. 1999; Pesaresi, 2000). Texture is closely linked to

tone and its spatial distribution. Image areas with tonal homogeneity are more

easily described by their characteristic tone (colour), whereas highly

heterogeneous areas in terris of tone are more easily described by their texture

(Haralick, 1 979).

Texture can be computed in several ways. Usually, it is computed with

the use of a grey-level CO-occurrence matrix that gives a count of tonal

adjacency. This matrix is then computed under a moving average filter with a

certain kernel size, and the central value of the filter is assigned a value

calculated with an equation such as entropy, contrast or angular moment. A

recent study has shown that most of these measures are highly correlated, with

contrast the most successful at improving classification (Pesaresi, 2000).

Interestingly, the early suggestions of using larger kernel sizes to rnimic the way

humans perceive texture, have not been followed because users have found

these iarger kernels to be inappropriate for heterogeneous areas such as cities

(Hodgson, 1998). This study makes use of a contrast measure given by Haralick

et al. (1 973) (Eq. 1).

CONTRAST = 1 (i-j12 ' Pi, / Z Po ;

where Pii is the estimated probability of going frorn grey level i to grey level j.

Contrast calculates the probability of occurrence of two brightness values

separated by a given distance in a given direction and within a specified kernel or

window size. The algorithm implementing the measure was written by Graniero

(1996). It takes a raw 8-bit image and under a specified moving kernel

calculates contrast using a horizontal displacement vector. Edge areas are

important in texture calculation because the algorithm can not compute the

measure where more than half of the kernel is displaced off the image.

Consequently, a band of zeros is created at the edge, which is as thick as one-

half the size of the kernel. This can seriously effect the analysis if the images are

small or if the images are later mosaicked.

As it will become evident later, vegetation, especially, tree cover is

spectrally very similar to shadows cast by trees and buildings. It is hypothesised

that the texture layers will improve the separability of these two classes.

2.4.1 Optimal kernel size for calculating texture

Contrast is computed over a range of kernel sizes from 3 to 25 pixels.

The most appropriate kernel size is detemined during the classification process

using accuracy assessment by the Kappa statistic. However, the ideal kernel

size can also be deduced theoretically by considering the needs of the

classification process. Although vegetation cover exhibits great heterogeneity in

the urban landscape, it is safe to assume that tree canopies are in the range of

3m to 20m in diameter. Thus, the optimal kernel size will most likely be in this

range.

The ideal kernel size is assessed on a smaller subset by performing

supervised classification using texture layers calculated at different kernel sizes.

The resülts indicate that the accuracy is maximised at kernel sizes of seven and

nine, at 1 rn resolution (Fig. 4). The effect of texture is also assessed for coarser

resolutions. At 5m resolution, where kernel sizes of three and five correspond to

15m and 25m wide areas, reveal that at a kernel of three, accuracy is improved,

however, at a kernel of five, it decreases (Fig. 5). These findings support results

from the literature which emphasise that texture is most useful for fine resolution

imagery (Graniero, 1999). Consequently, texture layers are computed only for

1 ml 2m, and 5rn resolutions.

Kemal Jze

Figure 4: Accuracy of supervised classification at l m resolution as a function of kernel size used to derive the texture layer. The accuracy is 0.52 without the use of texture layers. The ideal kernel size seems to be seven or nine pixels wide.

Figure 5: Accuracy of supervised classification at 5m resolution as a function of kernel size used to derive the texture layer. At a kernel of five (25m) accuracy is actually reduced from the case where no texture layer is used.

2.5 Vegetation indices

Indices of vegetation amount have been widely employed by ecologists,

however, they have also been utilised in mapping of vegetation as well. For

example, Gaydos (1992) classified aerial photographs by computing the

Normalised Difference Vegetation Index (NDVI) and then selecting a threshold

for the vegetation class. NDVl is based on the fact that chlorophyll absorbs red

light (RED) and mesophyll tissue strongly reflects infrared radiation (IR).

Therefore, the 1R.R ratio can provide a measure of the abundance of "green"

vegetation within a particular pixel. NDVl is a useful parameter for estimating

biomass or vegetation vigour. It is easily computed using the following equation

(Eq-2).

NDVl is computed for the satellite image using the TM3 and TM4 bands. The

values are scaled to O - 100. To convert the NDVl layer into a map of vegetation

a cut-off of 30 is arbitrarily specified as the boundary between vegetated and

non-vegetated. Deriving vegetation maps in this fashion eliminates the need to

perform classification on the map. The big limitation in this study is the lack of a

near infrared band for the aerial photographs. However, even with the help of a

IR band, Gaydos (1992) found a high percentage of confusion between the

vegetation and shadow classes.

2.6 Image classification

The literature abounds with studies whose main objective is the derivation

of land use from remotely sensed images (Gong and Howarth, 1992; Gong and

Howarth, 1990a; Haack, 1 987; Green et al. 1994). In contrast, the needs of this

study dictate classification according to land cover. While the derivation of land

use can be a much more complicated issue due to the difficulty of assigning land

covers s hared between land uses to culturally defined classes, the derivation of

land cover can be equally challenging in some instances. Although the objective

is sirnply to derive a binary map of vegetated and non-vegetated surfaces, the

problern of inadequate spectral resolution and the effect of shadows, create a

similar situation faced by researchers primarily interested in land use.

All classification schemes should incorporate two phases in their

implementation (Gong and Howarth, 1990a). In the first phase, an appropriate

classification scheme should be chosen. The scheme should either implement

an already existing scheme such as the very popular Anderson classification

system (Campbell, 1996) cr introduce a new or modified scheme that will be

ideal for the project at hand. Choosing an existing system will allow future users

to compare classifications. The nature of this study dictates that instead of land

use categories, land cover categories should be chosen. Moreover, the nature of

the data (colour photograph with red, green, and blue bands) limits the

classification to be based on colour (tonal variation) and texture. Consequentlÿ,

classes are chosen based on surface types and colour (Table 2). For some of

the coarser resolution imagery, the number of classes is reduced because of the

inability to find homogeneous sites for some of the classes like roads and grey

rooftops. The second phase of a classification process comprises the actual

implementation of the classification using the categories decided upon in phase

one.

Tree canopy Dark-g reen

Grass Light-green

Shadow (large) Dark-grey Shadows of large buildings

Shadow (thin) Light-grey Shadows of small buildings

Urban(highly Orange Mainly concrete, white paint ref lect ive) Urban (grey rooftop) Red A great mix of surfaces that looked grey

Urban (Roads) Purple

Table 2: Classification scheme used in supervised classification of the fine resolution imagery.

2.6.1 Selection of training sites

A good classification is predicated on the quality of the training data. One

of the first impressions during the classification is the large geographic sîale of

the study area, even when it is partitioned into two halves. To capture the slight

differences in reflectivity within the same class over the whole image, training

pixels are collected from across the whole image. Large images can have quite

dramatic changes of intensity moving from one end of the image to the other.

This in turn is due to camera limitations and bidirectional reflectance effects.

One of the assumptions of the Maximum Likelihood Classifier is that the

training classes are normal. In order to achieve such ideal distributions, training

sites are located in homogenous areas, in most cases as groups of pixels.

Selecting groups of pixels may be a violation of independently selected pixels,

because blocks of training data contain cells which have neighbouring cslls of

similar values and thus they are autocorrelated to some distance and not totally

independent. Many producers of supervised classification collect training data in

contiguous blocks that may violate the independent assumption of the Gaussian

classifier. In order to obtain training data where each pixel contributes

information independently to the class signature, training data should be

collected randomly by single pixels. Campbell (1 981 ) found a significant

difference between block collected training data and single pixel training data.

Moreover, Gong and Howarth (1990a) also found that single pixel, randomly

collected training data produced better separability between classes. However,

the effect of the difference of using randomly picked field data as opposed to

collecting contiguous training data on the classification results varied throughout

the growing season (Campbell, 1981). in fact, the difference between

classification from June and September resulted in only a slight difference. The

effect of spatial autocorrelation on the training areas will most likely be

dependent on the resolution of the imagery. In this study, the assumption is

made that the error due to the difference between single pixel versus block

training is orders of magnitude smaller than that of the error caused by the

confusion of vegetation and shadows.

2.5.2 Training signature statistics and analysis

A good classification is rooted in the quality of the signatures. Hixson et

al. (1 980) provide evidence that differences in the selection of training data are

more important influences on accuracy than are differences among different

classification procedu res. Signature separability is a powerful anal ysis tool that

can assess the quality of signatures. If the separability between two classes is

high, they wiil most likely produce an accurate classification with minimal

confusion. A common measure of separability is the Transformed Divergence

index, which is calculated from the means and covariance matrices of each

spectral class or training site. Statistical distances can be calculated not only

between classes but also between training sites of the same class. This

separability is an indirect estimate of the Iikelihood of correct classification

between groups of different band combinations (Haack et al., 1987). A

separability of 1.5 or higher is acceptable, but in order to have very good

classification results 1.9 or higher is recommended (max. = 2.0) (Gong and

Howarth, 1990b).

The key to an accurate vegetation map is to obtain high signature

separabilities between vegetation and shadow classes (Table 3). Unfortunately,

the separability is quite low (1.45) between the tree class and large shadow class

due to the fact the two classes had very similar spectral properties and in many

cases similar textural properties. However, the separability is even lower

between the trees and grass classes. For this reason, the grass and tree

classes are later merged into one, vegetation class. It appears that the texture

layer is successful at discriminating between vegetation and thin shadows (1.99).

These thin shadows are cast by smaller, one or two storey buildings.

Table 3: Transformed Divergence separability values for classes of the supervised classification. The lowest separability (boldface) is observed between trees and grass, and trees and large shadows.

2 6.3 Supervised classification

Separability

Vegetation jG rass) Urban (Highly ref lec tive) Urban (Grey rooftop) S hadow(large) Urban (Roads) Shadow(thin)

The classifier chosen for this study is the popular Maximum Likelihood

Classification (MLC). The MLC is a per-pixel classifier that assigns each pixel to

a class with the highest likelihood of belonging to its probability density function

that is Gaussian. The probability density function is calculated from the means

Urban (Grey rooftop)

1.99 1.86 1.99

Urban (Highly ref Iect ive)

1 -85

2.00 1.98 2.00

Shadow (larqe)

2.00 _ 1.99

Vegetation (Trees)

1.22

2.00

1.95

1.45 1 -99 1.99

Urban (Roads)

- 2.00

Vegetation (Grass)

2.00

1 -98

1.94 1.99 2.00

and variances of the training data. The classifier does not take into consideration

the location of pixels, or the spectral characteristics of surrounding pixels. Error

may enter into this classification method if the classes do not display multivariate

normal frequency distributions (Campbell, 1996). Studies of fine resolution

images have highlighted the fact that per-pixel classifien such as MLC, which

were developed for coarse resolution MSS data, are not appropriate at finer

scales and that new classifiers should be developed (Gong and Howarth,

1990a). However, until new algorithms are developed and tested, most

classifications of fine resolution imagery are dependent on the use of contextual

information derived from the images to improve the traditional classifiers.

Maximum Iikelihood classification allows the specification of the threshold

for the probability of accepting a pixel into a class. The effect of the change of

this variable is assessed similar to the selection of the optimal kernel size. Of

course, as the threshold is increased the amount of correctly classified pixels

increases, however, the errors of commission rise along with it and the errors of

omission decrease. The threshold for which there is a comparatively similar

decrease of omission errors and an increase of commission errors is 4 (Fig. 6).

This threshold is implemented in al1 classifications.

U>

60000 50000 - . / - -Commission % 40000 . - +-Omission O g 30000 . - Correct

Thnshold for vegetation class

Figure 6: Assessrnent of the effect of changing the threshold (in spectral space) for the vegetation class in supewised classification. The errors due to commission steeply rise beyond THRS.

2.6.4 ISODATA clustering

Unsupervised classification entails an unguided approach to image

segmentation. It is automated and it uses an iterative procedure whereby pixels

are assigned to the closest class (minimum Euclidean distance), after which the

means of the classes are recalculated and the pixels reassigned to the closest

classes until a pre-specified confidence limit is reached for al1 pixels (Jensen,

1996). One of the big limitations of clustering is the lack of control over class

selection and the subsequent assignment of classes to various land use or land

cover classes. This assignment is especially difficult for images of 10m

resolution or coarser.

2.6.5 Hybrid classification

Preliminary classifications of a subset in the downtown portion of the study

area reveal that the optimal classification is that using a supervised approach.

The user-guided approach allows for a better separation of the shadow and

vegetation classes. On the other hand, during the classification of the entire

study area, clustering proves more effective for the areas outside the downtown

core, where the confusion between shadow and vegetated pixels is much lower

than in the downtown core. Thus, a hybrid approach is implemented. A bitmap

is created for the downtown core, under which a supervised classifica!ion is

carried out, whereas the other parts of the image are classified by ISODATA

clustering. Depending on the resolution, 15-25 clusters are created by the

ISODATA algorithm, which are then assigned to either the vegetated or non-

vegetated classes. This assignment of classes gets increasingly difficult as the

resolution is coarsened. Due to the difficulty of finding homogeneous classes, al1

images with 25m or coarser resolution are classified using the unsupervised

approach. For classification of the finer resolution imagery (1 m. 2m, 5m), four

layers are used: red, green, blue and texture layers. For the coarser imagery,

only the three visible bands are ernployed. The use of al1 three spectral bands is

most likely superfluous as they contain redundant (multicollinearity, ~~>0 .98 )

information.

Although there are several classes created during the classification

process, they are al1 either assigned to the vegetation class or the urban class.

These final classified images for the two rows of imagery are mosaicked to

produce the final classified map for the whole study area (F ig. 7a-f).

2.6.6 Classified images

The proportion of vegetation to urban exhibits great variability over the

study area. Vegetation cover increases from zero cover in some of the downtown

blocks to over 100% in areas of Rosedale. This proportion, however, is scale

sensitive. There is evidence of the vegetation class to shrink in percentage

cover as the resolution coarsens (mainfy due to the mixed pixel effect), in areas

where vegetation at a fine resolution is fragmented and of low total cover (e-g.

East York) (Fig. 8). On the other hand, in areas of high vegetation cover. the

proportion actually increases (Fig. 8). It is interesting to observe that the

proportion of vegetation to urban increases and then levels off for the downtown

subset (Fig. 8). This could be caused by some of the parks in the subset being

aggregated with other nearby treed areas. The percentage cover of vegetation

over the whole study area is constant (-23%) until about 10m resolution, then

dips to 17% (at 50m res.) before rising again to 20% (at 100m) (Fig. 8).

Generally, classes that are clumped at a fine resolution will grow in proportion, as

they are aggregated (Cao and Lam, 1997). It should be stressed that these

proportions can be very much influenced by the training class selection in a

supervised approach and by cluster assignment in unsupervised classification.

For example, the assignment of clusters to the

Figure 7 (a.): Vegetation cover at 2m resolution.

7 (b.) Vegetation cover at 5m resolution.

25

7 (c.) Vegetation cover at 10m resolution.

7 (d.) Vegetation cover at 25m resolution.

7 (e.) Vegetation cover at 50m resolution.

7 (f.) Vegetation cover at 100m resolution.

27

correct land cover class is especially difficult at coarse resolutions due to the

mixed pixel effect. Also, at coarse resolutions, the availability of homogeneous

areas for training class selection is rninimised. Since separate classifications are

perforrned on the two rows of images, there is some discrepancy between the

two rows in the final classified images.

A 100 f à? V 8

8 o i m jWm . SET0

O r SUBSETA c 60 O - - i A SUBSEl-8

40 hA Q) x SUESETC Ul a3 X

X I*' p , 20 t 0 0 0 lxX A

t

O ' A * Y

Resolution (m)

Figure 8: Percentage vegetation cover as a function of resolution calcukted for the entire SET0 area and three smaller subsets (Subset A - Rosedale; Subset B - East York; Subset C - Downtown (see Fig. 9 for the location of the subsets)). The percent cover rises in areas (Subset A) where at a fine resolution the vegetation is clumped and decreases in areas where it is fragmented (Subset 6).

2.7 Accuracy assessment

Accuracy is assessed by the comparison of the classified images with

those of three visually classified areas (Fig. 9). Two of these areas are in the

downtown core and the other is in a suburban area. Visual classification is

su perior to automated classification, because trained interpreters can directly

Figure 9: Location of subsets used for accuracy assessment (Subset D, E,F). Subsets A,B, and C are used to calculate contagion (see section 2.9).

identify the classes of interest (Martin and Howarth, 1989). However, error

enters the visual interpretation in several ways. First, vegetation is delineated

with the use of a cursor, which sometimes goes one or two pixels over or under

the perceived boundary of vegetation. Second, even at l m resolution, some of

the vegetation is hard to distinguish from other objets of similar colour such as

green roofs. In addition, if vegetation is concealed by shadows, it is omitted.

However, some of the shadows cast by trees that are true shadow pixels could

be included in the ?ruew image. These error sources are minimised, yet they still

have an effect in the accuracy assessment of the classified images.

The coincidence matrix (also called contingency or confusion matrix) is a

universal tool in remote sensing accuracy assessment. The elements on the

main diagonal of the matrix represent the number of pixels of the same class that

overlap and the elements off the main diagonal are the pixels of the same class

that do not overlap (Congalton et a/., 1983). In most cases, column headings are

those of the reference data and row headings of the classified data. Reference

images are often collected in situ or derived from high resolution photography or

imagery. Due to the immensity of data volume for high resolution databases,

most analyses rely on stratified random sampling to obtain the reference data

(Congalton, 1991). However, as pointed out before, the present study utilised

small subsets as reference maps obtained by visual classification. This could

bias the estimates in areas further away from the subsets but overall it provides a

very large number of pixels to be used in the assessment.

Accuracy can be computed in several ways. For the combined accuracy

of al1 the classes, overall accuracy ana the Kappa coefficient have been widely

implemented. Overall accuracy has been the traditional coefficient of accuracy,

however, it overestimates accuracy by only taking into consideration the

elements on the main diagonal (Table 4). Congalton et a/. (1 983) was one of the

first to recommend the use of Cohen's Kappa coefficient (Eq. 3), as it also takes

into consideration the elements off the main diagonal, the errors of omission and

commission. The Khat statistic gives an indication of accuracy after random

agreement is removed from consideration.

Kappa coefficient: r

N ' x x , - 1 (x*' x,,) 1=1 1=1

,where N is the total number of classified pixels, Xii is the element on the main

diagonal, Xi+and X+; are the row and column totals respectively.

Table 4: Cornparison of Khat (Kappa) accuracy with overall accuracy. By not considering the off-diagonal celfs, overall accuracy overestimates the more reasonabie accuracy given by the Khat statistic.

SET0 KHAT OVERALL

2.7.1 Producer's and user's accuracy

While the Kappa statistic gives an overall estimation of accuracy, it can be

very biased when there are only two classes with disproportionate coverages. In

such cases the larger class will naturally have very high agreement, which then

biases the Kappa statistic. Therefore, individual class accuracies as catculated

by the producer's and user's accuracies might give a better indication of how

close the classification is to the "truth". Producer's accuracy indicates how many

of the classified pixels of a reference class actually belonged to that reference

class. These are the errors of omission and usually occur when class

boundaries (in feature space) are underestimated. On the other hand, user's

accuracy is a measure of how usable the classified map is, that is are the pixels

on the classified maps accurate (these are the erron of commission).

Resolution(m) 1

2 1 S 0.671 1 0.620 0.897 1 0.916

25 0.385

, 0.798

1 O 0.657 . 0.912

50 1 100 0.389 1 0.295 0.846 1 0.830

Table 5 indicates that from a user's perspective for fine resolutions, the

probability that a pixel on the classified map is vegetation in reality as well is

around 80 O h in âll three subsets. This probability decreases to 40 % for the

coarser resolutions. It is interesting to observe that the user's accuracy for the

25m resolution satellite image the accuracy is comparable to that of the 25m

resolution aerial photography, and in two of the subsets even superior. This is

quite encouraging for the satellite imagery since it means that for many

applications, gains may not be realised by the use of fine resolution aerial

photography. In terms of producer's accuracy, the values are much lower than

for the user's accuracy because of the confusion created by the shadow class

and the errors that entered into the delineation of the "true" map. Both user's

and producer's accuracies are influenced by the percentage of coverage. For

example, the producer's accuracy for Subset E at 2m, with 6.1% vegetation

cover, is less than in either Subset D or F, where the percentage cover is much

higher. The higher the percentage cover of a class, the higher the likelihood for

agreement.

Table 5: Accuracy assessrnent according to user's and producer's accuracies computed for three subsets (please see Fig. 9 for the location of the subsets). Of interest are the relatively high accuracies of the satellite image (TM-25).

PRODUCER'S (Omission) SUBSET D SUBSET E

% Vegetation Cover at 2m

19.5 6.1

Resoiution of Vegetation 2m

0.52 0.38 0.69

0.76

0.83 0.77

SUBSETF USER'S (Commission) SUBSET D

SUBSET E SUBSET F

. 34.8

19.5

6.1 34.8

5m

0.61 0.58 0.66

0.70

0.79 0.80

50rn

0.81 0.50 0.34

10m

0.83 0.72 0.59

0.64

0.70 0.77

i

lOOm

0.87 0.39 0.35

25m

0.49 0.70 0.37

0.51

0.39 0.m

TM-25m

0.36 0.34 0.60

0.82

0.57 0.78

0.49

0.46 0.71

0.33

0.40 0.53

2.8 Satellite image

The costlbenefit ratio, in terms of time and finances, is in favour of the

derivation of vegetation cover from the satellite irnagery. In addition, the

accuracy assessrnent indicates that vegetation cover derived from satellite

imagery is just as accurate as finer resolution aerial photography.

Although the process of classification is the same for satellite and aerial

photographs, the coarser resolution and the availability of an IR band merit sorne

further discussion. Due to the relatively coarse resolution, the number of land

cover classes used is five as opposed to eight for the fine resolution aerial

photography.

The false colour composite image provides great aid in the delineation of

vegetated versus non-vegetated or non-photosynthesising surfaces (Fig. 3).

Light blue signifies cernent, asphalt, or bare soi1 surfaces that are the

characteristic of primary materials in urban areas. Red areas are vegetated

areas, with higher intensities of red signifying higher percentage of vegetation.

Brownish red areas are mixed pixel areas where urban materials are mixing with

vegetation. Sometimes, an analyst can be at odds with what to do with such

mixed pixels. Of course, if one trains for these areas as vegetation al1 of the

mixed pixels will be classified as vegetation, if one does not train for them,

vegetation will be omitted in the resulting classification which are contained by

the mixed pixels.

Supervised and unsupervised classifications result in very similar results

(Fig. 10). Therefore, to Save time, clustering is chosen for the TM images. Two

of the five classes identified by the clustering are easily assigned to the tree and

grass classes, however, a third class contains the mixed pixels of residential

areas. The inclusion of these pixels significantly increases the percentage of

tree cover. This can be a very serious limitation of coarse imagery for tasks such

as vegetation mapping because individual trees blend into the urban fabric. The

near-infrared band does eliminate most of the shadow-vegetation confusion,

however, at 25m resolution shadows are also considerably blended into the

urban fabric.

A comparison of the 25m satellite and the 25m aerial photo derived

classifications show very low consistency (-40%). This could be caused by

producer's errors during the classification or by inconsistencies in the geometric

registration of either of the images to the UTM grid.

Figure 10: Vegetation cover derived from the satellite image at 25m resolution.

2.9 Contagion: A measure of spatial arrangement

One way of characterising urban vegetation structure is to assess its

cover. However, spatial arrangement of this vegetation cover may also be an

important variable to consider when studying relationships between biophysical

and socio-economic characteristics. "Configuration of land use is one of the

major contributors to the quality of life (Geoghegan et ai., 1997)".

Many different landscape indices have been proposed by ecologists to

measure habitat fragmentation, species diversity, etc (Riitters et al., 1995). They

have also found some application in urban studies (Burchfield, 2000; Geoghegan

et al., 1997). One commonly used index is contagion, which measures the

degree to which classes are clumped within a landscape. At low contagion, the

landscape exhibits high fragmentation whereas high contagion indicates low

fragmentation. Contagion is a modification of the entropy equation and is similar

to the first-order join count statistics.

m m

Contagion = 2 ' ln (m) - Z Z Pij ' ln (Pij) ; i=1: j=l

,where m is the total number of land classes and Pi, is the total number of times

attribute i is adjacent to attribute j. The values of Pu form the entries of the so

called grey-level CO-occurrence matrix (GLCM). The second term of the equation

is the entropy equation, which is a maximum when al1 pixels of attribute i are as

far apart from one another as possible. (Baker and Gai, 1992)

Contagion is calculated in the GRASS GIS program (Baker and Cai. 1992)

for three different subsets of the classified image (Fig. 9). All three areas

measure 600m by 600m. These areas are divided up into 36, 100m wide,

squares. Contagion, as well as vegetation cover, is calculated within each

square. In one of the subsets, the effect of resolution is assessed by calculating

contagion and vegetation cover using different resolutions.

As is evidenced by the graphs in figures 1 1 and 12, the relationship

between contagion and cover is very predictable which is an artefact of the

contagion algorithm. Contagion is a minimum at approximately 50% cover

because this is when the likelihood of maximally mixing two classes is the

highest. These findings have also been documented in the literature (Gustafson

and Parker, 1992; Benson and MacKenzie, 1995) and point to the fact that one

should be careful when using indices such as contagion, when characterising the

spatial arrangement of landscapes. Therefore, contagion is not used later in the

modelling.

Figure 11: Contagion as a function of vegetation cover computed for the East York subset (Subset 6) on a square grid using classified images at different resolutions. It appears that contagion is highest at 2m resolution.

O 20 40 60 80 1 0 0 120

Vegetation cover (%)

Figure 12: Contagion as a function of vegetation cover cornputed for the three subsets (Subset A- Rosedale, Subset B- East York, Subset C- Downtown). The shape of the plot is characteristic of contagion/cover relationships.

As presented in this chapter the derivation of biophysical variables such

as vegetation cover and vegetation pattern are feasible from both satellite

imagery and aerial photography. The time and effort required to process the

aerial photographs are in excess as that for the satellite image. Despite the

coarser resolution of the satellite image the accuracy of the classified maps is

comparable to that of the accuracy of maps derived from aerial photographs.

However, it remains to be seen whether there is significant difference in the

results of statistical analyses between different resolutions when these

vegetation rnaps are aggregated to the €A level.

111

Exploring the relationship between

biophysical and socio~onomic variables

The following chapter first describes the sources of socio-economic data

and the reason why the variables are chosen. Other sections follow on

exploratory data analysis, including qualitative and quantitative assessment.

Spatial dependence is measured using Moran's I statistic and by the semi-

variogram. Multivariate analysis includes both scatterplot analysis and

correlation table calculation.

3.1 Sources of socio-economic data

3.1 .1 Census data

A number of variables are selected from the 1991 Census of Canada

database at both the enumeration area (€A) and census tract (CT) level of

census geography. The choice of variables is dictated by suggestions from the

literature. For example, Grove (1 996) in his study of the reciprocal relationships

between vegetation structure and social stratification in an urban watershed

selected variables suggested by Shevky and Bell (1 955) that were classified as

belonging to one of three different groups of indices, namely a socio-economic,

household and ethnicity index. The socio-economic index included variables

reflecting income and education levels. The household index captured variables

such as marital status, home-ownership, and type of household. The ethnicity

index reflected the race and immigrant status. Finally, based on these three

indices a fourth composite social area index was built. Ryznar (1998), who

studied the relationship between vegetation change and the social environment,

in addition to socio-econornic variables included demographic indicators such as

fertility and age structure. Lo and Faber (1997) assessed the feasibility of

integrating Landsat Thematic Mapper and census data for quality of life

assessment using the following socio-economic variables: population density,

per capita income, median home value, and percentage of college graduates.

The variables selected for this analysis are as follows: population density,

number of single-detached homes, number of immigrants, number of university

graduates, and median household income.

The count variables are divided by the total population of the EA and then

multiplied by 100. One of the limitations of EA level data is the problern of

missing or suppressed values, especially for income related variables. Out of

302 EAs analysed for this study 94 have their values suppressed due mainly to

conf identiality issues.

3.1 -2 Social deprivation index

Socio-economic indices may be much better measures of the socio-

economic status of a geographical area than a single census variable. Social

deprivation indices have been snown to correlate with health in residential areas

(Mays and Chinn, 1989; Carstairs and Morris, 1989). Frohlich and Mustard

(1 996) computed a socio-economic index from six census variables. They found

a high correlation between the socio-econornic index and a health index.

However, Grove (1996) found that the utility of composite indices in the study of

the physical environment and the social environment needs a lot more research.

Grove (1996) found much higher correlations between single census variables

and biophysical variables than a composite index and biophysical variables.

The choice of variables for a social deprivation index should reflect the

main domains of socio-economic characteristics: dwelling, education,

employment, income, and mobility (Frohlich and Mustard, 1996). Jarman's

(1983) index, which reflects four of these characteristics, is calculated using

weights that were originally derived from a survey of General Practitioners (GPs)

in the UK.

Census category description Household of 4 or more people Lone parent families Rental housina units

Table 6: Jarman's (1 983) weights for the social deprivation index. Weights derived from a survey of General Practitioners in the UK.

--

' Weights 2.88 3.01 3.6 -

People 15+ without secondary education People 15+ unemployed People 15+ not in labour force Famifies with low incorne

All of the variables are first standardised and then assessed for

skewness. The removal of skewness has been identified as a prerequisite for

the generation of a composite index (Gilthorpe, 1995). Families with low income

are not calculated into the equation because of the many (94) suppressed

values.

2.9 3.34 3.34 3

3.2 Exploratory data analysis (EDA)

3.2.1 QualitativeNisual assessrnent of variables

A quick way to gain soma understanding about the spatial distribution of

the variables is to map them. The maps presented on the next few pages are

created using quartiles (Fig. 13 a-e). lt should be stressed that the pattern

shown is heavily influenced by the boundaries of the EAs with larger units

tending to dominate. The chloropleth maps also make it seem as if the variable

being mapped applies to every part of the study region, when it is evident from

the aerial photograph (Fig. 1). that there is great heterogeneity in land uses, let

alone socio-econom ic characteristics. In addition, chloroplet h maps can be

misleading because alternative choices of shading and class intervals can lead

to widely different visual interpretation (Bailey and Gatrell, 1995, p.256; Langford

and Unwin, 1994).

Keeping these reservations in mind, there are very evident trends on the

maps presented in figures 13(a-e). Population density displays a pattern where

the highest densities are observed in €As, which occupy apartment buildings

(Fig. 13a). Outside these very small EAs, the density is somewhat higher on the

east side of the study area (Riverdale) as opposed to areas of Rosedale to the

north and Cabbagetown, just south of it. SINGLE shows a highly clumped

distribution (Fig.13b). The downtown sections of the study area have almost no

single-detached homes, whereas, Rosedale and Riverdale have much higher

percentages of single-detached homes. A somewhat different spatial

distribution, yet still highly autocorrelated, can be observed in the case of the

proportion of university graduates (Fig. 13 c). EAs in the highest quartile are

found mainly in Rosedale, Cabbagetown and a few in Riverdale. Those in the

lowest quartile are found near the public housing projects and the apartment

EAs. The distribution of the percentage of immigrants exhibits more of a random

pattern, however, recent immigrants also tend to congregate in the public

housing projects, apartment blocks and the south-east portion of the study area

(Fig. 13d). Last, the social deprivation index manifests a distribution of deprived

areas in the south of the study area and much better off areas in the north,

Rosedale, Cabbagetown (Fig. 13e). It is interesting to observe one of the

Rosedale EAs to be in the third quartile, which might indicate that the index with

these weights is not really suitable for this area.

The histograms of the variables reveal their one-dimensional distribution.

Three of the socio-economic variables (POPDENS, INCOME, and SINGLE) and

al1 of the biophysical variables (VEG) reveal skewness to the right (Fig. 14).

SINGLE 0.01 0.01 - 6.26

-5-5 6.26 - 18.88 W 18.88 - 06.58 Suppressed

Figure 13 (a-e): Spatial distribution of selected census variables. a. Population density is highest in small EAs containing apartments. b. The lowest quartile of SINGLE is al1 zeros since these represent the values of the apartment and downtown EAs.

Figure 13 (a-e): Spatial distribution of selected census variables. c. UNlDEG is highest in Rosedale and Cabbagetown. d. The distribution of IMMIPOP displays a pattern of higher proportions near public housing projects and in the southeast part of the study area.

Figure 13 (a-e): Spatial distribution of selected census variables. e. Socially deprived areas are those in the highest quartile. These are found near the public housing projects and near the lake.

. . . . - . OS2 OSL OS O

r . . . . . i

OZ1 08 OV O

O 20 40 60 80

VEG lOM

Figure 14 (Cont.): Histograms of vegetation cover variables at various resolutions. They are al1 skewed to the right and subsequently transformed with the natural log.

3.2.2 Univariate statistics

3.2.2.1 Descriptives

The distribution with the largest range is that of POPDENS. Its maximum

is at 1891299 pers./km2 and minimum at 408.4 pers./km2 (Table 7). Such high

population densities are observed because €As covering only apartment

buildings have relatively high populations and very small areas. One should be

careful not to attribute these maximums of population densities to larger areas of

the city. The standard deviation of POPDENS is 249145.7 pers./km2. SINGLE.

IMMIPOP, and UNIDEG have roughly similar ranges. The mean value of

SINGLE is 5.0 %, not surprising given the fact that a large proportion of the

study area covers the downtown core. The mean total immigrant population is

quite high at 36.9 %, reflecting a high influx of recent immigrants. The highest

value UNIDEG is 62.5 % with a mean of 21.0 %. Median family income ranges

from about $ 5000 to $ 194074. These extremes represent the poorest and

richest people living not only in Toronto but also in Canada. The mean of

JNCOME is $ 27014 with a standard deviation of $ 25719.7. The social

deprivation index ranges from -364.9 (higher standard of living) to 631 -4 (lower

standard of living). The mean, median, and standard deviation are 33.5, 18.4,

and 206.1 respectively.

The descriptive summaries for the biophysical variables are also included

in Table 8. For VEG, derived from the aerial photographs, the range widens

from 75.7 to 100 with coarsening resolution as a result of the derived cover to

aggregate at coarse resolutions. The mean of these same variables increases

from 17.3 (at 2m resolution) to 23.1 (at 25m res.) and then decreases to 16.2 (at

1 OOm res.). The standard deviation increases with coarsening resolution from

13.9 to 29.8.

The descriptive summaries like their corresponding histograms reveal that

a majority of the variables are heavily skewed to the right. Correcting for

1 1 1 I 1 1 1 1

SKEWNESS 1 2.85 1 3.38 I 0.70 1 0.44 1 1.61 1 0.53 1 1

CENSUS

MIN

MAX

TOTAL

STD

INCOME

5216.45

INDEX

-364.94

POPDENS !SINGLE

408.35 1 0.01

1891 299.00

302

2491 45.70

KURTOSIS

VEGCOVER

MIN

IMMIPOP

0.01

86.58

302

1 1.81

12.23

VEG2M

0 .O0

1

TOTAL

STD

Table 7: Descriptive summaries for al1 of the variables selected for this study. For INCOME the 94 suppressed EAs are rernoved.

UNIDEG

0.01

14.73

73.43

MEDl A N

MAX

SKEWNESS

KURTOSIS

skewness is important, since most of the traditional statistical techniques,

including Pearson correlations require variables to have normal distributions.

The variables exhibiting high skewness (A) are transformed using the natural

logarithm. To avoid taking the logarithm of zero a small increment is added to

variables with zero entries. In some cases (SINGLE, VEG), the transformed

variables still exhibit a non-normal distribution, due to a large number of EAs

having values close to zero. Other transformations are attempted, however, the

distributioris are still not close to normal, so the variables are left using the log

transformation.

122.23

302

17.66

14.83

75.71

302

13.90

1

19.98

88.89

1.29

2.28

62.51

302

14.66

4.48

VEG2SM

0.00

14.79

VEGSM

0.00

302

15.09

2.00

VEGlûM

0.00

13.68

100.00

194074.01

208

2571 9.72

I 7.03 1 4.25

VEG5ûM 1VEG100M

0.00 1 0.00

1 1 1

1.75

100.00 I

1.81

1 -97

631 -41

302

206.1 1

TMVEG25M

0.00

302

16.69

1.61

1 -44

1.13 i 0.86

1.20 1 0.67

1

AVGNOVl

0.01

0.00

100.00

302

10.48

1 1

302

27.59

1 -85

3.32

1 -38

1.24

302

30.04

0.78

-0.01

2.53

100.00

10.65

47.91

302

29.80

302

20.89

3.2.2.2 Spatial dependence: Moran's I

The notion that many of the socio-economic and biophysical variables are

spatially dependent should not be surprisiny, especially to geographers. For

exarnple, wealthier individuals will most likely live closer to other wealthy

individuais than to poorer. Such dependence on neighbouring values violates

the assumption of independence required for rnodels such as linear regression.

Therefore. it is a good to have a firm idea of the magnitude and the significance

of spatial autocorrelation. Unfortunately, different measures of spatial

autocorrelation yield different values for its magnitude as well as significance.

One of the most established methods to calculate spatial dependence is to

compute Moran's I (Eq. 5) which is similar to the Pearson correlation statistic

with the addition of a neighbourhood matrix, W. Thus, Moran's I and its close

relative Geary's C are very much influenced by the selection of the

neighbourhood matrix. It should be defined as tightly as possible and with the

correct weighting to accurately reflect the relationship between neighbours. The

neighbourhood matrix in this study is defined based on first order neighbours.

Moran's I has mean of -f/(n-1) when there is no association. A value near one

attests to very strong positive autocorrelation whereas a value near negative one

shows very strong negative autocorrelation. For socio-economic variables, it is

normal to observe correlation coefficients of 0.1 5-0.20 (Griffith, 1996). This

coefficient is useful only for summarising autocorrelation and is not suitable for

prediction.

where Wij is the neighbourhood matrix.

The autocorrelation coefficients for al1 of the variables indicate significant

association (Table 8). Proportion of single detached houses (SINGLE) and

TM25M exhibited the highest autocorrelations, 0.701 and 0.435 respectively.

b

Table 8: Moran's I autocorrelation values for selected variables. The neighbourhood matrix is defined based on first order adjacency. All coefficients are highly significant, SINGLE exhibiting the highest coefficient. For the VEG variable, the coefficients decrease but then rise as the resolution is coarsened from 2rn to 1 OOm.

Moran's 1 LNPOPDENS LNSINGLE

INDEX LNVEG2M LNVEGSM LNVEG10M LNVEG25M LNVEG50M LNVEG1 OOM LNTM25M LNAVGNDVI

3.2.2.3 Spatial dependence: Semi-variance

Another way of visualising and determining the covariance of values in

Coefficient 0.295 0.701

0.41 5 0.341 0.299 0.202 0.1 98 0.293 -

0.327 0.435 0.1 75

space is by the plotting of variograms. Empirical variograms display the change

Signif icance I

0.0000 0.0000

0.6600 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 O .O000 0.0000

of semivariance with distance. The theory behind semivariance is based on the

assumption that the variance calculated between two points at a fixed distance

will increase and reach a global limit at a certain range (see Atkinson and Lewis

(2000) for an excellent review). Beyond the range the values are said not to be

dependent on each other. Also, it is assumed that this covariance structure is

constant throughout the study area and does not change with direction.

In order to reduce the effect of outliers, the variogram is estimated using a

robust method (Cressie and Hawkins, 1980) (Eq. 6).

where N(h) is the set of al1 pairwise Euclidean distances i-j = hl (N(h)l is the

number of distinct pairs in N(h), and 4 and zj are data values at spatial locations

i and j, respectively.

The variograrns are well defined for al1 of the variables except for SINGLE

and INDEX, which have linearly increasing semi-variance plots (Fig. 15a-b).

The POPDENS, INCOME, UNIDEG, and IMMlPOP census variables exhibit

ranges (distance at which the plots level) in the 600m to 800m range. These

values are reasonable given the fact that at these distances neighbourhoods

share similar characteristics. The semi-variance plot for VEG2M seems to be

cyclic with 400m cycles. This type of variogram is sometimes observed for

urban areas where the pattern of land use repeats every city block (Gaydos,

1 992).

, 1 r 1 --7

OOOOS OOOOE OOOOL O

OOE 002 001 O

ewurcb

3.2.3 Multivariate statistics

3.2.3.1 Scatterplots

Although scatterplots are visuaf depictions of the relationship between

two variables rather than a quantitative assessment, they are presented in this

section because they are a good first approximation of bivariate reiationships.

The scatterplot matrix exhibits somewhat dense data clouds, however, clear

linear relationships can be discerned between a number of variables (Fig. 16). It

is not surprising that there are linear relationships amongst a number of socio-

economic variables. For example, there is a positive linear relationship between

INCOME, SINGLE, and UNIDEG. One would expect high-income families to

live in single-detached homes and possessing post-secondary education than to

live in apartments and have lower levels of education. There is also a positive

linear relationship between IMMIPOP and INDEX, explained by the fact that

recent immigrants live in areas of lower socio-economic conditions. On the

other hand, negative linear relationships can be observed between UNIDEW

INCOME and IMMIPOP/ INDEX.

In assessing the relationship between the biophysical and socio-

economic variables, clear associations can only be discerned between VEG and

SINGLE, and VEG and INCOME, both of them positive. The positive

relationship between VEG and SINGLE is evident even from the aerial

photograph (Fig. 1 ) where single-detached homes are found in vegetated areas.

A less apparent, negative relationship can be obseived between population

density and vegetation cover. Such a relationship can be expfained by the fact

that areas with high population densities are those which contain apartment

buildings which are found in areas with less vegetation cover than single-

detached houses.

The three dimensional bar graph in figure 17 gives a sense of how

vegetation cover is related to POPDENS and SINGLE in the study area. As

expected the highest percentage of vegetation cover is characteristic of €As with

a high proportion of SINGLE. However, there is a srnaller increase of TM2SM

values further down along the SINGLE axis indicating that there is not

necessarily a ctear linear relationship between the variables.

Figure 17: 3-0 bar graph displaying the relationship between SINGLE, POPDENS, and VEG. The increase in TM25M moderately low proportion of single-detached homes is interesting to observe. It could be a result of EAs with a high percentage of parks and public green-spaces.

1- LNPOPDENS

Figure 16: Scatterplot matrix of selected socio-economic and biophysical variables. VEG seems to be related to INCOME, SINGLE, and LNPOPDENS,

When the 94 EAs with zero entries for INCOME and two more outliers

located in Rosedale are removed from the database, the relationship between

vegetation cover and income is quite clearly Iinear.

Figure 18: Scatterplot of vegetation cover derived from 2m resolution aerial photograph and median family income. All 94 suppressed €As and two outliers are removed. The relationship is clearly linear.

3.2.3.2 Decile plots

An alternative way to plat the relationship between the biophysical and

socio-economic variables is to plot the average of a dependent variable for every

decile of the independent variable. Some extra insight is gained by such

analysis. For example, the negative association between VEG and POPDENS

is only evident from the median (5" decile) (Fig. 19 a-1). It should be noted that

the median for the VEG variable is 14.8; first and third quartiles are 7.0 and 25.4

respectively; the maximum is 75.7. The positive association between VEG and

SINGLE is also most apparent from the median up, and it is very similar to a

quadratic function. Likewise, the positive relationship between VEG and

INCOME is only evident from the 7" decile of the vegetation variable. There is

no clear linear association between VEG and UNIDEG and VEG and IMMIPOP.

Figure 19 (a-9: Decile plots showing averages of selected socio- economic variables for every decile of VEGZM. The linear associations observed on the scatterplots (Fig. 16) are apparent only in the upper half of the distribution for UNIDEG, POPDENS, INCOME, SINGLE, and IMMIPOP.

It is interesting to observe the cyclic relationship between VEG and INDEX.

Lows in the deprivation index are observed at both ends of the VEG distribution.

There is also a high at the 7m decile.

3.2.3.3 Pearson's correlation tables

As indicated by the scatterplots there are significant linear correlations

between a number of the socio-economic variables. The highest significant

negative correlations are obsewed between UNIDEG and INDEX (-0.68) and

SINGLE and POPDENS (-0.62) (Table 9) . These numbers are self-evident as

university graduates are more likely to be found in socially privileged areas than

in socially deprived areas. Also, higher proportions of single-detached houses

c m be found in areas with lower population density. The correlations between

the biophysical and socio-economic variables reflect the following trends.

Vegetation cover (derived from imagery at al1 resolutions) is highly correlated

with proportion of single houses (SINGLE) (-0.40 to -0.63) and population

density (0.32 to 0.68). The highest correlations are observed between

vegetation cover layer derived from the satellite imagery and SINGLE (0.68) and

POPDENS (-0.63). It appears that the strength of the correlation between VEG

(derived from the aerial photographs) and SINGLE/ POPDENS weakens as the

resolution in coarsened from 2m to 25d50m and then increases as the

resolution reaches 1 00m.

The correlations between INCOME and the other variables is presented

for the smaller database where those EAs with zero income and the two outlying

EAs of Rosedale have been removed (Table 10). Again the highest correlation

is between UNIDEG and INDEX (-0.73). Moderately high and highly significant

correlations are obsewed between INCOME and UNIDEG (0.66), SINGLE

(0.51 ), INDEX(-0.5)' VEG2M(0.46), and POPDENS (-0.43). Vegetation cover is

correlated highest with SINGLE (0.61 ), INCOME (0.46). and POPDENS (-0.41 ).

It is fair to conclude that most of the variables can be classified into two

categories. I NCOME, SINGLE, UNIDEG, and VEG show positive correlations

between each other whereas they are negatively associated with fMMlPOP and

I N DEX. Thus, based on these correlations and scatterplots high vegetation

cover appears to be a characteristic of well-to-do areas. Based on similar

results, Lo and Faber (1997) suggest the use of greeness as a desirable quality

and as a important determinant of quality of life. Of course, this does not at al1

imply that low-income people and recent immigrants prefer areas with less

vegetation, however, it is an indication that vegetation is a resource that is not

distributed equally along the urban landscape (Grove 1 996).

1 1 INCOME 1 SINGLE 1 IMMlPOP 1 UNIDEG 1 INDEX2 / VEGPMT 1 LNDENS 1

SINGLE 0.51 1 .O0 -0.21 0.21 -0.03 0.61 -0.50

IMMlf'OP -0.28 4.21 1 .O0 -0.43 0.32 -0.1 2 0.43

UNIDEG 0.66 0.21 4.43 1 .O0 -0.73 0.23 -0.19

INDEX2 4.50 1 -0.03 0.32 4.73 1 .O0 0.00 -0.07 VEG2MT 0.46 0.6 t -0.12 0.23 0.00 1 .O0 -0.41 LNDENS 4.43 4.50 0.43 4.19 -0.07 -0.41 1 .O0

Table 10: Pearson's correlation coefficients for a subset where the suppressed INCOME EAs are removed. Boldfaced values are significant at p=0.001. INCOME is significantly correlated to al1 other variables. VEG is positively correlated to INCOME, SINGLE, and UNIDEG and negatively to POPDENS.

IV

Co-processing of biophysical and socio-economic data using simple

statistical models

There are a large number of public, private, and academic interests,

including city planning offices, health organisations, and universities, who rely on

up-to-date socio-economic data (Jensen, 1996; Treitz et al., 1 992). However, the

high cost of door to door surveys implies that the national censuses, taken every

five years in Canada, are usually the rnost up-to-date sources of socio-economic

data. One way to alleviate this data shortage is to take advantage of up-to-date,

cost-effective, and scalable remotely sensed data and model socio-economic

characteristics based on documented relationships between biophysical and

socio-economic data. Many studies, including this one have, shown significant

relationships between biophysical data and socio-economic characteristics

(Grove, 1996; Ryznar, 1998, Lo and Faber, 1997).

Based on the exploratory work presented in Chapter 3, significant

correlations are present between vegetation cover and proportion of single-

detached homes and between vegetation cover and population densiiy.

Therefore, as a potential application for CO-processing biophysical with socio-

economic data, a conceptual model will be implemented whereby the two socio-

economic variables will be predicted from the vegetation cover variable.

This prediction will be tested with two statistical models, which hold great

potential in the rnodelling of urban phenornena. The first model presented is a

spatial linear model (SLM), followed by a regression tree (RT) model. The SLM

is ideal for modelling variables with a significant amount of spatial dependence

because it explicitly controls for the effect of spatial autocorrelation. As identified

in the exploratory data analysis, there is a significant leve! of spatial

autocorrelation (according to Moran's I statistic) in the variables selected for this

study, justifying the use of this model. However, not al1 of the relationships

plotted in the €DA section show a clear linear dependence and therefore the

SLM might not be appropriate always. Although the RT model is a piecewise

linear model, it is applicable to non-linear but monotonic relationships. These

two models will be compared with a model selecting strategy, which accounts for

model cornplexity in addition to goodness of fit. Finally, sections will follow on

finding the appropriate resolution for such modelling.

4.1 Spatial linear regression model (SLM)

Many mainstream scientific studies fail to account for the spatial

relationship between units of the same variable. However, it is hard to find

phenornena, which are not spatially dependent at some scale. This neglect for

spatial autocorrelation may lead to biased, more optimistic results because of the

extra strength, values near each other spatially impart to the regression (Griffith

and Can, 1996). Griffith (1996) estimates that when a variable exhibits a spatial

autocorrelation value of 0.25, the efficiency of the Ordinary Least Squared

estimator is zero.

Spatial modelling diffen from aspatial modelling by the inclusion of

neighbouring values into the regression equation. Therefore, the response

variable is not only a fundion of the predictors but of the neighbouring predictand

values as well (Fig. 2O).The way these neighbours are defined is a crucial step in

Figure 20: Spatial interaction of neighbouring units taken into consideration in a S M .

63

the modelling exercise (Haining, 1990). Griffith (1 996) suggests a conservative

matrix with as few neighbours as possible with appropriate weights. It is best to

experiment with different matrices. The neighbourhood chosen for this study is a

first order matrix with weights equal to one.

The other important specification in a spatial autoregressive model is the

type of covariance model used. The Conditional autoregressive (CAR)

covariance model is used as opposed to the Sirnultaneous autoregressive (SAR)

mode1 because the residuals from the SAR model are correfated with the

neighbouring data values, resulting in inconsistent least-squares parameter

estimateç, while the CAR model does not have this problem. The CAR model

requires symmetry, which is specified in the construction of the neighbourhood

matrix.

CAR: E = (I-~N)''D$ ;

where p and o are scalar parameters to be estimated, N is a weighted neighbour

matrix, and D is a diagonal matrix used to account for non-homogeneous

variance of the marginal distributions. For the CAR model, N is required to be

symmetrical, meaning that the effect of one neighbour on another should be

equal in magnitude both ways.

4.2 Regression Trees (RT)

Classification and Regression Tree (CART) is a relatively novel statistical

model (Breiman, et al. 1984). It can be very useful because it can take both

categorical and interval data. In the case of categorical data, it is called a

classification tree and in the case of ratio data, it is called a regression tree (RT).

RT rnodels fit a piecewise Iinear model to the data. They have not been widely

employed, even though, in many applications they are superior to linear models,

which can be very biased to non-tinear relationships. Trees take advantage of a

splitting procedure that partitions the data according to some predictor variables

so that the deviance of the response variable is minimised in the different

branches of the tree. The partitioning algorithm chooses only those predictor

variables, which can actually reduce the deviance of the model. Thus, a model

with a large number of variables is input into a tree, the CART acts as a forward

selecting regression. The leaves of a tree correspond to bins for the predictand

variable. That is there are as many unique predicted values as there are leaves.

For example, in a bivariate regression, 6.59 is assigned to the predictand when

the range for the predictor is 4.0 to 14.5, 3.45 when the range is 14.6 to 50.7, and

so on.

To reduce the complexity of the tree, the tree can be pruned back to

nodes beyond which the deviance is only minirnally reduced. The final tree size

in the case of the SINGLE model is 13 leaves (Fig. 21) and for the POPDENS

model seven leaves.

4.3 Residual diagnostics

The maps of the residuals of the models are presented along with the

common diagnostic plots such as histograms and QQ-plots of residual

distributions, and plots of observed values against the predicted values to reveal

the fit of the regression. In the case of the autoregressive mode1 (SLM) the

variation in the response variable is modelled with a linear component, a

neighbour covariance component (termed "signaln(Haining, 1 990, p.258). and a

residual variation component. Therefore, the predicted values are computed as

the fitted values from the linear component plus the signal.

Figure 21: a. This plot shows how the deviance decreases as the size of the tree is increased in a mode1 where SINGLE is predicted from TM25M. b. Pruned tree with 13 leaves. c. Pruned tree with five leaves. d. Pruned tree with two leaves.

CONCEPTUAL MODEL: LNSlNGLE - LNTM25M

The map of residuals from the SLM show a pattern where the €As around

the downtown core have positive residuals whereas the downtown EAs have

negative residuals which are larger in magnitude (Fig.22). Also some of the

highest residuals are observed in EAs with small areas. The qq-plot reveals that

towards the ends of the distribution, as well as in the centre, the fit is poor (Fig.

22). This lack of fit can be explained by the fact that for EAs where SINGLE is

very low, there is no linear relationship between it and VEG. The residuals of the

RT model show a somewhat better fit (Fig. 23). However, there are still a

number of EAs where the model fit is very poor. Comparing the observed and

predicted maps, it is evident that the RT model prediction looks much more like

the original distribution (Fig. 24).

CONCEPTUAL MODEL: LNPOPDENS - LNTM2SM

The SLM fit for the POPDENS model seems to be better than for the

SINGLE model. Again, the largest residuals are found in the downtown core,

however, those residuals outside of the core are mainly negative, as opposed to

positive residuals in the case of the SINGLE model. There is a high

concentration of positive residuals around the Cabbagetown area. These are

most likely apartment EAs with very high population densities. The residuals of

the RT model resernble the residual distribution of the SLM, also exhibiting a high

concentration of positive residuals in the Cabbagetown area. As with the

SINGLE prediction, the RT residuals are smaller in magnitude than those of the

SLM. However, a visual comparison of the observed versus predicted spatial

distributions reveals a closer resemblance by the SLM than by the RT model.

SLM - SINGLE

Quantiles of standard nomal

Figure 22: Residual diagnostic plots for the SLM predicting LNSINGLE from LNTM25M at the EA level. The residuals (both negative and positive) are higher in magnitude in the downtown core.

RT - SINGLE

Quantiies of standard normal

Figure 23: Residual diagnostic plots for the RT mode1 predicting LNSINGLE frorn LNTM25M at the €A levet. The fit is still poor towards the ends of the distribution, however, the residuals are smaller in magnitude in most EAs as compared to the SLM in Fig. 22.

(a.) - RT-Predided

SINGLE

O 1000 a1W) Men

(b.) - SLM - Predicted

Figure 24 (a-b): Predicted maps of LNSINGLE using the RT and SLM models. in cornparison to the observed distribution (Fig. 1 3b), the RT map more closely resembles the original pattern.

Observeci vs. predicted

Figure 25: Residual diagnostic plots for the SLM predicting LNPOPDENS from LNTM25M at the €A level. In comparison to the prediction of LNSINGLE, the fit is much better. Clumping of positive residuals in the Cabbagetown is evident. These are probably EAs with apartments.

- POPDENS

Observed a. predicted

---rm."

Figure 26: Residual diagnostic plots for the RT model predicting LNPOPDENS from LNTM25M at the EA level. The residuals are smaller in magnitude, however, the high concentration of positive residuals in Cabbagetown is still present.

(b.)- SLM - Predcted 900 O 900 1800 Meters -

Figure 27: Predicted maps of LNPOPDENS using the RT and SLM models.

4.4 Which model? - Model selection strategies

4.4.1 Akai ke's l nformation Criterion (AC)

The residual sum of squares has been the traditional way to compare

different statistical rnodels. However, the main limitation of this approach is that

it does not consider the size of the model. So, a model using 15 variables

showing a remarkably good fit may be selected as the better model than a more

restricted model. Thus, as Linhart and Zuchini (1 986) suggest. models should be

parsimonious and only include as many variables as can be reliabfy estimated.

Therefore, the discrepancy between a selected segmentation and the observed

phenomena, should not only be measured by the estimation component

(measuring the goodness of fit between observed and predicted) but the

approximation cornponent (measuring model complexity) (Csillag and Kabos,

1997). A simple way to express this duality of model discrepancy is through the

Akaike Information Criterion (AIC)(Eq.8) (Akaike, 1974). Model selection using

the AIC assures that complex models with too many variables will pay a penalty

for a decrease of the neg-loglikelihood. The equation looks simpls, however, it

has deep mathematical roots which are beyond the scope of this thesis.

AIC = 2 * neg-loglikelihood + 2 ' parameters ; (8)

To compare the results of the SLM and RT in a quantitative way the AIC is

computed using the neg-loglikelihood estimates of the models. The model with

the smallest AIC is deemed the best model. For predicting SINGLE, the SLM is

the better model at al1 resolutions whereas the RT is the better model for

estimating POPDENS at the coaner resolutions of the aerial photographs and

the satellite image derived VEG layers (Table 11). The lowest AIC values are

observed when the vegetation layer is derived from the satellite imagery. There

seems to be a change in AIC values with coarsening resolution, however, this

change is not predictable. In the case of the StM-LNSINGLE model, the AIC

increases and decreases as the resolution is coarsened from 2m to 100m

resolution. This pattern is reversed in the case of the CART-LNPOPDENS

model.

AIC SLM CART Resolution (m) LNSINGLE 1 LNPOPDENS LNSINGLE 1 LNPOPDENS

Table 11: AIC values for various model runs. According to these values, the RT outperforms the SLM in predicting LNPOPDENS, however, the SLM is better at predicting LNSINGLE. THE lowest AIC values are observed when predicting from the TM25M vegetation variable. In this case the RT is superior for both LNSINGLE and LNPOPDENS.

The change in AIC is also assessed as the complexity of the tree is

increased from a basic two branch tree îo the full sized tree. The AIC decreases

much like the deviance plot (Fig. 29), however, not as abruptiy.

Figure 28: AIC as a function of the number of terminal nodes. For best prediction the tree should be pruned at around 10 nodes because beyond this point the AIC is only marginally improved.

4.5 Finding the appropriate resolution

A pressing issue in geography, as well as most other disciplines studying

spatially distributed phenomena, is the question of scale. What happens to a

relationship when it is studied at various levels of aggregation? Is there an

optimal scale at which certain processes operate? Can relationships from one

level be used to explain relationships at another level? These questions fit under

the umbrella of the Modifiable Unit Area Problem (MAUP) which has two

important components.

The first component deals with the scale issue that is the change in

strength and significance of relationships when going from one level of

aggregation (scale) to another. Some of the first researchers to study this

aggregation problem were Gehlke and Biehl (1934) who realised that the

magnitude of the correlation they were studying increased dramatically when

they aggregated their data. Later, Robinson (1 954) showed mathematically what

can happen when correlations are computed at different levels of aggregation.

Since then, many papers in geography as well as fields such as epidemiology

have addressed this issue (Fotheringham and Wong, 1991 ;Openshaw, 1984;

Amrhein, 1995). The aggregation problem is also analogous to the term labelled

ecological bias. If one transplants a determined relationship from one scale to

another scale an ecological fallacy is most likely to occur since in most cases the

joint distributions of variables changes when going from one scale to the other.

The severity of this ecological fallacy depends on the methods of analysis

(Openshaw, 1984) and it could be entirely an artefact of violated assumptions

and misspecified models (Amrhein, 1995). Nonetheless. it is very tempting to

draw individual level associations from group level research. Under some

circumstances, it might be permissible. For instance, when the groups are small

enough, with homogeneous populations, the ecological bias might not be that

high. Ecological bias is severe if the variance of the underlying population is

high, then it is obvious that there is little chance that an association determined at

the group level can be attributed to an individual. Ecological bias might be

somewhat balanced by 'regression-dilution' bias (random errors in measuring the

modelled variables) which may be greater in individuals than for populations

(English, 1998). Due to the heterogeneous nature of the study area in terms of

its human population, the ecological bias is probably severe. Therefore,

individual level associations should not be inferred from this research.

The second aspect of MAUP research is the aggregation effect. This

effect is manifest when partitioned data at one scale are repartitioned using a

different set of regions. Openshaw and Taylor (1 979) has shown that it is

possible to obtain a wide range of correlations from the same dataset, by

partitioning the data differently. Unfortunately, the partitions of socio-economic

investigations are firmly set by the data collecting agencies who set boundaries

that are in most cases arbitrary and do not follow any ecological or socio-

economic boundaries (Openshaw, 1 984; Bracken, 1 994).

4.5.1 Census Tract Level

The exploration of the relationship between socio-economic and

biophysical characteristics has been so far carried out at the €A level of census

geography. This is the lowest level of census collection and with the smallest

fevel of ecological bias of individual relationships. However, for some lines of

research group level analysis is more appropriate (Logan and Molotch, 1 987).

For this reason, it might be useful to assess the relationship between socio-

economic and biophysical characteristics at another scale. Logically, the next

scale above the EA level would be at the census tract (CT) level. These two

levels occupy the bottom of a nested hierarchy of collection areas.

Many researchers have observed that as data are aggregated, the

strength of the association between variables increases (Clark and Avery, 1976).

This is also the case for the variables under study. The scatterplots reveal linear

associations between al1 variables except for IMMl and VEG (Fig. 29). The

Pearson correlation coefficients increase with aggregation (Table 11). This

stronger association is perhaps due to the fewer degrees of freedom.

Table 1 1 : Pearson's correlation coefficients for selected socio-economic and biophysical variables at the CT level. Boldfaced numbers indicate significance at p=0.001. The coefficients are higher for most variables than at the EA level.

-0.38 0.78

LNINCOME LNVEG2M

The results of the modelling indicate that despite stronger linear

-0-63 0.54

L

relationships, there are still a few very high residuals in some of the CTs. A

better fit is obsewed with the RT model than the SLM rnodel (Fig. 30-33).

LNPOPDENS LNSINGLE

-0.63 -0.38

Figure 29: Scatterplot matrix for selected variables at the CT level. Linear relationships are apparent between VEG and INCOME, VEG and SCHOOL (UNIDEG), VEG and SINGLE, and VEG and POPDENS. There are also other linear relationships between the socio-economic variables.

1 -00 -0.43

0.54 0.78

-0.43 1 -00

-0.65 -0.08

0.69 -0.35

-0.57 0.29

0.48 1 .O0

1

0.88 0.1 5

1.00 0.48

Quantiles of standard noma[

Figure 30: Residual diagnostic plots for the SLM predicting LNSINGLE from LNVEGZM at the CT level. The residuals are skewed to the left indicating a few very high negative residuals, which can be seen on the map.

CART - SINGLE

fi Observed vs. predicted

Quantiles of standard normal

Figure 31 : Residual diagnostic plots for the RT model predicting LNSINGLE from LNVEGZM at the CT level. Residuals are skewed to the left and they are higher near the lake.

(a .) - SLM - Pl

SINGLE

900 O 900 1800 Wtew - (b.) - CART - PREOlCTED

Figure 32: Predicted maps of LNSINGLE using the RT and SLM models at the CT level. (see fig. 33 for cornparison).

Figure 33: Observed map of LNSINGLE at the CT ievel.

4.5.2 Non-spatial scaling of socio-economic variables to a regular grid

As pointed out at the beginning of section 4.5 one of the big limitations of

census related research is the lack of data at scales finer than the €A level. A

possible solution to this lack of scalability of socio-economic data is the use of

biophysical data for scaling. It is evident from Chapter Two that biophysical data

obtained from remotely sensed images are readily scalable (from 0.5m to 1 km

resolutions). One possible way to interpolate socio-economic data to a grid is to

use the relationship determined at the €A level and predict the socio-economic

variable at a scale where there is biophysical data available. The results of such

a prediction are shown in Figure 34 (a-d). A tree model is specified at the EA

level for SINGLE and VEG. This tree is then extrapolated to a 100 m grid for

which VEG is available. The effect of the size of the tree is also assessed. The

results of the gridded layers can be compared to the EA level map layer (Fig. 35).

Of course this kind of interpolation is subject to ecologic bias. The

relationship determined at the EA level rnight not be appropriate at the 100m

level.

b. 2 nodes a. 5 nodes

d. 21 nodes

Figure 34: SINGLE scaled to a 100m grid using a tree regression determined at the EA level and then extrapolated to a 1 OOm grid for which VEG is catculated. Each of the images displayed here are created with different sizes of trees. The lighter pink and blue indicate high values of SINGLE.

Allvar3-region .sh p 0.01 0.01 - 4.66 4.66 - 10.88 10.88 - 20.46 20.46 - 86.58 Suppressed

Figure 35: Observed SINGLE for the scaled area at the EA level. As opposed to the grids in fig. 34, dark colours indicate high SINGLE values.

Conclusion

This final chapter will offer a review of the findings and discuss the

potential for the CO-processing of biophysical and socio-economic data in an

urban context. The integration of the social and physical sciences is at the

forefront of many research initiatives in North America. 60th sides have a lot to

offer to the other and many studies have already taken advantage of the wealth

of data and techniques offered by interdisciplinary research . The nature of this study is also interdisciplinary, utilising the tools of

geographic information systems. remote sensing, and statistics. It assesses the

feasibility of deriving land-cover characteristics from scalable remotely sensed

data and explores their relationships with some socio-economic characteristics.

There is indication in the literature that biophysical variables such as vegetation

cover have significant associations with quality of life indicators such as income,

household characteristics, and education status (Grove, 1996; Lo and Faber,

1997). While the nature of these relationships Is in question (causality versus

spu rious correlations), the advantage of frequently obtained, cost-effective, fine

resolution data derived from remote sensing to characterise the biophysical

environment is worthwhile to be investigated.

The area selected in this study is highly heterogeneous in terms of the

social environment. Some of the richest and poorest people of Canada inhabit

th is area, creating very sharp boundaries between the different socio-economic

g roups. This study explores the relationships between socio-economic and

biophysical variables for this very complex area.

Vegetation cover is derived from both aerial photography and satellite

imagery at various resolutions using supervised and unsupervised classification.

For the finer resolution aerial photography, the main obstacle in accurate

classification is the presence of shadows, which are spectrally very similar to

vegetation. At coarser resolutions, the mixed pixel effect is the limiting factor in

accurate classification. Despite its coaner resolution, the accuracy of the

vegetation cover layer derived from the satellite imagery (-75%) is comparable to

that of the accuracy of the vegetation layer derived from the finer resolution aerial

photography (-80%). The implication of this result is that much time and effort

can be saved by utilising satellite images due to ease of processing as opposed

to aerial photographs. The vegetation cover layen are al1 aggregated to the EA

level where each EA is assigned a value of percent cover.

The exploration of the relationship between vegetation amount and

selected socio-economic variables such as population density, proportion of

u niversity graduates, etc., reveals that vegetation amount is significantly

associated with population density, median farnily income, and proportion of

single-detached homes (see Fig. 19 (b,c,d) for the clear monotone relationships

exhibited on the decile plots). For these monotone relationships, quite high

Pearson's correlation coefficients are observed (-0.41 between vegetation cover

and population density; 0.46 between vegetation cover and median family

income; 0.61 between vegetation cover and proportion of single homes),

indicatirig tha'r the relationships are perhaps linear.

These findings are theri the basis of prediction of two of the socio-

eco nom ic characteristics. Two statistical models, a spatial linear model and a

tree regression model, are implemented. The spatial linear model is chosen

because it explicitly deals with the effect of spatial autocorrelation and the tree

regression model is selected because it is well suited for non-linear monotone

relationships. The residual diagnostics plots indicate a better fit of the regression

tree model results, however, quantitative assessment, using the Akaike

Information Criterion, indicates that the spatial linear model is in sorne instances

superior. The prediction is repeated at the census tract level with slightly better

results. The non-spatial interpolation of one of the socio-economic variables to a

lOOm resolution grid using tree regression serves as a potential solution to the

scaling of census data.

Some of the conclusions of this study are the following. First, it

demonstrates the feasibility of deriving vegetation amount and pattern from fine

resolution imagery. This result alone can aid in specific studies concerning the

distribution of urban vegetation. Rowntree and Nowak (1991) point out that

inventories of vegetation (trees) located on both private and public land are

lacking in most North American cities. Yet, the accurate knowledge of the

distribution of vegetation can be important for studies of carbon sequestration, for

models of hydrology, microclimates (trees are excellent at reducing the heat

islands created by buildings (Oke and Roth, 1989)) and air quality. Furthermore,

accurate knowledge of vegetation cover, as well as other urban land uses and

land covers, is much in need for city planners and those modelling the quality of

life. In fact, Lo and Faber (1997) advocate the use of biophysical variables as

measures of quality of Iife since these data provide the rneans of better relating

environmental quality to social quality. Ryznar (1998) goes as far as to Say that

"it may be possible to eliminate the use of census data altogether and rely on the

satellite images to detenine neighbourhood statusn. Of course, relying on only

environmental data may be too premature, as the exact nature of the

relationships is still unclear.

Second, there are clear monotone relationships between vegetation cover

and a number of socio-economic variables including population density and

income. This finding reinforces the works of Grove (1 996), Lo and Faber (1 997).

and Weber and Hirsch (1992) who have also found such relationships. The

exact nature of the relationships is not clear, however they are al1 monotonic.

The resultant relationships are most likely vety sensitive to scale as the census

tract level analysis indicates.

The third major conclusion is that vegetation cover derived from rernotely

sensed images and then aggregated to the EA level can be used to model

scarce socio-economic data. Although the implementation of the statistical

models is rather simple, it serves as a good template for further prediction. With

more accurate characterisation of the relationships, such predictions could be

used to update databases of socio-economic variables and allow for the scaling

of socio-economic data.

Further research must ascertain whether more precise relationships can

be found between socio-economic and biophysical variables. One potential way

of better characterising the relationship between biophysical and socio-economic

variables is to partition the study area into homogenous units in which there are

more definite linear or exponential associations present. lnstead of finding al1

encompassing relationships, many, locally specific relationships should be

sought. For example, the distinct regions identified in socio-economic feature

space on the 3-D plot in figure 17 could be used as a basis for partitioning the

urban landscape (Fig. 36).

egionl egion2 est

Figure 36: The spatial distribution of high vegetation amount area in socio-economic feature space located in figure 17. Region 1 corresponds to the smaller distinct region in fig. 17 and Region 2 corresponds to the larger one. The partitioning of feature space based on such three dimensional plotting technique could be a possible way of finding areas where there are clear relationships between biophysical and socio- economic variables.

The goal is to identify regions in feature space, which correspond to

geographically contiguous areas (Region 2 could be identified as such a region).

As urban ecosystems become the primary focus of both social and

physical scientists, the repeated assessment of biophysical environment and

understanding its relationship with complex socio-economic variables will be

essential for the successful management cf urban areas.

References

Akaike, H. 1974. A new look at statistical model identification. 1.E.E.E. Transactions on Automatic Control. AU- 1 9, 71 6-722.

Amrhein, C.G. 1995. Searching for the elusive aggregation effect: evidence from statistical simulations. Environment and Planning A. 27: 105-1 19.

Anderson, J.R., E. Hardy, J. Roach, and R. Witmer. 1976. A land use and land cover classification system for use with remote sensor data, U.S. Geoloclical Survev Professional Pa~er 964.

Arai, K., 1993. A classification method with a spatial-spectral variability. international Journal of Remote sensing. 14: 699-709.

Atkinson, P.M. and P. Lewis. 2000. Geostatistical Classification for Remote Sensing: An Introduction. Cornputers & Geosciences. 26: 361 -371 .

Bailey, T.C. and A.C. Gatrell. 1995. Interactive Spatial Data Analvsis. Longman Scientific & Technical, New York. pp. 41 3.

Baker, W., and Y. Cai. 1992. The r.le programs for multiscale anaiysis of landscape structure using the GRASS geographical information system. Landscape ecology. 7: 291 -302.

Benson, B.J. and M.D. MacKenzie. 1995. Effects of sensor spatial resolution on landscape structure parameters. Landscape Ecology. 10: 1 13-1 20.

Breiman, L., J. H. Friedman, R.A. Olshen, and C.J. Stone. 1984. Classification and Resression Trees. Monterey: Wadsworth and Brooks/Cole.

Burchfield, M.L. 2000. Characterisina Scale-De~endent Chanaes in Spatial Pattern of an Urban Land Cover Classification and its Effects on a Chemical Fate and Trans~ort Model. M.Sc. Dissertation. University of Toronto.

Campbell, J.B. 1981. Spatial Correlation Effects upon Accuracy of Supervised Classification of Land Cover. Photo. Eng. & Rem. Sen. 47: 355-363.

Campbell, J.B. 1996. Introduction to Remote Sensinq. The Guilford Press, New York.

Cao, C. and N.S. Lam. 1997. Understanding the Scale and Resolution Effects in Remote Sensing and GIS. In: Quattrochi, D.A. and M.F. Goodchild (eds.). Scale in Remote Sensina and GIS. Lewis Publishers, New York. pp. 57- 72.

Carstairs, V. and R. Morris. 1989. Deprivation: explaining differences in mortality between Scotland and England. British Medical Journal. 299: 886-9.

Chica-Olmo, M. and F. Abarca-Hernandez. 2000. Computing geostatistical imzge texture for remotely sensed data classification. Cornputers 8 Geosciences. 26: 373-383.

Congalton, R.G. 1988. Using Spatial Autocorrelation Analysis to Explore the Errors in Maps Generated from Remotely Sensed Data. Photo. Eng. 8 Rem. Sen. 54: 587-592.

Congalton, R.G. and R.G. Odenwald, and R.A. Mead. 1983. Assessing Landsat Classification Accuracy Using Discrete Multivariate Analysis Statistical Techniques. Photogrammetric Engineering and Remote Sensing. 49: 1671 -1 678.

Cressie, N. and D.M. Hawkins. 1980. Robust estimation of the variogram: 1. Mathematical Geology, 1 2: 1 1 5-1 25.

Csillag, F. and S. Kabos. 1997. How many regions? Toward a definition of reg ionalisation eff iciency. AutoCarto 1 3. Seattle, Washington.

Eckenwalder, J.E. 1999. Vascular Plants. In: Roots, BI., D.A. Chant and C.E. Heidenreich. (eds.). Soecial Places: The Chanaina Ecosvstems of the Toronto Reqion. UBC Press, Toronto. pp. 342.

Fernandes, R.A., C.G. Amrhein, and LE. Band. Identification of Aggregated Demographic Clusters within the Baltimore Ecosystem Study Long Term Ecological Research Site. In Press.

Foresman, T.W., S.T.A. Pickett, and W.C. Zipperer. 1997. Methods for Spatial and Temporal Land Use and Land Cover Assessrnent for Urban Ecosysterns and Application in the Greater Baltimore-Chesapeake region. U b a n Ecosystems. 1 : 20 1 -2 1 6.

Fotheringham, A.S. and D.W.S. Wong. 1991. The modifiable areal unit problem in multivariate statistical analysis. Environment and Planning A. 23: 1025- 1 044.

Frohlich, N. and C. Mustard. 1996. A Regional Comparison of Socioeconomic and Health Indices in a Canadian Province. Soc. Sci. Med.. 42: 1273- 1281.

Gaydos, L.J. 1991. Scale-de~endent Measurement of Veaetation - in Urban Areas. Ph. D. Dissertation. University of California, Santa Barbara.

Gehlke, C. E. and K. Biehl. 1934. Certain effects of grouping upon size of the correlation coefficient in census tract material. Journal of the Amen'can Statistical Association Supplement. 29: 1 69- 1 70.

Geoghegan, J., L. Pritchard (Jr.), Y-Ogneva-Hirnmelberger, R.R. Chowdhury, S. Sanderson, and B.L. Turner 11. 1998. "Socializing the Pixeln and "Pixelizing the Social" in Land-Use and Land-Cover Change. In: Liverman, D., E. F. Moran, R.R. Rindfuss, and P.C. Stern (eds.) Peode and Pixels: Linkinq Remote Sensinci and Social Science. pp. 51 -69.

Geoghegan, J., L.A. Wainger, and N.E. Bockstael. 1997. Spatial landscape indices in a hedonic framework: an ecological economics analysis using GIS. Ecological Economics. 23: 251 -264.

Gilthorpe, S.M. 1995. The importance of normalisation in the construction of deprivation indices. Journal of Epidemiology & Community Health. 49: S45-S50.

Gong, P. and P. J. Howarth. 1990a. Land-Cover Classification Accuracies at the Rural- U rban Fri nge. Photogrammetric Engineering and Remote Sensing. 56: 67-73.

Gong, P. and P.J. Howarth. 1990b. An Assessrnent of Some Factors fnfluencing Multispectral Land-Cover Classification. Photo. ~ n g . & Rem. Sen. 56: 597-603.

Gong, P., and P.J. Howarth. 1992. Land-use classification of SPOT HRV data using a cover-f requency method. international Journal of Remote sensing. 13: 1459-1471.

Graniero, P.A. 1996. Fastcon program to calculate a contrast measure. University of Toronto.

G raniero, P. A. 1 999. Suburban Land Use Classification Using Texture Analysis: High Resolution is the Key. Proceedings from the 1999 ASPUS Annual Con ference. 1 4: 840-848.

Green, K., D. Kempka, and L. Lackey. 1994. Using Remote Sensing to Detect and Monitor Land-Cover and Land-Use Change. Photo. Eng. & Rem. Sen. 60: 331-337.

Griffith D.A. and A.Can. 1996. Spatial statistical/Econometric versions of simple urban population density models. In: Arlinghaus, S.L.(ed.). Practical Handbook of Soatial Statistics. pp. 231 -250.

Griffith D.A.1996. Some guidelines for specifying the geographic weights matrix contained in spatial statistical models. In: Arlinghaus, S.L.(ed.). Practical Handbook of Spatial Statistics. pp. 65-82.

G rove, J. M. 1 996. The Relat ionshi~ between Patterns of Processes of Social Stratification and Veaetation of an Urban-Rural Watershed. Ph.D. Dissertation, Yale University.

Gustafson, E. J. and G.R. Parker. 1 992. Relationships between landcover proportion and indices of landscape spatial pattern. Landscape Ecology. 7: 101-110.

Haack, B., N. Bryant, and S. Adams. 1987. An Assessrnent of Landsat MSS and TM Data for Urban and Near-Urban Land-Cover Digital Classification. Remote Sensing of Environment. 21 : 20 1 -2 1 3.

Haining, R. 1990. Spatial data analvsis in the social and environmental sciences. Cambridge University Press, Cambridge.

Haralick, R.M. 1979. Statistical and Structural Approaches to Texture. Proceedings of the IEEE. 67: 786-804.

Haralick, R.M., K.Shanrnugam, and 1. Dinstein. 1973. Textural Features for l mage Classification. !EEE Transactions or: Systems, Man, and Cybernetics. 3: 6 1 0-621 .

Hixson, M., D. Scholz, and N. Fuhs. 1980. Evaluation of Several Schemes for Classification of Remotely Sensed Data. Photo. Eng. & Rem. Sen. 46: 1 547-1 553.

Hodgson, M.E. 1998. What Size Window for Image Classification? A Cognitive Perspective. Photogrammetric Engineering & Remote Sensing. 64: 797- 807.

Hudak, A. T. and C. A. Wessman. 1998. Textural Analysis of Historical Aerial Photography to Characterize Woody Plant Encroachment in South African Savanna. Remote Sensing of Environment. 66: 31 7-330.

Jensen, J. R. 1996. lntroductorv Diqital lmaae Processina A Rernote Sensinq Perspective. Prentice Hall, New Jersey.

Langford M. and D.J. Unwin. 1994. Generating and mapping population density surfaces within a geographical information system. The Cartographie Journal. 31 : June.

Linhart, H. and W.Zucchini. 1986. Model Selection. Wiley, New York. pp. 301.

Lo, C. P.. 1995. Automated Population and Dwelling Unit Estimation from High- Resolution Satellite Images: a GIS Approach. International Journal of Remote Sensing. 16: 1 7-34.

Lo, C.P. and B..!. Faber. 1997. lntegration of Landsat Thematic Mapper and Census Data for Quality of Life Assessrnent. Remote Sensing of Environment. 62: 143-1 57.

Logan, J.R. and H.L. Molotch. 1987. Urban fortunes: the political economv of place. University of California Press, Los Angeles.

Machlis, G. E., J. E. Force, and W. R. Burch. 1997. The Human Ecosystem Part 1: The Human Ecosystem as an Organizing Concept in Ecosystem Management. Society & Natural Resources. 10: 347-367.

Martin, L.R.G. and P.J. Howarth. 1989. Change-Detection Accuracy Assessrnent Using SPOT Multispectral lmagery of the Rural-Urban Fringe. Remote Sensing of Environment. 30: 55-66.

Mays, N. and S. Chinn. 1989. Relation Between All Cause Standardised Mortality Ratios and Two Indices of Deprivation at Regional and District Level in England. Journal of Epidemiology and Community Health. 43: 191 -1 99.

McDonnell, M.J. and S.T.A. Pickett. 1990. Ecosystem Structure and Function Along Urban-Rural Gradients: An Unexploited Opportunity for Ecology. ECO~O~Y. 7: 1 232-1 237.

Mesev, V. 1998. The Use of Census Data in Urban Image Classification. P hotogrammetric Engineering & Remote Sensing. 64: 43 1 -438.

Miranda, F. P., L. E. N. Fonseca, and J. R. Carr. 1998. Semivariogram Textural Classification of JERS-1 (Fuyo-l ) SAR Data Obtained over a Flooded Area of the Amazon Rainforest. International Journal of Remote Sensing. 19: 549-556.

Mulroney, C. 2000. Photos from space help find ideal sites. The National Post. B14, July 11.

Munn, R.E., M. Thomas, and D. Yap. 1999. Climate. In: Roots, B.I., D.A. Chant and C.E. Heidenreich. (eds.). Special Places: The Chanaina - Ecosvstems of the Toronto Reaion. UBC Press, Toronto. pp. 342.

Nowak, D.J. 1994. Urban Forest Structure: The State of Chicago's Urban Forest. In: McPherson, E.G., D.J. Nowak, RA. Rowntree (eds.). Chicaao's Urban Forest Ecosvstem: Results of the Chicaao Urban Forest Climate Proiect, US. Department of Agriculture, Forest Service. pp. 3-1 8.

Oke, T.R. 1987. Boundaw Laver Climates. Routledge, New York. pp. 435.

Oke, T.R., and M. Roth. 1989. Satellite-derived urban heat islands from three coastal cities and the utilization of such data in urban climatology. Remote sensing of Environment. 1 0: 1 699- 1 720.

Openshaw, S. 1984. Ecological fallacies and the analysis of areal census data. Environment & Planning A. 16: 1 7-3 1 .

Openshaw, S. and P.J. Taylor. 1979. A million or so correlation coefficients: three experiments on the Modifiable Areal Unit Problem. In Wrigley N.(ed.). Statistical Ap~lications in the Spatial Sciences. Pion, London.pp. 127-1 44.

Pesaresi, M. 2000. Texture Analysis for Urban Pattern Recognition Using Fine- resolution Panchromatic Satellite Imagery. Geographical & Environmental Modelling. 4: 43-63.

Pete:son,D. M. Spanner, S. Running, and K. Teuber. 1987. Relationship of Thematic Mapper Simulator data to Leaf Area Index of Temperate Con iferous Forests. Remote Sensing of Environment. 22: 323-4 1 .

Pickett, S.T.A., W.R. Burch(Jr.), S.E. Dalton, T.W. Foresman, J.M. Grove, R. Rowntree. 1997. A Conceptual Framework for the Study of Human Ecosystems in Urban Areas. Urban Ecosystems, 1 : 1 85-1 99.

Riitters, K.H., R.V. O'Neill, C.T. Hunsaker, J.D. Wickham, D.H. Yankee, S.P. Timmins, K.B. Jones, and B.L. Jackson. 1995. A factor analysis of landscape pattern and structure metrics. Landscape Ecology. 10: 23-39.

Robinson, W.S. 1954. Ecological Correlations and the Behaviour of Individuals. American Statistics Association. 30: 51 7-536.

Rowntree, R A . 1984. Ecology of the Urban Forest-Introduction to Part 1. Urban ECO~O~Y. 8: 1-1 1.

Rowntree, R A . , and D.J. Nowak. 1991. Quantifying the Role of Urban Forests in Removing Atmospheric Carbon Dioxide. Journal of Arboriculture. 17: 269- 275.

Ryznar, R.M. 1998. Urban Veaetation and Social Chanae: An Analvsis Usinq Remote Sensina and Census Data. Ph.0. Dissertation. The University of Michigan.

Treitz, P.M., P.J. Howarth, and P. Gong. 1992. Application of Satellite and GIS Technologies for Land-Cover and Land-Use Mapping at the Rural-Urban Fringe: A Case Study. Photo. Eng. & Rem. Sen. 58: 439-448.

Triathlon Mapping Corporation . 1995. Digital Colour Aerial Photographs at 1 :20,000.

Weber, C. and J. Hirsch. 1992. Some urban measurements from SPOT data: urban life quality indices. International Journal of Remote sensing. 13: 325 1 -3261.

Westgate, L A . , P.H. von Bitter, N. Eyles, J. H. McAndrews, V. Timmer, and K.W.F. Howard. 1999. The Physical Setting: A Story of Changing Environments through Time. In: Roots, B.I., D.A. Chant and C.E. Heidenreich. (eds.). Special Places: The Chanaina Ecosvstems of the Toronto Re~ion. UBC Press, Toronto. pp. 342.

Wood, C.H. and D. Skole. 1998. Linking Satellite, Census, and Suwey Data to Study Deforestation in the Brazilian Ammon. In Liverman, D., E. f. Moran, R.R. Rindfuss, and P.C. Stern (eds.) People and Pixels: Linkina Remote Sensinq and Social Science. pp. 70-93.

Woodcock, C. E., A. H. Strahler, and D. L. Jupp. 1988. The Use of Variograms in Remote Sensing: 1. Scene Models and Simulated Imagss. Remote Sensing of Environment. 25: 323-348.

Zimmerman, A. 1999. Ecology, Ecosysterns, and the Greater Toronto Region. In: Roots, B.I., D.A. Chant and C.E. Heidenreich. (eds.). Special Places: The Chanaina Ecosvstems of the Toronto Reaion. UBC Press, Toronto. pp. 342.

Appendix 1 : Statistical analysis using S-PLUS and S-PLUS-SpatialStats

Expioratory Data Analysis (EDA)

Classification and Regression Trees (CART)

#pick the lowest deviance ea.p.tree<-prune.tree(ea.tree1k=708) ea,tree.predict<-predict(ea.p.tree) #count number of leaves (=LEAF)

eatrsec-tree(S ING LE-VEG2M) summary(ea.p.tree) ea.p.tree AIC1 summary(ea.tree.predict)

par(mfrow=c(3,2)) plot(ea.p.tree) text(ea.p.tree) title("S1NGLE-VEG2M. k=708") plot(prune.tree(ea.tree)) title("Reduction of Deviance") image(interp(X,Y,predict(ea.p.tree))) title("CART prediction for SINGLE") ea.tree.residc-(SINGLE-ea.tree.predict) image(interp(X,Y,ea.tree.resid)) title("CART residuals for SINGLE") hist(ea.tree.resid) title("Histogram of CART residualsu) qqnorm(ea.tree.resid) qqline(ea.tree.resid) title("QC1-plot of CART residuals")

Spatial Linear Regression (SLM)

Xbuild neighborhood matrix ea.place<-cbind(X,Y) ea-quadc-quad.tree(eâ.place) ea.nhbr~-find.neighbor(x=ea.place,quadtree=ea.quad,m~.dist=400) ea.n hbrc-ea,nhbr[ea,nhbr[,3]!=0,] ea-snh brc-spatial.neighbor(row.id=ea.nhbr[l ],col.id=ea.nhbr[,2],symmetric=TRUE)

#get rid of extra pairs from neigborhood matrix created by Arcview ea2.adjust.snhbr~-spatial.condense(ea.adjust.snhbrlsymmet~=T)

#Check for islands check.islands(ea.snhbr,remap=F)

Spatial linear model ea.slmc-slm(log(SINGLE)- log(VEG2M),cov.family=CAR,spatial.arglist=list(neighbor=ea.snhbr))

summary(ea.slm) Irt(ea.slm ,pararneters=O) AIC

XVisuals for slm par(mfrow=c(3,2)) image(interp(X,Y,ea.pred)) titie("SLM prediction for log(SlNGLE)") image(interp(X,Y,ea.slm$residuals)) title("SLM residuals for log(S1NGLE)") hist(ea.slmSresiduals) title("Histogram of SLM residuals") qqnorm(ea.slmSresiduals) qqline(ea.slmSresiduals) title("QQ-plot of SLM residuals") plot(ea.slmSfitted,ea.slmSresiduals) abline(h=O) title("Residuals vs. fitted values") ea.signal.slrn c-log(S1NGLE)-ea.slm$fitted-ea.slm$residuals plot(log(SINGLE),fitted(ea.slm)+ea.signal.slm) abline(0.1)

#Histogram with various options hist(SINGLE,nclass=l O,xlab=aSINGLE",main=wEAw,xlirn~(O,30)lylim=c(O,1 15))

#Makes 30 perspective piot persp(interp(X,Y,LAKE),zlab="SlNG LE",eye=c(632000,4830000,5000))

#Partitions data geographically par(mfrow=c(2,2)) eda.treec-tree(S1NGLE-X+Ylmindev=.OO1 ) plot(prune.tree(eda.tree)) eda2.treec-prune. tree(eda.tree, best=l O) plot(eda2.tree,type="uU) text(eda2.tree.srt=90) par(pty="s0 j partition.tree(eda2.tree) points(X,Y)

#To select outliers plot(fitted(ea.lm),residuaIs(ea.lm)) outIiers~-identify(fitted(ea.lm),residuals~ea.lm),labels=EA.ID) outliers ea.fit2.lmc-lm(Y-X,subset=-outliers)

derivation of biophysical variables from fine re for … · derivation of biophysical variables...

Documents