the use of agent-based models in regional science by mark kimura

THE USE OF AGENT-BASED MODELS IN REGIONAL SCIENCE

A Dissertation

Presented to the Faculty of the Graduate School

of Cornell University

in Partial Fulfillment of the Requirements for the Degree of

Doctor of Philosophy

by

Manabu “Mark” Kimura

May 2002

TABLE OF CONTENTS

CHAPTER ONE: INTRODUCTION 1

Use of Computers in Regional Science 1

Why Simulation? 2

Role of Randomness 2

Agent-Based Models 2

Spatial Issues 4

CHAPTER TWO: BACKGOUND 6

CHAPTER THREE: INTRODUCTION TO AGENT-BASED MODELS 8

A Highly Simplified Model 9

A More Realistic Simplified Model 21

CHAPTER FOUR: GENERALIZED SPATIAL AGGLOMERATION MODEL 25

Properties of Location—Q-Vector 25

Decision Making Process in Migration 27

CHAPTER FIVE: APPLICATIONS OF THE GENERALIZED SPATIAL

AGGLOMERATION MODEL 33

CHAPTER SIX: APPLICATION #1: A SIMPLE MODEL 34

vii

Specifications 34

Simulation Results 36

Discussion 44

CHAPTER SEVEN: APPLICATION #2: RESOURCE DEPLETION AND

POLLUTION ACCUMULATION 50

Specifications 50

Simulation Results and Discussions 53

CHAPTER EIGHT: APPLICATION 3: PROPAGATION 65

Specifications 65


CHAPTER NINE: APPLICATION 4: GEOGRAPHY 72

Specifications 72


CHAPTER TEN: CONCLUSIONS 81

BIBLIOGRAPHY 84

viii

LIST OF TABLES

Table 6.1: Perfectly Rank-Sized Distribution 47

Table 6.2: Not Rank-Sized Hierarchical Distribution 47

ix

LIST OF FIGURES

Figure 3.1: An example of Moore neighborhood. 10

Figure 3.2: A snap shot of the space. 11

Figure 3.3: Initial distribution of agents. 15

Figure 3.4: First village of 2. 16

Figure 3.5: First village of 3. 18

Figure 3.6: Emerged clusters at 06.0=t . 19

Figure 3.7: Three-dimensional representation of urban clusters at . 20 06.0=t

Figure 3.8: Three-dimensional representation of urban clusters at . 20 1.0=t

Figure 3.9: Migrants’ net payoff with negative externalities ( 0.9=π , ). 22 1=d

Figure 3.10: Emergence of cities with 98 family units, 0.9=π , c , t . 24 01.0= 2.0=

Figure 3.11: Further development of cities with 98 family units, 0.9=π , ,

.

24

01.0=c

5.0=t

Figure 4.1: Distribution of attractive forces around cell where and i 1=p 1=jG for

all .

30

j

Figure 4.2: Distribution of attractive forces around cell where and i 1=p 1=iG for

all j except for Γ at 70)( =jQ 15=ijd 31

x

Figure 6.1: Spatial patterns in equilibrium under various sets of parameters. 37

Figure 6.2: Evolution of spatial patterns at 3.0=c . 40

Figure 6.3: Effect of p and on number of clusters. 42 c

Figure 6.4: Effect of p and c on average cluster size. 43

Figure 6.5: Variance of cluster sizes on the cp − space. 44

Figure 6.6: Spatial patterns with travel cost uniformly distributed from 0 to 1. 46

Figure 6.7: RSR Index vs. p under 2.0=c . 48

Figure 6.8: Set of p and c that offer realistic population distributions added to figure

6.5. 49

Figure 7.1: Initial distribution of resources for Case 1. 53

Figure 7.2: Formation of clusters when resources exist with low transport cost. 55

Figure 7.3: Formation of clusters when resources exist with high transport cost. 56

Figure 7.4: Formation of clusters and behavior of mega-cluster with both resources

and pollution (low transport cost). 57

Figure 7.5: Formation of clusters under larger influence of pollution. 58

Figure 7.6: Initial distribution of resources for Case 5. 59

Figure 7.7: Behavior of agents with depletable resources concentrated in the center of

the space. 60

xi

Figure 7.8: Behavior of agents with renewable resources concentrated in the center of

the space and with accumulation of pollutants. 62


the space and with pollutant purification. 64

Figure 8.1: Propagation of agents. 70

Figure 8.2: Propagation of agents in a larger space. 71

Figure 9.1: Initial condition for q for all i (North American Map). 74 4,i

Figure 9.2: Formation of urban clusters of North America simulated under various

transport cost ranges. 79

Figure 9.3: Snapshots of simulations at t = 100 under 0 < p < 10. 80

xii

CHAPTER ONE:

INTRODUCTION

The role of science is to expand our knowledge about the universe and supply

the findings to the fields that make use of them. For example, mathematics arms

physicists with vigorous logical reasoning; physics allows astronomers to understand

how objects interact with each other; astronomy helps engineers construct a guidance

system for a space shuttle; and, we go to outer space. Conversely, the progress of

science also has been helped by engineering—especially, in the late 20th century, the

advance of computer capability changed the way most scientific work is done.

Likewise, regional science has also exploited the computer capabilities of

recent years in order to expand our knowledge about our economy in terms of space

and offer the findings to policy makers.

Use of Computers in Regional Science

In regional science, computers are used in three ways: (1) massive

computation of simple mathematical problems, which involves data processing and

numerical solution of a system of nonlinear equations; (2) Geographic Information

Systems (GIS) and (3) simulation. (1) Massive computation of simple mathematical

problems includes implementations of traditional methods such as statistical analyses

(e.g. Econometrics1), Input-Output analysis and CGE models. Regression analyses and

I-O analyses with real-life data require a series of calculation of large matrices and we

usually choose to not do it by hand. A CGE model, since it normally includes a

number of non-linear equations, cannot be solved analytically; thus, we need to resort

1 Guy Orcutt (1990), who is known as the pioneer of microsimulation, designed the first multiple

regression analyzer during World War II.

1

2

to approximations by numerical calculations. These methods have been extensively

developed for the last several decades. (2) Geographic Information Systems, which is

an analysis and visualization tool rather than a method, is used in regional science

when one applies his/her models to real-world geography. GIS has been actively used

for less than decade and will be extensively used with existing and new methods. (3)

Simulation is a common technique in natural sciences and engineering such as

Operations Research, but the use of it has been rare in regional science. It is the first

objective of this study to introduce this new method to the field of regional science.

Why Simulation?

Simulation has advantages over analytical approaches under some situations.

First, it is easy to deal with dynamic phenomena—it is, in fact, often inevitable for a

model to be dynamic. Second, one can model systems that offer complex spatial

patterns and/or non-linear behaviors. Third, randomness—more precisely,

stochasticity—can explicitly be added to a model.

Role of Randomness

Traditionally, when economists deal with random phenomena they use

probabilistic approaches such as sunspot equilibria, but with simulation it is possible

to leave randomness truly random. In other words, every time we run a simulation

with the same environment we would observe a different result. It is the second

objective of this study to show how the models with stochastic factors work, and, the

effect of randomness on the results.

Agent-Based Models

An agent-based model can be defined as a system of numerous autonomous

agents who interact with each other according to each agent’s rules, which are often

3

local. Also, it is known that even very simple rules can cause extremely complex

behaviors. The use of agent-based models is an ideal approach to make use of all the

advantages mentioned above; namely, the model can be dynamic and simultaneously

capable of simulating complex and random phenomena.

One of the implications of using simulation would be that we could possibly

“experiment” with new policies before applying them to the real world. With agent-

based models, in particular, we could also experiment on an individual’s preferences

and responses to the environment. Indeed, with agent-based approaches, it is possible

to even model adaptive behaviors of economic agents without allowing traditional

economic assumptions such as rational expectation. The difficulty in, and significance

of, modeling the complex behaviors of the real-world economy were pointed out by

Isard (1956) in the preface of “Location and Space-Economy”:

“…A presentation of conditions of equilibrium in a theoretical system

may seem to imply a tendency toward the attainment of a state of equilibrium

in the real world. But in a full historic sense, actual economic life never does

realize a state of equilibrium. There are always changes impinging upon the

economy. The process of adjustment is constantly in operation. Witness, for

example, the adaptation of population to environment. There has never been a

complete adjustment which might be said to characterize an optimum or

equilibrium spatial distribution of population.”

When this was written in 1956, analytical tools were the only ways for general

researchers; in fact, the methods introduced in the book attack complex problems

within the framework of analytical approaches. With agent-based models, however, it

is not a demanding task to create a model to satisfy the kind of complex behaviors

4

described in the citation above. The third objective of this research is to demonstrate

this.

In general, agent-based models are uniquely capable of handling the

followings: (1) Interaction between agents. Unlike traditional microeconomics

theories (not to mention macroeconomics), where only a representative agent is

considered with the interactions with no or a few agents at most, agent-based

approaches can explicitly model the interaction between each agent and an arbitrary

number of agents; (2) Asymmetrical space. Agent-based models allow us to assess

complex-shaped spaces. For example, the shape of the space could be as complex as a

map of the United States; and (3) Heterogeneous agents. Since the concept of agent-

based models is object-oriented, it is relatively easy to model heterogeneity of agents.

Note that we integrate all of these into one set of simple mathematical expressions.

Spatial Issues

As implied in the last part of the citation, one of the dominant factors that

complicate the real-life economy is the existence of space, which is, of course, the

motivation of regional science. As a matter of fact, traditional methods in regional

science such as those in “Methods of Interregional and Regional Analysis” (1998) do

answer some economic questions with regard to location, but they by themselves have

the limitation that the geography must be highly simplified so that the questions can

be solved analytically. In other words, if we are to apply an existing method to the real

geography, there is no way for the method to recognize the complex

shapes/dimensions of political and economic units.

It is an advantage of agent-based models that the physical complexity of the

space does not matter because an agent’s local rules are concerned with its

5

neighborhood with a simple shape2. For this reason, the existing methods with simple

scalar dimension such as the gravity model could be soundly incorporated into an

agent-based model and used for vectorial analyses.

The final objective of this study is to integrate a few existing regional science

methods into agent-based models and construct a general spatial agglomeration model

that can be used in other methods in regional science.

2 Moreover, as will be shown in later chapters, the intricate spatial distribution of population emerge

rather than be exogenously given. Obviously; this concept of emergence—which is often seen in

complexity science—is consistent with historical evidence.

CHAPTER TWO:

BACKGOUND

Krugman (1996) constructed an elaborate economic model using the idea of

“monopolistic competition” and developed it to a migration model. His model,

however, does not allow a region to have “size”; in other words, each region is a point

in the space. Furthermore, the scopes of both trade and economics agents’ ability to

obtain the information about other regions are global. As a result, depending on the

values of parameters, centrifugal and centripetal forces cannot coexist at the same time

in the whole system and this prevents the model from offering realistic spatial

patterns.

Page (1999), on the other hand, applied simple rules to agents’ migration and

acquired results both analytically and by simulations. He assumed a lattice as the

space with each border not connected to the other side; this border condition is a

strong assumption, which dominates some of the results presented in his study.

Although it is an interesting case to consider, it lacks generality and often hides the

effects of other important parameters. The scope of agents was either strictly global or

local.

Schweitzer and Steinbrink (1997) used a stochastic model to simulate urban

agglomeration based on empirical facts such as the Rank-Size rule and developed a

kinetic model, which generates realistic spatial distributions. Although the model is

elaborate in terms of physics and the result is consistent with Berlin’s spatial

distribution, it lacks economic rationale; the realistic results seem coincidental.

Therefore, the model does not appear to be applicable to general urban problems.

Axtell and Epstein (1996) developed an agent-based model on a closed lattice

space and demonstrated extensive experiments, which included birth, death, gender,

culture, conflict, disease and so on. In their model, however, the agglomeration of

6

7

agents is governed by the initial distribution of “sugar”, which is exogenously given,

so the resulting spatial patterns do not arise by emergence.

CHAPTER THREE:

INTRODUCTION TO AGENT-BASED MODELS

In this chapter, a spatial agglomeration model is presented as an example of

agent-based models. The model is kept simple just enough to illustrate how one could

construct an agent-based model for social phenomena and show its outcome.

In general, steps involved in developing an agent-based model would be:

1. Define a world where agents perform their activities.

2. Define behavioral rules of each agent. This often includes

a. How it behaves by itself,

b. How it interacts with other agents, and

c. How it responds to the changes in the world.

3. Define how the world responds to the actions taken by agents.

4. Build an algorithm.

5. Write a computer program for simulation.

Furthermore, in order to utilize the model for analysis, one may also:

6. Construct hypotheses.

7. Determine simulations that would test these hypotheses.

8. Run simulations.

9. Analyze the results.

In the following sections, these steps (except for 2.c and 3) are followed to

build the model.

8

9

A Highly Simplified Model

Specifications of Space

The space in which economic agents live is set to be a lattice of cells, each of

which can contain multiple agents. Furthermore, we forge a space of infinite horizon

by connecting one horizontal edge to the other and one vertical edge to the other3,

making a torus, in order to avoid potential border effects, which are undesirable for

the purpose of this chapter4.

The distance between any two cells is defined by its Moore neighborhood.

Figure 3.1 shows an example of Moore neighborhood5 where the distance from the

central cell is marked in each cell. We choose this type of distance ( distance)

rather than Euclidean ( L ) distance because of the space being a lattice and for the

∞L

2

3 This is a setting commonly seen in spatial interaction models.

4 For example, if we are to construct a model that includes trades between locations, all agents might

simply gather at the central cell of the lattice to minimize the transport cost. In this case, there could be

no place for randomness. 5 An alternative neighborhood is:

44 3 3

4 3 2 2 24 3 2 1 2 2 3

4 3 2 1 0 1 2 3 44 3 2 1 2 3 3

4 3 2 3 44 3 4

4 ,

which is characterized by L1 distance or Manhattan distance, which might be, as its name suggests, appropriate for certain problems in regional science.

10

ease of coding. It is also know that the choice of distance does not affect the

qualitative results (Durrett and Levin, 1994).

4 4 4 4 4 4 4 4 44 3 3 3 3 3 3 3 44 3 2 2 2 2 2 3 44 3 2 1 1 1 2 3 44 3 2 1 0 1 2 3 44 3 2 1 1 1 2 3 44 3 2 2 2 2 2 3 44 3 3 3 3 3 3 3 44 4 4 4 4 4 4 4 4

Figure 3.1: An example of Moore neighborhood.

Figure 3.2 is a snapshot of a lattice of 50 × 50 (=2500) cells with most cells

occupied by one family unit (small white dot), some with 2-9 family units (numbered)

and others empty (black). In addition, notice that there is a non-inhabitable area

represented by 23 white cells in the center of the lattice. Let us consider this area as a

lake, swamp, desert or other form of infertile area. Also, we assume no natural

resources such as oil, coal, etc., so, in other words, there are only human resources.

11

Figure 3.2: A snap shot of the space.

Productivity of Agents

Consider each agent as a family unit and assume that there are

homogeneous agents. The basic principle of each family’s activity is, as in

conventional microeconomic problems

N

6, that it attempts to maximize its productivity.

To obtain the productivity of family α , consider first the gross regional

product of location in which i α resides. We conceive agglomeration to originate in

6 Of course, we may introduce other factors—in fact, as many as factors we want including non-

economic ones. However, the main concern of this chapter is a smooth transition from conventional

economics; thus, it is appropriate to stick to the traditional microeconomic way. The generalized model

in the next chapter allows us to include other factors systematically.

12

economies of scale (e.g. specialization, localization, spatial juxtaposition, etc.) and, as

stated above, labor is the only input of production; thus, if we let denote the

population in location , we set up the production function,

in

i

( ) 2iii nnF = , (1)

which indeed reveals increasing returns to scale. Therefore, since all n agents are

homogeneous, average product of or the real wage of family

i

i α is,

( ) ii

iii n

nnnf ==

2

(2)

This suggests that in order for family α to gain more real wage, it needs to increase

the number of the other family units—by migrating to another location that has a

larger population7.

Law of Motions

Family α (which is in location ) relocates to location if such a move gives

the best net gain among all possible moves including no relocation. A net gain is

obtained by subtracting transport cost of migration from the real wage at the

destination. Therefore, if relocates from to , the net gain of family α is

i j

α

α i j

( ) ijjj dnnG π−=α (3)

where is the transport rate per unit distance and d is the distance between i and

. Thus, α chooses that maximizes G .

π ij

j j α

7 The family could also propagate and increase the population in the same location. The idea of birth

and death will be added to the models introduced in chapter 8.

13

Note that family is assumed to perceive the gross regional product of every

location or have perfectly global scope. This is a somewhat unrealistic assumption

(and not common in agent-based models). Family

α

α , however, tends to choose that

is in the neighborhood of due to the existence of transport cost and behaves as if the

family ignored distant locations. The models presented in the following chapters have

a more realistic approach toward agents’ scope.

j

i

Algorithm

Once we set up the characteristics of the space and the behaviors of agents, we

need to build a flow of simulation—or an algorithm.

In this model, first, parameters are initialized as follows:

• Dimension of Space: 50 × 50 = 2500 cells with 23 non-inhabitable cells

• Number of Family Units: 2477

• Initial Distribution of Family Units: Uniform, or one family unit per

cell8

• Transport Rate Per Unit Distance )π( : 0.9

The initial distribution of agents is shown in Figure 3.3. Second, the first family unit

is randomly chosen to consider relocation. (Let the location of is denoted by i )

Note that this is the main part where the role of randomness enters into this model.

Third, family unit evaluates all other locations by equation (3) and finds the

location, , that gives the highest gain. If there are more than one location that give

the highest gain, then the family unit chooses one of them randomly; this is another

a a a

a

maxj

8 The assumption of uniform initial distribution of family units is based on extensive historical facts.

See Flick (1934, p. 164).

14

part where randomness might take effect. Fourth, if the gain by relocating to location

is larger than that by staying in location , the family moves to ; otherwise,

it stays in location i . It also stays if those two gains are the same. Last, the algorithm

goes back to the random selection of a family unit and repeats this process. Note that

when the next family is randomly chosen, every family has an equal chance for

relocation, so the same family could also be chosen again by a probability of 1/2477.

maxj i maxj

a

Time

The unit of time in this model is assumed to be the average length between the

times when a family unit is chosen and when the same family unit is chosen again, so

for every unit of time, every agent has a chance to be selected once. Therefore, in this

model’s settings, one unit of time is equivalent to 2477 selections of family units since

there are 2477 family units in total. This definition of time allows us to change the

number of family units yet keep the normalized time consistent.

This way of updating the system, or letting only one agent move at the same

time, is called asynchronous update, and allows us to have continuous time. The

alternative is synchronous update, where all agents make their moves at once (usually

with each agent not knowing how the other agents move), and is equivalent to discrete

time. Synchronous update causes additional (and sometimes undesirable) effects

(Durrett and Levin, 1994), but in certain economic cases, it could be more

appropriate.9

9 For instance, economic agents (consumers, investors, governments, etc.) often make their decisions

based on the information that is available after a certain period time or only periodically, such as after

census data. In this case, time could be considered discrete.

15

Figure 3.3: Initial distribution of agents.

Implementation of a Sample Simulation

The simulations presented in this research were performed using a C++

language compiler on a personal computer. When the program is started, it first shows

the initial distribution of family units as shown in Figure 3.3. Then it chooses the first

family unit α in location i to consider relocation. Among all locations except for i ,

the adjacent ones ( ) give the highest payoff; namely,

1 1 1

1=dj

( ) 1.119.02111

=×−=π−===α ijjj dnnG

dd (4)

Also, the payoff by staying in is 1i

( ) 109.01111

=×−=π−=α ijii dnnG . (5)

16

And this is smaller than the payoff from relocation. Therefore, the family relocates to

one of its adjacent locations (recall that when a family unit is indifferent between

destinations, it chooses one of them randomly), making the first “village” that consists

of two family units. The state at this point in time is shown in Figure 3.4, where there

is one cell marked “2” next to a black cell without a white dot.

Figure 3.4: First village of 2.

It is conceivable that the very first—therefore, any—family does not move at

all and the initial distribution of families remains if π is larger than a certain value. In

fact, such a critical value exists and it can be algebraically derived; that is, the first

family moves only if

( ) 112111

>×π−=π−===α ijjj dnnG

dd (6)

17

or

1<π (7)

In an anthropological sense, this could partially explain that when and/or where the

cost for migration was too high, ancient family movements did not form groups that

eventually led to a civilization, further explaining why civilizations emerged on

plains.

After some random selections of families making villages of “2”, one family

eventually joins a village of “2” and makes the first village with three family units

(See Figure 3.5). Notice that the third family came from a cell two units distance

away. Indeed, this is possible since

( ) 12.129.032

>=×−==α dj

nG (8)

although not from a cell three distant units away because

( ) 13.039.033

<=×−==α dj

nG . (9)

18

Figure 3.5: First village of 3.

Similarly, families keep forming villages of 4, 5, 6 and so on. After 148

random selections (or t ), three clusters that range from 10 to 20 can be

observed as shown in figure 3.6. In this figure, a color bar is added in order to estimate

the numbers of families in the cells with more than nine family units.

06.0=

19

Figure 3.6: Emerged clusters at 06.0=t .

To provide different perspectives on the emerging spatial pattern of clusters,

let us use a three-dimensional representation of the space. The three-dimensional

graph in figure 3.7 corresponds to the two-dimensional picture in figure 3.6, and

figure 3.8 shows the spatial pattern after 2477 random selections or at t . In these

figures, the vertical axis is log-scaled in order to depict larger concentration. Note how

the spatial pattern changes and starts to form a clear hierarchical structure in spite of

the simplicity of the model.

1.0=

20

010

2030

4050

x0

10

20

30

40

50

y

0

1

2

3

4

log10POP

010

2030

4050

x

0

1

2

3

4

log10POP

Figure 3.7: Three-dimensional representation of urban clusters at . 06.0=t

010

2030

4050

x0

10

20

30

40

50

y

0

1

2

3

4

log10POP

010

2030

4050

x

0

1

2

3

4

log10POP

Figure 3.8: Three-dimensional representation of urban clusters at t . 1.0=

21

This hierarchical structure, however, does not last long in the highly simplified

model because all families eventually go to a single location (most likely the one with

the highest peak in figure 3.8). This is always the case because the model assumes

infinite capacity, which allows an urbanized area to simply keep attracting more

families. Obviously, it is unrealistic. In the next section, we tweak the model and

attempt to solve this problem.

A More Realistic Simplified Model

Extension of the Model—Diseconomies and Negative Externalities

We improve the highly simplified model in the last section by introducing

diseconomies and other negative externalities such as high rent, crime, environmental

pollution, etc. to the model. This can be done by adding a new term, , to

equation (3); namely,

2nc j−

( ) 2ncdnnG jijjj −π−=α , (10)

where is the parameter that captures the magnitude of negative externalities. We

keep small enough to allow negative externalities to take effect only when

becomes reasonably large. In fact, for

jc

jc jn

1<<jc and being quadratic, the term

is negligible when is small and the term becomes exponentially non-

negligible as increases. Figure 3.9 illustrates this tendency under and

2nc j−

2nc j− n

n 0.9=π 1=d .

We can observe that indeed starts to decline after obtaining a certain number of

families (to be precise, at

(nGα )

n )2/(1 c= ) and eventually becomes incapable of attracting

even a lone family in the nearest neighborhood.

22

0 20 40 60 80 100

-10

-5

0

5

10

15

20

25

n

G(n

)

c = 0.01c = 0.02c = 0.05

Figure 3.9: Migrants’ net payoff with negative externalities ( , ). 0.9=π 1=d

Implementation of a Sample Simulation

Based on the modification above, let us run a simulation with with all

other parameters kept the same as in the previous section. For a better observation of

the effect of the new term, on the result, we start the simulation at t with

the population distribution identical to that in figure 3.8.

01.0=c

=,2nc j− 1.0

Figure 3.10 shows the result after 495 random selections or t . The

primary node clearly seen in figure 3.8 has not developed; instead, there have emerged

other cities of equal size. In actuality, any city stops to grow when its population

reaches 98 family units for the following reason: if an agent is alone in location and

stays there, its net gain is 0.99. Similarly, if the family moves to the next cell

2.0=

i10 to

10 We do not need to consider the cases where the family moves to a location more than one unit

distance away because it is obviously more costly to make a village of 98 families by migrating from a

location two units distance away or farther.

23

make a village (or city) of 98 families, it gains 1.06, which is larger than 0.99. But if

the family made a village of 99 families, it would gain only 0.09, which is, of course,

smaller than 0.99. Thus, any village that already has 98 families cannot accommodate

another family.

Let us move the simulation forward. Figure 3.11 is a snapshot of the same

simulation at . We observe the further development of cities of 98 families.

Notice also that some smaller villages have disappeared; the families there have

chosen to migrate to make new cities of 98 families each. Actually, in equilibrium,

with the families most often make 25 cities of 98 and the residual families

make one or more smaller village(s). In general, given a value for , all families

attempt to make a certain number of cities of equal size. Unfortunately, this still is an

unrealistic result.

5.0=t

,01.0=c

c

In the next chapter, we construct an even more realistic and general model for

spatial agglomeration, which is capable of capturing arbitrarily many factors, in order

to simulate better the real world.

24

010

2030

4050

x0

10

20

30

40

50

y

0

1

2

3

4

log10POP

010

2030

4050

x

0

1

2

3

4

log10POP

Figure 3.10: Emergence of cities with 98 family units, 0.9=π , , t . 01.0=c 2.0=

010

2030

4050

x0

10

20

30

40

50

y

0

1

2

3

4

log10POP

010

2030

4050

x

0

1

2

3

4

log10POP

Figure 3.11: Further development of cities with 98 family units, , c ,

.

0.9=π 01.0=

5.0=t

CHAPTER FOUR:

GENERALIZED SPATIAL AGGLOMERATION MODEL

This chapter involves two major tasks for the development of a generalized

agent-based model for spatial agglomeration. First, we mathematically formalize the

relationship between properties of a location (e.g. population, resources,

environmental pollution, etc.) and the perception of each agent in the location. This

will enable us to systematically add any quantifiable factors involved in the urban

agglomeration/diffusion phenomenon. Second, we examine the way individuals in the

real world make their decisions when they choose a place to live, and incorporate the

procedure into the algorithm. This will result in the formation of highly realistic

spatial patterns.

Properties of Location—Q-Vector

In this section, we establish a formal expression of quantifiable properties (e.g.

population, resources, pollution level, etc.) that belong to a cell so that we can treat

those properties uniformly and do not have to change the model itself when we add

more properties. We first introduce the Q-vector of cell i . Write:

(11)

=

−

M

Mi

i

i

i

i

qq

qqq

Q

1,

3,

2,

1,

M

where each element is a quantity or level of a property and iq M is the number of

properties we choose. For convenience, we reserve the first element, q for the ,1

25

26

domestic population of agents throughout this study. For example, if one wishes to

include resources and pollution into consideration, the Q-vector is written as:

(12)

=

=

ii

qqq

Q

i

i

i

i

cellat levelpollution cellat resources ofamount

agents of population

3,

2,

1,

Properties such as amount of resources and pollution level are dynamic: they

change over time depending on human activities, other properties, and sometimes

themselves. The dynamics of properties is explicitly defined by an ordinary

differential equation (ODE) of Q-vector:

( )

ot QQQfQ

==

=0

& (13)11

where the second line of equation (13) is the initial condition.

For instance, if we are to define the dynamics of each property in (12) by the

following rules:

• Population grows at a constant rate, . 1r

• Each agent consumes resources at a constant rate r . 2

• Each agent generates pollutants at a constant rate . 3r

then, the ODE of the is written as: iQ

11 One may point out that there is inflow or outflow of agents; therefore, the ODE for q1 should be

adjusted accordingly—We do not have to include inflow or outflow of agents here because equation

(13) defines the changes in Qi during the time between one migration and another. Recall that we

simulate continuous time by asynchronous update.

27

(14)

−=

=

1,3

1,2

1

3,

2,

1,

i

i

i

i

i

i

qrqr

r

qqq

Q&

&

&

&

Note that this single ODE indeed embraces the fact that both the amount of resources

and pollution level are influenced by the presence of agents. Once the amount of

resources reaches zero, equation (14) needs to be replaced by:

(15)

=

=

1,3

1

3,

2,

1,

0

ii

i

i

i

qr

r

qqq

Q&

&

&

&

Equation (14) is a system of first-order linear equations, so it can be written as:

(16) BAQQ ii +=&

where

(17)

=

−=

00r

B and 0000000 1

3

2

rrA

Therefore, the stability of equation (14) can also be obtained analytically by using the

eigenvalue of . A

Decision Making Process in Migration

Our strategy for modeling migration is to simply make each agent “mimic” a

real person’s behaviors without making the model too complicated. Thus, we divide

the whole decision making process into two parts:

28

1. First, an agent decides which city12 she wants to live in.

2. Then, she finds the location she actually settles in.

The concept of this two-stage process is quite intuitive: When we consider relocation,

we often do think of a city first (e.g. “I want to live in the New York Metropolitan

Area!”) without detailed information about every community or each block in the city.

Also, we tend to assess a city by its economic center—or cultural center, depending on

what the criterion is. Then we start looking into various neighborhoods using more

detailed criteria such as rent, security, access to downtown, etc. (e.g. “The rent in

Manhattan is too high although I want to visit there sometimes”), and finally find the

most reasonable place to live (e.g. “I decided to live in Bronx”).

The decision making process stated above is a search problem: one searches

for one place to move to among other possibilities, given a certain amount of

information. In fact, the first stage is a global search, where an agent performs a

search throughout the whole space; the second stage is a local search, where the agent

has only limited scope. Thus, we model each of the two stages as a separate search

problem.

Global Search

For the first stage, an agent in cell evaluates each cell in

the whole space by the following utility function:

i ),,3, 2, 1,( KK ij =

( ) )( ijjj dDQG Γ= (18)

12 From this chapter on, when we use the word, “city”, imagine a cluster that consists of more than a

single cell rather than a dimensionless cluster as in the previous chapter.

29

where Q is the Q-vector of cell . j j )( jQΓ is a utility function which

is to be defined appropriately. is a function of distance between cell i and .

)( RRM → :Γ

)( ijdD j

As stated in chapter 1, one of our goals is to show how existing regional

science models can be incorporated into agent-based models; so, we take a gravity model type approach13 for ; namely, )( ijdD

( )

pij

jj d

QG

Γ= (19) 14

The parameter p controls the contribution of distance, , or in an economic sense, it

could be considered as a function of transport cost.

ijd

The agent then chooses the cell that gives the highest value of maxj jG 15. We

consider as the economic center mentioned above. Although the agent evaluates

all cells in the space for the sake of the algorithm’s consistency, it gives the same

as the case where she recognizes clusters and evaluates the economic centers of the

clusters.

maxj

maxj

Note that in terms of gravity models, G can be viewed as the attractive force

of

j

j on i . Figure 4.1 illustrates the distribution of attractive forces for . In this

figure, is set to be unity for all

1=p

)jQ(Γ j to see the pure effect of on attractive

force. Observe how distance discounts

ijd

)( jQΓ for ; this implies that although 1>ijd

13 Another possible form for D(dij) would be )2/exp(2/)( 2

ijij pdpdD −= π , which is equivalent to the

p.d.f. of Brownian motion. This form is appropriate if we assume that the agent has to search for a city

by random walk within a limited length of time.

14 It should be noted that when i=j, the distance, dij, is set to be 0.25, which is the average of arbitrary

many distances from the center of the cell and within the cell.

15 If there are more than one cell with the highest Gj, then the agent randomly chooses one.

30

the search is still global, the agent is strongly biased for her local cells16 and tends to

dismiss distant cells17.

0 5 10 15 20 25

0

0.5

1

1.5

2

2.5

3

3.5

4

4.5

5

dij

Gj

Figure 4.1: Distribution of attractive forces around cell where and i 1=p 1=jG for

all j .

The agent, however, will not dismiss a cell in a distant location if is

very large. One such case is presented in figure 4.2. In this example, has been

increased to 70 only at d keeping everything else unchanged from the settings

in figure 4.2. In this case, the agent will indeed choose the cell at d because her

net welfare at is higher than that in the original location although she is far

from the cell. As shown in this example, the gravity type approach enables the agent

)( jQΓ

)j(QΓ

15=

,15=ij

ij

15=ijd

16 However, note that if p=0, the scope of the search is purely global.

17 This is analogous to the case in astronomy where a rocket experiences zero “gravity” when it is far

enough from the earth.

31

to “jump” to prominent cells in a distant location instead of dismissing it—just as a

filmmaking student in Syracuse would consider Los Angeles as well as New York

City.

0 5 10 15 20 25

0

0.5

1

1.5

2

2.5

3

3.5

4

4.5

5

dij

Gj

Figure 4.2: Distribution of attractive forces around cell i where and 1=p 1=iG for

all j except for Γ at 70)( =jQ 15=ijd .

Local Search

After choosing a city (more precisely, the economic center of the city), the

agent begins to look for a place to settle in by performing a local search starting from

. First, the agent evaluates eight adjacent cells maxj )82 1,( K=h in her neighborhood

by the following utility function:

( ) ( )∑=

⊗Λ+Λ=odneighborho s'hk

khh QVQL (20)

32

and chooses such that is the largestmax1h hL

)jQ

18. The function Λ is a

utility function that gives a larger information set (or simply, more detailed information) than that of

) :( RRQ Mh →Λ)(

(Γ 19. The second term of the equation is the contribution

of the agent’s neighborhood—it captures the assumption that she gains or suffers from

adjacent cells. The existence of this term encourages the agent to stay with the cluster,

ensuring its growth outwards. V is a vector with M positive elements, each of which

corresponds to an element of . It controls, for each property, how much she gains

or suffers from cells adjacent to her.

kQ

The agent, then, moves her focus to and evaluates the eight adjacent cells

of h to find a better cell. If she indeed finds a better cell ( h ), she moves her

focus to it again. She repeats this process until she hits the cell where she cannot

find a better cell in its neighborhood. Finally, the agent moves to .

max1h

max1 max2

maxh

hmax

18 As in the global search, if more than one possibility exist, she randomly chooses one.

19 This specification is important because otherwise the agent would find her final destination just by

the global search, which is against the idea of the model.

CHAPTER FIVE:

APPLICATIONS OF THE GENERALIZED SPATIAL AGGLOMERATION

MODEL

In the following chapters, four applications of the generalized spatial

agglomeration model are shown. For each application, specifications of the model

involve the following steps:

• Determine the Q-vector.

• Construct an ODE for the Q-vector and its initial condition(s).

• Define utility functions.

To determine the Q-vector, we simply have to list the quantifiable properties that

belong to a cell, including the population of agents. To construct ordinary differential

equations for the Q-vector’s elements, we examine the relationships between each

property and the rest of the properties and mathematically formulate those relationships. To define utility functions, we need to define )( jQΓ in equation (18),

and vector V in equation (20). )(Λ hQ

Unless mentioned, other settings are the same as in the highly simplified model

in chapter 3, except that we no longer assume the 23 non-inhabitable cells. That is, (1)

the size of the space is 50×50; the shape of the space is a torus and (3) the number of

agents is 2500 for the models without birth or death.

33

34

CHAPTER SIX:

APPLICATION #1: A SIMPLE MODEL

The purpose of presenting this application is to show common behaviors of the

generalized spatial agglomeration model by using simple Q-vector, ODE and utility

functions. By referring to the results of this simple model, we can observe how the

results change when more advanced models are introduced.

Specifications

Q-Vector

In this application, we have two properties. As stated earlier, q is the

population of agents. The other property, is the negative externality and

diseconomies as in the more realistic simplified model in chapter 3; namely,

comprises the term where is the parameter that captures the magnitude of

diseconomy. We do not assume exogenous factors such as resources and pollution.

Thus, the Q-vector is:

1

,2q

,2q

,2cn− c

(21)

−

=

= 2

2,

1,

i

i

i

ii cn

nqq

Q

ODE

There is no birth or death for agents, or endogenous change in population;

accordingly, there is no change in or . Therefore, the ODE of the Q-vector is: 1q 2q

(22)

=

=

00

2,

1,

i

ii q

qQ

&

&&

The initial distribution of agents is random, so the initial condition of this ODE is:

35

( ) ( )( )

−

=0

00 2

1,

1,

i

ii cq

qQ (23)

where is zero or a random positive integer (≤ 2500) such that: )0(1,iq

(24) ( ) 250002500

1, =∑i

iq

Utility Functions

For the global search, we assume that each agent is only concerned with

positive externalities. So,

( ) iii nqQ ==Γ 1, (25)

Therefore,

( ) ( )p

ij

ip

ij

ii d

ndQQG =

Γ= (26)

For the local search, diseconomy enters:

( ) 22,1, hhhhh cnnqqQ −=+=Λ (27)

Furthermore, we assume:

(28)

=

01

V

This means each agent benefits from the economies in her surrounding cells without

loss and will not be distressed by the diseconomies emanating from those cells. Thus,

the utility function for the local search is:

36

( ) ( )

∑

∑

=

=

+−=

⊗Λ+Λ=

odneighborho s'

2

odneighborho s'

hk

khh

hkkhh

ncnn

QVQL (29)

Simulation Results

The simulations in this study were performed using a software program

developed with a C++ language compiler on a personal computer. In this section, we

first observe the development of spatial patterns from various angles, followed by the

analyses.

Spatial Patterns in Equilibrium

Figure 6.1 shows spatial patterns in equilibrium under various sets of

parameters p and c . The parameter p increases toward the bottom of the table and

toward the rightmost column. Different colors indicate population densities, which

correspond to the color bar at the bottom of the figure.

c

Roughly speaking, we can observe that the increase in causes a cluster to

expand its horizontal size; conversely, the decrease in causes it to increase in

density. This result is consistent with the predictions of the model itself. Similarly, the

larger

c

c

p is, the smaller the distances between clusters. This is also a natural

consequence, given that p is the parameter that represents the relocation cost.

Notice that for each c there seems to be a transitional state between the state

where there is one mega-cluster and that where there are arbitrary many small clusters.

In fact, outside that transitional state, the effect of p appears to be very small. There is

a possibility that this is a phase shift, which is commonly seen in the natural sciences.

Furthermore, in the transitional state, the system offers clear hierarchical structures.

Since this is a significant topic in regional science, it is examined in detail later in this

chapter.

37

p\c 0.01 0.1 0.3 0.5 0.7

0.1

0.3

0.5

0.7

0.9

1.1

4096102425664164

Figure 6.1: Spatial patterns in equilibrium under various sets of parameters.

38

Evolution of Spatial Patterns

Another concern in spatial agglomeration is its dynamics, or how spatial

patterns emerge over time. Figure 6.2 shows the evolutions of spatial patterns toward

their equilibria for with 0.7 0.5, 0.4, 0.3, ,1.0=p 3.0=c .

At we can see that once a single dense cluster is formed, all agents

who have chosen to relocate directly move to that cluster to create a mega-cluster.

,1.0=p

As p increases, however, this tendency changes. That is, agents first create

small clusters all over the space and some of the small clusters start to grow further.

This can be most prominently observed at 5.0=p

,1

in the figure: At there are

only small clusters of equal size, but at

5.0=t

=t there have emerged a few larger clusters,

which further attract other agents who may or may not belong to other clusters—thus,

creating a hierarchical structure in equilibrium.

The case where elicits behavior somewhat in between that seen in the

two examples above. Agents do make small clusters first; but, after creating a mega-

cluster, some who are already in the “small” clusters also move to the mega-cluster, as

do those who do not belong to any emerged clusters, resulting in a hierarchical

structure. This implies that agents are repeating long-distance relocations to larger

clusters.

4.0=p

At agents choose to travel only for short distances to form relatively

small clusters locally and do not create larger clusters.

,7.0=p

The migration behaviors stated above are consistent with the classification of

migrants by Ravenstein (1885); namely, (1) the long-journey migrant, (2) the migrant-

by-stages, (3) the short-journey migrant and (4) the local migrant. In figure 6.2, the

long-journey migrants are observed at 3.01.0 −=p

4.0

and in late stages at

the migrants-by-stages at

)1( ≥t

,4.0=p =p and 0 and the short-journey migrants

and local migrants are seen in early stages for

,5.

0 7.04. −=p .

39

One could claim a condition for hierarchical structures of urban clusters from

the fact that this model covers all types of migration patterns introduced by

Ravenstein. Note for example, that we observe realistic spatial patterns around

under (or the “transitional” area mentioned earlier in this

section) where all or most classes of migrants exist. This might tell us that hierarchical

structures take place when all types of migrants co-exist in the process of pattern

formation.

5.04.0 −=p 3.0=c

40

t\p 0.1 0.3 0.4 0.5 0.7

0

0.5

1

1.5

2

2.5

4096102425664164

Figure 6.2: Evolution of spatial patterns at 3.0=c .

41

Analysis

In order to examine the effects of the parameters p and on the spatial

patterns that emerge, a series of simulations were performed for analyses. Analysis is

possible often only by repeating the simulation as many times as possible under the

same setting due to the stochastic nature of the model. For the analyses in this section,

simulations were done 20 times for each set of

c

p and . c

Figure 6.3 shows the effect of p and on the number of clusters in

equilibrium. For any c , there emerges only one mega-cluster when

c

p is close to zero,

and the number of clusters increases as p increases. This is, of course, because the

higher the transport cost, the harder it is for agents to travel long distances; therefore,

they choose to form clusters near their original locations.

For small however, the number of clusters stays one for relatively large ,c p

(e.g. for it takes ,1.0=c 5.0=p for the number of clusters to be more than one). This

is because when c which is the magnitude for diseconomy, is small, the attractive

force of the mega-cluster is large enough to compensate for higher travel costs.

Conversely, when is large, agents are more sensitive to transport cost.

,

c

c

As p further increases, the increase in the number of clusters diminishes. This

should be mainly because of the fact that the space is discrete. Notice that when is

smaller, it requires fewer cells to hold the same number of agents—making it possible

for the space to contain more distinct clusters. In fact, this is the reason that in

equilibrium, smaller makes more clusters.

c

42

0 0.2 0.4 0.6 0.8 1

0

10

20

30

40

50

60

70

p

Num

ber o

f Clu

ster

s

0.10.30.50.70.9

c

Figure 6.3: Effect of p and on number of clusters. c

Let us consider the role of the two parameters in terms of the size of clusters.

Figure 6.4 shows the average sizes of clusters, which is measured by the number of

agents that belong to the cluster for the same set of p and in figure 6.3. Since the

total number of cluster is 2500, each point in the figure 6.4 is equal to 2500 divided by

the number of clusters in figure 6.3.

c

For small p and c the size of clusters is 2500 for the same reason that

the number of clusters is one, as explained above. Similarly, it decreases as

,

p

decreases for the same reason stated above. For large values of ,p the

differences between values of c appear marginal compared to that for small

values of p (although differences do exist, as seen in figure 6.3).

43

0 0.2 0.4 0.6 0.8 1

0

500

1000

1500

2000

2500

p

Aver

age

Size

of C

lust

ers

(# o

f age

nts)

0.10.30.50.70.9

c

Figure 6.4: Effect of p and c on average cluster size.

Recall that in figure 6.1, we have observed that certain combinations of p and

apparently result in hierarchical structures and other combinations of c p and

make either one mega-cluster or many small clusters of approximately equal size. It

would be of interest to quantitatively classify

c

p and by those three types of spatial

patterns.

c

One way to assess the classifications above is to compute the variance of

cluster sizes. Notice that (1) if there is only one cluster in the space, the variance

should be equal to zero; (2) if there are only small clusters of approximately equal

size, the variance should still be small and (3), if the sizes of clusters describe a

hierarchy, the variance of clusters’ sizes should be larger than that of the two other

spatial patterns.

Figure 6.5 shows the variance of cluster sizes on the cp − space. There clearly

exists an area with large variance, so we now know that this model will generate a

hierarchical structure as long as p and are within that area. Below that area, where c

44

the variance is zero in the figure (the flat deep blue area with low p and low ), is the

domain where we observe a mega-cluster. Above the area (high

c

p and high c ) is the

domain where there emerge small clusters of approximately equal size. For the

simulations performed for this analysis, the variances in this domain ranged

approximately from 500 to 50,000.

Figure 6.5: Variance of cluster sizes on the cp − space.

Discussion

The purpose of this chapter is to offer an example of the generalized spatial

agglomeration model developed in chapter 4, and illustrate the behaviors of the

generalized model with simple settings. It has been shown that the desired

characteristics of the space and the preferences of agents can be incorporated into the

generalized model using the Q-vector and utility functions for global and local

searches. It also has been shown that, in this simple version of the generalized model,

two parameters—those for travel cost and diseconomy—determine the resulting

45

spatial patterns that include (1) one mega-cluster, (2) many small clusters of

approximately equal size and (3) hierarchical structure.

Recall that, in the highly simplified model presented in chapter 3, hierarchical

spatial patterns cannot be sustained over time. In fact, the only possible spatial

patterns in equilibrium are the single mega-cluster or multiple clusters of exactly equal

size. Note that the utility functions of the model in this chapter are analogous to that in

the highly simplified model in chapter 3; yet the model in this chapter has clearly been

proven to show sustainable hierarchical structures—or more realism.

Even though the generalized model in this chapter does produce relatively

realistic spatial patterns, the conditions under which those spatial patterns emerge are

restricted to a small range of parameters as seen in figure 6.5. In other words, a

hierarchical structure in this model would be inherently unstable, collapsing with very

little change in either travel cost or the coefficient of diseconomy.

An Experiment—Heterogeneous Distribution of Transport Cost

Remember, earlier in this chapter, it has been pointed out that hierarchical

structures emerge when there are all kinds of migrants (as introduced by Ravenstein).

The instability of hierarchical structure arises from the fact that only a small range of

both p and allows multiple migrant types to exist. Actually, it is possible, for

instance, to assign different travel costs to the agents to solve this problem.

c

Figure 6.6 shows sample spatial patterns that emerge under such a

circumstance. The travel costs were uniformly assigned to agents with p ranging

from 0 to 1. This specification of travel cost clearly offers stable hierarchical

structures with the primate city except for 1.0=c . The reason that it does not when c

is small can be explained using figure 6.5: when c is very small, all agents form a

single mega-cluster for every p less than 1—in this case, the short-journey migrants

46

do not exist; thus, no hierarchy. Similarly, one could also impose different

distributions of transport costs and/or heterogeneous diseconomy coefficients if

desired.

c = 0.1 c = 0.3 c = 0.5 c = 0.7 c = 0.9

409102425664164 6

Figure 6.6: Spatial patterns with travel cost uniformly distributed from 0 to 1.

The Rank-Size Rule

Although we have charted, in figure 6.5, the area of p and that generate

hierarchy, let us further examine the model’s consistency with the real world.

c

The Rank-Size Rule is a commonly used theory, originally claimed by Zipf

(1949). According to this theory, the second largest city should contain 1/2 as many

people as the largest, the third should contain 1/3, the fourth should contain 1/4, and

so on; in other words, a city’s population weighted by its rank is expected to be

roughly constant. Using this empirical rule, one can evaluate the performance of an

urban agglomeration model.

To quantify “how consistent a population distribution is with the rank-size

rule,” one way is to use the variance of the product of the cluster size times its rank. If

a population distribution is perfectly consistent with the rank-size rule, the variance

should be equal to zero. As it becomes less consistent with the rank-size rule, the

variance should increase. In addition, in order for this index to be independent of the

47

total population size, divide the variance with the square of the total population. The

resulting value should be able to measure the closeness to the perfectly rank-sized

hierarchical structure. Let us call it the RSR index.

For example, consider the perfectly rank-sized distribution where the largest

cluster contains 1200 agents; the second largest contains 600 agents; the third contains

400 agents; and the fourth contains 300 agents, as shown in table 6.1. In this case, of

course, the RSR index is zero. In table 6.2, the population distribution is a similar

hierarchical structure, but it is slightly closer to the uniform distribution than the

previous case. In this case, the RSR index is 0.0036.

Rank Size (total = 2500) Weighted Size 1 1200 1200 2 600 1200 3 400 1200 4 300 1200 Variance 0 RSR index (normalized by 25002) 0

Table 6.1: Perfectly Rank-Sized Distribution

Rank Size (total = 2500) Weighted Size 1 1000 1200 2 700 1400 3 500 1500 4 300 1200 Variance 22500 RSR index (normalized by 25002) 0.0036

Table 6.2: Not Rank-Sized Hierarchical Distribution

48

Figure 6.7 illustrates how the RSR index changes over the values of p under

where each plot shows the average of the RSR indices obtained from 50

simulations. The RSR index appears to have its minimum level, or 0.006, at

2.0=c

59.0=p .

According to data from the US Census 1996, the RSR index of the Unites States

ranges from approximately 0.001 to 0.00620, so the model successfully gives realistic

spatial patterns as far as population distribution over cities is concerned.

Similarly, in figure 6.8, other values of p that offer the minimum RSR indices

are overlaid to figure 6.5. It is shown that the combinations of p and that generate

the most realistic distributions reside on the upper edge of the strip with high

variances in the figure.

c

0.5 0.55 0.6 0.65 0.7 0.75 0.8 0.85 0.90

0.005

0.010

0.015

0.020

0.025

0.030

0.035

0.040

p

RSR

Inde

x

Figure 6.7: RSR Index vs. p under 2.0=c .

20 This value depends on how many cities are included in the calculation. I used the largest 10 to 50

cities.

49

Figure 6.8: Set of p and c that offer realistic population distributions added to figure

6.5.

CHAPTER SEVEN:

APPLICATION #2: RESOURCE DEPLETION AND POLLUTION

ACCUMULATION

The settings in the previous application have only homogeneous land and the

qualities of the land do not change over time. In other words, all elements in the Q-

vector are functions of population only. Hence, since there is no population growth, no

substantial dynamics is defined in the ODE of the Q-vector as shown in equations (22)

and (23).

In this chapter, resources and pollution are introduced as properties of the

space to demonstrate how the properties of cells are used in the generalized spatial

agglomeration model, and to observe how they affect the resulting spatial patterns the

agents create.

Specifications

Q-Vector

In addition to the two properties, and q , in the previous application,

there are two new properties to cell . Define as the amount of resources that are

consumed by agents in and q as the level of the pollution from which agents in i

would suffer. Thus, the Q-vector is:

1,iq 2,i

3,i iq

i 4,i

(30)

×−×

−=

=

icic

cnn

qqqq

Q

p

r

i

i

i

i

i

i

i

at levelpollution at resources ofamount

2

4,

3,

2,

1,

where and respectively represent each agent’s sensitivities to resources and

pollution.

rc pc

50

51

ODE

As in the previous application, there is no birth-death process, so q and do

not change endogenously. Resources are consumed by each agent in i at a constant

rate so the amount depletes by for each unit of time since is the

population in cell i . Similarly, pollution level is raised by each agent in i at a rate r

so increases by for each unit of time. Therefore, the ODE of the Q-vector is

written as:

1 2q

,φr

3,i

3,iq

1q

1qrφ 1q

,τ

q rτ

(31)

−=

=

τ

φ

1

1

4,

3,

2,

1,

00

qrqr

qqqq

Q

i

i

i

i

i

&

&

&

&

&

with (Note that the amount of resources cannot be negative, so once

reaches zero, it stays zero).

03, ≥iq 3,iq

The initial condition for the population is the same as before; is either

zero or a random positive integer (≤ 2500) that satisfies equation (24). As for the

initial condition of resources, we will try different distributions. Also, we assume pollution does not exist at the beginning of time, so

)0(1,iq

0)0(4, =iq for all i . Thus, write:

( )

( )( )

−

=

0)0(0

0

03,

21,

1,

i

i

i

i qcqq

Q (32)

52

Utility Functions

For the global search, simply add and to equation (25) and take the

linear sum; namely,

3q 4q

( ) 4,3,1, iiii qqqQ ++=Γ (33)

Hence, the utility function for the global search is:

( ) ( )p

ij

iiip

ij

ii d

qqqdQQG 4,3,1, ++

=Γ

= (34)

For the local search, take the linear sum of all elements in the Q-vector.

( ) 4,3,2,1, hhhhh qqqqQ +++=Λ (35)

Moreover, we assume that each agent cannot use the resources in her neighborhood

except for that in her own cell, but does suffer from the pollution produced in her

neighborhood. Therefore,

(36)

=

1001

V

Thus, the utility function for the local search is:

( ) ( )

∑

∑

=

=

+++++=

⊗Λ+Λ=

odneighborho s'4,1,4,3,2,1,

odneighborho s'

hk

kkhhhh

hkkhh

qqqqqq

QVQL (37)

53

Simulation Results and Discussions

Case 1: Random endowment of resources, No pollution, cr=50, rφ=0.2

and p=0.1.

For our first example, let us focus on the effect of limited resources distributed over the space. The initial distribution of resources is random; is a random

value in [0,1] (See figure 7.1). The coefficient for resources, c is 50, which enables

the cell with enough resources to be more attractive than increased productivity

simply by agglomeration. The depletion rate of resources is 0.2; therefore, one single

agent can deplete all resources for five units of time at most. Transport cost is low

enough that agents are able to travel for long distances. The parameter is

0.3. Finally, we assume that agents’ decisions are not influenced by pollution for this example; thus c . All other conditions are the same as before.

)0(3,iq

,r

),1.0( =p

=p

,c

0

Figure 7.1: Initial distribution of resources for Case 1.

Figure 7.2 shows an example of simulations under these conditions. The agents

first move to nearby cells that are initially endowed with more resources ( . As a )1=t

54

result, those cells attract more agents due to their resources and agglomeration

economies . As the simulation advances, however, the resources in those

clusters are depleted and the agents start to move out from the clusters . Then,

the agents create new clusters where there are still resources

)32( −=t

)4( =t

)75( −=t . When most of

the resources in the whole space have been consumed, agents begin to decide more by

economies of scale than by the amount of resources, making larger clusters

. Eventually, there emerges only one mega-cluster in the entire space

.

)108( −=t

)11( =t

pRecall when and 1.0= 3.0=c in the simpler version presented in the

previous chapter, agents form a mega-cluster from the beginning of simulations (see

figure 6.2). There, it takes only about two units of time until most agents create the

final mega-cluster whereas with the presence of resources it takes about 10 units of

time.

Notice also that before the simulation reaches its equilibrium, the spatial

pattern exhibits a hierarchical structure )8( =t . Although this is not a sustainable

hierarchical structure, it should be noted that there are factors other than the specific

range of transport costs that could cause hierarchical structures to emerge.

55

t=0

t=1

t=2

t=3

t=4

t=5

t=6

t=7

t=8

t=9

t=10

t=11

409102425664164 6

Figure 7.2: Formation of clusters when resources exist with low transport cost.

Case 2: Random endowment of resources, No pollution, cr=50, rφ=0.2

and p=0.5.

This is the case where 5.0=p

(

with all other parameters unchanged from the

previous example. A sample result is shown in figure 7.3. Unlike the case with

the agents do not create “temporary” clusters that are fated to disappear

before the equilibrium. Instead, they stay in their original locations or make only short

trips to consume local resources first

,1.0=p

)21−=t

(

. Only after most of the resources are

depleted do the agents start forming clusters )3=t and eventually the system reaches

its equilibrium . )7( =t

56

Obviously, the presence of resources delays the formation of clusters—

Compare the path that the same set of p and c in figure 6.2 describes (the spatial

pattern at t in figure 6.2 is close to one at 1= 4=t to that in figure 7.3). The spatial

pattern at equilibrium appears to be unaffected by the resources.

t=0

t=1 t=2 t=3

t=4

t=5 t=6 t=7

409102425664164 6

Figure 7.3: Formation of clusters when resources exist with high transport cost.

Case 3: Random endowment of resources, cp=0.1, rτ=0.1, cr=50, rφ=0.2

and p=0.1.

We now add pollution to Case 1. The agents’ sensitivity to pollution, is 0.1

instead of zero and the accumulation rate, is 0.1. All other parameters are the same

as Case 1.

,pc

,τr

Figure 7.4 is a sample result under these conditions. We observe that the result

shows a behavior similar to Case 1 up to the point when the agents form a mega-

cluster ( . In Case 1, this would be the equilibrium and the cluster would

stay there forever. With pollution, however, the cluster stays only for a while

)120 −=t

57

)1914( −=t while agents accumulate pollutants in the cluster. When the pollution

reaches a certain level, the agents start to “escape” from the cells with pollutants to

other cells nearby . After agents finish this group migration, they stay in

the new cluster for a while

)2220( −=t

( )2624 −=t and move again ),3028( −=t repeating the

process described above.

164

t=0

t=2 t=4 t=6

t=8

t=10 t=12 t=14

t=16

t=18 t=20 t=22

t=24

t=26 t=28 t=30

409102425664 6

Figure 7.4: Formation of clusters and behavior of mega-cluster with both resources

and pollution (low transport cost).

58

Case 4: Random endowment of resources, cp=50, rτ=1, cr=50, rφ=0.2

and p=0.1.

In this example, c and have been increased to 50 and 1 respectively, with

other parameters held unchanged from Case 3. In other words, this is the case where

pollution is dominant.

p τr

The unique spatial pattern under this condition is shown in figure 7.5. We

clearly observe the formation of “orphaned” cells instead of clusters that consist of the

cells adjacent to each other. This happens because each agent is sensible of the

pollution in his neighborhood, not only in his own cell; thus, he avoids having

neighbors to minimize the negativity of pollution.

t=0

t=2

t=4

t=6

409102425664164 6

Figure 7.5: Formation of clusters under larger influence of pollution.

59

Case 5: Skewed endowment of resources, No pollution, cr=50, rφ=0.2

and p=0.5.

This case is the same as Case 2 except that the initial distribution of resources

is not random. The resources are initially distributed weighted by a Gaussian

distribution with its peak in the center of the space as shown in figure 7.6. This is an

example where a specific part of the space has absolute advantage that does not last

(e.g. mining, etc.).

Figure 7.6: Initial distribution of resources for Case 5.

Figure 7.7 shows a result under this situation. First, due to the high

concentration of resources, agents create the first cluster in the center ( and it

grows . However, as the central part of the space loses its resources, it

also starts to lose inhabitants

)2.0=t

)8.04.0( −=t

)1( =t

)0.2

. Eventually, it completely loses its attractive force

and creates a sprawl ( 8.1 −=t

73 −=

. Agents then develop satellite cities around the

center of the space . )(t

60

This example demonstrates the model’s potential to recreate complex

phenomena in the real world. In the next example, we will see a somewhat different

approach to simulate an urban sprawl.

t=0

t=0.2 t=0.4 t=0.6

t=0.8

t=1 t=1.2 t=1.4

t=1.6

t=1.8 t=2.0 t=3

t=4

t=5 t=6 t=7

409102425664164 6

Figure 7.7: Behavior of agents with depletable resources concentrated in the center of

the space.

61

Case 6: Skewed endowment of resources, cp=1, rτ=0.1, cr=50, rφ=0

(recoverable resources) and p=0.5.

In the previous case, agents leave the center of the cluster because of the depletion of resources. In this example, resources are never depleted, or .

(Imagine the resources to be wood or other kinds of plants that can be naturally

replenished) However, let us assume again that agents pollute indiscriminately, eventually expelling themselves

0=φr

)1.0 ,1( == τrcp .

A result is shown in figure 7.8. As in the previous example, agents start to

create a mega-cluster in the center of the space ( )5.0=t

),4(

and the cluster simply grows

while they continue to pollute. Since the center of the cluster had

immigrants first and has held agents longer than other areas, the pollution level is the

highest. Thus, the center loses its inhabitants first

)31( −=t

=t

)75.

creating a hole in the cluster

. The hole continues to grow )65.4( −=t 6( −=t . Eventually, the “ring” breaks

into several smaller clusters that will continue to move around through the space

. )13( =t

Notice that although the environment set for this example is quite different

from the previous case, it forms a similar spatial pattern in the early stages.

t=0

t=0.5 t=1 t=1.5

t=2

t=2.5 t=3 t=3.5

62

t=4

t=4.5 t=5 t=5.5

t=6

t=6.5 t=7 t=13

409102425664164 6


the space and with accumulation of pollutants.

Case 7: Skewed endowment of resources, cp=1, rτ=0.1, cr=50, rφ=0

(recoverable resources), pollutant purification, and p=0.5.

In this example, the pollutants produced by agents diminish over time, as is

sometimes the case in Nature. This can be represented by simply altering the fourth

element of equation (31):

(38)

−−

=

=

τ

φ

pi

i

i

i

i

rqrqr

qqqq

Q

1

1

4,

3,

2,

1,

00

&

&

&

&

&

where is the rate of purification. Note that this rate does not depend on the

population of agents since the purification is assumed to be accomplished by Nature.

pr

63

Figure 7.9 shows a sample simulation under 1=pr . As in the previous case,

the agents form a ring-like cluster )80( −=t . Then, while agents are gone from the

central part of the lattice, the pollutants are purged )1210( −=t . Since this area is still

attractive because of its high concentration of (recoverable) resources, some agents

move back to the central area, which they once abandoned ( )1814 −=t

20

. They,

however, produce pollutants again and evacuate that area ( )30−=t . The agents

repeat this process and the system exhibits periodic behavior.

t=0

t=2 t=4 t=6

t=8

t=10 t=12 t=14

t=16

t=18 t=20 t=22

t=24

t=26 t=28 t=30

409102425664164 6

64


the space and with pollutant purification.

CHAPTER EIGHT:

APPLICATION 3: PROPAGATION

In the real world, population growth plays a significant role in the formation of

spatial patterns. One of the benefits of using agent-based models is that when we need

to deal with population growth, the procedure is straightforward—simply add more

agents.

The addition of new agents should be done by certain rules. For example, it is

reasonable to place the new agents in the cell in which other agents (or parents) live or

in their neighborhood. Also, the birth rate could be a constant or a function of the Q-

vector.

In this application, newborn agents are initially placed in the same cell as their

parents and the birth rate is assumed constant and each agent gives birth to a new

agent stochastically with a constant probability.

Specifications

Q-Vector

The Q-vector of this version is the same as the one in the last chapter—or

equation (30)—except that we do not consider pollution (for the sake of simplicity).

Thus,

(39)

×−=

=

iccnn

qqq

Q

r

i

i

i

i

i

i

at resources ofamount

2

3,

2,

1,

65

66

ODE

The main difference from previous applications is that is no longer always

equal to zero. In fact, if we have a constant probability of birth, can be any value

in [0, 1] and it changes at every moment. However, on the average or when is

large enough, becomes equal to the constant probability. Thus,

1,iq&

,iq& 1

1,iq

1,iq&

(40)

−=

=

1

2

3,

2,

1,

qrcrr

qqq

Q n

n

i

i

i

i

φ&

&

&

&

where is the probability of birth. Note that we assume that a single agent can give

birth to a new agent without a mate although this is not the case for human beings. To

solve this problem, consider the agents as family units rather than individuals. For

future studies, this can be improved by introducing gender and by imposing more

strict conditions on propagation.

nr

At t only one agent is placed in one cell,0= 21, so the initial condition for

equation (40) is:

( )( )

( )

−=

)0(0

00

3,

21,

1,

i

i

i

i

qcqq

Q (41)

where for i0)0(1, =iq Nmm- ,,1 ,1 ,2, 1, KK += and 1)0(1, =miq . The initial

distribution of resources, or q , is the same as Case 1 in the previous chapter;

takes a random value in [0, 1] for all i (See figure 7.1).

)0(3,i

)0(3,iq

21 Although, the first agent is located in the center of the lattice in the figures in this chapter, where the

“center” is does not affect the result since the space is a torus.

67

Simulation Results

Figure 8.1 shows sample results for three different transport costs

. The probability of birth, was set to 0.1. The graph on the top

row, for each level of transport cost, shows the population growth (dashed lines) and

the change in the number of clusters (solid lines). Note that although the population

explosions take place at different times for those three examples (e.g., at

)0.9 and 0.5 ,1.0( =p ,nr

1.0=p 0=t

and around ), this is not because of different transport costs. Under the

same probability of birth, they happen stochastically, especially at the early stages

when there are but a small number of agents.

5.0=p t 20=

Initially, for each level of transport cost, when the agents have not formed

clusters, the population and the number of clusters are the same; thus, the dashed line

and the solid line overlap. For this period, individual agents appear to walk

randomly—they actually repeat short journeys seeking new resources in their

neighborhoods since resources are a more attractive force than agglomeration.

Eventually, resources in the whole space run short and agglomeration becomes more

attractive to the agents. Consequently, the number of clusters diverges from the

trajectory (see the top graphs in figure 8.1) and the rate of growth of the number of

clusters slows down. Finally, it starts to decline due to further agglomeration.

Compare and 1.0=p 5.0=p . For ,1.0=p the number of clusters begins to

decline, or the agents start to agglomerate, when the population is around 60 whereas

for the population is around 100. This is because when transport cost is high,

the population needs to be dense in order for an agent to find another agent in her

vicinity to form a new cluster.

5.0=p

68

t p=0.1 p=0.5 p=0.9

0 10 20 30 40 50 60 70 80 90 1000

20

40

60

80

100

120

140

160

180

200

Time

Num

ber o

f Clu

ster

s/Ag

ents

AgentsClusters

0 10 20 30 40 50 60 70 80 90 100

0

20

40

60

80

100

120

140

160

180

200

Time

Num

ber o

f Clu

ster

s/Ag

ents

AgentsClusters

0 10 20 30 40 50 60 70 80 90 100

0

20

40

60

80

100

120

140

160

180

200

Time

Num

ber o

f Clu

ster

s/Ag

ents

AgentsClusters

10

20

30

69

40

50

60

70

70

80

90

100

4096102425664164

Figure 8.1: Propagation of agents.

Larger Space

Recall that in the previous example, as agents diffuse some of them reach the

lattice’s boundaries, which are connected to each other. Therefore, the resulting spatial

patterns are equivalent to those from a space of infinite horizon with a “seed” agent

71

deployed every 50 cells. By expanding the unit size of the space, we should be able to

observe the spatial patterns where the boundary effect is reduced.

In figure 8.2, a case of 5.0=p

30

is presented. The only difference from the

previous example (see figure 8.1) is that the size of the space is now 100×100 cells

instead of 50×50. Initially, (up to =t ), agents simply disperse forming a “cloud” of

agents as in the previous case for 0=t to 30. Then, at ,40=t they form the first

clusters in the center of the cloud due to the denser population in that area. This

continues as the cloud expands while the initial clusters grow further . At

we observe the hierarchical spatial pattern, where the larger clusters are

located in the center and small ones are located in the outer tier.

)50( =t

,60=t

t=10 t=20 t=30

t=40 t=50 t=60

4096102425664164

Figure 8.2: Propagation of agents in a larger space.

CHAPTER NINE:

APPLICATION 4: GEOGRAPHY

In order for the model to be applied to specific regions, it is essential to include

real geography to the space. The geographical features might include complex

coastlines, political borders, lakes, rivers, mountains, etc., each of which has unique

effects on agents’ preferences. In this chapter, an example is presented to show that

those geographical features can be incorporated into the Generalized Spatial

Agglomeration Model simply by adding a new property to the Q-vector.

Specifications

Q-Vector

The first three elements of the Q-vector ( ) are the same as

equation (39) in the previous chapter. In order to include geography to the model, we

add another element to the vector; that is,

321 and , qqq

(42)

×−

=

=

i

r

i

i

i

i

i

i

i

gic

cnn

qqqq

Qat resources ofamount

2

4,

3,

2,

1,

where is an index that represents the “habitability” and the attraction level of

location . For example, if location i is on the ocean or the lake, is 0, making it

impossible for agents to live in ; if location i is a habitable, but not so attractive,

land (e.g. infertile or mountainous areas, etc.), then could be 1; if it is land facing

the ocean (thus extremely attractive), may be 10.

ig

i ig

i

ig

ig

72

73

ODE

Unlike the model in the previous chapter, the birth rate here is not constant.

Instead, we assume that the more urbanized the location is, the less children people

decide to have; thus, the birth rate is now a function of population. One way to realize this would be to divide the birth rate by the population. Also, the ODE’s for and

are inherited from equation (40) and is assumed to be constant

2,iq

3,iq 4,iq 22. So, the ODE

is written as:

(43)

−=

=φ

0

/

1

21

4,

3,

2,

1,

qrcr

qr

qqqq

Q n

n

i

i

i

i

i

&

&

&

&

&

The initial condition also inherits elements from the previous chapter with an

additional element for geographic features; that is,

( )

( )( )

−

=

)0()0(0

0

0

4,

3,

21,

1,

i

i

i

i

i

qqcqq

Q (44)

where for i0)0(1, =iq Nmm ,1, 1,-,2, 1, KK += and 1)0(1, =miq . As in equation

(41), takes a random value in [0, 1] for all i. The initial condition for is

given externally for each, shown in figure 9.1. As suggested earlier, cells facing the

ocean or lakes are given high values (especially the ones in California have even

higher values because of the attraction of gold early in the history of the United

States). However, coasts further north than the Great Lakes receive a handicap, due to

their cold climate.

)0(3,iq 4,iq

22 This assumption should be relaxed if we consider the time scale of natural history.

74

Figure 9.1: Initial condition for for all (North American Map). 4,iq i

75

Utility Functions

Unlike the utility functions used in previous chapters, we do not simply take the linear sum of all elements of the Q-vector. Instead, we treat in a special

manner since it affects how each agent perceives the other elements. For example, if location j is inhabitable (

4,jq

04, =jq ) because it is on the ocean, then that fact should

invalidate the other elements. Hence, an effective form for ( )jQΓ might be:

( ) ( )3,2,1,4, jjjjj qqqqQ ++=Γ (45)

Thus, the utility function for the global search is:

( )

apij

jjjjj d

qqqqG 3,2,1,4, ++

= (46)

The parameter p has the subscript a because we distribute various transport costs

over the agents, as in Chapter 6.

Similarly, for the local search, the utility level solely from location h is:

( ) ( )3,2,1,4, jjjjh qqqqQ ++=Λ (47)

Let us make the same assumptions for the neighborhood effect as before for

population and resources. In addition, the geographic features of location

neighborhood by themselves do not affect one’s preferences over location . Hence,

s'h

h

(48)

=

0001

V

Therefore, the total utility level for the local search is:

76

( ) ( )

∑

∑

=

=

⋅+++=

⊗Λ+Λ=

odneighborho s'4,1,3,2,1,4,

odneighborho s'

)( hk

kkhhhh

hkkhh

qqqqqq

QVQL (49)

Simulation Results

Figure 9.2 demonstrates simulation samples under three ranges of transport

costs, which are randomly distributed over agents. Every time a new agent is born, it

is assigned a random transport cost within the specified range. Also, the first agent is

located in Philadelphia as the snapshots on the top row indicate.

The leftmost column shows the behaviors of agents with a low transport cost

range . Since the agents have higher mobility, the early agents quickly

leave the east coast and find the attractive lands in the west coast ( . Then the

offspring of those agents explore other areas all over the continent and find other

attractive areas such as the Great Lakes and Florida

)10( << p

)10=t

)4030( −=t

)50>t

. Those in California

and around the Great Lakes develop urban clusters ( . Since most agents tend to

belong to a cluster due to their low transport costs, the whole population does not

grow fast. (Recall that the birth rate is inversely proportional to population density.)

The rightmost column shows a case of agents with high transport costs. Since

p can be up to 100, most of the agents are “sluggish”. In fact, in the simulation

shown in the figure, the early agents stay in the east coast for a while and

start to shape the first clusters there. Then, instead of jumping to California, the agents

first make short trips to the nearer side of the Great Lakes

)30( <t

)40( =t . Sometime

between t and 50, a few agents with low transport costs are born and they move

to the west coast while some other agents stay around the Great Lakes ( .

Those agents with high transport costs slowly expand their territories and create

clusters in the eastern half of the continent whereas those in California form their own

40=

)60−50=t

77

cluster by reproducing and by attracting newborn agents with low transports costs

. )60( >t

The middle columns shows a case between those explained above. The early

agents do form clusters around the east coast first ),50( <t but as soon as more mobile

agents are born they move to California and create new urban clusters ( .

Eventually, agents in both east and west sides of the continent develop equally large

clusters.

)7050 −=t

t 0 < p < 1 0 < p < 10 0 < p < 100

0

10

20

78

30

40

50

60

70

79

80

90

100

4096102425664164

Figure 9.2: Formation of urban clusters of North America simulated under various

transport cost ranges.

As far as these three cases are concerned, the one with medium transport cost

range appears more realistic than the others. Figure 9.3 shows various

results of simulations under this transport cost range. Naturally, the resulting

spatial patterns are different from each other due to the fact that the model is

)100( << p

)100( =t

80

stochastic. In spite of the variety of the results, the east and the west coasts tend to

have roughly equal populations.

A

B

C

D

E

F

4096102425664164

Figure 9.3: Snapshots of simulations at t = 100 under 0 < p < 10.

CHAPTER TEN:

CONCLUSIONS

This study serves four major purposes: (1) to introduce agent-based models to

regional science, (2) to construct a generic spatial agglomeration model, (3) to show

that the model offers hierarchical structures, and (4) to explore the model’s potential.

This chapter summarizes the results from previous chapters with regard to these

purposes.

In chapter 3, a very simple agent-based model for spatial agglomeration is

introduced. Although the model is highly simplified and there are unrealistic

assumptions, it offers general ideas as to how agent-based models work.

In chapter 4, a generalized spatial agglomeration model is presented. The

model is designed so that the user can “customize” it not only for urban agglomeration

but also for a wide variety of agglomeration phenomena in general, which is realized

by combining global and local search for agents’ migration behaviors. When using

this model, the user can specify (1) the attributes of location, (2) their dynamics and

(3) each agent’s preferences in a consistent way.

Chapter 5 explains the common procedures for the model’s applications

demonstrated in the following chapters.

The first application is shown in chapter 6. The factors involved in each

agent’s decision-making process is as simple as the model in chapter 3, but the model

is re-written in terms of the generalized model and it now includes the “global-local

hybrid” search as well. The effects of the two key factors—transport cost and carrying

capacity—are examined. In particular, it is shown that there is a set of transport costs

and carrying capacities that generate realistic spatial distribution, which is validated

using a newly introduced index that measures how well the spatial distribution

matches the Rank-Size Rule. In fact, this model indeed offers realistic spatial patterns

81

82

and the results show that this study provides a reasonable answer as to how the Rank-

Size Rule can be valid in the real world.

Recall that, as pointed out in chapter 2, Krugman’s model produces only

uniform spatial patterns with no concept of area. Schweitzer and Steinbink’s model

does produce realistic spatial patterns, but the behavioral rules of agents lack

economic sense (as they themselves point out that their model is a “physical” model);

therefore, one cannot expect to use this model for policy analyses.

In chapter 7, resources and pollution are added in order to show how the

dynamics of spatial or land attributes affects agents’ behaviors. We observe that

additional parameters such as the depletion rate of resources, accumulation rate of

pollution and initial distribution of resources dramatically alter the resulting spatial

patterns and the existence of equilibria. It is also shown that this model can simulate

urban sprawls.

In chapter 8, this model is proven to be capable of simulating the propagation

of agents.

In chapter 9, a real North American map is incorporated into the model. It is

done by simply treating geographic features of each location in the same way as other

attributes (such as population and resources) without modifying the model itself. The

results show the potential that the model could be applied to real life problems within

the framework of the generalized spatial agglomeration model.

Axtell and Epstein also incorporated additional features such as resources,

pollution, propagation, gender, etc. Their model, however, is not constructed in a

highly generalized way, so it is not easy to “use” the model for specific purposes.

With the Generalized Spatial Agglomeration Model presented in this study, on the

other hand, it has been shown that one can include any quantifiable attributes of the

83

space into the model in a consistent way—by introducing new elements to the Q-

Vector and defining its ODE.

This study is the first to vigorously utilize agent-based models in the field of

Regional Science. It has proven that agent-based models are not only useful in the

field (especially when combined with existing methods), but they could also offer

good answers to the complex questions that traditional models cannot solve. It is

expected that agent-based models will be explored further in the coming decades and

will eventually become a norm for the types of analyses where realistic modeling is

required.

BIBLIOGRAPHY

Durrett, R. and Levin, S. A. “Stochastic Spatial Models: A User's Guide to Ecological Applications.” Philosophical Transactions of the Royal Society of London B 343, 1994. 329-350.

Flick, Alexander C. (ed.). History of the State of New York, New York: Columbia University Press, Vol. V., 1934

Isard, W. Location and Space Economy: A General Theory Relating to Industrial Location, Market Areas, Land Use, Trade and Urban Structures. Cambridge: MIT Press, 1956.

Isard, W et al. Methods of Interregional and Regional Analysis. Burlington: Ashgate, 1998.

Joshua M. Epstein and Robert L. Axtell. Growing Artificial Societies. Washington, D.C.: Brookings Institution Press, 1996.

Krugman, P. The Self-Organizing Economy. Malden: Blackwell, 1996. 88-92.

Orcutt, G. “From Engineering to Microsimulation.” Journal of Economic Behavior and Organization 14, 1990. 9-10.

Page, S. “On the Emergence of Cities.” Journal of Urban Economics 45, 1999. 184-208.

Ravenstein, E.G. “The laws of migration.” Journal of the Royal Statistical Society 48, 2: 167-235, 1885.

Schweitzer, F. and Steinbrink, J. “Urban Cluster Growth: Analysis and Computer Simulation of Urban Aggregations.” Self-Organization of Complex Structures: From Individual to Collective Dynamics Ed. F. Schweitzer. London: Gordon and Breach, 1997. 501-518.

Zipf, G. Human Behavior and the Principle of Least Effort. Cambridge: Addison-Wesley Press, 1949

84

the use of agent-based models in regional science by mark kimura

Documents