on the use of rational choice theory in evolutionary biology · 2019-01-17 · the transfer of...
TRANSCRIPT
1
On the Use of Rational Choice Theory in Evolutionary Biology
Samir Okasha, University of Bristol
Introduction
The aim of this paper is to offer some philosophical reflections on the use of rational choice
theory within evolutionary biology. Since the 1960s, concepts and models from rational
choice theory have often utilized by biologists, usually with some modifications, to shed light
on evolutionary phenomena. This is particularly true in behavioural ecology – the science that
studies animal behaviour from a Darwinian perspective. For example, Bayesian decision
theory has been used to study foraging behaviour in animals (e.g. Houston and McNamara
1999); non-cooperative game theory has been used to study the evolution of strategic
behaviour, a development that gave rise to evolutionary game theory (see Maynard Smith
1982, Hammerstein 2012 ); and bargaining theory has been used to study the division of
reproduction within animal societies (Johnstone and Cant 2009, Roughgarden 2010). More
recently, there have been attempts to use ideas from social choice theory to shed light on
biological scenarios involving individuals with divergent interests living in groups (Okasha
2009, 2012; Conradt and List 2009; Bossert, Qi and Weymark 2013).
From one perspective, the fruitful transfer of ideas from rational choice theory to
evolutionary biology is surprising, given that non-human animals have rather limited powers
of rational deliberation. What explains it? The answer, I conjecture, stems from the fact the
concept of utility in rational choice theory plays a rather similar role to the concept of fitness
in evolutionary theory. Just as rational choice theorists typically assume that agents make
choices that maximize their utility, so evolutionary biologists assume, and in some cases can
show, that animals will makes choices that maximize their Darwinian fitness (roughly,
expected number of offspring) or some proxy for it. Thus the utility-maximizing paradigm of
the rational choice theorist corresponds closely to the fitness-maximizing paradigm of the
evolutionary biologist. At this abstract level, it is thus easy to see why rational choice models
should have biological applications.
A related phenomenon is the widespread use of teleological language to describe both
the process of Darwinian evolution and its products. Though the process that Darwin
described – descent with modification – is of course purely naturalistic, biologists have often
2
characterized it in quasi-teleological terms, e.g. as a ‘hill climbing’ process in which natural
selection drives populations up peaks in an ‘adaptive landscape’, thus effecting
‘improvements’. (Though this is controversial – see below). Still more commonly, the
products of the evolutionary process – the highly-adapted organisms that we see today – are
described in intentional or teleological terms. Thus the behaviour of even the simplest
organisms is often treated as purposive, in that it has the ‘aim’ of increasing the organism’s
fitness; and we find frequent talk of the ‘interests’ and ‘strategies’ of an organism or gene, of
‘conflicts of interests’ between organisms and genes, of ‘selfish’ and ‘altruistic’ behaviours.
Language of this sort, which pervades the professional and the popular biological literature, is
the informal counterpart to the explicit use of rational choice models to study animal
behaviour.
The transfer of ideas from rational choice theory in evolutionary biology raises a
number of interesting questions, both philosophical and technical, which have been the focus
of my research for the last six years or so. Here I offer an overview of some of this research,
with a focus on broad conceptual issues. Further details may be found in my previously
published papers, listed in the References section, some of which I draw on below.
The structure of this paper is as follows. Section 1 distinguishes two different ways in
which rational choice concepts have been applied to evolution, and discusses their relation.
Section 2 examines the long-standing opposition among some evolutionary theorists to the
notion of optimization, and explores the consequences for the evolution / rational choice
connection. Section 3 explores the parallel between utility and fitness, with a focus on
measurement-theoretic issues. Section 4 asks whether in applying rational choice theory to
biology, we should think of genes, individuals, groups, or some other units as the agents
doing the maximizing. Section 5 focuses on social behaviour, outlines Hamilton’s inclusive
fitness concept, and describes a recent attempt to integrate the concept into decision theory.
Section 6 discusses two scenarios in which it has been suggested that evolution will give rise
to non-rational behaviour. Section 7 concludes.
1. Evolution and the Rational Choice Metaphor: Product versus Process
The notion of optimization (or maximization) is central to rational choice theory. An agent
whose choice behaviour meets certain highly intuitive rationality constraints, notably
3
transitivity, may be represented as maximizing a utility function, or solving an optimization
problem. This simple idea lies at the heart of much of economic theory. The notion of
maximizing is also central to evolutionary biology, where ‘fitness’ is the quantity usually said
to be maximized. Thus it is natural to expect a connection between rational choice theory and
evolutionary theory, and the literature is replete with suggestions that there is such a
connection.
In fact, there are two quite different ways in which rational choice ideas can be
applied to evolution, depending on whether we focus on the process of evolution or its
product. Taken the former way, the idea is that natural selection may be viewed as a process
in which ‘mother nature’ chooses between alternative phenotypes, or traits, depending on
their biological fitness in the relevant environment. So the agent that does the choosing is
mother nature (i.e. a personification of natural selection); and the objects that she chooses
between are alternative genotypes or phenotypes (or in some versions of the metaphor,
alternative distributions of phenotypes in a population). Since mother nature’s choices will
presumably satisfy transitivity, they can be regarded as rational. Thus it seems that the
concept of rational choice may be applied, metaphorically, to any process of natural selection.
The metaphor helps articulate the intuition, popular among some biologists, that the
Darwinian process, though causal, is nonetheless still teleological in a sense.
Taken the second way, the idea is that the choice behaviour of evolved organisms
may be viewed through the lens of rational choice theory. The putative justification for this is
that since organisms have evolved by natural selection, their behaviour, including their choice
behaviour, is presumably adaptive, or “ecologically rational”, and should therefore satisfy
standard rational choice principles. Thus faced with a choice between alternative actions, an
evolved organism is expected to choose the fitness-maximizing one, and will thus exhibit
complete and transitive choice behaviour. Note that talk of an organism ‘choosing’, in this
context, is understood in an ‘as if’ sense: organisms are assumed to exhibit behavioural
plasticity, not to be capable of conscious deliberation. In this sense, organisms as simple as
bacteria can be regarded as making choices. The point of using rational choice theory is then
to help explicate, in a precise way, the assumption that an organism’s behaviour should be
adaptive.
These two ways of applying rational choice concepts to evolutionary biology are not
exclusive, for they have different foci. The former concerns the evolutionary process; the
4
suggestion is that this process may usefully be viewed as one in which some variants are
chosen, or preferred, over others, by a rational agent trying to achieve a goal. The second
concerns the behaviours that are produced by that process; the suggestion is that these
behaviours may usefully be viewed as if the organism performing them were a rational agent
trying to achieve a goal. In both cases, the ‘goal’ is supposed to be maximization of fitness,
though what exactly this means needs to be spelled out.
Interestingly, the notion of optimization has a rather chequered history within
evolutionary biology. Some biologists routinely assume that organisms will behave as if
maximizing their fitness, or some proxy for it (e.g. probability of survival); this is a standard
assumption in behavioural ecology and also in life-history theory, and models based on that
assumption enjoy considerable empirical success (cf. Krebs and Davies 1997, Stearns 1992).
However in other quarters the idea that natural selection optimizes is regarded as
demonstrably untrue, apart from in the simplest cases; since the 1960s this has been the
conventional view in mathematical population genetics, and has been reinforced by recent
developments in adaptive dynamics and evolutionary game theory (e.g. Ewens 2004, Rice
2005, Metz, Mylius and Diekmann 2008).
This apparent inconsistency is partly due to the fact that the two camps are focusing
on somewhat different issues, as Grafen (2006) has insightfully pointed out (though this is
not the whole story). In terms of our distinction above, the opponents of optimization are
generally focusing on the evolutionary process itself, while the proponents are generally
focusing on the behaviour of evolved organisms. In the light of this, it is worth probing more
deeply the validity of the rational choice analogy as applied to evolutionary biology. To do
this we focus briefly on the status of optimization ideas in biology, as applied to both the
process and the products of Darwinian evolution, examining the objections to them; and we
trace the implications for the rational choice analogy.
2. Optimization and its discontents
2.1 The process version
The optimizing view of the process of evolution by natural selection, and the problems it
faces, can be appreciated by recalling the ‘adaptive landscape’ metaphor of evolution.
Introduced by the American geneticist Sewall Wright (1932), the adaptive landscape pictures
5
evolution as a hill-climbing process in which natural selection pushes populations up ‘peaks’
in the landscape. The landscape (in one of its versions) is a based on a function from the
genetic composition of a population, represented by a vector of allele frequencies, to the
population’s mean fitness w . Peaks in the landscape correspond to populations with (locally)
maximal mean fitness. A simple adaptive landscape in three dimensions, involving just two
genes, is depicted below; note that any real case will be highly multi-dimensional.
Figure 1: The Adaptive Landscape
Wright argued that natural selection would tend to push a population up the steepest
peak in the adaptive landscape, by changing a population’s genetic composition so as to
increase its mean fitness; thus mean fitness was akin to a ‘potential function’ in physics. This
idea, and the adaptive landscape metaphor itself, became central to the thinking of many
evolutionary biologists in the 20th
century. However in fact, it is only true in special cases
that selection tends to increase w , a fact known since the 1960s (cf. Edwards 1995, Ewens
2004). If we consider selection at a single genetic locus, with fixed genotype fitnesses, then it
6
is true that w must always increase; but the argument cannot be generalized to more
complicated genetic architectures (e.g. two-locus cases), nor to cases where genotype
fitnesses vary over generations. Thus the ‘hill-climbing’ view of evolution is only defensible
in special cases.
This conclusion is reinforced by recent work on adaptive dynamics – a version of
evolutionary game theory that explicitly incorporates feedback between an evolving trait and
the ecological environment. This literature emphasizes that the fitness of a particular trait, or
genetic variant, is environment-relative, and that the relevant ‘environment’ typically
includes the frequency of the trait in the population, and the population density. (This is true
of many behavioural traits and most life-history traits.) In these circumstances, it is intuitively
clear that mean fitness need not be maximized. In fact, a stronger conclusion emerges,
namely that the evolutionary process, even when driven exclusively by selection, does not
necessarily maximize any quantity at all (so does not “obey an optimization principle”, as it is
often put). One way to see this is to note that the dynamics may be cyclical, never settling
down to an equilibrium (cf. Metz, Mylius and Diekmann 2008).
Despite all this, the idea that Darwinian evolution is an optimizing process still has
wide currency in biology, and is sometimes treated as trivially true. What explains this? The
answer, I think, is that a weaker claim is often intended, to the effect that at any given point in
time, selection will favour fitter over less fit variants, and will thus tend to modify a
population’s composition in the direction of higher fitness if the environment remains
constant. This weaker claim is indeed true, and follows from R.A. Fisher’s famous
‘fundamental theorem of natural selection’ (on the ‘modern interpretation’ of the theorem;
see Edwards 1994, Okasha 2009). Fisher claimed that this theorem revealed the tendency of
natural selection to “improve” organisms, an assertion with clear teleological overtones.
What renders this compatible with the fact that w does not always increase (a point that
Fisher had himself emphasized against Wright) is the fact that the ‘environment’ typically
changes over generations; so the ‘improving tendency’ need not actually be realized, and in a
population of constant size, it cannot be.
Where does this leave the rational choice metaphor as applied to the evolutionary
process? The answer, I think, is that the metaphor is partially tenable. At any point in time,
mother nature chooses between variant phenotypes according to their ‘fitness’ in the
environment at the time, and so is akin to a rational agent maximizing a utility function. But
7
mother nature has schizophrenic tastes: her utility function is continually changing, and so the
long-term evolutionary dynamics of the population cannot be predicted from her utility
function at any one point in time, except in those special cases where an optimization
principle, such as maximum w , does apply.
2.2 The product version
The optimizing view in its product version holds that since organisms are highly evolved,
their current behaviour should be adaptive, or ‘well designed’, hence at least locally optimal.
This viewpoint is partly motivated by theoretical arguments, and partly by the empirical
observation that much animal behaviour does in fact appear adaptive, and purposive.
Advocates of this viewpoint, who include many behavioural biologists, typically argue that
evolved behaviour has the function, or purpose, of maximizing the organism’s fitness. If this
is true, an evolved organism’s observed choices should presumably conform to the canons of
rational choice theory.
The status of this optimization assumption is a moot issue. Clearly the assumption is
predicated on a Darwinian or ‘adaptationist’ view of biology, which emphasizes the power of
natural selection to shape phenotypes, including behaviours, to meet environmental demands.
However general objections to adaptationism are not my focus here. For even granting the
validity of adaptationism, it is still unclear whether we should expect evolved behaviour to be
fitness-maximizing, or optimal. The issue is complicated for at least two reasons.
Firstly, though it is often asserted that fitness-maximizing behaviour is the inevitable
or at least the likely outcome of natural selection (e.g. Maynard Smith 1978), a general
demonstration of this proposition has not been provided in evolutionary theory. Recent work
by Alan Grafen has attempted to fill this lacuna, but it is only partly successful. In particular,
Grafen’s results do not apply to game-theoretic scenarios in which fitnesses are frequency-
dependent; and in fact, in such scenarios it is well-known that the evolutionary dynamics can
drive a population to a so-called ‘branching point’ at which evolved behaviours are not even
locally optimal, for they are at a minimum of the individual fitness function (cf. Abrams,
Matsuda & Harada 1993). Moreover, even in the absence of frequency-dependence, genetic
constraints mean that sub-optimal phenotypes can exist in a population at equilibrium, since
the optimal phenotype cannot breed true; Grafen’s results do nothing to escape this well-
8
known problem. So the theoretical justification for assuming that evolution will lead to
fitness-maximizing behaviour is at best imperfect.
A second issue concerns what ‘fitness’ means and what the ‘organisms’ (or
‘individuals’) are whose behaviour is supposed to maximize it. In the simplest case, an
‘organism’ refers to a multi-celled metazoan, e.g. a human, and ‘fitness’ may be defined as
the expected number of offspring produced over a lifetime. But complications abound. This
definition of fitness is known to be inapplicable in more complex cases, e.g. when there are
social interactions involved, or in stochastic environments involving aggregate risk. More
complicated fitness measures, e.g. ‘inclusive fitness’, then become needed; see sections 4 and
5 below. Moreover in some cases biologists apply the fitness concept to groups rather than to
individuals, or to the genes within individuals; this conceptual shift is needed to
accommodate a variety of empirical phenomena within the fitness-maximizing paradigm. So
we cannot simply take the concepts of individual, or organism, for granted; indeed these are
disputed concepts in biology. Nor can we take for granted that we know how to define the
‘fitness function’, if any, that natural selection will lead them to maximize.
This last point is relevant because workers in a numbers of fields have argued, and in
some cases produced models to show, that Darwinian evolution can in fact give rise to
irrational choice behaviour. Thus Sober (1998) and Skyrms (1995) both argued that in
certain cases, evolution and rational choice ‘part ways’, in that weakly or even strictly
dominated behaviours can be favoured by natural selection. Similarly, Robson (1996) argues
that in the face of environmental uncertainty, behaviours that violate expected utility
maximization may be selectively advantageous. And Houston, McNamara and Steer (2007)
devise models to show that irrational behaviour, e.g. intransitive preferences, can evolve in
certain circumstances. However as we will see later, these arguments for the ‘evolution of
irrationality’ typically make specific assumptions about who the maximizing agent is (e.g.
individual), what their utility function is (e.g. number of offspring), and what the state-space
of the decision problem is. By suitably altering these assumptions, the irrationality can often
be made to go away.
Where does this leave us as regards the validity of employing rational choice theory to
understand organismic behaviour? As I see it, the position is this. In so far as evolved
behaviour is locally optimal, hence fitness-maximizing, then it is valid to think of the
organism, in its behavioural choices, as akin to a rational agent seeking to maximize a utility
9
function. However the justification for this optimality assumption must ultimately be
empirical, not theoretical. Moreover we cannot assume a priori that we know which
biological entity (gene, individual, group etc.) is playing the role of the rational agent, nor
that we know what the agent’s utility function is; both of these are up for grabs. Despite these
caveats, the rational agent model remains a valuable way to think about evolved behaviour, in
my view.
3. The utility / fitness connection: foundational issues.
The observation that utility and fitness play similar roles in rational choice theory and
evolutionary theory respectively is by no means new. However the utility / fitness connection
has rarely been studied in detail. A natural starting point is to consider measurement-theoretic
issues. In the case of utility, we customarily distinguish between ordinal, cardinal, ratio and
absolute scales for measuring utility, depending on the class of transformations to which we
think we can subject the utility function without losing information. What about biological
fitness? What is the appropriate measurement scale for fitness?
In evolutionary theory, fitness is often defined on an absolute scale, e.g. as expected
number of offspring, or probability of survival, or intrinsic growth rate, or gametic
contribution to the next generation. These quantities are all absolutely measurable, in that the
actual numbers are meaningful. However it does not follow that for the purposes of
evolutionary analysis, all of this information is necessary. In fact, it many cases what matters
is how many offspring an organism or genotype leaves relative to others in the population;
this determines whether it will be favoured by natural selection or not, and is often sufficient
to determine the evolutionary dynamics.
In some evolutionary analyses, a purely ordinal fitness concept is sufficient. For
example, if competing strains of bacteria are growing in a nutrient broth, then under well-
known conditions the strain with the highest intrinsic growth rate will be the only one found
at equilibrium. So to predict the equilibrium state, we need only be able to order the strains
by growth rate; the actual growth rates do not matter. But in other cases ordinality is not
enough. For example, in a one-locus two-allele population genetics model we know that the
population will converge to a polymorphic equilibrium if the heterozygote is fitter than the
two homozygotes, (i.e. wAa > waa and wAa > wAA); but to determine the actual equilibrium
10
composition of the population we need to know the fitness differences between the three
genotypes, which requires cardinal measurability. And to determine the rate at which the
population approaches equilibrium, we need to know the ratios of the genotype fitnesses to
one another, which requires ratio-scaled measurability.
To focus the issue more generally, consider the Price equation, a standard formalism
for expressing the evolutionary change in a trait (or gene) over one generation in a
population:
∆ z = [Cov (wi, zi) + Exp (wiδi)] / w
where zi is the trait value (or gene frequency) of individual i, wi is the fitness of individual i,
z the mean trait value in the population, ∆ z the one generational change, δi the deviation in
trait value between individual i and its offspring; and Cov and Exp are the covariance and
expectation operators respectively (in the sense of population averages). (To apply the Price
equation to a discrete trait or gene, simply let zi be an indicator function for the presence of
the trait in individual i.) It is easily seen that the class of transformations to which the fitness
function w may be subject, while leaving ∆ z unchanged, involves multiplication by a
positive constant, implying that fitness is ratio-scale measurable. Note, though, that if we are
only interested in the sign of ∆ z (i.e. in whether a gene or trait will be favoured by selection
or not), then cardinal measurability is enough; since adding a constant to the fitness function
leaves the sign of ∆ z unchanged.
There are other contexts in which cardinality will also suffice. For example in ‘one
population’ evolutionary game theory, in which organisms are drawn at random from an
infinite population to play a symmetric game, the standard continuous-time replicator
dynamics are unaffected by addition of a constant to each payoff, and multiplication of each
payoff by a positive constant only changes the velocity with which the population moves
through the state space, not the orbits (cf. Weibull 1995 p.73-4). (Though other dynamics can
be affected, and in two population models the situation is different.) So for the purposes of
answering the questions that evolutionary game theory usually asks, fitnesses need only be
cardinally measurable. But in general, it cannot be assumed that adding a constant to the
fitness function makes no difference, as the zero point of fitness is meaningful. This is
particular clear in evolutionary ecology, where the fitness concept is used to study changes in
a population’s density as well as its genetic composition.
11
Besides measurability, rational choice theorists often ask whether utility is
interpersonally comparable, i.e. whether utility differences and /or levels can be meaningfully
compared across individuals. What about fitness? It might be thought that fitness is obviously
interpersonally comparable: surely the whole point of the fitness concept is to permit
comparison between the individuals in a population? However this reply is not to the point.
For in typical applications of the rational choice metaphor in biology, either there is only one
individual (“mother nature”), or there are many individuals but they all have the same utility
function, i.e. the analysis focuses on the behaviour of a ‘typical’ organism in the population.
This is true even in a game-theoretic context, so long as the organisms are playing a
symmetric game. So the fact that a population contains many organisms, whose fitnesses may
be compared, does not constitute an analogue of interpersonal comparability, so long as there
is a single fitness function. Different organisms exhibit different behaviours, hence have
different fitnesses; but this is simply the analogue of a rational agent receiving different
utility from different outcomes, which involves only intrapersonal comparison.
However some biological scenarios do involve multiple individuals with different
fitness functions. For example, in a symbiotic interaction, the individuals come from different
species (e.g. plants and fungi); the evolutionary analysis must then study both populations at
once. There are then two fitness functions, one for each population, from which the co-
evolutionary dynamics can be deduced. Or within a single population, there may be different
‘roles’, e.g. owner versus intruder, or predator versus prey, where a different fitness function
is associated with each role. (Such examples have often been studied using asymmetric game
theory.) More generally, biological scenarios that involve conflict of interest, e.g. between
queen and workers in an insect colony, or between nuclear and mitochondrial genes, or
between males and females, require us to consider multiple fitness functions. It is common to
use rational choice-theoretic concepts to describe such cases, e.g. by regarding the competing
genes or individuals as agents ‘trying’ to achieve incompatible goals.
In such ‘multiple individual’ cases, the analogue of the interpersonal comparability of
utility question does arise. What is the answer? Since biological fitness is in principle
absolutely measurable, it must therefore be fully comparable. However again, the real
question is whether all of this information is needed for evolutionary analysis. In the simplest
genetical models of co-evolution, which track genetic change in two separate populations of
interacting organisms, ratio-scale measurability is necessary, but different ratio-scales can be
used for the two populations. (This can be seen by applying the Price equation to two
12
separate populations.) Thus in these cases, fitness satisfies what social choice theorists call
“ratio-scale measurability without interpersonal comparability” (cf. Hammond 2005).
The situation is slightly different in two-population evolutionary game theory, which
is often used to model asymmetric interactions. In that case, whether affine transformation of
one player’s payoffs, independently of the other players, has an effect on the solution orbits
depends on both the evolutionary dynamics that are assumed, and on whether the
transformation involves adding a constant or multiplying by a positive constant; see Weibull
(1995 p.174) for useful discussion.
The overall moral, then, is that the appropriate measurability and comparability
assumptions for biological fitness depend on what questions are being asked, and what
modelling assumptions are made. In principle, biological fitness is absolutely measurable and
hence fully interpersonally comparable, but for many purposes not all of this information is
needed; weaker assumptions would suffice. The situation is interestingly different from in
rational choice theory, where different measurability and comparability assumptions about
utility are also used in different contexts (e.g. ordinal non-comparability in consumer theory,
but cardinal unit comparability in discussions of utilitarianism). In this case, the different
possible assumptions require justification, in that it is an open question whether they make
sense metaphysically. In the biological case, by contrast, the issue is not whether a given
measurability / comparability assumption is metaphysically permissible, but whether it is
sufficient to allow the evolutionary analysis to be done.
Besides measurability and comparability, a related foundational issue concerns the
‘revealed preference’ construal of utility maximization. In orthodox rational choice theory,
utility maximization is usually given a strictly ‘as if’ construal. So to say that an agent
maximizes their utility, or expected utility, is not to speculate about their psychological
makeup, but simply to say that their preferences (or choice behaviour) conform to certain
axioms. This behaviourist understanding of utility is a key element in ‘revealed preference
theory’, and routinely taught in microeconomics courses, though it has not gone unchallenged
(cf. List and Dietrich’s contribution to this conference.)
What about biological fitness? Should fitness-maximization in biology also be
understood on the ‘revealed preference’ model? In one respect the answer is clearly yes.
Biologists who study organismic behaviour using the fitness-maximizing paradigm are
obviously not suggesting that organisms are consciously trying to maximize their fitness (or
13
any other quantity); rather the suggestion is that natural selection leads them to behave as if
they were doing this. So the maximization is strictly in ‘as if’ sense, just like in utility theory.
But there is also a disanalogy. On the revealed preference view, preference is
conceptually prior to utility. Thus rational choice theorists begin their analysis of individual
behaviour by focusing on an agent’s binary preferences (or their hypothetical choice
behaviour) over a set of alternatives. The agent’s real-valued utility function is then
introduced as a representation of this preference relation. But in biology it is the other way
around. Talk of an organism’s ‘preferences’ and ‘interests’ is commonplace in biology, but
these notions are always derivative from the organism’s fitness function. To say that an
organism prefers outcome A to B, for example, means that the organisms receive a higher
fitness in A than B. So in this respect, the situation is the opposite from revealed preference
theory.
To illustrate this last point, consider the well-known conflict of interest that arises
between nuclear and mitochondrial genes in flowering plants over the sex-ratio of their host
plant. Mitochondrial genes are only transmitted in ovules, not pollen, so prefer that their plant
have an all-female brood. Nuclear genes generally prefer a fifty-fifty sex ratio (presuming
outbreeding and random mating). So the genes have opposing interests, and conflict is likely:
the mitochondrial genes will try to bias the sex ratio towards females, while the nuclear genes
will try to prevent this, leading to what is known as ‘intra-genomic conflict’. The key point is
this. Talk of the genes’ interests, preferences and aims is perfectly legitimate here, but it
derives from the underlying fitness functions. Suppose that in alternative A, the host plant’s
sex ratio is 3:1 in favour of females, while in alternative B it is 1:1; the total gametic output is
the same in both alternatives. Then, a mitochondrial gene has higher fitness in alternative A
than B, so ‘prefers’ A to B, so will ‘try’ to bring about A rather than B; while the reverse is
true for a nuclear gene. Thus the preference relations derive from the real-valued fitness
functions, not vice-versa.
Despite this difference, it could well be interesting to take a ‘preference first’
approach in evolutionary biology. For one aspect of the rationale for this approach in rational
choice theory is that preferences and choices are observable, while real-valued utility
functions are not. (This much is fairly uncontroversial.) This obviously applies to animal as
much as to human behaviour. So despite the conceptual priority of the real-valued fitness
function over the evolutionary ‘preferences’ that it induces, it could nonetheless be
14
interesting to see if the former can be deduced from the latter. This is discussed further, with
a concrete example, in section 5 below.
4: Who is the rational agent, and what is their utility function?
In rational choice theory, the agent whose behaviour is studied is typically the individual.
There are attempts to treat supra-individual entities, e.g. groups or committees, as rational
agents in their own right (e.g. List and Pettit 2011); and the preference aggregation problem
in social choice theory is sometimes viewed as an attempt to say when ‘society as a whole’
may be regarded as having a rational preference; but in general rational choice theory is
committed to methodological individualism. This makes sense, since it is individuals who
have beliefs, preferences and make choices, in the first instance at least.
In evolutionary biology matters are rather different. Suppose we are persuaded of the
usefulness of applying the rational choice metaphor in an evolutionary context (in its
‘product’ version). Which biological entity should we treat as akin to a utility-maximizing
rational agent? ‘The individual organism’ is the natural answer, and indeed the standard one
in the literature. But there are two related complications. Firstly, it is often unclear what the
‘individual organism’ actually is; in plants, fungi and marine invertebrates, for example, we
find entities whose status as individuals, as opposed to colonies, is controversial. For
example, some biologists would regard a large aspen grove as a single biological with non-
contiguous parts (the trees); while others regard the trees themselves as the individuals.
Secondly, even when it is clear what the individual is, evolutionary theorists recognise that
sub- or supra-individual entities, e.g. genes and groups, can be ‘units of selection’, i.e. subject
to natural selection and thus something that can potentially be treated as a fitness-maximizing
entity.
It is this second problem that we will focus on here. The problem is intimately related
to the familiar ‘levels of selection’ question in evolutionary biology, which asks what the
entities are that natural selection chooses between (cf. Okasha 2006). Candidate answers
include genes, chromosomes, cells, individual organisms, groups and even species. This
question arises from two factors. The first is that the Darwinian process can be characterized
in purely abstract terms, as a process of differential survival and replication, in a way that
leaves open what the entities actually are whose survival and replication is at issue. The
15
second is that empirically, organisms display certain traits, including behaviours, that do not
benefit them directly. Such traits do not appear designed to maximize the individual
organism’s fitness, so presumably have not evolved by a process of individual-level selection.
One example of such a trait is altruism, in which an organism performs an action
which benefits others at a personal cost. For example, in social insect colonies, workers
devote their whole lives to assisting the reproductive efforts of the queen, by tending the
larvae, foraging for food and defending the colony. Such workers typically leave no offspring
of their own (and are sometimes physiologically sterile), so have a personal fitness of zero.
One possible evolutionary explanation for the worker behaviour, first broached by Darwin
himself, is that the colony as a whole benefits; so the trait could have evolved by colony-level
selection. Alternatively it may be that the workers are looking after their indirect genetic
interests by helping relatives; see below.
Another example of a trait that is not individually beneficial is cytoplasmic male
sterility in hermaphroditic plants; this is related to the nuclear-mitochondrial conflict
mentioned above. Such plants typically need to make pollen and ova, in order to self.
However, sometimes they cease investing in pollen production completely, producing only
ovules. This ‘male sterility’ trait is detrimental for the fitness of the individual plant, yet is
quite common. The evolutionary explanation is that the genes responsible for the trait are
found in the mitochondria, which are only transmitted in females and thus have no interest in
their host plant producing pollen. The plant ends up with a sub-optimal trait – a lack of pollen
– but the mitochondrial genes reap a benefit.
4.1 Two ways of salvaging the rational choice metaphor
How do these complications affect the use of the rational choice metaphor in biology? At first
blush, they might seem to undermine it. If an organism exhibits traits that do not enhance its
fitness, surely it cannot be treated as akin to a utility-maximizing rational agent? But this is
too quick. What follows is a weaker conclusion, namely that the organism cannot be treated
as a utility-maximizer whose utility function is equal to its personal fitness function. This
leaves open the possibility that by appropriate choice of utility function, the rational choice
analogy can be restored, i.e. the organism may be maximizing some quantity other than its
own fitness. A second possibility is that the rational agent analogy does apply, but that the
playing the role of the rational agent, so doing the maximizing, is not the individual organism
but some other entity.
16
Interestingly, these two possibilities correspond to the two main ways that
evolutionary theorists have tried to account for the prevalence of traits that do not enhance
individual fitness. The first way is by invoking the inclusive fitness concept of Hamilton
(1964). Hamilton’s key contribution was to realise that in social situations, in which
organisms interact with genetic relatives, the classical measure of fitness – expected number
of offspring – is no longer the relevant quantity. An organism that behaves altruistically, by
sacrificing some of its personal fitness in order to help others, may be favoured by natural
selection, so long as the ‘others’ are relatives, rather than random members of the population.
The reason is that relatives share genes, so there is a certain probability that the recipient of
the altruistic act will itself be an altruist. Depending on the magnitude of the cost c incurred
by the altruist, the benefit b conferred on the recipient, and the coefficient of relationship r
between them, it is possible that the altruistic trait will spread by selection. (The coefficient
of relationship measures the genetic distance between altruist and recipient; it equals 1 for
clones, ½ for full-sibs and children, ¼ for grandchildren, 1/8 for cousins etc.).
This insight was captured in Hamilton’s famous rule (rb > c) for when a gene coding
for altruism will be favoured by natural selection; and in the idea that inclusive fitness, rather
than classical (or personal) fitness, is the quantity that organisms’ social behaviour should
appear designed to maximize. An organism’s inclusive fitness is defined as its personal
reproductive output, plus the weighted sum of the portion of each of its relatives’
reproductive output for which it is causally responsible; where the weights are the
coefficients of relatedness. Though it is the former idea – Hamilton’s rule – that enjoys pride
of place in most of the biological literature on social behaviour, it is the latter idea – inclusive
fitness maximization – that is more relevant for the rational agent view of evolved behaviour.
This latter idea, which is a kind of biological analogue of the ‘other regarding preferences’
posited by behavioural economists, suggests that the rational agent metaphor can be applied
to social behaviour, and to traits such as altruism, so long as the agent’s utility function is
taken to be inclusive fitness (cf. Grafen 2006, West and Gardner 2013, Queller 2011). This
idea is discussed further in section 5.
The second way that biologists have reconciled individually disadvantageous traits
with Darwinism is by arguing that the relevant biological unit, whose fitness is being
maximized, need not be the individual organism. Thus proponents of group selection, also
known as ‘multi-level selection’, argue that natural selection often acts at the group level,
favouring some groups over others, with the result that individuals will display traits that are
17
group-beneficial. An extreme version of this idea holds that certain biological collectives, e.g.
honey bee colonies, constitute ‘superorganisms’, in which the parts work largely for the good
of the whole, helping to maximize the group’s fitness. The remarkable complexity and
functional integration that such colonies exhibit lends credence to this view (cf. Holldobler
and Wilson 2009). Though widely dismissed in the 1960s, an attenuated version of the
superorganism idea has recently enjoyed a revival in evolutionary biology, in part because of
the realization that a multi-celled organism – the traditional ‘individual’ – is itself a highly
adapted, complex group of cells, and a eukaryotic cell is itself a group of smaller biological
units (cf. Maynard Smith and Szathmary 1995, Michod 1999). Since cells and multi-celled
organisms do clearly exist, the idea that evolution sometimes works for group advantage
cannot be dismissed.
The appeal to group advantage can be seen as an alternative way of trying to preserve
the rational agent model from the challenge posed by social behaviour. Whereas the inclusive
fitness approach retains the individual as the rational agent but posits a modified utility
function, the group selection approach treats the group rather than the individual as the
analogue of the rational agent, seeking to maximize a ‘group utility function’. Two
interesting questions them arise. Firstly, what is this group utility function? Second, what is
the relation between the ‘inclusive fitness’ and the ‘group advantage’ ways of trying to
accommodate social behaviour? I look at them briefly in turn.
In most though not all group selection models in biology, group fitness is defined as
average or total fitness of the individuals in the group; it is differential group fitness, so
defined, that drives the evolution of group-beneficial traits. (Many of these models are based
on a hierarchical expansion of the Price equation above, which involves partitioning the
covariance term into within and between group components; see Okasha 2006 for details.)
This suggests that the group utility function can be identified with average or total individual
fitness.1 If group fitness is defined this way, it follows that there is an interesting link between
the group fitness notion in biology and the utilitarian definition of ‘social utility’ in social
choice theory. In Okasha (2009) I exploited this analogy, by arguing that standard
axiomatizations of utilitarianism from social choice theory could be a given a biological
interpretation; this in turn allows us to characterize those situations in which group fitness
1 This is the choice of ‘group objective function’ used by Gardner and Grafen (2009) in their
study of the maximizing agent analogy as applied to groups.
18
cannot be defined as average or total individual fitness. My analysis was critiqued and
developed further by Bossert, Qi and Weymark (2013).
What about the relation between the inclusive fitness and group advantage
approaches? Biologists used to regard them as incompatible. Indeed, Hamilton’s work owed
much of its appeal to its ability to explain the evolution of pro-social traits from a strictly
individualistic perspective, without an appeal to group advantage (which for many biologists
remains a problematic notion.) This is the view expressed clearly in Dawkins (1976), for
example. However recently, many social evolutionists have converged on this idea that
inclusive fitness and group or multi-level selection models are ultimately equivalent, i.e. that
the choice is one of modelling preference, not empirical fact (e.g. Marshall 2011). This
equivalence thesis is supported by formal analyses showing that the correct condition for the
evolution of a pro-social behaviour can be expressed both ways. This is a surprising result, as
it suggests that it is partly a conventional matter whether the rational agent should be
identified with the individual or group. The matter is examined further in Okasha
(forthcoming).
4.2 The gene as rational agent
Certain traits that bring no advantage to the individual, such as the example of cytoplasmic
male sterility above, are not group advantageous either. To understand such traits, biologists
often adopt the ‘gene’s eye view’, and regard the trait as an adaptation designed to benefit the
gene that causes the trait, rather than the organism that expresses it. Usually, the interests of
any gene and its host organism co-incide; however in some cases a gene can profit at the
organism’s expense, by reducing the organism’s total fitness but accruing a larger share for
itself. This stems from the fact that, in sexually reproducing organisms, not all the genes in an
organism are transmitted intact to the next generation. Thus mitochondrial genes are only
transmitted in female gametes; while the nuclear genes are broken up by meiosis, so any
gamete only contain half of the complement of nuclear genes found in the host organism. As
a result, traits which are sub-optimal at the organism level can sometimes spread by natural
selection.
Such traits can be understood using the rational agent metaphor, so long as we take
the ‘agent’ to be the gene rather than the organism. The gene behaves as if it is trying to
maximize a utility function – which in this case, must be defined as the number of copies of
the gene transmitted in its host organism’s gametes. The gene ‘chooses’ the strategy that
19
maximizes this number; a side-effect of this is that the organism’s total gametic output may
be reduced, in which case there is a gene / organism conflict. So in this case, preserving the
rational agent view of evolution (in its ‘product’ version) requires descending to the level of
the gene, rather than ascending to the level of the group.
The idea of the gene as rational agent, ‘choosing’ between strategies so as to
maximize its representation in the next generation, may seem reminiscent of Richard
Dawkins’ ‘selfish gene’ concept. But there is in fact a crucial difference. Dawkins (1976)
applied his selfish gene analysis to all genes that evolve by natural selection, including those
that code for traits that benefit their host organism. His point was that all such genes are
‘trying’ to profit at the expense of their alleles in the population. However, our focus above
has been on genes that spread at the expense of their host organism. Such genes, which are
relatively rare, are sometimes called ‘ultra-selfish genes’, or ‘outlaw’ genes. Their
evolutionary dynamics can only be understood be focusing on gene-level, rather than
organism-level, advantage. It is only when studying genes of this sort, or their phenotypic
effects, that we are forced to treat the gene as the rational agent.
It is interesting to consider why it is usually possible to treat the whole organism as
the rational agent, i.e. why we only need descend to the level of the gene in special cases. The
reason is that organisms contain mechanisms designed to align the interests of their
constituent genes, i.e. to eliminate intra-individual conflict by ensuring that no ‘ultra-selfish’
genes can arise. Chief among these is fair meiosis (or Mendelian segregation), which ensures
that each nuclear gene has an equal chance of being passed to a gamete. When these
mechanisms do their job, they ensure that the only way a gene can benefit itself is by playing
a strategy (i.e. coding for a trait) which also benefits the host organism. It is then possible to
treat the whole organism as the adapted unit, since the interests of all of its constituent genes
become identical. Similarly, many apparent superorganisms, e.g. eusocial insect colonies,
contain mechanisms which serve to align the interests of the insects in the colony, i.e.
policing of selfish workers who try to lay their own eggs, rather than tending for the queen. If
the policing mechanism works well, then any worker can only further its own interests by
working for good of the whole colony. The general moral here, now widely accepted in
biology, is that for an entity to function as adapted unit, some mechanism is needed to align
the evolutionary interests of its constituents.
20
This moral is easily, and usefully, understood using the apparatus of rational choice
theory. In effect, conflicts of interest between genes and their host organisms (or organisms
and their host groups) arise when the preference orderings of the different genes, over the set
of alternative ‘outcomes’ that they can bring about, do not all coincide. Thus gene A would
prefer that the organism left 10 gametes, 80% of which contain A, than 15 gametes, 50% of
which contain A; however the organism itself, and all the other genes at unlinked loci, have
the reverse preference. Conflict-reducing mechanisms, such as fair meiosis and policing,
work by restricting the set of feasible outcomes; on the restricted set, interests are aligned.
Thus with Mendelian segregation in place, a gene cannot influence the proportion of the
organism’s gametes in which it is found, but only the total number of them; and it will prefer
that this number be as great as possible. Similarly, if a worker cannot lay eggs of its own, its
choices are restricted to actions that affect the queen’s reproductive output; it will prefer to
maximize this output. Thus we see how a simple rational choice analysis, drawing only the
concepts of preference ordering and feasible set, sheds light on the biology.
To sum up, organisms exhibit many traits that do not benefit them
individually. These are of two broad sorts: those that benefit relatives or fellow group
members, such as altruistic behaviours, and those that arise from ‘ultra-selfish genes’. Both
pose a prima facie challenge to the ‘rational agent’ view of evolution, but it can be overcome.
The former can be brought within the rational agent framework either by taking groups as the
agents or by letting the utility function be inclusive fitness; formally at least, these are
equivalent. The latter can be accommodated by taking genes, rather than their host organisms,
as the maximizing agents. In general, the organism can only be treated as the rational agent if
mechanisms are in place to unify the interests of its constituent genes.
5. Inclusive fitness maximization and rationality
Let us return briefly to the concept of inclusive fitness. As noted above, part of the appeal of
Hamilton’s inclusive fitness concept is that it promises to reconcile organisms’ social
behaviour, including altruistic acts, with the idea of individual maximization. This aspect of
Hamilton’s theory is emphasized by recent authors including Grafen (2006), West and
Gardner (2013) and Queller (2011). In effect, the idea is that an organism will choose
between different possible social actions by working out the inclusive fitness consequences of
each, then choosing the action with the highest inclusive fitness. Thus the key Darwinian idea
21
that evolved behaviour should appear ‘purposive’ can be retained, by taking the ultimate
purpose to be maximization of inclusive fitness.
This way of describing Hamilton’s theory suggests a natural connection with rational
decision theory, with inclusive fitness playing the role of the utility function. This connection
has been noted previously, but has rarely been developed in a systematic manner. One reason
for this is that biologists who have noted this connection, e.g. Grafen (2006), Queller (2011),
have not usually appreciated the ‘as if’ interpretation of utility maximization that is
customary in decision theory (i.e. the revealed preference approach). Thus although they
emphasize the connection between inclusive fitness and rational decision, they have not
sought to relate it explicitly to preferences or to patterns of choice behaviour.
In recent work, joint with John Weymark and Walter Bossert, we attempted to fill this
lacuna, by seeking to deduce the principle of inclusive fitness maximization from a more
primitive basis (Okasha, Weymark and Bossert 2014). To do this, we employed the
conventional decision-theoretic methodology of starting with an organism’s choice
behaviour, summarized in the form of a binary preference relation between actions; we then
sought axioms on this preference relation which are necessary and sufficient for the organism
to always choose between actions in accordance with inclusive fitness maximization. We
succeeded in axiomatically characterizing both inclusive fitness maximization itself, and a
variant that we dubbed ‘quasi-inclusive fitness maximization’ for use in situations when an
organism has incomplete information about degrees of relatedness.
A brief sketch of our model is this. We consider a set of individuals I = (1,...,n).
Individual 1 is the focal individual whose actions we are interested in; the other n-1 comprise
all the other individuals who might be affected by the focal individual’s action. There is a
fixed relatedness vector r = (r1,...,rn), where ri denotes the coefficient of relatedness between
the focal organism and individual i. Thus r1 = 1, and the higher the value of i, the closer the
genetic relation between the focal and the ith
individual. At a given point in time, the focal
individual can perform a number of different actions, each of which potentially affects the
personal fitness (expected no. of offspring) of each individual in the set I. We simply identify
an action with a payoff vector a = (a1,...an) Rn, where ai is the incremental personal fitness
gain or loss that individual i suffers as a result of action a. The set of all possible actions is
Rn; however only some of these may be feasible.
22
The focal individual’s choice behaviour is described by a binary preference relation
≿r on the set of feasible actions. The relation ≿r indicates, for any two actions in the feasible
set, which the focal individual would (weakly) prefer, given the relatedness vector r. The
inclusive fitness of a feasible action a is defined as i
riai ; that is, it is a weighted sum over
individuals of the action’s payoff to each individual, with weights given by the relatedness
profile. If the focal individual is an inclusive fitness maximizer, then its preference relation ≿r
is represented by the inclusive fitness function; this means that for any pair of feasible actions
a and b, a ≿r b if and only if i
riai i
ribi . If the organism is a quasi-inclusive fitness
maximizer, it evaluates actions using a weighted sum of the payoffs, but where the weights
are not the true relatendness coefficients but rather any monotonic transformation of them.
Quasi-inclusive fitness maximization can be thought of as an ‘attempt’ to maximize inclusive
fitness for an organism that lacks knowledge about exact degrees of relatedness, but does
know who it is more related to than who.
There is an interesting formal link between inclusive fitness maximization and a
‘weighted utilitarian’ social objective function in social choice theory (see d’Aspremont 1985
or Bossert and Weymark 2004). The latter evaluates a social action by a weighted sum of the
utilities that the action has for each member of society. (In single-person decision theory, the
same functional form could also represent the preferences of someone who takes some
account of the interests of others.) This suggests that an interesting axiomatic characterization
of inclusive fitness maximization might be found by employing axioms analogous to those
used to characterize the weighted utilitarian rule in social choice theory.
In Okasha, Weymark and Bossert (2013) we pursed this line of thought, and found
axioms on ≿r that are necessary and sufficient for the focal individual to be an inclusive
fitness maximizer, and a quasi-inclusive fitness maximizer. The former result is simpler,
using just three axioms that we call Ordering, Focal Individual Monotonicity and Haldane.
Ordering is just the familiar requirement that ≿r be reflexive and transitive; Focal Individual
Montonicity says that if the focal individual can increase its own payoff while leaving
unchanged the payoff of all others, it will strictly prefer to do so. Haldane says that the focal
individual will be prepared to sacrifice x units of its own payoff, if by doing do it can boost
the payoff to the ith
individual by x/ri. (Recall that ri is the relatedness of the focal individual
23
to the ith
individual.) This axiom derives its name from the biologist J.B.S. Haldane’s famous
quip that he would jump into a river to save two brothers or eight cousins. So long as the
focal individual obeys these three simple axioms, this guarantees that it is an inclusive fitness
maximizer.
Our characterization of quasi-inclusive fitness maximization is more complicated, but
has one key advantage. Instead of the Haldane axiom, which is informationally quite
demanding in that it requires that the focal individual ‘know’ its degree of relatedness to
every other individual, this characterization uses an axiom called Nepotism, which says just
that the focal individual would prefer to help more closely than less closely related
individuals. This is a reasonable idealization of the actual powers of kin discrimination of
many social animals.
The significance of these results is three-fold. Firstly, they suggest a possible means
by which natural selection could ‘fine tune’ choice behaviour so as to maximize inclusive
fitness. Secondly, they suggest a way of empirically testing the hypothesis that animals
maximize their inclusive fitness (or something close), by observations of their binary choices.
If it could be shown empirically that an animal’s choices fail to satisfy any of our axioms, we
could immediately infer that they are not an inclusive fitness maximizer. Thirdly, they help
forge a link, both formal and conceptual, between the biologists’ concept of inclusive fitness
maximization and rational choice theory. The chief novelty here is not the suggestion that
inclusive fitness plays the role of the utility function, which has been made before, but rather
the use of a revealed preference approach and the axiomatic method.
6. Challenges to the rational choice analogy
I turn briefly to two interesting challenges to the use of the rational choice analogy to
understand evolved behaviour. Unlike the general objections to optimization discussed in
section 2, these challenges are specific: they describe particular scenarios when the
expectation that natural selection will give to rational behaviour is allegedly violated.
Interestingly, though the challenges involve quite different considerations, a general moral
emerges.
24
6.1 Sober and Skyrms on the correlated prisoner’s dilemma
The first example is an argument made by both Sober (1998) and Skyrms (1995)
(independently, and in slightly different ways). Consider a simple prisoner’s dilemma game,
below. In a rational choice setting, in which the payoffs denote utilities, it is widely agreed
that in the one-shot game the rational agent should play D (defect), as it strongly dominates C
(cooperate). Thus the expected utility of D must exceed that of C. This is so even if the agent
believes that its opponent is likely to play the same strategy that it plays, presuming the truth
of ‘causal decision theory’ (Lewis 1981), as the two players are causally isolated.
Player 2
C D
C (6, 6) (0, 10)
Player 1
D (10, 0) (2, 2)
Figure 2: Prisoner’s Dilemma
Suppose we now transpose to an evolutionary setting. We consider a large population
of organisms engaged in a one-shot pair-wise interaction; an organism’s type (C or D) is
hard-wired genetically, and the payoffs now represent increments of Darwinian (personal)
fitness. Which type has the higher fitness? As Skyrms observes, this depends on the pairing
assumption that we make. Under random pairing, in which the probability of having a C
partner is same for both types, it is obvious that type D must be fitter. The expression for the
fitnesses of the two types are then:
WC = 6. P(C) + 0.P(D)
WD = 10.P(C) + 2.P(D)
Where P(C) and P(D) denote the probabilities of being paired with a co-operator and a
defector respectively; these probabilities are given by the overall frequency of each type in
the population. As Skyrms notes, these expressions for expected fitness are identical to the
corresponding expressions for the expected utility in the rational choice context, calculated
using standard (Savage-style) decision theory. Under random pairing, the type with the
25
highest expected fitness chooses the action that confers the highest expected utility, so
evolutionarily optimal behaviour is identical to rational behaviour.
Skyrms argues that matters are different if there is correlated pairing. We must then
calculate the expected fitness of each type using the conditional probabilities of having a
partner of a given type, which may differ for co-operators and defectors. The resulting
expressions are:
WC = 6. P(C/C) + 0.P(D/C)
WD = 10.P(C/D) + 2.P(D/D)
where P(X/Y) denotes the probability of having a partner of type X, given that one is of type
Y oneself. It is easy to see that if the correlation is strong enough, i.e. the conditional
probability of having a C partner is sufficiently greater for C types than D types, then the C
type may be fitter overall. This is an instance of the familiar statistical phenomenon known as
‘Simpson’s paradox’.
As Skyrms insightfully notes, the expressions for the expected fitness in the case of
correlated pairing correspond to Richard Jeffrey’s recommendation for how to calculate
expected utility in individual decision theory, known as ‘evidential decision theory’ (Jeffrey
1990). The problem, however, is that Jeffrey’s theory is widely agreed to be flawed as a
model of rational decision: it enjoins the agent to co-operate in the one-shot Prisoner’s
dilemma if the agent is sufficiently confident that its opponent will play the same strategy as
it, i.e. to deliberate ignore the difference between causation and correlation! Skyrms assumes,
surely correctly, that this is irrational. So he concludes that with correlated pairing, “rational
choice theory completely parts ways with evolutionary theory. Strategies that are ruled out by
every theory of rational choice can flourish under favourable conditions of correlation” (1995
p.106).
Sober (1998) develops the same point slightly differently, in the context of discussing
what he calls the “heuristic of personification” in evolutionary biology. This heuristic is the
idea that “if natural selection controls which of traits T, A1,…,An evolves in a given
population, then T will evolve, rather than the alternatives, if and only if a rational agent who
wanted to maximize fitness would choose T over A1,…,An.” (1998 p.409). Sober maintains
that this heuristic is usually unproblematic but fails in certain contexts, one of which is the
one-shot Prisoner’s dilemma. The rational agent will never play co-operate, since it is strictly
26
dominated, Sober reasons; however it is possible that natural selection will favour co-operate
over defect if the requisite correlation exists, i.e. if the probability of having a cooperative
partner is sufficiently greater for co-operators than for defectors. Thus the heuristic of
personification fails: the rational strategy and the evolutionarily optimal strategy do not co-
incide.
These arguments are intriguing, but there is an obvious response, developed in detail
by J. Martens (forthcoming). In applying the rational choice model to evolution, there is no
particular reason to equate the rational agent’s utility function with the personal fitness
function (or some affine transformation thereof.) Martens observes that if we take the agent’s
utility function to be its inclusive fitness, then the divergence between rationality and
evolution that Skyrms and Sober call attention to immediately disappears. This can be seen
by simply transforming the above payoff matrix by adding to each player’s payoff in each
cell the quantity ‘r times my partner’s personal payoff’; where ‘r’ is defined as a difference in
conditional probabilities [P(C/C) – P(D/C)] (which is a suitable generalization of Hamilton’s
original definition of ‘r’ for phenotypic rather than for genetic evolutionary models).2 Then,
the condition for the rational agent to prefer strategy C coincides exactly with the condition
for evolution to favour that strategy.
The immediate significance of Martens’ point is that Sober and Skyrms were too
quick to argue that evolution and rational choice ‘part ways’, in this example. For there is no
particular reason to equate utility with personal fitness, even if it is a convenient starting
assumption. So Sober and Skyrms’ policy of transposing a payoff matrix from a rational
choice to an evolutionary context, by re-interpreting the payoffs as ‘personal fitness’ instead
of ‘utility’, is not the only way to proceed. Given that, in social contexts, personal fitness is
not the quantity that evolution will lead to individuals to maximize, it makes more sense to
use inclusive fitness as the payoff. Alternatively, we could simply adopt a revealed
preference approach, and define rationality as consistency of choice, rather than as
maximization of expected utility for a specific choice of utility function. Had Skyrms and
2 Note that this definition of ‘r’ is well-motivated, for it is a special case of a definition widely used in the
biology literature, namely the linear regression of partner genotypic value on actor genotypic value. (If we take
this ‘value’ to be 1 for C and 0 for D, then this linear regression simply is the difference in conditional
probabilities given in the text.) This statistical definition of ‘r’ generalizes the original genealogical definition of
Hamilton’s 1964 papers.
27
Sober proceeded this way, their ‘parting of ways’ argument would not have got off the
ground.
In a way, what we have here is an evolutionary analogue of a moral familiar from
behavioural economics; namely that certain patterns of choice behaviour that may appear
anomalous, e.g. ‘altruistic punishment’, can be easily rationalized by positing other-regarding
preferences. Such behaviours only threaten the utility-maximization paradigm if we insist that
an agent’s utility function depends only on their own (material) payoff; and there is no good
reason for that assumption. The situation is similar for the Skyrms / Sober claim that the
evolutionarily optimal and the rational strategies fail to co-incide in the one-shot prisoner’s
dilemma with correlated pairing: this is true only if the rational agent is assumed to care only
about their own (personal) fitness.
6.2 Robson and McNamara on risk preferences
A quite different reason for thinking that natural selection may give rise to irrational
behaviour emerges from the analysis of optimal choice in the face of risk / uncertainty. The
problem of how a rational agent should choose between risky options (lotteries) has a long
history, of course; the standard answer is given by expected utility (EU) theory, first made
explicit by von Neumann and Morgenstern (1944). Though numerous alternatives to EU
theory have been developed in the last thirty years, primarily because systematic violations of
EU have been repeatedly discovered, EU remains the normative standard for how an ideally
rational agent should make risky choices.
Interestingly, the problem of optimal choice in the face of risk also arises in
evolutionary biology (cf. Okasha 2011). A typical problem in this area is as follows. An
animal can forage either in a resource-rich area where there is a high risk of predation, or a
resource-poor area where there is a lower risk. More resources mean greater survival and thus
more offspring and higher Darwinian fitness. Which strategy should an animal choose? This
question has attracted a large literature, empirical and theoretical, in behavioural ecology,
much of it based on statistical decision theory.
This prompts the question: what is the relation between rational choice and
evolutionarily optimal choice under uncertainty? Specifically, will natural selection lead
organisms to exhibit choices that satisfy expected utility maximization? Many authors have
assumed that the answer to this question must be ‘yes’; in part, I suspect, because ‘expected
28
number of offspring’ is a commonly-used measure of biological fitness in non-social
situations. Thus it is natural to assume that, when faced with a choice between two lotteries,
(e.g. ‘½ chance of 10 offspring, ½ chance of none’ and ‘4 offspring for certain’), natural
selection will lead an organism to prefer the option with the higher expected number of
offspring. On average, organisms with such a preference will leave more offspring than ones
with any other.
However, matters are not quite so simple. It is well-known in evolutionary theory that
the average performance of a behaviour or strategy does not always determine its
evolutionary fate: variability in performance is often relevant too. Thus J. Gillespie (1977)
showed, for example, that in the face of certain forms of stochasticity in payoff, the fittest
genotype will in fact be the one that (approximately) maximizes:
Exp (no. of offspring) – f [Var(no. of offspring)].
where ‘Exp’ and ‘Var’ denote expectation and variance respectively and ‘f’ is an increasing
function. (Which function depends on the exact pattern of stochasticity.) Gillespie’s formula
is strikingly reminiscent of M. Allais’s famous critique of EU theory; Allais maintained, to
the consternation of orthodox EU theorists, that a rational agent would discount the expected
utility of an action by some positive function of the variance in utility (Allais 1953). This
suggestive link between Gillespie’s evolutionary analysis and Allais’s rational choice
analysis is examined further in Okasha (2011).
The question of evolutionarily optimal preferences was studied by A. Robson (1996)
and J. McNamara (1995) who arrived (independently) at similar answers. Both conclude that
evolution will only favour organisms that maximize expected reproductive output if the risk
they face is ‘idiosyncratic’ rather than ‘aggregate’. Idiosyncratic risk means that each
organism in the population faces an independent lottery over the possible outcomes. This is
plausible in certain cases: the chance that an organism is a victim of predation while foraging
is likely to be independent across organisms. However some risks are aggregate. If there is a
5% chance of a very harsh winter in a given year, in which food will be critically scarce, then
all organisms face a 5% chance of starving. The risk is perfectly correlated across organisms.
Purely aggregate and purely idiosyncratic risk are opposite ends of a spectrum; real cases will
often lie somewhere in between.
29
The significance of the aggregate/idiosyncratic distinction arises because what matters
in evolution is how well one does relative to the population. An organism that leaves x
offspring in a state of the world where the population leaves many offspring, may contribute
a smaller fraction of the population, so be less ‘fit’ in the relevant sense, than an organism
leaving y < x offspring in a state where the population leaves few offspring. With
idiosyncratic risk, then if the population is large enough, the number of offspring produced
will be (effectively) constant across states of the world, by the law of large numbers, and thus
the strategy with the highest expected number of offspring will be fittest. But where risk is
aggregate, this is not so. In that case, McNamara (1995) shows that the optimal strategy
requires that an organism calculate ‘expected’ no. of offspring using a distorted probability
distribution p*(s) over the states of the world, that places less weight on states where the
population as a whole does well, and more on states where it does badly, compared to the true
distribution p(s).
McNamara and Robson both conclude that with aggregate risk, evolution will favour
irrational behaviour. Indeed Robson (1996) argues that evolution will favour individuals
whose choice behaviour violates stochastic dominance, and are not even ‘probabilistically
sophisticated’; since an individual’s evaluation of a lottery (over possible nos. of offspring)
will not even be a function of the marginal probabilities that it faces. Thus consider two
lotteries, A and B. In A, with probability ½ every organism in the population leaves 9
offspring, and with probability ½ everyone leaves 1. In B, each organism either leaves 8.5
offspring or 1 with equal probability, but with independence across organisms. From a single
organisms perspective, A may seem better than B, since A stochastically dominates B (and
yields a higher expected number of offspring). But it follows from the Robson / McNamara
analysis that lottery B is evolutionarily preferable. Natural selection should lead organisms to
prefer B to A, in apparent violation of basic canons of rationality. So again, natural selection
and rational choice appear to ‘part ways’.
An interesting response to this argument comes from Grafen (1999) and Curry (2001),
who argue (in effect) that it rests on a restrictive assumption, namely that an individual’s
utility function must depend only on its own reproductive output. Modulo this assumption, it
follows from Robson’s and McNamara’s analysis that no utility function exists such than an
EU maximizer will be led to make evolutionarily optimal choices. But if we relax that
assumption, then EU maximization can in fact be recovered, simply by letting the
individual’s utility function, in a given state of the world, to be its reproductive output
30
divided by the population average. The expected value of this quantity, over states of the
world, is the criterion that matters for evolution. An organism with preferences that maximize
this quantity – expected relative fitness – will be favoured by natural selection.
It may seem odd that the same evolutionary model can imply that EU maximization is
satisfied and that it is violated. How can this be? In fact, there is no contradiction here. When
Robson argues that EU maximization is violated and Curry argues that it can be rescued, they
are (in effect) using different state spaces to set up the decision problem. They are both using
the traditional von Neumann / Morgenstern set-up, of a finite set of prizes (outcomes) and a
set of lotteries on those prizes over which the agent has a preference ordering. Robson is
assuming that the basic prizes, from which the lotteries are constructed, specify how many
offspring an individual leaves. Curry is assuming that the basic prizes specify how many
offspring an individual has relative to the population average. On the first state space,
evolutionary optimality requires violation of EU maximization, but on the second it does not.
In effect, this means that are dealing with a biological analogue of the well-known
moral that apparent irrationalities of choice can often be removed by enlarging the state
space. This general moral is familiar from discussions of the status of the von Neumann /
Morgenstern independence axiom, or the Savage sure-thing principle, in decision theory. It is
well-know that apparent empirical violations of these axioms can often be ‘explained away’
by positing an enlarged state space. It is intriguing to find an analogue of this issue arising in
evolutionary biology; see Okasha (2011, 2013) for further discussion.
To conclude the section, we have looked briefly at two arguments which suggest that
evolution and rational choice may ‘part ways’, in the sense that what is evolutionarily optimal
may not always correspond to what is rational. Both can in principle be defused by suitable
choice of utility function and/or suitable choice of state space. (The same defusing strategy
can be applied to a recent attempt, by Houston, McNamara and Steer 2007, to show how
evolution may lead to intransitive preferences, as its authors point out.) It is tempting to
suggest that this will always be possible, i.e. that any putative ‘parting of ways’ may be
eliminated by similar means. However this needs to be judged on a case-by-case basis; there
is no a priori reason to think it is true.
31
7. Conclusion
The thematic and formal connections between rational choice theory and evolutionary
biology, and the fruitful transfer of ideas from one field to another, have often been noted but
rarely been explored in detail. In this article I have tried to outline some of the key issues,
philosophical and technical, raised by the connections between the two fields, drawing on my
previous work.
Some of the main morals to emerge are as follows: (i) there is an important distinction
between using a rational choice analogy to understand the process and the product of
evolution; (ii) the traditional objections to optimization in biology complicate but do not
undermine the rational choice analogy; (iii) there are close parallels, but also differences,
between the utility maximizing paradigm of rational choice theory and the fitness-
maximizing paradigm of evolutionary biology; (iv) some biological phenomena require us to
think of the gene, rather than the individual, as the maximizing agent; (v) altruistic
behaviours can be accommodated either by thinking of the group as the rational agent, or by
taking the individual to be maximizing its inclusive, rather than personal, fitness; (vi) it is
possible to formulate inclusive fitness maximization using a ‘revealed preference’ approach,
by deducing it from axioms on choice behaviour; (vii) apparent cases in which evolution and
rational choice ‘part ways’ can often be defused by suitable choice of utility function and/or
by changing the state space of the decision problem.
32
References
Abrams, P. A., Matsuda, H. and Harada, Y. (1993) ‘Unstable Fitness Maxima and Stable
Fitness Minima in the Evolution of Continuous Traits’, Evolutionary Ecology 7, 465-87.
Allais, M. (1953). ‘Le comportement de l'homme rationnel devant le risque: critique des
postulats et axiomes de l'école américaine’, Econometrica, 503-546.
Bossert, W. and Weymark, J.A. (2004) ‘Utility in Social Choice’, in Barberà, S., Hammond,
P.J. & Seidl, C. (eds.) Handbook of Utility Theory. Vol. 2: Extensions. Kluwer
Academic Publishers, Boston, pp. 1099–1177.
Bossert, W., Qi, C. and Weymark, J. A. (2013) ‘Extensive Social Choice and the
Measurement of Group Fitness in Biological Hierarchies’, Biology and Philosophy 28, 1, 75-
98.
Conradt, L. and List, C. (2013) ‘Group Decisions in Humans and Animals: a Survey’,
Philosophical Transactions of the Royal Society B, 364: 719-42.
Curry, P. (2001) ‘Decision Making under Uncertainty and the Evolution of Interdependent
Preferences’, Journal of Economic Theory 98: 357–69
d'Aspremont, C. (1985) ‘Axioms for social welfare orderings’, in: Hurwicz, L.,
Schmeidler, D., Sonnenschein, H. (Eds.), Social Goals and Social Organizations:
Essays in Memory of Elisha S. Pazner, Cambridge, CUP, pp. 19–76.
Dawkins, R. (1976) The Selfish Gene, Oxford: OUP.
Edwards, A. W. F. (1994) ‘The Fundamental Theorem of Natural Selection’, Biological
Reviews 69, 443-474.
Ewens, W. (2004) Mathematical Population Genetics, Springer.
Gardner, A. and Grafen, A. (2009) ‘Capturing the Superorganism: a Formal Theory of Group
Adaptation’, Journal of Theoretical Biology 22, 659-71.
Gillespie, J. (1977) ‘Natural Selection for Variances in Offspring Number: A New
Evolutionary Principle’, American Naturalist 111:1010–14.
Grafen, A. (1999) ‘Formal Darwinism, the Individual-as-Maximizing-Agent Analogy, and
33
Bet-Hedging’, Proceedings of the Royal Society B 266:799–803.
Grafen, A. (2006) ‘Optimization of Inclusive Fitness’, Journal of Theoretical Biology 238,
541-63.
Hamilton, W. D. (1964) “The Genetical Evolution of Social Behaviour I”, Journal of
Theoretical Biology, 7(1), 1-16.
Hammerstein, P. (2012) ‘Towards a Darwinian Theory of Decision Making: Games and the
Biological Roots of Behavior’, in Okasha, S. and Binmore, K. (eds.) Evolution and
Rationality, Cambridge: Cambridge University Press, p. 7-22.
Hammond, P.J. (2005) “Utility invariance in non-cooperative games”, Advances in public
economics: utility, choice and welfare, 31-50.
Holldobler, B. and Wilson, E.O. (2009) The Super-organism: The Beauty, Elegance and
Strangeness of Insect Societies, London: W.W. Norton.
Houston, A. I. and McNamara, J. (1999) Models of Adaptive Behaviour, Cambridge:
Cambridge University Press.
Houston, A., McNamara, J. & Steer, M. (2007) ‘Do We Expect Natural Selection to Produce
Rational Behaviour?’, Philosophical Transactions of the Royal Society of London B 362:
1531-43.
Jeffrey, R. (1990) The Logic of Decision 2nd
edition, Chicago: University of Chicago Press.
Johnstone, R. A. and Cant, M. A. (2009) ‘Models of Reproductive Skew: Outside Options
and the Resolution of Reproductive Conflict’, in R. Hager and C. Jones (eds) Reproductive
Skew in Vertebrates, Cambridge: Cambridge University Press 2009.
Krebs, J. and Davies, N. (1997) Behavioural Ecology: an Evolutionary Approach, Oxford:
Blackwell.
List, C. and Pettit, P. (2011) Group Agency, Oxford: OUP.
Lewis, D. (1981) ‘Causal Decision Theory’, Australasian Journal of Philosophy, 5–30.
Maynard Smith, J. (1978) ‘Optimization Theory in Evolution’, Annual Review of Ecology
and Systematics 9, 31-56.
Martens, J. (forthcoming) ‘Hamilton meets Causal Decision Theory’, British Journal for the
Philosophy of Science
34
Maynard Smith, J. (1982) Evolution and the Theory of Games, Cambridge: Cambridge
University Press.
Maynard Smith, J. and Szathmary, E. (1995) The Major Transitions in Evolution, Oxford:
Oxford University Press.
McNamara, J. (1995) ‘Implicit Frequency-Dependence and Kin Selection in Fluctuating
Environments’, Evolutionary Ecology 9: 185-203.
Marshall, J. (2011) ‘Group Selection and Kin Selection: Formally Equivalent Approaches’,
Trends in Ecology and Evolution 26, 7, 325-32.
Metz, J. A. J., Mylius, S. D. and Diekmann, O. (2008) ‘When Does Evolution Optimize?’,
Evolutionary Ecology Research 10, 629-54.
Michod, R. (1999) Darwinian Dynamics: Evolutionary Transitions in Fitness and
Individuality, Cambridge MA: MIT Press.
Okasha, S. (2006) Evolution and the Levels of Selection, Oxford: Oxford University Press.
Okasha, S. (2007) ‘Rational Choice, Risk Aversion and Evolution’, Journal of Philosophy
104, 5, 217-235.
Okasha, S. (2008) ‘Fisher’s Fundamental Theorem of Natural Selection’, British Journal for
the Philosophy of Science 59, 3, 319-51.
Okasha, S. (2009) ‘Individuals, Groups, Fitness and Utility: Multi-level Selection Meets
Social Choice Theory’, Biology and Philosophy 24, 5, 561-584.
Okasha, S. (2011) ‘Optimal Choice in the Face of Risk: Decision Theory meets Evolution’,
Philosophy of Science 78, 83-104.
Okasha, S. (2012) ‘Social Justice, Genomic Justice and the Veil of Ignorance: Harsanyi meets
Mendel’, Economics and Philosophy 28, 43-71.
Okasha, S. (2013) ‘The Evolution of Bayesian Updating’, Philosophy of Science 80(5), 745-
757.
Okasha, S. (forthcoming) ‘The Relation between Kin and Multi-level Selection: an Approach
Using Causal Graphs’, British Journal for the Philosophy of Science
35
Okasha, S. and Paternotte, C. ‘Group Adaptation, Formal Darwinism and Contextual
Analysis’, Journal of Evolutionary Biology 25, 6, 1127-1139.
Okasha, S., Weymark, J. A. and Bossert, W. (2014) ‘Inclusive Fitness Maximiziation: an
Axiomatic Approach’, Journal of Theoretical Biology 350, 24-31.
Queller, D. C. (2011). Expanded Social Fitness and Hamilton's Rule for Kin, Kith, and Kind.
Proceedings of the National Academy of Sciences, 108 (Supplement 2), 10792-99.
Rice, S. (2005) Evolutionary Theory: Conceptual and Mathematical Foundations, Sinauer.
Robson, A. (1996) ‘A Biological Basis for Expected and Non-expected Utility’, Journal of
Economic Theory 68, 397-424.
Roughgarden J. (2010) The Genial Gene, University of California Press.
Skyrms, B. (1995) Evolution of the Social Contract, Oxford: OUP.
Sober, E. (1998) “Three Differences between Deliberation and Evolution”, in P. Danielson
(ed.), Modelling Rationality, Morality, and Evolution, Oxford: OUP, pp.408-422.
Stearns, S. (1992) The Evolution of Life Histories, Oxford: Oxford University Press.
von Neumann, J. and Morgenstern, O. (1944) Theory of Games and Economic Behaviour,
Princeton: Princeton University Press.
Weibull, J. (1995) Evolutionary Game Theory, Cambridge MA: MIT Press.
West, S.A and Gardner, A. (2013) ‘Adaptation and Inclusive Fitness’, Current Biology 23,
R557-R584.
Wright, S. (1932) ‘The Roles of Mutation, Inbreeding, CrossBreeding and Selection in
Evolution’, in Proceedings of the Sixth International Congress of Genetics, pp.356-66.