optimising fungicide applications on winter wheat using genetic algorithms

Biosystems Engineering (2004) 88 (4), 401–410doi:10.1016/j.biosystemseng.2004.04.012

Available online at www.sciencedirect.com

IT}Information Technology and the Human Interface

Optimising Fungicide Applications on Winter Wheat using Genetic Algorithms

D.J. Parsons1; D. Te Beest2

1 Silsoe Research Institute, Wrest Park, Silsoe, Bedford, MK45 4HS, UK; e-mail of corresponding author: [email protected] Wageningen University, Postbus 16, 6700AA, Wageningen, The Netherlands; e-mail: [email protected]

(Received 24 July 2003; accepted in revised form 23 April 2004; published online 7 July 2004)

A genetic algorithm is used in a decision support system to select the combinations of chemicals and the timingof successive treatments for the optimal control of fungal diseases in winter wheat crops, using a simulationmodel to predict the performance of different treatments. The search space is large and discrete, makingthe use of conventional optimisation methods impractical. Furthermore, the user requirements specify that themethod must supply lists of near-optimal solutions, which fits with the use of populations of solutions inthe genetic algorithm. Substantial improvements in the performance of the algorithm were obtained by tuningthe fitness, selection, reproduction and replacement methods for the optimisation of short-term and long-termdecisions. These also ensured rapid convergence in the former and prevented premature convergence in thelatter.

The algorithm has proved to be effective at finding optimal and near optimal solutions within an acceptabletime. When compared with exhaustive searches for cases where this is possible (short-term planning withrestricted choices), it typically finds 5–8 of the top 10 plans and a similar number of the next 10. The results ofthe system in field and user trials have been good.# 2004 Silsoe Research Institute. All rights reserved

Published by Elsevier Ltd

1. Introduction

In the UK, approximately two million hectares ofland is used to grow wheat, and each crop receives onaverage 2�5 fungicide sprays (Thomas et al., 1997).Hence, appropriate fungicide application is an impor-tant part of cereal husbandry. However, there arearound 20 appropriate chemical actives and at leastone new chemical is introduced every year, replacing lesseffective old ones. These are formulated in combinationsinto hundreds of products. Each chemical has differentlevels of control against different diseases. Timing anddose both affect this degree of control (Paveley et al.,2000). Chemicals can be combined together to improveresults. Disease progress is very complex and variablefrom year to year, site to site and variety to variety.Surveys (Thomas et al., 1997) have shown that, as aresult, the use of fungicides by farmers is little affectedby likely disease pressure or variety. Thus, some farmersare using inappropriate fungicide application. It wouldbenefit both the environment and the farmer’s profit ifmore appropriate timings and doses could be used.

Several European countries have made use ofinformation technology to assist growers with diseasecontrol in wheat and other crops. One of the firstsystems was EPIPRE in the Netherlands (Zadoks, 1981;Rijsdijk, 1983). The name is derived from ‘epidemicprediction and prevention’. The EPIPRE system usedempirical models to relate observed disease levels tolikely losses. Use of the system has now declined.Germany has a web-based advisory system calledproPlant Expert (www.proplantexpert.com). It func-tions as an expert advisory system for a range of crops,pests and diseases. PC-Plant Protection, developed inDenmark (Murali et al., 1999) covers control of weeds,pests and diseases in wheat and other crops, includingbarley, peas and oilseed rape with an emphasis onreducing chemical use. Models are used to assess theneed for control based on observations, then it givesrecommendations of product choice and dose. Approxi-mately 2500 licences have been sold.

Within the UK, Decision Support Systems for ArableCrops (DESSAC) was set up as a LINK project, that is,one with joint government and industry funding. The

ARTICLE IN PRESS

1537-5110/$30.00 401 # 2004 Silsoe Research Institute. All rights reserved

Published by Elsevier Ltd

DESSAC system is designed as a personal computerbased platform to support the development of decisionsupport systems, providing access to farm, field,pesticide and weather databases (Brooks, 1998). Wheatdisease manager (WDM) is the first module and wasdeveloped in conjunction with the DESSAC platform. Itis designed for use by farmers and their advisors tosupport decisions relating to the management of fungaldisease on winter wheat. Its approach is somewhatdifferent from the systems mentioned above, in that ituses semi-mechanistic crop and disease developmentmodels, and produces recommendations by optimisingthe control variables}chemicals, dose and timing}inthe model.

There are four aspects to improving the application offungicides to wheat crops: predicting the effect on thediseases; predicting the impact of reduced disease onincreased yield; transferring this knowledge to thefarmer or adviser; determining the optimum applicationof fungicides. This paper focuses on the fourth aspectwithin WDM. The models on which it is based aredescribed elsewhere and cover simulation of the wheatcanopy (Milne et al., 2003), yield formation, foliardisease and disease of the stem base and ear.

The WDM module contains four components: theuser interface, the process model, the fungicide moduleand the decision model. The user interface allows theuser to enter observations of crop state and disease levelinto the farm database, and to set the requirements fromthe models. The process model is a collective term forthe models outlined above, and receives informationfrom the weather database, farm database (variety,sowing date & observations) and varieties database(disease resistance of wheat variety). The fungicidemodule handles the interactions with the database ofinformation about chemicals, and validates sprayprogrammes for compliance with constraints on spraytimings and mixtures. It allows the user to choose the listof products to be considered by the decision model:typically a short list of 10–20 products preferred by theuser. The decision model, which is the subject of thispaper, suggests courses of action to the user by makinguse of all this information.

Winter wheat is attacked throughout its life by anumber of fungal diseases of the stem, leaves and ear.The four major spring foliar diseases in the UK areSeptoria tritici, yellow rust (Puccinia striiformis), brownrust (Puccinia recondita) and powdery mildew (Erysiphe

gramminis). All of these diseases can cause substantiallosses of grain yield if not properly controlled. Stem basediseases such as eyespot can also cause yield losses andear diseases including Fusarium and sooty moulds canreduce the value of the grain. The dispersal anddevelopment of these diseases are strongly influenced

by weather variables, including temperature, humidity,wind speed and rainfall. The variables of importancediffer from disease to disease.

The number of possible choices available to thefarmer, even within a short time period, is very large:there are on the market over 200 fungicide products,containing between one and three active ingredients,with different levels of activity against each disease.Some of the products may also be mixed safely forapplication. Furthermore, there is growing interest inmoving away from the standard doses towards adjustingthe dose according to the disease risk. Thus, even if thechoices are restricted to, for example, 20 well-knownproducts, the number of possible two-spray programmesallowing one or two products per application over a fourweek period is almost 1 000 000 (20 single products and380 mixtures on each date, giving 4002 combinations,and 6 possible timing combinations with a minimumof 2 weeks between applications, as described inSection 2.2.2).

2. Decision model

2.1. Requirements

The objective of the decision model is to maximise themargin over fungicide costs, that is the value of the grainproduced less the cost of the chemicals and theirapplication. This will be referred to simply as themargin. This was a requirement from the users, whowanted the optimisation to be in terms of a financialobjective, not yield maximisation or disease minimisa-tion.

The decision model interacts with the process modelthrough a collection of control variables. These arefairly complex, consisting of time of application, indiscrete steps, dose, which could be continuous, andchemicals, which may include mixtures. There is also arequirement from the users that the decision modelshould not simply provide a single optimum solution,but should give a ranked list of near-optimal sprayprogrammes.

Taken together, these considerations motivated thechoice of a genetic algorithm. Firstly, it is capable ofbeing used with a black box simulation model. Secondly,it is intrinsically suited to discrete variables, whethernumerical or combinatorial. Thirdly, the existence of apopulation can be used to generate a list of near-optimalsolutions in a very natural way. A previous applicationto a silage harvesting problem with somewhat similarstructure had shown that genetic algorithms were suitedto problems of operation scheduling (Parsons, 1998).The details of the operation of genetic algorithms can be

ARTICLE IN PRESSD.J. PARSONS; D. TE BEEST402

found in reference works such as Holland (1992),Goldberg (1989) and Davis (1991).

The decision model is required for strategic or long-term planning prior to the fungicide treatment period(the ‘spraying season’), which is taken to be 15 March to15 July, and for tactical or short-term decision makingduring the season. The different requirements for thesetwo uses result in differences in implementation. Theintention is that the user should create and save one ormore long-term plans, then use the short-term mode torevise one as the season progresses. Each combinationof variety and sowing date within a farm will require adifferent plan.

For long-term planning, the time scale covers thewhole of the spraying season (15 March–15 July: 122days). The user can choose to apply up to 4 sprayswithin this period, each of which can contain up to threeproducts, subject to restrictions on permitted mixtures,timing and doses. The algorithm must search the spaceof possible spray plans, which includes all permittedmixtures, doses and timings, thoroughly; for long-termplanning, speed of operation is a secondary considera-tion. The use of the long-term mode is not restricted topre-season planning: if the user feels that it is necessary,it can be used at any point in the season to create a newplan, in which only the sprays after that date will beoptimised.

For short-term decision making, the emphasis ison revisions to an existing plan, which may be requiredbecause the crop and disease have not developedas forecast at the start of the season. The system willhave been updated with weather data and observationsof the state of the crop and the amount of infectionto allow more accurate predictions to be made.Applications of sprays prior to the decision pointwill also have been recorded. It is important forthe system to operate quickly when used in this way.This is achieved primarily by restricting the planninghorizon to 4 weeks. When combined with otherconstraints, this limits the number of possible sprayapplications to one or two within that period. Asdescribed below, this allows a more efficient encoding ofthe problem for use by the genetic algorithm. The otherdecision variables are the same as for the long-termoptimisation.

In addition, there are several constraints on the sprayplans. Firstly, there are restrictions imposed by pesticideapprovals, such as the maximum total dose of a productapplied during a season or the minimum intervalbetween spraying and harvest. These have the force oflaw, and the program is not permitted to break them.Secondly, there are published approved mixtures ofproducts. Although these are not part of the formalapprovals, the program will not recommend mixtures

that are not on the approved list, although the user canuse the program to experiment with them. Finally, adecision was taken to impose a minimum intervalbetween applications of 2 weeks, because this isapproximately the interval before the emergence of thenext unprotected leaf layer.

The algorithm also allows additional constraints to beimposed, such as fixing the date of one of the sprays, orfixing the contents of a spray. This could be used toanswer questions such as ‘What is the best product tospray today?’’ or ‘‘When would be the best time to sprayproduct X?’

2.2. Implementation

The key features of the use of a genetic algorithm forany problem are the method of encoding the problem asa chromosome, the fitness function (related to theoptimisation objective function), the method of selectionof individuals for reproduction, the reproductionoperators, and the method of replacement of oldindividuals by new ones.

The encoding was necessarily somewhat different forthe short- and long-term optimisations, but initially theyused similar fitness, selection, reproduction and replace-ment methods. These were chosen to give goodperformance in the short-term optimisation, but werefound to be unsuitable for the long-term one. For theshort-term it was necessary to introduce strong selectionpressures to ensure that the optimum plan was foundquickly, but to balance this with methods to preservepopulation diversity. However, these choices causedpremature convergence to inferior solutions in the muchlarger long-term search space. A systematic investiga-tion resulted in changes to most of the options.Substantial improvements were obtained, which areillustrated in the results section. This illustrates theimportance of tailoring genetic algorithms to theproblem being considered; they cannot simply beapplied with the default settings.

2.2.1. Population size and run length

As the complexity of the problem varies dramatically,it is not possible to use a fixed population size andnumber of generations. After experimenting to findvalues that gave acceptable speed, reliability anddiversity of the final population with different sizes ofproblems, heuristics were derived to adjust theseaccording to the length of the chromosome, which isused as a measure of the complexity of the problem. Thenumber of generations is proportional to the chromo-some length, while the population size is proportional toits square.

ARTICLE IN PRESSOPTIMISING FUNGICIDE APPLICATIONS ON WINTER WHEAT 403

2.2.2. Encoding

The way in which the problem variables are encodedas a chromosome can have a profound effect on theefficiency of a genetic algorithm. In general, thechromosome should be as short as possible and avoidredundancy. For the genetic operators to work well,short schemata within the chromosome should carrysignificant information, as it is these that tend to bepassed from generation to generation (Holland, 1992,Chapter 4). As far as possible constraints should beencoded into the structure of the chromosome, ratherthan imposed after it is decoded.

A binary alphabet is used for the chromosome, that isit consists of a string of 0 s and 1 s. It is convenient toconsider this as divided into several genes, eachdetermining a feature of the spray plan:

(1) date(s) of spray(s) as relative week numbers;(2) number of chemicals in each spray;(3) choice of each chemical; and(4) dose of each chemical in steps of 1

4dose.

All except the date genes are identical in both theshort- and long-term optimisations. These are explainedin more detail below, but a simple example will help toillustrate the construction of the chromosome. Considerthe case where a short-term, one-spray programme withup to two products being mixed from a list of 16possibilities is being optimised. A programme contain-ing a mixture of a half dose of product number 9 and afull dose of product number 2 applied 2 weeks from thecurrent date would encode as

01; 11; 1001; 0010; 1; 010;

where the commas indicate the divisions between thegenes. Taking the genes in that order, 01 means a doseof (1+1)/4, 11 means a dose of (3+1)/4, 1001 is product9, 0010 is product 2, 1 means 1+1 products and 010means week 2. Note that the doses and the number ofproducts cannot be 0, so 1 is added to the value of thegene interpreted as a binary number.

A temporal resolution of 1 week is used for the dates,that is, the date of a spray is simply the number of weeksfrom the start of the decision period. Given the inherentuncertainties and the fact that it may be impossible tospray on the optimum date due to weather or logistics, afiner resolution is inappropriate. It also helps to reducethe size of the search space, so reducing the timerequired to find a solution.

In short-term mode, this means that there are fivepossible one-spray timings and six possible two-spraytimings within the 4 week horizon, given the minimuminterval of 2 weeks between sprays. There is no simple,efficient representation for both one- and two-spray

plans, so they are considered separately and the resultsare combined after optimisation. In each case the dateor dates can be represented by a gene of three bits. Forone spray this is treated as an integer representation ofthe week number relative to ‘today’, as in the exampleabove. For two sprays it is mapped to a pair of weeknumbers using the representation shown in Table 1.Clearly, in both cases, there are a few unused bitcombinations. These are treated as infeasible values.

In the long-term optimisation it is necessary to cover amuch greater span of dates, and the user can specifyconsideration of up to four sprays, so enumerating thecombinations as above would be much more complex. Ifonly one spray is being optimised, the encoding isessentially the same as for the short-term case: a binaryinteger representing the week as an offset from ‘today’ orthe start of the spraying season, whichever is later. Thegene is increased to 4 bits to give it a range of 16 weeks(112 days). This is slightly shorter than the sprayingseason (122 days), but a spray programme containing asingle spray in the last 10 days is highly unlikely to beeffective.

When considering more sprays, it would be possibleto give each a gene representing its date relative to thestarting point, but this would require multiple con-straints to enforce the correct sequence and sprayingintervals. Parsons (1998) found that an efficient encod-ing of dates for operation sequences was to use genesthat represent the intervals between the operations. Thatis, the date gene for the second spray encodes thenumber of weeks after the first spray that it should beapplied, and so on. Again, the genes are treated asintegers, but an offset of 2 weeks is added to each toenforce the constraint on the minimum interval betweensprays. When there is more than one spray, each dategene is 3 bits, so the range spanned by two sprays isagain16 weeks (7+2+7). If 4 bits were used, as for thesingle spray, most of the search space would beinfeasible, or extend well outside the spraying season.It is not possible to reduce this further when there arethree or four sprays without excluding some reasonable

ARTICLE IN PRESS

Table 1

Representation of date pairs as binary genes in short-term

optimisation

Date pair, week numbers Representation

0 2 00 3 11 3 101 4 110 4 1002 4 101

D.J. PARSONS; D. TE BEEST404

solutions. Inevitably, chromosomes arise that place thelater sprays beyond the end of the spraying season.Rather than rejecting these, the late sprays are simplyignored, so that they become plans with fewer, widely-spaced sprays. If they result in high margins, these willsurvive in the population, so the user’s setting iseffectively the maximum number of sprays to apply.

The remainder of the chromosome consists of sets ofgenes representing the number and selection of chemi-cals in each spray and the corresponding doses. The usermay choose to mix up to three chemicals in each spray,so up to 2 bits are required to encode this as a binaryinteger. If the number of chemicals is fixed at 1, this geneis omitted. Each spray is then represented by up to 3gene pairs for chemical and dose. The chemical selectionis represented by a binary integer, which simply indexesthe chemical within the list of those available: either thefull database or a subset chosen by the user. The lengthof the gene depends on the length of the list, and willcontain unused values if this is not a power of 2. Thedose of each chemical is represented by 2 bits, which aremapped to 1

4, 1

2, 3

4or 1 full dose. If the user wishes to

impose constraints, such as fixing the date of applicationor the chemical applied, this is easily accommodated byremoving the corresponding gene.

As noted above, some chromosomes will not repre-sent valid spray programmes. In each case it would bepossible to convert these to valid spray plans, but thiswould introduce a bias, so instead they are treated asinfeasible, that is, not in the permitted solution space ofthe optimisation. Similarly, plans may be infeasiblebecause they violate practical constraints, such as dose-time approvals or approved mixtures, which are builtinto the pesticides database in the system. All infeasiblespray plans are given a value of 80% of the margin foran unsprayed crop. This ensures that they will notsurvive in the population, but does not bias thedistribution of objective values to the extent that usingan extreme value, such as 0, would. This is significantwith some types of fitness function.

2.2.3. Fitness normalisation

The objective function}the margin, as defined inSection 2.1}has to be converted to a fitness value thatwill directly control the probability of selection ofindividuals for reproduction. Various types of linear andnonlinear mapping are possible, and the most appro-priate for a given problem depends on the distributionof values of the objective function both in the earlystages and when close to convergence.

For the short-term optimisation, rank-based fitnessscaling was found to work well. This applies a lineartransformation to the rank of each individual within thepopulation, so that the best has fitness 1�0 and the worst

0�5. This imposes a high selection pressure even in thelate stages of optimisation, because it means thatindividuals in the middle of the ranking have a fitnessaround 0�75 even if their objective function value is closeto the maximum. This can lead to premature conver-gence, but, by selection of suitable population sizes andnumber of generations, produced quick and reliableperformance.

The long-term optimisation uses a linear scaling of theobjective function values so that the best has fitness 1�0and the worst 0�5. Although this appears very similar tothe above, it differs significantly in a population wherethere is a cluster of members close to the maximum withothers scattered at lower values, because all those closeto the maximum will have fitness values close to 1�0. Inthis case, rank-based selection is biassed towards thebetter members of the cluster, which can causepremature convergence, whereas linear scaling givesthem almost equal probabilities of selection.

2.2.4. Selection

In each generation, a proportion of the population isselected for reproduction using a randomised procedure(‘roulette wheel selection’) in which the probability of anindividual being selected is proportional to its fitness.The proportion used is 0�15 for the short term and 0�8for the long term. Partial replacement was found to givebetter convergence and eliminate infeasible plans morerapidly than replacement of the whole population ateach generation. However, the improvement overreplacement of the whole population in each generationwas marginal in the long term case, and further reducingthe proportion replaced did not improve the results.

2.2.5. Reproduction

Reproduction consists of two operations: crossoverand mutation. Crossover takes two individuals asparents and produces pairs of offspring by selectingone or more points in the chromosome at random andexchanging the strings for the two parents around thesepoints. The short-term algorithm uses one-point cross-over, whereas the long-term uses three crossover points.

Mutation randomly changes the value of one or morebits in the string, with an expectation of 1 bit perchromosome in short term and 2 in long term. Mutationencourages searching subspaces of the problem spacethat are not spanned by the original population.

2.2.6. Replacement

After reproduction new individuals replace old ones.Initially this used a rank-based replacement methodsimilar to that employed in GENITOR (Whitley, 1989),in which new individuals are allowed into the populationif they have higher fitness than the worst existing


member. This gave very rapid convergence and some-times resulted in a final population of the short-termoptimisation containing only one or two distinct plans,thus failing to meet the users’ requirement for a set ofoptions. In order to maintain diversity, alternativereplacement methods in which new individuals onlyhave to compete with sub-populations were used. This isknown as tournament replacement and is applied in bothversions, with slight differences.

In the long-term optimisation, parental replacement isused: the tournament consists only of the new individualand one of the parent chromosomes that generated it.The parent is replaced if the new individual has higherfitness. However, the algorithm is constructed with asmall probability of allowing the inferior of the twoindividuals to survive. This probability decreases as thealgorithm proceeds in a process known as annealing. Inthe short-term version the tournament contains nineother individuals selected to have the most similarchromosomes to the new one. The member of thetournament with the lowest fitness is removed. This typeof tournament is known as crowding replacement,because it tries to prevent the formation of crowds ofindividuals with very similar genomes.

In both cases, the best member of the old generation isalways allowed to survive. This is known as elitism andensures that the maximum fitness never decreases.

2.3. Post-processing

The two versions of the genetic algorithm describedabove were successful at producing diverse populationsof plans, but the upper ranks of the population wereoften still dominated by plans containing only a fewcost-effective products with different timings, doses andmixtures. The users preferred to see a wider range ofproducts used, so a post-processing step was added toreduce the repetition of products in the population. Thefollowing rule is applied to the list displayed to the user,working from the top down:

A plan is displayed provided that it contains at leastone product that has not appeared in two previousplans.

This succeeds in producing a more acceptable list,because there is sufficient diversity in the population toensure that plans with more than a few products havesurvived. Of course, some products may still appearfrequently, if they do so in combination with a variety ofothers. Throughout the following section, which isconcerned with the performance of the algorithm, theresults considered are the full populations producedbefore application of the post-processing step.

3. Results

3.1. Short-term optimisation

The initial trials were conducted with the spraysrestricted to single components. This allowed compar-isons to be made with the results of exhaustive searches,which became impractical when complexity of theproblem was increased. Several scenarios were investi-gated, including early spring and late spring decisions. Inthe early spring scenario, a fixed spray in the late springwas sometimes included. Similarly, in the late springscenario an early fixed spray was sometimes included.

Agronomically, the three main spray timings are oftenreferred to as T1, T2 and T3. The earliest, T1, is aroundgrowth stage 31–33 (first node detectable to third nodedetectable, or approximately leaf 3 emergence), and issignificant in the control of stem-base diseases, such aseyespot, as well as foliar diseases. The next, T2, isaround growth stage 39 (flag leaf emergence) and is veryimportant because the flag leaf is normally the largestcontributor to yield. Finally, T3, comes around growthstage 59 (inflorescence emergence to anthesis) and maycontribute to preserving the photosynthetic area andcontrol of ear diseases. The T2 spray is rarely omitted; atwo-spray programme will normally contain T1 and T2or T2 and T3 depending on circumstances.

These tests gave consistently good results with thetrue optimum (found by exhaustive search) beingachieved in almost every case, together with a collectionof near-optimal solutions. Even where the optimum wasnot found, the difference in margin was much smallerthan that arising from the natural variability in thesystem. Figure 1 shows a typical result for an earlyseason scenario (around T1). The margins for the 24best one-spray and two-spray plans are ranked. In thiscase, both sets show good diversity and the two-sprayplans, corresponding to T1 and T2, are clearly superior.In this scenario the levels of disease arising from thecombination of variety and weather were too high to becontrolled effectively by a single spray.

Figure 2 compares the results for one spray with anexhaustive search. It can be seen that the geneticalgorithm has found 6 of the top 10 solutions, which istypical of its performance. Below this there is a diversepopulation covering the range of the best 100 plans fromthe exhaustive search. The algorithm thus meets both therequirements placed on it: optimisation and diversity.

3.2. Long-term optimisation

Owing to the size of the search space used in the long-term optimisation, it was not practical to test the


solutions by exhaustive evaluation, so a differentprocedure was used to test the performance of thealgorithm. The stochastic nature of genetic algorithmsmeans that the final result may be sensitive to thesequence of numbers generated by the pseudo-randomnumber generator (RNG), which can be changed by thevalue chosen to seed it. If the genetic algorithm isrobust, there should be little variation in the bestmember of the final population, but a poor algorithmwill result in wide variation. The reduction in variabilityas a measure of performance was used as the basis fortesting.

Each configuration of the genetic algorithm was runwith five randomly selected seed values and the resultswere recorded. The configurations could then becompared in terms of the maximum margin obtainedfor all five runs and the range of maxima. A goodconfiguration would have a large maximum and a smallrange. Given the uncertainties present in the field, arange of a few pounds is not significant, but over £10 is acause for concern. The results that follow represent ascenario early in the spraying season, about T1, at whichpoint the short-term optimisation can also be run forcomparison. The tests were conducted with five differentwheat varieties with different levels of resistance to themain foliar diseases, particularly yellow rust andSeptoria tritici. Varieties with high resistance are likelyto have less serious infections and so be less responsiveto spraying. The varieties used were Brigadier andRiband (low resistance), Consort and Rialto (moderateresistance) and Claire (high resistance).

For the initial testing, to allow a closer comparisonwith the short-term optimisation, the length of the dategene was set to 2 bits for each spray in a multiple-sprayprogramme and to 3 bits in a one-spray programme.The main settings that controlled the genetic algorithm,such as the selection, crossover and replacementmethods were also the same as the ones that wereknown to work well in the short-term version. This isreferred to in the tables as the ‘original configuration’. Itwas found that the short-term optimisation (Table 2)was actually producing higher mean values than thelong term (Table 3). This undesirable result wascontrary to expectations, because the search space forthe short-term version is a subspace of that for longterm, so the latter should be able to find every solutionfound by the former. The long-term algorithm alsoshowed a very large range, especially on the low-resistance varieties. Both of these were indicative ofdefects in the configuration of the algorithm.

Systematic testing of options within the geneticalgorithm resulted in the configuration described inSection 2.2. This produced substantial improvements:the maximum and average margins now always

ARTICLE IN PRESS

450

400

350

3000 10 20 30 40 50 60

Rank

70 80 90 100

Mar

gin

over

fun

gici

des,

£ h

a−1

Fig 2. Comparison of best one-spray programs from exhaustivesearch (+) and genetic algorithm (� ) for an early season,short-term optimisation. The horizontal line passes through the11th best plan from the exhaustive search; six of the genetic

algorithm solutions lie above this line

450

400

350

3000 5 10 15 20 25

Rank

Mar

gin

over

fun

gici

des,

£ h

a−1

Fig 1. Comparison of best one-spray (+) and two-spray (� )programs in the results of the genetic algorithm for an early

season, short-term optimisation

Table 2Results from 5 runs of short-term optimisation with 2 sprays and

mixtures up to 3 products

Variety Margin, £

Mean Min Max Range

Brigadier 421 418 424 6Riband 425 424 426 2Consort 431 430 432 2Rialto 436 435 438 3Claire 468 465 470 5

OPTIMISING FUNGICIDE APPLICATIONS ON WINTER WHEAT 407

exceeded those found by the short-term algorithm, asthey should. The range of maxima between runs hadalso been reduced dramatically, showing that theperformance was much more consistent (Table 4). Whenthe date genes were restored to the length described inSection 2.2.2 (4 bits for single sprays and 3 bits per sprayfor multiple sprays), the margins improved substan-tially, as shown in Table 5, reflecting the fact that thesearch space included the full spraying season. Therange of variation between runs remained small, asrequired. An inevitable consequence of these changes isthat the long-term optimisation is slower than the short-term, by about a factor of 10. This is acceptable, giventhat it will be used occasionally for planning, whereasshort term optimisation is intended for tactical manage-ment decisions within the season.

Over the full range of 30 scenarios considered in thisway, the changes described above improved the max-imum margin by over 10% in 11 of them, and over 20%in five. These results illustrate the importance foreffective operation of a genetic algorithm of tuning thesettings carefully and choosing the right encoding. Inthis case, a partially redundant encoding (whichgenerates some sprays beyond the end of the season) ispreferable to a more compact one that fails to span theproblem space.

4. Discussion

As the results show, the genetic algorithm used issuccessful in giving solutions to this practical problem.For short-term optimisation with single-componentsprays, for which an objective comparison is available,the performance is very good and the diversity ofchoices in the final population meets the user require-ments.

A genetic algorithm is well-suited to this problem,because of its ability to handle different types ofvariables and produce a set of solutions. For use inreal-time, it also has the desirable ‘anytime’ property:the user can observe the objective function value andinterrupt when the result is felt to be good enough forpractical purposes.

The plant-disease system is subject to large uncertain-ties: those intrinsic in biological systems and the effectsof future weather. These mean that absolute precision inlocating the optimum is less important than locatingalternatives that can be compared for their robustnessacross a several scenarios for future events. TheDESSAC system provides multiple weather sets foreach site, allowing such assessments to be made. Atpresent this has to be done manually, by selecting eachset in turn, but this will be automated in future.

ARTICLE IN PRESS

Table 4

Results from 5 runs of long-term optimisation with 2 sprays and mixtures up to 3 products using revised settings and 2 bit date

encoding

Variety Margin, £ Difference fromshort-term mean

Difference from meanin Table 3

Mean Min Max Range

Brigadier 424 423 424 1 3 13Riband 427 423 429 6 2 19Consort 433 430 435 5 2 13Rialto 440 438 441 3 4 11Claire 468 468 468 0 0 6

Table 3

Results from 5 runs of long-term optimisation with 2 sprays and

mixtures up to 3 products using original settings and 2 bit dateencoding

Variety Margin, £ Difference fromshort-term mean

Mean Min Max Range

Brigadier 411 399 418 19 �10Riband 408 398 417 19 �7Consort 420 415 422 7 �11Rialto 429 423 437 14 �7Claire 462 459 467 8 �6

Table 5

Results from 5 runs of long-term optimisation with 2 sprays and

mixtures up to 3 products using revised settings and 3 bit dateencoding

Variety Margin, £ Difference frommean in Table 4

Mean Min Max Range

Brigadier 514 512 516 4 90Riband 509 508 511 3 82Consort 518 516 519 3 85Rialto 522 519 524 5 82Claire 536 536 537 1 68

D.J. PARSONS; D. TE BEEST408

4.1. Field trials

During development the system underwent 2 years ofreplicated plot trials on eight sites across the UK. In thefirst year, it was found to give a satisfactory selection ofchemicals on most sites, but tended to spray slightly lateon many of them, which was not sufficiently risk-aversewhere disease pressures were high. This was due to theperformance of the underlying process model in its firstyear of trials. Subsequent revisions have improvedperformance in more recent trials. In the 2002 harvestseason, trials intended to represent farm practice wereconducted at six sites in the UK and Ireland, mainly onsplit fields, including two at one site (Paveley, N. personalcommunication). In comparison with the reference fieldsmanaged according to good agronomic practice, WDMachieved a higher yield on all fields (mean difference0.5 t ha�1) and a higher margin on five of the seven (meandifference £7 ha�1). Simultaneous trials in Denmark alsogave good results, but are omitted from the means abovebecause the protocols were different.

4.2. User trials

Potential users of the system have been involvedthroughout its design and development to ensure that itmeets their needs. Early designs were based on the resultsof extensive user interviews, and these were then alteredin response to user comment and tested again. This wasrepeated throughout the project. In parallel with the fieldtrials there have been 2 years of usability trials ofprototypes, involving over 60 farmers and consultants inthe second season. In total, over 200 individuals havetaken part in the evaluation of the software.

Users have sometimes been surprised by the sugges-tions provided by the optimisation, or perceived apreference for certain chemicals. However, the sugges-tions were rarely bettered by manual adjustments to thespray programme. This confirmed that the optimisationwas accurately reflecting the process model and showeda difference between intuition and the results of themodel. The one feature of the genetic algorithm itselfthat caused comment was the fact that the list ofsolutions could differ on successive identical runs due toits stochastic nature. To prevent this, the randomnumber generator is now reseeded with a fixed valueat the start of each optimisation.

5. Conclusions

The use of a genetic algorithm in a decision supportsystem for the selection of fungicide spray plans in both

the long-term strategic planning and short-termtactical revision modes of operation meets therequirement to produce a list of optimal and near-optimal programmes from which the user canselect their preferred option. The performance of thealgorithm has been demonstrated in systematictesting and the system has performed successfully infield trials.

In order to test the algorithm for optimisationproblems that were too large for exhaustive searchmethods to provide a benchmark, a method based onusing the variation between randomly seeded runs as ameasure of robustness was developed.

It was a found that substantial changes in theconfiguration of the genetic algorithm were required toachieve good, robust performance in the long-term andshort-term modes. This illustrates the need to tailor thealgorithm to the problem and to test the optionscarefully.

Acknowledgements

The DESSAC project was supported by the Ministryof Agriculture Fisheries and Food (now Defra) througha number of programmes including LINK‘Technologies for Sustainable Systems’ with joint fund-ing from the Home-Grown Cereals Authority, theBiotechnology and Biological Sciences Research Coun-cil and Farmplan Computer Systems.

The genetic algorithm software used in this project isthe SUGAL package written by Dr. Andrew Hunter atthe University of Sunderland, England.

References

Brooks D H (1998). Decision support system for arable crops(DESSAC): an integrated approach to decision support. In:Pests and Disease, Proceedings of an International Con-ference, Brighton, UK, 16–19 November, 1998. Vol. 1, pp239–246, BCPC, Bracknell

Davis L (ed) (1991). Handbook of Genetic Algorithms. VanNostrand Reinhold, New York

Goldberg D E (1989). Genetic Algorithms in Search, Optimiza-tion and Machine Learning. Addison-Wesley, Reading, MA

Holland J H (1992). Adaptation in Natural and ArtificialSystems. MIT Press, Cambridge, MA

Milne A E; Paveley N D; Audsley E; Livermore P (2003). Awheat canopy model for use in disease managementdecision support systems. Annals of Applied Biology, 143,265–274

Murali N S; Secher B J M; Rydahl P; Andreasen F M (1999).Application of information technology in plant protection inDenmark: from vision to reality. Computers and Electronicsin Agriculture, 22, 109–115


Parsons D J (1998). Optimising silage harvesting plans in agrass and grazing simulation using the revised simplexmethod and a genetic algorithm. Agricultural Systems,56(1), 29–44

Paveley N D; Lockley D; Vaughan TB; Thomas J; Schmidt K

(2000). Predicting effective fungicide doses throughobservation of leaf emergence. Plant Pathology, 49(6),748–766

Rijsdijk F H (1983). The EPIPRE system. In: Decision Makingin the Practice of Crop Protection (Austin R B, ed),Monograph, 25, pp 65-76, BCPC, Bracknell

Thomas M R; Garthwaite D G; Banham A R (1997). PesticideUsage Survey Report 141: Arable crops in Great Britain1996. Defra Publications, London

Whitley L D (1989). The GENITOR algorithm and selectivepressure: why rank-based allocation of reproductive trialsis best. In: Proceedings of the Third International Con-ference on Genetic Algorithms (ICGA) (Schaffer J D, ed),pp 116-121, Morgan Kauffman, San Francisco

Zadoks J C (1981). EPIPRE: a disease and pest managementsystem for winter wheat developed in the Netherlands,EPPO Bulletin, 11, 365-369


optimising fungicide applications on winter wheat using genetic algorithms

Documents