representation chapter 4, essentials of metaheuristics, 2013 spring, 2014 metaheuristics byung-hyun...
TRANSCRIPT
Representation
Chapter 4, Essentials of Metaheuristics, 2013
Spring, 2014
Metaheuristics
Byung-Hyun Ha
R1
2
Outline
Introduction
Vectors
Directed encoded graphs
Trees and Genetic Programming
Lists
Rulesets
Bloat
Summary
3
Introduction
Representation of individual Approach to construct, tweak, and present individual for fitness
assessment
Metaheuristics as general framework Mostly, only representation differs with regard to different problems
4
Introduction
Examples of representation TSP
• Permutation-based (order-based)• e.g., 4-2-3-1 2-3-1-4 3-1-4-2 1-4-2-3
• Locus-based• The ith element represents the city following city i in the tour.• e.g., 4/3/1/2 ( 4-2-3-1, permutation-based)
• Random-key• 0.78:0.56:0.69:0.11 ( 4-2-3-1, permutation-based)
VRP (vehicle routing problems)• Using separator
• 6-9-0-2-4-7-5-0-8-1-3• Mutation and crossover?
Encoding and decoding
source: http://neo.lcc.uma.es/cEA-web/VRP.htmPhenotype Genotype
encoding
decoding
Tweak
5
Introduction
Tweak in representation Phenotype (E) genotype Tweak genotype (D) phenotype Determining fitness landscape
• Example: Hamming cliff and gray coding
Remember small change!• It can help metaheuristics, usually.
6
Introduction
Much of representation is an art, not a science! e.g., workflow (business process)
• How to encode and tweak?
source: http://www.tonymarston.net/php-mysql/workflow.html
7
Introduction
Properties, required (Talbi, 2009) Completeness
• All solutions should be represented
Connexity• A search path must exist between any two solutions (i.e., to global optimum)
Efficiency• Easiness to manipulate
Representation-solution mapping (Talbi, 2009) One-to-one Many-to-one
• Redundancy will enlarge the size of search space.
One-to-many (indirect encoding)• A good solution should be constructed from an individual.
8
Vectors
Initialization and bias Not difficult to initialize
• Some totally-random initialization method (covered already)
Bias?• e.g., solution for robot walking using heuristic (e.g., by motion capture)• But diversity is useful, particularly early on.• Some suggestions
1) Biasing is dangerous.
2) Start with values that aren’t all or exactly based on heuristic bias
Mutation Examples
• Gaussian convolution, bit-flip mutation, ...• Integer vector: Integer Randomization Mutation, Random Walk Mutation, ...
c.f., point mutation• Useful when there is less chance to get improvement by changing several
genes at a time• But, can be trapped in local optimum, e.g.,
9
Vectors
Recombination One- and Two-point Crossover, Uniform Crossover Line Recombination, Intermediate Recombination ...
Heterogeneous vectors? e.g., a function with real parameters and integer parameters
Phenotype-specific mutation or crossover e.g., Jung & Moon, The Natural Crossover for the 2D Euclidean TSP,
2002 Consider fitness landscape.
10
Directed Encoded Graphs
Graphs Examples
• Neural networks, finite-state automata, Petri nets, electrical circuits, ...
Types• Directed, undirected, with labels, with weights, cyclic, acyclic, recurrent, feed-
forward, sparse, dense, planar, ...• Those are constraints respecting Tweak.
Arbitrary-structured graph Our target of graph representation
Types of encoding Direct encoding
• Exact node and edge description in representation
Indirect (developmental) encoding• Some (production) rule to constructing graph, as a solution (discussed later)
11
Directed Encoded Graphs
Full adjacent matrix e.g., a recurrent directed graph structure, with
• no more than 5 nodes• no more than one edge between any two node• self-edges allowed• weights for edges
Mutation examples• One vector approach
• Algorithm 45. Gaussian Convolution Respecting Zeros• Using two vectors
• One for on/off, the other for weights
12
Directed Encoded Graphs
Arbitrary graph structure Initialization of graph (N, E)
• Determination of number of nodes and edges• e.g., using geometric distribution
• Creation of a node and an edge, depending on type of target graph
13
Directed Encoded Graphs
Arbitrary graph structure (cont’d) Further considerations in initialization
• e.g., connected and directed acyclic graph• c.f., general algorithms textbook
Mutation• e.g., do one of the followings, random number of times
• delete a random edge• add a random edge• delete a node and all its edges• add a node• relabel a node• relabel an edge
Recombination• c.f., goal of crossover is to transfer essential and useful elements to another• Determining elements to transfer
• Selecting subset of nodes and edges, or selecting subgraph• Coping with missing target of edge and with disjoint
14
Directed Encoded Graphs
Arbitrary graph structure (cont’d) Recombination (cont’d)
15
Directed Encoded Graphs
Vector vs. graph representation e.g., Relocation of containers in a bay for efficient loading
• Solution as a list of movements• e.g., (1-2), (3-2), (4-5), (6-5), (4-7), (4-6)• Weakness?
• Solution as a graph
a
b
c
d e
f
16
Trees and Genetic Programming
Genetic Programming How to use stochastic methods to search for and optimize small comput
er programs or other computational devices Concept of suboptimality, required
• Not simply right or wrong
Examples• Team soccer robot behavior, fitting math. equation to data set, finding finite-st
ate automata which matching given language
Representation Lists or trees, usually
• e.g., an artificial ant, sin(cos(x – sin x) + xx) for symbolic regression
17
Trees and Genetic Programming
Primitives in representation Basic functions (e.g., kick-toward-goal) or CPU operations (e.g., +) Constraints of context
• e.g., 4 + kick-toward-goal(), no sense• e.g., matrix-multiply, expecting exactly two children and ...
Tweaks need to maintain closure (valid individuals)
Fitness assessment Conversion data (genotype) to code (phenotype), and evaluate Examples
• Symbolic regression: sum of squared errors• Artificial ant: amount of food eaten
Tree-Style Genetic Programming Pipeline Sec. 3.3.3 One of popular algorithm for Genetic Programming (but not limited to)
18
Trees and Genetic Programming
Initialization New trees by repeatedly selecting from a function set
• Considering arity (predefined number of children)• e.g., Grow, Full, Ramped Half-and-Half, PTC2 algorithms
Ephemeral random constants• Handling constants for leaves (e.g., 0.2462, 0.9, –2.34, 3.14, “s%&e:m”)• Special leaf nodes to be transformed into randomly-generated constant
19
Trees and Genetic Programming
Recombination e.g., subtree crossover: swap two selected subtrees
• Non-homologous (i.e., highly mutative)
homologous: individual crossing over with itself will make copies of itself
Mutation Examples
• Subtree mutation: replacing random subtree with randomly-generated one• Replacing random non-leaf node with one of its subtrees• Picking random non-leaf node and swapping its subtrees• Mutating ephemeral random constants by introducing some noise• Swapping two disjoint subtrees
c.f., not popular because usually crossover is non-homologous
20
Trees and Genetic Programming
Forests e.g., forest of soccer robot team with each member as tree
Automatically defined functions (ADF) Not predefined functions but trees called by primary tree c.f., Modularity
• In case that we believe a good solution has repetitive part
Strongly-Typed Genetic Programming
21
Trees and Genetic Programming
Cellular encoding Indirect encoding (developmental encoding)
22
Lists
Grammatical Evolution: using predefined grammar for tree Trees generated by lists (indirect encoding)
• c.f., http://en.wikipedia.org/wiki/Backus-Naur_form
Pros and cons• Almost always valid tree, reduced size of search space• Tiny changes early in list result in gigantic changes (un-smoothness).
23
Rulesets
A policy as solution of problem Consisting of a set of rules e.g., stock trading program, entities in simulations
State-action rules Typical form
• a b ... y z• e.g., (left sonar value > 3.2) (forward sonar value 5.0) (turn left to 50)
An interpretation• Mapping from state space into actions
Under-specification and over-specification• Default rules, vote, ...
Fitness assessment• On a ruleset, or on a series of rules
24
Rulesets
Production rules Typical form
• a b c ... z Modular indirect encoding
• Describing large complex solution with lots of repetitions by small and compact rule (search) space
e.g., 8-node directed unlabeled graph structure as solution
25
Rulesets
Production rules (cont’d) e.g., Lindenmayer systems (L-systems)
• e.g., Koch Curve• F F + F – F – F + F• F: draw a line forward, +: turn left, –: turn right
F
F+F-F-F+F
F+F-F-F+F+F+F-F-F+F-F+F-F-F+F-F+F-F-F+F+F+F-F-F+F
F+F-F-F+F+F+F-F-F+F-F+F-F-F+F-F+F-F-F+F+F+F-F-F+F+F+F-F-F+F+F+F-F-F+F-F+F-F-F+F-F+F-F-F+F+F+F-F-F+F-F+F-F-F+F+F+F-F-F+F-F+F-F-F+F-F+F-F-F+F+F+F-F-F+F-F+F-F-F+F+F+F-F-F+F-F+F-F-F+F-F+F-F-F+F+F+F-F-F+F+F+F-F-F+F+F+F-F-F+F-F+F-F-F+F-F+F-F-F+F+F+F-F-F+F
26
Bloat
Code bloat or code growth A problem with variable-sized representation Far from optimum usually, memory consumption, ... and ugly
Common ways of handling Limiting size when individual is Tweaked Editing individual, to remove introns and the like Punishing individual for being very large
• e.g., linear parsimony pressure (problem?)• revised fitness f = r – (1 – )s, where r: fitness, s: size of individual
• e.g., non-parametric parsimony pressure
27
Summary
Phenotype & genotype
Encoding & decoding
Representations Vectors Graphs
+ Indirect-encoded graphs (edge encoding)
Trees+ Indirect-encoded trees (Grammatical Evolution)
Lists Rulesets
Bloat