modelling hybrid control systems with behaviour networks
DESCRIPTION
Modelling Hybrid Control Systems with Behaviour Networks. Pierangelo Dell’Acqua Anna Lombardi Dept. of Science and Technology - ITN Linköping University, Sweden L. M. Pereira Centro de Inteligência Artificial - CENTRIA Universidade Nova de Lisboa, Portugal. September, 2005. - PowerPoint PPT PresentationTRANSCRIPT
Modelling Hybrid Control Systems with Behaviour
Networks
Pierangelo Dell’AcquaAnna Lombardi
Dept. of Science and Technology - ITNLinköping University, Sweden
L. M. Pereira
Centro de Inteligência Artificial - CENTRIAUniversidade Nova de Lisboa, Portugal
ICINCO 05, Barcelona Spain
September, 2005September ‘05
Problem
We present an approach to model adaptive, dynamic hybrid
control systems based on behaviour networks.
• We extend/modify behaviour networks to make them self-
adaptive and dynamic.
• The approach is general, reconfigurable, and suitable for
environments that are dynamic and too complex to be
entirely predictable.
Outline
1. Hybrid control systems
2. Behaviour networks
3. Extended/modified behaviour networks
4. Modelling hybrid control systems
5. Discussion and future work
1. Hybrid control systems
Controller: discrete-time system
Interface
Plant: time-continuous system to be controlled
(possibly traditional controllers)
Generator Actuator
Plant
Controller
x(t)
x(tk) r(tk)
r(t)
Closed-loop configuration
Plant: described by differential equations
x(t) - state vector
r(t) - input vector
Generator Actuator
Plant
Controller
x(t)
x(tk) r(tk)
r(t)
r(t))f(x(t),(t)x
Generator Actuator
Plant
Controller
x(t)
x(tk) r(tk)
r(t)
Controller: described by difference
equations
s(tk+1) = ( s(tk), x(tk) )
r(tk) = ( s(tk) )
x(tk) – plant events
r(tk) - actions
Interface
Generator: converts continuous-time output of the plant into symbols
x(t) x(tk)
Actuator: receives symbols (actions) from the controller and converts them into an input to the plant
r(tk) r(t)
Generator Actuator
Plant
Controller
x(t)
x(tk) r(tk)
r(t)
2. Behaviour networks
Introduced by P. Maes in ‘89 for action selection in dynamic and
complex environments where the system has limited computational
and time resources.
A behaviour network is a network composed of specific competence
modules (rules) which activate and inhibit one another along the
links of the network.
• The activation/inhibition dynamics of the network is guided by
global parameters.
• Competence modules cooperate in such a way that the network
as a whole functions properly.
• This architecture is distributed, modular, robust and has an
emergent global functionality.
3. Extended/modified behaviour networks
To model hybrid control systems, we extended behaviour
networks to allow for:
• rules containing variables;
• internal actions;
• integrity constraints, and
• modules (sets of atoms and rules).
An extended behaviour network consists of 5 modules:
R - a set of rules formalizing the behaviour of the network
P - set of global parameters
H - internal memory
C - set of integrity constraints
G - set of goals
We call the state of the network the tuple S=(R,P,H,C,G).
We assume given a module Math formalizing the axioms of
elementary mathematics.
Atom takes the form:
m:X meaning that X belongs to module M whose name is m
m÷X meaning that X does not belong to M
Sequence of atoms:
Atom1 , Atom2 , . . . , Atomn (n 0)
# denotes the empty sequence
Goal is a sequence of atoms.
It expresses a condition to be achieved.
Integrity constraint is a sequence of atoms.
It expresses a condition that must not hold.
Language (simplified)
Rule is a tuple of the form: < prec; del; add; action; >
prec - preconditions
del, add - represent the internal effect of the rule
action - atom representing the external effect of the rule
- level of strength of the rule
Variables in rules are universally quantified:
• < h÷on, e:temp(x), math:x<20; # ; h:on; heating(on); 0.5 >
E denotes the environment
A substitution is a finite set of bindings: variable/expression
A substitution can be applied to an expression X, written as X, by
simultaneously replacing any variable v in X with t, for every binding
v/t in .
• if X = h:q(x,y,z,c) and = {x/3, z/6}, then X= h:q(3,y,6,c)
Two expressions X and Y are unifiable (i.e. two-way pattern matched),
written as XY, if there exists a substitution such that X=Y.
Substitutions can be applied to rules:
• if r = < h÷on, e:temp(x), math:x<20; # ; h:on; heating(on); 0.5 >
and = {x/12}, then
r = < h÷on, e:temp(12), math:12<20; # ; h:on; heating(on); 0.5 >
At every state, a rule in R has to be selected for execution.
A sequence L of atoms is true at a state S iff:
• for every m:X in L, XM and
• for every m÷X in L, there exists no for which XM.
A state S is safe if there exists no that makes an integrity
constraint in C true at S.
Executable rules
Applying a rule r=<prec; del; add; action; >
S=(M1,M2,M3,M4,M5) r(S)=(M’1,M’2,M’3,M’4,M’5)
every M’i is obtained from Mi :
• by removing X for every mi:X in del, and
• by adding Y for every mi:Y in add.
A rule r is executable at S iff:
• prec is true at S,
• r(S) is safe, and
• action is an atom with no variables.
r
Rules in a network are linked by three types of links.
Let x and y be rules.
Successor link x y
for every m:X in the add list of x and m:Y in the prec list of y such that XY
Predecessor link x y
for every successor link from y to x
Conflict link x y
for every m:X in the prec list of x and m:Y in the del list of y such that XY
Rule selection
Rules use these links to activate/inhibit each other.
Forward propagation
• There is input of activation energy coming from S and E towards
rules whose preconditions partially match them.
Backward propagation
• There is input of activation energy from G towards rules whose add
lists partially match G.
Inhibition
• There is inhibition by the goals that have been achieved. They
remove some activation energy from the rules that would undo
them.
Rules also inhibit and activate each other along the links in the
network.
The mathematical model is based on several global parameters in P
that are used to tune the spreading of activation energy through the
network:
- threshold of rules to become active,
- amount of energy that a true atom injects into the network,
- amount of energy that a goal injects into the network,
- amount of energy that achieved goals take away from the
network,
... etc.
Let r be a rule in R and a substitution.
The rule r becomes active when:
• it is executable,
• its activation level overcomes ,
• has the greatest activation level among all other executable
rules.
When an active rule has been executed, its activation level is set
to 0.
4. Modelling hybrid control systems
An adaptive, dynamic hybrid controller can be described by an
extended behaviour network.
P
RH Math
C
G
E Actuator
CU
The basic engine of CU can be described via the following cycle:
Cycle(n, R, P, H, C, G)
1. Load the rules of R into CU and calculate their activation level wrt. the global parameters in P.
2. If one rule becomes active, then execute its internal effect, and send its external effect (its action) to the actuator.
Let R', P', H', C', G' be the modules after the execution of the rule.
Cycle with (n+1, R', P', H', C', G').
3. If no rule becomes active, then lower the level of in P.
Cycle with (n+1, R, P', H, C, G).
Consider a virtual marine world inhabited by a variety of fish.
• Fish are situated in the environment, and sense and act over
it.
• The behaviour of a fish is reduced to search and eating food,
escaping and sleeping.
Actions of the fish: search, eat, escape and sleep.
Example: artificial fish
Modelling an artificial fish
Stimuli of the fish:
• hungry:
• fear:
• tired:
Input vector to the controller State of the controller
hungry(tk) food(tk)
x(tk) = fear(tk) s(tk) = satiated(tk)
tired(tk) safe(tk)
C = { } no constraints
G = { h:safe, h:satiated }
Behaviour of the fish:
< e:hungry(x), math:x>0.5, h÷food; # ; h:food; search; 0.5 >
< e:hungry(x), math:x>0.5, h:food; # ; h:satiated; eat; 0.5 >
< e:tired; # ; # ; sleep; 0.5 >
< e:fear(x), math:x>0.5; # ; h:safe; escape; 0.7 >
E - stimuli of the fish
H - internal state of the fish
R =
• Implementation: XSB Prolog + Java
• Extension of L to allow variables in the strength level of rules:
< h÷on, e:temp(x),math:x>20; # ; h:on; heating(on); 0.5*(20-x) >
• Engineering network of behaviour networks to model complex
environments
• Techniques developed in Logic Programming can be employed:
- preference reasoning to enhance action selection
- genetic algorithms to tune the global parameters of the network
- belief revision to resolve cases of conflicting rules if more rules
can become active
5. Future work