numerical solution of nonlinear 712 and jl@ control ...helton/billspapersscanned/hhk00.pdf ·...

14
Numerical Solution of Nonlinear 712 andJl@ Control Problems with Applicationto JetEngine Compressors Michael Hardt, Member IEEE, J. William Helton, Menrbe4IEEE, and Kenneth Kreutz-Delgado, Member IEEE Abstract-We describe an effective numerical approach to solring nonlinear 7(z or 7{- optimal control problems. Our principal goal will be to use this approach to solve the important problem of jet engine compressor control. The technique is demonstrated first with the tutorial example of the control of a pendulum. We then apply the numerical approach to the problem of controlling jet engine compressor stall and surge instabilities (three-dimensional Moore-Greitzer model) while imposing satu- ration constraints. Standard in this model is a curve of equilibria along which one may operate the engine. Here, the instabilities are hardest to control near the highest performance equilibria. Our numerical results tell us rather dramatically which equilibrium one can optimally control to and which are unmanageable. The magnitude of the rate saturation constraint on the controller turns out to dominate this phenomenon. We choose a high-performance manageable equilibrium E and compute the ?12 optimal larv which will control the system to E. We then describe plots which allow one to find a neighborhood of the equilibrium within which the closedloop system is guaranteed to remain. The technique should work with little modification in dimensions 4 and 5, at which point the "curse of dimensionality" forces restrictions. Index Terms-H-in{inity control, H-in{inity optimization, jet engines, nonlinear H2 control, numerical control, optimization methods. I. INTRoDUCTIoN tTt HE continuedpopularity of '112 and?l- control method- ! ologiesis in largepart due to their very good stabilization androbustness properties. There aremanytypes ofcontrol prob- lems which fit naturally into the '112 or Hn framework such as stabilization problems,trajectory tracking, risk-sensitive prob- lems, and disturbance attentuation. In the nonlinear regimes, computational methodsfor solving theseproblems are still in their early stages, and there is no generally accepted format for the best approach to solving them. We pursue two such ap- proachesfor solving nbnlinear '112 and 7l- control problems which we feel are well suited to handling saturation and other constraints of the problem, are efficient, and can accomodate higherdimensional systems to someextent. We usea combina- tion of optimization and multiple shootingalgorithmsto solve for a good approximationof the value function of the system throughout the state space. From this, one may readily define a Irlanuscript received December30,1997t revisedDecember 17, 1998.Rec- ommended by Associate Editor, K. Wise. This work was supported in part by the .\FOSR and the NSF. I\1. Hardt and K. Kreutz-Delgado are wilh the Department of Electrical and ComputerEngineering, UniversityofCalifornia, SanDiego, La Jolla,CA 92093 US.A, (e-mail: helton @ ucsd.edu). J. W. Helton is with the Department of Mathematics. University of Califomia, San Diego, La Jolla, CA 92093 USA. Publisher Item ldentifi er S I 063-6536(00)0 I 020-4. IEEETRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY. VOL. 8. NO. I. JANUARY2OOO closed-loop control law which is the numerical solution to the nonlinear712or 11* control problems. Also in SectionIY for purposes of completeness, we mention some other numerical methods which serve as alternative approaches. We illustrate these methods first in the testcase of a swinging pendulum. The underlying purposein the presentation of the pendulum test case is to clarify the use of the proposednu- merical approach with a simple example.Important subtleties of numericaloptimal control also become very evidenteven in this case. For example, theremay exist initial conditionsof the systemfor which thereexist two completelydifferent, equally optimal solutions.This can lead to situations where a small changein the initial conditionscan lead to radically different solution trajectories. Finally, we apply these numerical methods to the control of a jet engine compressor.The three-dimensional(3-D) Moore-Greitzer model is usedas a basisfor studyingthe com- pressor system, and we usethrottlefeedback control to robustly stabilize the system under the constraintsand performance criteria imposed. We calculate the solution of the nonlinear ?12 control problem subjectto rate saturation constraints and discuss considerations and differentapproaches to the problem in light of undesirable transient behaviorwhich can arise.The transient behavior of special concern is a large drop in the plenum pressure rise V, and these are punished by the H2 performance criterion. Our studies indicatethat the choice of performance index or cost function to be optimizedis largely secondary to the strong effects of actuator rate saturation.The strong influence of rate saturation is bestdisplayed later with Fig. 8 in our discussion of the jet enginecompressor. We describe how one may construct a globally optimal controller which meets these rateconstraints. The performanceof our final controller is then graphically dis- played with contour plots of the resulting value function and drops in the plenum pressure in different operating regions.This work represents the first time']12 optimal controllers were com- puted for this compressor problem. Much researchconducted lately has used throttle feedback and the 3-D Moore-Greitzer model to control the severeinsta- bilities present in a jet engine compressor. Liaw and Abed [l 9] used a throttle-basedlinear feedback law to develop a bifurca- tion-basedcontroller. This control law required sensingof only one of the state variables,namely the squaredamplitude of ro- tating stall,andit had local stabilityresults. Evekerer a/. [9] later extended this analysis to design a controllerwith a much larger operating region. It required,howeveq measurement of the time derivative of the massflow coefficient (D. Krstic er a/.[l5] then developed a backstepping controller which guaranteedglobal 1063-6536/00$10.00 @2000IEEE

Upload: others

Post on 31-Mar-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

Numerical Solution of Nonlinear 712 and Jl@ ControlProblems with Application to Jet Engine Compressors

Michael Hardt, Member IEEE, J. William Helton, Menrbe4IEEE, and Kenneth Kreutz-Delgado, Member IEEE

Abstract-We describe an effective numerical approach tosolring nonlinear 7(z or 7{- optimal control problems. Ourprincipal goal will be to use this approach to solve the importantproblem of jet engine compressor control. The technique isdemonstrated first with the tutorial example of the control of apendulum. We then apply the numerical approach to the problemof controlling jet engine compressor stall and surge instabilities(three-dimensional Moore-Greitzer model) while imposing satu-ration constraints. Standard in this model is a curve of equilibriaalong which one may operate the engine. Here, the instabilities arehardest to control near the highest performance equilibria. Ournumerical results tell us rather dramatically which equilibriumone can optimally control to and which are unmanageable. Themagnitude of the rate saturation constraint on the controller turnsout to dominate this phenomenon. We choose a high-performancemanageable equilibrium E and compute the ?12 optimal larvwhich will control the system to E. We then describe plots whichallow one to find a neighborhood of the equilibrium within whichthe closedloop system is guaranteed to remain. The techniqueshould work with little modification in dimensions 4 and 5, atwhich point the "curse of dimensionality" forces restrictions.

Index Terms-H-in{inity control, H-in{inity optimization, jetengines, nonlinear H2 control, numerical control, optimizationmethods.

I. INTRoDUCTIoN

tTt HE continued popularity of '112 and ?l- control method-

! ologies is in large part due to their very good stabilizationand robustness properties. There are many types ofcontrol prob-lems which fit naturally into the '112 or Hn framework such asstabilization problems, trajectory tracking, risk-sensitive prob-lems, and disturbance attentuation. In the nonlinear regimes,computational methods for solving these problems are still intheir early stages, and there is no generally accepted formatfor the best approach to solving them. We pursue two such ap-proaches for solving nbnlinear

'112 and 7l- control problemswhich we feel are well suited to handling saturation and otherconstraints of the problem, are efficient, and can accomodatehigher dimensional systems to some extent. We use a combina-tion of optimization and multiple shooting algorithms to solvefor a good approximation of the value function of the systemthroughout the state space. From this, one may readily define a

Irlanuscript received December 30,1997t revised December 17, 1998. Rec-ommended by Associate Editor, K. Wise. This work was supported in part bythe .\FOSR and the NSF.

I\1. Hardt and K. Kreutz-Delgado are wilh the Department of Electrical andComputerEngineering, University ofCalifornia, San Diego, La Jolla, CA 92093US.A, (e-mail: helton @ ucsd.edu).

J. W. Helton is with the Department of Mathematics. University of Califomia,San Diego, La Jolla, CA 92093 USA.

Publisher Item ldentifi er S I 063-6536(00)0 I 020-4.

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY. VOL. 8. NO. I. JANUARY 2OOO

closed-loop control law which is the numerical solution to thenonlinear 712 or 11* control problems. Also in Section IY forpurposes of completeness, we mention some other numericalmethods which serve as alternative approaches.

We illustrate these methods first in the test case of a swingingpendulum. The underlying purpose in the presentation of thependulum test case is to clarify the use of the proposed nu-merical approach with a simple example. Important subtletiesof numerical optimal control also become very evident even inthis case. For example, there may exist initial conditions of thesystem for which there exist two completely different, equallyoptimal solutions. This can lead to situations where a smallchange in the initial conditions can lead to radically differentsolution trajectories.

Finally, we apply these numerical methods to the controlof a jet engine compressor. The three-dimensional (3-D)Moore-Greitzer model is used as a basis for studying the com-pressor system, and we use throttle feedback control to robustlystabilize the system under the constraints and performancecriteria imposed. We calculate the solution of the nonlinear?12 control problem subject to rate saturation constraints anddiscuss considerations and different approaches to the problemin light of undesirable transient behavior which can arise. Thetransient behavior of special concern is a large drop in theplenum pressure rise V, and these are punished by the H2performance criterion.

Our studies indicate that the choice of performance index orcost function to be optimized is largely secondary to the strongeffects of actuator rate saturation. The strong influence of ratesaturation is best displayed later with Fig. 8 in our discussion ofthe jet engine compressor. We describe how one may constructa globally optimal controller which meets these rate constraints.The performance of our final controller is then graphically dis-played with contour plots of the resulting value function anddrops in the plenum pressure in different operating regions. Thiswork represents the first time']12 optimal controllers were com-puted for this compressor problem.

Much research conducted lately has used throttle feedbackand the 3-D Moore-Greitzer model to control the severe insta-bilities present in a jet engine compressor. Liaw and Abed [l 9]used a throttle-based linear feedback law to develop a bifurca-tion-based controller. This control law required sensing of onlyone of the state variables, namely the squared amplitude of ro-tating stall, and it had local stability results. Eveker er a/. [9] laterextended this analysis to design a controller with a much largeroperating region. It required, howeveq measurement of the timederivative of the mass flow coefficient (D. Krstic er a/.[l5] thendeveloped a backstepping controller which guaranteed global

1063-6536/00$10.00 @ 2000 IEEE

Page 2: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

HARDT et cL: NUMERICAL SOLUTION OF NONLINEAR ft2 AND ?l- CONTROL PROBLEMS

stability, and it had less restrictive measurement requirements byonly requiring sensing of the mass flow (D and the plenum pres-sure rise V. It was even shown later [16] that this controller isinverse optimal with respect to a performance index. This resultimplies robustness and stability margins for the system. Addi-tional other altemative approaches to controlling the jet enginecompressor may be found in [ll, t2l, t5l, t201, and [22]. Theresults in this paper were first announced in [2].

The previous research mentioned did not expressly considerthe effects that rate saturation might have on the problem. How-ever, there is new research focusing primarily on this issue. In[29], a local approximation of the Moore-Greitzer dynamicsis taken at the equilibrium in order to obtain a system of re-duced dimension. The reduced system and controller dynamicsare then studied together using phase space techniques to under-stand the effects of saturation constraints. Two other methods forhandling rate limits in other nonlinear settings by scheduling orswitching linear control laws are found in [8] and [28].

II. NONLINEARHz AND ?I- PnosLTN4s

The class of state equations that will be considered will benonlinear smooth time-invariant havine the form

where n(t) € X is the state, tt (t) € [/ is the controlled input,u(t) e L2l},Tl. T > 0, is the disturbance input, and z arethe to-be-controlled outputs or penalties. The system diagram isshown in [F ig. l ] .

A. Nonlinear?72 Control and tlrc HJB PDE

In the nonlinear H2 control problem, no disturbances enterinto the model (l), rrr : 0. The problem lies in minimizing acost functional of the form

V(t : ) -

which is often called the value fwtction.For numerical purposes, it is possible to incorporate the state

equations of the system into the cost functional, whereby tradi-tional nonlinear optimization software may be applied to it. Onemay also solve the problem by defining a Hamiltonian functionH ( r , p )

H(,,p) - pro(r) - ) r ' t ' r {*) tT@)p+f,n"67n61 (3)

which in the'Jlz case is convex. The solution to the ?12 con-trol problem will satisfy a Hamilton-Jacobi-Bellman (HJB)

inequality defined by fl(r. dVr ldr) ( 0. In the numericalmethods to be discussed, we will be seeking the null Hamil-tonian solution where H(r,dVr ld:r)

- 0.

The solution of the control problem is thus "simplified" to the

solution of a first-order PDE.

Fig. l. System diagram.

B. Bicharacteristic ODE's

It has long been known that this class of PDE's is equiva-lent to a system of ordinary differential equations (ODE's) ofdouble the dimension [3]. In the case of (3), the correspondingboundary value problem is described by the Hamiltonian vectorfields drldt and dpldt. The vector p is denoted as the set ofadjoint variables for the system, and it is also equal to the trans-pose of the gradient of the value function, p : )IVT ldr. Theboundary value problem consists of the following "bicharacter-istic" equations:

. 0H , , aH ,i :

* ( r , n ) n - - - * ( x , n )

r(o) : so p(T): aeEQD . (4)OT

The Hamiltonian will be constant and equal to zero along anoptimal trajectory, thus solving the HJB inequality.

This boundary value problem characterizes solving the finitetime H2 or Hn control problem with no final state constraints.We take the approach in this paper of indirectly requiring thestate r;(t) to converge to a desired equilibrium r* by includinga penalty function ofthe squared error ofthe state from the equi-librium into the performance. This has the effect that the systemwill converge asymptotically to c* with speed determined bythe gains chosen.

C. Optimal State Feedback Law

The vectors r andp may be used to define the optimal control

u. : -bT@)p

along each optimal trajectory. The numerical values for x, p,

and u make up an open-loop solution as all variables will onlybe known as a function of time and only at a discrete set ofpoints along the trajectory. Once one has calculated many such

trajectories, one can begin to approximate the numerical func-

tion c r-+ u, known only on a gnd, by a function u'(r) given

for all z (at least all c in some neighborhood) by a formula.

This is done to interpolate between grid points, so one has a

control law which can be quickly applied to any state. Prob-

ably many approximation methods exist. One is the easy-to-useMatlab neural-network toolbox. Later in the paper we use this to

derive the optimal feedback controller for the jet engine controlproblem.

99

l : : lQ , . t t " .w ) : o ( r ; ) + b1 ( r \ r " *b2 ( t : ) u

, = lt '{, ') l ,r,L 'u l

',,i {} lo' l,{r)ll '

at + el,(r))} (2)

(5)

x = f(x,u,w)

Page 3: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

-]tD-

..

. 100

D. Norilinear'17* Control

IEEE TRANSACNONS ON CONTROL SYSTEMS TECHNOLOCY, VOL. 8, NO. I, JANUARY 2OOO

Fig. 2. Multiple shooting methodfor all disturbance functions u.'(') e L2I},T|, ? > 0. The con-

stant value 0("(O)) satisfies 0 S 0@) ( oo, and p(t*) = 0,

where r* is the equilibrium for the system. z(l) is the response

of the system for the initial condition u (0).

As in (2), one may define the nonlinear 7l- control problem

as an optimization of a performance index. Unlike the'172

problem, however, the optimization is of a minimax-type which

pt"u.ntt the use of standard nonlinear optimization software'

One may similarly define a Hamiltonian function H(*,p)'

which is now in general nonconvex, for the nonlinear ?l- con-

trol problem

H( t : . y t ) :

+ lrtrr

1r1tr1r,1.

The solution to this problem, as in the nonlinear ?12 control

problem, nray be obtained by solving for a Cl solution V(:r)

io the so-called Hamilton-Jacobi-Isaacs (HJI) inequality'

H(r:,rIVr f rl:r) ( 0. The solution may also be calculated by

solving a set ofbicharacteristic equations as in (4)'

lll. PnoposED NUN'IERICAL SoLUTIoN oF NoNLINEAR 712

AND ?l€ PRosl-eN4s

The numerical approach to be discussed consists of several

methods which may be used independently or in combination

for solving the HJB and, in some cases, the HJI equation' These

methods will calculate the open-loop solution to an optimal tra-

jectory. The first method to be discussed is limited in that it can

only solve the nonlinear ?12 optimal control problem'

A. Direct Minimizatiort

To solve the'l1z control problem numerically with the di-

rect minimization approach, one discretizes the state and con-

trol variables in time over the trajectory. One may then formu-

late the minimization of the cost functional subject to the dif-

ferential and output constraints as a finite-dimensional nonlin-

early constrained optimization problem. The convexity of the

?12 problem allows conventional Newton-based methods such

as siquential quadratic programming (SQP) to be used to solve

the pioblem. Also, the direct minimization approach will frnd

the solution to the HJB equality, H(r,dVT ldr(a)) = 0' The

program DIRCOL from O. von Stryk t25l' t261, which uses

ihe nonlinear optimization software NPSOL [11], takes this ap-

proach and, in uOaition, it provides accurate estimates of the ad-

joint variables p.

In the nonlinear ?l- control problem, the model used (l) con-

tuii, Zr_uounded disturbance inputs tu. The objective here is to

ina u "on,toller

which guarantees that the closed-loop system

ias an Lz_gain less than or equal to a prespecified constant level

I fto* the disturbance inputs u/ to the output penalty functions

z l27l-We write this f2-gain condition as

rT 1T

l' ll"e)ll' ,tt I t2 / llr(r)ll' at + fr(r(o)) (6)Jo Jo

yf aQ:) *ir'l)rrOlu;fL') - r,1(r)af (")]p

B. Multiple Slrcoting

The second method multiple shooting may be applied to ei-

ther the ?lz control problem or the 7l- problem. The equations

to be solved are the two-point or multipoint boundary value

problem described by (4). An important property of this ap-

proach is that it can also handle nonconvex problems'

The prescribed boundary conditions for the problem are ini-

tial values of the state r(0) and final values of the adjoint vari-

ables p(?). The traditional shooting method integrates the state

equations beginning at c(0) and from a guess for p(0) for a

prespecified time ?. When the numerical value obtained for

/1f; ftorn the integration does not match the boundary condi-

tion p(T) : 0,p(:u(T)) I 0t, a zero-finding problem may be set

up in terms of the constraints, and Newton's method may then

bi used to iterate on p(0) until the boundary condition is satis-

fied.The strong sensitivity of initial value problems, and the small

convergence region for the Newton's method make solving

these problems via simple shooting very difficult' A mul-

tiple-shooting algorithm, however, is more stable and robust to

nonlinearities. This procedure consists of dividing up the time

interval [0, ?] into a series of grid points [0, tr 't2, '' ' , ?], and

a shooting mithod is performed between successive grid points

(see Fig. 2;. A set of starting values for the state and adjoint

vectors is required at each grid point in time, and continuity

conditions for the solution trajectory introduce additional

interior boundary conditions which are then incorporated into

one large zero-finding problem to be solved.

The program MUMUS by P. Hiltmann [13] is a mul-

tiple-shooting code which can solve two-point or multipoint

boundary value problems using a modifed Newton method'

This method is, in general, quicker than direct minimization'

lending itself more readily to iterated calculations of trajec-

tories. A drawback, however, is that the adjoint equations

must be programmed as input to the code' For complicated

problems, generating the equations by hand or with symbolic

manipulation packages may pose a formidable task'

C. Shooting Out

While the multiple shooting method is commonly used for

solving a boundary value problem, one may alternatively obtain

optim;l trajectories by stalting near the desired equilibrium r"

and at selected values p, then integrating the state and adjoint

(7)

Page 4: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

r1

t

HARDT et oL: NUIVIERICAL SOLUTION OF NONLINEAR fi: AND H - CONTROL PROBLEMS

differential equations outwards. This process is called slnotittg

out.We are, thus, able to obtain optimal trajectories for the state

without having to solve a difficult two-point boundary value

problem. A set of starting values (a guess for the solution tra-jectory) is also not needed as is the case in direct minimization

and multiple shooting. This technique was successfully used by

McEneaney in [21].The disadvantage of the shooting out method is that one has

less control in reaching a specified set of states. Remaining re-

gions of the state space, that are not able to be mapped out with

optimal trajectories via shooting out, may be mapped with the

previous multiple shooting approach using as starting values the

optimal trajectories already obtained.

D. Combined Method

When one wishes to map the value function over a large re-

gion of the state space for purposes of approximating well the

optimal closed-loop controller, it is necessary to calculate a fair

number of optimal trajectories. We can use the superiority of

direct minimization or shooting otrt methods to generate sev-

eral of these trajectories when little or no prior knowledge of

the solution is available in order to acquire starting values for

the multiple shooting method when solving for the remainder

of the trajectories. The multiple shooting method works well

when many different optimal trajectories are sought after since

the algorithm is fast. and one may initialize each run from the

previous run. One has much more control in mappin-e the value

function than one might have with shooting out, and the method

is, in general, quicker than direct optimization methods.

E. Homotopl"

In order to compute the value function V and the optimal

control u*, we must compute a family of optimal trajectories

which approximately (maybe only coarsely) sweep out most of

the state space. Supposing we have computed an optimal trajec-

tory ?,, from a state ?;1 to the equilibrium :r;", to quickly com-pute trajectories ?} from many initial states x.', simply pick r:2

near ?:1 and use ?", to init ialize a MUMUS or DIRCOL com-putation of ?"r. We have found this proceeds quickly to give us

many optimal trajectories (see Section VI-E;. Henceforth, we

refer to this as homotopying trajectories, since the point is to

move them gradually.When using the multiple shooting method, one may similarly

homotopy the initial conditions of the state variables, using boththe state and adjoint trajectories previously calculated to serveas siarting val.res for the new boundary value problem to besolved. By calculating trajectories over the portion of state spaceof interest, one will know u* and V at points over this entirearea. Multivariate approximation methods may then be used tofind fi* approximating u* as a function of the state z.

After the calculations have been made, the approximation u*of the closed-loop controller may then be implemented for usern real time.

IV. OrHER Nultrtlcnl METHoDS

There are many numerical approaches which have been triedon HJB and HJI equalities and inequalities and listing them allis beyond us. We shall try to list some of the main ones.

A. Finite Dffirences

The method of finite differences has been used successfullyfor solving HJB and HJI problems of low dimension. For an ac-count of finite difference methods and their relation to Markovchains see Kushner and Dupuis [7]. Also Falcone [10] and the

thesis by Dower under James [8] have used extensively finite

differences in this setting. It is not clear how these would ex-

tend to high dimensions in a manageable way, since rectangulargrids have a rigid geometry.

B. "Small" Basis of Functiorrs

The idea in this approach is to select basis functions fi, ex'

pand the value function V as Llt:r.if i, then some work is

required to find the proper coeffrcients ci. The main difFtculty

lies in selecting the basis functions f . The common approach

in conventional numerical partial differential equations (PDE's)

(in dimension <4) is to use basis functions which are piecewise

polynomials. In principle, in order to solve a first-order PDE like

HJB's, they should be continuous. The teselating (such as tetra-

hedra) of space into pieces which match up is a serious problem.

Doing this flexibly in three dimensions took substantial study'

but ofcourse is well developed now. The advantage is that once

everything is set up. mesh refinement can be done on regions of

space which may otherwise seem treacherous. However, doing

this in say five dimensions may well be very diffrcult.

One can select basis functions in a rvay which is ad hoc and

much less dift ' icult. For example, Beard and Mclain [4] use

physical intuit ion coming from a particular plant model to se-

lect ten or l2 basis functions. For example, if the model contains

sines and cosines and powers of ;r;, then their f 's wil l too.

For control purposes, one need only solve HJI inequalities,

provided the solutions l/ satisfy conditions which make them

what are called viscosity subsolutions. lf continuity is not

required tbr l" and jumps are allowed, the problem becomes

easier, but the control law typically has jumps.

Once a basis of /, is selected, one can find the c;'s in several

ways. One is a Galerkin procedure for solving HJI equations'

described for example in [4]. Another is by directly substituting

l' = Il1, ci f i intothe HJI inequality to obtain inequalities of

the form

r t ( f i : j = 1 , . . . . M \ , 0 ) U 0 V rl /

I"' l ' z o'j : 1

Here, q is quadratic in the 1, and one elects some finite set of r

to yield finitely many constraints (crucial areas can be heavily

gridded). Various optimization schemes can be applied'

C. Partly Analytical Methods

Very intriguing is backstepping' which applies to special

classes of systems and obtains analytical expressions for V and

Page 5: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

TEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 8, NO. I, JANUARY 2OOO

the controller u. Thus, when it applies, it has the potential for

beating the curse of dimensionality. One advantage of obtaining

analytical expressions for V and u is that often the solution

need not be recomputed with changes in the parameter values

or choice of equilibrium.Backstepping does not approach saturation directly, howeveq

although a highly experienced practitioner may well come up

with compromises. This makes it difficult to compare our efforts

with backstepping, since in the jet engine compressor problem

considered here, we will see that rate saturation dominates'

Similarly, feedback linearization is not able to address satu-

ration directly.

D. Characteristics and Direct Minimization

The method of characteristics, as they pertain to multiple

shooting and shooting out, and direct minimization methods

were presented in the previous section. They seem to have the

most promise among methods for "fully" solving HJB equa-

tions. The shooting out or multiple shooting methods can be a

bit difficult to enter into a Program at this point, since the adjoint

equations can be complicated. Direct methods, while slower, are

comparatively easy to run. All of these methods find the value

function V(:r;) at each point r; along a curve. In delicate regions,

one can pick many :r:'s, otherwise only a few :t;'s may be neces-

sary to approximate V(;r:) well. A careful approach can, thus,

help combat the curse of dimensionality.

The final desired result is a "sirnple" formula to approxi-

mate the function V in order to obtain a control law' Traditional

Galerkin methods reverse this in that they layout the approxi-

mation scheme first (before having any information about V)'

Since the Galerkin scheme must mesh very well with the numer-

ical PDE one is solving, this is much harder to do than merely

approximating tr/ given as data on a grid.

E. Miscellaneous

Other approaches include the technique using viabiliry ker'

nels as promoted by P. Saint-Pierre [24]. Bertsekas and Tsitsiklis

also solve HJB and HJI equations for discrete problems in their

book [6].

V. Ilt-usrRlrtoN: PeNowutvt

In order to demonstrate the basics and subtleties of applying

these numerical techniques to a control problem' we present

here a brief tutorial example of the optimal control of a pen-

dulum. Already in this simple problem, one finds many numer-

ical particularities which are important. For example, we dis-

covei multiple slrcets of the pendulum. These are regions of

the state space where numerically calculated oPtimal trajecto-

ries will stay within when iterating on the initial conditions' Yet

as one progresses far enough along the sheets, it may be the case

that the calculated trajectories are no lon-eer optimal'

Later, when we deal with the more complex problem of jet

engine compressor control, we must, for example, be careful to

ruli out multiple sheets or to find them. As we shall see in that

case, extensive searching did not reveal any such anomalies'

The dynamical equations of motion of a pendulum are

m I 2 g + m g l s i n ( 0 ) = u

where d is the position angle of the pendulum with resPect to

the bottom "rest" position. The pendulum rod is assumed to be

massless of length I with a point mass of rn at its end, and g is

the gravity constant.Let c1 - 0 and x2 = d, and for simplicity let I : rn : l'

Then we write the state and output equations as

i t :xz z=f ! " - " r l l , r ,i 2 : - g s i u ( x 1 ) l u L ( " - " r ) J

' - '

where c; is a desired equilibrium of the pendulum and u1 is

the control effort at the equilibrium' We specify a weighted

quadratic cost functional to be optimized as

1 f Tn'in j / tll" - *rllb + ll" - "Al'nir

dt + p(r(T)) (e), z J o

where Q, S ) 0 and n > 0 arb square matrices of appropriatedimension, and the norm llzllfT satisfies zT Qz for some vector

z and matrix Q. The penalty function on the final state is equal

to e(r(")) = ; l l r (") - rAl2s.

We wish to solve the HJB equality (3), H(t:,'-.}i ') = u'

for V. From the Hamiltonian, we construct the bicharacteristicequatio,ls (4) with initial and final boundary conditions

r ' (0 ) - ; r ;6 , p (T) = *Uf rn = S( ' r ' (? ) - : t :7 ) '' U.t:

Once these are solved, a closed-form expression for the optimal

control u*(lr;) along.the bicharacteristic trajectory can be deter-

mined from , : '![(") and the extremal value for u*

u* = -R-7br ' ( t )p. ( 1 0 )

A. Nonlinear Hz Solutiott and Comparison of Numerical

Methods

We compare both the direct minimizatiott approach

(DIRCOL) and the multiple shooting method (MUMUS) in

our numerical experiments. Of interest are their respective

run-times, the effects of varying numbers of grid points (in

time), and their convergence properties'

With the multiple shooting code MUMUS, for example, it

was found that run-times are not as affected by the choice of

discretization as the optimization approach of DIRCOL' Also'

given a good set of starting values, MUMUS can reach the solu-

iion'r'.ty quickly. The difficulty with MUMUS lies in obtaining

,""tonutl" starting values, in particular, those for the adjoint

variables. For the pendulum, perhaps due to its low dimension

and relatively mild nonlinearities' MUMUS was able to con-

verge well even with the obvious choice of straight-line starting

values. When iteratively calculating many trajectories with large

numbers of grid points, MUMUS thus was the obvious choice'

The direci optimization package DIRCOL makes it much

easier for the user to defrne the problem and to enter in nonlinear

equality and inequality constraints when desired' The use of this

uipto".tt is very useful in a preliminary analysis of an optimal

"onttol problem, even one as simple as the pendulum' Also' the

Page 6: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

200

-1

x1

Fig. 3. Homotopy in .r 1 (0 ). (.r:: (0 ) = 0 ) along two sheets of pendulum. The dotred lines indicate the continuation of a rfteet along which the numerical methodswill continue to calculate solutions though they are no longer optimal.

HARDT er a/.: NUMERICAL SOLUTION OF NONLINEAR ftr AND ft- CONTROL PROBLEIIS

250

HJB equality in the nonline^r Ttz control problem may havemore than one solution. An optimization code will converge tothe stabilizing solution of the HJB equality, while it is not guar-anteed that the shooting code will do the same. Since for smallnumbers of grid points the run-times of the two methods are verysimilar, a natural approach is to solve the problem initially withDIRCOL orby shooting orrt, then to init ialize MUMUS with itsoutput. This was the combined method previously described inSection IILD.

B. Multiple Sheets of the Value Function

The pendulum has been a heavily studied test problem. Evenin our case, this example sheds light on some rather interestingproperties of optimal control and of our choice of numericalmethods. When applying the previously described homotopymethod, we discover sheets along which the multiple shootingmethod will continue to produce solutions to the 772 controlproblem, though the calculated trajectories may no longer beoptimal.

In our analysis, the state o1 r€presents the position anglewhich determines the configuration of the system and 12 is theangle velocity. In the coordinates chosen, frr : 0.0 refers to thebottom "rest" position of the pendulum, and the desired finalequi l ibr ium is chosen as rDJ - ( rn.nyz) = ( -1.5,0) . t

In order to calculate V and u, the homotopy method is usedwith the multiple shooting program MUMUS to sweep out alarge region of the state space, repeatedly calculating trajec-

tories with slightly varying initial conditions. We take the op-

timal trajectory with init ial condition (r1(0),22(0)) as an ini-

tialization for another run of MUMUS with initial condition

lAll ansles are in radians.

150

100

-2-30 L-6

(r1(0) * 61,:r:2(0) * 62). The choice of (61,62) wil l determinehow accurate the resulting value function and control law willbe. In our experiment, a homotopy was perforrned over a largeregion of state space, roughly -3.9 < x7 1 2.4, -Il <. x2 I14. The final time ?, to which the trajectories are calculated,was chosen so that all of the optimal trajectories converged towithin a small tolerance level of the equilibrium. Finding an ap-propriate ? required some trial and error.

Our experiment consisted of two phases. The initial phaseconsisted of a homotopy in 11(0) along the two primary sheetsof the pendulum. One sheet traverses the pendulum in the clock-wise direction iterating the initial condition ("r(0), 12(0) = g)

in the negative rr1 direction. Each run uses the previous run asits initialization. The same is done in the counter-clockwise di-

rection. Since 11 is periodic, eventually the homotopy will cal-culate trajectories on two different sheets corresponding to the

same initial condition. These two trajectories will be different

since they will correspond to traversing the pendulum in oppo-site directions to reach the desired final endpoint and; hence,they will have different values for the value function. Recall

from (2) that the value functionV(r) is the infrmum of a cost

functional over all possible trajectories. The "true" I/ will be

the lowest of all the values obtained, while the higher values are

thrown out as they are not "optimal." More details on combining

different calculated values for V to construct the true value func-

tion are described by McEneaney l2ll.Fig. 3 represents a vertical slice along the r1 axis of the value

function. On the vertical axis is the value function V which rep-

resents the cost in reaching the desired equilibrium r*. This cost

is defined by the quadratic performance (9). Note that since

rs is periodicin2r = 6.3, c - (-3.9,0) is the same point

as r : (2.4, 0). The value function has the same value there,

Page 7: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

Fig. 4. Value function of pendulum.

y(-3.9,0) : VQ.q,0) = 185. At th is point , i t is equal ly

costly for the pendulum to travel counter-clockwise as it is to

travel clockwise in order to reach the desired equilibrium. The

state space is not a simply connected region, and it is therefore

to be expected that there exist points from which two equally

costly and optimal paths exist to the desired final state.

In the second phase, we iterate the initial values of the

trajectories we wish to compute in such a manner as to keep

(.rr(0),:r:2(0)) near a level set of I/(r). This way of choosing

(:r1(0),:r:2(0))'s is not essential but easy to implement. In

Fig. 4, we have graphed the value function for V(r) < 185'

Using an approximation method, we then obtain the desired

closed-loop control law.

VI. JET ENCINT STNIL ENO SURGE CONTNOI

The three state Moore-Greitzer model of a jet engine com-

pressor has been frequently used in the literature to study the

severe instabilities found in this class of systems. Surge is the

name given to oscillations in the annulus averaged flow of the

compression system while rotating stall is a disturbance which

consists ofregions ofrecluced or reversed flow that rotate around

the annulus of the compressor [9] (See [Fig. 5]). The specifrc

model that we will be using is [15]

i b : -V+ i t r r . - 3O .R1 -

, i i : * (o+1- ry ' v )IJZ

'

i l= oR(I - a ' - " ) ,

R(o) > o

The meaning of the state variables is as follows:

O znnulus-svsraged mass flow coeffrcient;

{/ plenum pressure rise;R squared amplitude of circumferential flow

metry;i!" compressor characteristic relating pressure rise V in

the plenum to the mass flow iD;"f proportional to the throttle area (control);

0, o system-dePendent Parameters.

IEEE TRANSACTIONS ON CONTROL SYSTEI'S TEC}I}iOLOGY, VOL. 8, NO. I, JANUARY 2OOO

A common approach in constructing a model for the system is

to represent i!" as a cubic in iD uhich is only an approximationto the actual phenomenon. This is a hypothetical approximationto the true compressor characteristic since the region (iD < 1) is

unstable and cannot be physicalli'observed, while in the stable

region (O > 1) experimental data is usually used in practical

settings to arrive at the compressor characteristic. A common

form of the cubic approximation is

\ ! ' : \ ! ' o + 1 + ; O - | O t '

The peak operating point of the compressor coincides with

the peak of the compressor characteristic in the region {Q > 0}

at Q = 1. At this point the system's linearized equations are

only marginally stable s'hen using the throttle parameter ? as

the control input. However, it has been shown that the system

may be stabilized globally at this equilibrium using a nonlinear

feedback [5].There exists a continuum of stable equilibria in the region

(O > 1). The equilibria in this region can be parametrized by(D = Oo,

The linearized Moore-Greitzer model is uncontrollable at these

equilibria, but it is stabilizable. The model at the R : 0 or

fll) no-stall equilibria, prevalent in the (iD > 1) region, cannot be

stabilized by throttle feedback in the region (@ :l 1). There

does, however, exist an unstable set ofequilibria in this region,

termed the stall-equilibria, for which the model can be stabilized

asym-by throttle feedback

iol IlYl = lv"-LRI , L

Here, the linearized equationsopen-loop system is unstable.

f $l = f *"',&.,1Lnl i I o I

iDs I3 (1- o3)oo | .1-o3 Iare controllable, though the

Page 8: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

HARDT er al.: \LiMER|C.\L SOLLTTIO)i OF NONLINEAR H: AND ft - CONTROL PROBLEMS

Fi-e. 5. Jet engine compressor.

A. Problem Statement

We shall use numerical methods to solve the nonlinear 7/2

control problem thus obtaining an optimal control law of the

form 1.(Q. V,l?). We began the analysis by studying a proper

choice of performance index or cost function. There were

two main problems also encountered at the outset which are

fundamental to the system and were not able to be remedied

by guessing different performances. The first main difficulty

that we encounter is what rve call the minimum-V problem.

This involves a large dip in the pressure variable V along

the closed-loop trajectory resulting from the controller 1.An objective is to design a controller which wil l keep the

system trajectory inside a given nei-ehborhood,Nr of the chosen

equil ibrium. Another major diff iculty is the rate saturation

problem. The controller must meet specified bounds on the

throttle rate i. Also, an absolute bound on ̂ 1 is required, but

our investigations lead us to suspect that this constraint often

becomes inactive uith realistic bounds on :i. Both problems

will worsen with larger init ial values of I l.

Another practical problem is to first choose a high-perfor-

mance equil ibrium defined by Oe and a control law'y(O. Q. i?)

which has the property that all states near to the equil ibrium are

kepr close to it. We il lustrate how one may compute a set A[ of

states with the property that a closed trajectory starting at a state

in.\i stays completely inside the specified neighborhood,A/.

In our model, we chose as parameters o = 1- p : 1. O,., =

0.1-169. The error coordinates of the system with respect to the

no-sta l lequi l ibr ia (O > 1) wi l l be def ined as d - (O - Qo) '

.q, = ({/ - V.(Ao)-). r = Il. Also 1y wil l denote the equil ibrium

control input. We began the analysis by studying a proper choice

of performance index or cost function.

B. Choice of Perfonnance ltdex

A preliminary investigation did not directly impose rate sat-

uration constraints and focused on the maximum pressure equi-

l ibrium (Oo : 1). The goal was to test various quadratic and

nonquadratic performance indices in order to find an optimal

controller having good transient properties remedying the min-

imum-V and rate saturation problems. This approach turned out

to be unsuccessful.The general form of the quadratic performance index used

was

r T

r,,i,, / {rr6t + c24,2 + car'2 * ct(l - ti2l I 't (12)'' .lo

while that of one with cross-terms was represented by

{ctd2 +rz( ' ,b - "ud) '

*csr2 *c+(l -rr)"} dt . (13)

Placing higher powers on the penalties of the state variables ac-tually worsened the minimum-V problem (See [Fig. 6]0. Thus,in the remainder of the investigation we used a quadratic per-formance index with all weights set equal to one. Additionally,rate saturation constraints will always be imposed.

C. Rate Saturation Constraints

Either numerical approach (DIRCOLA4UMUS) lends itselfeasily to imposing rate saturation constraints. A dummy state isintroduced which is equal to the control input. This then gives

an additional state equation with its derivative equal to the con-trol rate which becomes the new control for the system. In thecase of the multiple shooting algorithm, a box constraint on thecontrol rate i simply translates into an f statement when the

control law is defined. To simplify the problem dehnition, we

added a quadratic term in the control rate i to the performance

index. This permits us to use the standard approach of setting

the partial derivative with respect to the control of the Hamil-

tonian function to zero and thereby arrive at an expression for

the optimal level of 1. The performance index

r,,lr, / f:1$2 * t:2rfP I c;jr '2 { ct(^t - 1J)2 +,t; lz! ,tt (14)'r .lo

is then r.rsed for the remainder of our analysis.Fig. 7 shows the results of imposing a constraint on the con-

trol rate 7 with the constraints l?l < 0.1 and l"yl I t).2. The

control "y is restricted to start at its f inal equil ibrium value since

in practice the controller cannot change values discontinuously.

Evidence of the constraint is seen in the slightly larger dip that

the pressure r! is tbrced to take. This may be considered to be

the price tor rate saturation. The initial conditions for this ex-

periment were O : 1.1, !trr : 2.0, ft = 0.2, and the desired

equil ibrium is Os = 1.2, !p"(Oo) = 2.083, I l = 0 where

the equil ibrium pressure l ies at97Vo of its maximum value of

W,.(i.0) : 2.1-169. In the remaining portion of the paper, we

will continue to use a rate saturation constraint of l7l < 0.1 to-

gether with a quadratic performance index.

D. Choice of Equilibrium

In an attempt to improve the minimum-!! problem, a tradeoff

was explored by choosing different equilibria in the no-stall re-

gion (Os > 1) along the compressor characteristic. The benefit

of converging to an equilibrium further down the characteristic

curve is that the dip in the pressure V and the control rate ialong an optimal trajectory will be greatly reduced.

Fig. 8 pertains to the ?12 optimally controlled system with

the rate saturation constraint l '71 S 1.We study the influence

of this rate saturation constraint with four different equilibria

along the compressor characteristic (solid line). The figure

displays the behavior of the closed-loop system after ini-

tializing at points near the chosen equilibria. The state of

the closed-loop system follows a trajectory (moving in a

,r,iu/t

Page 9: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 8, NO. I. J.ANUARY 2OOO

E 4

' 0 2 4 6 8 1 0 1 2 1 4 1 6 1 8 2 0

Time

Fig. 6. Control -, versus rime for the ?l: optimal trajectory and different performance measures. The solid line is with respect to a quadratic performance

index. and the dashdot and dashed lines are with respect to p!rfo..on.. indeies with cross-terms with weights r'" = 1 and 5, respectively. The cirs-les represent

the sysrem initialized at R = 0 with a quadratic performance'

2.2

'6(L

1 . 5 10 4 6 8 1 0 1 2 1 4

tim€

Fig. 7. i! r,ersus time illustrating effects of introducing- different constraints on j,, the control rate. The solid line is the unconstrained trajectory while the dashdot

ani dashed line represent a constraint of lil S 0'2 and lil ( 0'1, respectively'

2.1

1 .8

1 .7

201 81 6

counter-clockwise direction) which eventually ends in that

equilibrium; these are the loops we see plotted. More specifi-

cally, the four equilibria that we consider are (O1, Vt, Rr) =

(o; . { r . (ok) .0) , wi th @6 eQual to 1.1, 1.2 ' 1 .3, 1.4 and

with respective init ialization points at (@6,{/k'0'12) for k:

1, . . . .4- Thus, the system begins at R = 0 '12 s igni fy ing a

modest size stall level which must eventually reach its equi-

librium value of R : 0.Striking is that the closed-loop optimal trajectory converging

to the 01 : 1.1 equilibrium exhibits a very large drop in pres-

sure before convergence. In contrast, the equilibrium with Q2 =

1.2 has a pressure- which is at 97Vo of the maximum possible

!{q:

::

:

:

:

:

::

i ::: ' : ')a;;,' . , / . , ,. , / t t: / i r: l i t. t . . I' ,1! ,' I ' I. , .. I t . t, l i ' i

l i t: t , '' l : t: l ! t : ': l t t :, l i t ,

, I i I

{ , ; i i\ z 't ' : , '

t .\ i I :' " ) : " " : " ":: :: :

I

i$$I t\lr\'r\|l

\

Page 10: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

HARDT ar aL: NUMERICAL SOLUTION OF NONLINEAR ft: AND ?{ - CONTROL PROBLEMS

pressure, and the trajectory initialized by our modest displace-

ment from it has only a small pressure drop. The explanation

for this dramatic change in performance of our optimal H2 con-

troller is that the rate saturation constraint becomes active for a

signihcant t ime along the trajectory converging to the O1 : l ' l

equilibrium, and does not become active along the other trajec-

tories.Table I compares key features of the trajectories in Fig. 8.

Listed are the values of the desired mass flow Ql which define

the equil ibria, the equil ibrium pressure V"(Ot)' the percentage

of the maximat equil ibrium pressure \i/(1.0) that the desired

equilibrium ties at, the minimum value that ttrr reaches along the

trajectory, and the maximal value of 1 and l'yl.

E. Caluilation-of Value Function

For purposes of calculating a value function (2), we chose

as a desired equilibrium Qo = 1'2, where the pressure is at

97Vo of the maximum equilibrium pressure. Furthermore, we

assumed a rate saturation constraint of lil S 0.1. Then 381 tra-

jectories were calculated from various initial values (O, {r, R) to

the equil ibrium (Oo, V"(iDo),0). The init ial values were equally

spaced wi th ranges O e {1.0,1 '4} . V € {1.6.2.2\ , and R e

{0.0, 0.2}. The final time ? to which the trajectories were calcu-

lated was set at 20.0 s, which allowed ample time for the system

to converge from any state in the given region. The average time

to calculate each trajectory with MUMUS was 6'3 s on an old

SPARC IPX. Varying the grid size in MUMUS was at times

also necessary to ensure convergence from initial values in the

state space, since areas of extremely high curvature in the tra-

jectories cause numerical troubles. Since each trajectory con-

iains the open-loop value for the control law, the various tra-

jectories were needed to calculate a reasonable approximation

2.2

2.1

1 . 9

1 . 8

1 .7

1 . 6

oo-

l . 5 L0.7

Fig. g. Controlling to different equilibria along compressor characteristic at a starting initial value of R = 0.12. Trajectories move in a counter-clockwise

direction.

1.51 .41 .30.8 0.9 1 1 .1 1.2Phi

TABLE ICottlpARtsoN oF DtFFERENT EQUILTBRTA

Perc. Min Ma:< Ma:t

Q1 V"(Ar) Vc(1.0) V t l t l

l . l 2 .1311.2 2.0831.3 1.9981.4 1.875

0.993 1.520 1.652 0.100

0.970 1.975 1.536 0.072

0.931 1.946 t.627 0.018

0.873 1.841 1.753 0.014

for the closed-loop control law. A slice of the value function at

{l? = 0.12} is displayed in Fig. 9.

Trajectories calculated from some areas ofthe state-space ex-

hibit poorer transient properties than from others' In particular,

the closer the initialization is to the peak of the compressor char-

acteristic, the more difficulty is encountered in controlling R to

0. In the presence of the rate saturation constraint lil S 0'1 and

starting close to the peak, the optimal trajectory would allow a

substantial increase in .R before controlling it to zero' Such tran-

sient behavior may often be impractical or nonimplementable as

the growth in .R signifies the growth of the stall cell' This effect

becomes more Pronounced with larger initial values of ft' We

see this as a consequence of the rate saturation constraint which

prevents the controller from keeping the stall instability from

growing.

F. State Feedback Contol

After the calculation of the value function, there are a number

of methods which may be used to approximate the control law in

(5). The calculated trajectories provide us with numerical values

oi iD, !!, l?, and 1" on a grid in the state space' One may then

approximate the data 1'(O, V , E) with the feedback control law

iliO, V, R) as described in Section II-C. In t23l' it was shown

107

Page 11: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

2

& r.s

1 .8

1 .7

1 . 6

F ig .9 . Acon tou rp l o to f as l i ceo f t heva lue func t i ona t { I l - 0 . 12 } .Thex ind i ca tes theequ i l i b r i um .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY. VOL. 8. NO. I. JANUARY 2OOO

Let :r = [O {, .n]" represent the state vector. Then the con-trol law is

how a neural network may effectively be used to approximatethe closed-loop controller in this manner. We took this sanre ap-proach using a two-layer ten-node feedforward neural networkto approximate 7.(O.V,/?) with very good results. This ap-proximation was done offline using the Matlab neural-networktoolbox. The approximation was done in a large box on the scaleof that displayed in Fig. 9 and with 1? in the interval between0.0 and 0.12. Only several points were excluded near the ex-treme left uppermost corner of the box where the system wasapproaching a full stall.

The resulting weight and bias matrices for the first and secondlayer are

1 .15 1 .2 1 ,25 1.3 1.35 1.4Phi

where the values z; (conesponding to the ten hidden nodes) arecalculated as

(16)

using the logistic sigmoid function g(a) = iTh. Note that thenumerical online burden for implementing this controller is ateach time step ten function evaluations and about 80 flops.

G. Evaluation of Results

Figs. 10-12 best demonstrate the effectiveness of the con-troller constructed in the previous section. Fig. l0 gives a goodindication about the quality of minimum-V performance onemay theoretically expect with the implementation of this feed-

back control law. In it we initialize n(0) to 0.12 and choose as a

desired equil ibrium (Os, itrr"(iDs), R) : (1.2,2.083,0.0) whichlies at 97 Vo of the peak operating pressure. The white neighbor-hood in the figure, for example, contains the set of initial stateswith the property that the controller 7* drives these to the equi-librium x along a trajectory upon which \[r remains above 2.0'Likewise, the next shaded neighborhood consist of states whichare 7* controllable with pressure always remaining above 1.95.

Fig. ll displays maxlO - O6l when starting from differentpoints throughout the state space while Fig. 12 contains the max-imum values of R when starting from B(0) : 0.t2. Note that inthe majority of the state space, R decreases monotonically withimplementation of the optimal closed-loop controller resulting

( ls)

zi:s(i ' t ' ' ,* '1")

6(t ) -

?r(r) -

- 10.2954-6.6018-2.951212.313510.8508-6.0692-8.83229.4260-3.4612-0.8463-3.28L212.48887.t9700.79740.7149

-5.9826-6.29204.55725.04255.75 1 1

6.20321.4250

-2.5187-6.8863-7.65630.17128.2157-6.2t971.4363-L.2278

9.8482-9.67583.16420.5173

-2 .598712.88.1510.8908

- 11 .2955-1 .01864.4439t.5t277.06700.43960.21390.0926

-0 .01960.28301.3205

-3.0503-3.4664

V = 0.0'18316

and b(2) = -1.1896.

t c l T1 D ' - ' :

Page 12: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

HARDT ?r al.: NUMERICAL SOLUTION OF \O\LINEAR H: AND H - CONTROL PROBLENIS

space.

2'2affiipfii";&hi o) <= 0.87y- -. ' : ; .gbs(Phi - PhL0) <= 0.69

tos(pni - Ph'Lo) <= o.s3

" r labs(Phi - PhL0) <= 0.38

2

& r .s

1 . 8

abs(Phi - Phi_o) <= 0. 1 3

1 . 2Phi

pbs(Phi - Phi_0) <= 0.25

iI

1 . 7 iII

in the large region labeled by Il < 0.12. Only when init ializin-s

in the the upper-left corner of the (O.tL) plane near the peak

of the compressor characteristic (Oo = 1.0. t[t. (<1111) = '2.1'lTI

does the nice convergence of I i deteriorate. When init ializing

at (O0 = 1.0, ! ! , (Or) = 2.1-17. I l = 0.12) . -B incre i tses t t r

0.88 which symbolizes alntost a full stall of the contpressor'

Fortunately, this was the only point in the region shou'n where

the closed-loop controller was not able to prevent an excessire

1 .25 1 . 3

growth in the value of Il. A similar effect is shown in Fig. I I

where the deviations from the equilibrium value of Qe become

nruch lar-ger near the peak of the compressor characteristic'

The three figures allow one to determine neighborhoods of

initial valr.res,\i which produce closed-loop trajectories inside

a given box .,\' of the specified equilibrium x at iDo : 1'2' If a

perfbrmance criterion is to keep the system state within a pre-

ipecitied region around the equilibrium, then one can determine

1 . 6 -1 I n < 1 . 1 5 1 . 2

Phi

F i g . l l . A c o n t o u r p l o t o f t h e m l x i m t t n l a b s o l u t e c l i t . l e r e n c e v a | u c s t r [ . I , f r o m t h e e q u i l i b r i u mtrajector ies in i l ia l ized at d i t ferent ( ' I ' . ' I ' i and at 1? = {) .12 in the state spi tce.

Page 13: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

IEEE TRANSACTIONS ON CONTROL SYSTEIIIS TECHNOLOCY. VOL. 8. NO, I. JANUARY 2OOO

1.02 't.04 1.06 1 . 1Phi

1 . 1 2 1 . 1 4 1 . 1 6 1 . 1 8

l;ig. | 2. A conrour plot of rhe maximum values of Il attained along closed-loop system trajectories initialized at different ( O. { ) and at Il = 0.12 in the state

spatce.

the set of initial values 16 for which this constraint is met withinrplementation of the closed-loop controller. In other words, welruve constructed a graphical display of the operating region oftlre cornpressor with implementation of the optimal closed-loopcontroller developed in this paper.

H- Tining and Scaling to Higher Dimensiotts

The numerical methods described here can easily solve op-timal control problems in higher dimensions. The run-times nec-essary to calculate optimal trajectories grow with increased di-rrrension, yet their growth is slow enough such that it generally

does not become a limiting factor. The "curse of dimensionality"lrowever becomes apparent when one attempts to "map out" thestate space by iteratively calculating oPtimal trajectories. Forexample, assume one were to use a higher order model of thejct engine compressor-with dimension 4 and calculate optimaltrajectories over a grid of initial conditions in four of the statevariables. Even a rough grid of eight different values for eachstate variable would require the calculation of 8a : 4096 tra-jectories. At l0 s/traj (SPARC IPX), this would require morethan I I h ofcalculation plus storage considerations.

A more intelligent selection of trajectories to be calculatedwould exploit the principle of optimaliry which tells us that in-termediate points reached along a trajectory also represent ini-tial conditions for optimal trajectories. Consequently, the valuefunction is automatically calculated along the entire trajectory.For example in .Ra, one may need only calculate 83 : 512 tra-jectories. Likewise in dimension 5, a cautious estimate on run

tinres is 20 sltraj on a SPARC IPX for 84 trajectories which

yields about 22 h of calculation. Thus, one can avoid much ex-

cess calculation by choosing the trajectories to more efficiently

nrap out the state space.

VII. Coxcr-ustoN

The purpose of this paper was to show that computationalmethods for solving nonlinear ?lz control problems can be ef-fective and practical in solving significant problems which donot lend themselves to standard linear techniques. We outline a

method by which this may be done using available numerical

software, then we apply this to the difhcult problem of jet en-gine compressor control. One who has developed a valid distur-

bance model might use these methods for nonlinear 1l- control.The dominating effect of rate saturation in the control ofjet en-gine compressors is recognized, and a closed-loop controller is

constructed which satisfies these constraints. Graphical contourplots highlight the system performance and transient behavior

one can predict given our model.

ACKNOWLEDGMENT

A variation of the computational scheme proposed has al-

ready been used on aerospace problems t71, tl4l in the context

of differential games. The authors are grateful to O. von Stryk

for describing his work on robots, as well as providing his pro-

gram. The authors also wish to express appreciation to P. Hilt-

mann and P. Gill for the use of their programs. Many thanks to

M. Breitner, M. Krsti6, D. Mayne, R. M. Munay, and to P. V.

Kokotovi6 and his students for their comments and suggestions.

REFERENCES

tll O. O. Badmus, S. Chowdbury, and C' N. Nett, "Nonlinear control of

surge in axial compression systems," Atttornatica, vol.32, no. I' pp'

59-70, 1996.[2] J. Baillieul, S. Dahlgren, and B. Lehman, "Nonlinear control designs for

systems with bifurcations with applications to stabilization and control

of compressors." in Proc. 34th IEEE Conf. Decision Contr' 1995' pp'

3062-3067.

Page 14: Numerical Solution of Nonlinear 712 and Jl@ Control ...helton/BILLSPAPERSscanned/HHK00.pdf · problem of jet engine compressor control. The technique is demonstrated first with the

HARDT er a1.: NUMERICAL SOLUTION OF NONLINEAR Hr AND ?l- COI{TROL PROBLEMS

T. Bagar and G. Olsder, Dvratnic Nottcooperative Game Theory,2nded. New York: Academic. 1995.R. W. Beard and T. W. Mcl-ain, "Successive Galerkin approximational gorithms for nonlinear optimal and robust control," I nt. J. C ont r., vol.71 . no . 5 , pp .717 -43 . 1998 .R. Behnken. R. D'Andrea, and R. M. lvlunay, "Control of rotating stall ina low-speed axial flow compressor using pulsed air injection: Modeling.simulations. and experimental validation." in Proc. 34th IEEE Conf. De-cision Contr. 1995, pp. 305G3061.D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dr-nanic Prograrn-ming. Boston, MA: Athena. 1996.M. H. Breitner. "Real-time applicable feedback controller for differen-tial games," presented at the 6th Int. Symp. Dynamic Games Applicat.,St-Jovite, Quebec, Canada, 1994.

l8l P. NI. Dower, "Power Gain Analysis and Control of Nonlinear Systems,"Ph.D. dissertation, Univ. Newcastle, Australia. Comput. Eng.

tgl K. t!'1. Eveker. D. L. Gysling. C. N. Nen, and O. P. Sharma, "Integrated

control of rotating stall and surge in aeroengines," presented at the I 995SPIE Conf. Sensors. Actuators. Contr. Aeropropulsion.M. Falcone and R. Ferreti, "Discrete time high-order schemes for vis-cosity solutions of Hamilton-Jacobi-Bellman equations," N ume r. M ath.,vol. 67, pp. 315-344,1994.P. E. Gill, W. Munay. M. A. Saunders. and M. H. Wright, Usels Guide

for NPSOL (Version 5.0).M. Hardt, J. \l '. Helton, and K. Kreutz-Delgado. "Numerical solution ofnonf inear 7t: and'lt* control problems with application to jet enginecontrof ," Proc. 3 6 t h I E E E Co nf. De c i s io rt Cutt r, v ol. 3, pp. 23 17 -2322.

1997.P. Hiltmann, K. Chudej, and M. Breitnec "Eine moditiziene Mehziel-methode zur Ltisung von Mehrpunkt-Randrvenproblemen: Benutzeran-leitung," Math Dept.. Techn. Univ. lrlunich. Cermany. Rep. 14, 1993.

t l4 l H. Kreim, B. Kugelmann. H. J. Pesch. and lv l . H. Brei tner, "Minimiz in-s

the maximum heating of a reentering space shuttle: An optimal con-trol problem with multiple control construints," Op timal Contr. Applicut.tulattuxls, vol. 17. no. I, pp.45-69. 1996.

ll5l M. Krsti6, J. Nl. Protz, J. D. Paduano. and P. V. Kokotovi6, "Backstep-

pin,r designs tbr jet engine stall and sur-ee control." in Pntc. 34th Conf.Decision Cutr.. 1995, pp. 3Ol9-3055.

l16l M. Krsti6, D. Fontaine. P. V. Kokotovi6. and J. D. Paduano, "Useful

nonlinearities and global stabilization of biturcations in a model ofjeten-tine surge and stall." IEEE Trmts. Auto,,tot. Contr., vol.43, no. 12,pp. 1 739- 1 7.15. 1 998.H. J. Kushner and P. C. Dupuis. Nunterit'ul lvlethods for Stochastic Con-trol Problems in Continuous Tinte. New York: Springer-Verlag, 1992.T. Lauvdal. R. M. Murray. and T. I. Fossen. "Stabilization of integratorchains in the presence of magnitude and rate saturations: A gainscheduling approach." in Proc. 36th ConJ'. Decision Contr., 1997. pp.400.1-4005.D. C. Liaw and E. H. Abed. Active control of compressor stall incep-tion: A bifurcation-theoretic approach, Bordeaux, France, 1992, vol. 32.Automat ica, no. I , pp. 109- l 15, 1996.C. A. Mansoux, J. D. Setiawan, D. L. Cysling. and J. D. Paduano, "Dis-

tributed nonlinear modeling and stability anllysis of axial compressorstall and stnge," in 1994 Amer Contr. Conf., vol. 2, July 1994, pp.2305-23t6.

t2ll W. M. McEneaney and K. D. lvlease, "Nonlinear'11* control ofaerospace plane ascent." in 34th IEEE Conf. Decision Contr., Dec.1995, pp. 3994-3995.

I22) l.D. Paduano, L. Valavani, A. H. Epstein, E. Ivt. Greitzer, and G. R.

Guenette. "Modeling for control of rotating stall," Automatica, vol. 30,no. 9, pp. 1357-73,1994.

t23l H. J. Pesch, l. Gabler, S. Miesbach. and M. H. Breitner, "Synthesis ofoptimal strategies for differential games by neural networks," Ann. lnt.Soc. Dynamic Games, vol.3. pp. 78-106, 1996.

[24] P. Saint-Pierre, "Numerical approximation of the value function for con-trol problems with state constraints: A viability approach," in Proc. /E{CSvnp. Nonlinear Control S.vst. Desrgn, vol. 2' 1995. pp. 529-534.

l25l O. von Stryk, "Numerische Lcisung optimaler steuerungs-prob-leme: Diskretisierung, Parameteroptimierung und Berechnung deradjungierten Variablen," Fortschrit Berichte VDI. Reihe 8 Nr. 441VDI-Verlag, Dtisseldorf, Germany. 1995.

126l -, Use/s Guide for DIRCOL A direct collocation method for thenumerical solution of optimal control problems,1995.

t27l A. J. van der Schaft, "Nonlinear state space ?l* control theory" Essayson Control Perspectives in the Theory and Applications,1993.

[28] A. R. Teel, O. E. Kaiser, and R. M. Munay, "Uniting local and globalcontrollers for the caltech ducted fan," in Proc. 36th Conf. DecisionContr, 1997, pp. 1539-1543.

[29] Y. Wang and R. lvl. Murray, "Effects of Noise, Magnitude Saturation,and Rate Limits on Rotating Stall Control," in Proc. 36th Conf. DecisionCont n, 1997, pp. 46824687 .

Michael Hardt (M'99) received the B.S. degreein mathematics and the M.A. degree in appliedmathematics and statistics from the University ofCalifornia, Los Angeles, in l99l and 1993, respec-tively. Until 1999, he has been a doctoral student atthe University of California, San Diego, studyingrobotics and control theory in the Department ofElectrical and Computer Engineering.

He has investigated recursive symbolic robot dy-namics algorithms and their uses in computer visionand the control of articulated mechanical sytems. His

interests extend to the use of various numerical methods for the optimal andnear-optimal control of nonlinear systems, including the problem ofbipedal locomotion.

J. William Helton (M'73) was born in Jacksonville, TX, in 19,14. He receivedthe bachelor's degree in mathematics from the University of Texas, Austin, in1964, and the master's and Ph.D degrees in mathematics from Stanford Univer-sity. Stanford. CA, in 1966 and 1968, respectively.

From | 968 to | 973 he was at the State University of New York, Stony Brook,as an Assistant and then Associate Professor. In 1974 he visited the Univenity ofCalifbmia, Los Angeles, for six months and subsequently moved to the Univer-sity of Califomia, San Diego, as an Associate Professor, where he is currentlyProfessor of Mathematics.

Dr. Helton was a recipient of a Guggenheim fellowship and an Outstandingpaper award from the TAC in 1986. He has delivered plenary addresses at theannual meeting of the AMS, at the EECTD, at several of the International Sym-posia on the Mathematical Theory of Networks and Systems, as well as at othermeetings. His earlier articles concemed circuit theory, distributed systems, and

aspects of the theory of operators on Hilbert space which come from circuits,systems, dift'erential and integral equations, and noncommutative differentialgeometry. His theoretical studies of amplifier design then lead to an interest in

I/- control. Recent work focuses on extending If- control to nonlinear sys-

tems, computer algebra aimed at linear systems research, and frequency domain

optimization theory and algorithms.

Kenneth Kreutz-Delgado (S'79-M'84-SM'93) received the B.A. and M.S. de-

grees in physics and the Ph.D. degree in engineering systems science, all from

the University of Califomia, San Diego (UCSD).

He is a member of the AAAS and IEEE Robotics, SMC' Control and Signal

Processing Societies, and is currently on the faculty of the UCSD Electrical

and Computer Engineering Department as an Associate Professor of Robotics

and Machine Intelligence. Before joining the faculty at UCSD' he was a Re-

searcher at the NASA Jet Propulsion Laboratory, California Institute of Tech-

nology, where he worked on the development of intelligent telerobotic systems

for use in space exploration and satellite sewicing and repair. His interest in

sensor-based intelligent systems that can function in unstructured and nonsta-

tionary environments is the basis for his current research activities in real-time

adaptive sensory-motor control, real-time recognition/classification from mul-

tipli time-series data sets, multibody systems theory (dynamics and kinematics),

and in the development of"emergenf intelligent behavior from disributed com-

plex systems.

t5l

t31

t41

t6l

t7)

I IO l

t l l l

tt2l

tt 3l

t t T l

I t 8 l

t te l

I20l