minimal vector padé approximation

Journal of Computational and Applied Mathematics 32 (1990) 27-37 North-Holland

27

Minimal vector Pad& approximation

Adhemar BULTHEEL and Marc VAN BAREL Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan ZOOA, B-3030 Heverlee, Belgium

Received 24 September 1989 Revised 6 November 1989

Abstract: We describe the minimal vector Padt approximation problem, which consists in finding PadC approximants with a common denominator for a number of series. These approximants are minimal in the sense that for a given order of approximation and a given discrepancy in numerator and denominator degrees, the degree of the rational approximant is minimal. Properties and solution methods are derived from an associated minimal partial realization problem.

Keywords: Minimal Pad& vector Pad&, simultaneous approximation, minimal partial realization.

1. Motivation

Suppose you have a scalar normal Pad6 table (without blocks) and you want for a given order of approximation 7 the “simplest” Pad6 approximant (PA) from the table. The answer depends of course on the criterion you choose to measure simplicity. Suppose you mean by it the one with the least degree (maximum of numerator and denominator degree). Then you get a PA which is on or near the main diagonal of the table. It is of course also possible to get other entries of the Pad6 table by allowing a discrepancy in numerator and denominator degree. We can, e.g., minimize the maximum of the numerator degree minus some shift s and the denominator degree. We shall call this the s-degree. For s = 0 we get solutions near the main diagonal. For s positive (negative) you select some diagonal below (above) the main diagonal of the table. We now get a problem that is parametrized in r and s, rather than in (Y and j3 (the numerator and denominator degrees). The (r, s)-net places a grid over the Pad6 table along the diagonals while the ((Y, /3)-net was a row-column grid. The problem is somewhat generalized if we do not require the approximants to be PAS, but only rational approximants of order at least r, for which the s-degree is minimized. Not every solution of this problem is a PA, but among all possible solutions, there is always a Pad6 form which can be uniquely selected by imposing extra conditions on the order of approximation, on the denominator degree or on the degree of the numerator. We call the problem as described above (parametrized in r and s) a minimal Pad&

approximation problem (mPA problem). In the scalar case, this does not add much new to the theory of Pad6 approximation. With some extra conditions, all the solutions of the mPA problem are also Pad6 approximants and conversely. In the (r, s)-table of (the uniquely defined) minimal Pad6 approximants, one can find essentially the same block structure as in the ordinary Pad6

0377-0427/90/$03.50 0 1990 - Elsevier Science Publishers B.V. (North-Holland)

28 A. B&heel, M. Van Bare1 / Minimal vector Pad6 approximation

table (see [4]). Other properties of PAS are transportable to mPAs as well. Our interest in the mPA problem stems from its usefulness to study generalizations of the (m)PA problem to the matrix case. Rational approximations to matrix sequences is a topic which has been studied extensively as the minimal partial realization (MPR) problem in linear system theory. This problem can be translated quite naturally into what we called the mPA problem, so that a number of results that are well known in system theory can be used to get corresponding results in the theory of (minimal) matrix PadC approximation (mMPA). Some results in this sense were obtained in [3] and in [14] a matrix two-point Pad& problem was considered. In this paper we want to give a slightly different approach to the problem of minimal vector PadC approximation (mVPA).

In this paper we address the general problem of specifying uniquely a minimal vector PadC approximant. The algebraic theory of simultaneous PadC approximation (which is virtually identical to one type of vector PadC approximation) has been investigated by Mahler [12], Coates [5], Jager [lo], de Bruin [6] and Nikisin [13]. In particular, Mahler noted the connection between the problems of simultaneous PadC approximation and Hermite-Pad& approximation. The connection between Kalman’s partial realization problem for SIMO (single-input-multi-output) systems and vector PadC approximation was noted by Graves-Morris [8]; some algorithms applicable to nondegenerate cases are given by Graves-Morris and Wilkins [9]. A more robust approach was adopted by Bultheel and Van Bare1 [3] in a more general matrix setting. These results were based upon earlier results of Dickinson et al. [7] and Anderson et al. [l] who described implicitly an algorithm for the general minimal partial realization problem for MIMO (multi-input-multi-output) systems. These ideas were clarified and an explicit algorithm, which generalizes the Euclidean algorithm to the case of matrix polynomials, was given in [15]. Similar results are also given by Antoulas in [2]. In [15] it is also shown how a unique (canonical) solution may be obtained by the algorithm.

A quite different approach to vector PadC approximation was given by Wynn [17] and developed by Graves-Morris and Jenkins [8]. The approximants they consider are almost unrelated to the approximants we shall consider here. The definition given by van Iseghem [16], however, is much more related. It turns out that her approximants are a special case of the ones defined here. The generalization being that we consider a different shift for each of the components separately, which means that we consider vectors with polynomial entries whereas considering polynomials with vector coefficients results naturally in an (almost) uniform shift for each of the components. To put it in yet another way: if, like in [16], we have a uniform degree for the numerator entries, then to match the degrees of freedom with the number of interpolation conditions, we can distribute these conditions equally over all components with possibly some extra conditions on the initial components. In our approach however, we shall be free to distribute the degrees of freedom (numerator degrees) and the interpolation conditions indepen- dently over whatever components we like.

2. Definitions and notations

Let F(z) = CrFkzk E K”[[z]] be a given formal power series. It has coefficients Fk E K”. These are 1 x m row vectors with entries in a field K. The ith component of Fk is denoted as

Fk.ie

A. B&heel, M. Van Bare1 / Minimal vector Pad6 approximation 29

We shall need a special notation to denote vectors of polynomials with a different degree for each of their components. Therefore we introduce the following constructs. Let k = [k,, . . . , k,] E N”. By zk we mean diag(zkl,..., zkm). Fk means the m-vector whose ith component is F&.

For any t E N”, Ci=,,Fkzk denotes the polynomial vector with ith component Ei,=,,Fk,,izkl. In general, a summation for the m-vector k ranging from some lower bound I to an upper bound u like in c;,,u, with ok some m-vectors with components u~,~, i = 1,. . . , m, means a componentwise summation so that the ith component of the result is C~;=,U~,,~. Inequalities between m-vectors like k < t also mean componentwise inequalities: ki < ti for all i = 1,. . . , m.

Addition of a scalar and a vector means the addition of the scalar to each component of the vector. Also < between a vector and a scalar means < for each component.

A polynomial P(z) E Kr[z], is said to have degree s = [sl,. . . , sm] E irn if its ith entry P,(z) has degree si. We have set N = I!4 U { - oo} and define deg 0 = - 00. deg P(z) = s means that s is the minimal index vector for which P(z) = C”,,,P,z”.

Next we introduce the idea of the s-degree of a rational form which will play an important role in this paper. For s E E” we say that a rational form Q(z)-‘P(z) with P(z) E K”[z] and

Q<z> E K[zl h as s-degree v = max{ deg Q(z), deg Pi(z) - si, i = 1,. . . , m } . We say that such a rational form is an approximant of order (at least) r E N” for F(z) if

F(z) - Q(z)-‘I’( z) = E+l~r+l + E+2~r+2 + . . . . (2.1)

The minimal vector Pad6 approximation (mVPA) problem can now be formulated as follows: given F(z) E K”[[z]], r E N” and s E Z”, find some P(z) E K”[ z] and Q(z) E K[ z] such that Q( z)-lp( z) has minimal s-degree among the approximants for F(z) of order at least r.

Note that Q(0) has to be nonzero because Q(0) = 0 implies P(0) = 0 and then we do not have a minimal approximant since it can be reduced. In general P(z) and Q(Z) should be coprime (i.e., the m components Pi(z) of P(z) and Q(z) should have no common factors) for the same reason. If Q(z) # 0, then Q( z)-‘P( z) is an approximant of order at least r if and only if

Q(z) F( z) - P(z) = R,+l~r+l + R,+Z~‘+2 + . . . .

It is clear that (P(z), Q(z)) = (CL,OFkzk, 1) always satisfies the order condition and therefore the set of order r approximants is not empty. For given s it is then possible to find in this set an approximant of minimal s-degree. Thus the mVPA problem always has a solution.

For the sake of completeness, we recall here what we call a vector Pad6 approximation (VPA) problem. The problem is that of finding some P(z) E K”[z] and Q(z) E K[z] such that for given aEf+Jm and p E N and F(z) E K”[[z]], the rational form Q( z)-lp( z) which approxi- matesF(z)uptoorderrwith ]r+l(=C(ri+l)>C(ai+l)+~=]~+l]+~whiledegP(z) G a and deg Q(z) < j3.

The readers who are not familiar with the MPR problem, should consult, e.g., [11,1,2,7,9,15]. To give a feeling for the work to be done in translating a MPR result into a mVPA result, we give a brief account of this. The MPR problem is defined as follows. Let M(z) = CFMkz-l E K”[[z-‘I] be a given formal power series in z-l. Let also r E N” be given. The problem is to find C(z) E K”[ z] and A(z) E K[z] such that

M(z) -A(z)_‘C(z) =Mr*+1Z-‘-l+M,*+2Z-r-2+ . . . .

Since deg Ci( z) is less than deg A(z), minimizing the (- l)-degree of A( z)-‘C(z) is the same as minimizing deg A(z). Note that again, because of minimality, A(z) and C(z) are coprime. In

30 A. B&heel, M. Van BareI/ Minimal vector Pad6 approximation

particular A(0) # 0 or C(z) # 0. Now, we replace z by z-i to transform this into a mVPA problem. As we can easily check, the problem turns into a mVPA problem for Ir( z) = z-iM( z-i) with a solution Q(Z) = z”A(z-‘) and P(z) = zy-’ C( z-l), where Y is the degree of A(z). Note however that Y need not be equal to deg Q(z), but it is equal to the (- l)-degree of the rational form Q( z)-‘P( z). As we mentioned in the introduction, the shift s is introduced to get other “nondiagonal” mVPAs. More details can be found in the proof of Theorem 3.1. This construction is slightly different from the one used in [3] as will be explained in Section 3.

3. Relation with minimal partial realization

Suppose that Q(Z)-‘P( ) z is an mVPA of F(z) for some given s and r and let its minimal s-degree be given by d. Then, for the i th component, we may write ( Fk = 0 for k < 0)

(3.1)

where d+s,

P,(z) = c pi,izj, j=O

Q(z) = ii Q/Sk, k=O

e(z) = E l$izj. j=O

If d + si > ri, (3.1) represents only a possible choice for the numerator P,(z). It does not contain conditions for the denominator. For simplicity, we restrict ourselves for the time being to the case where r = s + n with n E N. This means that the numerator P(z) can be computed from Q(z) and F(z), while th e d enominator has to satisfy the homogeneous block Hankel system

r Ml M2 * ** Mn-d]

=[o() . . . 01, Q,#O, d minimal,

(3 2)

where

Mk = ‘s+k-

Thus We have Mk = [Mk,i];=l E Km and Mk,i = F$,+k,i.

(3.3)

At this point, the mVPA problem is reduced to something which is formally the same as an n th-order minimal partial realization (MPR) problem for the vector sequence Mk, k = 1, 2,. . . .

This problem has been intensively studied in the system literature for the more general p X m

matrix case; references [1,2,7] are representative. Recently, a canonical solution for this problem was proposed which had a number of interesting properties [15]. In the analysis of the MPR problem, the Kronecker and dual Kronecker indices play an important role. For the vector case, there is only one dual Kronecker index v(n) associated with the sequence Ml,. . . , M,. It is the smallest d for which the system (3.2) has a solution with Q, # 0. By definition of the mVPA problem, it is the minimal s-degree of the mVPA of order s + n for the series F(z) whose

A. B&heel, M. Van Bare1 / Minimal vector PadP approximation 31

coefficients are related with the Mk sequence by (3.3). The Kronecker indices I = [ K~( n) . . -

am] E N” for the same sequence are defined as follows: I, is the smallest d such that

has a solution with VoT = [V,,, . . . l&-i 1 0 . . * 0] E K”. Since it will be clear from the context whether we mean Kronecker or dual Kronecker indices, we shall call v(n) as well as I Kronecker indices for short. It is a well-known property of Kronecker indices that 1 K(n) 1 = C,K,(P’Z) = V(n) (see, e.g., [2]).

Now we are ready to formulate the following theorem.

Theorem 3.1. Let F(z) = CTFkzk E K”[[z]]. For a given integer n 2 0, - n 6 s E Z” and r = s + n >, 0 (componentwise inequalities), let v = v(n) and K = I be the Kronecker indices of the

sequence M,, . . . , M, with kfk=FS+k (4. = 0 for i < 0). Then there exists a unique mVPA

Q(z)-‘P( z) with Q(0) = 1 f o or d er r and minimal s-degree v which satisfies

F(z) - Q(z)-+(z) =RK+y+l~K+S+Y+l + . * * .

Proof. The present proof supplements that of a corresponding theorem in [15]. It clearly shows the mechanism which we mentioned at the end of the previous section relating MPR problems and mVPA problems for a general shift s.

The results of [15] are for a right MPR problem, but the results can be easily translated for a left MPR problem as we have here. Computing a left MPR of order n for the sequence M,, Mz,. . . is essentially the same as computing a right MPR for the sequence MIT, MzT,. . . . Applied to the present situation, the result then reads: define for Mk = Fs+k the formal series

M(z)=MIz-I+ +.a +M,z-“+M,+~z-“-~+ .e..

Then there exists a unique (canonical) MPR A( z)-lC( z) of the sequence Ml,. . . , M, which satisfies

M(z) - A( z)-‘C( z) = tiK+v+l~-(K+Y+l) + - - - ,

with A(z) manic of degree v and C(z) E K”[ z]. Multiply with zs from the right, after z is replaced by z-l. Then you get

M(l/z)z” - A(l/z)-lC(l/z)z” = (i@K+,+l~K+Y+l + . -. )zs,

which implies

F(z) - Q(z)-?(z) =tiK+y+lzK+s+y+l + . . . ,

32

with

A. B&heel, M. Van BareI / Minimal vector Pad6 approximation

F(z) = 2 FkZk + M(l/z)z”, k=O

Q(z) =A(l/+, Q(0) = 1,

P(Z) = Q(z) i &zk i i

+ c(l/z)zS+“. k=O

The ith component Pj( z) of P(z) is a polynomial of degree at most si + v. This is surely true if si > 0. If si c 0, then, because we agreed that Fk,; = 0 for k < 0, we also get Mk,i = 0 for k < -si. Hence, since Ci( z) is given as the polynomial part of A(z) M,( z), it follows that Ci( z) is of degree at most v + si. Thus Pi(z) will be given by Ci(l/z)zs~+” which is a polynomial of degree at most si + v.

To show that the mVPA is also a VPA, we note that the number of degrees of freedom in the pair (P(z), Q(Z)), taking Q(0) = 1 into account, is equal to

; (Si+v+l)+v= ]s]+m(v+l)+v i=l

and the number of coefficients fitted is

&v+Ki+Si+l)= ]s]+m(v+l)+ ]K] = ]s]+m(v+l)+v. i=l

Witha=s+v,P=vandtheorderraK+++v,wehaveshownthat ]r+l]>]a+l]+/?,so that the canonical mVPA solution is also a VPA. 0

It is also possible to construct a matrix continued fraction whose convergents are the successive mVPAs for successive values of n.

Corollary. Under the conditions of the previous theorem, apply the algorithm CMPR described in

[15] to the data MT,. . . , MT; then it will construct unimodular matrices

‘(“) = Y,(z) c,(z) x,(z) A),(z) I E K(m+w(m+l)

such that the matrix continued fraction . . .

y-r GT + CL? . . .

-,x2’+& YIT + c,’

. . . 7 GT + cz’ . . .

-X+2+4 X;+Al

\

k k=O

Fkzk

has as successive convergents the uniquely defined m VPAs of the previous theorem for n = 0, 1, 2, . . . .

Proof. Using the correspondence used in the previous theorem, this result is easily obtained from [15]. 0

A. B&heel, M. Van Bare1 / Minimal vector Pad& approximation 33

4. Another approach

In the previous section, we have reduced the problem of mVPA for given s and r = s + n, to a problem of left MPR computation for the vector sequence Fs+ 1,. . . , Fs+,,, which is the same as computing a right MPR for the transposed sequence. The algorithm of [15] then gives a recursive method to compute the solutions for n = 0, 1,. . . . This means that during these computations s is kept constant while r increases in each step by 1 in all its components. In the scalar case, this corresponds to a recursion along a downward sloping diagonal in the Pad6 table. The algorithm CMPR of [15] performs on (formal) series some operations which are basically comparable to those performed in the Euclidean algorithm [3]. In the scalar case, it is also possible to use formally the same algorithm to move along an antidiagonal in the Pad6 table. This path of an antidiagonal corresponds for the vector case, just as in the scalar case, to choosing r constant and letting s = r - n vary for n = 0, 1,. . . . The systems (3.1) can then be reduced to

[e, *.. Q,] H(n, d) = 0, Q, # 0, d minimal,

where H(n, d) is a block Hankel of the same form as in (3.2), but now the Mk are defined by

Mk = I;l-k+l with the same short-hand notation as in (3.3). As it is formulated now, we do not have exactly an MPR problem. It were, if only we required Qd # 0 instead of Q, # 0. This means that we have to give up the reducedness condition of the approximant. We can only be sure that the linearized interpolation conditions (2.1) will be satisfied. This corresponds to the fact that in the scalar case when moving on an antidiagonal in the Pad6 table, it is possible to hit a singular block below its main antidiagonal. Here the reduced Pad6 approximants do not have the appropriate order of contact and it is only possible to satisfy the linearized conditions. This was the approach followed in [3] in the more general case of (right) matrix Pad6 approximation. We shall not repeat it here for the vector case.

5. The nice problem

Instead of choosing r = s + n with n integer, we shall now come to the more general situation wherer=s+t, t=[t, ,..., t,]~tY~, where the tj can now be different from each other. Also in this case, the reduction to the MPR problem can be done as before. Define again Mk = Fs+k as in (3.3) and let n be n = t,, = max{ t;, i = 1,. . . , m}. The system (3.2) based on the sequence M i, . . . , AI,, however, contains too many restrictions. Indeed the solution of that system gives orders of approximation s + t,, = s + n > s + t = r. Some of the entries in H( n, d) are specified and used in the determination of the solution whereas others are not specified. We refer to the latter as “don’t-care-entries”, which we shall indicate by “?‘. These unspecified elements are given in the matrix MT = [MT . . - MT] which looks like

4, **+ Mr,,i ? ? ? ?

Ml2 ..* ... . . . Mt,,2 ? ?

MT= &fli . . . . . . . . . . . . M fmx 1 l

34 A. B&heel, M. Van Bare1 / Minimal vector Padb approximation

where we supposed n = t,, = ti. Such a problem is the vector case of what is known in the theory of MPR as a nice problem. It is called a nice problem because some of the elements of MT are left unspecified, and this is always done so that Mk,i = ? implies Mk+l,i = ? for i=l ,***, m. It is possible to construct a recursive algorithm similar to the algorithm CMPR of [15] to solve such a nice problem, but due to space limitations, we shall not do it here.

For nice problems, the role of the Kronecker indices is naturally played by indices which are known as nice indices. In fact there are both nice and dual nice indices, but as in the case of Kronecker indices, we shall drop the adjective dual most of the time. Since we have here a row vector, there is only one (dual) nice index and this then coincides with the (dual) Kronecker index. These should now be defined for matrices that are not completely specified (i.e., one that contains ?-elements. See, e.g., [ll]). Consider the block Hankel matrix H(n) = [Mi+j_l, i, j = 1 ,*.., n] where M, is the last of the Mk that is not completely unspecified. The entries Mj for j > n contain only ?-elements. Then y(n) + 1 is the number of the first row in H(n) that is linearly dependent on the previous ones where the ?-elements are supposed to take the values that keep the rank of H(n) as low as possible. Note, however, that there may exist many choices for these elements that will give the minimal rank. Because of the Hankel structure and the nice specification scheme, the linear dependency of row y(n) implies the linear dependency of all the following rows. A similar convention is used to define the nice indices (and also Kronecker indices) corresponding to the columns. We call the integers I = [q(n), . . . , K~( n)] nice indices for the sequence M,, . . . , M,,ifinthematrixH(n)allthecolumnskm+i, k=O,l,...,~~(n)- 1, i=l,..., m, are linearly independent and the others are linearly dependent on these. Again here this dependency is defined for a choice of the ?-elements that keeps the rank of H(n) as low as possible. One clearly has rank H(n) = v(n) = 1 I ) = CK~( n). The choice of the nice indices rc(n) is not unique. One can take any set of v(n) independent columns as long as they satisfy the nice property, i.e., column km + i dependent implies (k + 1)m + i dependent. Even for a fixed choice of the ?-elements there may be different sets of nice indices K(H). On the other hand, the Kronecker indices are uniquely defined for a fixed choice of the ?-elements since they choose the first y(n) independent columns that can be found. We could qualify them as minimal nice indices. A different minimal extension may or may not give different Kronecker indices. It is clear that if we extend the sequence Mk for k to infinity with completely unspecified elements, then the definition of the nice indices will not change when they are obtained from matrices H( n + i), i > 0. For this reason we could drop the argument n from the notation and we shall speak of the nice indices v and K of the (infinite) sequence Mk or of the Hankel matrix H= H(cc).

As with the Kronecker indices, it is possible to prove the following theorem.

Theorem 5.1. Let the sequence Mk = Fs+k for s E h m be nicely specified up to M, and n = max{ ti } . Let v and K be a choice for the nice indices of this sequence. Then there exists a unique mVPA

Q(z)-‘0 > f d z o or er r = s + t which has minimal s-degree v and which satisfies

F(z) - Q(z)-+(z) = ~+y+l~K+S+Y+l + . . . ,

where F(z) = CrFkzk. The couple (P(z), Q(z)) can then be made unique by choosing Q(0) = 1.

Proof. This result can be derived from the corresponding result in MPR literature. The proof is essentially the same as for the Kronecker indices. See, e.g., [2]. 0

A. B&heel, M. Van Bare1 / Minimal vector Pad6 approximation 35

Notes. (1) For the Kronecker indices, it is guaranteed that the required order of approximation is met, but one may not conclude that for all i one has K~ + Y >, tj. This means that sometimes, to fix the unique solution, more of the coefficients Fk are used and sometimes less than actually needed to meet the order condition.

(2) The previous theorem says that for given s and t, we can find a unique mVPA for each choice of the nice indices (possibly depending on the choice of the extension made). It does not say anything about these mVPAs being the same or not.

(3) It is possible to parametrize all possible mVPAs for given s and t with a minimum number of free parameters. We shall not do this here.

We shall now describe how the nice indices change if one of the unspecified elements becomes specified. The jumps in Y correspond to the jumps in the denominator degree of the scalar PAS when one jumps over a (possibly trivial 1 X 1) block while moving on a downward sloping diagonal in the Pad6 table.

To obtain this, we follow the approach of Kalman [ll]. For the scalar case, Kalman uses the following simple but powerful lemma.

Lemma 5.2. Consider the matrix D(?) = [g f] with one unspecified element ?. (1) If rank A = rank[ A B] = rank[AT CT], then there exists exactly one value v of ? for which rank A = rank D(V). For any other value of ?, rank D(?) = rank A + 1.

(2) If rank[A B] > rank A or rank[AT CT] > rank A, then rank D(?) is independent of the value of ?.

This lemma, together with properties of Hankel matrices led Kalman to the conclusion in the scalar case (m = 1 and hence v = K) that by specifying one more element in the sequence M 1,. . . , M,,, we can come into different possible situations. If 2v > 12, then situation (2) of Lemma 5.2 applies to the Hankel matrix H = [Mi+i_l], with entries M,, . . . , M,,,?, . . . . Therefore the minimal degree v shall not change. This is plausible since to fix the unique solution, we used

M,,..., M2,, (see Theorem 5.1 with s = - 1) which is more than required for an approximant of order n. If 21, G n, then M,,, shall appear in H in at least one position where situation (1) of Lemma 5.2 applies. The minimal degree will not increase if M,,, is the specific value alluded to in (1) of the lemma. In that case, the sequence is extended without increasing the rank of the Hankel matrix H for this sequence. We shall call this value the minimal extension value. This means that for a minimal extension value of M, + I we have v(n + 1) = v(n). For any other value

of Ml?+, one gets v(n + 1) = n + 1 - v(n). Thus, in general, it holds that v(n + 1) = max{ v( n), n + 1 - v(n)}. See, e.g., [ll]. In terms of Pad6 approximants the choice of the minimal extension value corresponds to the fact that a Pad6 approximant based on the data M,, . . . , M, has an expansion which defines an extension M,, . . . , M,,, Gnfl, an+,, . . . of this sequence. This extension is chosen such that it keeps the minimal s-degree of the approximant. It is a minimal extension. If it happens that M, + 1 = fin+ 1, then the n th PA will be equal to the (n + 1)th if M n+l is given. This stays like that until some Mk turns out to be different from the minimal extension value. In that case the denominator degree increases as indicated.

The following theorem generalizes this property to the vector case.

36 A. B&heel, M. Van Bare1 / Minimal vector Pad6 approximation

Theorem 5.3. Consider the block Hankel matrix H = [Mi+j_l] with Mk E I?“‘, I?= KU {?}, a nicely specified sequence up to Mt. Let (v, K) be a choice for the nice indices of H. Mark with a x , all the rows and columns that are chosen independent according to this choice of the nice indices. That are the rows l,.. ., v and the columns km + i, k = 0,. . . , ~~ - 1, i = 1, . . . , m. The remaining

rows and columns are all marked with a l . Let u = ti + 1; then MU,i is the first unspecified element in column i. This element appears at several positions in the matrix H, viz. the positions (u-j, jm+i), j=O ,..., u - 1. If we specify this element, then we have one of the following cases :

(1) None of the positions where this element appears in H is of type (0, 0). Then the nice indices can be left unchanged whatever the specification of MU,i is.

(2) This element appears on at least one position which is of type (., .). Then either it takes a uniquely defined minimal extension value, in which case the nice indices can be left unchanged, or it is different from this value, in which case v and ~~ increase with an amount u - v - K~. This increase is precisely equal to the number of times the element MU,i appears on a (0, l )-position.

Proof. The elements MU,i appears in precisely u positions of the matrix H. Of these, only u - K~

appear on ~-columns, by definition of nice index K~. These are in the first u - K, rows. Since the first v rows are X-rows, there are at most max{ 0, u - v - K~} elements on (0, .)-positions. If the element is not on a ( ., .)-position, then, according to the previous lemma, the rank is independent of the choice of MU,i and this means that v does not change, and hence also K can be left unchanged. If the element appears on a (0, .)-position, then part (1) of Lemma 5.2 applies. This means that either MU,i takes the minimal extension value which keeps the rank down, in which case the nice indices need not be changed, or MU,i is not this value, and then all the rows and columns corresponding to these (0, .)-positions have to be changed into x -rows and -columns, which means that the nice indices change as indicated. One only needs to exploit the Hankel structure to verify that the uniquely defined extension value is the same for each of the (0, .)-positions. The theorem is then proved. 0

6. Conclusion

In this short note we have formulated the minimal vector PadC approximation problem. We reduced this problem to a minimal partial realization problem for a single-input-multi-output system. Such MPRs for SIMO systems were also considered in [9]. We also considered a nice problem which corresponds to different numerator degrees and different orders of approximation per component. We generalized a property about the jumps in numerator degree of PadC approximants to the vector case. Similar results hold for the MPR problem, even in the matrix case. See, e.g., [2].

Besides all the previously mentioned definitions of vector PadC approximants, this paper contains yet another approach to possible extensions of the notion of scalar Pad& approximants to the vector case.

Acknowledgement

The authors thank Peter Graves-Morris for making many valuable suggestions to make the paper more readable.

A. B&heel, M, Van Bare1 / Minimal vector PadP approximation 31

References

[l] B.M. Anderson, F.M. Brasch Jr and P.V. Lopresti, The sequential construction of minimal partial realizations from finite input-ouput data, SIAM J. Control 13 (1975) 552-571.

[2] A.C. Antoulas, On recursiveness and related topics in linear systems, IEEE Trans. Automat. Control 31 (1986) 1121-1135.

[3] A. Bultheel and M. Van Barel, A matrix Euclidean algorithm and matrix Pad6 approximation, Report TW104, Dept. Comput. Sci., Kath. Univ. Leuven, 1988.

[4] A. Bultheel and G.-L. Xu, The structure of the minimal Pad6 approximation table, Report TW117, Dept. Comput. Sci., Kath. Univ. Leuven, 1988.

[5] J. Coates, On the algebraic computation of functions, Indag. Math. 28 (1966) 421-461. [6] M. de Bruin, Generalized continued fractions and a multidimensional Pad6 table, Ph.D. Thesis, Amsterdam,

1974. [7] B.W. Dickinson, M. Morf and T. Kailath, A minimal realization algorithm for matrix sequences, IEEE Trans.

Automat. Control 19 (1974) 31-38. [8] P.R. Graves-Morris, Vector valued interpolants I, Numer. Math. 42 (1983) 331-348; Vector valued interpolants

II, ZMA J. Numer. Anal. 4 (1984) 209-224; (with C.D. Jenkins) Vector valued interpolants III, Constr. Approx. 2

(1986) 263-289. [9] P.R. Graves-Morris and J.M. Wilkins, A fast algorithm to solve Kalman’s partial realisation problem for

single-input, multi-output systems, in: E.B. Saff, Ed., Approximation Theory Tampa, Lecture Notes in Math. 1287 (Springer, Berlin, 1987) l-20.

[lo] H. Jager, A multidimensional generalization of the Pad6 table, Indug. Math. 26 (1964) 193-249. [ll] R.E. Kalman, On partial realizations, transfer functions and canonical forms, Actu Polytech. Scund. Math.

Comput. Sci. Ser. 31 (1979) 9-32. [12] K. Mahler, Perfect systems, Compositio Math. 19 (1968) 95-166. [13] E.M. N&kin, On simultaneous Pad6 approximants, Math. USSR Sb. 41 (1982) 409-425. [14] M. Van Bare1 and A. Bultheel, Minimal Pad6 sense matrix approximations around s = 0 and s = co, in: A. Cuyt,

Ed., Nonlinear Numerical Methods and Rational Approximation (Reidel, Dordrecht, 1988) 143-154. [15] M. Van Bare1 and A. Bultheel, A canonical matrix continued fraction solution of the minimal (partial) realization

problem, Linear Algebra Appl. 122-124 (1989) 973-1002. [16] J. van Iseghem, Approximants de Pad6 vectoriels, Ph.D. Thesis, Lille, 1987. [17] P. Wynn, Continued fractions whose coefficients obey a non-commutative law of multiplication, Arch. Rational

Mech. Anal. 12 (1963) 273-312.

minimal vector padé approximation

Documents