rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... ·...

22
Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 http://www.journalofinequalitiesandapplications.com/content/2013/1/580 RESEARCH Open Access Perturbation analysis of the stochastic algebraic Riccati equation Chun-Yueh Chiang 1 , Hung-Yuan Fan 2 , Matthew M Lin 3* and Hsin-An Chen 3 * Correspondence: [email protected] 3 Department of Mathematics, National Chung Cheng University, Chia-Yi 621, Taiwan Full list of author information is available at the end of the article Abstract In this paper we study a general class of stochastic algebraic Riccati equations (SARE) arising from the indefinite linear quadratic control and stochastic H problems. Using the Brouwer fixed point theorem, we provide sufficient conditions for the existence of a stabilizing solution of the perturbed SARE. We obtain a theoretical perturbation bound for measuring accurately the relative error in the exact solution of the SARE. Moreover, we slightly modify the condition theory developed by Rice and provide explicit expressions of the condition number with respect to the stabilizing solution of the SARE. A numerical example is applied to illustrate the sharpness of the perturbation bound and its correspondence with the condition number. MSC: Primary 15A24; 65F35; secondary 47H10; 47H14 Keywords: Brouwer fixed-point theorem; perturbation bound; stochastic algebraic Riccati equations; condition number 1 Introduction In this paper we consider a general class of continuous-time stochastic algebraic Riccati equations A X + XA + C XC ( XB + C XD + S )( R + D XD ) ( B X + D XC + S ) + H = , (a) R + D XD , (b) where A R n×n , C R n×n , B R n×m , D R n×m , S R n×m , respectively. Moreover, H R n×n and R R m×m are symmetric matrices. Here we denote M (respectively, M ) if M is symmetric positive definite (respectively, positive semidefinite). The unknown X R n×n is a symmetric solution to SARE (a)-(b). Let S n be the set of all symmetric n × n real matrices. For any X, Y S n , we write X Y if X Y . In essence, SARE (a)-(b) is a rational Riccati-type matrix equation associated with the operator R : dom R S n R(X)= P (X)– S (X)Q(X) S (X) , where the affine linear operators P : S n S n , Q : S n S m , S : S n R n×m , and dom R are defined by P (X)= A X + XA + C XC + H, ©2013 Chiang et al.; licensee Springer. This is an Open Access article distributed under the terms of the Creative Commons At- tribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Upload: others

Post on 23-May-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580http://www.journalofinequalitiesandapplications.com/content/2013/1/580

R E S E A R C H Open Access

Perturbation analysis of the stochasticalgebraic Riccati equationChun-Yueh Chiang1, Hung-Yuan Fan2, Matthew M Lin3* and Hsin-An Chen3

*Correspondence:[email protected] of Mathematics,National Chung Cheng University,Chia-Yi 621, TaiwanFull list of author information isavailable at the end of the article

AbstractIn this paper we study a general class of stochastic algebraic Riccati equations (SARE)arising from the indefinite linear quadratic control and stochastic H∞ problems. Usingthe Brouwer fixed point theorem, we provide sufficient conditions for the existence ofa stabilizing solution of the perturbed SARE. We obtain a theoretical perturbationbound for measuring accurately the relative error in the exact solution of the SARE.Moreover, we slightly modify the condition theory developed by Rice and provideexplicit expressions of the condition number with respect to the stabilizing solutionof the SARE. A numerical example is applied to illustrate the sharpness of theperturbation bound and its correspondence with the condition number.MSC: Primary 15A24; 65F35; secondary 47H10; 47H14

Keywords: Brouwer fixed-point theorem; perturbation bound; stochastic algebraicRiccati equations; condition number

1 IntroductionIn this paper we consider a general class of continuous-time stochastic algebraic Riccatiequations

A�X + XA + C�XC –(XB + C�XD + S

)(R + D�XD

)–(B�X + D�XC + S�)+ H = , (a)

R + D�XD � , (b)

where A ∈ Rn×n, C ∈ R

n×n, B ∈ Rn×m, D ∈ R

n×m, S ∈ Rn×m, respectively. Moreover, H ∈

Rn×n and R ∈R

m×m are symmetric matrices. Here we denote M � (respectively, M � )if M is symmetric positive definite (respectively, positive semidefinite). The unknown X ∈R

n×n is a symmetric solution to SARE (a)-(b). Let Sn be the set of all symmetric n × nreal matrices. For any X, Y ∈ Sn, we write X � Y if X – Y � .

In essence, SARE (a)-(b) is a rational Riccati-type matrix equation associated with theoperator R : domR→ Sn

R(X) = P(X) – S(X)Q(X)–S(X)�,

where the affine linear operators P : Sn → Sn, Q : Sn → Sm, S : Sn → Rn×m, and domR

are defined by

P(X) = A�X + XA + C�XC + H ,

©2013 Chiang et al.; licensee Springer. This is an Open Access article distributed under the terms of the Creative Commons At-tribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction inany medium, provided the original work is properly cited.

Page 2: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 2 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

Q(X) = R + D�XD,

S(X) = XB + C�XD + S,

domR ={

X ∈ Sn |Q(X) � }

.

We say that X is the maximal solution (or the greatest solution) of SARE (a)-(b) if itsatisfies (a)-(b) and X � P for any P ∈ Sn satisfying R(P) ≥ and (b), i.e., X is themaximal solution of R(X) ≥ with the constraint (b). Furthermore, it is easily seen thatSARE (a)-(b) also contains the continuous-time algebraic Riccati equation (CARE)

A�X + XA – XBR–B�X + H = ()

with R � , C = , D = and S = , and the discrete-time algebraic Riccati equation(DARE)

X – C�XC +(C�XD + S

)(R + D�XD

)–(D�XC + S�)– H = ()

with A = –I and B = , as special cases.

Matrix equations of the type (a)-(b) are encountered in the indefinite linear quadratic(LQ) control problem [], and the disturbance attenuation problem, which is in deter-ministic case the H∞ control theory, for linear stochastic systems with both state- andinput-dependent white noise. For example, see [–]. For simplicity, we only considerone-dimensional Wiener process of white noise in this paper; it is straightforward buttedious to extend all perturbation results presented in this paper for multi-dimensionalcases. In the aforementioned applications of linear stochastic systems, a symmetric solu-tion X, called a stabilizing solution, to SARE (a)-(b) ought to be determined for the designof optimal controllers. This stabilizing solution plays a very important role in many ap-plications of linear system control theory. The definition of a stabilizing solution to SARE(a)-(b) is given as follows. (See also [, Definition .].)

Definition . Let X ∈ Sn be a solution to SARE (a)-(b), � = A + BF and � = C + DF ,where F = –Q(X)–S(X)�. The matrix X is called a stabilizing solution for R if the spec-trum of the associated operator Lc with respect to X defined by

Lc(W ) = ��W + W� + ��W� , W ∈ Sn, ()

is contained in the open left half plane, i.e., σ (Lc) ⊂C–.

Note that if C = D = in (a)-(b), then it is easily seen from Definition . that thematrix X ∈ Sn is a stabilizing solution to SARE (a)-(b) or, equivalently, CARE () if andonly if σ (�) ⊂C–. Therefore, Definition . is a natural generalization of the definition ofa stabilizing solution to CARE () in classical linear control theory. Moreover, a necessaryand sufficient condition for the existence of the stabilizing solution to a more general SAREis derived in Theorem . of []. See also [, Theorem ]. In this case, it is also shown thatif SARE (a)-(b) has a stabilizing solution X ∈ dom(R), then it is necessarily a maximalsolution and thus unique [, ].

Page 3: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 3 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

The standard CARE () and DARE () are widely studied and play very important rolesin both classical LQ and H∞ control problems for deterministic linear systems [–]. Inthe past four decades, an extensive amount of numerical methods were studied and de-veloped for solving the CARE and DARE (see [–] and the references therein). Thereare two major methodologies among these numerical methods or algorithms. One is theso-called Schur method or invariant subspace method, which was first proposed by Laub[]. According to this methodology, the unique and non-negative definite stabilizing so-lution of the CARE (or DARE) can be obtained by computing the stable invariant subspace(or deflating subspace) of the associated Hamiltonian matrix (or symplectic matrix pen-cil). Some variants of the invariant subspace method, which preserve the structure of theHamiltonian matrix (or symplectic matrix pencil) by special orthogonal transformationsin the whole computational process, are considered by Mehrmann and his coauthors [–]. The other methodology comes from the iterative method, for example, it is referred toas Newton’s method [], matrix sign function method [], disk function method [], andstructured doubling algorithms [, ] and references therein. So far there has been nosources in applying the invariant subspace methods for solving SARE (a)-(b), since thestructures of associated Hamiltonian matrix or symplectic matrix pencil are not available.Only the iterative methods, e.g., Newton’s method [] and the interior-point algorithmpresented in [], can be applied to computing the numerical solutions of SARE (a)-(b).Recently, normwise residual bounds were proposed for assessing the accuracy of a com-puted solution to SARE (a)-(b) [].

Due to the effect of roundoff errors or the measurement errors of experimental data,small perturbations are often incorporated in the coefficient matrices of SARE (a)-(b),and hence we obtain the perturbed SARE

A�X + XA + C�XC –(XB + C�XD + S

)(R + D�XD

)–(B�X + D�XC + S�)+ H = , (a)

R + D�XD � , (b)

where A, B, C, D, H , R and S are perturbed coefficient matrices of compatible sizes. Themain question is under what conditions perturbed SARE (a)-(b) still has a stabilizingsolution X ∈ Sn. Moreover, how sensitive is the stabilizing solution X ∈ dom(R) of originalSARE (a)-(b) with respect to small changes in the coefficient matrices? This is related tothe conditioning of SARE (a)-(b). Therefore, we will try to answer these questions forSARE (a)-(b) in this paper. For CARE () and DARE (), the normwise non-local andlocal perturbation bounds have been widely studied in the literature. See, e.g., [–].Also, computable residual bounds were derived for measuring the accuracy of a computedsolution to CARE () and DARE (), respectively [, ]. To our best knowledge, theseissues have not been taken into account for constrained SARE (a)-(b) in the literature.

To facilitate our discussion, we use ‖·‖F to denote the Frobenius norm and ‖·‖ to denotethe operator norm induced by the Frobenius norm. For A = (A, . . . , An) = (aij) ∈ Rm×n andB ∈R

p×q, the Kronecker product of A and B is defined by A ⊗ B = (aijB) ∈Rmp×nq, and the

operator vec(A) is denoted by vec(A) = (A� , . . . , A�

n )�. It is known that

vec(ABC) =(C� ⊗ A

)vec(B), vec

(A�)

= Pn,m vec(A),

Page 4: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 4 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

where A ∈ Rn×m, B ∈ R

m×�, C ∈ R�×k , and Pn,m is the Kronecker permutation matrix

which maps vec(A) into vec(A�) for a rectangle matrix A, i.e.,

Pn,m =n,m∑i,j=

Ei,j,n×m ⊗ Ej,i,m×n,

where the n × m matrix Ei,j,n×m has as its (i, j) entry and ’s elsewhere.This paper is organized as follows. In Section , a perturbation equation is derived from

SAREs (a)-(b) and (a)-(b) without dropping any higher-order terms. By using Brouwerfixed point theorem, we obtain a perturbation bound for the stabilizing solution of SARE(a)-(b) in Section . In order to guarantee the existence of the stabilizing solution ofperturbed SARE (a)-(b), some stability analysis of the operator Lc is established in Sec-tion . A theoretical formula of the normwise condition number of the stabilizing solutionto SARE (a)-(b) is derived in Section . Finally, in Section , a numerical example is givento illustrate the sharpness and tightness of our perturbation bounds, and Section con-cludes the paper.

2 Perturbation equationAssume that X ∈ Sn is the unique stabilizing solution to SARE (a)-(b) and X ∈ Sn is asymmetric solution of perturbed SARE (a)-(b), that is,

R(X) := A�X + XA + C�XC – �(X) + H = , ()

R(X) := A�X + XA + C�XC – �(X) + H = , ()

where the two operator � : Sn → Sn and � : Sn → Sn are given by

�(X) = S(X)Q(X)–S(X)�,

�(X) = S(X)Q(X)–S(X)�,()

and two affine linear operators S : Sn → Sn, Q : Sn → Sm are defined by

S(X) = XB + C�XD + S,

Q(X) = R + D�XD

for all X ∈ Sn. Let

�X = X – X.

The purpose of this section is to derive a perturbation equation of �X from SAREs (a)-(b) and (a)-(b). For the sake of perturbation analysis, we adopt the following notations:

�A = A – A, �B = B – B, �C = C – C, �D = D – D,

�S = S – S, �R = R – R, �H = H – H()

and

δQ = �R + D�X�D + �D�XD + �D�X�D,

δS = �S + C�X�D + �C�XD + �C�X�D + X�B.()

Page 5: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 5 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

Moreover, let

F = –Q(X)–S(X)�, F = –Q(X)–S(X)�,

� = C + DF , � = C + DF , ()

� = A + BF , � = A + BF ,

and by the definition of � , we define

K := X� . ()

Note that S(X) = S(X) + δS and Q(X) = Q(X) + δQ. Substituting () into (), we observethat

�(X) = –S(X)F ,

�(X) =(S(X) + �XB + C��XD

)(Q(X) + D��XD

)– ()

× (S(X) + �XB + C��XD

)�.

Thus far, we have not specified the relation between R(X) and R(X). Such a tedious taskcan be turned into a breeze by repeatedly applying the matrix identities []

(I + U)– = I – U(I + U)–, V (I + UV )– = (I + VU)–V . ()

To begin with, assume that �R and �D are sufficiently small so that Q(X) is invertible.We see that the product

(S(X) + �XB + C��XD

)(Q(X) + D��XD

)–

=(S(X) + �XB + C��XD

)× [

I – Q(X)–D��XD(I + Q(X)–D��XD

)–]Q(X)–

= –F� + �XB(I + Q(X)–D��XD

)–Q(X)–

+ ���XD(I + Q(X)–D��XD

)–Q(X)–.

It follows that

�(X) = –S(X )F – F�B��X – F�D��XC – ���XDF – �XBF

+(���XD + �XB

)(I + Q(X)–D��XD

)–Q(X)–

× (D��X� + B��X

)since F�S(X)� = S(X )F . Next, from () we can see that

���X + �X� = A��X + �XA + F�B��X + �XBF ,

���X� = C��XC + F�D��XC + ���XDF .()

Page 6: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 6 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

Applying (), we obtain the linear equation

R(X) – R(X) = ���X + �X� + ���X� – E – h(�X) = , ()

where

E := –(�A�X + X�A + C�XC – C�XC + S(X )F – S(X)F + �H

),

h(�X) := ���XD(I + Q(X)–D��XD

)–Q(X)–D��X�

+ �XB(I + Q(X)–D��XD

)–Q(X)–B��X

+ �XB(I + Q(X)–D��XD

)–Q(X)–D��X�

+ ���XD(I + Q(X)–D��XD

)–Q(X)–B��X.

It follows from () that

���X + �X� + ���X� = E + h(�X). ()

Equipped with this fact, we now are going to derive a perturbation equation in terms of�X by using �A, �B, �C, �D, �S, �R, δS, and δQ. It should be noted that

� = (�C + C) – (�D + D)(Q(X) + δQ

)–(S(X) + δS)�

= � + �� ,

with

�� := �C – �DQ(X)–S(X)� – �DQ(X)–δS� – DQ(X)–δS�

+ (�D + D)Q(X)–δQQ(X)–(I + Q(X)–δQ)–(S(X)� + δS�)

()

and

� = (�A + A) – (�B + B)(Q(X) + δQ

)–(S(X) + δS)�

= � + ��,

with

�� := �A – �BQ(X)–S(X)� – �BQ(X)–δS� – BQ(X)–δS�

+ (�B + B)Q(X)–δQQ(X)–(I + Q(X)–δQ)–(S(X)� + δS�)

. ()

It then is natural to express the left-hand side of () by �� and �� such that

���X + �X� + ���X� = ���X + �X� + ���X� – h(�X),

with

h(�X) := –(����X + �X�� + ���X�� + ����X� + ����X��

).

Page 7: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 7 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

Observe further that

C�XC – C�XC = C�X�C + �C�XC + �C�X�C,

S(X )F – S(X)F = –(S(X) + δS

)(Q(X) + δQ

)–(S(X) + δS)�

+ S(X)Q(X)–S(X)�

= F�δS� + δSF – δS(I + Q(X)–δQ

)–Q(X)–δS�

– F�δQ(I + Q(X)–δQ

)–Q(X)–δS�

– δSQ(X)–δQ(I + Q(X)–δQ

)–F

+ F�δQF – F�δQ(I + Q(X)–δQ

)–Q(X)–δQF .

Upon substituting () into δSF and F�δQF , we have

δSF = �SF + C�X�DF + �C�XDF + �C�X�DF + X�BF ,

F�δQF = F��RF + F�D�X�DF + F��D�XDF + F��D�X�DF ,

so that the structure of E in () can be partitioned into linear equations

E := –(K��DF + F��D�K + K��C + �C�K + F��RF + F��S� + �SF + �H

),

E := –[�A�X + X�A + �C�X�C + F��B�X + X�BF

+ F��D�X�C + �C�X�DF + F��D�X�DF

–(F�δQ + δS

)(I + Q(X)–δR

)–Q(X)–(δQF + δS�)],

that is, E = E + E.

Lemma . Let X be the stabilizing solution of SARE (a)-(b) and X be a symmetric solu-tion of perturbed SARE (a)-(b). If �X = X – X, then �X satisfies the equation

���X + �X� + ���X� = E + E + h(�X) + h(�X), ()

where

E = –(K��DF + F��D�K + K��C + �C�K + F��RF

+ F��S� + �SF + �H), (a)

E = –[�A�X + X�A + �C�X�C + F��B�X + X�BF

+ F��D�X�C + �C�X�DF + F��D�X�DF

–(F�δQ + δS

)(I + Q(X)–δR

)–Q(X)–(δQF + δS�)],

h(�X) = –(����X + �X�� + ���X�� + ����X� + ����X��

), (b)

h(�X) = ���XDD��X� + �XBB��X

+ �XBD��X� + ���XDB��X, (c)

Page 8: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 8 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

where = (I + Q(X)–D��XD)–Q(X)–, the matrices �A, �B, and so on are given by()-().

Note that E and E are not dependent on �X, h(�X) is a linear function of �X, andh(�X) is a function of �X with degree at most . Assume that the linear operator Lc of() is invertible. It is easy to see that the perturbed equation () is true if and only if

�X = L–c E + L–

c E + L–c h(�X) + L–

c h(�X). ()

Thus far, we have not specified the condition for the existence of the solution �X in ().In the subsequent discussion, we shall limit our attention to identifying the condition ofthe existence of a fixed point of (), that is, to determine an upper bound on the sizeof �X.

3 Perturbation boundsLet f : Sn → Sn be a continuous mapping defined by

f (Y ) = L–c E + L–

c E + L–c h(Y ) + L–

c h(Y ) for Y ∈ Sn. ()

We see that any fixed point of the mapping f is a solution to the perturbed equation ().Our approach in this section is to present an upper bound for the existence of some fixedpoints �X. It starts with the discussion that the mapping f given by () satisfies

∥∥f (�X)∥∥

F ≤ ∥∥L–c E

∥∥F +

∥∥L–c E

∥∥F +

∥∥L–c h(�X)

∥∥F +

∥∥L–c h(�X)

∥∥F .

Define linear operatorsM : Rn×n → Sn,N : Rn×m → Sn, T : Sm → Sn andH : Rn×m →Sn by

M�C = L–c

(K��C + �C�K

), (a)

N�D = L–c

(K��DF + F��D�K

), (b)

T �R = L–c

(F��RF

), (c)

H�S = L–c

(F��S� + �SF

), (d)

and the scalars ω, μ, ν , τ , η by

ω =∥∥L–

c∥∥, μ = ‖M‖, ν = ‖N ‖, τ = ‖T ‖, η = ‖H‖. ()

From (a) we then have

∥∥L–c E

∥∥F ≤ μ‖�C‖F + ν‖�D‖F + τ‖�S‖F + η‖�R‖F + ω‖�H‖ ≡ ε. ()

We now move into more specific details pertaining to the discussion of the fixed pointof the continuous mapping f . Before doing so, we need to describe an important prop-erty of the norm of the product of two matrices and repeatedly employ it in the followingdiscussion. For the proof, the reader is referred to [, Theorem .].

Page 9: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 9 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

Lemma . Let A and B be two matrices in Rn×n. Then ‖AB‖F ≤ ‖A‖‖B‖F and ‖AB‖F ≤

‖A‖F‖B‖.

It immediately follows that the matrices δQ and δS, defined by (), satisfy

‖δQ‖F ≤ ‖�R‖F + ‖XD‖‖�D‖F + ‖X‖‖�D‖F ≡ δr ,

‖δS‖F ≤ ‖�S‖F + ‖XC‖‖�D‖F + ‖XD‖‖�C‖F ()

+ ‖X‖‖�D‖F‖�C‖F + ‖X‖‖�B‖F ≡ δs.

Assume that the scalar δr satisfies

–∥∥Q(X)–∥∥

δr > . ()

Then ‖L–c E‖F is bounded by∥∥L–

c E∥∥

F ≤ ω‖X‖(‖�A‖F + ‖F‖‖�B‖F

)+ ω‖X‖

(‖�C‖F + ‖F‖‖�D‖F)

+ω‖Q(X)–‖(‖F‖δr + δs)

– ‖Q(X)–‖δr≡ ε. ()

From (b) we see that∥∥h(�X)∥∥

F ≤ (‖��‖F + ‖�‖‖��‖F + ‖��‖

F)‖�X‖F ,

and also from () and () we have

‖��‖F ≤ ‖�A‖F + ‖F‖‖�B‖F +(∥∥BQ(X)–∥∥

+∥∥Q(X)–∥∥

‖�B‖F)δs

+‖Q(X)–‖(‖B‖ + ‖�B‖F )(‖F‖ + ‖Q(X)–‖δs)δr

– ‖Q(X)–‖δr≡ δ�, ()

‖��‖F ≤ ‖�C‖F + ‖F‖‖�D‖F +(∥∥DQ(X)–∥∥

+∥∥Q(X)–∥∥

‖�D‖F)δs

+‖Q(X)–‖(‖D‖ + ‖�D‖F )(‖F‖ + ‖Q(X)–‖δs)δr

– ‖Q(X)–‖δr≡ δ� . ()

It follows that∥∥L–c h(�X)

∥∥F ≤ ωδ‖�X‖F , ()

where the positive scalar δ is defined by

δ = δ� + ψδ� + δ� with ‖�‖ = ψ . ()

Also, from () and Lemma . we know that Q(X) = Q(X) + δQ = Q(X)–(I + Q(X)–δQ)and ‖Q(X)–δQ‖F ≤ ‖Q(X)–‖δr < . This implies that Q(X) is nonsingular,

‖B‖∥∥Q(X)–∥∥

= ‖B + �B‖∥∥(Q(X) + δQ

)–∥∥

= ‖B + �B‖∥∥(

I + Q(X)–δQ)–Q(X)–∥∥

≤ ‖Q(X)–‖(‖B‖ + ‖�B‖F )

– ‖Q(X)–‖δr≡ γB. ()

Page 10: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 10 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

Similarly, we have

‖D‖∥∥Q(X)–∥∥

≤ ‖Q(X)–‖(‖D‖ + ‖�D‖F )

– ‖Q(X)–‖δr≡ γD. ()

Assume that

– γD‖�X‖F > . ()

It then follows from Lemma . and (c) that

∥∥h(�X)∥∥

F ≤ (‖�‖‖D‖ + ‖B‖)‖Q(X)–‖‖�X‖F

– ‖D‖‖Q(X)–‖‖�X‖F

, ()

and from (), () and () that

‖B‖ ≤ ‖B‖ + ‖�B‖ ≡ αB,

‖D‖ ≤ ‖D‖ + ‖�D‖ ≡ αD, ()

‖�‖ ≤ ‖�‖ + ‖��‖F ≤ ‖�‖ + δ� ≡ ψ .

Upon substituting (), () and () into (), we see that

∥∥L–c h(�X)

∥∥F ≤ ω(ψγD + ψαBαD + γB)‖�X‖

F – γD‖�X‖F

.

Finally, by (), () and (), we arrive at the statement

∥∥f (�X)∥∥

F ≤ ε + ωδ‖�X‖F +ωα‖�X‖

F – γD‖�X‖F

, ()

where

α ≡ ψγD + ψαBαD + γB, ε ≡ ε + ε. ()

Consider the quadratic equation

(γD – ωδγD + ωα)ξ – ( – ωδ + εγD)ξ + ε = . ()

It is true that if

δ <ω

, (a)

ε ≤ ( – ωδ)

γD – ωδγD + ωα +√

(γD – ωδγD + ωα) – γ D( – ωδ)

, (b)

then the positive scalar ξ∗ denoted by

ξ∗ =ε

( – ωδ + εγD) +√

( – ωδ + εγD) – (γD – ωδγD + ωα)ε()

Page 11: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 11 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

is a solution to (). Let Snξ∗ be a compact subset of Sn given by

Snξ∗ =

{�X ∈ Sn : ‖�X‖F ≤ ξ∗

}.

It can be seen that in ()

∥∥f (�X)∥∥

F ≤ ξ∗ if �X ∈ Snξ∗ .

It then follows from the Brouwer fixed-point theorem (see []) that the continuous map-ping f has a fixed point �X∗ ∈ Sn

ξ∗ , that is, condition () automatically holds.Observe also that if �X ∈ Sn

ξ∗ , then

– γD‖�X‖F ≥ ( – γDξ∗) (by ())

≥ –εγD

– ωδ + εγD=

– ωδ – εγD

– ωδ + εγD(by (b))

≥ – ωδ – (–ωδ)γDγD–ωδγD+ωα

– ωδ + εγD(by (a))

=( – ωδ)ωα

( – ωδ + εγD)(γD – ωδγD + ωα)≥ .

This implies that assumption () is true, if assumption (a)-(b) is true.

4 Stability analysisWe have shown that the mapping f given by () has a Hermitian fixed point �X∗. Thisfurther implies that perturbed SARE (a)-(b) has a Hermitian solution X = X + �X∗. Inthis section, we want to discuss the stability of the solution X , i.e., show that the solution Xis the unique maximal solution to SARE (a)-(b). Let ϒ and � be two operators definedby

ϒ(W ) = ��W + W�, �(W ) = ��W� , W ∈ Sn,

with the notations � and � given in Definition .. It follows that the operator Lc definedby () can also be written as

Lc(W ) = ϒ(W ) + �(W ), W ∈ Sn. ()

We then have the following important result addressing the condition for a linear operatorto be stable. To see a few necessary and sufficient conditions on the stability, we refer tothe results and proofs given in [].

Theorem . The linear operator Lc = ϒ + � given by () is stable, i.e., σ (Lc) ⊂ C–, ifand only if σ (�) ⊂C– and det(ϒ + τ�) �= for all τ ∈ [, ].

When small perturbations Z, Z ∈ Rn×n are taken into consideration, the perturbed

operator of Lc can be expressed by

Lc(W ) = ϒ(W ) + �(W ), ()

Page 12: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 12 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

where ϒ(W ) = (� + Z)�W + W (� + Z) and �(W ) = (� + Z)�W (� + Z) for all W ∈ Sn.Define the quantity

�(θ ) =∥∥(ϒ + θ�)–∥∥

for θ ∈ [, ] and

β(Lc) = min(Z,Z)∈Z

max{‖Z‖,‖Z‖

},

where the set Z = {(Z, Z) ∈Rn×n ×R

n×n | det(ϒ + θ�) = for some θ ∈ [, ]}. It shouldbe noted that if σ (�) ⊂C–, � = and Z = , then

β(Lc) ≤ β(�), ()

where the value β(�) is defined by []

β(�) = min{‖Z‖

∣∣ max≤j≤n

Reλj(� + Z) = , Z ∈Rn×n

}. ()

Here, λj(� + Z) (j = , . . . , n) denote the eigenvalues of � + Z.The connection between β(Lc) and the maximum of the scalar function �(θ ) on [, ]

can be established in the following form.

Theorem . [] Suppose that the linear operator Lc given by () is stable, and let

�c = maxθ∈[,]

�(θ ), ψ = ‖�‖. ()

Then

β(Lc) ≥ �–c

(ψ + ) +√

(ψ + ) + �–c

.

We now apply Theorem . to () and obtain that

β(�) ≥ β(Lc) ≥ �–c

(ψ + ) +√

(ψ + ) + �–c

.

Hence, if a perturbation matrix Z ∈Rn×n satisfies

‖Z‖ <�–

c

(ψ + ) +√

(ψ + ) + �–c

,

then () implies that the matrix � + Z must be c-stable.We now turn to a key stability test of the operator Lc, the striking tool of our stability

analysis.

Theorem . [] Suppose that the linear operator Lc is stable, and let the scalars �c andψ be defined as in (). If the perturbation matrices Z, Z ∈R

n×n satisfy

max{‖Z‖,‖Z‖

}<

�–c

(ψ + ) +√

(ψ + ) + �–c

, ()

Page 13: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 13 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

then � + Z is c-stable and the perturbed linear operator Lc defined by () is also stable,i.e., σ (Lc) ⊂C–.

Upon substituting X for X in S(X) and Q(X) of (), we shall have

Q(X) = R + D�XD = Q(X) + δQ + D��XD ≡Q(X) + �R, ()

S(X) = S + C�XD + XB = S(X) + δS + �XB + C��XD

≡ S(X) + �S. ()

Also, corresponding to X , the perturbed �X and �X of � and �, respectively, can beexpressed in terms of the formulae

�X = (�A + A) – (�B + B)(� + �S)–(� + �R)� := � + ��,

�X = (�C + C) – (�D + D)(� + �R)–(� + �S)� := � + �� ,()

with

�� := �A – �BQ(X)–S(X)� – �BQ(X)–�S� – BQ(X)–�S�

+ (�B + B)Q(X)–�RQ(X)–(I + Q(X)–�R)–(S(X)� + �S�)

,

�� := �C – �DQ(X)–S(X)� – �DQ(X)–�S� – DQ(X)–�S�

+ (�D + D)Q(X)–�RQ(X)–(I + Q(X)–�R)–(S(X)� + �S�)

.

Let αC := ‖C‖ + ‖�C‖. Since ‖�X‖F ≤ ξ∗, it follows from (), () and () that

‖�R‖F ≤ δr + αDξ∗ := cr ,

‖�S‖F ≤ δs + (αB + αCαD)ξ∗ := cs.

Thus ‖��‖F and ‖��‖F are bounded by the inequalities

‖��‖F ≤ ‖�A‖F + ‖F‖‖�B‖F +αB‖Q(X)–‖(cs + cr‖F‖)

– cr‖Q(X)–‖,

‖��‖F ≤ ‖�C‖F + ‖F‖‖�D‖F +αD‖Q(X)–‖(cs + cr‖F‖)

– cr‖Q(X)–‖.

Here, the above upper bounds are obtained by simplifying those given by () and ().Let

f = ‖F‖, γ =∥∥Q(X)–∥∥

,

αC = ‖C‖ + ‖�C‖F , ζ = max{‖�A‖F ,‖�C‖F

}, ()

ζ = max{‖�B‖F ,‖�D‖F

}, ζ = max{αB,αD},

where αB and αD are defined by () and � is defined to be the right-hand side of (),that is,

� =�–

c

(ψ + ) +√

(ψ + ) + �–c

. ()

Page 14: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 14 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

We then have

max{‖��‖F ,‖��‖F

} ≤ ζ + f ζ +γ ζ(δs + δrf ) + γ ζ(αB + αCαD + α

Df )ξ∗ – δrf – α

Df ξ∗.

It follows that if the condition

ζ + f ζ +γ ζ(δs + δrf ) + γ ζ(αB + αCαD + α

Df )ξ∗ – δrf – α

Df ξ∗< �

or, equivalently,

ξ∗ <(� – ζ – f ζ)( – δrf ) – γ ζ(δs + δrf )

(� – ζ – f ζ)αDf + γ ζ(αB + αCαD + α

Df )

holds, then corresponding to Theorem ., the perturbed linear operator Lc with respectto X is stable. In other words, the matrix X ∈ Sn must be the unique stabilizing (and max-imal) solution to perturbed SARE (a)-(b).

We now have all the materials needed for the existence of a stabilizing solution of (a)-(b).

Theorem . (Perturbation bound) Let X be the stabilizing solution of (a)-(b). Let ω, δr ,δs, δ, γD, αB, αD, α, ε, f , γ , αC , ζ, ζ, ζ, � be the scalars defined by (), (), (), (),(), (), () and (), respectively. Define

ξ∗ =ε

( – ωδ + εγD) +√

( – ωδ + εγD) – (γD – ωδγD + ωα)ε.

If the perturbed quantities of the coefficients of (a)-(b) are sufficiently small, for example,ε � , such that

–∥∥Q(X)–∥∥

δr > ,

– ωδ > ,

( – ωδ)

γD – ωδγD + ωα +√

(γD – ωδγD + ωα) – γ D( – ωδ)

– ε ≥ ,

(� – ζ – f ζ)( – δrf ) – γ ζ(δs + δrf )(� – ζ – f ζ)α

Df + γ ζ(αB + αCαD + αDf )

– ξ∗ > ,

then perturbed SARE (a)-(b) has the unique stabilizing solution X, and

‖X – X‖F

‖X‖F≤ ξ∗

‖X‖F. ()

5 Condition number of the SAREIn the study of a computational problem, a fundamental issue is to determine the conditionnumber of a problem to be the ratio of the relative change in the solution to the relative

Page 15: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 15 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

change in the argument. Applying the theory of condition number given by Rice [], wedefine the condition number c(X) of the stabilizing solution X of SARE (a)-(b) by

c(X) = limδ→+

supδ

‖�X‖F

κδ, ()

where the set of perturbed matrices δ is defined by

δ = δ(κA,κB,κC ,κD,κS,κR,κH )

={

(�A,�B,�C,�D,�S) ∈Rn×n ×R

n×m ×Rn×n ×R

n×m ×Rn×m,

(�R,�H) ∈ Sn × Sm | < δp ≤ δ}

, ()

with

δp =∥∥∥∥(

�AκA

,�BκB

,�CκC

,�DκD

,�SκS

,�RκR

,�HκH

)∥∥∥∥F

,

and κA, κB, κC , κD, κS , κR, κH , κ are positive parameters. Then () gives the absolutecondition number cabs(X) if

(κA,κB,κC ,κD,κS,κR,κH ,κ) = (, , , , , , , )

and gives the relative condition number crel(X) if

(κA,κB,κC ,κD,κS,κR,κH ,κ) =(‖A‖F ,‖B‖F ,‖C‖F ,‖D‖F ,‖S‖F ,‖R‖F ,‖H‖F ,‖X‖F

).

It follows from () and (a)-(d) that

�X = P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H + O

p),

where the linear operators P : Rn×n → Sn and Q : Rn×m → Sn are defined by

P�A = L–c

(X�A + �A�X

), (a)

Q�B = L–c

(X�BF + F��B�X

). (b)

In order to derive the explicit expression for the condition number c(X) of the stabilizingsolution X of (a)-(b), we require a theorem concerning the form of the optimal solution.This theorem can be regarded as a theoretical extension of the results discussed in [,]. Most strategies have been established earlier by using much heavier machinery. Sincethis theorem is most relevant to our stability analysis, we briefly outline a direct proof withideas from [] to make this presentation more self-contained.

Theorem . Let L : Rn×n ×Rm×m →R

k×k be a linear operator and

L(Z, Z)� = L(Z, Z�

)

()

Page 16: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 16 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

for all Z ∈ Rn×n and Z ∈R

m×m. Then the optimal solution (Z�, Z�) to the problem

max‖(Z,Z)‖F =

∥∥L(Z, Z)∥∥

F ()

exists for some Z� ∈ Rn×n and Z� = ±Z�� ∈ Rm×m. Furthermore, if the linear operator

L(, Z) is a positive operator with respect to any Z ∈Cm×m, that is, for all Z ∈C

m×m, wehave

L(, Z) � if Z � .

Then there exists an optimal solution (Z�, Z�) ∈ Rn×n ×R

m×m to problem () such thatZ� is symmetric.

Proof Since L is a linear operator on a finite dimensional space, it is clear that the optimalsolution of () exists. Assume that (Z�, Z�) solves this optimization problem. Let σmax =‖L(Z�, Z�)‖F and L ∈R

k×(n+m) be the matrix representation of the operator L such that

vec(L(Z, Z)

)= L

[vec(Z)vec(Z)

]. ()

By () and (), we have

[vec(Z�)vec(Z�)

]�L�L

[vec(Z�)vec(Z�)

]=

[vec(Z�)vec(Z�

�)

]�L�L

[vec(Z�)vec(Z�

�)

]= σ

max. ()

Note that

� := ‖Z�‖F +

∥∥∥∥ (Z� + Z�

)∥∥∥∥

F= , only if Z� = , Z� = –Z�

�.

It follows that if Z�� �= –Z�, by (), we see that √

�(Z�,

(Z� + Z��)) is another optimal

solution for (). This proves the first part of the theorem.For the second part, if there exists a symmetric optimal solution, then it completes

the proof. Otherwise, from the first part, we know that there exists an optimal solution(Z�, Z�) with Z� = , Z� = –Z�

� ∈ Rm×m and ‖(Z�, Z�)‖F = to (). Let i =

√–. We

have the following matrix decomposition:

Z� = Q� diag

([ –ω

ω

], . . . ,

[ –ωk

ωk

], r

)Q,

where r is a zero matrix with size r × r, Q is an m × m orthogonal matrix, and ωj > for ≤ j ≤ k. Let

Z� := Q� diag

([ω ω

], . . . ,

[ωk ωk

], r

)Q

Page 17: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 17 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

be a real symmetric matrix. Since –Z� � iZ� � Z�, it is true that

L(, Z�) �L(, iZ�) �L(, –Z�) = –L(, Z�).

Using the fact that ‖iZ�‖F = ‖Z�‖F , we see that ‖(, Z�)‖F = ‖(, iZ�)‖F = ‖(, Z�)‖F = and

∥∥L(, Z�)∥∥

F ≥ ∥∥L(, iZ�)∥∥

F =∥∥L(, Z�)

∥∥F .

If W � W � –W, then ‖W‖F ≥ ‖W‖F , which implies that (, Z�) is a symmetric opti-mal solution to () (see [, Lemma A.]). This completes the proof. �

With the existence theory established above, it is interesting to note that the conditionnumber c(X) defined by () can be written as

c(X) =κ

limδ→+

supδ

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δ

maxδp>

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δp. ()

Note that the second equality in () is only an application of linearity of the norm. (Forthe proof, see Lemma A..) Observe further that the inverse operator L–

c of () satisfies

[L–

c (W )]� = L–

c(W �)

since [Lc(W )]� = Lc(W �) for all W ∈ Cn×n. It follows that

[T �R]� = T �R�, [P�A]� = P�A, [Q�B]� = Q�B,

[M�C]� = M�C, [N�D]� = N�D, [H�S]� = H�S.

Also, it is known that the inverse operator L–c is positive [, Corollary .]. It follows that

T is also a positive operator. Now, applying Theorem . to the operator P�A + Q�B +M�C + N�D + H�S + T �R + L–

c �H in (), we obtain the equality

c(X)

max

‖κAP�A + κBQ�B + κCM�C + κDN�D + κSH�S + κRT �R + κHL–c �H‖F

‖(�A,�B,�C,�D,�S,�R,�H)‖F,

where the extended set is defined by

={

(�A,�B,�C,�D,�S,�R,�H) ∈Rn×n ×R

n×m ×Rn×n ×R

n×m

×Rn×n ×R

n×n ×Rm×m | ∥∥(�A,�B,�C,�D,�S,�R,�H)

∥∥F >

}.

On the other hand, observe that the matrix representation of the operation Lc in () canbe written in terms of Lc = I ⊗ � + �� ⊗ I + �� ⊗ � . Corresponding to (a)-(d) and

Page 18: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 18 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

(a)-(b), we let

Mc = L–c

(I ⊗ K� +

(K� ⊗ I

)Pn,n

),

Nc = L–c

(F� ⊗ K� +

(K� ⊗ F�)

Pm,n),

Tc = L–c

(F� ⊗ F�)

,

Hc = L–c

(F� ⊗ I +

(I ⊗ F�)

Pm,n),

Pc = L–c

(I ⊗ X + (X ⊗ I)Pn,n

),

Qc = L–c

(F� ⊗ X +

(X ⊗ F�)

Pm,n)

and

U =(κAPc,κBQc,κCMc,κDNc,κSHc,κRTc,κHL–

c).

It follows that

c(X) =κ

maxV∈

‖U vec(V )‖

‖vec(V )‖=

‖U‖

κ.

Based on the above discussion, we have the following result.

Theorem . The condition number c(X) given by () has the explicit expression ‖U‖κ

.In particular, we have the relative condition number

crel(X) =‖(‖A‖F Pc,‖B‖F Qc,‖C‖F Mc,‖D‖F Nc,‖S‖F Hc,‖R‖F Tc,‖H‖F L–

c )‖

‖X‖F. ()

6 Numerical experimentIn this section we want to demonstrate the sharpness of perturbation bound () andits relationship with the relative condition number (). Based on Newton’s iteration [],a numerical example, done with × coefficient matrices, is illustrated. The numerical al-gorithm is described in Algorithm . The corresponding stopping criterion is determinedwhen the value of the Normalized Residual (NRes)

NRes =‖P(X) – S(X)Q(X)–S(X)�‖

‖P(X)‖ + ‖S(X)‖‖Q(X)–‖‖S(X)�‖is less than or equal to a prescribed tolerance.

Example Given a parameter r = –m, for some m > , let the matrices A, B, C, D bedefined by

A =

[– –

], B =

[ –

√r

], C = I, D =

[

],

and the matrices S, R, H be defined by

S =

[

], R =

[ r

], H =

[–/

].

Page 19: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 19 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

Algorithm : SARE [X] = Sare(A, B, C, D, R, S, H)Input: Matrices A, C ∈R

n×n, B, D, S ∈ Rn×m, H ∈ Sn, R ∈ Sm

Output: Matrix X ∈ Sn

beginChoose X ∈R

n×n;for i ← , . . . do

Fk = –(R + D�XkD)–(B�XkC + S�); �k = A + BFk ; �k = C + DFk ;R(Xk) as defined by ();Updating Xk+ by solving��

k Xk+ + Xk+�k + ��k Xk+�k = ��

k Xk + Xk�k + ��k Xk�k – R(Xk);

endend

Table 1 Relative errors and perturbation bounds

j Relative error ξ∗‖X‖F

5 6.28× 10–5 5.74× 10–4

6 6.34× 10–6 5.22× 10–5

7 1.14× 10–6 8.42× 10–6

8 1.61× 10–7 7.42× 10–7

9 1.05× 10–8 5.58× 10–8

It is easily seen that the unique stabilizing and maximal solution is

X =

[– (

√ – )/

].

Let the perturbed coefficient matrices �A, �B, �C, �D, �S, �R and �H be generated us-ing the MATLAB command randn with the weighted coefficient –j. That is, the matrices�A, �B, �C, �D, �S, �R and �H are generated in forms of randn()×–j, respectively.Since �R and �H are required to be symmetric, we need to fine-tune the perturbed ma-trices �R and �H by redefining �R and �H as �R + �R� and �H + �H�, respectively.Now, let (A, B, C, D, S, R, H) = (A + �A, B + �B, C + �C, D + �D, S + �S, R + �R, H + �H),which are coefficient matrices of SARE (a)-(b).

Firstly, we would like to evaluate the accuracy of the perturbation bound with the fixedparameter r = –, i.e., m = , and different weighted coefficients, –j, for j = , . . . , . Itcan be seen from Table that the values of the relative errors are closely bounded by ourperturbation bounds of (). In other words, () does provide a sharp upper bound ofthe relative errors of the stabilizing solution X.

Secondly, we want to investigate how ill-conditioned matrices affect the quantities ofperturbation bounds. In this sense, the weighted coefficients are fixed to be –, i.e.,j = . The relationships among relative errors, perturbation bounds, and relative con-dition numbers are shown in Table . Due to the singularity of the matrix R caused byparameter r, the accuracy of the perturbation bounds is highly affected by the singularity.When the value of m increases, the perturbation bound is still tight to the relative error.Also, it can be seen that the number of accurate digits of the perturbation bounds is re-

Page 20: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 20 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

Table 2 Relative errors, perturbation bounds and relative condition numbers

m Relative error ξ∗‖X‖Fcrel(X)

1 5.94× 10–15 1.64× 10–14 5.84× 101

2 5.28× 10–14 9.47× 10–14 5.56× 102

3 4.75× 10–13 7.85× 10–13 5.56× 103

4 4.58× 10–12 7.40× 10–12 5.56× 104

5 4.85× 10–11 7.26× 10–11 5.56× 105

6 4.56× 10–10 7.22× 10–10 5.57× 106

7 4.69× 10–9 8.61× 10–9 5.57× 107

duced proportionally to the increase of the quantities of the relative condition numbers.In other words, if the accurate digits of the perturbation bound are added to the digitsin the relative condition numbers, this number is almost equal to . (While using IEEEdouble-precision, the machine precision is around . × –.) This implies that the de-rived perturbation bound of () is fairly sharp.

7 ConclusionWhile doing numerical computation, it is important in practice to have an accuratemethod for estimating the relative error and the condition number of the given problems.In this paper, we focus on providing a tight perturbation bound of the stabilizing solu-tion to SARE (a)-(b) under small changes in the coefficient matrices. Also, some suffi-cient conditions are presented for the existence of the stabilizing solution to the perturbedSARE. The corresponding condition number of the stabilizing solution is provided in thiswork. We highlight and compare the practical performance of the derived perturbationbound and condition number through a numerical example. Numerical results show thatour perturbation bound is very sensitive to the condition number of the stabilizing solu-tion. As a consequence, they provide good measurement tools for the sensitivity analysisof SARE (a)-(b).

AppendixWe provide here a proof of the condition given by ().

Lemma A. Let P , Q, M, N , H, T , L–c be the operators defined by (a)-(b), (a)-

(d) and (), and let δ , δp be defined by (). Then the following equality holds:

limδ→+

supδ

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δ

= maxδp>

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δp. ()

Proof For any δ > , < δp ≤ δ, we see that

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δ

=δp

δ

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δp

≤ maxδp>

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δp,

Page 21: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 21 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

where δp = ‖( �AκA

, �BκB

, �CκC

, �DκD

, �SκS

, �RκR

, �HκH

)‖F . It follows that

limδ→+

supδ

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δ

≤ maxδp>

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δp. ()

On the other hand, for any fixed δ > , choose any perturbation matrices

(�A,�B,�C,�D,�S,�R,�H) ∈ δ

and therefore∥∥∥∥(�A

κA,�B

κB,�C

κC,�D

κD,�S

κS,�R

κR,�H

κH

)∥∥∥∥F

= δp ≤ δ.

It is true that ( δδp

�A, δδp

�B, δδp

�C, δδp

�D, δδp

�S, δδp

�R, δδp

�H) ∈ δ and thisgives the fact that

limδ→+

supδ

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δ

≥‖P δ

δp�A + Q δ

δp�B + M δ

δp�C + N δ

δp�D + H δ

δp�S + T δ

δp�R + L–

δp�H‖F

δ

=‖P�A + Q�B + M�C + N�D + H�S + T �R + L–

c �H)‖F

δp.

Hence

limδ→+

supδ

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δ

≥ maxδp>

‖P�A + Q�B + M�C + N�D + H�S + T �R + L–c �H‖F

δp. ()

Comparison of () and () gives (). �

Competing interestsThe authors declare that there is no conflict of interests regarding the publication of this article.

Authors’ contributionsAll authors contributed equally and significantly in writing this paper. All authors read and approved the final manuscript.

Author details1Center for General Education, National Formosa University, Huwei 632, Taiwan. 2Department of Mathematics, NationalTaiwan Normal University, Taipei 116, Taiwan. 3Department of Mathematics, National Chung Cheng University, Chia-Yi621, Taiwan.

AcknowledgementsThe authors wish to thank the editor and two anonymous referees for many interesting and valuable suggestions on themanuscript. This research work is partially supported by the National Science Council and the National Center forTheoretical Sciences in Taiwan. The first author was supported by the National Science Council of Taiwan under GrantNSC 102-2115-M-150-002. The second author was supported by the National Science Council of Taiwan under Grant NSC102-2115-M-003-009. The third author was supported by the National Science Council of Taiwan under Grant NSC101-2115-M-194-007-MY3.

Received: 16 September 2013 Accepted: 14 November 2013 Published: 11 Dec 2013

Page 22: rportal.lib.ntnu.edu.twrportal.lib.ntnu.edu.tw/bitstream/20.500.12235/... · Chiangetal.JournalofInequalitiesandApplications2013, 2013:580  RESEARCH OpenAccess

Chiang et al. Journal of Inequalities and Applications 2013, 2013:580 Page 22 of 22http://www.journalofinequalitiesandapplications.com/content/2013/1/580

References1. Rami, MA, Zhou, XY: Linear matrix inequalities, Riccati equations, and indefinite stochastic linear quadratic controls.

IEEE Trans. Autom. Control 45(6), 1131-1143 (2000). doi:10.1109/9.8635972. El Bouhtouri, A, Hinrichsen, D, Pritchard, AJ: On the disturbance attenuation problem for a wide class of time invariant

linear stochastic systems. Stoch. Stoch. Rep. 65(3-4), 255-297 (1999)3. Damm, T, Hinrichsen, D: Newton’s method for a rational matrix equation occurring in stochastic control. In:

Proceedings of the Eighth Conference of the International Linear Algebra Society (Barcelona, 1999), vol. 332/334,pp. 81-109 (2001)

4. Hinrichsen, D, Pritchard, AJ: Stochastic H∞ . SIAM J. Control Optim. 36, 1504-1538 (1998)5. Lancaster, P, Rodman, L: Algebraic Riccati Equations. Oxford Science Publications. Clarendon, New York (1995)6. Mehrmann, VL: The Autonomous Linear Quadratic Control Problem: Theory and Numerical Solution. Lecture Notes in

Control and Information Sciences, vol. 163. Springer, Berlin (1991). doi:10.1007/BFb00394437. Zhou, K, Doyle, JC, Glover, K: Robust and Optimal Control. Prentice Hall, Upper Saddle River (1996)8. Benner, P, Laub, AJ, Mehrmann, V: A collection of benchmark examples for the numerical solution of algebraic Riccati

equations I: continuous-time case. Technical Report SPC 95_22, Fakultät für Mathematik, TU Chemnitz-Zwickau,09107 Chemnitz, FRG. http://www.tu-chemnitz.de/sfb393/spc95pr.html (1995)

9. Benner, P, Laub, AJ, Mehrmann, V: A collection of benchmark examples for the numerical solution of algebraic Riccatiequations II: discrete-time case. Technical Report SPC 95_23, Fakultät für Mathematik, TU Chemnitz-Zwickau, 09107Chemnitz, FRG. http://www.tu-chemnitz.de/sfb393/spc95pr.html (1995)

10. Sima, V: Algorithms for Linear-Quadratic Optimization. Monographs and Textbooks in Pure and Applied Mathematics,vol. 200. Dekker, New York (1996)

11. Laub, AJ: A Schur method for solving algebraic Riccati equations. IEEE Trans. Autom. Control 24(6), 913-921 (1979)12. Ammar, G, Benner, P, Mehrmann, V: A multishift algorithm for the numerical solution of algebraic Riccati equations.

Electron. Trans. Numer. Anal. 1, 33-48 (1993)13. Ammar, G, Mehrmann, V: On Hamiltonian and symplectic Hessenberg forms. Linear Algebra Appl. 149, 55-72 (1991)14. Benner, P, Mehrmann, V, Xu, H: A new method for computing the stable invariant subspace of a real Hamiltonian

matrix. J. Comput. Appl. Math. 86, 17-43 (1997)15. Benner, P, Mehrmann, V, Xu, H: A numerically stable, structure preserving method for computing the eigenvalues of

real Hamiltonian or symplectic pencils. Numer. Math. 78(3), 329-358 (1998)16. Bunse-Gerstner, A, Byers, R, Mehrmann, V: A chart of numerical methods for structured eigenvalue problems. SIAM J.

Matrix Anal. Appl. 13, 419-453 (1992)17. Bunse-Gerstner, A, Mehrmann, V: A symplectic QR like algorithm for the solution of the real algebraic Riccati

equation. IEEE Trans. Autom. Control 31(12), 1104-1113 (1986)18. Mehrmann, V: A step toward a unified treatment of continuous and discrete time control problems. Linear Algebra

Appl. 241-243, 749-779 (1996)19. Byers, R: Solving the algebraic Riccati equation with the matrix sign function. Linear Algebra Appl. 85, 267-279 (1987)20. Benner, P: Contributions to the numerical solutions of algebraic Riccati equations and related eigenvalue problems.

PhD thesis, Fakultät für Mathematik, TU Chemnitz-Zwickau, Chemnitz, Germany (1997)21. Chu, EK-W, Fan, H-Y, Lin, W-W: A structure-preserving doubling algorithm for continuous-time algebraic Riccati

equations. Linear Algebra Appl. 396, 55-80 (2005)22. Chu, EK-W, Fan, H-Y, Lin, W-W, Wang, C-S: Structure-preserving algorithms for periodic discrete-time algebraic Riccati

equations. Int. J. Control 77(8), 767-788 (2004)23. Chiang, C-Y, Fan, H-Y: Residual bounds of the stochastic algebraic Riccati equation. Appl. Numer. Math. 63, 78-87

(2013)24. Konstantinov, M, Gu, D-W, Mehrmann, V, Petkov, P: Perturbation Theory for Matrix Equations. Studies in

Computational Mathematics, vol. 9. North-Holland, Amsterdam (2003)25. Sun, J-G: Perturbation theory for algebraic Riccati equations. SIAM J. Matrix Anal. Appl. 19(1), 39-65 (1998).

doi:10.1137/S089547989529130326. Sun, J-g: Sensitivity analysis of the discrete-time algebraic Riccati equation. In: Proceedings of the Sixth Conference of

the International Linear Algebra Society (Chemnitz, 1996), vol. 275/276, pp. 595-615 (1998).doi:10.1016/S0024-3795(97)10017-9

27. Sun, J-g: Residual bounds of approximate solutions of the algebraic Riccati equation. Numer. Math. 76(2), 249-263(1997). doi:10.1007/s002110050262

28. Sun, J-g: Residual bounds of approximate solutions of the discrete-time algebraic Riccati equation. Numer. Math.78(3), 463-478 (1998). doi:10.1007/s002110050321

29. Riedel, KS: A Sherman-Morrison-Woodbury identity for rank augmenting matrices with application to centering.SIAM J. Matrix Anal. Appl. 13(2), 659-662 (1992). doi:10.1137/0613040

30. Stewart, GW, Sun, JG: Matrix Perturbation Theory. Computer Science and Scientific Computing. Academic Press,Boston (1990)

31. Ortega, JM, Rheinboldt, WC: Iterative Solution of Nonlinear Equations in Several Variables. Classics in AppliedMathematics, vol. 30. SIAM, Philadelphia (2000). Reprint of the 1970 original

32. Rice, JR: A theory of condition. SIAM J. Numer. Anal. 3, 287-310 (1966)33. Sun, J-g: Condition numbers of algebraic Riccati equations in the Frobenius norm. Linear Algebra Appl. 350, 237-261

(2002). doi:10.1016/S0024-3795(02)00294-X34. Xu, S: Matrix Computation in Control Theory. Higher Education Press, Beijing (2010) (In Chinese)

10.1186/1029-242X-2013-580Cite this article as: Chiang et al.: Perturbation analysis of the stochastic algebraic Riccati equation. Journal ofInequalities and Applications 2013, 2013:580