Riccati Equation Example 174258

Partial capture of text on file.
                    Iterative Solution of Algebraic Riccati Equations for Damped Systems
                                                          Kirsten Morris and Carmeliza Navasca
                  Abstract—Algebraic Riccati equations (ARE) of large dimen-        common iterative methods are Chandrasekhar [3], [9] and
               sion arise when using approximations to design controllers for       Newton-Kleinman iterations [14], [17], [28]. Here we use a
               systems modelled by partial differential equations. We use a         modiﬁcation to the Newton-Kleinman method ﬁrst proposed
               modiﬁed Newton method to solve the ARE. Since the modiﬁed            by by Banks and Ito [3] as a reﬁnement for a partial solution
               Newton method leads to a right-hand side of rank equal to            to the Chandraskehar equation.
               the number of inputs, regardless of the weights, the resulting
               Lyapunov equation can be more efﬁciently solved. A low-rank             Solution of the Lyapunov equation is a key step in im-
               Cholesky-ADI algorithm is used to solve the Lyapunov equation        plementing either modiﬁed or standard Newton-Kleinman.
               resulting at each step. The algorithm is straightforward to code.    The Lyapunov equations arising in the Newton-Kleinman
               Performance is illustrated with an example of a beam, with           method have several special features: (1) the model order
               different levels of damping. Results indicate that for weakly        n is generally much larger than the number of controls m
               damped problems a low rank solution to the ARE may not
               exist. Further analysis supports this point.                         or number of observations p and (2) the matrices are often
                                       INTRODUCTION                                 sparse. We use a recently developed method [20], [26] that
                                                                                    uses these features, leading to an efﬁcient algorithm. Use
                  Aclassical controller design objective is to ﬁnd a control        of this Lyapunov solver with a Newton-Kleinman method is
               u(t) so that the objective function                                  described in [27], [6], [7], [8].
                             ! ∞!x(t),Qx(t)"+u∗(t)Ru(t)dt                    (1)       In the next section we describe the algorithm to solve the
                                                                                    Lyapunov equations. We use this Lyapunov solver with both
                              0                                                     standard and modiﬁed Newton-Kleinman to solve a number
               is minimized where R is a symmetric positive deﬁnite matrix          of standard control examples, including one with several
               and Q is symmetric positive semi-deﬁnite. It is well-known           space variables. Our results indicate that modiﬁed Newton-
               that the solution to this problem is found by solving an             Kleinman achieves considerable savings in computation time
               algebraic Riccati equation                                           over standard Newton-Kleinman. We also found that using
                         A∗P +PA−PBR−1B∗P =−Q.                               (2)    the solution from a lower-order approximation as an ansatz
                                                                                    for a higher-order approximation signiﬁcantly reduced the
               for a feedback operator                                              computation time.
                                    K=−R−1B′P.                               (3)                  I. DESCRIPTION OF ALGORITHM
               If the system is modelled by partial differential or delay              One approach to solving large Riccati equations is the
               differential equations then the state x(t) lies in an inﬁnite-       Newton-Kleinman method [17]. The Riccati equation (2) can
               dimensional space. The theoretical solution to this problem          be rewritten as
               for many inﬁnite-dimensional systems parallels the theory for                        ∗                                 ∗
               ﬁnite-dimensional systems [11], [12], [18], [19]. In practice,           (A−BK) P +P(A−BK)=−Q−K RK.                                (4)
               the control is calculated through approximation. The matrices        We say a matrix A is Hurwitz if σ(A ) ⊂ C .
               A, B, and C arise in a ﬁnite dimensional approximation of                                 o                    o       −
                                                                                       Theorem 1.1: [17] Consider a stabilizable pair (A,B)
               the inﬁnite dimensional system. Let n indicate the order of          with a feedback K0 so that A − BK0 is Hurwitz. Deﬁne
               the approximation, m the number of control inputs and p              S =A−BK,andsolve the Lyapunov equation
                                                                                      i             i
               the number of observations. Thus, A ∈ Rn×n, B ∈ Rn×m                              S∗P +P S =−Q−K∗RK                                (5)
               and C ∈ Rp×n. There have been many papers written                                   i  i     i i             i     i
               describing conditions under which approximations lead to             for P and then update the feedback as K          =−R−1B∗P.
               approximating controls that converge to the control for the               i                                      i+1                 i
                                                                                    Then
               original inﬁnite-dimensional system [4], [12], [15], [18],                                     lim P = P
               [19]. In this paper we will assume that an approximation                                       i→∞ i
               has been chosen so that a solution to the Riccati equation (2)       with quadratic convergence.
               exists for sufﬁciently large n and also that the approximating          The key assumption in the above theorem is that an initial
               feedback operators converge. Unfortunately, particularly in          estimate, or ansatz, K , such that A − BK            is Hurwitz
                                                                                                             0                         0
               partial differential equation models with more than one space        is available. For an arbitrary large Riccati equation, this
               dimension, many inﬁnite-dimensional control problems lead            condition may be difﬁcult to satisfy. However, this condition
               to Riccati equations of large order. A survey of currently           is not restrictive for Riccati equations arising in control of
               used methods to solve large ARE can be found in [8]. Two             inﬁnite-dimensional systems. First, many of these systems
               are stable even when uncontrolled and so the initial iterate                                     TABLE I
               K may be chosen as zero. Second, if the approximation                             COMPLEXITY OF CF-ADI AND ADI [20]
                 0
               procedure is valid then convergence of the feedback gains is
               obtained with increasing model order. Thus, a gain obtained                                     CF-ADI      ADI
                                                                                                                                 2
                                                                                                    Sparse A   O(Jrn)      O(Jrn )
               from a lower order approximation, perhaps using a direct                                               2          3
               method, may be used as an ansatz for a higher order                                  Full A     O(Jrn )     O(Jrn )
               approximation. This technique was used successfully in [14],
               [24], [28].                                                                                     TABLE II
                 Amodiﬁcation to this method was proposed by Banks and                                  CHOLESKY-ADI METHOD
               Ito [3]. In that paper, they partially solve the Chandrasekhar            Given A and D
               equations and then use the resulting feedback K as a                             o
                                                                                         Choose ADI parameters {p ,...,p } with ℜ(p ) < 0
                                                                                                    √           1       J          i
                                                                                                              ∗       −1
               stabilizing initial guess. Instead of the standard Newton-                Deﬁne z =    −2p (A +p I) D
                                                                                                1         1   o    1
                                                                                         and Z =[z ]
               Kleinman iteration (5) above, Banks and Ito rewrote the                        1     1
                                                                                         For i = 2,...,J
               Riccati equation in the form                                                           √−2pi+1                     ∗         −1
                                                                                         Deﬁne W =( √          )[I − (p   −p)(A +p        I)  ]
                                                                                                 i       −2pi         i+1    i    o    i+1
                                ∗                                 ∗                      (1) z = W z
                   (A−BK) X +X(A−BK) = −D RD,                                                i     i i−1
                              i    i     i           i            i     i                (2) If $z$ > tol
                            Di = Ki −Ki−1, Ki+1          = Ki−B∗Xi.                        Z =[Z       z ]
                                                                                             i     i−1  i
                 Solution of a Lyapunov equation is a key step in each                    Else, stop.
               iteration of the Newton-Kleinman method, both standard and
               modiﬁed. Consider the Lyapunov equation
                                  XAo+A∗X=−DD∗                               (6)   symmetric A the optimal parameters can be easily calculated.
                                             o                                     A heuristic procedure to calculate the parameters in the
               where Ao ∈ Rn×n is Hurwitz and D ∈ Rn×r. Factor                     general case is in [26]. If the spectrum of A is complex,
               Q into C∗C and let nQ indicate the rank of C. In the                we estimated the complex ADI parameters as in [10]. As
               case of standard Newton-Kleinman, r = m + nQ while for              the spectrum of A ﬂattens to the real axis the ADI parameters
               modiﬁed Newton-Kleinman, r = m. If Ao is Hurwitz, then              are closer to optimal.
               the Lyapunov equation has a symmetric positive semideﬁnite             If A is sparse, then sparse arithmetic can be used in
               solution X. As for the Riccati equation, direct methods such                o
               as Bartels-Stewart [5] are only appropriate for low model           calculation of Xi. However, full calculation of the dense
               order and do not take advantage of sparsity in the matrices.        iterates Xi is required at each step. By setting X0 = 0,
                 In [3] Smith’s method was used to solve the Lyapunov              it can be easily shown that Xi is symmetric and positive
                                                                             −1    semideﬁnite for all i, and so we can write X = ZZ∗
               equations. For p < 0, deﬁne U = (A − pI)(A + pI)
                                                         o          o              where Z is a Cholesky factor of X [20], [26]. (A Cholesky
                                 ∗       −1     ∗           −1
               and V = −2p(Ao+pI) DD (Ao+pI) . The Lyapunov                        factor does not need to be square or be lower triangular.)
               equation (6) can be rewritten as                                    Substituting Z Z∗ for X , setting X          = 0, and deﬁning
                                                                                                   i  i        i             0
                                              ∗                                              ∗        −1
                                      X=U XU+V.                                    M =(A +pI) weobtain the following iterates
                                                                                      i      o     i
               In Smith’s method [30], the solution X is found by using                    Z     = "−2p M D
               successive substitutions: X = limi→∞Xi where                                  1        " 1 1
                                                                                            Z = [ −2pMD, M(A∗−pI)Z ].
                                 Xi = U∗Xi−1U +V                             (7)             i                i  i       i   o    i    i−1
               with X0 = 0. Convergence of the iterations can be improved          Note that Z ∈ Rn×r, Z ∈ Rn×2r, and Z ∈ Rn×ir.
               by careful choice of the parameter p e.g. [29, pg. 297].                         1             2                   i
               This method of successive substitition is unconditionally              The algorithm is stopped when the Cholesky iterations
               convergent, but has only linear convergence.                        converge within some tolerance. In [20] these iterates are
                 The ADI method [21], [32] improves Smith’s method by              reformulated in a more efﬁcient form, using the observation
               using a different parameter p at each step. Two alternating         that the order in which the ADI parameters are used is
                                              i                                    irrelevant. This leads to the algorithm shown in Table II.
               linear systems,                                                        This solution method results in considerable savings in
                    (A∗ +piI)Xi−1        = −DD∗−Xi−1(Ao−piI)                       computation time and memory. In calculation of the feedback
                       o             2
                         ∗           ∗              ∗      ∗                       K, the full solution X never needs to be constructed. Also,
                      (A +piI)X          = −DD −X 1(Ao−piI)
                         o           i                     i−2                     the complexity of this method (CF-ADI) is also considerably
               are solved recursively starting with X0 = 0 ∈ Rn×n and              less than that of standard ADI as shown in Table I. Recall
               parameters pi < 0. If all parameters pi = p then the above          that for standard Newton-Kleinman, r is the sum of the rank
               equations reduce to Smith’s method. If the ADI parameters           of the state weight nQ and controls m. For modiﬁed Newton-
               pi are chosen appropriately, then convergence is obtained           Kleinman r is equal to m. The reduction in complexity
               in J iterations where J ≪ n. The parameter selection                of CF-ADI method over ADI is marked for the modiﬁed
               problem has been studied extensively [10], [21], [31]. For          Newton-Kleinman method.
                                                 10      2                                           2             2
                                   E    2.68 ×10    N/m                           where M = EI d2φ+C I d2ψ ∈H2(0,1).
                                   I     1.64 ×10−9 m4                                             bdr        d bdr
                                   b                                                We use R = 1 and deﬁne C by the tip position:
                                   ρ       1.02087 kg/m
                                                     2
                                  Cv         2 Ns/m
                                  Cd     2.5 ×108 Ns/m2                                           w(1,t) = C[w(x,t) w˙(x,t)].
                                   L           1 m
                                  I       121.9748 kg m2                          Then Q = C∗C. We also solved the control problem with
                                   h
                                   d        .041 kg−1                             Q=I.
                                          TABLE III                                 Let H2N ⊂ H be a sequence of ﬁnite-dimensional
                               TABLE OF PHYSICAL PARAMETERS.                      subspaces spanned by the standard cubic B-splines with a
                                                                                  uniform partition of [0,1] into N subintervals. This yields
                                                                                  an approximation in H2N × H2N [16, e.g.] of dimension
                      II. CONTROL OF EULER-BERNOULLI BEAM                         n = 4N. This approximation method yields a sequence
                                                                                  of solutions to the algebraic Riccati equation that converge
                 In this section, we consider a Euler-Bernoulli beam              strongly to the solution to the inﬁnite-dimensonal Riccati
               clamped at one end (r = 0) and free to vibrate at the other        equation corresponding to the original partial differential
               end (r = 1). Let w(r,t) denote the deﬂection of the beam           equation description [4], [22].
               from its rigid body motion at time t and position r. The
               deﬂection is controlled by applying a torque u(t) at the                III. LOW RANK APPROXIMATIONS TO LYAPUNOV
               clamped end (r = 0). We assume that the hub inertia I                                        FUNCTION
                                                                    ¨        h
               is much larger than the beam inertia I so that I θ ≈ u(t).
                                                        b          h                Tables IV,V show the effect of varying C on the number
               Thepartial differential equation model with Kelvin-Voigt and                                                     d
                                                                                  of iterations required for convergence. Larger values of C
               viscous damping is                                                                                                               d
                                                                                  (i.e. smaller values of α) leads to a decreasing number of
                                                                                  iterations. Small values of C     lead to a large number of
                                       ρw (r,t)+C w (r,t)+...                                                     d
                                           tt         v t                         required iterations in each solution of a Lyapunov equation.
                   ∂2                                        ρr
                       [C I w     (x,t) +EI w (r,t)] =          u(t),               Recall that the CF-ADI algorithm used here starts with a
                   ∂r2    d b rrt             b  rr          I
                                                              h                   rank 1 initial estimate of the Cholesky factor and the rank of
               with boundary conditions                                           the solution is increased at each step. The efﬁciency of the
                                                       w(0,t)    = 0              Cholesky-ADI method relies on the existence of a low-rank
                                                      w (1,t)    = 0.             approximation of the solution X to the Lyapunov equation.
                                                        r                         This is true of many other iterative algorithms to solve large
                              EI w (1,t)+C I w (1,t) = 0
                                 b  rr           d b rrt                          Lyapunov equations.
                      ∂ [EI w (r,t)+C I w           (r,t)]       = 0.               Theorem 3.1: For any symmetric, positive semi-deﬁnite
                      ∂r     b  rr          d b rrt       r=1                     matrix X of rank n let µ ≥ µ ... ≥ µ        be its eigenvalues.
                                                                                                             1     2        n
               The values of the physical parameters in Table II are as in        For any rank k < n matrix Xk,
               [2].                                                                                   *X−X *           µ
                 Let                                                                                           k 2 ≥ k+1.
                             #                                      $                                    *X*            µ
                                                        dw                                                    2           1
                       H= w∈H2(0,1):w(0)=                   (0) = 0               Proof: See, for example, [13, Thm. 2.5.3].                   !
                                                         dr                         Thus, the relative size of the largest and smallest eigen-
               be the closed linear subspace of the Sobolev space H2(0,1)         values determines the existence of a low rank approximation
                                                            2                     Xk that is close to X, regardless of how this approximation
               and deﬁne the state-space to be X = H×L (0,1) with state           is obtained.
               z(t) = (w(·,t), ∂ w(·,t)). A state-space formulation of the
                                ∂t                                                  The ratio µ      /µ is plotted for several values of C in
               above partial differential equation problem is                                    k+1    1                                    d
                                                                                  Figure 1. For larger values of C the solution X is closer to
                                                                                                                    d
                                 d                                                a low rank matrix than it is for smaller values of C .
                                   x(t) = Ax(t)+Bu(t),                                                                                   d
                                 dt                                                 A bound on the rank of the solution to a Lyapunov
               where                                                              equation where A is symmetric is given in [25]. A tighter
                               0              I                                 bound on the error in a low-rank approximation has been
                        A=                                                      obtained [1] in terms of the eigenvalues and eigenvectors of
                                   EI   4        C I   4      C                   A. This bound is applicable to all diagonalizable matrices
                                 − bd          − d b d      − v
                                    ρ   dr4        ρ   dr4     ρ                  A. The bound for the case where the right-hand-side D has
               and                            0                                 rank one is as follows.
                                       B=                                         Theorem 3.2: [1, Thm 3.1] Let V be the matrix of right
                                                 r                                eigenvectors of A, and denote its condition number by κ(X).
                                                Ih                                Denote the eigenvalues of A by λ1,...λn. There is a rank
               with domain                                                        k < n approximation to X satisfying
                                                                d                                                  2                   2
                dom(A)={(φ,ψ)∈X :ψ∈H, M(L)=                      M(L)=0}                   *X−X * ≤(n−k) δ                (κ(V)*D* )          (8)
                                                               dr                                   k 2              k+1             2
                                                                                                                                                           TABLE IV
                    where                                                                                            BEAM: : EFFECT OF CHANGING C (STANDARD NEWTON-KLEINMAN)
                                                                          *                 *                                                                   d
                                                                      k                      2
                                                       −1            )*λk+1−λj*
                                   δk+1 =                                 *   ∗             * .                        Cv        Cd           α         N.-K. It’s      Lyapunov Itn’s         CPU time
                                               2Real(λk+1) j=1*λk+1 +λj*                                                 2      1E04       1.5699            –                   –                   –
                                                                                                                                3E05       1.5661            –                   –                   –
                    The eigenvalues λi are ordered so that δi are decreasing.                                                   4E05       1.5654            3          1620;1620;1620            63.14
                        In order to calculate this bound, all the eigenvalues and                                               1E07       1.5370            3          1316;1316;1316            42.91
                    eigenvectors of A are needed.                                                                               1E08       1.4852            3            744;744;744             18.01
                        If the condition number κ(V ) is not too large, for instance                                            5E08       1.3102            3            301;301;301              5.32
                    if A is normal, then δk+1/δ1 gives a relative error estimate of
                    the error µ            /µ in the approximation X . This estimate                                                                       TABLE V
                                     k+1       1                                      k                               BEAM: EFFECT OF CHANGING C (MODIFIED NEWTON-KLEINMAN)
                    is observed in [1] to estimate the actual decay rate of the                                                                                d
                    eigenvalues of X quite well, even in cases where the decay                                           C         C            α        N.-K. It’s      Lyapunov It’s        CPU time
                    is very slow.                                                                                          v         d
                                                                                                                          2      1E04        1.5699           2                  –                 –
                        Consider a parabolic partial differential equation, with A-                                              3E05        1.5661           2                  –                 –
                    matrix of the approximation is symmetric and negative deﬁ-                                                   4E05        1.5654           2              1620;1              24.83
                    nite. Then all the eigenvalues are real. A routine calculation                                               1E07        1.5370           2              1316;1              16.79
                    yields that                                                                                                  1E08        1.4852           2               744;1              7.49
                                                                                                                                 5E08        1.3102           2               301;1              2.32
                                                         δk+1 ≈ λ1
                                                           δ1         λk
                    and so the rate of growth of the eigenvalues determines the                                     [23]. A simple calculation yields that for all k,
                    accuracy of the low rank approximation. The accuracy of
                    the low-rank approximant can be quite different if A is non-                                                                    k−1
                    symmetric. The solution X to the Lyapunov equation could                                                                δk = )                  1 2          ≈1.
                                                                                                                                                                      4c
                                                                                                                                            δ1             1+            v   2
                    be, for instance, the identity matrix [1], [25] in which case                                                                    j=1          (wk−wj)
                    the eigenvalues of X do not decay.                                                              This indicates that the eigenvalues of the Lyapunov solution
                        It is observed through several numerical examples in [1]                                    do not decay.
                    that it is not just complex eigenvalues that cause the decay
                    rate to slow down, but the dominance of the imaginary parts                                                                    IV. CONCLUSIONS
                    of the eigenvalues as compared to the real parts, together with                                     The results indicate a problem with solving the algebraic
                    absence of clustering in the spectrum of A. This effect was                                     Riccati equation for systems with light damping where the
                    observed in the beam equation. Figure 2 shows the change                                        eigenvalues are not decaying rapidly. Although better choice
                    in the spectrum of A as C is varied. Essentially, increasing
                                                            d                                                       of the ADI parameters might help convergence, the fact
                    C increases the angle that the spectrum makes with the
                       d                                                                                            that the low rank approximations to the solution converge
                    imaginary axis. Note that the eigenvalues do not cluster. As                                    very slowly when the damping is small limits the achievable
                    the damping is decreased, the dominance of the imaginary                                        improvement. This may have consequences for control of
                    parts of the eigenvalues of A increases and the decay rate of                                   coupled acoustic-structure problems where the spectra are
                    the eigenvalues of X slows. The Cholesky-ADI calculations                                       closer to those of hyperbolic systems than those of parabolic
                    for the beam with C = 0 did not terminate on a solution
                                                    d                                                               systems.
                    Xk with k < n.
                        These observations are supported by results in the theory
                    of control of partial differential equations. If C                            > 0 the
                                                                                               d                                                  -./0123450+637.8+98:+;<=)+>.7?+23:@.1/+A
                                                                                                                                                                                   B
                                                                                                                                !                                                                  +
                    semigroup for the original partial differential equation is                                               #!
                    parabolic and the solution to the Riccati equation converges                                                !#!
                    uniformly in operator norm [18, chap.4]. However, if C = 0,                                               #!
                                                                                                     d
                                                                                                                                !$!
                    the partial differential equation is hyperbolic and only strong                                           #!
                    convergence of the solution is obtained [19]. Thus, one might                                               !%!
                                                                                                                              #!
                    expect a greater number of iterations in the Lyapunov loop                                               #
                                                                                                                            !   !&!
                                                                                                                            +,+*#!
                    to be required as C               is decreased. Consider the bound (8)                                  !
                                                   d
                                                                                                                                !"!
                    for the beam with C = 0, and an eigenfunction basis for                                                   #!
                                                    d
                    the approximation so that the eigenvalues of A are the exact                                                !)!
                                                                         2                                                    #!
                    eigenvalues. Deﬁning c = C /ρ, c = EI /ρandindicating                                                                      '
                                                       v        v                   b                                                      "C#!
                                                                                                                                !(!            '
                    the roots of 1 + cos(a)cosh(a) by a , the eigenvalues are                                                 #!           #C#!
                                                                            k
                                                                                                                                               (
                                                                                                                                           #C#!
                                                                                                                                !'!+
                                                   λ =−c /2±iω                                                                #!
                                                     k           v            k                                                   !       "       #!      #"      $!      $"      %!      %"      &!
                                                                                                                                                                  *
                    where                                   +
                                                                   2       2                                            Fig. 1.    Eigenvalue ratio for solution to beam ARE for varying C
                                                   ωk =         ca −c /4                                                                                                                                d
                                                                   k       v
The words contained in this file might help you see if this file matches what you are looking for:

...Iterative solution of algebraic riccati equations for damped systems kirsten morris and carmeliza navasca abstract are large dimen common methods chandrasekhar sion arise when using approximations to design controllers newton kleinman iterations here we use a modelled by partial differential modication the method rst proposed modied solve since banks ito as renement leads right hand side rank equal chandraskehar equation number inputs regardless weights resulting lyapunov can be more efciently solved low is key step in im cholesky adi algorithm used plementing either or standard at each straightforward code arising performance illustrated with an example beam have several special features model order different levels damping results indicate that weakly n generally much larger than controls m problems may not exist further analysis supports this point observations p matrices often introduction sparse recently developed uses these leading efcient aclassical controller objective nd contr...
Related files

Share

Help

Related files

Share

Share to social media

Help

Login Area