268x Filetype PDF File size 0.62 MB Source: etna.math.kent.edu
Electronic Transactions on Numerical Analysis. ETNA
Volume54,pp.68–88,2021. KentStateUniversity and
Copyright © 2021, Kent State University. JohannRadonInstitute(RICAM)
ISSN1068–9613.
DOI:10.1553/etna_vol54s68
ONTHESOLUTIONOFTHENONSYMMETRICT-RICCATIEQUATION∗
† †
PETER BENNER AND DAVIDE PALITTA
Abstract. The nonsymmetric T-Riccati equation is a quadratic matrix equation where the linear part corresponds
to the so-called T-Sylvester or T-Lyapunov operator that has previously been studied in the literature. It has applications
in macroeconomics and policy dynamics. So far, it presents an unexplored problem in numerical analysis, and both
theoretical results and computational methods are lacking in the literature. In this paper we provide some sufficient
conditions for the existence and uniqueness of a nonnegative minimal solution, namely the solution with component-
wise minimal entries. Moreover, the efficient computation of such a solution is analyzed. Both the small-scale
and large-scale settings are addressed, and Newton-Kleinman-like methods are derived. The convergence of these
procedures to the minimal solution is proven, and several numerical results illustrate the computational efficiency of
the proposed methods.
Keywords. T-Riccati equation, M-matrices, minimal nonnegative solution, Newton-Kleinman method
AMSsubjectclassifications. 65F30, 15A24, 49M15, 39B42, 40C05
1. Introduction. In this paper, we consider the nonsymmetric T-Riccati operator
n×n n×n T T
R :R →R , R (X):=DX+X A−X BX+C,
T T
where A,B,C,D ∈ Rn×n,andweprovidesufficientconditions for the existence and unique-
ness of a minimal solution X ∈Rn×nto
min
(1.1) RT(X)=0.
ThesolutionofthenonsymmetricT-Riccatiequation(1.1)playsaroleinsolvingdynamics
generalized equilibrium (DSGE) problems [9, 22, 25]. “DSGE modeling is a method in
macroeconomics that attempts to explain economic phenomena, such as economic growth and
1
business cycles, and the effects of economic policy” . Equations of the form (1.1) appear in
certain procedures for solving DSGE models using perturbation-based methods [9, 25].
Taking inspiration from the (inexact) Newton-Kleinman method for standard algebraic
Riccati equations, we illustrate efficient numerical procedures for solving (1.1). Both the
small-scale and large-scale settings are addressed. In particular, in the latter framework, we
assume the matrices A and D to be such that the matrix-vector products Av and Dw require
O(n)floating point operations (flops) for any v,w ∈ Rn. This is the case, for instance, when
AandDaresparse. Moreover, we suppose B and C to be of low rank. These hypotheses
mimictheusualassumptions adopted when dealing with large-scale standard algebraic Riccati
equations; see, e.g., [2, 5, 7, 10, 13, 17, 18, 19, 23, 24, 21] and the recent survey paper [3].
The following is a synopsis of the paper. In Section 2 we present the result about the
existence and uniqueness of a minimal solution X to (1.1), namely the nonnegative solution
min
with component-wise minimal entries. A Newton-Kleinman method for the computation of
suchaX is derived in Section 3, and its convergence features are proven in Section 3.1. The
min
large-scale setting is addressed in Section 3.2, where the convergence of an inexact Newton-
Kleinman method equipped with a specific line search is illustrated. Some implementation
details of the latter procedure are discussed in Section 3.3. Several numerical results showing
∗Received July 3, 2020. Accepted September 20, 2020. Published online on November 20, 2020. Recom-
mended by Dario Bini. This work was supported by the Australian Research Council (ARC) Discovery Grant
No. DP1801038707.
†Department Computational Methods in Systems and Control Theory (CSC), Max Planck Institute for Dynamics
of Complex Technical Systems, Magdeburg, Germany ({benner,palitta}@mpi-magdeburg.mpg.de).
1https://en.wikipedia.org/wiki/Dynamic_stochastic_general_equilibrium
68
ETNA
KentStateUniversity and
JohannRadonInstitute(RICAM)
ONTHESOLUTIONOFTHENONSYMMETRICT-RICCATIEQUATION 69
the effectiveness of the proposed approaches are reported in Section 4, while our conclusions
are given in Section 5.
Throughoutthepaperweadoptthefollowingnotation: Thematrixinnerproductisdefined
as hX,Yi := trace(YTX) so that the induced norm is kXk2 = hX,Xi . I denotes the
F F n
identity matrix of order n, and the subscript is omitted whenever the dimension of I is clear
from the context. The brackets [·] are used to concatenate matrices of conforming dimensions.
In particular, a MATLAB-like notation is adopted, where [M,N] denotes the matrix obtained
by augmenting M with N. A ≥ 0 (A > 0) indicates a nonnegative (positive) matrix, that
is, a matrix whose entries are all nonnegative (positive). Clearly, A ≤ 0 (A < 0) if −A ≥ 0
(−A>0),andA≥BifA−B≥0.Moreover,werecallthatamatrixAisaZ-matrixifall
its off-diagonal entries are nonpositive. It is easy to show that a Z-matrix can be written in the
form A = sI −N, where s ∈ R and N ≥ 0. If s ≥ ρ(N), where ρ(·) denotes the spectral
radius, then A is called M-matrix.
Furthermore, we will always suppose that the following assumption holds.
ASSUMPTION 1.1. Weassumethat
• B is nonnegative (B ≥ 0) and C is nonpositive (C ≤ 0).
• I⊗D+(AT⊗I)ΠisanonsingularM-matrixwhere⊗denotestheKroneckerproduct
2 2 P P
n ×n n n
while Π ∈ R is a permutation matrix given by Π := i=1 j=1Ei,j ⊗Ej,i.
Thematrix E ∈Rn×ninAssumption1.1isthematrixwhose(i,j)-thentry is 1 while
i,j
all the others are zero.
Notice that I ⊗ D + (AT ⊗ I)Π being a nonsingular M-matrix implies that the T-
n×n n×n T
Sylvester operator S : R →R ,S (X):=DX+X A,hasanonnegativeinverse,
T T
i.e., S−1(X) ≥ 0 for X ≥ 0. For the standard Sylvester operator S : Rn×n → Rn×n,
T
S(X) := DX +XA,thisis guaranteed by assuming A, D to be nonsingular M-matrices;
see, e.g., [10, Theorem A.20]. Another important consequence of Assumption 1.1 is the
monotonicity of ST, i.e., ST(X) ≥ 0 implies X ≥ 0; see, e.g., [8].
It is not easy to analyze the impact that Assumption 1.1 has on the coefficient matrices A
andD. Nevertheless, a careful inspection of the ordering of the entries of I ⊗D+(AT ⊗I)Π
shows that if the latter is a singular M-matrix, then A ≤ 0. Indeed, every entry of A appears,
T
at least once, as an off-diagonal entry in I ⊗ D + (A ⊗I)Π, and since the off-diagonal
components of an M-matrix are nonpositive, it must hold that A ≤ 0.
2. Existence and uniqueness of a minimal solution. In this section we provide suffi-
cient conditions for the existence and uniqueness of a minimal solution X of (1.1). Our
min
result relies on the following fixed-point iteration:
X =0,
(2.1) 0
DX +XT A=XTBX −C, k≥0.
k+1 k+1 k k
THEOREM2.1. Theiterates computed by the fixed-point iteration (2.1) are such that
X ≥X , k≥0,
k+1 k
and, if there exists a nonnegative matrix Y such that R (Y ) ≥ 0, then X ≤ Y for any
T k
k ≥ 0. Moreover, under this assumption, the sequence {X } converges, and its limit is the
minimal nonnegative solution X to (1.1). k k≥0
min
ETNA
KentStateUniversity and
JohannRadonInstitute(RICAM)
70 P. BENNERANDD.PALITTA
Proof. We first show that X ≥X foranyk ≥0byinductiononk. Fork = 0,we
k+1 k
have X = S−1(−C) ≥ 0 = X asC ≤ 0. WenowassumethatX¯ ≥ X¯ for a certain
1 T 0 k k−1
¯
k > 0, and we show that X¯ ≥X¯.Wehave
k+1 k
X¯ =S−1(XTBX¯−C)
k+1 T ¯ k
k
=S−1(XTBX¯)+S−1(−C)=S−1(XTBX¯)+X +X¯−X¯
T ¯ k T T ¯ k 1 k k
k k
=S−1(XTBX¯)+X +X¯−S−1(XT BX¯ −C)
T ¯ k 1 k T ¯ k−1
k k−1
=S−1(XTBX¯−XT BX¯ )+X¯.
T ¯ k ¯ k−1 k
k k−1
Clearly, XT ≥ XT , as X¯ ≥ X¯ bytheinduction hypothesis. Therefore, recalling that
¯ ¯ k k−1
k k−1
B≥0,wehave
XTBX¯−XT BX¯ ≥0,
¯ k ¯ k−1
k k−1
so that X¯ ≥X¯.
k+1 k
Wenowsupposethatthereexistsanonnegative Y such that RT(Y) ≥ 0, and we show
that X ≤ Y for any k ≥ 0 by induction on k once again. The result is straightforward for
k
¯
k = 0 as X = 0. We now assume that X¯ ≤ Y for a certain k > 0, and we show that
0 k
X¯ ≤Y.SinceX¯ ≤Y andB ≥0,XTBX¯ ≤YTBY sothat−XTBX¯ ≥−YTBY.
k+1 k ¯ k ¯ k
Thus, we can write k k
0 ≤ DY +YTA−YTBY +C ≤DY +YTA−XTBX¯+C,
¯ k
k
and since by definition −XTBX¯ +C = −DX¯ −XT A,weget
¯ k k+1 ¯
k k+1
0 ≤ DY +YTA−DX¯ −XT A.
k+1 ¯
k+1
This means that ST(Y −X¯ ) ≥ 0, which implies Y ≥ X¯ thanks to the monotonicity
k+1 k+1
of S .
T
In conclusion, {X } is a nondecreasing, nonnegative sequence bounded from above,
k k≥0
therefore it has a finite limit lim X = X ≥ 0. Taking the limit on both sides
k→+∞ k min
of (2.1) shows that X is a solution of the equation R (X) = 0. Moreover, X is the
min T min
minimal nonnegative solution, as we have proven that X ≤Y foranynonnegative Y such
min
that R (Y) ≥ 0.
T
Asimilar result has been shown in [15, Theorem 2.3] for the (standard) nonsymmetric
Riccati equation.
3. The(inexact)Newton-Kleinmanmethod. Duetoitspossibleslowconvergencerate,
the fixed-point iteration (2.1) may not be attractive for the actual computation of the minimal
solution X , and a Newton-Kleinman-like method can be more effective for this task.
min
Thek-th iteration of the Newton method is defined as
′
R [X](X −X )=−R (X ),
T k+1 k T k
′
where R [X] denotes the Fréchet derivative of R at X. For the nonsymmetric T-Riccati
T T
operator, we have
′ T T T T T
R [X](Y)=DY +Y A−Y BX−X BY =(D−X B)Y +Y (A−BX),
T
and therefore the (k + 1)-st iterate of the Newton method is the solution of the T-Sylvester
equation
(3.1) (D−XTB)X +XT (A−BX )=−XTBX −C.
k k+1 k+1 k k k
ETNA
KentStateUniversity and
JohannRadonInstitute(RICAM)
ONTHESOLUTIONOFTHENONSYMMETRICT-RICCATIEQUATION 71
Depending on the problem size n, different state-of-the-art methods can be employed for the
solution of equations (3.1); see, e.g., [11, 12].
3
If n is moderate, say n ≤ O(10 ), then dense methods based on some decomposition
of the coefficient matrices can be employed to solve the T-Sylvester equations in (3.1). For
instance, in [11, Section 3] an algorithm based on the generalized Schur decomposition
of the pair (D,AT) is presented for efficiently solving a T-Sylvester equation of the form
DX+XTA=C.
If the problem dimension does not allow for dense matrix operations, then equations (3.1)
must be solved iteratively. The iterative solution of the T-Sylvester equations may introduce
someinexactness in the Newton scheme leading to the so-called inexact Newton-Kleinman
method and affecting the convergence features of the latter. By using tools similar to the ones
presented in [5], in Section 3.2 we show how a specific line search guarantees the convergence
of the inexact Newton method.
However, we first need to guarantee that the sequence {X } generated by (3.1) is
k k≥0
well-defined and that it converges to X ; this is the topic of the next section.
min
3.1. A convergence result. In this section we prove the convergence properties of the
Newton-Kleinmanmethod(3.1). To this end, we first recall a couple of classic results about
M-matrices; see, e.g., [8, Chapter 6].
LEMMA3.1. LetAbeaZ-matrix. ThenAisanonsingularM-matrixifandonlyifthere
exists a nonnegative vector v such that Av > 0. Moreover, if A is a nonsingular M-matrix and
B≥AisaZ-matrix,thenB isalsoanonsingularM-matrix.
Toproveconvergence of the Newton method to the minimal nonnegative solution X
min
to (1.1), we also need the following lemma.
LEMMA3.2. Assumethatthereexists a matrix Y such that RT(Y) > 0. Then it follows
that I ⊗ (D −XT B)+((A−BXmin)T ⊗I)ΠisanonsingularM-matrix.
min
Proof. Since Assumption 1.1 holds, we have that I ⊗D +(AT ⊗I)Π = rI 2 −N, with
n
N≥0,r>ρ(N),andwecanwrite
I ⊗(D−XT B)+((A−BX )T ⊗I)Π
min min
=I⊗D+(AT⊗I)Π−(I⊗XT B+(BX )T⊗I)Π)
min min
=rI−(N+(I⊗XT B+(BX )T⊗I)Π),
| min {z min }
≥0
as B, X ≥ 0. Notice that X ≥ 0 exists since the hypotheses of Theorem 2.1 are
min min
fulfilled. Therefore, I ⊗ (D − XT B)+((A−BX )T ⊗I)ΠisaZ-matrix.
min min
Moreover,
T T
(D−X B)(Y −X )+(Y −X ) (A−BX )
min min min min
=DY −XT BY −DX +XT BX
min min min min
+YTA−YTBX −XT A+XT BX .
min min min min
Since R (X ) = 0, it follows that −DX −XT A+XT BX =C. Moreover,
T min min min min min
adding and subtracting Y TBY , we get
T T
(D−X B)(Y −X )+(Y −X ) (A−BX )
min min min min
=R (Y)+(Y −X )TB(Y −X ).
T min min
To conclude, we notice that Y − X ≥0, as X is the minimal solution to (1.1) and
min min
RT(Y)>0.Therefore,
(D−XT B)(Y −X )+(Y −X )T(A−BX )≥R (Y)>0.
min min min min T
no reviews yet
Please Login to review.