查看“︁矩阵差分方程”︁的源代码

'''矩阵差分方程'''是一种[[差分方程]]，其中某时刻的变量向量（或矩阵）与之前时刻的值通过矩阵相关。<ref>{{cite book|last1=Cull|first1=Paul|author2-link=Mary Flahive|last2=Flahive|first2=Mary|last3=Robson|first3=Robbie|title=Difference Equations: From Rabbits to Chaos|title-link= Difference Equations: From Rabbits to Chaos |publisher=Springer|date=2005|at=ch. 7|isbn=0-387-23234-6}}</ref><ref>{{cite book|last=Chiang|first=Alpha C.|title=Fundamental Methods of Mathematical Economics|url=https://archive.org/details/fundamentalmetho0000chia_h4v2|url-access=registration|edition=3rd|publisher=McGraw-Hill|date=1984|pages=[https://archive.org/details/fundamentalmetho0000chia_h4v2/page/608 608–612]|isbn=9780070107809 }}</ref>方程的'''阶'''是变量向量任意两个指示值之间的最大时差。例如

:<math>\mathbf x_t = \mathbf{Ax}_{t-1} + \mathbf{Bx}_{t-2}</math>

是二阶矩阵差分方程，其中{{math|'''x'''}}是{{math|''n'' × 1}}变量向量，{{math|'''A'''}}、{{math|'''B'''}}是{{math|''n'' × ''n''}}矩阵。该方程齐次，因为方程末尾没有常数项向量。同一个方程也可写成

:<math>\mathbf x_{t+2} = \mathbf{Ax}_{t+1} + \mathbf{Bx}_{t}</math>

或

:<math>\mathbf x_n = \mathbf{Ax}_{n-1} + \mathbf{Bx}_{n-2}</math>

最常见的矩阵差分方程都是一阶的。

==非齐次一阶情形及稳态==

非齐次一阶矩阵差分方程如：
:<math>\mathbf x_t = \mathbf{Ax}_{t-1} + \mathbf{b} </math>

与一个加性常向量 {{math|'''b'''}}。该系统的稳态是{{math|'''x'''}}向量的值{{math|'''x'''*}}，一旦达到就不会偏离。{{math|'''x'''*}}可通过置{{math|'''x'''<sub>''t''</sub> {{=}} '''x'''<sub>''t''−1</sub> {{=}} '''x'''*}}、解{{math|'''x'''*}}以得

:<math> \mathbf{x}^* = [\mathbf{I}-\mathbf{A}]^{-1}\mathbf{b} </math>

其中{{math|'''I'''}}是{{math|''n'' × ''n''}}[[单位矩阵]]，假定{{math|['''I''' − '''A''']}}可逆。非齐次方程可用偏离稳态的齐次方程重写：

:<math> \left[\mathbf{x}_t - \mathbf{x}^*\right] = \mathbf{A}\left[\mathbf{x}_{t-1}-\mathbf{x}^*\right] </math>

==一阶情形的稳定性==

一阶矩阵差分方程{{math|['''x'''<sub>''t''</sub> − '''x'''*] {{=}} '''A'''['''x'''<sub>''t''−1</sub> − '''x'''*]}}是[[稳定性理论|稳定]]的，即当且仅当转移矩阵{{math|'''A'''}}的所有特征值（无论实复）绝对值都小于1时，{{math|'''x'''<sub>''t''</sub>}}才逐渐收敛到稳态{{math|'''x'''*}}。

==解一阶情形==

假定方程齐次形式为{{math|'''y'''<sub>t</sub> {{=}} '''Ay'''<sub>''t''−1</sub>}}，然后可从[[初始条件]]{{math|'''y'''<sub>0</sub>}}开始迭代。{{math|'''y'''<sub>0</sub>}}是{{math|'''y'''}}的初值，必须得知才能求解：

:<math>\begin{align}
\mathbf y_1 &= \mathbf{Ay}_0 \\
\mathbf y_2 &= \mathbf{Ay}_1=\mathbf A^2 \mathbf y_0 \\
\mathbf y_3 &= \mathbf{Ay}_2=\mathbf A^3 \mathbf y_0
\end{align}</math>

以此类推，由[[数学归纳法]]，用{{mvar|t}}表示的解为

:<math>\mathbf y_t=\mathbf A^t \mathbf y_0</math>

此外，若{{math|'''A'''}}可对角化，就可用它的特征值和特征向量重写{{math|'''A'''}}，得到解

:<math>\mathbf y_t = \mathbf{PD}^{t}\mathbf{P}^{-1} \mathbf y_0,</math>

其中{{math|'''P'''}}是{{math|''n'' × ''n''}}矩阵，列是{{math|'''A'''}}的特征向量（假设特征值互异）；{{math|'''D'''}}是{{math|''n'' × ''n''}}[[对角矩阵]]，对角元是{{math|'''A'''}}的特征值。这个解就是上述稳定性结果的依据：当且仅当{{mvar|A}}的特征值绝对值都小于1，{{math|'''A'''<sup>''t''</sup>}}才会随时间收缩到零矩阵。

==从一阶矩阵系统中提取单一标量变量的动力特性==

从{{math|''n''}}维系统{{math|'''y'''<sub>''t''</sub> {{=}} '''Ay'''<sub>''t''−1</sub>}}开始，可以提取其中一个状态变量（如{{math|''y''<sub>1</sub>}}）的动态变化。上述{{mvar|y<sub>t</sub>}}的求解方程表明，{{math|''y''<sub>1,''t''</sub>}}的解是根据{{math|'''A'''}}的{{mvar|n}}个特征值求得的。因此，描述{{math|''y''<sub>1</sub>}}变化的方程本身必须有涉及特征值的解。这种描述直观地产生了{{math|''y''<sub>1</sub>}}的演化方程，即

:<math> y_{1,t} = a_1 y_{1,t-1} + a_2 y_{1,t-2} + \dots + a_n y_{1,t-n}</math>

其中参数{{mvar|a<sub>i</sub>}}来自{{math|'''A'''}}的[[特征方程式]]：

:<math>\lambda^{n} - a_1 \lambda^{n-1} - a_2 \lambda^{n-2} - \dots - a_n \lambda^{0} = 0.</math>

因此，{{mvar|n}}维一阶线性系统中的每个标量变量都根据一元{{mvar|n}}阶差分方程演化，与矩阵差分防尘具有相同的稳定性。

==高阶情形的解与稳定性==

可用分块矩阵将高阶矩阵差分方程转换到一阶，可以求解时滞超过一个周期的高阶方程，并分析其稳定性。例如，假设有二阶方程

:<math>\mathbf x_t = \mathbf{Ax}_{t-1} + \mathbf{Bx}_{t-2}</math>

变量向量{{math|'''x'''}}尺寸为{{math|''n'' × 1}}，{{math|'''A'''}}、{{math|'''B'''}}尺寸为{{math|''n'' × ''n''}}。则可以叠加为下列形式

:<math>\begin{bmatrix}\mathbf{x}_t \\ \mathbf{x}_{t-1} \\ \end{bmatrix} = \begin{bmatrix} \mathbf{A} & \mathbf{B} \\ \mathbf{I} & \mathbf{0} \\ \end{bmatrix} \begin{bmatrix} \mathbf{x}_{t-1} \\ \mathbf{x}_{t-2} \end{bmatrix} </math>

其中{{math|'''I'''}}是{{math|''n'' × ''n''}}[[单位矩阵]]，{{math|'''0'''}}是{{math|''n'' × ''n''}}[[零矩阵]]。然后将当前变量和一度滞后变量的{{math|2''n'' × 1}}叠加向量表示为{{math|'''z'''<sub>''t''</sub>}}，将{{math|2''n'' × 2''n''}}分块矩阵表示为{{math|'''L'''}}，就得到了之前的解

:<math>\mathbf z_t = \mathbf L^t \mathbf z_0 </math>

与之前一样，当且仅当矩阵{{math|'''L'''}} 的所有特征值的绝对值都小于1时，叠加方程与原二阶方程才稳定。

==非线性矩阵差分方程：黎卡提方程==

在[[LQG控制]]中，会出现一个当前和未来成本矩阵反向演化的非线性矩阵方程，下面用{{math|'''H'''}}表示。这个方程也被称为离散动力[[黎卡提方程]]，当据线性矩阵差分方程演化的变量向量受外源向量的控制，以优化[[二次函数|二次]][[多对象优化|损失函数]]时，就会产生这个方程。黎卡提方程形式如下：

: <math> \mathbf{H}_{t-1} = \mathbf{K} +\mathbf{A}'\mathbf{H}_t\mathbf{A} - \mathbf{A}'\mathbf{H}_t\mathbf{C}\left[\mathbf{C}'\mathbf{H}_t\mathbf{C}+\mathbf{R}\right]^{-1}\mathbf{C}'\mathbf{H}_t\mathbf{A} </math>

其中{{math|'''H'''}}、{{math|'''K'''}}、{{math|'''A'''}}尺寸为{{math|''n'' × ''n''}}；{{math|'''C'''}}尺寸为{{math|''n'' × ''k''}}；{{math|'''R'''}}尺寸为{{math|''k'' × ''k''}}，{{math|''n''}}是受控向量元素数，{{math|''k''}}是控制向量元素数。参数矩阵{{math|'''A'''}}、{{math|'''C'''}}来自线性方程，参数矩阵{{math|'''K'''}}、{{math|'''R'''}}来自二次损失函数。详见[[代數Riccati方程#離散時間的代數Riccati方程|此处]]。

一般来说，该方程无法根据{{mvar|t}}分析求解{{math|'''H'''<sub>''t''</sub>}}，而是通过迭代黎卡提方程，求出{{math|'''H'''<sub>''t''</sub>}}的值序列。不过，已经证明<ref>{{cite journal|last1=Balvers|first1=Ronald J.|last2=Mitchell|first2=Douglas W.|date=2007|title=Reducing the dimensionality of linear quadratic control problems|journal=Journal of Economic Dynamics and Control|volume=31|issue=1|pages=141–159|doi=10.1016/j.jedc.2005.09.013|url=https://papers.tinbergen.nl/01043.pdf|access-date=2023-10-15|archive-date=2022-01-18|archive-url=https://web.archive.org/web/20220118212230/https://papers.tinbergen.nl/01043.pdf|dead-url=no}}</ref>，若{{math|'''R''' {{=}} [[Zero matrix|'''0''']]}}、{{math|''n'' {{=}} ''k'' + 1}}，则可将黎卡提方程简化为标量[[有理差分方程]]分析求解；对任意{{mvar|k}}、{{mvar|n}}，若转移矩阵{{math|'''A'''}}可逆，则黎卡提方程就可根据矩阵特征值进行分析求解，尽管特征值可能要用数值计算才能找到。<ref>{{cite journal|last=Vaughan|first=D. R.|date=1970|title=A nonrecursive algebraic solution for the discrete Riccati equation|journal=IEEE Transactions on Automatic Control|volume=15|issue=5|pages=597–599|doi=10.1109/TAC.1970.1099549}}</ref> 

在大多数情况下，{{math|'''H'''}}随时间的演化是稳定的，也就是说{{math|'''H'''}}会收敛到特定的常矩阵{{math|'''H'''*}}，其他矩阵都有理时也可能是无理的。参见[[隨機控制#離散時間系統]]。

相关的黎卡提方程<ref>{{cite book|last1=Martin|first1=C. F.|last2=Ammar|first2=G.|contribution=The geometry of the matrix Riccati equation and associated eigenvalue method|editor1-last=Bittani|editor2-last=Laub|editor3-last=Willems|date=1991|title=The Riccati Equation|publisher=Springer-Verlag|isbn=978-3-642-63508-3|doi=10.1007/978-3-642-58223-3_5}}</ref>是

:<math>\mathbf{X}_{t+1} = -\left[\mathbf{E}+\mathbf{B}\mathbf{X}_t\right]\left[\mathbf{C}+\mathbf{A}\mathbf{X}_t\right]^{-1}</math>

其中{{math|'''X''', '''A''', '''B''', '''C''', '''E'''}}全都是{{math|''n'' × ''n''}}方阵。这个方程可以显式求解。假设<math>\mathbf X_t = \mathbf N_t \mathbf D_t^{-1}</math>，在{{math|''t'' {{=}} 0}}时{{math|'''N'''<sub>0</sub> {{=}} '''X'''<sub>0</sub>}}、{{math|'''D'''<sub>0</sub> {{=}} [[identity matrix|'''I''']]}}显然成立。然后将其用于差分方程，得出

:<math>\begin{align}
\mathbf{X}_{t+1}&=-\left[\mathbf{E}+\mathbf{BN}_t\mathbf{D}_t^{-1}\right]\mathbf{D}_t\mathbf{D}_t^{-1}\left[\mathbf{C}+\mathbf{AN}_t\mathbf{D}_t^{-1}\right]^{-1}\\
&=-\left[\mathbf{ED}_t+\mathbf{BN}_t\right]\left[\left[\mathbf{C}+\mathbf{AN}_t \mathbf{D}_t^{-1}\right]\mathbf{D}_t\right]^{-1}\\
&=-\left[\mathbf{ED}_t+\mathbf{BN}_t\right]\left[\mathbf{CD}_t+\mathbf{AN}_t\right]^{-1}\\
&=\mathbf{N}_{t+1}\mathbf{D}_{t+1}^{-1}
\end{align}</math>

因此通过归纳法，形式<math>\mathbf X_t = \mathbf N_t \mathbf D_t^{-1}</math>对所有{{mvar|t}}都成立。那么{{math|'''N'''}}、{{math|'''D'''}}的演化可写为

:<math>\begin{bmatrix} \mathbf{N}_{t+1} \\ \mathbf{D}_{t+1} \end{bmatrix} = \begin{bmatrix} -\mathbf{B} & -\mathbf{E} \\ \mathbf{A} & \mathbf{C} \end{bmatrix} \begin{bmatrix} \mathbf{N}_t \\ \mathbf{D}_t \end{bmatrix} \equiv \mathbf{J} \begin{bmatrix}\mathbf{N}_t \\ \mathbf{D}_t \end{bmatrix}</math>

因此可归纳

:<math>\begin{bmatrix} \mathbf{N}_t \\ \mathbf{D}_t \end{bmatrix} = \mathbf{J}^{t} \begin{bmatrix} \mathbf{N}_0 \\ \mathbf{D}_0 \end{bmatrix}</math>

==另见==
*[[矩阵微分方程]]
*[[差分方程]]
*[[线性差分方程]]
*[[动力系统]]
*[[代数Riccati方程]]

==参考文献==
{{reflist}}

[[Category:线性代数]]
[[Category:递回关系]]
[[Category:动力系统]]