【MPC学习笔记】02：MPC详细简介（Lecture 1_1 Unconstrained MPC）

本笔记来自北航诸兵老师的课程
课程地址：模型预测控制（2022春）lecture 1-1 Unconstrained MPC
接上一篇：【MPC学习笔记】01：MPC简介（Lecture 1_1 Unconstrained MPC）

文章目录

1 详细介绍

1 详细介绍

1.1 状态方程

对 LTI 离散系统：
$Bu(k)\quad x\in\R^n, u\in R^p$

对传统控制系统，连续系统是好处理的，离散系统是要额外考虑其他因素的
对MPC，则是反过来，离散系统是好处理的，连续系统是要额外考虑其他因素的

假设：

$A, B$ 可控（Stablizable）
状态和控制输入不存在约束（本节讨论无约束MPC）

Define：
$x (i ∣ k), u (i ∣ k)$ : Prediction of $i$ steps ahead from time $k$ (比如，在时刻 $k$ 预测下一时刻的状态，记为 $x (1∣ k)$ ，当前时刻的输入，记为 $u (0∣ k)$ )
预测：
$\begin{aligned} x(1|k) &= Ax(0|k) + Bu(0|k) \\ x(2|k) &= Ax(1|k) + Bu(1|k)=A[Ax(0|k) + Bu(0|k)] + Bu(1|k) \\ &=A^2x(0|k) + ABu(0|k) + Bu(1|k) \\ \quad&\quad \quad \quad \quad \quad \vdots\quad \quad \quad \vdots\quad \quad \quad \vdots \\ x(i|k) &= A^ix(0|k) + A^{i-1}Bu(0|k) + A^{i-2}B^2u(1|k) + \cdots + Bu(i-1|k) \end{aligned}$

In compact form:
$\Phi U(k)$
$X(k)\triangleq \begin{bmatrix} x(1|k)\\ x(2|k)\\ \vdots\\ x(N|k) \end{bmatrix} \quad\quad U(k)\triangleq \begin{bmatrix} u(0|k)\\ u(1|k)\\ \vdots\\ u(N-1|k) \end{bmatrix}$

$X (k)$ 式中的 $x (k)$ 也即 $x (0∣ k)$
$\triangleq$ : 表示定义为
$N$ : Control/Predictive horizon，实际上二者有区别，但这里不做区分

1.2 Cost Function

这里cost function 的控制/预测时域是一个有限的数
$\begin{aligned} J(k) &= \sum^{N}_{i=1}||x(i|k)||_Q^2 + ||u(i-1|k)||_R^2 \\ &= X^T(k)\mathcal{Q}X(k) + U^T(k)\mathcal{R}U(k) \end{aligned}$
假设 $Q$ 和 $R$ 是正定的，是权重
$\mathcal{Q} = \begin{bmatrix} Q\\ &Q\\ &&\ddots\\ &&&Q \end{bmatrix} \quad\quad \mathcal{R} = \begin{bmatrix} R\\ &R\\ &&\ddots\\ &&&R \end{bmatrix}$
将 $\Phi U(k)$ 代入 $J (k)$ 有
$\begin{equation*} \begin{aligned} J(k) &= (Fx(k) + \Phi U(k))^{T} \mathcal{Q} (Fx(k) + \Phi U(k)) + U^T(k)\mathcal{R}U(k) \\ &= (x(k)^TF^T + U(k)^T\Phi^T)(\mathcal{Q}Fx(k) + \mathcal{Q}\Phi U(k))+ U^T(k)\mathcal{R}U(k) \\ &= \textcolor{green}{x(k)^TF^T\mathcal{Q}Fx(k)}+ \textcolor{red}{U(k)^T\Phi^T\mathcal{Q}Fx(k) + x(k)^TF^T\mathcal{Q}\Phi U(k)} + \textcolor{blue}{U(k)^T\Phi^T\mathcal{Q}\Phi U(k)+ U^T(k)\mathcal{R}U(k)} \\ &=\textcolor{green}{x(k)^TF^T\mathcal{Q}Fx(k)}+ \textcolor{red}{2x(k)^TF^T\mathcal{Q}\Phi U(k)} +\textcolor{blue}{U(k)^T(\Phi^T\mathcal{Q}\Phi+\mathcal{R})U(k)} \end{aligned} \end{equation*}$

$F x (k)$ 和 $\Phi U(k)$ 维数相同（是系统状态的维数*N），而 $\mathcal{Q}$ 是一个对角方阵，故 ${U(k)^T\Phi^T\mathcal{Q}Fx(k) = x(k)^TF^T\mathcal{Q}\Phi U(k)}=一个标量$ ，故红色部分相加相当于其中一个乘2

1.3 状态变量 $u (k)$ 的求解

Minimize the control function by predictive control series:
（可以不严谨地理解为：让 $J$ 最小，相当于求 $J$ 在 $J$ 对 $U$ 导数为0 点的值）
$\nabla_U{|}_{U=U^*}=\frac{\partial{J}}{\partial U}{|}_{U=U^*}=0$
$\frac{\partial{J}}{\partial U}=\textcolor{green}{0} +\textcolor{red}{2x(k)^TF^T\mathcal{Q}\Phi}+\textcolor{blue}{2U(k)^T(\Phi^T\mathcal{Q}\Phi+\mathcal{R})}$
令 $\frac{\partial{J}}{\partial U}=0$ ，可得：
$\begin{equation*} \begin{aligned} \textcolor{red}{x(k)^T F^T \mathcal{Q} \Phi}+\textcolor{blue}{U(k)^T (\Phi^T\mathcal{Q}\Phi+\mathcal{R})}= 0 \\ (\textcolor{red}{x(k)^TF^T\mathcal{Q}\Phi}+\textcolor{blue}{U(k)^T(\Phi^T\mathcal{Q}\Phi+\mathcal{R})})^T=0 \\ \textcolor{red}{(x(k)^TF^T\mathcal{Q}\Phi)^T}+\textcolor{blue}{(\Phi^T\mathcal{Q}\Phi+\mathcal{R})^TU(k)}=0 \\ \textcolor{red}{\Phi^T \mathcal{Q} Fx(k)} + \textcolor{blue} {(\Phi^T\mathcal{Q}\Phi+\mathcal{R})U(k)}=0 \\ \textcolor{blue}{(\Phi^T\mathcal{Q}\Phi+\mathcal{R})U(k)}=-\textcolor{red}{\Phi^T\mathcal{Q}Fx(k)} \\ \textcolor{blue}{U(k)}=-\textcolor{blue}{(\Phi^T\mathcal{Q}\Phi+\mathcal{R})^{-1}}\textcolor{red}{\Phi^T\mathcal{Q}Fx(k)} \end{aligned} \end{equation*}$
即：
$\begin{equation*} \begin{aligned} \textcolor{blue}{U(k)}=- \textcolor{blue}{(\Phi^T\mathcal{Q}\Phi+\mathcal{R})^{-1}} \textcolor{red}{\Phi^T\mathcal{Q}Fx(k)}\\ (R>0,Q\ge0;or R\ge0,Q\gt0, and\ \Phi\ is\ fully\ ranked) \end{aligned} \end{equation*}$

满足括号里的条件， $(\Phi^T\mathcal{Q}\Phi+\mathcal{R})$ 才可逆

$\begin{equation*} \begin{aligned} \begin{aligned} \textcolor{blue}{u^*(k)}&=- \begin{bmatrix} I_{p\times p}&0&\cdots 0 \end{bmatrix} \textcolor{blue}{(\Phi^T\mathcal{Q}\Phi+\mathcal{R})^{-1}} \textcolor{red}{\Phi^T\mathcal{Q}Fx(k)} \\ &=-K_{mpc}x(k) \end{aligned} \end{aligned} \end{equation*}$
取 $u^*(k)$ ，则在 $U (k)$ 前乘一个分块矩阵，对角线上的第一个分块是一个单位阵，维度为控制输入的维度 $p$ ,如果不是多输入而是单输入，则 $p = 1$ 。
$u^*(k)$ 最后化简为一个常数矩阵 $K_{mpc}$ （因为 $\Phi$ , $\mathcal{Q}$ , $\mathcal{R}$ , $F$ 这些全部是已知量）乘上 $k$ 时刻的状态变量，从形式上看是状态反馈。
因此，无约束线性MPC实际上是一个线性反馈控制。

1.4 举例

在这里插入图片描述
写一段matlab程序，即可求解 $F$ ， $\Phi$ 和 $u^*(k)$ 。但问题是，求解出来的 $u^*(k)$ 是否能保证系统是稳定的？
对于有稳定性的判定，有李雅普诺夫直接法和李雅普诺夫间接法，见下：

$> 0 和 < 0$ 分别指的是正定和负定
验证稳定性的前提是 $K_{mpc}$ 存在
可优化性并不决定可稳定性（两种可能的原因见下a和b），所以这里验证稳定性的操作是必要的。
优化是在一段时间上进行的，在这段时间内， $x$ 不一定由大变小，也可能先变小再变大，从而不收敛
对于非最小相位系统，系统响应方向可能相反，N取得不够大时，预测不能反映真实运动趋势，那么优化的不是系统真正的性能，导致不稳定的情况发生

本例采用李雅普诺夫间接法在离散时间系统下的判据
当 $N$ 的取值越来越小，如下面这个例子所示，特征值超出了单位圆，从而不稳定。
在这里插入图片描述
知道如何判断稳定性后，现在的问题变成了：每一次优化后，都要去验证一下系统的稳定性呢？有没有一种机制，保证每一次算出来的 $K$ 都保证系统的稳定性？