Model Predictive Control(1)--What and why MPC

Model predictive control is like playing chess, at ,each time step, you choose the best strategy to win. In this process, you make prediction based on current situation.

Model predictive control (MPC), also known as dynamical matrix control (DMC), generalized predictive control (GPC) or receding horizon control (RHC), is an online control algorithm based on numerically solving an optimal problem at each step. This article gives a summarization about model predictive control strategy. It is mainly a summarization about reference ²

Why MPC

Intuitive concept, easy to understand and implement
Systematic handling of constrains
Can handle MIMO and dead-time without modification
Feed forward to make good use of future target information
Handling challenging dynamics (unlike PID)

MPC has been widely used in industry because it has been proved that by giving superior control, the profits can be improved:

If one is confident that the variance of the output can be reduced, one can then safely operate closer to a constraint and increase quantity.
It has the ability to incorporate constraints explicitly enables ‘optimum’ constrained performance

What is MPC⁴

Main components (Important)

Prediction
Receding horizon (滑动窗口)
Modelling
Performance index
Degree of freedom
Constraint handling
Multivariable (MIMO)

We will discuss them one by one

Prediction

When we talk about prediction, answer the following question:

Why is prediction important

Think three step before do one step — 三思而后行

Before planning an activity, always think though about all the likely consequences or there will be disasters!!!

How to predict and how far and how accurate should we predict
- How far
Prediction horizon is often mistreated as a tuning parameters in MPC. However, in terms of normal human behavior, we all know how far we need to predict, for example:

Q: When driving a car, what is the prediction horizon

A: You should predict beyond the safe braking distance, our you will die! For example, while driving 70 mph, you need to look at least 100 m ahead, while driving 20 mph, 20 m is OK.

Prediction horizen > settling time!

Summary: always look beyond the key dynamics of a process

Consequence of not predicting: Low performance, maybe disasters.

Receding horizen

It means that we continually update our predictions and decision making to take account of the most recent target and measurement data.

One effect is that the prediction horizon is always relative to the current position and recedes away from the viewer as the viewer moves forward.

图片名称

Feedback from MPC

In MPC, the continual update prediction and decision making based on the measurement data introduces the feedback. (Measurement introduces feedback.)

Measurement is a core part of a feedback loop
decision based on measurement are the second core part

Predictive control incorporates both.

Modelling

Modelling is a core part of prediction control. People can predict based on experience, same idea, in order to make prediction, the behaviors of a system should be clear and the model is required.

However, here comes the problem, what is an appropriate prediction model?

Modelling requirements

Easy to form prediction - ideally linear
Easy to identify model parameters
Accurate prediction: steady-state, fast transients, mid-response……

The simplest model gives accurate prediction is usually the best.

But it’s OK if the model is not that accurate, because we have feedback to deal with the small error.

Performance index

We could give out descriptions about what constitutes good or bad, for example, we use SLOW, OSCILLATORY, UNSTABLE, IDEAL to describe the step response.

图片名称

When we talk about performance index, we need to ask the following questions:

1. What is the performance index for?

In order to decide which input trajectory, we need a precise numeric definition of ‘best’, so the performance index is a numerical definition of what is the best. However, it should be note that the performance index is still have some contradiction with real situation.

2. How should the performance index be designed?

The index should be simple, you should only increase the complexity where the benefits is clear. For example:

With the increase of the experience, people can do more complex activities and give out more challenging performance indices, because our internal model is better through experience.

图片名称

Typically quadratic performance is used, because:

It give us well conditioned optimization
Unique minimum
smooth behaviours (unlike 1-norm or inf-norm)

3. How should we make trade offs between optimal and safe/robust performance?

图片名称

Degree of freedom (DOF)

DOF describes the complexity of the input predictions, it is closely linked to the performance index. There is no point to use high DOF with a highly performance demanding if the model is poor, it’s just like asking a beginner to play like a master!

The useful num of DOF is related to the prediction accuracy

An ill-posed performance index is shown below:

图片名称

Try to fit a second order curve with one DOF, BAD!! One degree of freedom can only be applied to simple target

In MPC, an ill-posed performance index means that a low prediction horizon (can not predict further events) compared to the system dynamic and use numerous DOF to optimal tracking with that horizon. With low prediction horizon, one cannot fully anticipate the consequence of ones actions, so the planned actions maybe poor.

Constraint handling

One major advantage of predictive control is that it embeds constraints to strategy. It’s critical to getting effective and robust close-loop behavior. We should know that:

The proposed input trajectory is optimal only if it satisfies constraints
More typical control strategy treat constraints after thought (like PID)

Example: overshooting

In some situation, overshooting is a disaster (like the chemical tank level control, if overshooted, the chemical spills everywhere! ), it means we lost control of the system and make explosion.

In MPC, the constraints (flow, power or speed limitations) are embedded, which means that it will not propose input flows that allow overshooting, the response time may become slower, but much more safer. In PC, the input is limited to 100% and will not allow earlier input choices which make the system unstable

Multivariable (MIMO)

In MIMO system, often changing one input changes all the outputs, therefore we need an control law consider all the I/Os. One advantage of MPC is that we consider about the interaction, although we need to express the algorithm in mathematical form.

How MPC works

Mathematical form about MPC

Basic of MPC

The first thing we need to know that the implementation of MPC is usually in discrete time¹. Meanwhile, due to the limitations of physical world, there are constrains:

$\begin{equation} \begin{aligned}{ll}{U_{\min } \leq u_{i}(k) \leq U_{\max }} & {\text { for all } i=1,2, \ldots, M \text { and } k=0,1,2, \ldots, \infty} \\ {X_{\min } \leq x_{i}(k) \leq X_{\max }} & {\text { for all } i=1,2, \ldots, N \text { and } k=0,1,2, \ldots, \infty} \end{aligned} \end{equation}$

so in practical control system, we need to solve a constrained LQR problem, now let’s convert infinite-time limit to a $N$ step finite-time limit. Then our optimal problem becomes a set of linear equations with variables ${\mathrm{x}(1), \mathrm{x}(2), \ldots, \mathrm{x}(\mathrm{N}) ; \mathrm{u}(0), \mathrm{u}(1), \ldots, \mathrm{u}(\mathrm{N}-1)}$, the initial state is $x(0)=x_0$. Now we give out the goal of MPC

Goal: find the best control sequence over a future horizon of $N$ steps

$\begin{equation} \begin{aligned} \min &\sum^{N-1}_{k=0}\left \| W^y(y_k-r(t)) \right \|^2_2+\left \| W^u(u_k-u_r(t)) \right \|^2_2\\ s.t.\quad &\mathrm{pridiction}\\ &x_{k+1}=f(x_k,u_k)\\ &y_k=g(x_k,u_k)\\ &\mathrm{constraints}\\ &u_{\min}\leq u_k \leq u_{\max}\\ &y_{\min}\leq y_k \leq y_{\max}\\ &\mathrm{state\ feedback}\\ &x_0=x(t) \end{aligned} \end{equation}$

Linear MPC (equals to linear state-feedback!)

Unconstrained case

Let’s consider our linear prediction model:

$\begin{equation} \left\{\begin{aligned} x_{k+1} &=A x_{k}+B u_{k} \\ y_{k} &=C x_{k} \end{aligned}\right. \quad\begin{array}{l}{x \in \mathbb{R}^{n}} \\ {u \in \mathbb{R}^{m}} \\ {y \in \mathbb{R}^{p}}\end{array} \end{equation}$

The relation between input and states is:

$\begin{equation} x_{k}=A^{k} x_{0}+\sum_{j=0}^{k-1} A^{j} B u_{k-1-j} \end{equation}$

and the performance index is (cost function, see ):

$\begin{equation} J\left(z, x_{0}\right)=x_{N}^{\prime} P x_{N}+\sum_{k=0}^{N-1} x_{k}^{\prime} Q x_{k}+u_{k}^{\prime} R u_{k} \quad \begin{aligned} R &=R^{\prime} \succ 0 \\ Q &=Q^{\prime} \succeq 0 \\ P &=P^{\prime} \succeq 0 \end{aligned} \quad z=\left[\begin{array}{c}{u_{0}} \\ {u_{1}} \\ {\vdots} \\ {u_{N-1}}\end{array}\right] \end{equation}$

Goal: find a sequence $z^*$ to minimize cost function $J$. See LQG regulator.

$\begin{equation} J\left(z, x_{0}\right)= x_{0}^{\prime} Q x_{0}+\left[\begin{array}{c}{x_{1}} \\ {x_{2}} \\ {\vdots} \\ {x_{N-1}} \\ {x_{N}}\end{array}\right]^{\prime}\left[\begin{array}{ccccc}{Q} & {0} & {0} & {\cdots} & {0} \\ {0} & {Q} & {0} & {\cdots} & {0} \\ {\vdots} & {\vdots} & {\ddots} & {\vdots} & {\vdots} \\ {0} & {\cdots} & {0} & {Q} & {0} \\ {0} & {0} & {\cdots} & {0} & {P}\end{array}\right]\left[\begin{array}{c}{x_{1}} \\ {x_{2}} \\ {\vdots} \\ {x_{N-1}} \\ {x_{N}}\end{array}\right]\\ +\left[\begin{array}{c}{u_{0}} \\ {u_{1}} \\ {\vdots} \\ {u_{N-1}}\end{array}\right]\left[\begin{array}{cccc}{R} & {0} & {\cdots} & {0} \\ {0} & {R} & {\cdots} & {0} \\ {\vdots} & {\vdots} & {\ddots} & {\vdots} \\ {0} & {\cdots} & {0} & {R}\end{array}\right]\left[\begin{array}{c}{u_{0}} \\ {u_{1}} \\ {\vdots} \\ {u_{N-1}}\end{array}\right] \end{equation}$

$\begin{equation} \left[\begin{array}{c}{x_{1}} \\ {x_{2}} \\ {\vdots} \\ {x_{N}}\end{array}\right]=\left[\begin{array}{cccc}{B} & {0} & {\cdots} & {0} \\ {A B} & {B} & {\cdots} & {0} \\ {\vdots} & {\vdots} & {\ddots} & {\vdots} \\ {A^{N-1} B} & {A^{N-2} B} & {\ldots} & {B}\end{array}\right]\left[\begin{array}{c}{u_{0}} \\ {u_{1}} \\ {\vdots} \\ {u_{N-1}}\end{array}\right]+\left[\begin{array}{c}{A} \\ {A^{2}} \\ {\vdots} \\ {A^{N}}\end{array}\right] x_{0} \end{equation}$

$\begin{equation} \begin{aligned} J\left(z, x_{0}\right) &=\left(\bar{S} z+\bar{T} x_{0}\right)^{\prime} \bar{Q}\left(\bar{S} z+\bar{T} x_{0}\right)+z^{\prime} \bar{R} z+x_{0}^{\prime} Q x_{0} \\ &=\frac{1}{2} z^{\prime} \underbrace{2\left(\bar{R}+\bar{S}^{\prime} \bar{Q} \bar{S}\right)}_{H} z+x_{0}^{\prime} \underbrace{2 \bar{T}^{\prime} \bar{Q} \bar{S}}_{F^{\prime}} z+\frac{1}{2} x_{0}^{\prime} \underbrace{2\left(Q+\bar{T}^{\prime} \bar{Q} \bar{T}\right)}_{Y} x_{0} \end{aligned} \end{equation}$

Now let’s take a look at the cost function

$\begin{equation} J\left(z, x_{0}\right)=\frac{1}{2} z^{\prime} H z+x_{0}^{\prime} F^{\prime} z+\frac{1}{2} x_{0}^{\prime} Y x_{0} \end{equation}$

the optimum is obtained by zeroing the gradient

$\begin{equation} \nabla_{z} J\left(z, x_{0}\right)=H z+F x_{0}=0 \end{equation}$

and hence the solution is:

$\begin{equation} z^{*}=\left[\begin{array}{c}{u_{0}^{*}} \\ {u_{1}^{*}} \\ {\vdots} \\ {u_{N-1}^{*}}\end{array}\right]=-H^{-1} F x_{0}=-[I\ 0\ ...\ 0]H^{-1}Fx(t)=Kx(t) \end{equation}$

Unconstrained linear MPC = linear state-feedback!

constrained case

Still the prediction model above, this time we add input and output constraints to enforce, our control problem becomes

$\begin{equation} \begin{array}{cl}{\min _{z}} & {x_{N}^{\prime} P x_{N}+\sum^{N-1}_{k=0} x_{k}^{\prime} Q x_{k}+u_{k}^{\prime} R u_{k}} \\ {\text { s.t. }} & {u_{\min } \leq u_{k} \leq u_{\max }, k=0, \ldots, N-1} \\ {} & {y_{\min } \leq y_{k} \leq y_{\max }, k=1, \ldots, N}\end{array} \end{equation}$

write as condensed form:

$\begin{equation} \begin{aligned} V\left(x_{0}\right)=\frac{1}{2} x_{0}^{\prime} Y x_{0}+& \min _{z} \frac{1}{2} z^{\prime} H z+x_{0}^{\prime} F^{\prime} z \\ \text { s.t. } & G z \leq W+S x_{0} \end{aligned} \end{equation}$

图片名称

Input constraints $u{\min } \leq u{k} \leq u_{\max }$

$\begin{equation} \left[\begin{array}{cccc}{1} & {0} & {\cdots} & {0} \\ {0} & {1} & {\cdots} & {0} \\ {\vdots} & {} & {\ddots} & {\vdots} \\ {0} & {\cdots} & {0} & {1} \\ {-1} & {0} & {\cdots} & {0} \\ {0} & {-1} & {\cdots} & {0} \\ {\vdots} & {} & {\ddots} & {\vdots} \\ {0} & {\cdots} & {0} & {-1}\end{array}\right] z \leq\left[\begin{array}{c}{u_{\max }} \\ {u_{\max }} \\ {\vdots} \\ {u_{\max }} \\ {-u_{\min }} \\ {-u_{\min }} \\ {\vdots} \\ {-u_{\min }}\end{array}\right] \quad z=\left[\begin{array}{c}{u_{0}} \\ {u_{1}} \\ {\vdots} \\ {u_{N-1}}\end{array}\right] \end{equation}$

Output constraints $
y{k}=C A^{k} x{0}+\sum{i=0}^{k-1} C A^{i} B u{k-1-i} \leq y_{\max }, k=1, \ldots, N
$

$\begin{equation} \left[\begin{array}{cccc}{C B} & {0} & {\cdots} & {0} \\ {C A B} & {C B} & {\cdots} & {0} \\ {\vdots} & {} & {} & {\vdots} \\ {C A^{N-1} B} & {\cdots} & {C A B} & {C B}\end{array}\right] z \leq\left[\begin{array}{c}{y_{\max }} \\ {y_{\max }} \\ {\vdots} \\ {y_{\max }}\end{array}\right]-\left[\begin{array}{c}{C A} \\ {C A^{2}} \\ {\vdots} \\ {C A^{N}}\end{array}\right] x_{0} \end{equation}$

Linear MPC algorithm

At each sampling time $t$:

measure (or estimate) the current state $x(t)$
Get the solution $z^=\left[\begin{array}{c}{u_{0}^} \ {u{1}^*} \ {\vdots} \ {u{N-1}^*}\end{array}\right]$ of the QP

$\begin{equation} \left\{\begin{array}{cc}{\min _{z}} & {\frac{1}{2} z^{\prime} H z+x^{\prime}(t) F^{\prime} z} \\ {\text { s.t. }} & {G z \leq W+Sx(t)}\end{array}\right. \end{equation}$

Apply only $u(t)=u^_0$, discarding the remaining optimal inputs $u^1,…,u^*{N-1}$

Application

Explicit MPC Control of a Single-Input-Single-Output Plant³

This subsection is mainly a summarization about reference ³.

Plant design

The linear open-loop dynamic model is a double integrator.

1	plant = tf(1,[1 0 0]);

MPC design

Ts = 0.1;
p = 10;
m = 3;
mpcobj = mpc(plant, Ts, p, m);

P sets the prediction horizon steps, specified as a positive integer. The product of PredictionHorizon and Ts is the prediction time; that is, how far the controller looks into the future.
`M`
Control horizon, specified as one of the following:
- Positive integer, m, between 1 and p, inclusive, where p is equal to PredictionHorizon. In this case, the controller computes m free control moves occurring at times k through k+m-1, and holds the controller output constant for the remaining prediction horizon steps from k+m through k+p-1. Here, k is the current control interval.
- Vector of positive integers [m1, m2, …], specifying the lengths of blocking intervals. By default the controller computes M blocks of free moves, where M is the number of blocking intervals. The first free move applies to times k through k+m1-1, the second free move applies from time k+m1 through k+m1+m2-1, and so on. Using block moves can improve the robustness of your controller. The sum of the values in ControlHorizon must match the prediction horizon p. If you specify a vector whose sum is:
  - Less than the prediction horizon, then the controller adds a blocking interval. The length of this interval is such that the sum of the interval lengths is p. For example, if p=10 and you specify a control horizon of ControlHorizon=[1 2 3], then the controller uses four intervals with lengths [1 2 3 4].
  - Greater than the prediction horizon, then the intervals are truncated until the sum of the interval lengths is equal to p. For example, if p=10 and you specify a control horizon of ControlHorizon= [1 2 3 6 7], then the controller uses four intervals with lengths [1 2 3 4].

Specify actuator saturation limits as MV constraints.

1	mpcobj.MV = struct('Min',-1,'Max',1);

Model Predictive Control(1)--What and why MPC

Why MPC