An Overview of Systems-Theoretic Guarantees in Data-Driven Model Predictive Control
An Overview of Systems-Theoretic Guarantees in Data-Driven Model Predictive Control
Systems-Theoretic
Guarantees in Data-Driven
Model Predictive Control
Julian Berberich and Frank Allgöwer
arXiv:2406.04130v1 [eess.SY] 6 Jun 2024
Abstract
1
1. INTRODUCTION
Determining models that are both accurate and tractable, as commonly required for con-
troller design, is challenging. On the other hand, the increasing availability of data provides
unprecedented opportunities for enhancing controllers with additional information. System
identification provides one possibility for extracting information from data by estimating a
model of the underlying dynamical system (1), which can then be employed for control. Ob-
taining theoretical guarantees for this indirect data-driven control strategy requires rigorous
bounds on the estimation error, whose derivation is an active field of research (2).
As an alternative to indirect approaches, designing controllers directly from data with-
out intermediate system identification has received increasing attention in the recent litera-
ture (3). The Fundamental Lemma by Willems et al. (4) provides a unifying framework for
direct data-driven control. It states that one persistently exciting trajectory of an unknown
linear time-invariant (LTI) system can be used to parametrize all other trajectories. This
allows to address various analysis and controller design problems, that are traditionally
solved using model knowledge, based purely on data, see (5) for a recent survey. One of
the most promising applications of the Fundamental Lemma is the design of data-driven
model predictive control (MPC) schemes. MPC is a powerful modern control technique
which can handle constraints, performance criteria, and nonlinear dynamics (6). It relies
on a receding-horizon principle: At each time step, an open-loop optimal control problem
is solved, the first component of the optimal input sequence is applied to the plant, and
the process is repeated at the next time step using a new measurement. While classi-
cal MPC schemes assume precise model knowledge, they can be enhanced via data using
learning-based (7) and adaptive (8, 9, 10) approaches.
This review provides an overview of recent research on direct data-driven MPC based on
the Fundamental Lemma with a focus on systems-theoretic guarantees for the closed-loop
system. It was first shown in (11, 12) that the Fundamental Lemma can be used to set
up data-driven MPC schemes, which compute control inputs directly from data without in-
termediate system identification. Since then, data-driven MPC has been successfully used
in a large number of applications, including a quadcopter (13), a four-tank system (14),
synchronous motor drives (15), a quadrupedal robot (16), a soft robot (17), green house
automation (18), and traffic control (19, 20, 21, 22). Further, various successful applications
have been demonstrated in the domain of energy systems, including grid-connected power
converters (23, 24), power system oscillation damping (25), building control (26, 27, 28, 29),
battery charging (30), a combined-cycle power plant (31), energy networks (32, 33, 34), and
a fuel cell system (35). These remarkable empirical demonstrations have motivated nu-
merous theoretical contributions with the goal of understanding and improving data-driven
MPC. When using data-driven MPC in real-world applications, especially for complex and
safety-critical systems, it is desirable to provide a priori guarantees on reliable closed-loop
operation. Mathematically, the goal is to ensure systems-theoretic properties such as sta-
bility, robustness, and constraint satisfaction for the controlled system.
In this review, we provide an overview of recent progress on systems-theoretic guaran-
tees in data-driven MPC. We discuss appropriate modifications that allow to prove rigorous
guarantees for the closed-loop system, covering different setups including linear and non-
linear systems as well as noise-free and noisy data. While we mainly focus on theoretical
aspects of data-driven MPC, in particular closed-loop guarantees, we refer to (5, 36, 37) for
further details, e.g., on implementations.
This review is structured as follows. In Section 2, we introduce preliminaries on the
For
√ a vector x, we denote the p-norm by ∥x∥p and we define the weighted norm ∥x∥P =
x⊤ Px for some matrix P = P⊤ .
2. Preliminaries
In this section, we introduce the Fundamental Lemma (Section 2.1) as well as model-based
MPC (Section 2.2).
The following result from (4), commonly referred to as the Fundamental Lemma, forms
the theoretical basis of data-driven MPC.
−1
Theorem 2.1. Suppose {udk }N L−1
k=0 is PE of order L + n. Then, {ūk , ȳk }k=0 is a trajectory
N −L+1
of the system in Equation 1 if and only if there exists α ∈ R such that
" # " #
HL (ud ) ū
α= . 2.
HL (yd ) ȳ
www.annualreviews.org • 3
The Fundamental Lemma (Theorem 2.1) provides a parametrization of all possible
input-output trajectories of the system in Equation 1 based on only one input-output tra-
−1
jectory {udk , ykd }N
k=0 . More precisely, the image of the Hankel matrices in Equation 2 is
equal to the set of all system trajectories (recall the notation ū = ū[0,L−1] and ȳ = ȳ[0,L−1] ).
While the result was originally formulated and proven in behavioral systems theory (4), al-
ternative proofs were provided more recently in the classical state-space framework (38, 39).
Further, the result has found increasing usage in the recent literature for developing direct
data-driven analysis and control methods. In the remainder of the review, we focus on data-
driven MPC based on the Fundamental Lemma with closed-loop guarantees. Beyond MPC,
the result can be used, e.g., for data-driven simulation (40), dissipativity analysis (41, 42),
and state-feedback design (43, 44, 45), see (5, 46, 47, 48) for recent overview articles.
s.t. x̄k+1 (t) = Ax̄k (t) + Būk (t), k ∈ I≥0 , x̄0 (t) = xt , 3b.
ūk (t) ∈ U, x̄k (t) ∈ X, k ∈ I[0,L−1] . 3c.
Here, ū(t) and x̄(t) denote the predicted input and state trajectory at time t, e.g., x̄k (t) is
the k-th step of the state trajectory predicted at time t. According to the constraint 3b,
{ūk (t), x̄k (t)}L−1
k=0 is a trajectory of the system 1 which is initialized at the current state xt .
In the cost 3a, the distance of this trajectory to the setpoint is penalized over the prediction
horizon L with positive definite cost matrices Qx , R. Further, Equation 3c enforces the
constraints on the predicted trajectory. We denote the optimal solution of the optimization
problem 3 by ū∗ (t), x̄∗ (t). It can be used to synthesize a feedback controller in a receding-
horizon fashion as commonly done in MPC: At time t, the first component ū∗0 (t) of the
optimal input trajectory is applied to the system and a new optimal input is computed at
the next time step based on a new state measurement, see Algorithm 2.1.
When applying Algorithm 2.1 without further modifications, the closed-loop system
need not be stable, see (49) for an experimental example. Guarantees on closed-loop sta-
bility and constraint satisfaction can be ensured either via a sufficiently long prediction
horizon L (50) or by adding terminal ingredients to the optimization problem 3, e.g., a
terminal cost and a terminal region constraint on the final state x̄L (t) (6).
Both the design and the implementation of standard MPC schemes as in Algorithm 2.1
require model knowledge. For example, for solving the optimization problem 3, the matrices
(A, B, C, D) need to be available. In this review, we cover an alternative MPC approach
which can control unknown systems only from input-output data while maintaining systems-
theoretic guarantees.
tasks in the ideal case of noise-free data (Section 3.1) and in the more realistic scenario of
noisy data (Section 3.2). In Section 3.3, we discuss more advanced MPC approaches, e.g.,
addressing control objectives beyond stabilization.
The input-output trajectory predicted at time t is denoted by ū(t) = ū[−n,L−1] (t) ∈ UL+n ,
ȳ(t) = ȳ[−n,L−1] (t) ∈ YL+n . The constraint in Equation 4b is based on the Fundamental
Lemma and ensures that {ūk (t), ȳk (t)}L−1
k=−n is a trajectory of the LTI system. The cost
penalizes the difference of the predicted trajectory w.r.t. the setpoint (us , ys ) for user-
specified positive definite matrices Q, R. Note that the predicted trajectory is of length
L + n. In the constraint 4c, the first n components {ūk (t), ȳk (t)}−1
k=−n are used to initialize
the prediction based on the most recent n input-output measurements {uk , yk }t−1 k=t−n . This
initialization over n time steps is required when using an input-output prediction model to
ensure that the internal states of the prediction and the plant coincide, i.e., to implicitly
enforce a constraint analogous to the initial condition in Equation 3b. Instead of the
system order n, any upper bound can be used (to be precise, an upper bound on the lag
is sufficient). Further, Equation 4d contains the input-output constraints. Note that, for
convex polytopic constraint sets U, Y, the optimization problem 4 is a convex quadratic
program. We denote the optimal solution of the optimization problem 4 by α∗ (t), ū∗ (t),
ȳ∗ (t). It can be used to set up a data-driven MPC scheme in a receding-horizon fashion
analogous to the model-based case in Section 2.2, see Algorithm 3.1.
www.annualreviews.org • 5
Algorithm 3.1. Data-driven MPC
Offline: Choose upper bound on system order n, prediction horizon L, positive definite
−1
cost matrices Q, R, constraint sets U, Y, setpoint (us , ys ), and generate data {udk , ykd }N
k=0 .
Online:
Algorithm 3.1 allows to control unknown LTI systems based only on input-output data
and without explicit model knowledge. The main difference to model-based MPC as in
Section 2.2 is the usage of data-dependent Hankel matrices for prediction, instead of a
state-space model. If the data are PE, the optimization problem 4 returns the same optimal
input as using a state-space model. Since the Fundamental Lemma directly parametrizes
input-output trajectories, Algorithm 3.1 inherently is an output-feedback MPC scheme. In
contrast, model-based output-feedback MPC schemes typically involve an observer whose
estimation error needs to be accounted for.
Analogous to the model-based case, data-driven MPC may lead to an unstable closed
loop and can even destabilize an open-loop stable system if L is too short (51). Closed-loop
stability of data-driven MPC can be ensured by modifying the optimization problem 4.
The arguably simplest approach is to add terminal equality constraints, i.e., to restrict the
optimal solution to be equal to the setpoint (us , ys ) over n steps at the end of the prediction
horizon. Mathematically, the following constraint is added to the optimization problem
" # " #
ūk (t) us
= , k ∈ I[L−n,L−1] . 5.
ȳk (t) ys
This implicitly ensures that the internal state corresponding to the input-output trajectory
ū(t), ȳ(t) is equal to the steady-state xs corresponding to (us , ys ) at time L. With this
modification, it can be shown under mild assumptions (e.g., PE) that, if the optimization
problem is feasible at initial time t = 0, then it is recursively feasible for all t ∈ I≥0
and Algorithm 3.1 exponentially stabilizes (us , ys ) in closed loop (51). While terminal
equality constraints are simple to implement and to study theoretically, they have significant
drawbacks in terms of robustness and the size of the region of attraction. In particular,
the latter only includes points from which the optimization problem is initially feasible, i.e.,
from which the setpoint can be reached within L time steps while satisfying the constraints.
As an alternative, one can design more general terminal ingredients, i.e., a terminal
cost function and a terminal region constraint for an extended state vector consisting of
sequential input-output values (52). To be precise, an alternative,
" non-minimal
# state of
u[t−n,t−1]
the LTI system in Equation 1 can be obtained as ξt = , compare (53, 54).
y[t−n,t−1]
Adapting data-driven output-feedback design methods from (55), it is possible to design
a terminal cost function Vf (ξ¯L (t)) = ∥ξ¯L (t)∥2P as well as a corresponding terminal region
constraint ξ¯L (t) ∈ Ξf for a sublevel set Ξf of Vf . Replacing the terminal equality constraint
in Equation 5 by these two components leads to closed-loop stability guarantees with a
significantly larger region of attraction (52). However, this approach faces an important
limitation: The design of the terminal ingredients from (52) is only applicable when the
compare Figure 1. Further, the noise is bounded by some ε̄ > 0, i.e., ∥εdk ∥∞ ≤ ε̄ for
k ∈ I[0,N −1] and ∥εt ∥∞ ≤ ε̄ for t ∈ I≥0 . The following optimization problem provides the
www.annualreviews.org • 7
SUBSPACE PREDICTIVE CONTROL
Subspace predictive control (SPC) involves a two-step procedure consisting of 1) identification of a multi-
step predictor from data and 2) using this predictor as a model for MPC (64, 65). To introduce SPC, we
partition the Hankel matrices used for prediction in Equation 4b as follows
" # Hn (ud[0,N −L−1] ) Up
H (ud
HL+n (ud ) L [n,N −1] ) Uf
= =: . 6.
HL+n (yd ) d
Hn (y[0,N −L−1] ) Yp
d
HL (y[n,N −1] ) Yf
Here, the data matrices Up , Yp correspond to the first n components of the input-output predictions, i.e.,
to the initial conditions in Equation 4c (hence, the index ’p’ for past). On the other hand, the data matrices
Uf , Yf correspond to future predictions (hence, the index ’f’). In SPC, the data are used to identify
a multi-step predictor which predicts future output values based on a future input trajectory and initial
conditions. To be precise, the predictor M is determined via least-squares estimation and takes the form
†
Up
M = Yf Uf , where A† denotes the Moore-Penrose inverse of a matrix A. Based on this predictor, SPC
Yp
solves the following optimal control problem at time t.
L−1
X ū[−n,−1] (t)
min ∥ūk (t) − us ∥2R + ∥ȳk (t) − ys ∥2Q s.t. ȳ[0,L−1] (t) = M ū[0,L−1] (t) . 7.
ū(t),ȳ(t)
k=0 ȳ[−n,−1] (t)
Thus, in contrast to direct data-driven MPC, SPC involves an a priori estimation step and, hence, it is
an indirect data-driven MPC method. In Section 5, we discuss differences between direct and indirect
data-driven MPC in more detail.
basis for controlling unknown linear systems based on noisy input-output data.
L−1
X
min ∥ūk (t) − us ∥2R + ∥ȳk (t) − ys ∥2Q + λα ∥α(t)∥22 + λσ ∥σ(t)∥22 8a.
α(t),σ(t),ū(t),ȳ(t)
k=0
" # " #
ū(t) HL+n (ud )
s.t. = α(t), 8b.
ȳ(t) + σ(t) HL+n (ỹd )
" # " #
ū[−n,−1] (t) u[t−n,t−1]
= , 8c.
ȳ[−n,−1] (t) ỹ[t−n,t−1]
ūk (t) ∈ U, k ∈ I[0,L−1] . 8d.
In contrast to data-driven MPC for noise-free data as in Section 3.1, the exact output
values are replaced by their noisy versions. Further, the scheme contains an additional op-
timization variable σ(t) ∈ Rp(L+n) , the slack variable, which ensures that the constraint 8b
remains feasible despite the noise. In order to reduce the prediction error, the slack vari-
able is penalized in the cost with regularization parameter λσ > 0. Moreover, the norm
Figure 1
Setup considered for robust data-driven MPC in Section 3.2. The figure displays a generic system
generating the data which are affected by output measurement noise ỹk = yk + εk .
of the variable α(t) is penalized with regularization parameter λα > 0 (see Section 3.2.2
for details). Finally, we note that the optimization problem 8 does not contain output
constraints, which would require a robust constraint tightening due to the noise, see (66)
for details. The optimization problem 8 can be used to set up a robust data-driven MPC
scheme in a standard receding-horizon fashion, see Algorithm 3.2.
In the following, we discuss several key issues in robust data-driven MPC: closed-loop
guarantees for bounded noise (Section 3.2.1), the role of the regularization of α(t) (Sec-
tion 3.2.2), and closed-loop guarantees for stochastic noise (Section 3.2.3).
3.2.1. Closed-loop guarantees for bounded noise. Similar to Section 3.1, Algorithm 3.2
does not guarantee closed-loop stability unless the prediction horizon is sufficiently long
or terminal ingredients are added. It was shown in (51) that stability and robustness can
be ensured by adding terminal equality constraints as in Equation 5 to the optimization
problem 8. In this case, due to a technical controllability argument, proving theoretical
guarantees requires the application in an n-step fashion (67). This means that, in each
MPC iteration, the first n optimal input components ū∗[0,n−1] (t) are applied to the system
before repeating the optimization. With these modifications, robust data-driven MPC with
terminal equality constraints guarantees that the internal state xt converges exponentially
to a neighborhood of the steady-state xs corresponding to (us , ys ), where the size of the
neighborhood depends on the noise level. Mathematically, there exist 0 < c < 1 as well as
a continuous and strictly increasing function β with β(0) = 0 such that
compare (51, Theorem 3). For ε̄ = 0, the closed loop converges to the desired setpoint.
These theoretical guarantees require analogous assumptions as in the noise-free case (e.g.,
PE data, n is an upper bound on the system order, compare Section 3.1) and additionally
that the noise level ε̄ is sufficiently small and that the regularization parameters scale as
www.annualreviews.org • 9
λα ∼ ε̄, λσ ∼ 1ε̄ . The latter conditions ensure that, in the noise-free case ε̄ = 0, robust data-
driven MPC (Algorithm 3.2) reduces to the nominal scheme (Algorithm 3.1) since λα = 0
and λσ → ∞ enforces σ(t) = 0. Beyond this stability result, the theoretical analysis reveals
direct connections between the data quality and the closed-loop performance. In particular,
the asymptotic tracking error decreases and the region of attraction increases if the minimum
singular value of the input Hankel matrix HL+2n (u) increases, which corresponds to a
quantitative PE condition (39). The possiblity to establish such end-to-end insights is a
valuable feature of direct data-driven control methods, which is often harder to achieve
in indirect approaches. While the original paper (51) required an additional non-convex
constraint ∥σ(t)∥∞ ≤ ε̄(1 + ∥α(t)∥1 ) to bound the slack variable, it was later shown in (58)
that this constraint can be dropped when λσ ∼ 1ε̄ , see also (68, Theorem 4.1).
As in the noise-free case, the occurrence of terminal equality constraints poses practical
limitations, e.g., a small region of attraction, and alternative means of guaranteeing closed-
loop stability are available. Roughly speaking, any model-based MPC scheme which admits
inherent robustness properties against disturbances can be transformed into a data-driven
MPC scheme with closed-loop guarantees in the presence of noise (69). Beyond this general
principle, the literature contains various concrete results on closed-loop guarantees in robust
data-driven MPC. For example, in the noisy data case, terminal ingredients can be designed
based on robust data-driven output-feedback design (55). By combining continuity of data-
driven MPC w.r.t. noise (70) and inherent robustness properties (69, 71), one can show that
the resulting data-driven MPC scheme admits closed-loop guarantees also in the presence of
noise, see (68, Section 4.5.2) for a discussion. Likewise, data-driven MPC without terminal
ingredients practically exponentially stabilizes the closed-loop system, assuming that the
prediction horizon L is sufficiently long (58). Notably, in both cases (terminal ingredients
and L sufficiently long), stability and robustness can be proven even for a 1-step MPC
scheme as in Algorithm 3.2. Alternatively, one can optimize over an artifical equilibrium
online which provides powerful practical features (no knowledge required if (us , ys ) is a
feasible equilibrium, improved robustness, increased region of attraction, handling online
setpoint changes) and also admits closed-loop guarantees in the presence of noise (70).
Due to the inaccurate predictions of the Fundamental Lemma with noisy data, handling
output constraints is non-trivial. By using a bound on the prediction error from (51), a
constraint tightening can be constructed which guarantees closed-loop output constraint
satisfaction (66). The conservatism of this tightening can be reduced, e.g., via a pre-
stabilizing feedback, and analogous results can be derived when the system is affected by
bounded disturbances (72).
In summary, the Fundamental Lemma can be used to design robust data-driven MPC
schemes for unknown LTI system based on noisy data. With appropriate modifications,
these schemes admit rigorous closed-loop guarantees on recursive feasibility, constraint sat-
isfaction, and practical exponential stability.
3.2.2. Regularization in data-driven MPC. While robust data-driven MPC (Algorithm 3.2)
differs from the nominal approach (Algorithm 3.1) in multiple aspects, the most crucial
modification is the regularization of α(t) in the cost. Indeed, satisfactory practical results
can often be obtained even without the slack variable σ(t). On the contrary, the absence
of the regularization of α(t) can pose difficulties already for tiny noise levels or even in
the complete absence of noise due to numerical inaccuracies. Intuitively, this regularization
can be understood as reducing the complexity of the employed prediction model based
www.annualreviews.org • 11
results (51, 58, 66, 69, 70, 72), and designing new regularization strategies via a closed-loop
perspective provides a promising future research direction.
3.2.3. Closed-loop guarantees for stochastic noise. While Section 3.2.1 has focused on
bounded noise, we now turn our attention to data-driven MPC for systems affected by
stochastic disturbances. One recent stream of works has employed Polynomial Chaos Ex-
pansions (PCE) for designing stochastic data-driven MPC schemes with closed-loop guar-
antees. Stochastic system trajectories are random variables which can be expressed (ap-
proximately) in a polynomial basis – the PCE basis. By relating the images of Hankel
matrices filled by three different quantities (random variables, PCE coefficients, and sam-
ples), a stochastic version of the Fundamental Lemma can be derived (89), see (90) for a
recent tutorial and overview of behavioral data-driven control for stochastic systems.
The stochastic Fundamental Lemma can be used to construct a data-driven MPC
scheme for unknown stochastic systems subject to chance constraints. In (91), it is shown
that this scheme admits closed-loop guarantees on recursive feasibility and practical stabil-
ity as well as closed-loop performance bound. Here, closed-loop guarantees are ensured via
terminal ingredients, i.e., a terminal cost function and a terminal region constraint, which
can be constructed based only on data. Subsequent works have extended the result in (91),
which considers state measurements and a binary selection of the initial condition as either
the measured state or a backup in case of infeasibility (the prediction from the previous
time step). In particular, (92) provides an MPC scheme with analogous guarantees for
input-output measurements and (93) optimizes the initial condition, interpolating between
the measured state and the backup. Since stochastic data-driven MPC based on PCE ad-
mits an increased computational cost, a tailored multiple shooting approach for complexity
reduction was derived in (94).
As an alternative to the PCE approach, closed-loop guarantees of data-driven MPC
with stochastic noise can be ensured by resorting to techniques from tube-based robust
MPC, i.e., computing a constraint tightening offline such that closed-loop guarantees can
be established (95). All previously mentioned approaches for stochastic data-driven MPC
require access to past disturbance realizations and, therefore, they derive data-driven es-
timation procedures. As an alternative, the recent work (96) constructs an explicit data-
driven parametrization of consistent disturbance realizations from which new samples can
be drawn, allowing to set up a sampling-based stochastic data-driven MPC scheme which
guarantees closed-loop properties via a robust first-step constraint.
www.annualreviews.org • 13
4.1. Exploiting global linearity via a nonlinear Fundamental Lemma
The recent literature contains various extensions of the Fundamental Lemma beyond LTI
systems. Tailored formulations have been derived for Hammerstein and Wiener sys-
tems (115), second-order Volterra systems (116), linear parameter-varying (LPV) sys-
tems (117), affine systems (70, 118), nonlinear autoregressive exogenous systems (119),
bilinear systems (120), linear time-periodic systems (121), and feedback linearizable sys-
tems (122). Further, nonlinear variations of the Fundamental Lemma have been developed
by resorting to multi-step predictors which are linear in basis functions (123), the Koopman
operator (124), and kernel methods (125, 126).
Each of these nonlinear versions of the Fundamental Lemma allows to design a data-
driven MPC scheme for systems in the associated system class. In the following, we show
how this can be done for Hammerstein systems via the corresponding Fundamental Lemma
from (115). Analogous MPC schemes can be designed for other nonlinear system classes
when using the corresponding version of the Fundamental Lemma, cf. above. A Hammer-
stein system is a series interconnection of a static nonlinearity and an LTI system, i.e.,
for k ∈ I≥0 , compare Figure 2. Here, ψ : Rm → Rnψ is a static nonlinear function which is
assumed to be composed of known basis functions with unknown coefficients, i.e., ψ(u) =
Pq m
i=1 ai ψi (u) for known functions ψi : R → Rnψ and unknown scalars ai ∈ R.
u ψ(u) y
Static nonlinearity LTI System
Figure 2
Block diagram illustration of a Hammerstein system, which consists of a series interconnection of
a static nonlinearity and an LTI system, compare Equation 10.
−1
For a given data trajectory {udk , ykd }N
k=0 and under suitable PE conditions, an arbitrary
L−1
input-output sequence {ūk , ȳk }k=0 is a trajectory of the system in Equation 10 if and only
if there exists α ∈ RN −L+1 such that
" # " #
HL (vd ) v̄
α= 11.
HL (yd ) ȳ
compare (115, Proposition 5). This result exploits the linearity of the system which maps
the auxiliary input v, whose components involve the actual input u and the known basis
functions ψi , to the output y. Since Equation 11 allows to accurately predict input-output
trajectories of the given Hammerstein system, it can be used to set up a data-driven MPC
scheme precisely as in the linear noise-free case considered in Section 3.1. Again, closed-
loop guarantees can be established via appropriate modifications of the basic optimization
with state xk ∈ Rn , input uk ∈ Rm , output yk ∈ Rp , and time k ∈ I≥0 . The key idea is
to use online data updates in order to parametrize trajectories of the affine linearization at
the current state, i.e., of
www.annualreviews.org • 15
with Jacobians and remainder terms
∂f ∂h
Axt = , Cxt = , ext = f (xt ) − Axt xt , rxt = h(xt ) − Cxt xt . 15.
∂x xt ∂x xt
Before introducing the data-driven MPC scheme, we take a slight detour to model-based
MPC. The work (131) proposes an MPC scheme to control nonlinear systems based on the
linearized dynamics in Equation 14, i.e., an MPC scheme based on precise model knowledge.
The approach includes a terminal equality constraint w.r.t. an artificial equilibrium xs (t)
for the linearized dynamics, i.e., a vector xs (t) satisfying
xs (t) = Axt xs (t) + Bus (t) + ext , ys (t) = Cxt xs (t) + Dus (t) + rxt 16.
for some equilibrium input us (t) and output ys (t), compare (61). The distance of the arti-
ficial equilibrium xs (t) to the actual setpoint is penalized in the cost. If the corresponding
cost weight is sufficiently small, then xs (t) is encouraged to remain close to the initial state
xt . If xt is, in turn, close to the steady-state manifold of the nonlinear system, then the full
predicted trajectory x̄(t) remains in a small region around xt , i.e., in a region where the
linearized dynamics in Equation 14 provide an accurate prediction. Based on this idea, it
can be shown that the MPC scheme based on the linearized dynamics indeed exponentially
stabilizes the setpoint in closed loop. Notably, this guarantee does not only hold locally
around the setpoint, but the region of attraction is a neighborhood of the entire steady-
state manifold, which can be significantly larger. Roughly speaking, e.g., in an autonomous
driving application, successful tracking of a position can be guaranteed when starting from
any initial position, under the restriction that the car does not drive too fast, i.e., it remains
close to the steady-state manifold.
Let us now return to a data-driven MPC setup, where the quantities f , h, B, D in Equa-
tion 13 are unknown. If, at time t, a PE input-output trajectory of the dynamics linearized
at xt was available, then the Fundamental Lemma for affine systems (70, 118) would yield
equivalent predictions to a state-space model. Thus, the results in (131) on model-based
MPC could be used to guarantee closed-loop exponential stability for the corresponding
data-driven MPC scheme. However, instead of data of the linearized dynamics, in practice
one only has access to measurements from the nonlinear system. The key insight to bridge
this gap lies in using, at time t, the most recent input-output measurements {uk , yk }t−1 k=t−N
for prediction, i.e., the predicted trajectory ū(t), ȳ(t) is parametrized via
" # " #
ū(t) HL+n (u[t−N,t−1] ) X
= α(t), αi (t) = 1. 17.
ȳ(t) HL+n (y[t−N,t−1] ) i
The first equation is analogous to the linear optimization problem 4 with the main differ-
ence of using online trajectories for prediction in the Hankel matrix rather than an offline
trajectory. The second equation ensures that the vector α(t) sums up to 1, which is required
due to the affine Fundamental Lemma (70, 118) since the linearized dynamics 14 are affine.
By using an MPC scheme with artificial equilibrium as in (131), it can be enforced
that the system does not change too rapidly in closed loop and, therefore, the trajectory
{uk , yk }t−1
k=t−N of the nonlinear system is close to a trajectory from the dynamics linearized
at xt . More precisely, if a certain cost matrix in the MPC optimization problem is chosen
small enough, then the closed-loop trajectory moves slowly such that Equation 17 provides
locally accurate predictions for the nonlinear system. Combining these ideas in a rigorous
5. DISCUSSION
When applying data-driven control to real-world systems, especially in complex and safety-
critical applications, it is desirable to provide rigorous systems-theoretic guarantees for
the closed-loop operation. In this review, we discussed direct data-driven MPC methods
based on the Fundamental Lemma which do provide such guarantees, e.g., on stability, ro-
bustness, and constraint satisfaction. The presented schemes can control unknown systems
based only on input-output data in various scenarios including linear and nonlinear systems
as well as noise-free and noisy data. As in the model-based case, closed-loop stability of
data-driven MPC is not necessarily guaranteed but needs to be ensured via suitable terminal
ingredients or a sufficiently long prediction horizon. When controlling linear systems based
on noise-free data, techniques from model-based MPC can be borrowed, but the technical
analysis needs to additionally cope with input-output cost functions. In the more realistic
scenario of noisy data (bounded or stochastic), desirable guarantees can be provided under
appropriate modifications of the MPC scheme, e.g., adding a regularization. For nonlinear
systems, we discussed two alternatives based on either global or local linearity. The former
www.annualreviews.org • 17
approach exploits that systems from specific classes can be represented in linear coordinates
using knowledge of suitable basis functions, allowing to formulate nonlinear versions of the
Fundamental Lemma that can be used for MPC. Alternatively, nonlinear systems can be
controlled by updating the data used for prediction via the Fundamental Lemma online,
exploiting local linearity of (smooth) nonlinear dynamics. In guaranteeing closed-loop prop-
erties for the presented approaches, the key challenge is to combine established concepts
from model-based MPC with unique challenges when using the Fundamental Lemma for
prediction, in particular inaccuracies due to noise or nonlinearities.
A natural question in the context of data-driven MPC is the connection between direct
approaches based on the Fundamental Lemma and indirect schemes such as SPC (132).
For linear systems and noise-free data, basic formulations of SPC and data-driven MPC are
equivalent since both employ exact predictions (133, 134). This equivalence carries over to
more advanced formulations, e.g., including regularization (compare Section 3.2.2), and the
synergy between both approaches has led to the development of new data-driven control
methods (77, 60, 135). In the presence of inaccuracies, e.g., noisy data or nonlinearities,
the predictions in direct and indirect data-driven MPC are no longer equivalent and study-
ing performance gaps becomes relevant. The literature contains various empirical studies,
see (5, 36, 37) and the references therein, but also theoretical results indicating that the
open-loop performance of either direct or indirect approaches can be superior depending on
different factors including the data length (136). Deriving formal closed-loop results on this
gap, e.g., considering stability or closed-loop performance guarantees but also conservatism
of constraint tightenings, is an open research problem. To the best of the authors’ knowl-
edge, the existing SPC literature does not contain closed-loop guarantees under assumptions
comparable to the direct data-driven MPC results covered in this review, in particular in
the presence of noise or nonlinearities. Deriving such guarantees may provide a meaningful
step towards understanding the interplay between SPC and direct data-driven MPC.
In terms of complexity, indirect approaches have the advantage of being less com-
putationally demanding, whereas the number of decision variables in direct approaches
grows with the data length, compare the optimization problems 4 and 8. This has mo-
tivated contributions on complexity reduction in direct data-driven MPC, e.g., based on
wavelets (137), trajectory segmentation (138), singular value decomposition (139), alter-
native data-driven system representations (140), iterative solvers (141), and differentiable
convex programming (142). Finally, another noteworthy distinction arises when designing
indirect data-driven MPC schemes via identified state-space models, which admit several
inherent differences to the multi-step predictor based on the Fundamental Lemma (143).
Data-driven MPC is a promising modern control approach with remarkable empirical
performance and a solid theoretical foundation. We conclude the review by discussing open
research directions that may further enhance the theoretical understanding and reliability
of data-driven MPC, thereby simplifying its application in challenging control problems.
FUTURE ISSUES
1. Data-driven MPC for nonlinear systems: Data-driven methods are most useful in
cases where first-principles models are hard to obtain, e.g., for complex systems
with nonlinear dynamics. Current nonlinear data-driven MPC approaches are either
tailored to specific system classes, which limits their practicality, or employ online
data updates, which requires PE closed-loop trajectories. Developing a unifying
DISCLOSURE STATEMENT
The authors are not aware of any affiliations, memberships, funding, or financial holdings
that might be perceived as affecting the objectivity of this review.
ACKNOWLEDGMENTS
The authors are thankful to Tim Martin for helpful comments. F. Allgöwer is thankful that
this work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research
Foundation) under Germany’s Excellence Strategy - EXC 2075 - 390740016 and within
grant AL 316/15-1 - 468094890. The authors would like to thank the Stuttgart Center for
Simulation Science (SimTech) for the support.
LITERATURE CITED
1. Ljung L. 1987. System Identification: Theory for the User. Prentice-Hall, Englewood Cliffs,
NJ
2. Tsiamis A, Ziemann I, Matni N, Pappas GJ. 2023. Statistical learning theory for control: a
finite-sample perspective. IEEE Control Systems Magazine 43(6):67–97
3. Hou ZS, Wang Z. 2013. From model-based control to data-driven control: Survey, classification
and perspective. Information Sciences 235:3–35
4. Willems JC, Rapisarda P, Markovsky I, De Moor B. 2005. A note on persistency of excitation.
Syst. Contr. Lett. 54:325–329
5. Markovsky I, Dörfler F. 2021. Behavioral systems theory in data-driven analysis, signal pro-
cessing, and control. Annual Reviews in Control 52:42–64
www.annualreviews.org • 19
6. Rawlings JB, Mayne DQ, Diehl MM. 2020. Model Predictive Control: Theory, Computation,
and Design. Nob Hill Pub. 3rd printing
7. Hewing L, Wabersich KP, Menner M, Zeilinger MN. 2020. Learning-based model predictive
control: Toward safe learning in control. Annual Review of Control, Robotics, and Autonomous
Systems 3:269–296
8. Adetola V, Guay M. 2011. Robust adaptive MPC for constrained uncertain nonlinear systems.
Int. J. Adaptive Control and Signal Processing 25(2):155–167
9. Tanaskovic M, Fagiano L, Smith R, Morari M. 2014. Adaptive receding horizon control for
constrained MIMO systems. Automatica 50(12):3019–3029
10. Lu X, Cannon M, Koksal-Rivet D. 2021. Robust adaptive model predictive control: perfor-
mance and parameter estimation. Int. J. Robust and Nonlinear Control 31(18):8703–8724
11. Yang H, Li S. 2015. A data-driven predictive controller design based on reduced hankel matrix.
In Proc. Asian Control Conference, pp. 1–7
12. Coulson J, Lygeros J, Dörfler F. 2019. Data-enabled predictive control: in the shallows of the
DeePC. In Proc. European Control Conf. (ECC), pp. 307–312
13. Elokda E, Coulson J, Lygeros J, Dörfler F. 2021. Data-enabled predictive control for quad-
copters. Int. J. Robust and Nonlinear Control 31(18):8916–8936
14. Berberich J, Köhler J, Müller MA, Allgöwer F. 2021. Data-driven model predictive control:
closed-loop guarantees and experimental results. at-Automatisierungstechnik 69(7):608–618
15. Carlet PG, Favato A, Bolognani S, Dörfler F. 2022. Data-driven continuous-set predictive
current control for synchronous motor drives. IEEE Trans. Power Electronics 37(6):6637–
6646
16. Fawcett RT, Afsari K, Ames AD, Hamed KA. 2022. Toward a data-driven template model for
quadrupedal locomotion. IEEE Robotics and Automation Letters 7(3):7636–7643
17. Müller D, Feilhauer J, Wickert J, Berberich J, Allgöwer F, Sawodny O. 2022. Data-driven
predictive disturbance observer for quasi continuum manipulators. In Proc. 61st IEEE Conf.
Decision and Control (CDC), pp. 1816–1822
18. Hemming S, de Zwart F, Elings E, Petropoulou A, Righini I. 2020. Cherry tomato production
in intelligent greenhouses - sensors and AI for control of climate, irrigation, crop yield, and
quality. Sensors 20(22):6430
19. Wang J, Lian Y, Jiang Y, Xu Q, Li K, Jones CN. 2023. Distributed data-driven predictive con-
trol for cooperatively smoothing mixed traffic flow. Transportation Research Part C: Emerging
Technologies 155:104274
20. Wang J, Zheng Y, Li K, Xu Q. 2023. DeeP-LCC: data-enabled predictive leading
cruise control in mixed traffic flow. IEEE Trans. Control Systems Technology Doi:
10.1109/TCST.2023.3288636
21. Shang X, Wang J, Zheng Y. 2023. Smoothing mixed traffic with robust data-driven predictive
control for connected and autonomous vehicles. arXiv:2310.00509
22. Rimoldi A, Cenedese C, Padoan A, Dörfler F, Lygeros J. 2023. Urban traffic congestion control:
a DeePC change. arXiv:2311.09851
23. Huang L, Coulson J, Lygeros J, Dörfler F. 2019. Data-enabled predictive control for grid-
connected power converters. In Proc. 58th IEEE Conf. Decision and Control (CDC), pp.
8130–8135
24. Huang L, Zhen J, Lygeros J, Dörfler F. 2021. Quadratic regularization of data-enabled pre-
dictive control: theory and application to power converter experiments. IFAC-PapersOnLine
54(7):192–197
25. Huang L, Coulson J, Lygeros J, Dörfler F. 2021. Decentralized data-enabled predictive con-
trol for power system oscillation damping. IEEE Trans. Control Systems Technology Doi:
10.1109/TCST.2021.3088638
26. Lian Y, Shi J, Koch MP, Jones CN. 2021. Adaptive robust data-driven building control via
bi-level reformulation: an experimental result. arXiv:2106.05740
www.annualreviews.org • 21
to data-driven analysis and control. IEEE Control Systems Magazine 43(6):32–66
49. Raff T, Huber S, Nagy ZK, Allgöwer F. 2006. Nonlinear model predictive control of a four
tank system: An experimental stability study. In Proc. IEEE Int. Conf. Control Applications
(CCA), pp. 237–242
50. Grüne L. 2012. NMPC without terminal constraints. In Proc. IFAC Conf. Nonlinear Model
Predictive Control, pp. 1–13
51. Berberich J, Köhler J, Müller MA, Allgöwer F. 2021. Data-driven model predictive control
with stability and robustness guarantees. IEEE Trans. Automat. Control 66(4):1702–1717
52. Berberich J, Köhler J, Müller MA, Allgöwer F. 2021. On the design of terminal ingredients
for data-driven MPC. IFAC-PapersOnLine 54(6):257–263
53. Goodwin GC, Sin KS. 2014. Adaptive filtering prediction and control. Courier Corporation
54. Koch A, Berberich J, Allgöwer F. 2022. Provably robust verification of dissipativity properties
from data. IEEE Trans. Automat. Control 67(8):4248–4255
55. Berberich J, Scherer CW, Allgöwer F. 2023. Combining prior knowledge and data for robust
controller design. IEEE Trans. Automat. Control 68(8):4618–4633
56. van Waarde HJ, Eising J, Camlibel MK, Trentelman HL. 2024. A behavioral approach to data-
driven control with noisy input-output data. IEEE Trans. Automat. Control 69(2):813–827
57. Alsalti M, Lopez VG, Müller MA. 2023. Notes on data-driven output-feedback control of linear
MIMO systems. arXiv:2311.17484
58. Bongard J, Berberich J, Köhler J, Allgöwer F. 2023. Robust stability analysis of a simple data-
driven model predictive control approach. IEEE Trans. Automat. Control 68(5):2625–2637
59. Lazar M. 2021. A dissipativity-based framework for analyzing stability of predictive controllers.
IFAC-PapersOnLine 54(6):159–165
60. Lazar M, Verheijen PCN. 2023. Generalized data-driven predictive control: merging subspace
and Hankel predictors. Mathematics 11(9):2216
61. Limón D, Alvarado I, Alamo T, Camacho EF. 2008. MPC for tracking piecewise constant
references for constrained linear systems. Automatica 44(9):2382–2387
62. Berberich J, Köhler J, Müller MA, Allgöwer F. 2020. Data-driven tracking MPC for changing
setpoints. IFAC-PapersOnLine 53(2):6923–6930
63. Cai C, Teel AR. 2008. Input–output-to-state stability for discrete-time systems. Automatica
44(2):326–336
64. Favoreel W, Moor BD, Gevers M. 1999. SPC: Subspace predictive control. IFAC Proceedings
Volumes 32(2):4004–4009
65. Huang B, Kadali R. 2008. Dynamic modeling, predictive control and performance monitoring:
a data-driven subspace approach. Springer
66. Berberich J, Köhler J, Müller MA, Allgöwer F. 2020. Robust constraint satisfaction in data-
driven MPC. In Proc. 59th IEEE Conf. Decision and Control (CDC), pp. 1260–1267
67. Grüne L, Palma VG. 2015. Robustness of performance and stability for multistep and updated
multistep MPC schemes. Discrete and Continuous Dynamical Systems 35(9):4385–4414
68. Berberich J. 2022. Stability and robustness in data-driven predictive control. Ph.D. thesis,
University of Stuttgart
69. Berberich J, Köhler J, Müller MA, Allgöwer F. 2022. Stability in data-driven MPC: an inherent
robustness perspective. In Proc. 61st IEEE Conf. Decision and Control (CDC), pp. 1105–1110
70. Berberich J, Köhler J, Müller MA, Allgöwer F. 2022. Linear tracking MPC for nonlinear sys-
tems part II: the data-driven case. IEEE Trans. Automat. Control 67(9):4406–4421Extended
version on arXiv:2105.08567
71. Yu S, Reble M, Chen H, Allgöwer F. 2014. Inherent robustness properties of quasi-infinite
horizon nonlinear model predictive control. Automatica 50(9):2269–2280
72. Klöppelt C, Berberich J, Allgöwer F, Müller MA. 2022. A novel constraint tightening ap-
proach for robust data-driven predictive control. Int. J. Robust and Nonlinear Control Doi:
10.1002/rnc.6532
www.annualreviews.org • 23
97. Allibhoy A, Cortés J. 2021. Data-based receding horizon control of linear network systems.
IEEE Control Systems Lett. 5(4):1207–1212
98. Alonso CA, Yang F, Matni N. 2022. Data-driven distributed and localized model predictive
control. IEEE Open Journal of Control Systems 1:29–40
99. Köhler M, Berberich J, Müller MA, Allgöwer F. 2022. Data-driven distributed MPC of dy-
namically coupled linear systems. IFAC-PapersOnLine 55(30):365–370
100. Breschi V, Sassella A, Formentin S. 2023. On the design of regularized explicit predictive
controllers from input-output data. IEEE Trans. Automat. Control 68(8):4977–4983
101. Xie Y, Berberich J, Allgöwer F. 2023. Linear data-driven economic MPC with generalized
terminal constraint. IFAC-PapersOnLine 56(2):5512–5517
102. Liu W, Sun J, Wang G, Bullo F, Chen J. 2023. Data-driven resilient predictive control under
denial-of-service. IEEE Trans. Automat. Control 68(8):4722–4737
103. Liu W, Sun J, Wang G, Bullo F, Chen J. 2023. Data-driven self-triggered control via trajectory
prediction. IEEE Trans. Automat. Control 68(11):6951–6958
104. Deng L, Shu Z, Chen T. 2024. Event-triggered robust MPC with terminal in-
equality constraints: a data-driven approach. IEEE Trans. Automat. Control Doi:
10.1109/TAC.2024.3357417
105. Bajelani M, van Heusden K. 2023. Data-driven safety filter: An input-output perspective.
arXiv:2309.00189
106. Bajelani M, van Heusden K. 2024. From raw data to safety: reducing conservatism by set
expansion. arXiv:2403.15883
107. Schmitz P, Lanza L, Worthmann K. 2023. Safe data-driven reference tracking with prescribed
performance. In Proc. 27th Int. Conf. on System Theory, Control and Computing (ICSTCC),
pp. 454–460
108. Khaledi M, Tooranjipour P, Kiumarsi B. 2023. Data-driven safety-certified predictive control
for linear systems. IEEE Control Systems Lett. 7:3687–3692
109. van Waarde HJ, Camlibel MK, Mesbahi M. 2022. From noisy data to feedback controllers:
non-conservative design via a matrix S-lemma. IEEE Trans. Automat. Control 67(1):162–175
110. Nguyen HH, Friedel M, Findeisen R. 2023. LMI-based data-driven robust model predictive
control. IFAC-PapersOnLine 56(2):4783–4788
111. Xie Y, Berberich J, Allgöwer F. 2024. Data-driven min-max MPC for linear systems: robust-
ness and adaptation. arXiv:2404.19096
112. Schmitz P, Faulwasser T, Worthmann K. 2022. Willems’ fundamental lemma for linear de-
scriptor systems and its use for data-driven output-feedback MPC. IEEE Control Systems
Lett. 6:2443–2448
113. Zhang K, Zuliani R, Balta EC, Lygeros J. 2024. Data-enabled predictive iterative control.
IEEE Control Systems Lett. Doi: 10.1109/LCSYS.2024.3408073
114. Wolff TM, Lopez VG, Müller MA. 2022. Robust data-driven moving horizon estimation for
linear discrete-time systems. arXiv:2210.09017
115. Berberich J, Allgöwer F. 2020. A trajectory-based framework for data-driven system analysis
and control. In Proc. European Control Conf. (ECC), pp. 1365–1370
116. Rueda-Escobedo JG, Schiffer J. 2020. Data-driven internal model control of second-order dis-
crete Volterra systems. In Proc. 59th IEEE Conf. Decision and Control (CDC), pp. 4572–4579
117. Verhoek C, Tóth R, Haesart S, Koch A. 2021. Fundamental Lemma for data-driven analysis
of linear parameter-varying systems. In Proc. 60th IEEE Conf. Decision and Control (CDC),
pp. 5040–5046
118. Martinelli A, Gargiani M, Draskovic M, Lygeros J. 2022. Data-driven optimal control of affine
systems: a linear programming perspective. arXiv:2203.12044
119. Mishra VK, Markovsky I, Fazzi A, Dreesen P. 2021. Data-driven simulation for NARX sys-
tems. In Proc. European Signal Processing Conference, pp. 1055–1059
120. Yuan Z, Cortés J. 2022. Data-driven optimal control of bilinear systems. IEEE Control Systems
www.annualreviews.org • 25