0% found this document useful (0 votes)

19 views

Model Based Output Difference Feedback Optimal Control

Uploaded by

Gary Rey

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Model Based Output Difference Feedback Optimal Control

Uploaded by

Gary Rey

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Model-Based Output-Difference Feedback

Optimal Control

1 Introduction
This document investigates a model-based method to design the optimal Output-
Difference Feedback Controller (ODFC). We begin by assuming the presence of
an observer that provides an unbiased estimate of the state, represented math-
ematically as:

x̂k = xk + ϵk , ϵk ∼ N (0, Σϵ )

2 Theorem 3.1: Optimal Control Problem

Consider the optimal control problem defined by equations (2)-(5). The optimal
state feedback controller gain K ∗ is given by:

K ∗ = R + B T P ∗ B −1 B T P ∗ A + N T
where P ∗ > 0 is the solution to the Algebraic Riccati Equation (ARE):

AT P ∗ A − P ∗ − AT P ∗ B + N R + B T P ∗ B −1 B T P ∗ A + N T + Q = 0
T T
Here, Q = A Qx A, R = B T Qx B + R, N = A Qx B, and A = A − I.

3 Average Cost
The average cost associated with K ∗ is given by:

λK ∗ = Tr(ATeff Qx Aeff Σϵ )+Tr(Qx Ww )+2Tr(Qy Wv )+Tr(K ∗T B T P ∗ K ∗ Wv )+Tr(P ∗ (Ww +Σϵ ))−Tr((A−BK ∗ )T P

3.1 Deriving Each Term

1. **State Cost**: - Tr(ATeff Qx Aeff Σϵ ): Captures the cost associated with the
state estimation error.

1
2. **Control Cost**: - Tr(Qx Ww ): Reflects the cost related to the process
noise affecting the state.
3. **Output Cost**: - 2Tr(Qy Wv ): Represents the cost linked to the output
noise.
4. **Feedback Gain Cost**: - Tr(K ∗T B T P ∗ K ∗ Wv ): Captures the cost
incurred due to the control action based on the feedback gain K ∗ .
5. **Covariance Cost**: - Tr(P ∗ (Ww + Σϵ )): Accounts for the combined
effect of the process noise covariance and the estimation error covariance.
6. **Adjustment for Feedback**: - −Tr((A − BK ∗ )T P ∗ (A − BK ∗ )Σϵ ):
Adjusts for the effect of the feedback control on the state dynamics.

4 Proof Overview
The proof resembles results for linear stochastic systems with state-dependent
quadratic costs, following similar procedures to those found in [?]. The optimal
feedback gain K ∗ is derived from minimizing the Bellman equation, leading to
the satisfaction of equations (9) and (10).

5 Theorem 3.2: Iterative Algorithm

Let K0 be any stabilizing state feedback controller gain and Pi > 0 be the
solution of the Lyapunov equation:

ATi Pi Ai − Pi + Q + KiT RKi − KiT N − N Ki = 0

where i = 0, 1, 2, . . . and Ai = A − BKi . For Ki+1 calculated as:

Ki+1 = R + B T Pi B −1 B T Pi A + N T
The following holds:

• A − BKi+1 is Schur.

• P ∗ ≤ Pi+1 ≤ Pi
• limi→∞ Pi = P ∗ , limi→∞ Ki = K ∗

6 Proof Overview
The proof follows arguments similar to those in [?] (Theorem 3.1) and is there-
fore omitted here.

2
7 Theorem 3.3: Parameterized Observer
A parameterized observer is introduced to estimate the system state xk from
the output difference measurement. The observer can be combined with (8) to
provide a solution for the optimal control problem.
The state parametrization is given as:

x̄k = Γu αk + Γy βk
This converges exponentially in mean to the state xk as k → ∞ for an
observable system. The estimation error is given by:

x̃k ≡ xk − x̄k ∼ N (0, Σϵ )

where Σϵ is a bounded error covariance matrix.

7.1 Matrices and Updates

The matrices Γu and Γy contain system-dependent transfer function coefficients.
The updates for αk and βk are defined as follows:
i
αk+1 = Aαki + Buk , ∀i = 1, 2, . . . , m

βki = Cσki + D(yk − yk−1 ), ∀i = 1, 2, . . . , p

where ui and yi are the i-th input and output, respectively.

7.2 Existence of the Observer

The existence of the parametrization is equivalent to the difference-feedback
state observer:

x̄k+1 = (A − LCA + LC)x̄k + (B − LCB)uk + L(yk+1 − yk )

where L is the observer gain. The mean and covariance of the estimation
error can be determined using this formulation.

8 Derivation of Discrete-Time ARE

The Algebraic Riccati Equation (ARE) is a fundamental equation in optimal
control theory, particularly for discrete-time linear systems. Below, we derive
the discrete-time ARE from the principles of optimal control.

3
8.1 Discrete-Time Linear System
Consider a discrete-time linear system described by:

xk+1 = Axk + Buk

where:
• xk is the state vector at time k,

• uk is the control input,

• A is the state transition matrix,
• B is the input matrix.

8.2 Cost Function

We want to minimize a quadratic cost function of the form:
∞
X
xTk Qxk + uTk Ruk + 2xTk N uk

J=
k=0

where:

• Q is a positive semi-definite matrix,

• R is a positive definite matrix,
• N is a matrix that captures the coupling between the state and control
inputs.

8.3 Bellman Equation

The optimal control problem can be formulated using the Bellman equation.
The value function V (x) represents the minimum cost to go from state x:

V (x) = min xT Qx + uT Ru + 2xT N u + V (Ax + Bu)

Assuming a quadratic form for the value function:

V (x) = xT P x
where P is a positive semi-definite matrix, we can write:

V (Ax + Bu) = (Ax + Bu)T P (Ax + Bu)

4
8.4 Substituting into the Bellman Equation
Substituting back into the Bellman equation, we have:

V (x) = min xT Qx + uT Ru + 2xT N u + xT AT P Ax + xT AT P Bu + uT B T P Ax + uT B T P Bu

Grouping terms, we get:

V (x) = xT Q + AT P A x + uT R + B T P B u + 2xT AT P B + N u

8.5 Minimizing the Cost Function

To minimize this quadratic expression with respect to u, we take the derivative
and set it to zero:
∂V
= 2 R + B T P B u + 2 AT P B + N x = 0

∂u
Solving for u gives:
−1
u∗ = − R + B T P B BT P A + N T x

8.6 Substituting Back into the Cost Function

Substituting u∗ back into the cost function:

−1 T
J ∗ = x T Q + AT P A x − x T AT P B + N R + B T P B B PA + NT x

This leads to the equation:

−1 T
J ∗ = x T Q + AT P A − AT P B + N R + B T P B B PA + NT x

For the minimum cost to be zero, the term in parentheses must equal zero:

−1
AT P A − P − AT P B + N R + BT P B BT P A + N T + Q = 0

8.7 Algebraic Riccati Equation

Rearranging gives us the discrete-time Algebraic Riccati Equation (ARE):
−1
AT P A − P − AT P B R + B T P B BT P A + Q = 0

5
9 Conclusion
The discrete-time ARE is a key result in optimal control, allowing us to compute
the optimal feedback gain matrix K ∗ using:
−1
K ∗ = R + BT P B BT P A + N T

The solution P can be found using various numerical methods, such as iter-
ative algorithms or matrix factorizations.

References

03 9MA0 01 9MA0 02 A Level Pure Mathematics Practice Set 3 Mark Scheme
No ratings yet
03 9MA0 01 9MA0 02 A Level Pure Mathematics Practice Set 3 Mark Scheme
21 pages
Lec19 - Linear Quadratic Regulator
No ratings yet
Lec19 - Linear Quadratic Regulator
7 pages
A2 Linear-Quadratic Optimal Control
No ratings yet
A2 Linear-Quadratic Optimal Control
8 pages
Model_free_RL_Notes
No ratings yet
Model_free_RL_Notes
7 pages
7 Linear Quadratic Control: 7.1 The Problem
No ratings yet
7 Linear Quadratic Control: 7.1 The Problem
10 pages
Module 11: Introduction To Optimal Control: Lecture Note 3
100% (1)
Module 11: Introduction To Optimal Control: Lecture Note 3
5 pages
Linear Quadratic Regulator (LQR) State Feedback Design: R T U R T X
100% (1)
Linear Quadratic Regulator (LQR) State Feedback Design: R T U R T X
10 pages
T6: Introduction To Optimal Control: Gabriel Oliver Codina
No ratings yet
T6: Introduction To Optimal Control: Gabriel Oliver Codina
3 pages
Sargent Macroeconomics Chapter 9
No ratings yet
Sargent Macroeconomics Chapter 9
25 pages
The Revised Simplex Method: Javier Larrosa Albert Oliveras Enric Rodr Iguez-Carbonell
No ratings yet
The Revised Simplex Method: Javier Larrosa Albert Oliveras Enric Rodr Iguez-Carbonell
34 pages
Inno2020 Emt4203 Control II Chap3.3-4 LQ Optimal
No ratings yet
Inno2020 Emt4203 Control II Chap3.3-4 LQ Optimal
11 pages
Linear Quadratic Regulator (LQR) State Feedback Design: R T U R T X X
No ratings yet
Linear Quadratic Regulator (LQR) State Feedback Design: R T U R T X X
10 pages
Cgnotes PDF
No ratings yet
Cgnotes PDF
11 pages
State Regulator
No ratings yet
State Regulator
23 pages
Optimal Control Exercises
100% (2)
Optimal Control Exercises
79 pages
Field Exam - Controls
No ratings yet
Field Exam - Controls
1 page
Estimation 2 PDF
No ratings yet
Estimation 2 PDF
44 pages
LQR
100% (1)
LQR
14 pages
Class4
No ratings yet
Class4
4 pages
16.323 Principles of Optimal Control: Mit Opencourseware
No ratings yet
16.323 Principles of Optimal Control: Mit Opencourseware
4 pages
Athans Workshop 10 07
No ratings yet
Athans Workshop 10 07
40 pages
Semana9 Sist Cont II
No ratings yet
Semana9 Sist Cont II
9 pages
1 Solving Systems of Linear Equations: Gaussian Elimination: Lecture 9: October 26, 2021
No ratings yet
1 Solving Systems of Linear Equations: Gaussian Elimination: Lecture 9: October 26, 2021
8 pages
Note That For: Figure 10-35
No ratings yet
Note That For: Figure 10-35
5 pages
Pointwise Min-Norm Control: March 14, 2018
No ratings yet
Pointwise Min-Norm Control: March 14, 2018
2 pages
Kybernetika 39-2003-4 6
No ratings yet
Kybernetika 39-2003-4 6
11 pages
Linear Quadratic Control
No ratings yet
Linear Quadratic Control
7 pages
Kalman 2
No ratings yet
Kalman 2
25 pages
Assignment2_EE6302 1
No ratings yet
Assignment2_EE6302 1
2 pages
Lecture 5
No ratings yet
Lecture 5
16 pages
Chapter 2: Non-Linear Equations: Nguyen Thi Minh Tam
0% (1)
Chapter 2: Non-Linear Equations: Nguyen Thi Minh Tam
26 pages
Mathematical Tripos: at The End of The Examination
No ratings yet
Mathematical Tripos: at The End of The Examination
27 pages
An Iterative Solution To The Finite-Time Linear Quadratic Optimal Feedback Control Problem
No ratings yet
An Iterative Solution To The Finite-Time Linear Quadratic Optimal Feedback Control Problem
2 pages
HW 1
No ratings yet
HW 1
6 pages
1 Optimal Control: 1.1 Problem Definition
No ratings yet
1 Optimal Control: 1.1 Problem Definition
8 pages
Mpc12 Exam
No ratings yet
Mpc12 Exam
8 pages
Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters
No ratings yet
Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters
15 pages
Optimal Control
No ratings yet
Optimal Control
35 pages
Lab 3 State Feedback Control Design
No ratings yet
Lab 3 State Feedback Control Design
8 pages
Controller Design of Inverted Pendulum Using Pole Placement and LQR
100% (1)
Controller Design of Inverted Pendulum Using Pole Placement and LQR
7 pages
Workshop6 Solutions
No ratings yet
Workshop6 Solutions
3 pages
EE363 Review Session 1: LQR, Controllability and Observability
No ratings yet
EE363 Review Session 1: LQR, Controllability and Observability
6 pages
et_Ch3
No ratings yet
et_Ch3
38 pages
Kernel-Based Portfolio Management Model
No ratings yet
Kernel-Based Portfolio Management Model
8 pages
Generalised Minimum Variance Controller
No ratings yet
Generalised Minimum Variance Controller
32 pages
PCE6101 Linear Systems Theory: (Optimal Control)
No ratings yet
PCE6101 Linear Systems Theory: (Optimal Control)
26 pages
Krylov Subspace Methods
No ratings yet
Krylov Subspace Methods
8 pages
Notes On Some Methods For Solving Linear Systems: Dianne P. O'Leary, 1983 and 1999 September 25, 2007
No ratings yet
Notes On Some Methods For Solving Linear Systems: Dianne P. O'Leary, 1983 and 1999 September 25, 2007
11 pages
Jurnal - Gauss Newton
No ratings yet
Jurnal - Gauss Newton
11 pages
HW 1 Solutions
No ratings yet
HW 1 Solutions
5 pages
OCT Previous Qn
No ratings yet
OCT Previous Qn
12 pages
Model Free Difference Feedback Control of Stochastic Systems
No ratings yet
Model Free Difference Feedback Control of Stochastic Systems
6 pages
EC744 Lecture Note 5 Applications of Deterministic DP: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 5 Applications of Deterministic DP: Prof. Jianjun Miao
23 pages
Idris3 PDF
No ratings yet
Idris3 PDF
15 pages
Tutorial KF
No ratings yet
Tutorial KF
13 pages
SeF AppelloGiugno2022
No ratings yet
SeF AppelloGiugno2022
2 pages
Ee263 Ps1 Sol
No ratings yet
Ee263 Ps1 Sol
11 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Solutions to Problems in Fluids and Turbomachinery
From Everand
Solutions to Problems in Fluids and Turbomachinery
Rahul Basu
No ratings yet
State Derivative Feedback by Lqr
No ratings yet
State Derivative Feedback by Lqr
6 pages
Linear Quadratic Control Using Model-Free Reinforcement Learning
No ratings yet
Linear Quadratic Control Using Model-Free Reinforcement Learning
16 pages
System Identification Technique For Control of Hybrid Bio-System
No ratings yet
System Identification Technique For Control of Hybrid Bio-System
7 pages
Machine Learning Based System Identification With
No ratings yet
Machine Learning Based System Identification With
9 pages
Number Systems Quiz
No ratings yet
Number Systems Quiz
3 pages
Tutorial Sheet Digital
No ratings yet
Tutorial Sheet Digital
3 pages
Template B.Eng Evaluations
No ratings yet
Template B.Eng Evaluations
1 page
Home Task 4 Introduction To Desmos
No ratings yet
Home Task 4 Introduction To Desmos
2 pages
Computer Aided Design CAD-I
0% (1)
Computer Aided Design CAD-I
67 pages
P2 Chp9 Differentiation
No ratings yet
P2 Chp9 Differentiation
52 pages
Parametric Equations
No ratings yet
Parametric Equations
14 pages
Multivariable Calculus: Don Shimamoto
No ratings yet
Multivariable Calculus: Don Shimamoto
5 pages
Cufsm Advanced Matlab
No ratings yet
Cufsm Advanced Matlab
13 pages
Vector_2__Comprehensive_Notes_-_By_Trockers
No ratings yet
Vector_2__Comprehensive_Notes_-_By_Trockers
98 pages
Texto 08-5
No ratings yet
Texto 08-5
20 pages
3D Object Representation
No ratings yet
3D Object Representation
38 pages
12 4 Curvature
No ratings yet
12 4 Curvature
7 pages
Meta Parametric - Design Preprint
No ratings yet
Meta Parametric - Design Preprint
26 pages
Tangent To Conics
100% (1)
Tangent To Conics
3 pages
Surface Reconstruction From Point Clouds Without Normals by Parametrizing The Gauss Formula
No ratings yet
Surface Reconstruction From Point Clouds Without Normals by Parametrizing The Gauss Formula
19 pages
WRC Nozzle Loads
No ratings yet
WRC Nozzle Loads
3 pages
Vector Calculus
No ratings yet
Vector Calculus
62 pages
Use of NX For Stator Winding
No ratings yet
Use of NX For Stator Winding
9 pages
(Calc 8) Logarithmic Differentiation & Parametric Differentiation
No ratings yet
(Calc 8) Logarithmic Differentiation & Parametric Differentiation
9 pages
Parametric Equations by Chain Rule
No ratings yet
Parametric Equations by Chain Rule
10 pages
Wjec Gce Mathematics Unit 3 Coordinate Geometry
No ratings yet
Wjec Gce Mathematics Unit 3 Coordinate Geometry
1 page
10 Lesser-Known Tips For Solidworks
No ratings yet
10 Lesser-Known Tips For Solidworks
24 pages
10-Parametric Equations and Polar Coordinates
No ratings yet
10-Parametric Equations and Polar Coordinates
11 pages
Unit 9 Progress Check MCQ Part A
No ratings yet
Unit 9 Progress Check MCQ Part A
5 pages
Surface To Surface Intersections
No ratings yet
Surface To Surface Intersections
11 pages
Unit Ii Me16501 PDF
No ratings yet
Unit Ii Me16501 PDF
192 pages
Numb3rs Activities Curriculum Alignment
No ratings yet
Numb3rs Activities Curriculum Alignment
6 pages
Maths B 123
No ratings yet
Maths B 123
3 pages
Syllabus mathematical analysis
No ratings yet
Syllabus mathematical analysis
4 pages
Calculus Early Transcendentals 7th Edition Stewart Solutions Manual download
No ratings yet
Calculus Early Transcendentals 7th Edition Stewart Solutions Manual download
55 pages

Model Based Output Difference Feedback Optimal Control

Uploaded by

Model Based Output Difference Feedback Optimal Control

Uploaded by

Model-Based Output-Difference Feedback

2 Theorem 3.1: Optimal Control Problem

λK ∗ = Tr(ATeff Qx Aeff Σϵ )+Tr(Qx Ww )+2Tr(Qy Wv )+Tr(K ∗T B T P ∗ K ∗ Wv )+Tr(P ∗ (Ww +Σϵ ))−Tr((A−BK ∗ )T P

3.1 Deriving Each Term

5 Theorem 3.2: Iterative Algorithm

ATi Pi Ai − Pi + Q + KiT RKi − KiT N − N Ki = 0

x̃k ≡ xk − x̄k ∼ N (0, Σϵ )

7.1 Matrices and Updates

βki = Cσki + D(yk − yk−1 ), ∀i = 1, 2, . . . , p

7.2 Existence of the Observer

x̄k+1 = (A − LCA + LC)x̄k + (B − LCB)uk + L(yk+1 − yk )

8 Derivation of Discrete-Time ARE

xk+1 = Axk + Buk

• uk is the control input,

8.2 Cost Function

• Q is a positive semi-definite matrix,

8.3 Bellman Equation

V (x) = min xT Qx + uT Ru + 2xT N u + V (Ax + Bu)

Assuming a quadratic form for the value function:

V (Ax + Bu) = (Ax + Bu)T P (Ax + Bu)

V (x) = min xT Qx + uT Ru + 2xT N u + xT AT P Ax + xT AT P Bu + uT B T P Ax + uT B T P Bu

Grouping terms, we get:

8.5 Minimizing the Cost Function

8.6 Substituting Back into the Cost Function

This leads to the equation:

8.7 Algebraic Riccati Equation

You might also like