











































Studia grazie alle numerose risorse presenti su Docsity
Guadagna punti aiutando altri studenti oppure acquistali con un piano Premium
Prepara i tuoi esami
Studia grazie alle numerose risorse presenti su Docsity
Prepara i tuoi esami con i documenti condivisi da studenti come te su Docsity
Trova i documenti specifici per gli esami della tua università
Preparati con lezioni e prove svolte basate sui programmi universitari!
Rispondi a reali domande d’esame e scopri la tua preparazione
Riassumi i tuoi documenti, fagli domande, convertili in quiz e mappe concettuali
Studia con prove svolte, tesine e consigli utili
Togliti ogni dubbio leggendo le risposte alle domande fatte da altri studenti come te
Esplora i documenti più scaricati per gli argomenti di studio più popolari
Ottieni i punti per scaricare
Guadagna punti aiutando altri studenti oppure acquistali con un piano Premium
Appunti completi e riordinati del corso "Data Driven" tenuto dal professor Garatti presso il Politecnico di Milano. Il corso introduce l'idea dell'adaptive control e implementa 3 metodi diversi: self-tuning, VRFT e Q-learning. Gli appunti in alcune parti ripetono passaggi simili ma sono estremamente utili per comprendere meglio quello che succede. Spero possano tornare utili a chi li acquisterà!
Tipologia: Appunti
1 / 51
Questa pagina non è visibile nell’anteprima
Non perderti parti importanti!












































The aim of the course is to learn some methods to use data to design a controller able to work in uncertain, time-varying
environments robustly.
We start with an example: Given a tank, we will study its behaviour in case of uncertainty of parameters.
௧
is the volume of water at time 𝑡
𝑢(𝑡) is the incoming flow of water
𝑝(𝑡) is the outgoing flow of water
௧
Equation of the system:
௧ାଵ
௧
௧
We can rewrite it substituting 𝑉
௧
and
The control goal is to follow a given setpoint 𝑟
optimal control law: 𝑢(𝑡) = −𝑎 ∙ 𝑥(𝑡) + 𝑟(𝑡)In this way: 𝑥(𝑡 + 1) = 𝑟(𝑡)
What does it change if a is not well-known?
In this case 𝑎 ∈ [𝛽, 𝛾]
To obtain an optimal control law we need the exact value of 𝑎 but this is not possible. We can use a guess of 𝑎:
The system becomes:
At equilibrium:
Based on the value of 𝑎
we can have an overshoot (𝑎
< 𝑎) or an undershoot (𝑎
To add robustness at the control law we add an integrator
But it has not a good transient. Whatever 𝑘 we will choose the system always has a pole with magnitude
Integrator control scheme has always a slow dynamic irrespective of 𝑘.
With this example we demonstrate that linear robust control can work but is very conservative. It can work only in the cases
where we do not have strictly requirements.
௧
𝑝(𝑡)+leakage
The system 𝑆 is a mathematical description of a part of reality. It links some quantities through some relations.
These quantities are called:
Dependent: their values depend on other quantities (e.g. state and output)
o Observable
o Non-observable (hidden)
Independent: their values do not depend on other quantities (e.g. input)
o Tuneable (as input)
o Exogenous (ad noise or disturbance)
Known
Unknown
Uncertain system
An uncertain system needs a robust control.
The control problem is to decide the values of the tuneable quantities based on the observations of the observable
dependent quantities so that their behaviour meets some design goals irrespective of the values taken by the independent
uncertain quantities.
𝑠 represents an instance of the system for a given value of the uncertain quantities.
𝑠 ∈ 𝑆 →Class of systems for different instances of the uncertain quantities.
Same for controller: 𝑐 ∈ 𝐶 (𝑐 instance of the policy and 𝐶 class of controllers, for example PID)
Index of performances:
It evaluates the behaviour of the dependent quantities when 𝑠 is operating according to 𝑐.
Example:
max|𝑒𝑖𝑔|
We can redefine the control problem as:
Find 𝑐 ∈ 𝐶: 𝐽(𝑐, 𝑠) ≤ 𝑘 ∀ 𝑠 ∈ 𝑆
Find a controller that premises to achieve all the control goals
Two cases:
This is a lucky situation
In this case a classic control is sufficient (no need of adaptation or data-driven)
Design goals not too strict and small uncertainties (additive and parametric uncertainties)
None of the admissible policies is a solution for the robust problem
We must change our admissible policy: 𝐶 → 𝐶
ᇱ
→ Enlarged class of policies for which a solution can be
found.
Tricky, 𝐶′ cannot be too big otherwise we are not able to operate with it.
Adaptation
It is a general method to find a proper enlarged policy class 𝐶′ so that a solution to the robust problem can be found.
To work we need that:
At least a control capable to solve the system
Adaptation rule: we implement a policy 𝑐 ∈ 𝐶 that is not fixed. An additional layer, the tuner, modifies the control law based
on the information that is gathered from the system.
The pro is that follows a rule that is built on our knowledge.
The con is that works as second feedback𝐶′ is more complex and difficult to study (𝐶′ policy is to use policy 𝐶 but
premising the switching between control laws)
Remark:
Adaptation and data-driven CSD (Control System Design) is not related only to t-varying systems
Obviously, it helps a lot for t-varying systems, but it is not used only for this.
A t-varying system does not necessarily need an adaptive control
o Example
𝑎 = 0.9 + 0.05 ∙ sin
The system is time-varying but well-known
Optimal control law: 𝑢(𝑡) = −𝑎 ∙ 𝑥(𝑡) + 𝑟(𝑡)
Adaptive control is used in case of:
t-invariant systems with uncertainties (if strictly requirements)
t-varying systems with uncertainties (in 99% of the cases. Difficult to achieve robust control with traditional method)
Exploitation and exploration trade-off
The additional feedback leads, in adaptive control schemes, a trade-off between exploitation and exploration.
The exploitation is called also primal effect and indicates that the behaviour of 𝑆 depends on the tuneable parameter (input).
The exploration is called also dual effect and indicates that the choice of the input influences the estimated model 𝑆
. More
the system is excited the better the model 𝑆
. (But the aim of the controller is not to excite the system but to keep it at
steady-state).
Therefore, exploitation and exploration are:
Opposing:
o ExploitationKeep 𝑆 at steady state
o ExplorationExcite 𝑆 to improve the model 𝑆
Intertwined:
o Input impacts on 𝑆
and 𝑆
on input.
An adaptive scheme automatically determines a trade-off between them. The optimal trade-off is not obvious.
Taxonomy
Offline Online
Indirect Basic identification control design Self-Tuning
Direct VRFT
Reinforcement learning
Q-learning
Tuner
The model 𝑆 is not completely trusted. 𝑢(𝑡) is decided accounting for an estimation of 𝑆 − 𝑆
(model
mismatch)
we remain cautious and avoid trusting 𝑆
too much so that the real system 𝑆 is not brought in operating
conditions which are more difficult to recover when we recognize that the system has not behaved as
expected.
In indirect control approach we develop the controller based on 𝑆
This controller is good if 𝐽(𝐶, 𝑆
) ≈ 𝐽(𝐶, 𝑆) (Indicators of performance are equals)
But instilling this requirement is very difficult.
Direct approach jumps the identification of the system 𝑆
and goes directly to an indicator of performance starting from data
We use the data to find an estimate of 𝐽(𝐶, 𝑆) directly.
Example
𝑓(𝑥) is an unknown real function.
ௗ
ௗ
The objective is to evaluate ∫ 𝑓
∙
I measure {𝑥 ଵ
ଶ
ே
Indirect approach:
(𝑥) = min
ఏ
ଵ
ே
ఏ
ଶ
ே
ୀଵ
∙
is the evaluation for ∫
∙
Analysing the difference:
∙
∙
The error remains big for large 𝑁 when 𝑑 is high
Direct approach:
ଵ
ே
ே
ୀଵ
Analysing the difference:
ே
ୀଵ
∙
The error is independent to 𝑑.
To have an error smaller than 𝜖 (e.g. 𝜖 = 0.1 and 𝑑 = 6) we need:
st
case:
ଵ
√
ே
ଵ
ఢ
nd
case:
ଵ
√
ே
ଵ
ఢ
మ
Online indirect method with sub-tuner that follows the CEP logic.
The main elements are:
Uncertain system 𝑆
Identification algorithm ID
The controller 𝐶
ଵ
It is a linear time-varying stochastic (SISO) system
𝑦(𝑡): outputdependent observable variable
𝑢(𝑡): inputIndependent tuneable variable
𝑒(𝑡): additive disturbance 𝑒(𝑡)~𝑊𝑁(0, 𝜆
ଶ
)uncertain exogenous independent quantity
𝑦( 0 ), … , 𝑦(−𝑚 + 1): initial state uncertain exogenous independent quantity
Robustness to these uncertainties is achievable also by means of standard control methodsAdaptation not
necessary
ଵ
: parameters uncertain t-varying exogenous independent quantity
Difficult to address with standard control techniquesCalled for adaptation
We have that 𝑏
(𝑡) ≠ 0 ∀ 𝑡 and 𝑑 ≥ 1 (𝑑 is the delay, strictly proper system)
Parameter vector:
ଵ
்
Vector of regression variables:
்
We can rewrite the system in compact form:
்
்
ି ଵ
ି ଵ
(For hypothesis 𝐴(𝑧
ି ଵ
(𝑡)) and 𝐵(𝑧
ି ଵ
(𝑡)) are coprimeNo cancellation between numerator and denominator, in
this way we avoid non-observable or non-reachable parts, and the control problem is well-defined)
Identification algorithm ID
𝑆 is a t-varying ARX (Autoregressive with exogenous input).
We can use a PEM (Prediction Error Minimization) for the identification of the model. One possible implementation is the
Least Square Identification. (It fits very well ARX models)
Therefore, we search the model to fit 𝑆 in the ARX model class.
ଵ
(same model as before)
We assume to know the order of 𝑆𝑛, 𝑚 and 𝑑 known.
We assume that 𝜃 is constant
We can rewrite the model in compact form:
்
At time 𝑡 − 1, 𝜓(𝑡 − 1)
்
is known
A prediction of 𝑦(𝑡) given the information at 𝑡 − 1 is:
்
ே
= argmin
ఏ
்
ଶ
்
ே
ୀଵ
ே
்
்
ே
ୀଵ
்
்
்
ே
ୀଵ
்
்
ே ்
ୀଵ
்
ே
ୀଵ
்
்
ே ்
ୀଵ
்
்
ே
ୀଵ
ି ଵ
்
்
ே
ୀଵ
ே
ே ்
ୀଵ
ି ଵ
ே
ୀଵ
This is the identification block in self-tuning.
This implementation of the LS (least square) is trivial:
Computationally expensiveInverse matrix
Numerical instability
A lot of memory is allocated (𝑢( 1 ), … , 𝑢(𝑡), 𝑦( 1 ), … , 𝑦(𝑡))
Two states:
௧
௧ ்
ୀଵ
ି ଵ
௧
ୀଵ
Information matrix: 𝑆(𝑡) = 𝑆
௧ ்
ୀଵ
௧
ି ଵ
௧
ୀଵ
𝑆(𝑡) = 𝑆
̅
்
௧
ୀଵ
= 𝑆
̅
்
்
=
௧ିଵ
ୀଵ
𝑆(𝑡 − 1) + 𝜓(𝑡 − 1) ∙ 𝜓(𝑡 − 1)
்
𝑆(𝑡 + 1) = 𝑆(𝑡) + 𝜓(𝑡) ∙ 𝜓(𝑡)
்
𝑆
( 𝑡 + 1
) ∙ 𝜃
𝑡+
( 𝑡 + 1
) ∙ 𝑆
( 𝑡 + 1
)
−
ഥ
ഥ
( 𝑖 − 1
) ∙ 𝑦
( 𝑖
)
𝑡+
𝑖=
ቍ
𝑡+
ഥ
ഥ
𝑡+
𝑖=
ഥ
ഥ
𝑡
𝑖=
𝑡
௧ାଵ
−
௧
௧ାଵ
−
𝑇
௧
௧ାଵ
௧
−
𝑇
௧
Where:
𝑆(𝑡 + 1)
ିଵ
்
𝑡
൯ is the innovation to be added to 𝜃
௧
𝑺(𝒕 + 𝟏)
ି 𝟏
∙ 𝝍(𝒕): is the weight of the information at 𝑡 + 1
𝑻
𝒕
: is the prediction error for 𝑦(𝑡 + 1) made at time 𝑡 with the
best current estimate of the parameter vector.
Initialization: 𝜃
௧ାଵ
௧
𝑡 + 1
−
∙ 𝜓
𝑡
𝑡
𝑇
௧
𝑇
Pro:
𝜃
௧
returned by recursive least squareExact least square without approximation
Con:
Not practical:
o Computational demandinginverse of a matrix: 𝑆(𝑡 + 1)
ିଵ
o Numerical unstable 𝑆(𝑡 + 1) = 𝑆(𝑡) + 𝜓(𝑡) ∙ 𝜓(𝑡)
்
diverge asymptotically
Let 𝐹, 𝐺, 𝐻 and 𝐾 be matrices with suitable dimensions s.t.:
𝐹, 𝐻 and 𝐹 + 𝐺 ∙ 𝐻 ∙ 𝐾 are invertible
Then:
ି ଵ
ି ଵ
ି ଵ
ି ଵ
ି ଵ
ି ଵ
ି ଵ
It cures both the problems.
The idea is that instead of computing the inverse of 𝑆(𝑡 + 1)
ିଵ
we can deal directly with the inverse:
ି ଵ
And we can find a formula to compute 𝑉(𝑡 + 1) from 𝑉(𝑡) and 𝜓(𝑡)This is not computational demanding.
ି ଵ
𝑇
ି ଵ
And using the lemma we obtain:
−
−
𝑇
−
ଵି
𝑇
−
௧ାଵ
௧
𝑇
௧
𝑉
𝑡
∙ 𝜓
𝑡
𝑡
𝑇
𝑡
𝑇
ି ଵ
Computationally inexpensive (inverse of a scalar ൫1 + 𝜓(𝑡)
𝑇
−
ିଵ
(The only hard inverse is 𝑆
ି ଵ
Numerical stable 𝑉(𝑡) does not diverge
For simplicity we use the I form:
௧ାଵ
௧
𝑡 + 1
−
∙ 𝜓
𝑡
𝑡
𝑇
௧
𝑇
𝜓(𝑡) ∙ 𝜓(𝑡)
்
≥ 0 → 𝑆(𝑡 + 1) = 𝑆(𝑡) + 𝜓(𝑡) ∙ 𝜓(𝑡)
்
diverges
Since this 𝜃
௧ାଵ
௧
−
𝑇
௧
௧
Also the prediction of the parameters convergeIf 𝜃 changes the algorithm is not able to update the value of 𝜃
௧
The identification is done through:
ே
ଶ
ே
ୀଵ
்
Every prediction error has always the same weight 𝑆
Issue for time-varying parameter
௧
is built on data that refers to too much old information
௧ାଵ
௧
ି ଵ
்
௧ାଵ
௧
ି ଵ
்
௧
்
With 𝜇 < 1, 𝑆(𝑡) evolves as an asymptotic stable 1
st
order system (it does not diverge) even if 𝜓(𝑡) ∙ 𝜓(𝑡)
்
is strictly positive.
ି ଵ
does not go to zero and 𝜃
௧
is always reactive to new data.
We can have a singularity issue known with the name of blow-up phenomenon.
With 𝜇 < 1:
்
and if 𝜓(𝑡) is not informative (e.g. 𝜓(𝑡) = [0 0 ⋯ 0]) then 𝑆(𝑡 + 1) → 0
This initially is not a big problem since the prediction vector is:
௧ାଵ
௧
ି ଵ
்
௧
And the product 𝑆(𝑡 + 1)
ି ଵ
∙ 𝜓(𝑡) is:
Big ∙ 0 = 0𝜃
௧ାଵ
௧
The problem arises when the system returns informative 𝜓(𝑡) ≠ 0 and the product 𝑆(𝑡 + 1)
ି ଵ
∙ 𝜓(𝑡) is:
Big ∙ (≠ 0) ≠ 0 and its value is very high𝑆(𝑡 + 1)
ି ଵ
௧ାଵ
changes suddenly and strongly.
Example
This behaviour is not good when identification is used in a control scheme.
I said that this can occur every time the 𝜓(𝑡) becomes not informative, not necessarily equal to 0 as in the example.
If 𝜓(𝑡) = [1 1 ⋯ 1]𝐷𝑒𝑡൫𝑆(𝑡)൯ = 0 approaches singularity and 𝑆(𝑡)
ି ଵ
divergesBlow up phenomenon
We can prevent it looking if 𝑆(𝑡) is approaching singularity (𝑆(𝑡) is observable). And if it is near a singularity, we change the
adopted strategy for the forgetting factor.
We will look to the value of 𝑟𝑐𝑜𝑛𝑑൫𝑆(𝑡)൯ (condition number:
௦
ି ଵ
௧
Burst
In this way 𝑆(𝑡)
ି ଵ
doesn’t diverge and we avoid the burst.
Characterization of the performances
Given a general self-tuning scheme:
௧
൯ is a basic controller𝐶൫𝜃
௧
𝐶(𝜃) is any linear controller whose parameters are functions of 𝜃:
𝑢(𝑡) = 𝛼 ଵ
(𝜃) ∙ 𝑢(𝑡 − 1) + ⋯ + 𝛼
ഀ
(𝜃) ∙ 𝑢(𝑡 − 𝑛 ఈ
) + 𝛽
(𝜃) ∙ 𝑦(𝑡) + ⋯ + 𝛽
ഁ
(𝜃) ∙ 𝑦൫𝑡 − 𝑛 ఉ
൯ + 𝛾
(𝜃) ∙ 𝑟(𝑡) + ⋯ + 𝛾
ം
(𝜃) ∙ 𝑟൫𝑡 − 𝑛 ఊ
൯
In compact form:
The specific parameters 𝛼
(𝜃) and 𝛾
(𝜃) depend on the chosen control technique, which can be chosen depending
on the application.
The analysis of the self-tuning will be conditional to the chosen control technique (=self-tuning can be as good as what can
be achieved by the chosen control scheme (PID, PI, etc…)).
is a PIDSelf-tuning performance will be as that of a PID
𝐶(𝜃) is an LQGSelf-tuning performance will be comparable to the one of a LQG scheme
There is no need to specify the controller class completely since these results are general.
Given a scheme as:
We call Σ(𝜃, 𝜃) the set of standard linear control schemes in which we connect the controller 𝐶(𝜃) with the model 𝑀(𝜃)
Where satisfactory means that with the chosen controller 𝐶(𝜃) the control specifications are met (design goals).
If 𝐶(𝜃) is badly chosen with respect to design goals, then the set Ξ will be small.
Obviously Ξ cannot be as large as the domain since there exist intrinsic limitations as observability and reachability.
ି ଵ
௧
௧
Irrespective of the experiments, 𝜃
(𝑡) converges to some finite vector 𝜃
ஶ
Always ∃ 𝜃
ஶ
ஶ
௧ାଵ
௧
ି ଵ
்
௧
௧ାଵ
௧
ି ଵ
்
்
௧
Defining:
௧
௧
: LS estimation error
௧ାଵ
௧
ି ଵ
்
௧
Multiplying for 𝑆(𝑡 + 1):
௧ାଵ
௧
்
௧
்
௧
And since:
்
௧ାଵ
்
௧
௧
This is an invariant of the LS algorithm:
௧ାଵ
௧
௧
ି ଵ
Convergence: 𝜃
ஶ
௧
ஶ
ି ଵ
But we yet know that 𝑆
ି ଵ
converges (since 𝑆
்
diverges)
ିଵ
Positive definite
“A square and symmetric matric Γ is said positive definite if 𝑋
்
Given two symmetric, squared matrices 𝑀 and 𝑁 with same dimensionality, we say that 𝑀 ≥ 𝑁 if 𝑀 − 𝑁 ≥ 0
(and 𝑋
்
்
்
ି ଵ
ି ଵ
ି ଵ
Decreasing function
𝑆(𝑡)
ିଵ
≥ 𝑆(𝑡 + 1)
ିଵ
↔ 𝑋
்
∙ 𝑆(𝑡)
ିଵ
∙ 𝑋 ≥ 𝑋
்
∙ 𝑆(𝑡 + 1)
ିଵ
∙ 𝑋
𝑋
்
∙ 𝑆
̅
ି ଵ
∙ 𝑋 ≥ ⋯ ≥ 𝑋
்
∙ 𝑆(𝑡 + 1)
ିଵ
∙ 𝑋 ≥ 0
Therefore 𝑋
்
ି ଵ
∙ 𝑋 converges asymptotically
ି ଵ
ଵଵ
ଵ
ଵ
Choosing 𝑋 = [1 0 … 0]: 𝑋
்
ି ଵ
ଵଵ
ଵଵ
ஶ
Choosing 𝑋 = [0 1 … 0]: 𝑋
்
ି ଵ
ଶଶ
ଶଶ
ஶ
Each 𝑉
ஶ
Then choosing 𝑋 =
்
ି ଵ
ଵଵ
ଶଶ
ଵଶ
ଵଵ
ଵଵ
ஶ
ଶଶ
ଶଶ
ஶ
ଵଶ
ଵଶ
ஶ
Each 𝑉
ஶ
ି ଵ
ஶ
ஶ
௧
௧
ஶ
ஶ
ஶ
௧
ି ଵ
ஶ
௧
ஶ
ି ଵ
ஶ
In adaptive control 𝑽 ஶ
= 𝟎 cannot be guaranteed!
We would like now to prove that Σ൫𝜃
௧
൯ behaves like a linear control scheme with no uncertainty Σ(𝜃, 𝜃) irrespective of
the fact that 𝑆 is unknown.
Naïve expectation:
Self-tuning behaves as Σ(𝜃
o Best possible resultSelf-tuning behaves as control scheme we would design if system was known
o Impossible result in general
Since 𝜃
௧
ஶ
we may instead expect that self-tuning behaves like Σ൫𝜃
ஶ
ஶ
I/O of self-tuning:
௧
௧
௧
௧ାଵ
௧
ି ଵ
்
௧
்
Input: 𝑟(𝑡)
Output: 𝑧(𝑡) =
௧
We have yet seen that 𝜃
௧
and 𝑆(𝑡)
ି ଵ
converge. Then we are interested in 𝑧(𝑡).
௧
௧
௧
is the estimate obtained in the self-tuning scheme when
RLS is fed by 𝑦(𝑡) and 𝑢(𝑡) and NOT 𝑢
௧
is seen as an exogenous input in this scheme and does
not depend on 𝑢
(𝑡) and 𝑢(𝑡) ≠ 𝑢
௧
௧
൯ is a linear time varying systemቊ
௧
௧
௧
௧
௧
This system does not exist in practise. We can simulate it and use it as comparing term for self-tuning.
௧
௧
There are 2 reasons why 𝜃
ஶ
∈ Ξ robustly:
ஶ
depends on:
a. 𝜃
b. Dataset 𝑢(𝑖), 𝑦(𝑖)
c. 𝜃
and 𝑆
initialization of RLS
For those basic controllers for which Ξ
େ
is small enough (e.g. in pole placement Ξ
େ
is a linear subspace so its
dimensionality is lower than the 𝜃 domain), if 𝜃
is chosen with a perturbation, then 𝑃൫𝜃
ஶ
If we find a 𝜃
which leads to Ξ
େ
→ then we can perturb a bit one
of the dimensions of 𝜃
and 𝜃
ஶ
will NOT tend Ξ
େ
to edit the RLS algorithm so that as soon as we see that 𝜃
௧
is approaching Ξ
େ
we can drift the estimate away from
େ
without altering the properties of the algorithm. Therefore 𝜃
ஶ
∈ Ξ is always guaranteed.
ஶ
௧
Before starting we do some observations/simplifications:
∗
→A.S. is always a requirement
ஶ
∈ Ξ, therefore, there is an instant 𝑡
after which 𝜃
௧
௧
∈ Ξ ∀t ≥ t
To simplify the discussion, we assume that 𝜃
௧
∈ Ξ ∀t
→No conceptual changes.
௧
௧
௧
௧
௧
ଵ
ି ଵ
ି
ଵ
ି ଵ
ି
்
The system 𝑆 is: 𝑦
்
்
௧
௧
்
௧
்
௧
ଵ,௧
,௧
,௧
,௧
்
௧
௧
௧
௧
௧
௧
௧
େ
ஶ
ஶ
ஶ
େ
ஶ
େ
ஶ
Imaginary control scheme
௧
௧
௧
௧
௧
௧
௧
Additional input 𝑒(𝑡):
Non-linear feedback
்
௧
௧
௧
The whole proof of the theorem can be now split in three steps:
௧
௧
൯ is asymptotically stable also if is t-varying
The system can be written as:
௧
௧
௧
௧
: state
𝑟(𝑡): input
(𝑡): output
The system is A.S. if ‖𝜉(𝑡)‖ → 0 ↔ 𝐹 ൫
௧
is Hurwitz stable ቚ
௧
We cannot say that if 𝜃
௧
∗
௧
௧
This is not valid because we are changing the parameters over time.
Example:
Both the systems are A.S.
But if we combine the dynamics:
The overall system is not A.S. 𝐹(𝑡) Hurwitz ∀𝑡 is not enough to have A.S.
We can instead evaluate Σ൫𝜃
ஶ
ஶ
൯: In this case, since the system is LTI, for A.S. is sufficient that 𝐹൫𝜃
ஶ
൯ is
Hurwitz stable
If 𝜃
ஶ
∗
ஶ
ஶ
௧
௧
is asymptotically stable if:
௧
∗
௧
ஶ
∗
Intuitively this is because from a certain moment Σ൫𝜃
௧
௧
൯ is not switching anymore
We will see that 𝑒(𝑡) → 0 always. It is asymptotically vanishing irrespective of the operating conditions.
்
௧
௧
ି ଵ
ି ଵ
By analysing our feedback, we can prove that 𝜓(𝑡) is bounded ∀𝑡
𝜉
ଵ
ଶ
ଵ