Professional Documents
Culture Documents
Kersting Et Buss - 2017 - Direct and Indirect Model Reference Adaptive Contr
Kersting Et Buss - 2017 - Direct and Indirect Model Reference Adaptive Contr
Abstract—This article proposes direct and indirect model class of linear switched systems in which the switching signal is
reference adaptive control strategies for multivariable state-dependent instead of exogenous. Recent examples for the
piecewise affine systems, which constitute a popular tool application of PWA systems can be found in switching power
to model hybrid systems and approximate nonlinear sys-
tems. A chosen reference model, which can be linear or also converters [2], biosystems [3], pneumatic systems [4], or cruise
piecewise affine, describes the desired closed-loop system control systems [5]. While this list is by no means complete, it
behavior that is to be achieved by the adaptive controllers gives a good understanding of the ubiquitous presence of PWA
for unknown system dynamics. Each subsystem acquires systems due to their universal approximation capabilities for
its own set of control gains, which is tuned under care-
nonlinear systems.
ful consideration of the switching behavior. In the indirect
approach, the use of dynamic gain adjustment avoids sin- Switching generally complicates the analysis of a dynami-
gularities in the certainty equivalence principle. It is shown cal system and the controller design. The research on hybrid
for both algorithms that the state of the reference model systems extended the linear system theory and developed new
is tracked asymptotically given a common Lyapunov func- stability concepts [6]–[8], definitions for observability and con-
tion for the switched reference model is available. Further- trollability [9] as well as observer [10] and controller designs
more, parameter convergence in both the direct and indirect
approach is proven for sufficiently rich reference signals. [11]. In the next step, it is to be noted that hybrid systems may
Finally, both algorithms are evaluated in numerical simu- contain unknown or time-varying parameters, which sets the
lations and their advantages and disadvantages are dis- stage for adaptive control of hybrid systems. Model reference
cussed. adaptive control (MRAC) forces the unknown system to track
Index Terms—Model reference adaptive control (MRAC), a specified reference system. By choosing a suitable reference
piecewise affine (PWA) systems, switching. system, this technique may, thus, remove undesired switching
and nonlinearities from the open-loop hybrid systems, resulting
I. INTRODUCTION in a simple linear closed-loop system. Existing approaches to-
NCREASING complexity in technical systems lead to a wards this goal can be divided into MRAC for general switched
I growing interest in hybrid systems to efficiently handle
switching behavior or approximate nonlinearities. Hybrid sys-
systems and MRAC for PWA systems.
For the general class of switched systems various adaptive
tems consist of continuous dynamics and a switching mecha- controllers have been proposed in recent years [12]–[15]. A
nism [1]. Different subclasses for hybrid systems exist, such model reference robust adaptive control algorithm for uncertain
as mixed logical dynamical systems, linear complementarity switched linear systems is proposed in [12]. The proof of uni-
systems, and piecewise affine (PWA) systems. In PWA systems, form boundedness under arbitrary switching is based on multiple
the state-input space is partitioned into convex polytopes and the Lyapunov functions constructed from a sufficient LMI condi-
system dynamics are governed by different linear subsystems tion. Furthermore, a dead band is introduced in order to sup-
in each polytope. Hence, PWA systems also constitute a sub- press the bursting phenomenon in adaptive systems. A critical
assumption in MRAC for switched systems is exact knowledge
of the switching signal. The case in which the reference sys-
Manuscript received June 3, 2016; revised September 23, 2016 and
March 3, 2017; accepted March 21, 2017. Date of publication March tem and controller do not switch synchronously with the plant
30, 2017; date of current version October 25, 2017. This work was is investigated in [13]. Dwell-time constraints are derived that
supported in part by the European Research Council under the Euro- ensure practical global stability of the switched error system.
pean Union’s Seventh Framework Programme (FP/2007-2013)/ERC un-
der Grant 267877 and in part by the Technische Universität München— While work in [13] is restricted to switched linear systems, [14]
Institute for Advanced Study (www.tum-ias.de), funded by the German extends the proposed framework to nonlinear switched systems
Excellence Initiative. Recommended by Associate Editor D. Dochain. with nonlinear reference systems. This extension enables more
(Corresponding author: Stefan Kersting.)
The authors are with the Chair of Automatic Control Engineering, TUM sophisticated performance requirements as shown by the exam-
Institute for Advanced Study, Technische Universität München, München ple of a highly maneuverable aircraft. Wang and Zhao point
80333, Germany (e-mail: stefan.kersting@tum.de; mb@tum.de). out that deriving dwell-time constraints for switched systems
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org. may be infeasible without knowledge of the actual subsystem
Digital Object Identifier 10.1109/TAC.2017.2690060 parameters [15]. Instead, their adaptive state tracking algorithm
This work is licensed under a Creative Commons Attribution 3.0 License. For more information, see http://creativecommons.org/licenses/by/3.0/
KERSTING AND BUSS: DIRECT AND INDIRECT MODEL REFERENCE ADAPTIVE CONTROL FOR MULTIVARIABLE PIECEWISE AFfiNE SYSTEMS 5635
is based on a hyper-stability criterion which relates to passivity. the state of the art. It will be shown that working with affine
The proposed control, however, assumes that also the switching subsystems requires less previous knowledge about the system.
signal of the system can be controlled. This is unsuitable for As opposed to [21]–[24], this enables application of the algo-
PWA systems where the switching is always state-dependent. rithm to systems with uncertain or unknown operating points.
This brings us to the class of PWA systems, for which there Then, we derive an indirect MRAC algorithm which is based
are various MRAC approaches available in the literature [16]– on our previous work on parameter identifiers [26]. Parame-
[24]. Early work by di Bernardo et al. extends the minimal ter identifiers generate estimates of the subsystem parameters
control synthesis algorithm to bimodal PWA systems [16]–[18]. and the control gains are dynamically adjusted based on these
Later, di Bernardo et al. proposed a hybrid MRAC strategy for estimates. The unique advantage of this approach is that esti-
multimodal PWA systems [19] in which the state-input space mates of all system parameters are obtained in parallel to the
partitioning of the controlled system may be different form the control task. For both the direct and the indirect approach, the
reference system. Excluding sliding modes, stability in [19] stability proofs for asymptotic state tracking rely on Lyapunov
was proven by passivity-based arguments as well as Lyapunov theory and dwell-time assumptions. Moreover, parameter con-
theory. For the Lyapunov-based proof, a common Lyapunov- vergence is guaranteed under PE. Overall, this paper yields the
function of the reference system’s subsystems is required. The first MRAC laws for multivariable PWA systems, which over-
extension of this hybrid MRAC algorithm in [20] is concerned comes restrictions to PWL systems, single-input systems or
with potential instabilities of the original algorithms due to dis- systems in control canonical form, postulated by state of the art
turbances at the input in form of the affine terms of the plant and algorithms [16]–[24].
reference system. The solution is based on a common Lyapunov The remainder of this paper is structured as follows. In
function and can guarantee convergence of the error system even Section II, we provide required preliminaries such as the defini-
when the system enters a sliding mode. Note that the referenced tion of PWA systems and the control structure. Also the formal
work by di Bernardo et al. focuses on single input systems in MRAC problem is stated. The direct and indirect MRAC al-
control canonical form which restricts the class of applicable gorithms are presented in Sections III and IV, respectively. In
systems. both cases, we consider reference system tracking separately
Work by Sang and Tao focuses on MRAC for piecewise lin- from the additional requirement of parameter convergence. The
ear (PWL) systems which approximate nonlinear systems by application of direct and indirect MRAC is demonstrated in
linearization at multiple operating points [21]–[24]. The state Section V by means of numerical examples. Finally, Section VI
tracking algorithm proposed in [21] and [22] ensures bounded concludes the paper.
signals and asymptotic tracking for arbitrary fast switching in
case a common quadratic Lyapunov function (CQLF) exists. In
case no such Lyapunov function exists, signals are shown to be II. PRELIMINARIES AND PROBLEM FORMULATION
bounded, and an upper bound on the tracking error is given.
This section defines PWA systems and discusses briefly how
Asymptotic tracking can be ensured through parameter projec-
they naturally arise from the linearization of a nonlinear system
tion combined with PE and slow switching. A more detailed
at multiple operating points. Furthermore, a motivation is given,
analysis of the proposed algorithm is given in [23] with careful
why PWA systems are favorable for MRAC compared to PWL
treatment of various special cases related to the existence of
systems. Thereafter, the reference system is introduced and the
a common Lyapunov function or the level of excitation in the
MRAC problem is formulated. The section concludes with some
reference signal. A multivariable extension of the algorithm is
preliminaries on signal properties and the Filippov concept for
proposed in [24].
sliding mode solutions.
Closely related to adaptive control of PWA systems is their
recursive identification. In particular, our previous works [25],
[26] provide algorithms for the subsystem identification of
A. PWA Systems
continuous-time PWL and PWA systems in state-space form.
Furthermore, the achievable convergence rates were improved The considered PWA system consists of a total of s ∈ N
by using concurrent learning [27]. A recent survey of PWA affine subsystems with dynamics
system identification was given in [28].
Despite the revised list of publications on adaptive control
ẋ(t) = Ai x(t) + Bi u(t) + fi , i = 1, . . . , s (1)
of switched systems, there still is a lack of MRAC algorithms
for multivariable PWA systems [29]. Our contribution in this
paper is to fill this gap by means of two approaches: One di- where x ∈ Rn is the state vector, u ∈ Rp is the control input,
rect and one indirect MRAC algorithm. First, we build upon the and Ai ∈ Rn ×n , Bi ∈ Rn ×p , and fi ∈ Rn are the unknown,
direct approach by Sang and Tao [23] to define direct MRAC constant system parameters. The pairs (Ai , Bi ) are assumed to
for PWA systems. Especially for parameter convergence, the be controllable and each Bi be of full column rank. For the
additional affine terms require further analysis. By adding one common case p ≤ n, the full column rank is usually satisfied by
additional assumption and revising the proof for persistence of physical systems.
excitation, all gains are guaranteed to converge to their nominal The state-input space [x , u ] ∈ Rn +p is partitioned into s
values. This extension provides great advantages compared to polyhedral regions Ωi . The i-th polyhedral region is defined by
5636 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 62, NO. 11, NOVEMBER 2017
a set of μi ∈ N linear inequalities each region is characterized by a minimal error with respect to
⎧ ⎡ ⎤ ⎫
⎪ h1,i ⎡ ⎤ ⎪
the original nonlinear system. Considering only the deviations
⎨ x ⎬ Δxi = x − x∗i and Δui = u − u∗i , and assuming additionally
x ⎢ ⎥
Ωi = ∈ Rn +p ⎣ ... ⎦ ⎣ u ⎦ [i] 0 (2) equilibrium operating points with g(x∗i , u∗i ) = 0, the same con-
⎪
⎩ u ⎪
⎭
hμ ,i 1 siderations yield a PWL model with linear subsystems
i
where [i] represents an element-wise list of operators < and ẋ ≈ Ai Δxi + Bi Δui . (7)
≤. Each row vector h(·,·) defines a hyperplane which divides
the state-input space into two half spaces and is of the form As both the affine subsystems (6) and the linear subsystems
h = [α1 , . . . , αn , β1 , . . . , βp , γ] ∈ R1×(n +p+1) . Hence, Ωi is (7) constitute first order approximations of the nonlinear dy-
the intersection of μi half spaces. The discrimination between namics g, they share the same approximation capabilities and
< and ≤ specifies whether the half space is open or closed and result in the same partitioning of the state-input space. The cen-
hence whether the hyperplane does not or does belong to the tral difference is that PWA models replace the local deviations
partition, respectively. Each hyperplane must belong to either from the operating points, i.e., Δxi , Δui , by the globally valid
one of the two neighboring partitions. This is necessary to ob- measurements x and u at the expense of the additional uncertain
tain a complete partition of the state space without overlapping terms fi . That means, working with PWA models, as opposed to
regions, i.e., Ωi ∩ Ωj = ∅, ∀i = j. working with PWL models, requires no knowledge of the oper-
Each of the subsystems in (1) belongs to one of the regions Ωi . ating points x∗i and u∗i as well as the values of g(x∗i , u∗i ). Hence,
Thus, the region which contains the current state-input vector PWA models are more favorable in the design and application
determines which subsystem is active. This is expressed with of adaptive algorithms as less knowledge about the system has
the indicator functions to be introduced a priori.
1, if (x(t), u(t)) ∈ Ωi
χi (t) = (3) B. Reference System Modeling
0, otherwise.
The aim of this paper is to design MRAC laws for PWA sys-
As the regions do not overlap, it follows that si=1 χi (t) = 1 tems. In MRAC, a reference system, which defines the desired
and χi (t)χj (t) = 0, i = j. The dynamics of the PWA system behavior of the controlled system, is chosen by the designer.
are then given by The adaptive control law causes the controlled system to track
ẋ(t) = A(t)x(t) + B(t)u(t) + f (t) (4) the reference system despite parameter uncertainties in the con-
s s trolled system.
A(t) = i=1 Ai χi (t), B(t) = i=1 Bi χi (t), and f (t)
where The reference system in this paper is chosen to be PWA in
= si=1 fi χi (t). Hence, in the following, a matrix without in- order to give enough flexibility to the designer. Let the reference
dex refers to the currently active matrix, i.e., A → Ai with system and the controlled PWA system share the same state-
χi = 1. space partitions Ωi . Hence, we consider the reference system
The adaptive control algorithms derived in this paper are
based on the PWA system representation (4). In order to demon- ẋm = Am xm + Bm r + fm (8)
strate the benefit of PWA systems compared to PWL systems,
a brief revision of how they are obtained by linearizing a non- where xm ∈ Rn and r ∈ Rp are the state of the reference sys-
linear system around multiple operating points is appropriate. tem and the reference signals, respectively. The parameters
Consider the nonlinear system of the reference system switch according to the same indi-
s functions χi in(3)
cator
s
and are therefore given by Am =
s
ẋ(t) = g (x(t), u(t)) (5) A
i=1 m i i χ , Bm = B
i=1 m i i χ , and fm = i=1 fm i χi with
n ×n n ×p
where g : Rn +p → Rn is a smooth (continuously differentiable) Am i ∈ R , Bm i ∈ R , and fm i ∈ R , ∀i = 1, . . . , s.
n
nonlinear vector function. Note that, throughout the paper, we Note that simply applying the partitions of the controlled
omit the dependency on time t as long as it is clear from the PWA system also for the reference system would unnecessarily
context. limit the choice of reference systems. Take for example the case
Neglecting higher order terms in the linearization of g around in which the designer wants to introduce certain thresholds into
an operating point (x∗i , u∗i ) yields the affine model the behavior of the closed-loop system at which the dynamics
are supposed to change. In that case, the reference system may
ẋ ≈ g(x∗i , u∗i ) + Ai (x − x∗i ) + Bi (u − u∗i ) need to feature a switch at states where the controlled system
is continuous. Hence, to gain additional freedom, an original
= Ai x + Bi u + g(x∗i , u∗i ) − Ai x∗i − Bi u∗i
partition Ωi of the controlled PWA system may be divided into
= Ai x + Bi u + fi (6) convex subsets Ωk and Ωl with Ωi = Ωk ∪ Ωl . While the pa-
)
)
rameters of the controlled PWA system remain the same in both
where Ai = ∂ g ∂(x,u
x (x ∗i ,u ∗i )
, Bi = ∂ g ∂(x,u
u (x ∗i ,u ∗i )
, and fi = g regions, e.g., Ak = Al = Ai , the reference system parameters
∗ ∗ ∗ ∗
(xi , ui ) − Ai xi − Bi ui . A PWA model is obtained by lin- can be chosen differently in both sets, i.e., Am k = Am l . In prac-
earizing around various operating points {(x∗i , u∗i )}i=1,...,s and tice, this enables to switch the reference system independently
partitioning the state space such that the active subsystem in from the controlled system.
KERSTING AND BUSS: DIRECT AND INDIRECT MODEL REFERENCE ADAPTIVE CONTROL FOR MULTIVARIABLE PIECEWISE AFfiNE SYSTEMS 5637
Stability of the reference system (8) is essential for the track- takes the form
ing problem. Assuming that each subsystem Am i is stable, it
follows that there exists a symmetric, positive definite Lyapunov u(t) = Kx (t)x(t) + Kr (t)r(t) + kf (t) (11)
matrix Pi ∈ Rn ×n for every symmetric, positive definite matrix
with estimated control gains Kx ∈ Rp×n , Kr ∈ Rp×p , and
Qm i ∈ Rn ×n such that
kf ∈ Rp . Note that there is a different set of control gains
A
m i Pi + Pi Am i = −Qm i ∀i = 1, . . . , s. (9) {Kxi , Kr i , kf i } for each subsystem i and the control gains
need to switch in synchrony with the system parameters.
Hence, Vi = x m Pi xm is a quadratic Lyapunov function for the Hence, the control gain switching relies on the same indi-
i-th subsystem of the reference system. For each subsystem cator functions χi as the
i, there also exist constants am i , λm i > 0 such that eA m i t ≤ s controlled system sand the refer-
ence system,
i.e., K x = K χ
xi i , K r = i=1 Kr i χi , and
am i e−λm i t . If all subsystems satisfy (9) for a common matrix P ,
i=1
kf = si=1 kf i χi . Therefore, as common in related works [16]–
i.e., Pi = P, ∀i, then V = xm P xm is referred to as a CQLF . For [27], Ωi and χi are assumed to be known throughout the paper.
a CQLF, the reference system (8) is stable, even for arbitrary fast Despite this technical assumption, the later simulation studies
switching. If, however, no CQLF exists, stability of the reference will suggest some degree of robustness against asynchronous
system (8) can be ensured through dwell-time considerations. In switching. This also implies that, given the partitions of the
other words, the reference system must reside in one mode for a PWA system are unknown, one may define a sufficiently large
sufficiently long (dwell) time T0 . The dwell-time constraint for number of fictitious partitions, which deliver a satisfactory ap-
the switched reference system (8) is captured in the following proximation of the actual switching behavior. In combination
Lemma, where λm in (Pi ) and λm ax (Pi ) refer to the minimum with the robustness to asynchronous switching this can yield an
and maximum eigenvalue of Pi . acceptable control performance.
Lemma 1 (see[23]): “The switched system ẋ(t) = Am (t) Controlling the PWA system (4) with the state feedback con-
x(t) is exponentially stable with decay rate σ ∈ (0, 1/2α) if troller (11) yields the closed-loop system
T0 satisfies
s
α a2
T0 ≥ Tm = ln(1 + μΔA m ) μ = m max Pi ẋ = χi (Ai + Bi Kxi )x + Bi Kr i r + Bi kf i + fi (12)
1 − 2σα λm β i i=1
where ΔA m = maxi,j Am i − Am j , α = maxi λm ax (Pi ), which in order to solve Problem 1 should equal (8) for suit-
β = mini λm in (Pi ), am = maxi am i , and λm = maxi λm i .” ably chosen control gains. Therefore, to ensure feasibility of the
Proof: The detailed proof of Lemma 1 can be found in [23]. tracking problem, it is assumed, as usual, that there exist nom-
∗
Here, it suffices to point outthat the proof is based on the inal control gains Kxi , Kr∗i , and kf∗ i which fulfill the matching
Lyapunov function V = x ( si Pi χi ) x. The indicator func- conditions
tions χi cause discontinuities in the Lyapunov function at every ∗
switching time tk . For later analysis in this paper, the following Am i = Ai + Bi Kxi
approximation of the potential increase in V associated with Bm i = Bi Kr∗i ∀i = 1, . . . , s
each switch is of greater importance:
fm i = fi + Bi kf∗ i (13)
V (tk ) ≤ (1 + μΔA m )V (t−
k) (10)
One practical implication of these matching conditions is that
where V (t−k ) and V (tk ) are the values of the Lyapunov function the structure of the controlled system needs to be known to some
immediately before and after the switch, respectively.
extend. Otherwise the designer cannot guarantee that the chosen
Finally, given that the autonomous switched system ẋ =
reference systems yield feasible matching conditions.
Am (t)x(t) is exponentially stable for a dwell-time T0 , it can be
The proposed direct and indirect adaptive control laws pro-
concluded, with the same dwell-time constraint, that the state of
duce and update estimates of the nominal control gains in dif-
the switched reference system (8) remains bounded for bounded
ferent ways, but both solve Problem 1. A well-known advantage
reference signals [30].
of MRAC is that solutions to Problem 1 need not converge
to the nominal control gains. However, parameter convergence
C. Problem Formulation and Controller Design
improves robustness and transient performance in adaptive sys-
The formal problem statement addressed by this paper reads tems [31], [32]. Hence, in cases where the reference signal is
as follows. not periodic or where it is desirable to monitor the nominal pa-
Problem 1 (State tracking): Given a PWA system (4) with rameters, the convergence of estimated parameters according to
known regions Ωi and unknown subsystem parameters Ai , Bi , the following problem is required.
and fi , design a state feedback control law u(t) which stabilizes Problem 2 (Convergence of parameter estimates): Besides
the PWA system and forces the state x(t) to asymptotically track the tracking according to Problem 1, ensure that all estimated
the state xm (t) generated by the reference system (8). parameters converge to their nominal values.
In the following two sections, we propose a direct and an The proposed algorithms comply with Problem 2 under the
indirect approach to Problem 1. In both cases, the control law assumption of suitable reference signals.
5638 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 62, NO. 11, NOVEMBER 2017
D. Signal Properties knowledge about the system. Hence, this section builds upon
the framework in [23] and derives a direct MRAC algorithm for
In order to characterize suitable reference signals, we need
to define some signal properties. First, persistence of excitation PWA systems. This generalization modifies the PE conditions
in [23] and, hence, demands for an extended proof of parameter
plays a fundamental role in the convergence of adaptive systems.
convergence.
The general idea is that if some internal signals are rich enough in
their frequency content they are able to excite all system modes. We begin by defining errors between the current control gains
∗
and the nominal values: K̃xi = Kxi − Kxi , K̃r i = Kr i − Kr∗i ,
We consider the standard notation presented in [33, Def. 4.3.1] ∗
and k̃f i = kf i − kf i . Let us rewrite the dynamics of the closed-
and refer to a piecewise continuous signal vector z : R+ → Rn
as persistently exciting (PE) in Rn with a level of excitation loop system in terms of the reference system and the gain errors
∗
by adding and subtracting the expression Bi Kxi x + Bi Kr∗i r +
α0 > 0 if there exist constants α1 , T0 > 0 such that α1 In ≥
t+T 0 ∗
Bi kf i in (12), which yields
1
T0 t z(τ )z (τ )dτ ≥ α0 In , ∀t ≥ 0. While, we rely on this
PE definition in combination with dwell-time assumptions,
s
∗
a first notion of PE for switched systems was introduced ẋ = χi (Ai + Bi Kxi )x + Bi Kr∗i r + fi + Bi kf∗ i
in [34]. i=1
A common approach to ensure PE of a signal vector is to ∗
relate it to the spectral properties of the reference signals. As + Bi (Kxi − Kxi )x + (Kr i − Kr∗i )r + (kf i − kf∗ i )
will be shown throughout the paper, the sufficiently rich property
s
given in [33, Def. 5.2.1] serves well for this purpose. A signal = Am x + Bm r + fm + χi Bi K̃xi x + K̃r i r + k̃f i .
r : R+ → R is referred to as sufficiently rich of order 2n, if it i=1
consists of at least n distinct frequencies. Therefore, a signal (14)
with n sinusoidal components, for instance, is sufficiently rich
of order 2n. Finally, a frequently applied relationship is that a Following from (14) and (8), the dynamics of the tracking
reference signal sufficiently rich of order n + 1 ensures PE of a error e(t) = x(t) − xm (t) are
controllable linear state-space system with state dimension n.
s
While the above properties can characterize the amount of in- ė = Am e + χi Bi K̃xi x + K̃r i r + k̃f i . (15)
formation contained in signals, additional definitions are needed i=1
to analyze the input-output properties of a system. For this pur-
pose, it is conventional to use Lp spaces defined for instance in If the control gains Kxi , Kr i , and kf i take the nominal values
[35, Section 2.7]. As z ∈ L∞ implies boundedness of z(t) and defined in (13), then the error dynamics in (15) reduce to ė =
vice versa, we use the two statements synonymously. Am e, which indicates that the tracking error e converges to zero
exponentially for a stable reference system.
Consider now, however, the case in which the system pa-
E. Sliding Mode Solutions rameters Ai , Bi , and fi are unknown, which at the same time
One important aspect in the problem formulation has not been implies unknown nominal control gains. We make the usual as-
considered so far. That is, if the vector fields of two neighboring sumption in multivariable MRAC [29] that there exist known
regions both point towards the switching hyperplane, the system quadratic matrices Si ∈ Rp×p for which Kr∗i Si is symmetric
trajectories cannot simply cross from one region to the other. and positive definite: Kr∗i Si = (Kr∗i Si ) 0. Then, initial es-
Instead, the state of the system is unable to leave the hyperplane timates Kxi (0) = Kxi0 , Kr i (0) = Kr i0 , and kf i (0) = kf i0 are
and an additional mode—the sliding mode—is created. As the introduced and updated over time. Lyapunov analysis—which
solutions of such sliding modes may evolve in a substantially will be conducted shortly—suggests the following update laws
different manner from the individual subsystems, they must be for the estimated control gains
carefully examined.
In the Filippov concept [36], the sliding mode solution is K̇xi = −Γxi Si Bm
i Pi ex χi
described by a unique convex combination of the contributing K̇r i = −Γr i Si Bm
i Pi er χi
vector fields. In order to obtain such a convex combination in
the presented framework, replace all indicator functions χi by k̇f i = −Γf i Si Bm
i Pi e χi (16)
χ̄i with the modified property that χ̄i now takes values in the
interval [0, 1]. That means the property χi χj = where Γxi , Γr i , Γf i ∈ R+ are positive scaling constants, and
0 for i = j does where Pi ∈ Rn ×n is the symmetric, positive definite Lya-
not hold for sliding modes. But for convexity si=1 χ̄i = 1 still
holds. In the later proofs of Theorems 1 and 3, the Filippov punov matrix of the i-th reference system satisfying A m i Pi +
concept is applied to show that the presented results also hold Pi Am i = −Qm i for some symmetric, positive definite matrix
in the presence of sliding modes. Qm i ∈ Rn ×n . The tracking abilities under the adaptation (16)
are summarized in the following theorem.
Theorem 1 (Direct MRAC for PWA systems with CQLF):
III. DIRECT MRAC
Consider a reference system (8) for which a CQLF with
As pointed out in the previous section, the advantage of PWA P = Pi , ∀i is known. Let the PWA system (4) with known
systems over PWL systems for MRAC is the reduced a priori regions Ωi be controlled by the state feedback (11) with gains
KERSTING AND BUSS: DIRECT AND INDIRECT MODEL REFERENCE ADAPTIVE CONTROL FOR MULTIVARIABLE PIECEWISE AFfiNE SYSTEMS 5639
updated according to (16). Then, the state of the PWA system convex combinations of the vector fields contributing to the slid-
asymptotically tracks the state of the reference system. ing modes are analyzed by replacing χi ∈ {0, 1} with χ̄i ∈ [0, 1]
Proof: Consider the following candidate Lyapunov function, in the above derivation. As in [20], doing so confirms nega-
which is quadratic in the tracking error e and the deviations K̃xi , tive semidefiniteness of V as V̇ = −e 12 si=1 Qm i χ̄i e ≤ 0.
K̃r i , k̃f i from the nominal control gains Note that this derivation shows stability of the sliding mode so-
lution without determining the uniquely defined Fillipov vector
1
s
1 field of the actual sliding mode.
V = e Pe + tr K̃xi Msi K̃xi
2 2 i=1 In summary, the error system is shown to be stable under ar-
bitrary fast switching and sliding modes, and the errors e, K̃xi ,
+ tr K̃ri Msi K̃r i + k̃fi Msi k̃f i (17) K̃r i , and k̃f i are bounded (∈ L∞ ). Boundedness of e, together
with a stable reference system (i.e., xm ∈ L∞ ), yields x ∈ L∞ .
where Msi = (Kr∗i Si )−1 ∈ Rp×p . As mentioned before, the Moreover, from (15) it follows that ė ∈ L∞ . This permits appli-
variables Si are chosen such that the inverse exists and is positive cation of Barbalat’s Lemma on (20) and yields limt→∞ e(t) = 0,
definite. The time derivative of (17) along (15) is which completes the proof.
s In most cases, it is desirable to choose an LTI reference sys-
1
V̇ = e χi (A
m i P + P Am i ) e
tem as it defines the same reference system for all subsystems
2 i=1 or equivalently over the entire state-input space. This choice
s causes the nonlinear system (4) to behave like a linear system
+ χi e P Bi K̃xi x + K̃r i r + k̃f i and naturally satisfies the assumption of a CQLF in Theorem 1.
i=1 Depending on the PWA system, however, the matching condi-
s tions (13) might not allow for an LTI reference system. In such
˙
+ tr K̃xi
Msi K̃˙ xi + tr K̃ri Msi K̃˙ r i + k̃fi Msi k̃f i . cases, a PWA reference system must be employed, for which a
i=1 CQLF might not exist. In this case, the following theorem con-
(18) siders a PWA reference system and shows that under a certain
dwell-time assumption and under sufficiently rich reference sig-
At this point, the rationale behind the update laws in (16) can
nals, all tracking errors and parameter errors converge to zero.
be seen as a suitable choice to cancel all terms in the last two lines
Theorem 2 (Direct MRAC and parameter convergence for
of (18). Note that, for simplicity and without loss of generality,
PWA systems without CQLF): Consider the reference system
the scaling constants Γxi , Γr i , Γf i are neglected throughout the
(8) without CQLF and let the PWA system (4) with known
proof. Inserting the update laws (16) with Pi = P in (18) yields
regions Ωi be controlled by the state feedback (11) with gains
s
1 updated according to (16). Let the reference signals in r
V̇ = −e Qm i χi e be sufficiently rich of order n + 1 with distinct frequencies.
2 i=1
Furthermore, let the resulting switching signal be sufficiently
s slow with dwell time Tdwell and cause repeated activation of
+ χi e P Bi K̃xi x − tr K̃xi
Msi Si Bm
i P ex
all subsystems. If the input matrices Bi have full column rank,
i=1 if the system matrices Am i are invertible, and if the the pairs
(Am i , Bm i ) are controllable, then all errors e, K̃xi , K̃r i , and
+ e P Bi K̃r i r − tr K̃ri Msi Si Bm
i P er
k̃f i asymptotically converge to zero for t → ∞.
Proof: The proof of Theorem 2 begins with analyzing the
+ e P Bi k̃f i − k̃fi Msi Si Bm
iP e (19) convergence of the active subsystem parameters. Consider a
for which the terms under the second summation cancel time interval t ∈ [tk −1 , tk ) and let i be such that χi (t) = 1.
out. To see this, note that Msi Si Bm ∗ Thus, the error (15) reads as
i = Msi Si (Kr i ) Bi =
−1 ⎡ ⎤
Msi Msi Bi = Bi simplifies the first trace to tr(K̃xi Bi P ex ).
As tr(X) = tr(X ), this is equivalent to tr(xe P Bi K̃xi ), x
which can be rewritten as tr(e P Bi K̃xi x) due to the property ė = Am i e + Bi K̃xi K̃r i k̃f i ⎣ r ⎦ . (21)
tr(XYZ) = tr(YZX). Next, note that the obtained expression 1
inside the trace operator is a scalar and cancels the correspond-
We make use of the Kronecker product ⊗ and define
ing term in (19). The same steps can be performed to cancel the
other terms in (19). ⎡ ⎤
x
It follows that the derivative of V is negative semidefinite: ⎣ r ⎦ ⊗ In ϑ̃B
s Ψ r := i := vec Bi K̃xi K̃r i k̃f i (22)
1 1
V̇ = −e Qm i χi e ≤ 0. (20)
2 i=1
where the operator vec(·) concatenates the columns of a matrix
Considering additionally the case of potential sliding modes, (·) in a single column vector. With (22), rewrite (21) as
one must furthermore evaluate the derivative of V along the slid-
ing mode solution. Hence, with the Filippov solution concept all ė = Am i e + Ψ B
r ϑ̃i . (23)
5640 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 62, NO. 11, NOVEMBER 2017
First, Di Bernardo et al. define the state-space partitions for an essential element in the update law. The parameter update
the controlled PWA system and the reference system separately. laws proposed in [26] are
Therefore, their control gains consist of two parts, one switching
˙ ˙ ˙
according to the controlled system and one switching accord- Âi = −χi Pi x̃x B̂i = −χi Pi x̃u fˆi = −χi Pi x̃
ing to the reference system. Second, in the MCS framework, the
control gains depend on the tracking error through a proportional where the indicator functions χi ensure that only the currently
and an integral term. Due to the differential equations in (16), active subsystem parameters are updated. Let Ãi = Âi − Ai ,
the algorithm presented here only contains the integral term. As B̃i = B̂i − Bi , and f˜i = fˆi − fi be the parameter errors. Then,
the proportional term has a beneficial effect on the convergence the derivative of the prediction error in terms of these parameter
of the tracking error, it is interesting to investigate in future work errors is
whether adding such a term in the presented adaptation schemes s
can further improve the control performance. x̃˙ = Am x̃ + Ãi x + B̃i u + f˜i χi . (34)
This section introduced MRAC with direct gain adaptation i=1
according to (16). Compared to its PWL counterpart in [23], the Following the certainty equivalence principle, one can rear-
consideration of affine terms enlarges the class of applicable sys- range the matching conditions (13) and obtain the following
tems. Also, it is not restricted to single input system in canonical algebraic equations for the controller gains:
form as [19], [20]. It was determined in Theorem 1 that the state
tracking task can be achieved without imposing additional re- Kxi = B̂i+ (Am i − Âi )
quirements on the excitation of the reference system. Theorem 2
on the other side suggests excitation with sufficiently rich refer- Kr i = B̂i+ Bm i
ence signals in case convergence to the nominal control gains is kf i = B̂i+ (fm i − fˆi )
required. This is for instance the case if a nonperiodic reference
signal needs to be tracked precisely or the nominal gains are where B̂i+ represents the Moore–Penrose pseudoinverse of B̂i .
to be monitored. For monitoring purposes it might, however, This pseudo inverse constitutes the main obstacle in this ap-
be more interesting to analyze the system parameters Ai , Bi , proach as it introduces singularities into the system. In case of
and fi instead of the control gains. This is the motivation for singularities, the control gains take unbounded values which
the indirect MRAC algorithm derived in the next section, which violates the stability requirement. Hence, additional measures
has the advantage that estimates of the system parameters are need to be taken to ensure stability. One approach is the use of
generated in parallel to the control task. projection operators to avoid singularities [33, Sect. 8.5.5]. In
order to use projection operators, however, additional knowl-
IV. INDIRECT MRAC edge about the system (more precisely Kr∗i ) as compared to the
In the direct MRAC approach, the control gains are directly direct adaptive control algorithm in Section III would be needed.
tuned based on the tracking error. In the indirect case, which is Instead of introducing projection operators, we follow ideas
discussed next, the adaptation of control gains is based on time- previously discussed in [35], [40], and define dynamic update
varying estimates Âi (t), B̂i (t), and fˆi (t) of the system param- laws for the control parameters (Kxi , Kr i , kf i ) which are based
eters Ai , Bi , and fi . Under the certainty equivalence principle, on the current parameter estimates (Âi , B̂i , fˆi ). This preserves
the estimates are handled as if they are the true parameters at all the indirect nature of the adaptation algorithm (hence provides
times [29]. Based on the estimates, the control gains Kxi , Kr i , estimates of the system parameters) and at the same time relies
and kf i are derived from the matching conditions (13). on the same previous knowledge about the system as the direct
The estimates Âi , B̂i , and fˆi are determined by parameter approach. Our contribution in this context is the extension of
identifiers, which are a standard tool for the adaptive identi- [35, Section 3.3] and [40] to switched affine systems. We also
fication of LTI systems, see [33, chapter 5]. Recently in [25] present a thorough proof of parameter convergence, which is
and [26], parameter identifiers were extended to the setting of not given in [35] and [40].
switched linear and switched affine systems. Parameter identi- We begin by defining the following closed-loop estimation
fiers in [26] predict the state x in terms of the parameter esti- errors, which can be interpreted as errors in the matching con-
mates (Âi , B̂i , fˆi ) of the currently active subsystem (χi = 1). ditions (13):
The dynamics of the predicted state x̂ ∈ Rn are εA i = Âi + B̂Kxi − Am i
s
x̂˙ = Am x̂ + (Âi − Am i )x + B̂i u + fˆi χi (33) εB i = B̂i Kr i − Bm i
i=1
εf i = fˆi + B̂kf i − fm i . (35)
where Am = si=1 Am i χi is the switched system matrix of the
stable reference system (8). Let x̃ = x̂ − x be the prediction The closed-loop estimation errors (35) characterize the mis-
error between the predictor (33) and the controlled system (4). match between the desired system dynamics and the estimated
This prediction error contains information about the mismatch closed-loop system dynamics, which are obtained for the esti-
between parameter estimates and true parameters and is, hence, mated system parameters and control gains.
5642 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 62, NO. 11, NOVEMBER 2017
Lyapunov-based stability analysis (which will be presented K̃r i , k̃f i . To see this, insert the update laws (37) with Pi = P ,
shortly) suggests the next steps. Let the closed-loop estimation which yields
errors drive the adaptation of the control gains as follows:
s &
1
s
K̇xi = −Si Bm
i εA i V̇ = − x̃
Qm i χi x̃ + − tr à ε
i Ai
2 i=1
K̇r i = −Si Bm
i εB i
i=1
k̇f i = −Si Bm
i εf i (36) − tr B̃i εA i Kxi
+ εB i Kri + εf i kfi − f˜i εf i
Proof: The proof is based on the candidate Lyapunov func- = Msi Msi−1 Bi = Bi
tion
1 and grouping traces of matrices with equal dimension, results
s
1
V = x̃ P x̃ + tr Ãi Ãi + tr B̃i B̃i + f˜i f˜i in
2 2 i=1
s
1
+ tr K̃xi Msi K̃xi + tr K̃ri Msi K̃r i + k̃fi Msi k̃f i , V̇ = − x̃ Qm i χi x̃
2 i=1
whose time derivative along x̃˙ in (34) is s &
s
1 + − tr à ˜
i εA i + K̃xi Bi εA i − fi εf i − k̃f i Bi εf i
V̇ = x̃ (A P + P Am i )χi x̃ i=1
2 i=1 m i
'
s &
− tr B̃i (εA i Kxi
+ εB i Kri + εf i kfi ) + K̃ri Bi εB i .
+ x̃ P Ãi x + B̃i u + f˜i χi
i=1 (38)
+ tr à ˙ ˙ ˜ ˜˙
i Ãi + tr B̃i B̃i + fi fi Inside the trace operators, one can justify the transformations
'
˙ ˙ ˙ ∗
+ tr K̃xi Msi K̃xi + tr K̃r i Msi K̃r i + k̃f i Msi k̃f i . Ã
i + K̃xi Bi εA i = Â
i − Ai + Kxi Bi − Kxi Bi εA i
At this point, with equalities of the form x̃ P Ãi x = = Â
i − Am i + Kxi Bi εA i
tr(Ã
i P x̃x ), the first elements in the update laws (37) for Â,
B̂, and f prove suitable to cancel out x̃ P (Ãi x + B̃i u + f˜i )χi .
ˆ = Â
i − A
mi + K
B̂
xi i − K
B̃
xi i εA i
The elements containing εA i , εB i , εf i on the other side will turn
out to cancel those terms associated with the control gains K̃xi , = εA i − Kxi B̃i εA i
KERSTING AND BUSS: DIRECT AND INDIRECT MODEL REFERENCE ADAPTIVE CONTROL FOR MULTIVARIABLE PIECEWISE AFfiNE SYSTEMS 5643
= Am x̂ + (Â − Am )x + B̂(Kx x + Kr r + kf ) + fˆ
= B̃i εA i Kxi
+ B̃i εf i kfi + B̂i εB i Kri
= Am x̂ + εA x + B̂(Kr r + kf ) + fˆ
− Bi εB i Kri + Kri Bi εB i − Kr∗i Bi εB i
= (Am + εA )x̂ − εA x̃ + (Bm − εB )r + fm − εf (39)
= B̃i εA i Kxi
+ B̃i εf i kfi + B̂i εB i Kri − Kr∗i Bi εB i
where εA = si=1 εA i χi , εB = si=1 εB i χi , and εf = si=1
= B̃i εA i Kxi
+ B̃i εf i kfi + B̂i εB i Kri − Bm
i εB i εf i χi . By design, reference signals are assumed to be bounded:
= B̃i εA i Kxi
+ B̃i εf i kfi + ε r ∈ L∞ . With stable Am , boundedness of (Bm − εB )r + fm −
B i εB i .
εf , and εA ∈ L2 , it can be concluded from (39) that x̂ ∈ L∞ .
Since x = x̂ − x̃, and since both x̂ and x̃ are bounded, also the
Hence, together with
system state is bounded (x ∈ L∞ ). This implies boundedness
of u due to (11). Finally, bounded x and u imply boundedness
f˜i + k̃fi Bi εf i = fˆi − fi + kfi Bi − kf∗ i Bi εf i ˙ ε̇A , ε̇B , and ε̇f due to x̃˙ = Am x̃ + Ãx + B̃u + f˜ and
of x̃,
(35). It follows that V̈ ∈ L∞ and V̇ ∈ L2 ∩ L∞ which makes
= fˆi − fmi + kfi Bi εf i Barbalat’s Lemma applicable and, therefore limt→∞ V̇ (t) = 0
[35, Corollary 2.9]. Hence, x̃, εA , εB , εf → 0 as t → ∞.
= fˆi − fmi + kfi B̂i − kfi B̃i εf i Inspecting the dynamics of x̂ in (39), we conclude that
˙
= ε
f i εf i − kf i B̃i εf i
lim x̂(t)
t→∞
= Am x̂ + Bm r + fm
the PWA system asymptotically tracks the state of the reference orem 1 guarantees asymptotic state tracking without imposing
system and the estimated parameters Âi , B̂i , and fˆi as well as additional assumptions on the reference signals. If, however, the
the estimated gains Kxi , Kr i , and kf i converge to their nominal choice of reference systems is restricted and no CQLF is found,
values as t → ∞. one must resort to Theorem 2 and impose additional require-
Proof: The proof of Theorem 3 showed that the adaptation ments on the system as well as the reference signals in order
is stable and that state tracking is achieved. For the proof of to achieve asymptotic state tracking. The dominant requirement
Theorem 4, it is left to show that the parameter estimates of are sufficiently rich reference signals to ensure PE and in turn
the active subsystem converge asymptotically to the true pa- guarantee convergence of the control gains to the nominal gains.
rameters. During this proof, we thus neglect the index i with As converged gains imply perfect reference system matching,
the understanding that all steps shown here apply for the active direct MRAC according to Theorem 2 should also be applied if
subsystem. Hence, if all subsystems are activated in a persistent precise tracking under nonperiodic reference signals is required.
manner, all parameter estimates converge. Independent of the existence of a CQLF this task is only achiev-
The proof consists of three steps. First, the convergence of able for arbitrary reference signals if the nominal control gains
estimated subsystem parameters is tied to a PE condition of are applied.
internal signals. Then, Lemma 2 is applied to verify this PE The ideal use case for the indirect MRAC algorithm presented
requirement and, thus, show convergence of the estimated sub- in Section IV is when monitoring of subsystem parameters be-
system parameters. In the final step, it is shown that converging comes the main goal besides reference system tracking alone.
parameter estimates yield converging control gains. Applications can be the early detection of parameter drifts or ag-
Using a similar notations as in (22), begin by introducing ing components in the system. From this point of view, indirect
⎡ ⎤ MRAC according to Theorem 3 has limited practical relevance
x as it only achieves the asymptotic state tracking of a reference
Ψu := ⎣ u ⎦ ⊗ In θ̃ := vec à B̃ f˜ . (40) system for which a CQLF is known. No guarantees are given for
1 the convergence of parameter estimates. Hence, in this case the
Then, from (34) and (37), likewise (26), the closed-loop system benefit of a simpler structure in the direct MRAC law is more
for both the prediction error x̃ and parameter errors θ̃ can be convincing. Yet, the advantage of the indirect MRAC strategy
written as is that control gains are updated indirectly over the interme-
$ % diate closed-loop estimation errors, such as εA . Therefore, the
x̃˙ Am Ψu (t) x̃ 0 control gains are updated less aggressively which can reduce
˙ = −Ψu (t)P 0 θ̃
+
ε
(41)
the wear associated with strongly oscillating control gains. The
θ̃
convincing full potential of indirect MRAC is given in form
where ε = −vec([εA , εA Kx + εB Kr + εf kf , εf ]). Note that of Theorem 4. That is, by exciting the system with sufficiently
from the proof of Theorem 3 it can be concluded that ε ∈ L2 . rich reference signals, it is guaranteed that the estimated sub-
Hence, convergence is derived from the analysis of the first system parameters converge to the true values, which is ideal
summand in (41). for monitoring purposes. Unfortunately the application of indi-
For (41) with ε = 0, exponential stability of the equilib- rect MRAC is restricted to reference systems with CQLF as the
rium x̃ = 0 and θ̃ = 0 holds if zu = [x , u , 1] is PE [33, convergence of subsystem parameters is asymptotic. Therefore,
Lemma 5.6.3]. As Theorem 3 showed e → 0, it follows that zu is a stability proof as carried out for direct MRAC is infeasi-
PE if zm = [x
m , um , 1] is PE, where um can be thought of as an ble as the discontinuities in the Lyapunov function can not be
internal signal in the reference system if it was realized in terms compensated by a combination of exponential convergence and
of the matching conditions, i.e., um = Kx∗ xm + Kr∗ r + kf∗ . Un- dwell-time constraints.
der the given assumptions, Lemma 2 guarantees PE of zm .
Hence, the PE condition on zu is also satisfied, which—
according to [33, Lemma 5.6.3] —implies asymptotic conver- V. NUMERICAL VALIDATION
gence of θ̃ to zero and thus  → A, B̂ → B and fˆ → f as We consider the mass-spring-damper system in Fig. 1(a)
t → ∞. Note that the exponential convergence stated in [33, for numerical validation of the proposed adaptation schemes.
Lemma 5.6.3] does not hold here due to the presence of ε in The system consists of two masses m1 = 5 kg and m2 = 1 kg
(41). connected to each other and the environment by springs and
Finally, combine the convergence of Â, B̂, and fˆ with the dampers. All dampers have fixed damping coefficients d =
convergence of εA , εB , and εf shown in the proof of Theorem 3. 1 Ns/m. The left cart is connected to the environment by a
For the assumed full column rank of B, it can then be concluded constant spring with stiffness c0 = 1 N/m. Let the stiffness of
that also the control gains converge to their nominal values: the spring between the two masses be PWA with characteris-
Kx → Kx∗ , Kr → Kr∗ , and kf → kf∗ as t → ∞. tic Fc (p1 , p2 ) shown in Fig. 1(b), which suggest a PWA system
After deriving direct and indirect MRAC control algorithms with three subsystems. The characteristic is assumed to be linear
for PWA systems, let us briefly revise their preferred use cases with slope c1 = 10 N/m for |p2 − p1 | < γ = 1 m. If the spring
as well as advantages and disadvantages. The direct MRAC is extended beyond γ, the slope of the spring characteristic de-
algorithm presented in Section III is ideally suited for refer- creases to c2 = 1 N/m < c1 . For strong spring compression, i.e.,
ence system tracking alone. In case a linear reference system is p2 − p1 < −γ, the slope of the spring characteristic increases to
chosen, or a CQLF for the reference subsystems is known, The- c3 = 100 N/m > c1 . As Fig. 1(b) indicates, the change in slope
KERSTING AND BUSS: DIRECT AND INDIRECT MODEL REFERENCE ADAPTIVE CONTROL FOR MULTIVARIABLE PIECEWISE AFfiNE SYSTEMS 5645
APPENDIX A
PROOF OF LEMMA 2
Proof: The proof of Lemma 2 follows ideas presented in [33,
pp. 306–308] and relates the PE condition of zm to the spectral
properties of r. Consider the two representations
ẋm = Am xm + Bm rm + fm and
ẋm = Axm + Bum + f
um = Kx∗ xm + Kr∗ r + kf∗ .
Note that the affine terms fm and kf∗ can be understood to
relate xm and um to a constant input of 1. Therefore, let zm =
[x
m , um , 1] be related to the constant input via the transfer
function Hf (s). Furthermore, let the transfer function Hk (s)
relate zm to the k-th element of the reference signal vector
r = [r1 , . . . , rp ] . Hence, zm (s) is given by
⎡ ⎤ ⎡ ⎤
xm (s) (sI − Am )−1 fm
⎢ ⎥ ⎢ ⎥
zm (s) = ⎣ um (s) ⎦ = ⎣ kf∗ ⎦ 1(s)
Fig. 6. Parameter convergence under indirect MRAC for subsystem 1. 1(s) 1
Solid lines represent parameters during the active periods of subsys- !" #
tem 1 (χ 1 = 1) and dotted lines visualize the inactive periods of subsys-
=:H f (s)
tem 1 (χ 1 = 0). The ideal control gains are given by dashed lines.
⎡ ⎤
(sI − Am )−1 bm ,k
p
⎢ ∗ ⎥
similar and therefore omitted). Besides the control gains Kx1 , + ⎣ kr,k + Kx∗ (sI − Am )−1 bm ,k ⎦ rk (s)
Kr 1 , and kf 1 , the figure also visualizes the norm of the error k =1 0
in the estimated parameters: θ̃1 = vec([Ã1 , B̃1 , f˜1 ]). As !" #
=:H k (s)
derived in Theorem 4, all errors tend to zero asymptotically
despite switching. ∗
where I is the identity matrix, and bm ,k and kr,k denote the k-th
One difference between direct and indirect MRAC is the column of Bm and Kr∗ , respectively.
strength with which control gains are adjusted. Especially kf 1 Under the assumption of distinct frequencies in the reference
is adjusted less aggressively in the indirect case (note that the signals, the auto covariance of zm is given by
scales differ by a factor of ten). ) ∞
1
Rz m (0) = Hf (−jω)S1 (ω)Hf (jω)dω
2π −∞
VI. CONCLUSION p )
1 ∞
This paper proposes a direct and an indirect algorithm for + Hk (−jω)Sr k (ω)Hk (jω)dω (42)
2π −∞
MRAC of multivariable PWA systems. In both cases, asymp- k =1
totic tracking of the reference system is shown for arbitrary fast where S1 (ω) and Sr k (ω) are the spectral distributions of the
switching in case a common Lyapunov function for the refer- constant input 1 and the reference signals rk , respectively. Since
ence system is known. Without a common Lyapunov function, rk is sufficiently rich of order n + 1, its spectrum has n + 1
a dwell-time constraint on the switching signal is given for the distinct peaks Fr k (ωk l ) at frequencies ωk l , l = 1, . . . , n + 1.
direct approach. Furthermore, convergence of all parameter er- The constant input 1(s) results in a single spectral peak at zero.
rors and tracking errors to zero is guaranteed for sufficiently rich The spectral distributions are
reference signals. The limitations of the proposed algorithms are
twofold. First, the full state feedback restricts the class of ap-
n +1
plicable systems and limits the choice of reference systems. S1 (ω) = δ(ω) Sr k (ω) = Fr k (ωk l )δ(ω − ωk l ) (43)
This limitation can be overcome by output feedback. The sec- l=1
ond limitation of the proposed algorithms is the assumption of where δ(ω = 0) = 1 and δ(ω = 0) = 0. With (43), the integrals
full state measurement and known state-space partitions. While in (42) are replaced by summations
the conducted simulation studies show some degree of robust-
1
ness against asynchronous switching, the proposed algorithms Rz m (0) = Hf (−j0)Hf (j0)
become inapplicable for completely unknown state-space parti- 2π
1
tions. Therefore, future work focuses on extending the presented p n +1
ideas to uncertain switching hyperplanes. From a practical point + Fr k (ωk l )Hk (−jωk l )Hk (jωk l ).
2π
of view, also the reduction of discontinuities in the control inputs k =1 l=1
According to [33, Lemma 5.6.1], the vector signal zm is PE [vn +1 , . . . , vn +p ] ∈ Rp and Z = vn +p+1 ∈ R, which leads to
if Rz m (0) is positive definite, i.e., the quadratic equation
(adj(sI − Am )Bm ) X
v Rz m (0)v = 0, v ∈ Rn +p+1 (45)
+ (a(s)Kr∗ + Kx∗ adj(sI − Am )Bm ) Y + 0p Z = 0p . (51)
can only have a single solution at v = 0n +p+1 . Note that the
outer products Hk (−jωk l )Hk (jωk l ) of the column vectors The following expressions for adj(sI − A)B and a(s) are
Hk (jωk l ) are positive semidefinite. Therefore, each summand applied to (51)
in (44) is positive semidefinite and (45) is equivalent to adj(sI − A)B = Bsn −1 + (AB + an −1 B)sn −2
v Rz m (0)v = v Rz 1 (0)v + v Rz 2 (0)v = 0 (46) + (A2 B + an −1 AB + an −2 B)sn −3 +, . . . ,
!" # !" #
≥0 ≥0
+ (An −1 B + an −1 An −2 B+, . . . , +a1 B)
where
a(s) = sn + an −1 sn −1 +, . . . , +a1 s + a0
1
p n +1
Rz 1 (0) := Fr k (ωk l )Hk (−jωk l )HkT (jωk l ) which results in a polynomial of order n. All coefficients of the
2π
k =1 l=1 resulting polynomial must be zero in order for the polynomial to
1 be zero independent of s. For the coefficient of sn , one obtains
Rz 2 (0) := Hf (−j0)HfT (j0).
2π Y Kr∗ = 0 (52)
First, v Rz 1 (0)v = 0 is analyzed. Since all summands in
which by multiplication of SMs from the right yields Y = 0p .
Rz 1 (0) are positive semidefinite, the equality holds, if and only
Note that the assumption of full column ranks in B and Bm
if
ensures that both S and Ms have empty null spaces such that
∀k = 1, . . . , p their multiplication does not affect the solution of (52).
v Hk (−jωk l )Hk (jωk l )v = 0 For Y = 0p , the n equalities related to the coefficients from
∀l = 1, . . . , n + 1
s0 to sn −1 can be combined in the compact form
or equivalently
Bm Am Bm . . . Anm−1 Bm X = 0n p
∀k = 1, . . . , p
Hk (jωk l )v = 0 (47) which due to controllable (Am , Bm ) leads to X = 0n .
∀l = 1, . . . , n + 1. This shows that (51) holds for X = 0n , Y = 0p , and an ar-
Next, rewrite the transfer function Hk (s) with a(s) = det(sI − bitrary Z ∈ R. Consequently, Rz 1 (0) is only positive semidef-
Am ) as inite. Therefore, Rz 2 (0) in (46) must restrict the choice of Z
⎡ ⎤ further. In other words, for X = 0n and Y = 0p the equality
adj(sI − Am )bm ,k v Rz 2 (0)v = 0 must only hold for Z = 0. The same arguments
1 ⎢ ∗ ⎥
Hk (s) = ⎣ a(s)kr,k + Kx∗ adj(sI − Am )bm ,k ⎦ (48) as above lead to
a(s) ⎡ ⎤
0 adj(−Am )fm
!" # ⎢ ⎥
=: H̄ k (s) H̄f (j0)v = ⎣ a(j0)kf∗ ⎦ v = 0
and note that the equalities (47) are also equivalent to a(j0)
[5] D. Corona and B. De Schutter, “Adaptive cruise control for a SMART car: [30] G. Michaletzky and L. Gerencser, “BIBO stability of linear switching
A comparison benchmark for MPC-PWA control methods,” IEEE Trans. systems,” IEEE Trans. Automat. Control, vol. 47, no. 11, pp. 1895–1898,
Control Syst. Technol., vol. 16, no. 2, pp. 365–372, Mar. 2008. Nov. 2002.
[6] L. Hai and P. Antsaklis, “Stability and stabilizability of switched lin- [31] M. Duarte and K. Narendra, “Combined direct and indirect approach
ear systems: A survey of recent results,” IEEE Trans. Automat. Control, to adaptive control,” IEEE Trans. Automat. Control, vol. 34, no. 10,
vol. 54, no. 2, pp. 308–322, Feb. 2009. pp. 1071–1075, Oct. 1989.
[7] R. DeCarlo, M. S. Branicky, S. Pettersson, and B. Lennartson, “Perspec- [32] E. Lavretsky, “Combined/composite model reference adaptive control,”
tives and results on the stability and stabilizability of hybrid systems,” IEEE Trans. Automat. Control, vol. 54, no. 11, pp. 2692–2697, Nov. 2009.
Proc. IEEE, vol. 88, no. 7, pp. 1069–1082, Jul. 2000. [33] P. Ioannou and J. Sun, Robust Adaptive Control. Upper Saddle River, NJ,
[8] R. Shorten, F. Wirth, O. Mason, K. Wulff, and C. King, “Stability criteria USA: Prentice-Hall, 1996.
for switched and hybrid systems,” SIAM Rev., vol. 49, no. 4, pp. 545–592, [34] M. Petreczky and L. Bako, “On the notion of persistence of excitation for
2007. linear switched systems,” in Proc. 50th IEEE Conf. DecisionControl Eur.
[9] A. Bemporad, G. Ferrari-Trecate, and M. Morari, “Observability and con- Control Conf., 2011, pp. 1840–1847.
trollability of piecewise affine and hybrid systems,” IEEE Trans. Automat. [35] K. Narendra and A. M. Annaswamy, Stable Adaptive Systems. New York,
Control, vol. 45, no. 10, pp. 1864–1876, Oct. 2000. NY, USA: Dover, 2005.
[10] A. Tanwani, H. Shim, and D. Liberzon, “Observability for switched linear [36] A. F. Filippov, Differential Equations with Discontinuous Righthand Sides,
systems: Characterization and observer design,” IEEE Trans. Automat. vol. 18. Dordrecht, The Netherlands: Springer, 1988.
Control, vol. 58, no. 4, pp. 891–904, Apr. 2013. [37] S. Boyd and S. Sastry, “On parameter convergence in adaptive control,”
[11] B. Passenberg, P. E. Caines, M. Leibold, O. Stursberg, and M. Buss, Syst. Control Lett., vol. 3, pp. 311–319, 1983.
“Optimal control for hybrid systems with partitioned state space,” IEEE [38] M. S. Branicky, “Multiple lyapunovfunctions and other analysis tools for
Trans. Automat. Control, vol. 58, no. 8, pp. 2131–2136, Aug. 2013. switched and hybrid systems,” IEEE Trans. Automat. Control, vol. 43,
[12] Q. Wang, Y. Hou, and C. Dong, “Model reference robust adaptive control no. 4, pp. 475–482, Apr. 1998.
for a class of uncertain switched linear systems,” Int. J. Robust Nonlinear [39] G. Tao, Adaptive Control Design and Analysis (Adaptive and learning sys-
Control, vol. 22, no. 9, pp. 1019–1035, 2012. tems for signal processing, communications, and control Series). Hobo-
[13] C. Wu, J. Zhao, and X.-M. Sun, “Adaptive tracking control for uncertain ken, NJ, USA: Wiley, 2003.
switched systems under asynchronous switching,” Int. J. Robust Nonlinear [40] M. Duarte and K. Narendra, “Indirect model reference adaptive control
Control, vol. 25, no. 17, pp. 3457–3477, 2015. with dynamic adjustment of parameters,” Int. J. Adapt. Control Signal
[14] J. Xie and J. Zhao, “Model reference adaptive control for nonlinear Process., vol. 10, pp. 603–621, 1996.
switched systems under asynchronous switching,” Int. J. Adapt. Control
Signal Process., vol. 31, pp. 3–22, 2016.
[15] X. Wang and J. Zhao, “Adaptive state tracking of switched systems based
on a hyperstability criterion,” Int. J. Adapt. Control Signal Process., Stefan Kersting (S’12) received the Dipl.Ing.
vol. 28, no. 1, pp. 28–39, 2014. degree in electrical engineering from Technical
[16] M. Di Bernardo, U. Montanaro, and S. Santini, “Minimal control synthesis University Darmstadt, Darmstadt, Germany, in
adaptive control of continuous bimodal piecewise affine systems,” in Proc. 2011. He is currently working toward the Ph.D.
51st IEEE Annu. Conf. Decision Control, 2012, pp. 2637–2642. degree at the Technical University of Munich,
[17] M. Di Bernardo, U. Montanaro, and S. Santini, “Minimal control synthesis Munich, Germany.
adaptive control of continuous bimodal piecewise affine systems,” SIAM He is currently a Research Associated in the
J. Control Optim., vol. 48, no. 7, pp. 4242–4261, 2010. Chair of Automatic Control Engineering, Techni-
[18] M. Di Bernardo, C. I. H. Velasco, U. Montanaro, and S. Santini, “Experi- cal University of Munich. His research interests
mental implementation and validation of a novel minimal control synthe- include adaptive identification and control of hy-
sis adaptive controller for continuous bimodal piecewise affine systems,” brid systems and robust switching control.
Control Eng. Practice, vol. 20, pp. 269–281, 2012.
[19] M. Di Bernardo, U. Montanaro, and S. Santini, “Hybrid model refer-
ence adaptive control of piecewise affine systems,” IEEE Trans. Automat.
Control, vol. 58, no. 2, pp. 304–316, Feb. 2013. Martin Buss (F’14) received the Diploma
[20] M. Di Bernardo, U. Montanaro, R. Ortega, and S. Santini, “Extended Engineering degree in electrical engineering
hybrid model reference adaptive control of piecewise affine systems,” from Technical University Darmstadt, Darm-
Nonlinear Anal., Hybrid Syst., vol. 21, pp. 11–21, 2016. stadt, Germany, in 1990, the Doctor of Engineer-
[21] Q. Sang and G. Tao, “Adaptive control of piecewise linear systems: The ing degree in electrical engineering from the Uni-
state tracking case,” in Proc. Amer. Control Conf., 2010, pp. 4040–4045. versity of Tokyo, Tokyo, Japan, in 1994, and the
[22] Q. Sang and G. Tao, “Adaptive control of piecewise linear systems Habilitation degree from Technical University of
with applications to NASA GTM,” in Proc. Amer. Control Conf., 2011, Munich, Munich, Germany, in 2000.
pp. 1157–1162. In 1988, he was a research student for one
[23] Q. Sang and G. Tao, “Adaptive control of piecewise linear systems: year with the Science University of Tokyo. From
The state tracking case,” IEEE Trans. Automat. Control, vol. 57, no. 2, 1994 to 1995, he was a Postdoctoral Researcher
pp. 522–528, Feb. 2012. in the Department of Systems Engineering, Australian National Univer-
[24] Q. Sang, G. Tao, and J. Guo, “Multivariable state feedback for output sity, Canberra, ACT, Australia. From 1995 to 2000, he was a Senior
tracking MRAC for piecewise linear systems with relaxed design condi- Research Assistant and a Lecturer with the Chair of Automatic Con-
tions,” in Proc. Amer. Control Conf., 2014, pp. 703–708. trol Engineering, Department of Electrical Engineering and Information
[25] S. Kersting and M. Buss, “Adaptive identification of continuous-time Technology, Technichal University of Munich. From 2000 to 2003, he was
switched linear and piecewise linear systems,” in Proc. Eur. Control Conf., a Full Professor, Head of the Control Systems Group, and the Deputy
2014, pp. 31–36. Director of the Institute of Energy and Automation Technology, Faculty
[26] S. Kersting and M. Buss, “Online identification of piecewise affine sys- IV, Electrical Engineering and Computer Science, Technical University
tems,” in Proc. UKACC 10th Int. Conf. Control, 2014, pp. 90–95. Berlin, Berlin, Germany. Since 2003, he has been a Full Professor (Chair)
[27] S. Kersting and M. Buss, “Concurrent learning adaptive identification of with the Chair of Automatic Control Engineering, Faculty of Electrical En-
piecewise affine systems,” in Proc. 53rd IEEE Conf. Decision Control, gineering and Information Technology, Technical University of Munich,
2014, pp. 3930–3935. where he has been with the Medical Faculty since 2008. Since 2006,
[28] A. Garulli, S. Paoletti, and A. Vicino, “A survey on switched and piecewise he has also been the Coordinator of the Deutsche Forschungsgemein-
affine system identification,” in Proc. 16th IFAC Symp. Syst. Identif., 2012, schaft Excellence Research Cluster Cognition for Technical Systems
pp. 344–355. with CoTeSys. His research interests include automatic control, mecha-
[29] G. Tao, “Multivariable adaptive control: A survey,” Automatica, vol. 50, tronics, multimodal human system interfaces, optimization, nonlinear,
no. 11, pp. 2737–2764, 2014. and hybrid discrete-continuous systems.