Professional Documents
Culture Documents
2024 Parameter Refinement ReferenceTracking LinearParameter Varying
2024 Parameter Refinement ReferenceTracking LinearParameter Varying
Abstract—In this study, we implement a control method for The dynamic model structure is well-known for robotic
stabilizing a ballbot that simultaneously follows a reference. A systems based on physical laws. The identification of the
ballbot is a robot balancing on a spherical wheel where the parameters can be obtained through classical mechanics [5]
single point of contact with the ground makes it omnidirectional
and highly maneuverable but with inherent instability. After and enhanced white modeling, but parameters that describe
introducing the scheduling parameters, we start the analysis friction can be determined only experimentally. This study will
by embedding the nonlinear dynamic model derived from first provide a method that refines the parameters that define the
principles to a linear parameter-varying (LPV) formulation. nonlinear ballbot model from the measurements derived in [1].
Continuously, and as an extension of a past study, we refine For controlling the ballbot, often, PID controllers considered
the parameters of the nonlinear model that enhance significantly
its accuracy. The crucial advantages of the LPV formulation [1], [6] or with double-loop approaches [3]. One of the
are that it consists of a nonlinear predictor that can be used limitations of the aforementioned methods is the consideration
in model predictive control (MPC) by retaining the convexity of input/state constraints.
of the quadratic optimization problem with linear constraints Model predictive control (MPC) has been widely applied
and further evades computational burdens that appear in other
for reference tracking with constraints [4]. It can generate
nonlinear MPC methods with only a slight loss in performance.
The LPVMPC control method can be solved efficiently as a optimal trajectories that steer the robot in a constrained space.
quadratic program (QP) that provides timing that supports real- Since autonomously moving robots are safety-critical systems,
time implementation. Finally, to illustrate the method, we test nonlinear MPC (NMPC) is gaining popularity due to its
the control designs on a two-set point 1D non-smooth reference ability to utilize high-fidelity nonlinear models, enabling more
with sudden changes, to a 2D nonstationary smooth reference
accurate and precise control actions. Another advantage of
known as Lissajous curves, and to a single-set point 1D non-
smooth reference where for this case theoretical guarantees such MPC is that it can be leveraged with planning algorithms,
as stability and recursive feasibility are provided. which is crucial for such autonomously moving objects. The
Index Terms—Identification, Control, Ballbot, Model Predic- drawback of NMPC is the computational complexity that
tive Control, Linear Parameter-Varying, Quadratic Program makes online minimization of the underlying objective func-
tion over a nonconvex manifold cumbersome. Therefore, a
I. I NTRODUCTION good alternative is to utilize the LPV formulation that can
embed the nonlinear model equivalently and introduce the
A ballbot Fig. 1 is an omnidirectional mobile robot that
LPVMPC framework that can be solved efficiently as QP,
balances on a single spherical wheel [1]–[3]. The single point
with nearly the same computational burden as in linear MPC,
of contact with the ground makes this under-actuated system
enabling real-time implementation.
agile but challenging to control. The use cases of a ballbot
include healthcare, virtual assistants, etc., as service robots, θy
where maneuverability in busy environments while maintain- z rw Virtual
ing stability is important. A ballbot system’s stability implies ϕy
balancing and trajectory tracking through a predetermined Wheel
1
divergent when random initialization is considered, or a so- 20
0
lution can be algebraically correct without explaining the 0
-1
physical law. To tackle such a problem, initialization of the -20
-2
b-parameters should be done by respecting the engineering 0 1 2 3 4 5 0 1 2 3 4 5
regularity of the problem. Thus, some parameters bi can Fig. 3: Models simulations; the linearized model given in
be initialized with physical meaning, e.g., b1 , b2 , b3 being dashed blue line; non-linear model simulation given in black;
positive to explain the high-fidelity knowledge of the physical LPV in dashed red lines; all done using ode45 on MATLAB.
plant [1] where b4 as concerns friction can remain arbitrary Finally, with green is the RK4 method utilizing (7). The input
but also with a reasonable small value. τy is a multiharmonic signal.
In Fig. 2, the iterative scheme (13) converged after five
steps and gave the optimal solution with a residual error that B. LPVMPC for Reference Tracking
stagnated to the value 0.4330. This error cannot be improved Reference tracking for a ballbot refers to the stabilized
further as the analysis relies on the identified parameters p motion by the LPVMPC method that can maneuver over
from the study [1], where the dynamics are corrupted with any given state reference xref . We will consider three study
noise and/or slight nonlinear behavior from the physical plant. scenarios. The 1st scenario will be a 1D reference tracking of
Continuously, in Fig. 3, the simulation indicates the ex- a nonsmooth discontinuous trajectory that will ”shock” the
pected performance after substituting the obtained parameters optimization problem due to the sudden changes. The 2nd
b in Table III and applying a multi-harmonic input to the scenario will be a 2D smooth reference as Lissajous curves,
stabilized plant. In particular, in Fig. 3, the nonlinear model assuming the ballbot can be driven independently in the xz and
compared to the linearized model stays close to the dynamical yz planes. Finally, a 3rd scenario will illustrate the reference
evolution for all states when the dynamics are close to the lin- tracking of a single set point with a scheme of the LPVMPC
earization operation point xe = 0 as expected. The discrepancy that can guarantee the stability of the closed-loop system and
increases for larger deviations in ϕ from the origin where the the recursive feasibility of its optimization problem. These
linearized cannot be trusted. Finally, in Fig. 3, the comparison properties are explained below. Tuning of the prediction hori-
between the LPV and the nonlinear model certifies that the zon depends on the physical system’s operational bandwidth
LPV is an equivalent embedding of the nonlinear system where and can vary significantly across applications. The ballbot is
differences are buried in machine precision error. Finally, the a robotic system that operates around 1(Hz); thus, a horizon
discrete-time implementation with the RK4 in (7) with a ZOH of about 1 (s) is reasonable.
also certifies a good accuracy compared to the ode45. 1) 1st Scenario: Nonsmooth 1D reference: In Fig. 4, we
consider the following 2-set point nonsmooth reference. The
TABLE III: The refined b parameters of the nonlinear model. ballbot should start from the origin and at t = 1 s should
b∗ b1 b2 b3 b4
roll for one circle 2π rad, then stay there for 2 s and for the
Optimal 0.002483 0.059325 0.143093 -0.07436
remaining 1 s to return to the origin; thus, it will roll a total
distance of approximately (2 · 2π · rb ≈ 1.5 m) within 4 s.
At the beginning and near the origin, positive virtual torque
applies on the ballbot to decrease the tilt angle θ that instantly
will move the ballbot to the negative ϕ, where immediately
switching direction (normal behavior of such non-minimal
phase system, similar to a bike where to turn, first a small
maneuver to the opposite direction is needed). Continuously
the ballbot increases speed in the ϕ positive direction while
approaching the target 2π with satisfying accuracy and faster
than the linear one after ≈ 1.3 s, then stays there and at
around 2+ s where the prediction horizon of time length
N · ts = 20 · 0.05 = 1 s receives the information of the sudden
change in reference from 2π to 0 rad, the ballbot slightly Fig. 4: Ballbot reference tracking with the angular displace-
overshoots in ϕ so to change the tilt angle again and starts to ment ϕ traveling from the origin to 2π rad and back. A
accelerate in the opposite direction. After less than a second, comparison between the linear and LPV MPC frameworks is
the ballbot has reached its origin without overshooting, which illustrated. θ measuring balancing of ballbot, ϕ̇, θ̇ measuring
outlines the good performance seen mainly in nonlinear MPC the angular velocities, τy is the virtual torque, J is the energy
frameworks. cost function.
In Fig. 4, we compare the MPC performances between the to global coordinates as: X ref = ϕx rb = 0.12 · 2π sin(0.3t)
LPV and the linearized model. As expected, the LPVMPC is and Y ref = ϕy rb = 0.12 · 2π sin(0.4t), under the constraints
generally faster at reaching the reference with almost similar in Table IV. In that case, the reference is smoother without
computational complexity as the linear MPC and the potential sudden changes. Therefore, we can increase the weight for
to handle better strong maneuvering as the adaptive scheduling matching the ϕ without inheriting infeasibility problems; thus,
variable ρi|k provides linearizations over a trajectory of the the quadratic cost for the 2D reference tracking in each
nonlinear ballbot model instead of a single fixed point for direction can be set as Q = diag([1000, 1, 0.1, 0.1]). All
the linearized model. The quadratic costs for the 1st scenario the other quadratic weights and parameter specifications are
are defined as Q = diag([200, 1, 0.1, 0.1]), R = 1000, and considered the same as in the 1st scenario. In Fig. 5, the
the terminal cost P is the Riccati solution from the LQR LPVMPC control strategy accurately drives the ballbot to the
on the linearized model from [1] with the same tuning of reference within an area of around 2 m2 and for 70 s.
Q and R that will penalize the ”tail” of the horizon. The
sampling time is ts = 0.05 (s), with horizon length N = 20, 0.8
and the MPC solution is obtained under the input/state hard
0.6
constraints introduced in Table IV and highlighted with the
0.4
gray background in Fig. 4. In the 1D reference tracking, as
0.2
4
R EFERENCES
[1] M. Studt, I. Zhavzharov, and H. S. Abbas, “Parameter identification and
3
LQR/MPC balancing control of a ballbot,” in 2022 European Control
2 Conference (ECC), 2022, pp. 1315–1321.
[2] P. Fankhauser and C. Gwerder, “Modeling and control of a ballbot,”
1 2010. [Online]. Available: https://api.semanticscholar.org/CorpusID:
58555644
0
[3] D. B. Pham, H. Kim, J. Kim, and S.-G. Lee, “Balancing and transferring
-1 control of a ball segway using a double-loop approach [applications of
control],” IEEE Control Systems Magazine, vol. 38, no. 2, pp. 15–37,
-2
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
2018.
[4] M. Nezami, D. S. Karachalios, G. Schildbach, and H. S. Abbas, “On
Fig. 6: Single-set point reference i.e., π, for the LPVMPC the design of nonlinear MPC and LPVMPC for obstacle avoidance in
autonomous driving*,” in 2023 9th International Conference on Control,
framework with and without theoretical guarantees. Decision and Information Technologies (CoDIT), 2023, pp. 1–6.
[5] B. Siciliano, L. Sciavicco, L. Villani, and G. Oriolo, Robotics: Mod-
elling, Planning and Control, 1st ed. Springer Publishing Company,
TABLE V: Comparison of computation times for a single QP Incorporated, 2008.
[6] S. Puychaison and B. S., “Mouse type ballbot identification and control
using YALMIP with LPVMPC. The simulations are performed using a convex-concave optimization,” in In Proceedings of the Inter-
on a Dell Latitude 5590 laptop with an Intel(R) Core(TM) national Automatic Control Conference (CACS 2019), National Taiwan
i7-8650U CPU and 16 GB of RAM. The scenarios are im- Ocean University, Keelung,Taiwan, Nov. 2019, pp. 1–6.
[7] P. S. G. Cisneros, S. Voss, and H. Werner, “Efficient nonlinear model
plemented in MATLAB, utilizing the YALMIP toolbox [11], predictive control via quasi-LPV representation,” in 2016 IEEE 55th
with an optimality tolerance of 10−8 . Conference on Decision and Control (CDC), 2016, pp. 3216–3221.
LPVMPC with ts = 0.05 s Mean Standard deviation [8] M. M. Morato, J. E. Normey-Rico, and O. Sename, “Model predictive
control design for linear parameter varying systems: A survey,” Annual
1st Scenario 0.0086 s 0.0104 s
Reviews in Control, vol. 49, pp. 64–80, 2020.
2nd Scenario 0.0087 s 0.0132 s
[9] H. S. Abbas, R. Tóth, N. Meskin, J. Mohammadpour, and J. Hanema,
3rd Scenario 0.0058 s 0.0025 s
“A robust MPC for input-output LPV models,” IEEE Transactions on
Automatic Control, vol. 61, no. 12, pp. 4183–4188, 2016.
[10] H. S. Abbas, “Linear parameter-varying model predictive control for
IV. C ONCLUSION nonlinear systems using general polytopic tubes,” Automatica, vol. 160,
p. 111432, 2024. [Online]. Available: https://www.sciencedirect.com/
In this study, we were concerned with the reference tracking science/article/pii/S000510982300599X
of a ballbot by utilizing the LPV embedding within the MPC [11] J. Löfberg, “YALMIP : A toolbox for modeling and optimization in
framework. An important step was the refinement of the matlab,” in In Proceedings of the CACSD Conference, Taipei, Taiwan,
2004.
parameters of the ballbot system from measurements of a past [12] C. Verhoek, J. Berberich, S. Haesaert, R. Tóth, and H. S. Abbas, “A
study. Thus, having the nonlinear model allowed the nonlinear linear parameter-varying approach to data predictive control,” 2023.
embedding of the LPV form. An improved discretization [Online]. Available: https://arxiv.org/abs/2311.07140