Professional Documents
Culture Documents
Motion Planning For Quadrupedal Locomotion: Coupled Planning, Terrain Mapping and Whole-Body Control
Motion Planning For Quadrupedal Locomotion: Coupled Planning, Terrain Mapping and Whole-Body Control
Motion Planning For Quadrupedal Locomotion: Coupled Planning, Terrain Mapping and Whole-Body Control
Abstract—Planning whole-body motions while taking into ac- motions while considering the terrain conditions. Due to this
count the terrain conditions is a challenging problem for legged complexity, many legged locomotion approaches focus on
arXiv:2003.05481v4 [cs.RO] 27 Jun 2020
robots since the terrain model might produce many local minima. terrain-blind methods with instantaneous actions [e.g. 1, 2, 3].
Our coupled planning method uses stochastic and derivatives-free
search to plan both foothold locations and horizontal motions due These heuristic approaches assume that reactive actions are
to the local minima produced by the terrain model. It jointly enough to ensure the robot stability under unperceived terrain
optimizes body motion, step duration and foothold selection, and conditions. Unfortunately, these approaches cannot tackle all
it models the terrain as a cost-map. Due to the novel attitude types of terrain, in particular terrains with big discontinuities.
planning method, the horizontal motion plans can be applied Such difficulties have limited the use of legged systems to
to various terrain conditions. The attitude planner ensures the
robot stability by imposing limits to the angular acceleration. specific terrain topologies.
Our whole-body controller tracks compliantly trunk motions Trajectory optimization with contacts has gained attention in
while avoiding slippage, as well as kinematic and torque limits. the legged robotics community [4, 5, 6]. It aims to overcome
Despite the use of a simplified model, which is restricted to flat the drawbacks of terrain-blind approaches by considering a
terrain, our approach shows remarkable capability to deal with horizon of future events (e.g. body movements and foothold
a wide range of non-coplanar terrains. The results are validated
by experimental trials and comparative evaluations in a series of locations). It could potentially improve the robot stability
terrains of progressively increasing complexity. given a certain terrain. However, in spite of the promising
benefits, most of the works are focused on flat conditions
Index Terms—legged locomotion, trajectory optimization, chal-
lenging terrain, whole-body control and terrain mapping or on simulation. For instance, these trajectory optimization
methods do not incorporate any terrain-risk model. This model
serves to quantify the footstep difficulty and uncertainty.
I. I NTRODUCTION Nonetheless, it is not yet clear how to properly incorporate
The remainder of the paper is structured as follows: after we briefly describe our previous decoupled planning method,
discussing previous research in the field of dynamic whole- which will be used as baseline to compare against our new
body locomotion (Section II) we briefly describe our decou- coupled planning method.
pled planner method, which we use for comparison. Next,
we introduce our locomotion framework in Section III. We A. Decoupled planning
describe our coupled planning method (Section IV) and how
In our previous decoupled planning locomotion frame-
the terrain model is formulated in our trajectory optimization.
work [10, 11], the sequence of footholds was selected by
Section V briefly describes a controller designed for dynamic
computing an approximate body path. It builds body-state
motions. This controller improves the tracking performance
graph that quantifies the cost given a set of primitive actions
and the robustness of the locomotion by passivity-based con-
towards a goal. Then, it chooses locally the locations of the
trol paradigm. In Section VI, VII we evaluate the performance
footholds. Finally, it generates a body trajectory that ensures
of our locomotion framework, and provide comparison with
dynamic stability and achieves the planned foothold sequence.
our decoupled planner, in real-world experimental trials and
For that, we used two fifth-order polynomials to describe
simulations. Finally, Section VIII summarizes this work.
the horizontal Center of Mass (CoM) motion. The stable
horizontal motion is computed using a cart-table model. For
II. R ELATED WORK more details the reader can refer to [10, 11].
In environments where smooth and continuous support is
available (floors, fields, roads, etc.), exact foot placement III. L OCOMOTION F RAMEWORK
is not crucial in the locomotion process. Typically, legged In this section, we give an overview of the main components
robots are free to move with a gaited strategy, which only of our locomotion framework (Section III-B), after a quick
considers the balancing problem. The early work of Marc description of the HyQ robot (Section III-A).
Raibert [12] crystallized these principles of dynamic loco-
motion and balancing. Going beyond the flat terrain, the A. The HyQ robot
Spot and SpotMini quadrupeds are a recent extension of this
HyQ is a 85 kg hydraulically actuated quadruped robot [30].
work. While SpotMini is able to traverse irregular terrain
It is fully torque-controlled and equipped with precision joint
using a reactive controller, we believe that (as there is no
encoders, a depth camera (Asus Xtion), a MultiSense SL
published work) the footholds are not planned in advance.
sensor and an Inertial Measurement Unit (MicroStrain). HyQ
Similar performance can be seen on the Hydraulically actuated
measures approx. 1.0 m×0.5 m×0.98 m (length × width ×
Quadruped (HyQ) robot, that is able to overcome obstacles
height). The leg extension length ranges from 0.339-0.789 m
with reactive controllers [2, 13] and/or step reflexes [14, 15].
and the hip-to-hip distance is 0.75 m (in the sagittal plane).
The main limitation of those gaited approaches is that
It has two onboard computers: a Intel i5 processor with Real
they quickly reach the robot limits (e.g. torque limits) in
Time (RT) Linux (Xenomai) patch, and a Intel i5 processor
environments with complex geometry: large gaps, stairs or
with Linux. The Xenomai PC handles the low-level control
rubble, etc. Furthermore, in these environments, the robot often
(hydraulic-actuator control) at 1 kHz and communicates with
can afford only few possible discrete footholds. Reason why it
the proprioceptive sensors through EtherCAT boards. Addi-
is important to carefully select footholds that do not impose a
tionally, this PC runs the high-level (whole-body) controller at
particular gaited strategy. Towards this direction, the DARPA
250 Hz. Both RT threads (i.e. low- and high- level controllers)
Learning Locomotion Challenge stimulated the development
communicate through shared memory. On the other hand, the
of strategies that handle a variety of terrain conditions. It
non-RT PC processes the exteroceptive sensors to generate the
resulted in a number of successful control architectures [7, 8]
terrain map and then compute the plans. These motion plans
that plan [16, 17, 18] and execute footsteps [19] in a pre-
are sent to the whole-body controller (i.e. the RT PC) through
defined set of challenging terrains. Roughly speaking, these
a RT-friendly communication.
approaches are able to compute foothold locations by using
tree-search algorithms, and to learn the terrain cost-map from
user demonstrations [20]. B. Framework components
Legged locomotion can also be formulated as an optimal Our locomotion framework is composed by three main
control problem. However, most works do not consider the modules: motion planning, whole-body control, and mapping
contact location and timings [e.g. 21, 22, 23] due to the and estimation (Fig. 1). The horizon optimization computes
requirement of having a smooth formulation. The contact the CoM motion and footholds to satisfy robot stability and to
location and timings are often planned using heuristic rules deal with terrain conditions (Section IV-C). To handle terrain
with partial guarantee of dynamic feasibility [24, 25]. Using heights, our attitude planner adapts the trunk orientation as
these rules, it is possible to avoid the combinatorial complexity described in Section IV-A1b. We build onboard a terrain cost-
and the excessive computation time of more formal approaches map and height-map which are used by the horizontal and
([e.g. 26, 27, 28, 29]). Even though recent works have reduced attitude planners, respectively (Section IV-B).
the computation time by a few orders of magnitude [e.g. The whole-body controller has been designed to compliantly
5, 6], they are still limited to offline planning and they require track motion plans (Section V). It consists of a virtual model,
a convex model of the terrain. In the following subsection, a joint impedance controller and a whole-body optimization.
MASTALLI et al.: MOTION PLANNING FOR QUADRUPEDAL LOCOMOTION: COUPLED PLANNING, TERRAIN MAPPING AND WHOLE-BODY CONTROL 3
Horizontal Joint
Motion IK
Impedance
optimization
Planning
CoM
motion
Whole-body Torque Torque
Optimization Mapping + Control
Virtual
Foot
Locations Model +
terrain
heigh-map Gravity Whole-Body Mapping and Leg
H Compensation Control Estimation Odometry
Attitude
Terrain
Planning mapping
terrain Visual
cost-map odometry
Fig. 1: Overview of the locomotion framework. Horizontal optimization computes simultaneously the CoM and footholds
(xdcom , ẋdcom , xdsw , ẋdsw ) given a terrain cost-map T. The attitude planner adapts the trunk orientation (Rd , ω d , ω̇ d ) given a
terrain height-map H. This results in a stable motion that is tracked by the whole-body controller. The virtual model allows
us to compliantly track the desired CoM motion, and the controller computes an instantaneous whole-body motion (q̈∗ , λ∗ )
that satisfies all robot constraints. The optimized motion is then mapped into desired feed-forward torques τ d . To address
unpredictable events, we use a joint impedance controller with low stiffness which tracks the desired joint commands (qdj , q̇dj ).
Finally, a state estimator provides an estimation of the trunk pose and twist (xcom , ẋcom , ω, ω̇). It uses IMU and kinematics
(leg odometry), and stereo vision (visual odometry).
is computed through piece-wise functions that resemble the C. Horizontal trajectory optimization
log-barrier constraint as described in the equations of Sec- The trajectory optimization computes an optimal sequence
tion IV-B2, IV-B3. With this, we have extended the off-the- of control parameters U∗ used for the generation of the hor-
shelf solver to address constrained problems (see Section IV-C izontal trajectories for the CoM (Section IV-A). We compute
for more details). Below the log-barrier function for each the entire plan by solving a finite-horizon trajectory optimiza-
feature is described. tion problem for each phase (similarly to a receding horizon
1) Height deviation: The height deviation cost penalizes strategy). The horizon is described by a predefined number
footholds close to large drop-offs; for instance, this cost of preview schedules N with n phases (e.g. our locomotion
is important for crossing gaps or stepping stones. In fact, cycle (schedule) has 6 phases). Our method presents several
staying far away from large drop-offs is beneficial because advantages to address challenging terrain locomotion. It en-
inaccuracies in the execution of footsteps can cause the robot ables the robot to generate desired behaviors that anticipate
to step into gaps or banned areas. The height deviation feature future terrain conditions, which results in smoother transitions
fh is computed using the standard deviation around a defined between phases. Note that the optimal solution U∗ is defined
neighborhood. as explained in Section IV-A2.
2) Slope: The slope reflects the local surface normal in a Compared with [34], our trajectory optimization method 1)
neighborhood around the cell. The normals are computed using uses a terrain cost-map model for foothold selection, 2) defines
Principal Component Analysis (PCA) on the set of nearest non-linear inequalities constraints for the CoP position, and
neighbors. A high slope value will increase the chance of 3) guarantees the robot stability against changes in the terrain
slipping even in cases where the friction cone is considered, elevation. Additionally, in contrast to [34], we have defined
e.g. due to inaccuracies in the friction coefficient or estimated a single cost function that tracks desired walking velocities,
surface normal. Slope cost increases for larger slope values, without enforcing the tracking of a specific step time and
while small slopes have zero cost as they are approximately distance.
flat. We consider the worst possible slope smax occurs when 1) Problem formulation: Given an initial state s0 , we
the terrain is very steep (approximately 70◦ )4 . optimize a sequence of control parameters inside a predefined
We map the height deviation and slope features f into cost horizon, and apply only the optimal control of the current
values through the following piece-wise function: phase. Given the desired user commands (trunk velocities), a
sequence of control parameters U∗ are computed solving an
0 f ≤ ff lat unconstrained optimization problem:
f (x,y)
Tf (x, y) = − ln 1 − fmax −ff lat ff lat < f < fmax U∗ = argmin
X
ωj gj (s0 , U, r). (9)
f ≥ fmax U
T
max j
when dealing with dynamic motions (as shown in Section VI). we demonstrate the capabilities of our complete locomotion
In these cases, the effect of the leg dynamics is no longer framework (i.e. coupled planner, whole-body controller, terrain
negligible and must be considered to achieve good tracking. mapping and state estimation) by crossing terrains with various
With this whole-body controller, the robot achieves faster slopes and obstacles. All the experimental results are in the
dynamic motions in real-time, see [42], when compared with accompanying video or in Youtube5 .
our previous quasi-static controller [15]. The block diagram
of the trunk controller is shown in Fig. 1. A virtual model
A. Motion planning: decoupled vs coupled approach
generates the reference centroidal wrench Wimp necessary
to track the reference trajectories. The problem is formu- 1) Decoupled planner setup: The swing and stance du-
lated as a Quadratic Programming (QP) with the generalized ration are predefined since they cannot be optimized. The
accelerations and contact forces as decision variables, i.e. footstep planner explores partially a set of candidate footholds
x = [q̈T , λT ]T ∈ R6+n+3nl where nl is the number of end- using the terrain-aware heuristic function [11]. These duration
effectors in contact: are tuned for every terrain and, range from 0.5 to 0.7 s
and from 0.05 to 1.4 s for the swing and stance6 phases,
x∗ = argmin kWcom − Wcom
d
k2Q + kxk2R
x=(q̈,λ) respectively. However, it is not always possible to compute a
set of polynomial’s coefficients (CoM trajectory) that satisfies
s.t. Mq̈ + h = Sτ + JTc λ, (dynamics)
the dynamic stability for some footstep sequences. Thus,
Jc q̈ + J̇c q̇ = 0, (stance) (15) unfortunately, step and swing duration need to be hand-tuned
Rλ ≤ r, (friction) depending on the footstep sequence itself.
¯ ≤ q̈ ≤ q̈,
q̈ (kinematics) 2) Coupled planner setup: The same weight values for the
cost functions are used for all the results presented in Table I
τ̄ ≤ Mq̈ + h − JTc λ ≤ τ , (torque)
(i.e. 300, 30 and 10 for the human velocity commands, terrain,
where M is the joint-space inertial matrix, Jc is the contact Ja- and energy, respectively). We did not re-tune these weights
cobian, h is the force vector that accounts Coriolis, centrifugal, for a different experiment, as it was sometimes necessary
and gravitational forces, (R, r) describe the linearized friction with the decoupled planner. This shows a greater generality
¯ q̈) are acceleration bounds defined given the current
cone, (q̈, with respect to the decoupled planner. We add a quadratic
robot configuration [42], and (τ̄ , τ ) describe the torque limits. penalization, when the terrain cost T(x, y) is higher then 0.8
The first term of the cost function (15) penalizes the (i.e. 80% of its maximum value).
tracking error at the wrench level, while the second one is a For this kind of problems, it is not trivial to define a
regularization factor to keep the solution bounded or to pursue good initialization trajectory (i.e. to warm-start the optimizer).
additional criteria. Both costs are quadratic-weighted terms. However, since our solver uses stochastic search, this is not
All the constraints are linear: the equality constraints encode so critical and we decided not to do it. We used the same
dynamic consistency, the stance condition and the swing task. stability margin and angular acceleration (as in Section VI-B)
While the inequality constraints encode friction, torque, and for the trunk attitude planner, and the horizon is N = 1, i.e.
kinematic limits. Then the optimal acceleration and forces x∗ 1 locomotion cycle or 4 steps7 .
are mapped into desired feed-forward joint torques τfdf ∈ Rn 3) Increment of the success rate: The foothold error is on
using the actuated part of the full dynamics. Finally, the feed- average around 2 cm, half than in the decoupled planner case.
forward torques τfdf are summed with the joint PD torques (i.e. Note that these results are obtained with the state estimation
feedback torques τf b ) to form the desired torque command τ d , algorithm proposed in [33]. The coupled planner dramatically
which is sent to a low-level joint-torque controller. increases the success of the stepping stones trials to 90%;
A terrain mapping module provides, as inputs to the whole- up over 30% with respect to the decoupled planner [10]. We
body controller, an estimate of the friction coefficient µ and define as success when the robot crosses the terrain, e.g. it does
of the normal to the terrain n̂t at each contact location [45]. not make a step in the gap, and does not reach its torque and
Finally a state estimation module fuses inertial, visual and kinematic limits. In Table I we report the number of footholds,
odometry information to get the current floating-base position the average trunk speed, and the ELC for simulations made
and velocity w.r.t. the inertial frame [33]. with our coupled and decoupled planners in different challeng-
ing terrains. The coupled planner also increases the walking
VI. E XPERIMENTAL R ESULTS velocity of least 14% and up to 63%, while also modulating
To understand the advantages of our locomotion framework, the trunk attitude. The number of footholds is also reduced of
we first compare the decoupled and coupled approaches in 14% on average. Jointly optimizing the motion and footholds
different challenging terrains (e.g. stepping stones, pallet, reduces the number of steps because it considers the robot
stairs and gap). We use as test-cases for the comparison the dynamics for the foothold selection. Note that the trunk speed
decoupled planner presented in [10]. After that, we show how and the success rate increased even with terrain elevation
the modulation of the trunk attitude handles heights variations changes (e.g. gap and stepping stones). The ELC is higher
in the terrain. Subsequently, we analyze the effect of the 5 https://youtu.be/KI9x1GZWRwE
terrain cost-map in our coupled planner. We study how dif- 6 In this work, with stance phase, we refer to the case when the robot has
ferent weighting choices result in different behaviors without all the feet on the ground.
affecting significantly the robot stability and the ELC. Finally 7 As mentioned early, we define 6 phases which 2 of them are stance ones.
MASTALLI et al.: MOTION PLANNING FOR QUADRUPEDAL LOCOMOTION: COUPLED PLANNING, TERRAIN MAPPING AND WHOLE-BODY CONTROL 9
TABLE I: Number of footholds, average walking speed and normalized ELC for different challenging terrains without changes
in the elevation for our coupled (Coup.) and decoupled (Dec.) planners. We normalize the ELC with respect to the walking
velocity to easily compare results across different motion speeds. All the results are computed from simulations.
Fig. 6: Snapshots of experimental trials used to evaluate the performance of our trajectory optimization framework. (a) crossing
a gap of 25 cm length while climbing up 6 cm. (b) crossing a gap of 25 cm length while climbing down 12 cm. (c) crossing a
set of 7 stepping stones. (d) crossing a sparse set of stepping stones with different elevations (6 cm). To watch the video, click
the figure.
for our coupled planner; however, this is an effect of higher 5) Crossing challenging terrains: Trunk attitude adaptation
walking velocities and of the tuning of the cost function. This tends to overextend the legs, especially in challenging terrains,
is expected even if we normalized the ELC with respect to as bigger motions are required. To avoid kinematic limits, we
the walking velocity. Note that as velocity increases the kinetic define a foot search region. This ensures kinematic feasibility
energy rises quadratically with a consequent affect on the ELC. for terrain height difference of up to 12 cm (coupled planner),
We also found that the tuning of the ELC cost does not affect in Fig. 6a, b. Note that in the decoupled case we had to define
the stability and the foothold selection. a more conservative foot search region (i.e. in the footstep
planner) than in the coupled one, making very challenging to
4) Computation time: An important drawback of including cross gaps or stepping stones with height variations. Indeed,
the terrain cost-map is that it increases substantially the com- crossing the terrain in Fig. 6a-d is only possible using the
putation time. In fact for our planners, this increases from 2- coupled planner since we managed to increase the foothold
3 s to 10-15 min, for more details about the computation region from (20 cm×23.5 cm) to (34 cm×28 cm). Note that the
time of the decoupled planner see [11]. The main reason is decoupled planning requires smaller foothold regions due to
that we use a stochastic search which estimates the gradient the fact that only considers the robot’s kinematics.
(see [41]). Instead, for the decoupled planner, we use a tree-
search algorithm (i.e. Anytime Repairing A* (ARA*)) with a
heuristic function that guides the solution towards a shortest
path, not the safest one, which allows us to formulate the CoM For all our optimizations, we define a stability margin of
motion planning through a QP program. For more details about r =0.1 m (introduced in Section IV-A1b) which is a good
the footstep planner, used in the decoupled planning approach, trade-off between modeling error and allowed trunk attitude
see [11]. adjustment.
10 IEEE TRANSACTIONS ON ROBOTICS. PREPRINT VERSION. ACCEPTED JUNE, 2020
Fig. 10: Crossing a terrain that combines elements of the previous cases; first a ramp of 10 degree, then a gap of 15 cm and
finally a step with 15 cm height change. Execution of the planned motion with the HyQ robot (top). Visualization of the terrain
cost-map, friction cone and Ground Reaction Forces (GRFs) (bottom). The color for the friction cone and GRFs are magenta
and purple, respectively. To watch the video, click the figure.
0.3
actual
0.2 desired
y (m)
0.1
0.0
0.1
0.2
0.0 0.5 1.0 1.5 2.0 2.5
x (m)
150 limits
100 command
RF HFE (Nm)
50
0
50
100
150
0 5 10 15 20 25 30
t (sec)
Fig. 12: Optimizing 208 steps from 52 trajectory optimization
Fig. 11: HyQ crossing a terrain that combines elements of problems. In each optimization problem the step timing,
all previous cases. (Top): CoM tracking performance, desired foothold locations and CoM trajectory for 4 steps in advance
(blue) and executed (black) motions. (Bottom): applied torque with different velocity commands are computed. The color
command along the course of the motion. At t =14 sec, the describes different step phases of the planned motion.
planned motion produced a movement that reached the torque
limits; however, the controller applies a torque command
inside the robot’s limits. In fact, the tracking error increases
at approximately x =1.25 m, and is reduced in the next steps. C. The effect of the terrain cost-map
D. Considering terrain with slopes controller to avoid slippage on the trialled terrain surfaces.
Higher walking speed increases the probability of foot- We presented results, validated by experimental trials and
slippage. When one or more of the feet slip backwards, comparative evaluations, in a series of terrains of progressively
or when a foot is only slightly loaded, might result in a increasing complexity.
poor tracking. Both events are more likely to happen in a
R EFERENCES
terrain with different elevations due to errors in the state
estimation or noise in the exteroceptive sensors. Including [1] P. M. Wensing and D. E. Orin, “Generation of dy-
friction-cone and foot unloading / loading constraints, in the namic humanoid behaviors through task-space control
whole-body optimization, has been shown to help mitigate the with conic optimization,” in IEEE Int. Conf. Rob. Autom.
poor tracking. We demonstrated experimentally that is possible (ICRA), 2013.
to navigate a wide range of terrain slopes without considering [2] V. Barasuol, J. Buchli, C. Semini, M. Frigerio, E. R.
the friction cone stability in the planning level (only at the De Pieri, and D. G. Caldwell, “A Reactive Controller
controller level). Framework for Quadrupedal Locomotion on Challenging
Terrain,” in IEEE Int. Conf. Rob. Autom. (ICRA), 2013.
[3] C. Dario Bellicoso, F. Jenelten, P. Fankhauser,
E. Terrain mapping and state estimation
C. Gehring, J. Hwangbo, and M. Hutter, “Dynamic
Estimating the state of the robot with a level of accuracy locomotion and whole-body control for quadrupedal
suitable for planned motions is a challenging task. Reliable robots,” in IEEE/RSJ Int. Conf. Intell. Rob. Sys. (IROS),
state estimation is crucial, as accurate foot placement directly 2017.
depends on the robot’s base pose estimate. The estimate is also [4] H. Dai and R. Tedrake, “Planning Robust Walking Mo-
used to compute the desired torque commands through a vir- tion on Uneven Terrain via Convex Optimization,” in
tual model. The major sources of error for inertial-legged state IEEE Int. Conf. Hum. Rob. (ICHR), 2016.
estimation are IMU gyro bias and foot slippage. These produce [5] B. Aceituno-Cabezas, C. Mastalli, H. Dai, M. Focchi,
a pose estimate drift, which cannot be completely eliminated A. Radulescu, D. G. Caldwell, J. Cappelletto, J. C.
by the contact state estimate [32] (or with proprioceptive Grieco, G. Fernandez-Lopez, and C. Semini, “Simulta-
sensing). Pose drift particularly affects the desired torques neous Contact, Gait and Motion Planning for Robust
computed from the whole-body optimization. To eliminate Multi-Legged Locomotion via Mixed-Integer Convex
it, we fused high frequency (1 kHz) proprioceptive sources Optimization,” IEEE Robot. Automat. Lett. (RA-L), 2017.
(inertial and leg odometry) with low frequency exteroceptive [6] A. W. Winkler, D. C. Bellicoso, M. Hutter, and
updates (0.5 Hz for LiDAR registration, 10 Hz for optical J. Buchli, “Gait and trajectory optimization for legged
flow) in a combined Extended Kalman Filter [33]. We noticed systems through phase-based end-effector parameteriza-
that the drift accumulated, in between the high frequency tion,” IEEE Robot. Automat. Lett. (RA-L), vol. 3, 2018.
proprioceptive updates and the low frequency exteroceptive [7] J. Z. Kolter, M. P. Rodgers, and A. Y. Ng, “A control ar-
updates, affected the experimental performance. In practice, chitecture for quadruped locomotion over rough terrain,”
to cope with this problem, we reduced the compliance of our in IEEE Int. Conf. Rob. Autom. (ICRA), 2008.
whole-body controller (by increasing the proportional gains). [8] M. Kalakrishnan, J. Buchli, P. Pastor, M. Mistry,
and S. Schaal, “Learning, planning, and control for
VIII. C ONCLUSION quadruped locomotion over challenging terrain,” The Int.
In this paper, we presented a new framework for dynamic J. of Rob. Res. (IJRR), vol. 30, 2010.
whole-body locomotion on challenging terrain. We extended [9] C. Mastalli, M. Focchi, I. Havoutis, A. Radulescu,
our previous planning approach from [9] by modeling the S. Calinon, J. Buchli, D. G. Caldwell, and C. Sem-
terrain through log-barrier functions in the numerical optimiza- ini, “Trajectory and Foothold Optimization using Low-
tion. In addition, we proposed a novel robot attitude planning Dimensional Models for Rough Terrain Locomotion,” in
algorithm. Using this, we could optimize both the CoM motion IEEE Int. Conf. Rob. Autom. (ICRA), 2017, pp. 236–258.
and footholds in the horizontal frame, and allow the robot to [10] A. Winkler, C. Mastalli, I. Havoutis, M. Focchi, D. G.
adapt its trunk orientation. We demonstrated in experimental Caldwell, and C. Semini, “Planning and Execution
trials and simulations that the assumptions on the attitude of Dynamic Whole-Body Locomotion for a Hydraulic
planner avoided instability under significant terrain elevation Quadruped on Challenging Terrain,” in IEEE Int. Conf.
changes (up to 12 cm). We compared coupled and decoupled Rob. Autom. (ICRA), 2015.
planning and highlighted the advantages and disadvantages of [11] C. Mastalli, A. Winkler, I. Havoutis, D. G. Caldwell,
them. In our test-case planners, we used the same method and C. Semini, “On-line and On-board Planning and
for quantifying the terrain difficulty (i.e. terrain cost-map). Perception for Quadrupedal Locomotion,” in IEEE Conf.
We showed that reduced models for motion planning (such on Techn. for Pract. Rob. Apps. (TEPRA), 2015.
as cart-table with flywheel) together with whole-body control [12] M. H. Raibert, Legged robots that balance. MIT press
are still suitable for a wide range of challenging scenarios. Cambridge, MA, 1986, vol. 3.
We used the full dynamic model only in our real-time whole- [13] I. Havoutis, J. Ortiz, S. Bazeille, V. Barasuol, C. Semini,
body controller to avoid slippage, and hitting torque and and D. G. Caldwell, “Onboard Perception-Based Trot-
kinematic limits. The online terrain mapping allowed our ting and Crawling with the Hydraulic Quadruped Robot
14 IEEE TRANSACTIONS ON ROBOTICS. PREPRINT VERSION. ACCEPTED JUNE, 2020
contact balancing of humanoid robots in confined spaces: Michele Focchi received the B.Sc. and M.Sc. de-
Utilizing knee contacts,” in IEEE/RSJ Int. Conf. Intell. grees in control system engineering from Politec-
nico di Milano, Milan, Italy, in 2004 and 2007,
Rob. Sys. (IROS), 2017. respectively, and the Ph.D. degree in robotics in
[45] C. Mastalli, I. Havoutis, M. Focchi, and C. Semini, 2013.
“Terrain mapping for legged locomotion over challenging He is currently a Researcher at the Advanced
Robotics department, Istituto Italiano di Tecnologia.
terrain,” Istituto Italiano di Tecnologia (IIT), Tech. Rep., He received both the B.Sc. and the M.Sc. in Control
2019. System Engineering from Politecnico di Milano in
[46] G. Nava, F. Romano, F. Nori, and D. Pucci, “Stability 2004 and 2007, respectively. In 2009 he started to
develop a novel concept of air-pressure driven micro-
analysis and design of momentum-based controllers for turbine for power generation in which he obtained an international patent.
humanoid robots,” in IEEE/RSJ Int. Conf. Intell. Rob. In 2013, he got a Ph.D. degree in Robotics, where he developed low-level
Sys. (IROS), 2016. controllers for the Hydraulically Actuated Quadruped (HyQ) robot. Currently
his research interests are dynamic planning, optimization, locomotion in
[47] P.-B. Wieber, “Holonomy and nonholonomy in the dy- unstructured/cluttered environments, stair climbing, model identification and
namics of articulated motion,” in Proc. on Fast Mot. in whole-body control.
Bio. Rob., 2005. Darwin G. Caldwell (Senior Member, IEEE) re-
[48] A. Herzog, S. Schaal, and L. Righetti, “Structured contact ceived the B.Sc. and Ph.D. degrees in robotics from
force optimization for kino-dynamic motion generation,” University of Hull, Hull, U.K., in 1986 and 1990,
respectively, and the M.Sc. degree in management
in IEEE/RSJ Int. Conf. Intell. Rob. Sys. (IROS), 2016. from University of Salford, Salford, U.K., in 1996.
[49] C. Mastalli, R. Budhiraja, W. Merkt, G. Saurel, B. Ham- He is a founding Director at the Istituto Italiano
moud, M. Naveau, J. Carpentier, L. Righetti, S. Vijayaku- di Tecnologia in Genoa, Italy, and a Honorary Pro-
fessor at the Universities of Sheffield, Manchester,
mar, and N. Mansard, “Crocoddyl: An Efficient and Bangor, Kings College, London and Tianjin Univer-
Versatile Framework for Multi-Contact Optimal Control,” sity China. His research interests include innovative
in IEEE Int. Conf. Rob. Autom. (ICRA), 2020. actuators, humanoid and quadrupedal robotics and
locomotion (iCub, HyQ and COMAN), haptic feedback, force augmentation
exoskeletons, dexterous manipulators, biomimetic systems, rehabilitation and
surgical robotics, telepresence and teleoperation procedures. He is the author
or co-author of over 450 academic papers, and 17 patents and has received
awards and nominations from several international journals and conferences.