Professional Documents
Culture Documents
NMBU Atas Finished
NMBU Atas Finished
Traversability-guided Planning
for Efficient Robot Navigation
in Uneven Terrains
Fetullah Atas
Traversability-guided Planning for Efficient
Robot Navigation in Uneven Terrains
Fetullah Atas
Ås 2024
iii
Sammendrag
Denne doktorgradsavhandlingen presenterer flere nye bidrag til robotnavigasjon
i ujevnt terreng, og legger vekt på utviklingen av innovative tilnærminger
innen bevegelsesplanlegging, miljørepresentasjon, estimering av kryssbarhet
og kinodynamisk bevegelsesplanlegging. Forskningen fokuserer på den kritiske
viktigheten av å effektivt navigere i ujevnt terreng, en utfordring spesielt relevant
for utendørsroboter som opererer under strenge dynamiske begrensninger som
styrings-, hastighets- og akselerasjonsgrenser.
Et bemerkelsesverdig bidrag fra denne avhandlingen er utnyttelse av
terrenginformasjon for optimal bevegelsesplanlegging i utfordrende miljøer.
Arbeidet bryter ny mark utover tradisjonell geometrisk bevegelsesplanlegging,
og går dypt inn i kinodynamisk bevegelsesplanlegging, og adresserer den
reduserte manøvrerbarheten forårsaket av dynamiske begrensninger (f.eks.
hastighet eller dreiemoment) og forbedrer sikkerheten og effektiviteten til
utendørs robotsystemer. En annen nøkkelinnovasjon i denne oppgaven, er
utviklingen av en dynamisk traversabilitetsstimeringsmetode basert på nevrale
nettverk. Denne tilnærmingen er skreddersydd til robotens unike evner og
orientering, og utnytter dens navigasjonsopplevelser for læring. Utnyttelse av
disse traversibilitetsestimatene resulterer i mer presise og pålitelige kollisjonsfrie
bevegelsesplaner og effektive kontrollmekanismer, noe som betydelig forbedrer
sikkerheten og effektiviteten til komplekse navigasjonsoppgaver.
Videre introduserer oppgaven et omfattende navigasjonsrammeverk med et
standardisert, plugin-basert grensesnitt. Dette rammeverket letter integreringen
av nye algoritmer og sikrer sømløs interoperabilitet mellom forskjellige kom-
ponenter, for eksempel kontrollere, bevegelsesplanleggere og kollisjonssjekkere.
Denne helhetlige tilnærmingen validerer de foreslåtte metodikkenes effektivitet
i scenarier i den virkelige verden og legger grunnlaget for fremtidig forskning
innen robotnavigasjon gjennom komplekse og ujevnt terreng.
Samlet sett bidrar denne avhandlingen til feltet robotnavigasjon ved å tilby
ny innsikt, metoder og et allsidig rammeverk som forbedrer tilpasningsevne,
presisjon og respons. Eksperimentene og valideringen av algoritmene presentert
i denne oppgaven ble utført ved hjelp av to distinkte robotplattformer. Disse
plattformene ble testet på tvers av et mangfold av miljøer, og sikret en omfattende
evaluering av algoritmenes ytelse og anvendelighet i ulike scenarier i den virkelige
verden.
v
Preface
This thesis is hereby submitted in partial fulfillment of the requirements for
the degree of Philosophiae Doctor (Ph.D.) at the Norwegian University of
Life Sciences (NMBU). The research presented was undertaken at the NMBU
Robotics and Control Lab, a constituent of the Faculty of Science and Technology
(REALTEK). Spanning from December 2020 to December 2023, this research
period was marked by studies and experimental work, all aimed at contributing
to the advancement of robot navigation techniques in uneven terrains.
This thesis is systematically organized into several chapters. It starts with
an introductory chapter that sets the stage for the research, followed by a
background chapter that lays the context of the thesis. The subsequent sections
are dedicated to explaining the methodologies employed throughout the research.
This is followed by a summary of six research papers that constitute the core of
the thesis. The majority of these papers have undergone a stringent peer-review
process and have been published, with two papers currently under review as of the
writing of this thesis. In the discussion chapter, there is a critical evaluation of
the research findings, where limitations are acknowledged, and potential avenues
for future research are identified. The final chapter encapsulates concluding
remarks and outlines promising future work, building upon the methodologies
and discoveries presented in this thesis.
vii
Acknowledgements
I extend my profound gratitude to my supervisors, Grzegorz Cielniak and
Lars Grimstad, for their invaluable guidance and support. Their expertise and
insightful perspectives have been pivotal in shaping the contributions of this
thesis. The enlightening discussions and productive meetings we shared over the
past three years have not only inspired me but also played a crucial role in the
fruition of my research. Their mentorship has been a significant factor in my
academic journey, for which I am deeply thankful. A heartfelt thanks to both of
you.
I wish to express my deepest gratitude to my family, whose unwavering belief
in me was a beacon of hope, especially during moments when challenges seemed
insurmountable. Their enduring support and faith have been fundamental to
my perseverance and success.
Additionally, I extend my sincere thanks to all my colleagues at NMBU
Robotics Lab. for their assistance and camaraderie throughout this journey.
Their contributions, both big and small, have been invaluable in facilitating my
research and enriching my academic experience.
Last but not least, I wish to express my profound love for the mountains
of Kurdistan, the place where I grew up and to which I am deeply connected.
These majestic landscapes have taught me the essence of resilience and courage,
shaping the person I have become. Their enduring strength and beauty have
been a constant source of inspiration and guidance throughout my journey.
Fetullah Atas
February, 2024
ix
List of Papers
Paper I
Fetullah Atas, Grzegorz Cielniak, Lars Grimstad. “Benchmark of Sampling-Based
Optimizing Planners for Outdoor Robot Navigation”. In: Intelligent Autonomous
Systems 17 (IAS). Vol. 577, June 2022, DOI: 10.1007/978-3-031-22216-0_16.
Paper II
Fetullah Atas, Grzegorz Cielniak, Lars Grimstad “Elevation State-Space: Surfel-
Based Navigation in Uneven Environments for Mobile Robots” In: IEEE/RSJ
International Conference on Intelligent Robots and Systems (IROS), 2022, DOI:
10.1109/IROS47612.2022.9981647
Paper III
Fetullah Atas, Grzegorz Cielniak, Lars Grimstad “Navigating in 3D Uneven
Environments Through Supervoxels and Nonlinear MPC” In: European
Conference on Mobile Robots (ECMR) DOI: 10.1109/ECMR59166.2023.10256342
Paper IV
Fetullah Atas, Grzegorz Cielniak, Lars Grimstad “CostTrust: A Fast-Exploring,
Iteratively Expanding Frontier-Based Kinodynamic Motion Planner” Submitted
for Journal of Modeling, Identification and Control (MIC), 2024.
Paper V
Fetullah Atas, Grzegorz Cielniak, Lars Grimstad “From Simulation to Field:
Learning Terrain Traversability for Real-World Deployment” Submitted for
Robotics and Autonomous System (RAS) Journal, 2023.
Paper VI
Fetullah Atas, Grzegorz Cielniak, Lars Grimstad “Enabling Robot Autonomy
through a Modular Software Framework” In: ICRA Workshop on Robotic
Software Architectures, 2023.
xi
Contents
Abstract iii
Sammendrag v
Preface vii
Acknowledgements ix
List of Papers xi
Contents xiii
1 Introduction 1
1.1 Graphical Abstract . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Research Questions . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.5 The Papers and Thesis Outline . . . . . . . . . . . . . . . 6
2 Background 9
2.1 Motion Planning . . . . . . . . . . . . . . . . . . . . . . . . 9
2.2 Environmental Representation . . . . . . . . . . . . . . . . 14
2.3 Traversablity Estimation . . . . . . . . . . . . . . . . . . . 16
2.4 Emerging Approaches in Uneven Terrain Navigation . . . . 18
3 Methodological Framework 21
3.1 Explanation of Research Design . . . . . . . . . . . . . . . 21
3.2 Robot Platforms and Data . . . . . . . . . . . . . . . . . . 21
3.3 Environments and Simulations . . . . . . . . . . . . . . . . 23
3.4 Application Scenarios . . . . . . . . . . . . . . . . . . . . . 24
3.4.1 Precision Agriculture on Hilly Farms . . . . . . . 24
3.4.2 Dense Forest Exploration . . . . . . . . . . . . . 24
3.4.3 Delivery Robots in Scarcely Populated Towns . . 24
3.5 Software Tools . . . . . . . . . . . . . . . . . . . . . . . . . 25
3.6 Limitations . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4 Paper Summaries 27
xiii
Contents
5 Discussion 33
5.1 The Motion Planning Problem in Outdoor Uneven Terrains 33
5.2 The Need for Exploitation of Terrain . . . . . . . . . . . . 33
5.3 The Adaptability of Different Robot Platforms to Uneven
Terrains . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
5.4 Kinodynamic Motion Planning for Uneven Terrains . . . . 34
5.5 Going Beyond Geometrical or Visual-based Traversability
Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
5.6 A Holistic Approach to Uneven Terrain Navigation . . . . 35
Papers 42
xiv
Contents
Bibliography 141
xv
List of Figures
xvii
List of Figures
3.1 The figure displays two robotic platforms: the Thorvald II robot
on the left and the Robotnik platform on the right. Both
platforms have a consistent sensor suite comprising LIDAR,
Inertial Measurement Units (IMUs), wheel encoders, and GNSS
receivers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.2 Sensors mounted on a robot are depicted on the left. The robot’s
pose is determined by combining multiple sensory data using an
Extended Kalman Filter (EKF). The dark red points are from the
live LIDAR sensor that precisely overlaps the point cloud map,
indicating precise localization. . . . . . . . . . . . . . . . . . . . 22
3.3 This figure presents a collection of satellite imagery and first-
person photographs of the testing sites. . . . . . . . . . . . . . . 23
xviii
Chapter 1
Introduction
Autonomous robot navigation across uneven terrains holds significant importance,
with wide-ranging applications including exploration, mining, search and rescue,
agriculture, and forestry. While autonomous navigation within two-dimensional
environments is well-established with standard techniques based on 2D occupancy
grids, navigating outdoor uneven terrains introduces a spectrum of complex
challenges that necessitate further research. These challenges encompass the
requirement for the nontrivial robust terrain traversability analysis, which is
essential to prevent stability issues like tipping over or getting stuck. Additionally,
efficient motion planning is crucial to avoid collisions, and adaptability to dynamic
and changing environments is imperative. A further complication arises from the
fact that different robotic platforms possess varying traversability capabilities,
influenced by their specific design and locomotion mechanisms. Addressing these
multifaceted challenges is critical for advancing the field of robotic navigation in
complex outdoor environments.
This thesis presents a series of contributions designed to address a range of
problems pertinent to enhancing the autonomy of robots operating in uneven
terrains. The scope of our research encompasses various key areas, including
motion planning, terrain traversability analysis, and efficient environmental
representation techniques. Each of these areas is crucial for developing more
advanced and autonomous robotic systems capable of effectively navigating and
performing tasks in complex, uneven landscapes. Through our comprehensive
approach, we aim to significantly advance state-of-the-art robotic navigation
and control in challenging outdoor environments.
1
1. Introduction
2
Motivation
1.2 Motivation
This thesis addresses the complex challenges faced by mobile robots when
navigating autonomously in outdoor environments. While most of these
environments are not entirely flat, the variations in elevation are often not
drastic, allowing the assumption of a 2D environmental representation to suffice
in many scenarios. For instance, the 2D-based approaches are commonly applied
in the context of autonomous vehicles and delivery robots.
Figure 1.2: The figure illustrates the Thorvald II [1] robot’s capability to operate
in various rough uneven terrains. To fully utilize the robot’s strengths in these
terrains, it is crucial to have a reliable navigation strategy.
3
1. Introduction
one side of a building to the other. A 3D motion plan could illogically propose
a trajectory that involves jumping over the building, which is unfeasible for
ground-based robots.
Therefore, to address these realistic constraints, the majority of the
contributions in this thesis adopt a 2.5D environmental representation. This
approach acknowledges the three-dimensional nature of the terrain while
remaining grounded in the practical limitations of ground robot navigation.
The 2.5D model thus facilitates more feasible and effective motion planning in
complex outdoor environments.
Given the aforementioned challenges, it is imperative to move beyond the
oversimplified assumption that the ground is always effectively 2D. Acknowledging
this, this thesis presents several methodologies that more accurately perceive
the environment as it exists, minimizing assumptions like the world being 2D.
Discarding this 2D assumption introduces complexities in planning and
environmental representation. This transition inevitably leads to increased
demands on time, memory, and implementation, resulting in inefficiencies in
classical planning and navigation algorithms. These challenges necessitate the
development of more sophisticated and efficient approaches to ensure effective
navigation in more complex, three-dimensional terrains.
4
Contributions
outdoor and all robots under such dynamic constraints? Specifically, how can
these new approaches address the unique constraints of velocity, acceleration, and
torque to enhance the performance and efficiency of outdoor robotic systems?
RQ3: Current approaches to identifying traversable regions in uneven
outdoor terrains predominantly rely on general terrain analysis, focusing on
geometrical and visual characteristics. This approach often overlooks pivotal
factors such as the robot’s heading direction and the varied traversability
capabilities of different robotic systems, revealing a notable gap in the
literature and a shortcoming in traditional offline computation methods. Given
these limitations, is it possible to refine traversability estimation with more
comprehensive and nuanced methods? Specifically, how can these advanced
methods be designed to incorporate often-neglected elements, such as the robot’s
directional orientation and the distinct traversability profiles of various robotic
platforms, thereby enhancing the accuracy and practical relevance of terrain
analysis in robotic navigation?
The answers to these research questions are elaborated upon in a series of
conference and journal papers, the details of which will be comprehensively
provided in the subsequent sections of this thesis. These papers present our
findings and methodologies and contribute significantly to the broader discourse
in the field of autonomous outdoor terrain navigation.
1.4 Contributions
This thesis comprises six comprehensive papers, each constituting a self-contained
contribution to uneven outdoor terrain navigation. While the introduction section
of each paper explicitly outlines its contributions, in this segment of the thesis,
we aim to contextualize these contributions precisely. Our focus here is to
summarize the most notable novel contributions brought by the papers to the
field. The bullet points emphasizing the novel contribution are as follows:
5
1. Introduction
• We introduced the first neural network designed to process raw LIDAR and
IMU data for real-time, direction-aware traversability assessments using
spatiotemporal point cloud maps, negating the need for real-world data.
Additionally, we developed an efficient simulation pipeline for autonomous
data generation and labeling, leveraging robot experiences across varied
terrains to enhance neural network training with minimal human input.
6
The Papers and Thesis Outline
Figure 1.3: The figure provides a systematic enumeration of the papers, detailing
the specific research questions they address, and outlining the precise context
and focus of each paper.
7
1. Introduction
8
Chapter 2
Background
In this chapter, we aim to lay the foundational background and effectively
contextualize the scope of this thesis. It is essential to clarify that this chapter
is not intended to be a comprehensive review. Instead, it serves as a concise
overview of the literature and a refresher on key fields pertinent to our research.
Central among these areas are motion planning, environmental representations,
and traversability estimation, all of which are integral to the focus of this thesis.
By providing this foundational knowledge, we aim to equip readers with the
necessary context and understanding required to engage fully with the detailed
discussions and novel contributions that follow in the subsequent chapters of
this thesis.
Figure 2.1: The figure illustrates the working principles of the Probabilistic
RoadMap (PRM) [2], a widely recognized sampling-based motion planning
approach.
9
2. Background
10
Motion Planning
Figure 2.2: This figure illustrates the Rapidly-Exploring Random Tree (RRT)
algorithm, which employs a tree structure to systematically search the state
space and establish a path to the desired goal state. The inherent structure of
RRT, based on tree construction, renders it less suitable for re-planning scenarios,
as a new tree must be generated for each new set of start and goal pairs.
11
2. Background
Figure 2.3: This figure depicts kinodynamic motion planning using a modified
RRT algorithm. Each new sample must conform to the system’s motion
constraints in this process, often achieved through steering functions or
random forward propagation with varied control inputs. While this restricts
maneuverability, it’s crucial for ensuring smooth and feasible motion plans. In the
realm of kinodynamic planning, the paramount concern is the strict adherence
to both physical constraints and control input limitations.
12
Motion Planning
13
2. Background
14
Environmental Representation
The Navigation2 framework [25], along with its predecessor, the original ROS
navigation framework, is known for its widespread adoption in both academic
and industrial contexts [5]. These frameworks predominantly rely on a 2D
occupancy grid for environmental representation, which has proven practical
and widely utilized in various research projects. However, their reliance on 2D
occupancy grids limits their applicability in navigating uneven terrains.
Digital Elevation Maps (DEMs) represent an alternative approach to
environmental mapping. A DEM is essentially a grid map where each grid cell
contains elevation data for specific x and y coordinates, expressed as a function
z = f (x, y). Wermelinger et al. [26] expanded on this concept by incorporating
a traversability measure within each cell of the DEM. This adaptation provides
elevation information and evaluates the terrain’s navigability, enhancing the
utility of DEMs in complex environments. However, DEMs have limitations
in accurately capturing environments with layered structures. They fall short
in scenarios involving underpasses, such as beneath bridges, under dense tree
canopies, or overhanging structures, which are common in outdoor terrains.
This shortcoming is due to the inherent nature of DEMs, which typically
represent terrain as a single-layered surface, thus failing to account for the
vertical complexity found in such layered environments.
An advanced OcTree-based 3D occupancy method, OctoMap, was developed
by Hornung et al.[27]. OctoMap distinguishes itself by utilizing a probabilistic
approach to determine the occupancy values of geometric voxels. This method
is specifically designed to account for sensor noise and uncertainties prevalent
in real-world scenarios. Building on OctoMap, Wang et al.[28] implemented
a framework that leveraged OctoMap for navigating indoor mobile robots in
uneven environments. Their approach showcases the adaptability of OctoMap in
complex, real-world applications beyond its initial intended use cases. However,
the suitability of OctoMap for navigating uneven terrains raises certain concerns.
A key issue is its tendency to retain numerous cells that may not be relevant
to the navigation task. This retention leads to a suboptimal memory footprint,
unnecessarily increasing data storage. Furthermore, this excess can result
in heightened computational costs during collision-checking processes. Such
inefficiencies are particularly problematic in complex terrains, where optimal
memory usage and efficient collision detection are crucial for effective navigation.
As discussed in [29, 30], other methodologies have employed 3D meshes
derived from point cloud maps to represent uneven terrains. These approaches
leverage the edge connectivity of 3D meshes to devise navigation strategies
suitable for outdoor environments with complex structures. For instance, the
authors in [29] utilized a 3D mesh to construct the edge connectivity of such
environments. Subsequently, graph-based search algorithms are applied to
this connectivity framework to generate feasible navigation paths. Ruetz et
al. [30] also explored a 3D mesh-based framework in their research. While mesh
connectivity methods are designed to aim for robot independence, they primarily
focus on the geometrical aspects of environmental representation. This approach,
however, does not explicitly accommodate the kinematic and dynamic constraints
inherent to specific types of robots, particularly those equipped with Ackerman
15
2. Background
steering systems. Therefore, the purely geometrical nature of these methods may
limit their applicability in scenarios where such constraints play a critical role in
the robot’s navigation and maneuverability capabilities. The primary distinction
of this thesis’s contribution to environmental representation, particularly for
optimized navigation across uneven terrains, is its consideration of kinematic
and dynamic constraints. This focus sets it apart from previously proposed
mesh-based representations, which typically do not account for these crucial
factors as the mesh-based motion plans are geometric but not kinodynamic.
Previous methods in the field have often lacked a focused emphasis on the
interplay between environmental representation and motion planning, leading to
incompatible and redundant implementations of path-planning algorithms across
different environmental representations. In stark contrast, this thesis significantly
emphasizes the synergy between environmental representation and planning. We
argue that integrating these two components is essential for effective planning
and navigation in challenging uneven terrains.
16
Traversablity Estimation
Figure 2.5: Diverse traversability estimation methods for varied environments are
depicted. (a) Bayesian Generalized Kernel-based traversability estimates applied
to Digital Elevation Maps (DEM)[3], (b) image-based terrain classification[4],
(c) obstacle proximity-based 2D costmaps[5], and (d) our contribution of a deep
neural network-based traversability estimation on 3D point clouds.
17
2. Background
18
Emerging Approaches in Uneven Terrain Navigation
19
Chapter 3
Methodological Framework
3.1 Explanation of Research Design
This thesis employs a hybrid methodology, combining both quantitative and
qualitative research methods, to analyze the obtained results effectively. A
significant portion of the problems addressed in this thesis are quantifiable. For
instance, in our aim to optimize the efficiency and quality of motion plans across
uneven terrains, we evaluated various path characteristics, including path length,
curvature, and accumulative control duration.
However, there are scenarios where quantitative evaluation alone is insufficient.
A notable example is the estimation of traversability for uneven terrains.
Currently, there is no established dataset or benchmark for non-binary
traversability estimation, making quantitative comparisons with existing methods
challenging. This difficulty is compounded by the influence of a robot’s
locomotion capabilities on the results. To address this limitation, this thesis also
incorporates qualitative evaluations in the contributed papers. We conducted
numerous field experiments, both in simulations and in real-world scenarios, to
assess the effectiveness of our approaches in practical settings.
21
3. Methodological Framework
Figure 3.1: The figure displays two robotic platforms: the Thorvald II robot
on the left and the Robotnik platform on the right. Both platforms have a
consistent sensor suite comprising LIDAR, Inertial Measurement Units (IMUs),
wheel encoders, and GNSS receivers.
Figure 3.2: Sensors mounted on a robot are depicted on the left. The robot’s pose
is determined by combining multiple sensory data using an Extended Kalman
Filter (EKF). The dark red points are from the live LIDAR sensor that precisely
overlaps the point cloud map, indicating precise localization.
22
Environments and Simulations
the methods employed for its localization. The integration of GNSS data into
our localization framework is dependent on weather conditions. Primarily,
localization is achieved through the fusion of data from the Inertial Measurement
Unit (IMU), LIDAR odometry, and wheel odometry using an Extended Kalman
Filter (EKF). The inclusion of GNSS data is optional and is utilized based on
its feasibility under varying weather scenarios.
Figure 3.3: This figure presents a collection of satellite imagery and first-person
photographs of the testing sites.
The experiments conducted as part of this research were carried out across
various seasons, encompassing a range of weather conditions. To visually
represent the testing environments, Figure 3.3 includes satellite imagery from
Google Maps and first-person view photographs.
In the course of this thesis, we extensively utilized simulation tools for the
development and testing of our algorithms. The primary simulators employed
were the Gazebo simulator [54] and a Unity game engine-based simulator. The
Gazebo simulator is particularly beneficial due to its seamless integration with
ROS [55] and its capability to simulate various sensor plugins, which facilitates
an efficient transition from simulation to real-world application. However, when
high-fidelity simulation was required, we found that the Unity game engine-based
23
3. Methodological Framework
24
Software Tools
3.6 Limitations
Despite the significant contributions of this thesis to the field of uneven terrain
navigation, it is important to acknowledge certain limitations. The most notable
of these is the lack of consideration for robot-terrain interactions. Ideally, this
aspect could be addressed through the estimation of soil parameters using a
range of sensors, including RGB cameras. However, our primary reliance on
LIDAR, which provides solely geometric measurements, precluded the reliable
prediction of soil parameters. This limitation is particularly relevant in scenarios
involving soft soil, where the absence of soil characteristic data could increase
the risk of the robot becoming stuck.
Another limitation in our research pertains to the initial reliance on pre-built
maps. Although the necessity of moving beyond the constraint of pre-existing
maps became apparent, this realization did not emerge until the later stages of
our research. The initial dependence on pre-built maps potentially restricted the
versatility and adaptability of our navigation algorithms in dynamically changing
or previously unmapped environments.
The absence of additional sensors, such as RGB cameras, in our research
constitutes another limitation. The integration of such sensors could have
significantly enhanced the estimation of soil parameters and the accuracy of
traversability assessments, especially in challenging environments like tall grass
areas. While the incorporation of these sensors would have potentially improved
25
3. Methodological Framework
26
Chapter 4
Paper Summaries
27
4. Paper Summaries
28
Paper IV: CostTrust: A Fast-Exploring, Iteratively Expanding Frontier-Based
Kinodynamic Motion Planner
improved to an extent. The motivation is to go beyond the improvements of
Paper II. Paper III introduces a novel approach combining supervoxel-based
point cloud representations for terrain mapping and a Nonlinear Model Predictive
Controller (NMPC) for motion planning. This method aims to provide more
precise, adaptable, and efficient navigation in varied and challenging terrains by
bypassing conventional sampling-based motion planners.
The approach used in this paper is distinct from existing works in its
integration of supervoxel-based point cloud representations and Nonlinear Model
Predictive Control (NMPC). Supervoxels represent an efficient method for
clustering point clouds while respecting object boundaries. This characteristic
renders them particularly suitable for uneven terrain navigation. The key
advantage of super voxels lies in their ability to construct a detailed connectivity
mesh over extensive point clouds efficiently. This mesh is a foundational structure
that various graph algorithms can leverage to determine optimal navigation paths.
Using this mesh, planners can effectively create courses from a start supervoxel
to a goal supervoxel, navigating through complex terrains with a higher level of
granularity and precision. The utilization of supervoxels, therefore, significantly
enhances the capability of navigation systems to process and interpret large-
scale spatial data in uneven terrains. NMPC, used for obstacle avoidance and
motion plan tracking, offers kinodynamic feasibility in resulting movements, a
notable advancement over simpler, geometry-based planners. This combination
allows for more accurate, dynamic, and efficient navigation in uneven outdoor
environments, setting it apart from existing methodologies.
The approach was evaluated through extensive simulations and real-world
tests. These evaluations focused on the method’s performance in accurately
and efficiently navigating uneven terrains. The results demonstrated a notable
improvement over existing navigation methods, particularly regarding path
efficiency and execution time. The conclusions underscore the effectiveness
of the combined use of supervoxel-based terrain representation and NMPC for
autonomous navigation in challenging 3D environments, highlighting its potential
to enhance robotic mobility in real-world applications.
The research detailed in this paper garnered attention when presented at
a conference, leading to an invitation from the conference chairs to develop
an extended version for publication in a reputable journal. This invitation
reflects the recognized potential and impact of the research within the academic
community, underscoring its contribution to the field.
29
4. Paper Summaries
30
Paper VI: Enabling Robot Autonomy through a Modular Software Framework
which utilizes data generated from the robot’s movement across high-fidelity
simulations. This process is supported by a novel pipeline for data generation and
labeling, enhancing the quality and relevance of the training data. Once trained,
the neural network, combined with the sensory input, provides continuous and
reliable assessments of terrain traversability. This synergistic use of advanced
neural network techniques and simulated training data effectively bridges the
gap between simulated environments and the complexities of real-world terrain
navigation, offering a significant advancement in the field.
Figure 4.1: This figure presents the components constructing the approach in
Paper V.
31
4. Paper Summaries
32
Chapter 5
Discussion
In this chapter, we articulate several critical conclusions derived from the
research conducted within this thesis. These conclusions are informed by a
thorough interpretation of our findings, substantiated by the experimentation
and validation of the methods we proposed. The core focus areas of our
research—motion planning, environmental representations, and traversability
estimation—have been instrumental in facilitating robust navigation over uneven
terrains. These fundamental themes have not only deepened our understanding
of the inherent challenges in autonomous outdoor navigation but have also
guided us toward innovative solutions. The insights gained have led to a series
of focused discussions on a variety of relevant topics, which we will explore in
the subsequent sections here.
33
5. Discussion
34
Going Beyond Geometrical or Visual-based Traversability Estimation
35
5. Discussion
representations. To mitigate this gap and enhance the practicality and reusability
of the algorithms presented in this thesis, we have developed ’vox_nav.’ This
uneven terrain navigation framework is designed with a plugin-based architecture,
accommodating a wide range of motion planners, controllers, and environmental
representations. ’vox_nav’ is versatile, supporting motion plans in state spaces
such as SE3, SE2, Dubins, Reeds-Sheep, and Elevation (ours). The framework
includes 2 map plugins, 5 motion planner plugins, and 3 controller plugins,
and is compatible with the majority of state-of-the-art motion planners from
OMPL. While ’vox_nav’ has been rigorously tested on the robot platforms
and environments depicted in Figure 3.3, this presents both a strength and a
limitation. Its effectiveness has been validated on real systems, yet for it to
be established as a standard in the field, more extensive testing across various
robotic platforms and in a wider array of environments is necessary. This
broad-based validation is essential to confirm its adaptability and effectiveness
in diverse navigation scenarios.
36
Chapter 6
37
6. Conclusion and Future Work
38
Future Directions
39
Papers
I
Paper I
Benchmark of Sampling-Based
Optimizing Planners for Outdoor
Robot Navigation
43
Benchmark of Sampling-Based Optimizing
Planners for Outdoor Robot Navigation
1 Introduction
45
Fig. 1. Figure illustrates representative planning results from 16 distinct SBO planners
in a real-world setting for a randomly generated planning problem. The dark blue
points indicate various obstacles such as buildings, the colored paths generated by
each planner are displayed with sequential arrows.
46
– We use an automated planning problem generation method that can better
capture the behavior of probabilistic SBO planners by observing results from
many different planning problems.
– We present a performance analysis of state-of-the-art SBO planners with
realistic planning times in real and simulated 3D environments.
2 Related Work
The number of SBO planners has increased steadily in the past decade. As this
is the case, several benchmarking works have been proposed to investigate the
performances of each planner in-depth for specific scenarios. Meijer et al. [19],
investigated various OMPL planners for grasping execution tasks on different
robotic arms. Based on critics such as; the number of exactly solved runs, path
length, and computation times, recommendations were made for high-performing
planners depending on the type of grasp execution, e.g., pick or place. Jedrze-
jczyk et al. [13] presented an empirical work focusing on benchmarking of vari-
ous planners within MoveIt! framework [3]. The particular focus of the planning
problem was related to a robotic harvester arm. The task was picking toma-
toes in a simulated greenhouse environment. The authors used time of planning,
time of generated plan simplification, and time plan execution as metrics for
performance evaluation. The results for metrics are provided as average however
it is not clear how many runs were performed to obtain the average values for
metrics. Magalhaes et al. [18] benchmarked OMPL planners on a six-degree of
freedom manipulator in a simulated vineyard for pruning tasks. The authors
use the success rate, path length, planning time, and path clearance as their
metrics. The authors conclude that most of the evaluated planners in OMPL
perform poorly for demanding pruning tasks in agricultural setups.
Commonly, the above works focused on robotic manipulation scenarios; they
used path length, success rate, and execution time as evaluation metrics. In our
benchmark, we exclude the execution time from the metrics. We solely bench-
mark optimizing planners as we believe that the quality of generated path for an
outdoor robot is crucial. The optimizing planners utilize all allowed time to get
the most optimal path. Therefore time consumed by each planner is the same.
We further discuss on this in Sec. 3.1.
More recent work by Heiden et al. [10] introduced a benchmarking tool for
path planning of mobile robots. The authors present the Bench-MR platform for
convenient performance evaluation of different sampling-based and grid-based
path planning algorithms. To showcase the features of the proposed tool, the
authors provide sample benchmarks from several planning algorithms. However,
there is no large-scale comparison of different planners.
Moll et al. introduced a generic software infrastructure wrapping around the
OMPL interface in [20], which allows straightforward comparison of OMPL plan-
ners considering different performance metrics. The tool has introduced many
performance metrics making it convenient to evaluate performance for additional
requirements. We have used this benchmarking tool to carry out experiments.
47
As can be seen from highlighted summaries above, the existing works on
sampling-based planning benchmarking have focused on benchmarking planners
for robotic manipulation. We also realize a lack of methodology that can re-
veal more profound performance differences of probabilistic planners that are
being evaluated. We deal with these shortcomings by introducing a method (see
Sec. 3.2). We also use an automated problem generation on the real and simu-
lated environment that can generate a large number of valid planning problems.
Both of these allow us to gain a clear picture of SBO planners’ performance in
the setting of unstructured outdoor path planning.
3 Problem Formulation
θ(0) = Ss , θ(1) = Sg ,
(1)
θ(x) ∈ Sf ree , ∀x ∈ [0, 1]
where c(θ∗) is an optimized path W.R.T given cost function c, which assigns
a strictly positive cost to collision-free paths.
In our benchmark, planners will minimize a cost function c, where the cost is
determined by the path’s length and obstacle clearance properties. The obstacle
clearance objective aims to steer the path away from obstacles. A weighted
combination of these two objectives is used for determining cost c. The weights
were set to 10 and 1 for path length and clearance objectives, respectively.
48
Planner Name Short Description
PRM* [14] Number of nodes to be connected are not fixed but is determined auto-
matically based on coverage of the state space.
LazyPRM* [2] As in PRM*, the number of nodes to be connected is not fixed but is
determined automatically based on coverage of the state space, differently,
the edges of nodes are not immediately checked for the collision but are
checked only when a better path to the goal is found.
RRT* [14] An asymptotically-optimal version of RRT that is guaranteed to converge
to an optimal solution if one exists. The optimality refers to a specified
cost function, commonly a function that determines path length.
RRTsharp [1] Bases on Rapidly-exploring Random Graphs(RRG), it improves the con-
vergence rate by keeping stationary information of sampled vertices; by
doing this, RRTsharp can achieve better estimations of potential optimal
paths.
RRTX [22] A variant of RRT*, the nodes in the tree are rewired only if the cost is
being reduced by a specified margin(known as epsilon), this improves the
convergence of solution.
InformedRRT* [8] Narrow downs tree exploration space to boundaries of a prolate hyper-
spheroid, a prolate hyperspheroid is n-dimensional symmetric ellipse that
is constructed around current optimal plan. As the search space is reduced
to region around optimal plan, the converge rate improves significantly.
BIT* [7] Uses a unified structure that is composed of both graph and trees, this
leads to a few advantages such as being able to use heuristics like one in
A* [9] to prioritize tree expansions towards the goal with high-quality
paths.
ABIT* [27] An extension of BIT*, Advanced BIT*(ABIT*), combines an advanced
graph-based search with an RRT* like planner for faster convergence rate.
AIT* [26] Another variant of BIT*, uses an asymmetric bidirectional search where
both of the searches continuously inform each other, this leads to quicker
initial solutions.
LBTRRT [24] Allows for a continuous interpolation between RRT* and RRG algorithms
for high-quality solutions (comparable to RRG and RRT*) with little
running-time.
SST [16] An enhancement of RRT* which tries to address optimal kinodynamic
planning when there is no access to two-point boundary value prob-
lem(BVP) solver, BVP is required to get the desired property for systems
with dynamics.
SPARS [5] Built on top of PRM*, suggests finite-sized nodes within the roadmap
which might provide near-optimality properties.
SPARStwo [4] Extends on SPARS. Sparser graphs can provide solutions meeting near-
optimality requirements. Reduced resources are used to solve the plan-
ning problem, SPARStwo also decreases the rate of node addition into the
roadmap.
FMT* [12] Utilizes recursive dynamic programming to form a tree from probabilisti-
cally sampled states.
CFOREST [21] Wraps a parallelized framework around single-query SBO planners, e.g.,
RRT*. Separate trees are grown from different threads while these threads
communicate with each other every time a thread finds a better path.
Other threads are also informed, leading to a significant speedup of con-
vergence. The result paths are shorter and smoother.
AnytimePartShortening(APS) [17] A generic method that wraps around one or more planner. The method
applies an iterative path simplification, shortening process. It is capable
of generating a hybrid path if they are to yield shorter solutions.
There are various metrics to assess the quality of resulting paths, such as path
length, total execution time, availability of exact solution, etc. SBO planner’s
execution time is quite ambiguous and needs an explanation here. SBO planers
use a timeout parameter; initially, some of the time is used to construct an
49
initial valid plan (if one exists). After a valid plan is found, the remaining time
is utilized for getting lower-cost paths, at the end, all given time is used. Since
in this work we are interested in optimal paths, we cannot set execution time
as a metric since SBO planners utilize all given time to construct the optimal
plan. In experiments, all planner’s timeout were set to 5 seconds. Metrics of
evaluation and their short descriptions are given as follows.
– Average Solve Rate: Number of exact solutions divided by the total number
of problems.
– Solution Length: Real length of the resulting solution path.
– Solution Smoothness: The smoothness of the path is calculated by the sum
of angles between consecutive path segments. Assume a path as sequence
of segments where three states within the path form two segments. Using
Pythagoras generalized theorem, one can calculate angles between consec-
utive segments for all intermediate states. Each angle is normalized by the
summed lengths of two segments. Summation of all normalized angles gives
final smoothness value for the path. The lower this value the smoother the
path is. For instance, a perfectly straight line will result in a smoothness
value of zero.
The planners subject to evaluation are not deterministic but probabilistic. There-
fore from a single run, it is not possible to measure the performance of a planner.
Hence we run planners on a high number of randomly generated planning prob-
lems, we then use normal distributions of results from each run to determine
and rank the performances of planners.
Two-Sample Z-Test is a statistical method to evaluate the differences between
two normal distributions. We use a Two-Sample Z-Test to find out whether the
performance of a planner differs significantly compared to other planners, this
helps us to obtain the proportion of difference from two normal distribution
metrics. Additionally, we can visualize the performance of each planner against
every other planner, as in Fig. 2. The formal equations of the Two-Sample Z-Test
are written as follows;
s
σ12 σ2
s= + 2,
N1 N2
µ1 − µ2
z= ,
s
x − µ1 (3)
1 0.5( )
f (x) = √ e σ1 ,
σ1 2π
Z z
1
P (X ≤ z) = √ f (x), dx
σ1 2π −∞
50
Where σ1 and σ2 are standard deviations, N1 and N2 are sample sizes and
µ1 and µ2 are mean values of normal distributions for compared planners. Fi-
nally, P (X ≤ z), also named the p-value, determines the area under the normal
distribution and assesses the significance of change between the two compared
distributions. We perform the procedure of Z-Test for each planner against every
other planner(including the planner itself); if the p-value for this comparison is
below a threshold a, the result is 1 and -1 if otherwise. This results in a 2D skew-
symmetric matrix with the size of the number of planners. The values are in the
range of [−1, 1], the column-wise summation of positive values in this matrix
tells how many planners have been outperformed by the planner in each col-
umn(see Fig. 2). We acquire this matrix for both smoothness, and path lengths
metrics since results from these two are presented with distributions. Consider-
ing multiple non-deterministic metric values, it is difficult to withdraw an overall
performance score. Therefore we consider weighting the metrics and summing
them respectively.
4 Experiments
51
collision check status between a 3D box representing the robot’s body and Oc-
toMap at start and goal poses. To make sure that an actual valid plan exists
between randomly generated start and goal poses, a planning request was made
from start to goal pose with a sufficient amount of timeout (10 seconds). The
actual benchmarking was initialized only if there existed a valid plan from ran-
dom start to random goal pose, we repeat this process until we achieve 500 valid
pairs of start and goal poses that define 500 unique planning problems.
The planners are evaluated in Dubin [6] state space to meet the kinematic
structure of the Ackerman steering ground robot. Dubin’s state-space particu-
larly suits forward-only motions, adding more challenge to the planning problem.
Dubin’s state space ensures kinematic constraints are complied with by providing
path segments that lay on curves in between start and goal poses. We must note
that each planner’s performance will be reasonable and similar if given enough
time. Therefore we used a strict timeout of 5 seconds to challenge and reveal
performance differences among SBO planners. We generated 500 random plan-
ning problems with the minimum Euclidean distance between start and goal pose
being 30 meters. To reduce bias related to the probabilistic nature of planners,
we requested each planner to solve each problem twice, making the total number
of runs 1000 for each planner. Since we evaluated 16 SBO planners, 16000 runs
were performed, each taking 5 seconds, resulting in a total computation time of
22.22 hours on a laptop with Intel Core i7-6700HQ 2600 MHz CPU with eight
cores and 16GB of memory. All numbers reported in the following sections are
result of 1000 runs.
Fig. 2. A matrix depicting whether one can state that, a planner’s performance in a
column produced significantly shorter paths than a planner in a row with the confidence
of 100 ∗ (1 − a)%. The significance threshold a is set to 0.05. The normalized column-
wise summation of matrix values is obtained on the vertical axis for each planner. In
this case, the APS planner achieves the best score (1.0).
52
In the following, we will interpret results metric-wise.
53
is more significant for a mobile robot, still, it is possible to use different weights
and recalculate final scores with given information in Table 2 and Table 3. As
seen in the vertical label of Fig. 2, the score for each planner has been calculated
by normalized column-wise summation, these accumulative scores are created
for path length and smoothness metrics, which are denoted with Zlength and
Zsmooth respectively.
Table 2. Means and deviations of acquired path length and smoothness metric from
1000 runs. A lower mean value indicates that planners achieve better performance for
both metrics.
Table 3. A table presenting the average solve rate and overall performance scores of
planners calculated by the proposed method in Sec. 3.2. The planners are ranked with
highest final score values.
54
With acquired Zlength and Zsmooth , Using Eq. 4, we achieve the final per-
formance scores presented in Table 3. Note that the values present in Table 3
are achieved by comparing planners with each other, a low value does not nec-
essarily mean a planner has failed to produce the desired result, but it instead
means that, the planner has performed worse than other planners with greater
final score values. Our primary focus in this paper was on the problem of mobile
robot path planning with kinematic constraints. Our recommendation for users
is to consult to the final scores of planners given in Table 3. Based on these
results, one can judge the most suitable planners for mobile robot navigation on
outdoor terrains. We were not able to acquire any exact solution from FMT*
planner. It is unclear what might have caused this. More investigation for this
particular planner is needed.
5 Conclusion
We evaluated all SBO planners in the OMPL with a systematic approach. We run
each planner on 500 different planning problems twice, resulting in 1000 results
that reveal each SBO planner’s behavior. The evaluation setup was mainly aimed
at the planning of outdoor mobile robots. The proposed evaluation methodology
identifies the most suitable planners for planning in unstructured terrains. With
provided probability distributions of metrics and their combinations, the method
revealed performance differences among planners numerically. The APS planner
was found to be the most consistent planner along with all metrics. Therefore we
recommend the readers’ APS planner, as it is the best performing SBO planner
for mobile robot path planning. This paper does not propose a methodology to
find optimal configuration parameters for each evaluated planner automatically.
There are hundreds of parameters for all planners being assessed, making it
challenging to find optimal values for each parameter. Therefore all planners
were set for experiments with their default parameters. We plan to investigate
parameter tuning for each planner methodologically in future works.
References
55
6. L. E. Dubins. On curves of minimal length with a constraint on average curvature,
and with prescribed initial and terminal positions and tangents. American Journal
of Mathematics, 79(3):497–516, 1957.
7. J. D. Gammell, T. D. Barfoot, and S. S. Srinivasa. Batch informed trees (bit*):
Informed asymptotically optimal anytime search. The International Journal of
Robotics Research, 39(5):543–567, 2020.
8. J. D. Gammell, S. S. Srinivasa, and T. D. Barfoot. Informed rrt*: Optimal incre-
mental path planning focused through an admissible ellipsoidal heuristic. CoRR,
abs/1404.2334, 2014.
9. P. Hart, N. Nilsson, and B. Raphael. A formal basis for the heuristic determination
of minimum cost paths. IEEE Transactions on Systems Science and Cybernetics,
4(2):100–107, 1968.
10. E. Heiden, L. Palmieri, L. Bruns, K. O. Arras, G. S. Sukhatme, and S. Koenig.
Bench-mr: A motion planning benchmark for wheeled mobile robots. IEEE
Robotics and Automation Letters, 6(3):4536–4543, 2021.
11. A. Hornung, K. M. Wurm, M. Bennewitz, C. Stachniss, and W. Burgard. Octomap:
An efficient probabilistic 3d mapping framework based on octrees. Autonomous
Robots, 2013.
12. L. Janson and M. Pavone. Fast marching trees: a fast marching sampling-based
method for optimal motion planning in many dimensions - extended version.
CoRR, abs/1306.3532, 2013.
13. F. Jedrzejczyk, J. Bajer, G. Gawdzik, J. Glówka, and A. Sprońska. Benchmark
and analysis of path planning algorithms of “ros moveit!” for pick and place task
in tomato harvesting. In R. Szewczyk, C. Zieliński, and M. Kaliczyńska, editors,
Automation 2021: Recent Achievements in Automation, Robotics and Measurement
Techniques, pages 272–284, Cham, 2021. Springer International Publishing.
14. S. Karaman and E. Frazzoli. Sampling-based algorithms for optimal motion plan-
ning. CoRR, abs/1105.1186, 2011.
15. N. Koenig and A. Howard. Design and use paradigms for gazebo, an open-source
multi-robot simulator. In 2004 IEEE/RSJ International Conference on Intelligent
Robots and Systems (IROS) (IEEE Cat. No.04CH37566), volume 3, pages 2149–
2154 vol.3, 2004.
16. Y. Li, Z. Littlefield, and K. E. Bekris. Asymptotically optimal sampling-based
kinodynamic planning, 2016.
17. R. Luna, I. A. Şucan, M. Moll, and L. E. Kavraki. Anytime solution optimization
for sampling-based motion planning. In 2013 IEEE International Conference on
Robotics and Automation, pages 5068–5074, 2013.
18. S. Magalhães, F. Neves Dos Santos, R. Martins, L. Rocha, and J. Brito. Path
Planning Algorithms Benchmarking for Grapevines Pruning and Monitoring, pages
295–306. 08 2019.
19. J. Meijer, Q. Lei, and M. Wisse. Performance study of single-query motion plan-
ning for grasp execution using various manipulators. In 2017 18th International
Conference on Advanced Robotics (ICAR), pages 450–457, 2017.
20. M. Moll, I. A. Şucan, and L. E. Kavraki. Benchmarking motion planning algo-
rithms: An extensible infrastructure for analysis and visualization. IEEE Robotics
& Automation Magazine, 22(3):96–102, September 2015.
21. M. Otte and N. Correll. C-forest: Parallel shortest path planning with superlinear
speedup. IEEE Transactions on Robotics, 29(3):798–806, 2013.
22. M. Otte and E. Frazzoli. Rrtx: Real-time motion planning/replanning for environ-
ments with unpredictable obstacles. In WAFR, 2014.
56
23. J. Pan, S. Chitta, and D. Manocha. Fcl: A general purpose library for collision
and proximity queries. In 2012 IEEE International Conference on Robotics and
Automation, pages 3859–3866, 2012.
24. O. Salzman and D. Halperin. Asymptotically near-optimal RRT for fast, high-
quality, motion planning. CoRR, abs/1308.0189, 2013.
25. T. Shan, B. Englot, D. Meyers, W. Wang, C. Ratti, and R. Daniela. Lio-sam:
Tightly-coupled lidar inertial odometry via smoothing and mapping. In IEEE/RSJ
International Conference on Intelligent Robots and Systems (IROS), pages 5135–
5142. IEEE, 2020.
26. M. P. Strub and J. D. Gammell. Adaptively Informed Trees (AIT*): Fast asymp-
totically optimal path planning through adaptive heuristics. In Proceedings of
the IEEE International Conference on Robotics and Automation (ICRA), pages
3191–3198, 31 May – 31 Aug. 2020.
27. M. P. Strub and J. D. Gammell. Advanced bit* (abit*): Sampling-based planning
with advanced graph-search techniques. 2020 IEEE International Conference on
Robotics and Automation (ICRA), May 2020.
57
Paper II
Elevation State-Space:
Surfel-Based Navigation in Uneven
Environments for Mobile Robots II
Fetullah Atas, Grzegorz Cielniak, Lars Grimstad
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
10.1109/IROS47612.2022.9981647
59
Elevation State-Space: Surfel-Based Navigation in Uneven
Environments for Mobile Robots
Fetullah Atas 1 Grzegorz Cielniak 2 Lars Grimstad 1
Abstract—This paper introduces a new method for robot representations with improvements over existing methods. In
motion planning and navigation in uneven environments through earlier approaches, such as in [11], [12], the standard planning
a surfel representation of underlying point clouds. The proposed algorithms were tailored specifically to work with a particular
method addresses the shortcomings of state-of-the-art navigation
methods by incorporating both kinematic and physical con- environment representation requiring additional integration
straints of a robot with standard motion planning algorithms effort and limited portability between the representations. This
(e.g., those from the Open Motion Planning Library), thus caused a considerable effort to reimplement or adapt differ-
enabling efficient sampling-based planners for challenging uneven ent planning algorithms to work with proposed environment
terrain navigation on raw point cloud maps. Unlike techniques representations. This work proposes a new state-space design
based on Digital Elevation Maps (DEMs), our novel surfel-based
state-space formulation and implementation are based on raw and implementation to address these limitations. We built
point cloud maps, allowing for the modeling of overlapping our method on top of raw point cloud maps. Point clouds
surfaces such as bridges, piers, and tunnels. Experimental results are directly derived from distance measurements provided by
demonstrate the robustness of the proposed method for robot various sensors (e.g., RGBD cameras, LIDARs, etc.). Point
navigation in real and simulated unstructured environments. The clouds are not volumetric structures that could convey the
proposed approach also optimizes planners’ performances by
boosting their success rates up to 5x for challenging unstructured occupancy information of the environment. To convey the
terrain planning and navigation, thanks to our surfel-based occupancy information of the environment, we create an
approach’s robot constraint-aware sampling strategy. Finally, we occupancy volume through surfels, which are introduced as
provide an open-source implementation of the proposed method point primitives to simplify the representation of a dense point
to benefit the robotics community. cloud in computer graphics [10]. The surfels hold a cost
Index Terms—Unstructured Terrain Navigation, Outdoor
Robotics, Planning. property which indicates a traversability measure of the region
they represent within the dense point cloud. The cost property
I. I NTRODUCTION of surfels is also projected to occupancy volume, see Fig. 2.
The occupancy volume is built where it matters: the traversable
There is an increasing demand for robots in uneven en- regions on the surface of environments, which is advantageous
vironments such as construction, rural, disaster, or planetary since a ground robot’s area of interest is primarily the ground’s
discovery sites. Current methods for autonomous navigation in surface. We limit the occupancy volume used for planning
these environments use various environment representations, to only relevant locations (ground’s surface) without loss
including 2D grid maps [7], [8], Digital Elevation Maps of ability to model complex structures such as tunnels or
(DEMs) [2], [20], 3D occupancy grids , [19] and meshes [12], piers, which DEM-based methods cannot model, see Fig. 1.
[14]. Finally, together with the cost property, this volume is called
Despite the variety of proposed methods, some limitations elevation state-space, a dedicated state-space for planning and
call for further improvements. A few examples in this regard navigation in unstructured terrains. We run experiments in real
are limitations from the environment representation (as in and simulated unstructured environments to validate our ap-
DEMs for not being able to model some structures such as proach. Through our surfel-based state-space implementation,
tunnels, overhangs, etc.) and the curse of dimensionality in we enable seamless integration with the standard planning
full 3D occupancy grids. There is also a lack of convergence techniques such as those from the OMPL [18] library for
towards a standard environment representation for 3D naviga- unstructured terrain planning and navigation.
tion. 2D grid maps are widely adopted for structured terrain
In summary, our key contributions are:
planning and navigation in academia and industry. As we
explore in Sec. II, there is no widely adopted representation for • A configurable implementation interface that can account
navigation in uneven terrains, leading to a need for alternative for different robot types with various kinematic and
1 Norwegian University of Life Sciences {fetullah.atas, physical constraints, we run experiments on Ackerman
lars.grimstad} @nmbu.no and omnidirectional robot types.
2 University of Lincoln gcielniak@lincoln.ac.uk • We demonstrate that both in real and simulated uneven
61
values of geometric voxels are determined by a probabilistic
approach to account for sensor noise and uncertainties. Wang
et al. [19] presented a framework that indirectly used OctoMap
to navigate an indoor mobile robot in an unstructured envi-
ronment. First, an OctoMap of the domain is created. The
generated OctoMap is then projected to several layers with
various resolutions, and differences of the gained layers are
used to determine the traversability of the environment map.
The planning and navigation are performed on top of this
2D traversability map. The method was developed and tested
only for indoor unstructured environments; hence there was
no incorporation of terrain roughness or ground clearance of
the robot chassis.
Some other methods, such as , [12], [14] have used 3D
meshes generated from 3D point cloud maps of the environ-
ment to represent unstructured terrains. They used acquired
3D mesh edge connectivity to develop a navigation strategy
for unstructured outdoor environments. Authors in [12] use
such a 3D mesh to build the edge connectivity of a 3D envi-
Fig. 1: An example use-case for navigation in unstructured ronment. Graph-based search algorithms are used on this edge
terrain. PRM* planner [5], operating in the proposed state- connectivity to create feasible paths for navigation. Although
space, returns a valid path (in cyan color) by passing under the method is aimed to be robot-independent, there was no
the pier. direct indication for consideration of kinematic constraints,
such as Ackerman-type. Ruetz et al. [14] also presented a 3D
mesh-based framework. However, the acquired 3D mesh was
terrains, the proposed approach leads to better results than only based on visible point clouds from the live LIDAR sensor,
standard navigation frameworks (i.e., Navigation2 [7]). limiting the method for global navigation in unstructured
• An open-source implementation of our software 1 which environments.
enables the use of standard motion planning libraries (i.e., Although initially not intended for unstructured terrain
OMPL) for unstructured terrain planning and navigation, navigation, Navigation2 [7] and its predecessor, the original
removing the need for additional efforts to integrate ROS navigation framework is worth mentioning due to its
representation-dependant path planners. popularity in academia and industry [8]. These methods, which
II. R ELATED W ORK rely on a 2D occupancy grid for environment representation,
are practical and have been used by numerous researchers.
The previous work in mobile robot navigation in unstruc-
However, because these approaches are based on a 2D occu-
tured environments used a variety of map representations,
pancy grid for environment representation, their use is limited
including digital elevation maps (DEMs), 2D-3D occupancy
for unstructured terrain.
grids, and meshes. We take a brief look into recent methods
There was less emphasis on the relationship between envi-
and explain their characteristics in the following.
A digital elevation map (DEM) is a generic grid map where ronment representation and planning in the existing methods,
each cell within the grid holds the elevation information of resulting in incompatible, duplicate implementations of path
corresponding x and y locations with a function like z = planning algorithms among used environment representations.
f (x, y). Wermelinger et al. [20] used a special type of DEM In contrast, our approach places a greater emphasis on the
where, besides elevation, a traversability measure of terrain relationship between environment representation and planning.
was also present within each cell. The authors referred to We argue that seamless integration between the environment
the new form of this map as a traversability map. They used representation and planning is crucial for challenging unstruc-
this map for planning paths for a legged robot in unstructured tured terrain planning and navigation. Our approach integrates
terrain. Similarly, Fan et al. [2] used a layered grid map where environment representation and planning tightly through our
the elevation and risk factors associated with each position surfel-based elevation state-space, eliminating the need to
were present. They then used this layered 2D grid map for risk- tailor planners to a specific environment representation. As
aware planning and navigation in unstructured environments. we show in Sec. IV, this helps us navigate unstructured
These methods rely on an elevation map with limitations such environments effortlessly as we enable access to dozens of
as not modeling overhangs, tunnels, etc. efficient sampling-based planners (e.g., from OMPL).
An OcTree-based 3D occupancy method, OctoMap, was III. A PPROACH
introduced by Hornung et al. [4]. In OctoMap, occupancy
Surfel is an abbreviation for “surface element” [10], which
1 https://github.com/NMBURobotics/vox nav is introduced as point primitives to simplify the rendering or
62
shading of an object represented by a dense point cloud. For following cost critics are calculated for each surfel in the surfel
a ground robot navigating in an unstructured terrain where the set SO .
environment is represented by such a dense point cloud, the
area of interest is the surface of the point cloud map. We use ts = max(| arctan(nsx , nsz )|, | arctan(nsy , nsz )|), (2)
these point primitives (surfels) to simplify the representation
of terrain features derived from points, see Sec. III-A. In where arctan(nsx , nsz ) and arctan(nsy , nsz ) define the roll
the following, we describe the extraction of traversability and pitch of the surfel respectively.
information and embedding of this information into elevation j
X APx + BPy + CPz + D
state-space. pds = 1/j | √ |, (3)
An environment is represented with point clouds consisting k=1
A2 + B 2 + C 2
of n points. We refer to this point set as the original point
set PO . We perform a uniform sampling to discretize this The average point deviation from the surfel plane pds is
environment and distribute the cost values associated with determined by the sum of all point distances to the surfel
local patches [15] operation on the original point cloud set plane divided by the number of points within the surfel.
PO . Uniform sampling is performed with a voxel size of ds ,
resulting in a point cloud subset of j points. We refer to hds = max(Pz1 , .., Pzj ) − min(Pz1 , ..., Pzj ), (4)
this point set as a sampled point set PS . We extract local
The height of a point is its value along the z-axis, with Pzj
terrain features such as roughness, tilt, the maximum height
being the z value of jth point. hds is the maximum height
of sampled points, and their geometrical relationship with
difference among points of the surfel.
neighboring points from the original point’s set PO . These
features are used for deriving a traversability measure of the APx + BPy + CPz + D
map, see the itemized cost critics in Sec. III-A. gcs = ∀k ∈ {1, .., j}, max( √ ). (5)
A2 + B 2 + C 2
We construct the surfels with sampled points and their
neighbors within radius rs , from the original point set PO . Where gcs is the distance of the farthest point from the surfel
Let us refer to this acquired surfel set as the original surfel plane (along the normal vector), the gcs should be less than
set SO , each surfel in this set is defined by a 3D position the distance of the robot chassis to the ground to ensure safe
vs ∈ R3 , a 3D normal vector ns ∈ R3 and a radius rs ∈ R1 . ground clearance.
The radius of surfel rs is approximated to ds /2.0 to minimize The individual critics are then linearly weighted by
overlapping where ds is the voxel size. Positions of surfels are
directly acquired from the sampled point’s positions, while the Js = wt ts + wpd pds + whd hds + wgc gcs , (6)
normal vectors are determined by the RANSAC plane fitting
to the neighboring points of each sampled point. Plane fitting where wt , wpd , whd and wgc are adjustable parameters to
for points within surfel boundaries is expressed in the plane weigh each critic according to the desired behavior. In the
equation given in Eq. 1. Then the normal vector for surfel ns default configuration, the values for each weight are set to
becomes the coefficients of the plane equation: 0.25, meaning they equally contribute to determining the
overall traversability cost of a region represented by a surfel.
Ax + By + Cz + D = 0, Js becomes the cost associated with surfel Ss . Elevating each
(1)
ns = (nsx , nsy , nsz ) = (A, B, C) surfel along their normal vector ns by de (see Eq. 7), we
A, B, C, D are coefficients of scalar equation of a plane. ns achieve a thin layer that follows the structure of terrain with
becomes normal vector for the plane. a distance. de is configuration parameter that can be adjusted
according to physical properties of robot, distance from robot’s
A. Regressing Costs on Local Patches of Terrain center of gravity to the ground is a good default parameter for
We regress the cost values for each surfel in the original the de . We refer to this elevated surfel set as SE .
surfel set SO , and this cost defines a traversability measure for
each surfel. We select four critics to determine a cost value v s = v s + de ns , (7)
associated with each surfel:
• Tilt of the slope within a surfel, ts . At this stage, the elevated surfel set SE resembles a thin
• Average point deviation from the plane of a surfel, pds , layer (consisting of surfels) on top of the terrain underneath.
is used to determine roughness. We add a volume to this thin layer by stacking several surfels
• Max height difference of points within the surfel, hds . together (along with their normal vectors) with a step size
• Ground Clarence to the robot’s bottom chassis, gcs . ssize . An illustration of this process is depicted in Fig. 2.
These cost critics aim to determine cost values for terrain The result of this stacking looks like a “waffle” that can be
patches considering the robot’s physical constraints, e.g., roll approximated to a cylinder if ssize is chosen small enough.
or pitch angle, terrain roughness, ground clearance, etc. This volume is then used for planning and navigation on
The following equations show how these cost critics are unstructured terrains. We refer to this volume as elevation
computed with the acquired surfel properties. Note that the state-space.
63
Fig. 4: Elevation state-space is composed of SE2 and z
components. We account for the kinematic constraints of an
Ackerman type by considering Dubins or Reeds-Sheep space
for the SE2 element.
Fig. 2: Waffle surfels on top of an unstructured environment
patch. Stacked surfels that construct one waffle have identical
normal vectors, radius, and cost values. We achieve a con- of SE2 members are used to account for kinematic constraints
strained state-space suitable for unstructured terrain planning related to the maneuvers. The constraint of the z component,
and navigation with the side-by-side collection of surfel waf- which enforces constant contact of robot base and terrain, is
fles. handled by the elevated surfels introduced in Sec. III-A
C. Implementation Details
We implement the new state-space into the OMPL frame-
work. The implementation is done in the C++ programming
language. A few substantial C++ objects that construct the
Fig. 3: Cost values are encoded with RGB channels to the body of our state-space implementation are Elevation State-
terrain point cloud map. Space, Surfel-based Valid State Sampler (SVSS), and Surfel
Cost Optimization Objective (SCOO). We can distinguish
between traversable and non-traversable regions with the cost
B. Integrating the Physical and Kinematic Constraints regression explained in Sec. III-A. SVSS exploits this fact
A robot navigating in unstructured terrain is subject to the and only generates samples within traversable areas. It is also
limitations of maneuvers due to its physical and kinematic possible to enable discrete probability distribution to give a
constraints. higher probability of drawing samples from lower-cost regions.
Physical constraints are robot-specific parameters. For in- The samples are drawn directly from the elevated surfel set
stance, the maximum tilt angle that a robot can bear while SE , in Eq. 8, we weigh the likelihood of a sample being drawn
navigating is a robot-specific physical limitation. Another according to their costs. The higher a surfel cost is, the less
robot-specific limitation is ground clearance [9] which ensures likely SVSS will mark it as a sample for OMPL planners, the
contact-free robot chassis-terrain interaction. We address phys- probability is calculated with Equation 8.
ical constraints by regulating our cost critics (see Sec. III-A) P (i|(M/J0 ), (M/J1 ), ..., (M/Jn−1 )) =
to a configurable range. The range for each cost critic is an
Ji (8)
adjustable parameter and can be adapted according to the Pn−1 ,0 ≤ i < n
robot type and desired behavior. For instance, setting a tight k=0 M/Jk
range for cost critic ts (tilt of the terrain) from Sec. III-A Where M is the max cost threshold, we assume 255. P
will lead to a more inclination-sensitive traversability map; describes the probability of surfel Si being drawn from surfel
thus, the robot will not navigate with such settings through set SE as a sample, and Ji is the cost associated with surfel
highly inclined regions, see Fig. 9. In Fig. 9, the red-colored Si .
patches are marked as non-traversable, resulting in cost critics Some OMPL planners, such as RRT* and PRM* [5] re-
being out of defined ranges. Kinematic constraints, e.g., for ferred to as optimizing planners, can use an objective function
the Ackerman type of robot, are considered in the planning to optimize a path according to a given objective. The opti-
stage. Typically, Dubins [1] or Reeds-Sheep [13] shortest mizing planners use the objective function’s value to judge the
curves handle these constraints. When the planning problem quality of the path to improve the quality of the path based on
is aimed at unstructured terrains, the situation gets more this. SCOO sums up costs given in the Eq. 6, SCOO can be
complicated as traditionally, Dubins and Reeds-Sheep are used with other optimization objectives such as shortest length
computed on a 2D plane, but an additional dimension (z) obstacle clearance objectives already present in OMPL.
exists for unstructured terrains. Our method deals with this
situation by decomposing SE2 and z as separate components IV. E XPERIMENTS AND VALIDATION
(see Fig. 4) of elevation state-space. SE2 state-space is used We use data from a real environment consisting of unstruc-
for robots operating in a flat environment with three degrees tured terrain with various objects to evaluate our method. We
of freedom (x, y, heading). Dubins or Reeds Sheep variants use a LIDAR-based SLAM approach to acquire the natural
64
environment’s point cloud map seen in Fig. 7. Recent SLAM edges formed by piles of sand and gravel resulting from
approaches [16], [17] and [6] can produce dense and accurate some building works. This limits feasible regions for the safe
point cloud maps using a LIDAR, IMU, and optionally GNSS. navigation of the robot. Repetitive planning and navigation
LIO-SAM by Shan et al. [17] proved to produce reliable maps queries from the different start positions to the goal inside the
of these methods. Therefore, we run our tests on the maps excavation resulted in successful navigation.
produced by LIO-SAM. In addition to actual data from the real
C. Comparison of Planners Performance in Elevation State-
world, we use point cloud maps generated from the Gazebo
space
simulator. As our robot platform, Thorvald II by Grimstad et
al. [3], Thorvald II is a modular agricultural robot suitable for The selected edge cases are insufficient to draw general
unstructured environment navigation. conclusions on planners’ performance in the proposed state-
space. We perform benchmarking for the success rates of
A. Comparison to Standard ROS Navigation Framework RRT* and PRM* planners from the OMPL on a randomly
Two approaches, Mesh Nav [12] and Navigation2 [7], have generated problem set to draw a general conclusion on plan-
organized software implementations that we may compare our ners’ performance in the elevation state-space. The randomly
approach as baselines. Navigation2 is a widely used navigation generated set consists of 100 planning problems in the real or
framework that is built on top of a well-matured software simulated map (randomly chosen for each problem) used in
stack [8]. The primary use of this framework is in indoor the edge-case tests. All randomly generated problems satisfy
scenes; however, outdoor scenes are also possible. We realize the following conditions:
that, for even terrain patches, both our method and Navigation2 • The Euclidean distance between the randomly generated
perform as expected. The performance difference reveals itself start and goal pose is from 40 meters up to 100 meters.
when the terrain is uneven (e.g., steep inclinations). Our • There exists at least one exact solution for the generated
approach correctly models the unstructured nature of outdoor problem.
scenes, leading to superior performance over Navigation2. An We verify that at least one solution exists for the generated
example case is depicted in Fig. 6 where a robot (Ackerman problem by setting the timeout parameter to a very long wait
type) is given an identical navigation task over an unstructured time, 5 min. If the planner found an exact solution within that
terrain. Thanks to the terrain feature-aware nature of elevation time interval, we add the generated problem for benchmarking;
state-space, the planning, and navigation tasks are effortlessly if the planner found no solution for the generated problem,
handled, whereas Navigation2 fails to address the tasks due we skip it and generate another one until 100 problems are
to a lack of accurate representation of uneven terrain since it available for benchmarking. A solution is considered exact if
bases on a 2D occupancy grid. the path has been connected to the goal state with less than 0.2
meters gap and approximate if this gap is less than 10 meters.
B. Edge-Case Tests in Real Environment
The success rate results from 100 runs are given in Fig. 8.
The edge cases are aimed to qualitatively assess the planning 1) SCOO assessment: We introduced SCOO in Sec. III-C.
and navigation performance in challenging parts of an actual It is worth noting how this optimization objective affects the
environment map. In the first case, the robot must navigate behavior of planners. Planners tend to avoid higher-cost areas
from various poses to one goal pose in the map depicted in with SCOO, including steep inclinations and rougher terrain
Fig. 7. patches. In the figure, we compare results from two instances
1) Navigating through an unstructured inclination: In this of PRM* where one instance is configured with SCOO, and
case, the robot is expected to navigate through an unstructured the other instance is configured with the length optimization
inclination; see Fig. 7. Although we assume a robot model objective. The length optimization objective tries to maintain
that can take on unstructured terrain, the robot is expected the shortest path possible. In contrast, the SCOO considers the
to follow a smoother and less inclined path to minimize integrated cost values along this path, thanks to our state-space
energy consumption and risk factors. For this case, there are representation’s available traversability measures(surfel costs).
approximately 2 meters of elevation variance from start to In this case, the SCOO gets safer and less energy-consuming
goal, with the total distance to the goal being around 65 meters. paths. See an example in Fig. 9, where the SCOO steers the
The planning and consecutive navigation for this case path towards greener areas, indicating lower-cost zones.
consistently resulted in success. See Fig. 7 Fig. 5 for instance.
The planned paths are steered towards feasible regions, mainly D. Discussion
when the inclination raises. Fig. 7 and Fig. 5 belong to the We provided results from randomly generated problems in
same scene, but experiments in Fig. 5 were performed during Sec. IV-C to evaluate the use and benefits of the proposed
winter. method based on the robot model on hand. Point clouds
2) Navigating through the excavation of the building site: are accurate approximations of the actual 3D world and are
In this case, the robot tries to navigate through an excavation the most common output format from 3D SLAM methods.
residual of a building in the real environment. The challenge Therefore, we argue that it is beneficial to base robot naviga-
of this case was the height difference of the excavation area tion directly on raw point clouds. We used the intermediate
from the rest of the ground plane and its non-traversable surfel representation since we are interested in leveraging
65
Fig. 5: A sequence of images showing auto navigation of Thorvald II robot in a real unstructured snowy terrain with the help
of elevation state-space. In sequences, at the top-left third-person view of the robot, at the bottom-left robot’s POV, and on the
right side, the RVIZ view of the map is depicted. In the RVIZ view, the planned path also can be seen(yellow-colored line).
(a)
(b)
Fig. 6: The robot effortlessly goes through an inclined hill and reaches the desired goal in elevation state-space (a). In the
classical navigation framework, as the robot gets closer to the inclined terrain, it perceives the hill as an obstacle. While in
reality, it is traversable for the given robot model. This leads to confusion on the planner and controller, ultimately causing
navigation failure (b).
the abstract features of point cloud surfaces. Concepts that Our method’s concept of incorporating traversability costs to
proposed method introduced, i.e., SVSS and SCOO proved to planning through SCOO becomes helpful in defining what
compliment the performance of the OMPL planners in the should be the characteristics of an optimal path because it
context of unstructured terrain planning see Fig. 8. is possible to weigh the cost critics.
With the existing state-space implementations in OMPL,
such as SE3 SE2, it was impossible to achieve feasible path V. C ONCLUSION
plans in unstructured terrains. The simple reason for this is This paper used the surfel representation of underlying
that these state-spaces are not aware of constraints(kinematic point clouds to introduce a constraint-aware state-space. The
or physical) that the planner must respect. In contrast, these outcomes of the experiments show that the method can be
constraints are considered and respected in the proposed used to navigate in difficult, uneven surroundings. Within
elevation state-space. With the introduced cost critics, we can the suggested method, it is also possible to account for the
readjust the behaviors of planners according to the desired kinematic and physical constraints of different robot kinds.
result. For instance, increasing the weight of average point The inefficiency of our approach when dealing with kinematic
deviation from the surfel plane critic (pds in Sec. III-A) forced constraints is one of its flaws. When employing Dubins or
plans to be generated in smoother areas rather than short- Reeds Sheep state-spaces, it takes a long time for planners
cutting towards the goal. In unstructured terrains, defining the to converge to near-optimal paths. In the future, we’d like to
characteristics of an optimal path is challenging. For instance, expand on this work by looking at efficiency difficulties while
which path is the most optimal and desired path with minor accounting for kinematic constraints and standardizing motion
energy consumption, the shortest length, or the least risk of planning and navigation for unstructured terrains using point
rollover?. We argue that the answer is application-specific. clouds.
66
Fig. 9: Compared to the path length optimization objective,
SCOO maintains approximately the same path length with
lower traversability cost overall, improving the safety and
energy consumption during navigation. Refer to Fig. 3 for the
relation between RGB color and cost values. SCOO can also be
used with other optimization objectives to achieve composite
behavior.
Fig. 7: Example result of navigating through an unstructured
inclination mentioned in Sec. IV-B1. The robot start position [7] S. Macenski, F. Martin, R. White, and J. Ginés Clavero. The marathon
is marked with white circle and the white circle with the flag 2: A navigation system. In 2020 IEEE/RSJ International Conference on
is the goal position, in the upper side, a drone image of the Intelligent Robots and Systems (IROS), 2020.
[8] E. Marder-Eppstein, E. Berger, T. Foote, B. Gerkey, and K. Konolige.
case is presented. At the bottom, a cost regressed map of the The office marathon: Robust navigation in an indoor office environment.
environment and produced path (in cyan) are visualized. In International Conference on Robotics and Automation, 2010.
[9] P. Papadakis. Terrain traversability analysis methods for unmanned
ground vehicles: A survey. Engineering Applications of Artificial
Intelligence, 26(4):1373–1385, 2013.
[10] H. Pfister, M. Zwicker, J. Baar, and M. Gross. Surfels: Surface
elements as rendering primitives. Proceedings of the ACM SIGGRAPH
Conference on Computer Graphics, 05 2000.
[11] A. Pfrunder, P. V. K. Borges, A. R. Romero, G. Catt, and A. Elfes.
Real-time autonomous ground vehicle navigation in heterogeneous en-
vironments using a 3d lidar. In 2017 IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS), pages 2601–2608, 2017.
[12] S. Pütz, T. Wiemann, J. Sprickerhof, and J. Hertzberg. 3d navi-
gation mesh generation for path planning in uneven terrain. IFAC-
Fig. 8: Success rates from OMPL planners for randomly PapersOnLine, 49(15):212–217, 2016. 9th IFAC Symposium on In-
generated 100 planning problems on an unstructured terrain. telligent Autonomous Vehicles IAV 2016.
[13] J. A. Reeds and L. A. Shepp. Optimal paths for a car that goes both
Without exploiting surface information of elevation, planners forwards and backwards. Pacific Journal of Mathematics, 145(2):367 –
suffer as most sampled states are generated in undesired areas. 393, 1990.
However, guided by SVSS, the first instance of PRM* solves [14] F. Ruetz, E. Hernández, M. Pfeiffer, H. Oleynikova, M. Cox, T. Lowe,
and P. Borges. Ovpc mesh: 3d free-space representation for local ground
all planning problems. vehicle navigation. In 2019 International Conference on Robotics and
Automation (ICRA), pages 8648–8654, 2019.
[15] R. B. Rusu and S. Cousins. 3d is here: Point cloud library (pcl). In 2011
IEEE International Conference on Robotics and Automation, pages 1–4,
R EFERENCES 2011.
[16] T. Shan and B. Englot. Lego-loam: Lightweight and ground-optimized
[1] L. E. Dubins. On curves of minimal length with a constraint on average lidar odometry and mapping on variable terrain. In IEEE/RSJ Inter-
curvature, and with prescribed initial and terminal positions and tangents. national Conference on Intelligent Robots and Systems (IROS), pages
American Journal of Mathematics, 79(3):497–516, 1957. 4758–4765. IEEE, 2018.
[2] D. D. Fan, K. Otsu, Y. Kubo, A. Dixit, J. Burdick, and A. Agha- [17] T. Shan, B. Englot, D. Meyers, W. Wang, C. Ratti, and R. Daniela.
Mohammadi. STEP: stochastic traversability evaluation and planning Lio-sam: Tightly-coupled lidar inertial odometry via smoothing and
for safe off-road navigation. CoRR, abs/2103.02828, 2021. mapping. In IEEE/RSJ International Conference on Intelligent Robots
[3] L. Grimstad and P. J. From. Thorvald ii - a modular and re-configurable and Systems (IROS), pages 5135–5142. IEEE, 2020.
agricultural robot. IFAC-PapersOnLine, 50(1):4588–4593, 2017. 20th [18] I. A. Sucan, M. Moll, and L. E. Kavraki. The open motion planning
IFAC World Congress. library. IEEE Robotics Automation Magazine, 19(4):72–82, 2012.
[4] A. Hornung, K. M. Wurm, M. Bennewitz, C. Stachniss, and W. Burgard. [19] C. Wang, L. Meng, S. She, I. M. Mitchell, T. Li, F. Tung, W. Wan, M. Q.-
OctoMap: An efficient probabilistic 3D mapping framework based on H. Meng, and C. W. de Silva. Autonomous mobile robot navigation
octrees. Autonomous Robots, 2013. Software available at http://octomap. in uneven and unstructured indoor environments. In 2017 IEEE/RSJ
github.com. International Conference on Intelligent Robots and Systems (IROS),
[5] S. Karaman and E. Frazzoli. Sampling-based algorithms for optimal pages 109–116, 2017.
motion planning. CoRR, abs/1105.1186, 2011. [20] M. Wermelinger, P. Fankhauser, R. Diethelm, P. Krüsi, R. Siegwart, and
[6] S.-W. Lee, C.-M. Hsu, M.-C. Lee, Y.-T. Fu, F. Atas, and A. Tsai. Fast M. Hutter. Navigation planning for legged robots in challenging terrain.
point cloud feature extraction for real-time slam. In 2019 International pages 1184–1189, 10 2016.
Automatic Control Conference (CACS), pages 1–6, 2019.
67
Paper III
Navigating in 3D Uneven
Environments Through
Supervoxels and Nonlinear MPC
III
69
Navigating in 3D Uneven Environments through Supervoxels and
Nonlinear MPC
Fetullah Atas 1 Grzegorz Cielniak 2 Lars Grimstad 1
71
based framework for uneven terrain navigation; however, the • We introduce a new navigation strategy that utilizes
work did not deal with dynamic obstacles and assumed static supervoxels and NMPC, which have been shown to be
environments. effective in real uneven terrain navigation.
2) Geometric Motion Planning: Sampling-based motion • Through a series of experiments in simulated and real-
planning has been demonstrated as an effective strategy to deal world uneven terrains, we demonstrate that the proposed
with planning for robots in uneven environments in various method’s planning module outperforms existing state-of-
works such as in Jian et al. [8] and Atas et al. [2]. In the the-art sampling-based planners in terms of efficiency and
following, we provide a brief review on sampling-based mo- flexibility.
tion planning. The PRM Kavraki et al. [10] and RRT Lavalle • We provide an open-source implementation of our ap-
and Kuffner [12] algorithms and their dozens of variants proach.
have paved the way for sampling-based motion planning. The
authors in Karaman and Frazzoli [9] investigated sampling- II. A PPROACH
based planners’ completeness and optimality properties to A. Problem Statement
understand formal guarantees further. Some sampling-based
planners were later improved to account for non-holonomic A successful navigation task will produce a discrete motion
constraints Palmieri et al. [18]. A library called Open Motion plan defined by state s̃ and control ũ sets at the final stage t.
Planning Library (OMPL) Sucan et al. [26] contains imple- The state and control sets are as follows:
mentations of many sampling-based planners. The majority
of these planners are based on RRT* and PRM* Karaman s̃ = (s1 , s2 , ..., st ), ũ = (u1 , u2 , ..., ut ). (1)
and Frazzoli [9] planners (e.g., RRTX Otte and Frazzoli [17], Based on Eq. 1, we define the following components for
LazyPRM* Bohlin and Kavraki [3], InformedRRT* Gammell feasible navigation:
et al. [5], and so on), but a newer collection of planners
uses a structure that contains both graphs and trees (e.g., 1) A finite state space X.
BIT* Gammell et al. [6], ABIT* Strub and Gammell [24], 2) A control space U (s) for each state s ∈ X.
AIT* Strub and Gammell [25]). Some methods utilize par- 3) A state transition function f that produces next state
allelized approaches (e.g., CFOREST Otte and Correll [16], f (s, u) ∈ X.
AnytimePartShortening (APS) Luna et al. [13]), in which 4) A set of stages denoted by t that begins at t = 0 and
many planners execute concurrently on different threads while continues indefinitely, and a goal set XG ∈ X.
planners inform each other of milestones reached, leading to A successful navigation task is achieved by the function Υ
better performance overall. that maps every state to control; X −→ U .
3) NMPC for Collision-Free Control: Numerous studies
B. General Overview
have established the effectiveness of Nonlinear Model Predic-
tive Control (NMPC) in achieving optimal control of diverse Our approach is divided into three stages. First, we develop
robotic systems including mobile robots Salimi Lafmejani and a method for evaluating the traversability of a point cloud
Berman [22], unmanned aerial vehicles (UAVs) Mansouri et al. map by analyzing the geometric characteristics of local ter-
[15], and autonomous vehicles Yu et al. [28]. These systems rain patches. The second stage involves the construction of
share a common trait, which is the presence of multiple supervoxels on point cloud maps with quantified traversability
constraints that must be accounted for in each control cycle. values, the supervoxels are then used to calculate minimal cost
These constraints may include limitations on state dynamics, paths either with the Dijkstra or A* graph search algorithms.
non-collision requirements, and limits on state variables such The Final stage utilizes the calculated geometrical path to
as velocity or acceleration, among others. The approach we determine the optimal and kinodynamically feasible control
propose shares similarities with the work of Yu et al. [28], commands until a termination condition is met (e.g. goal
who also present an NMPC scheme for obstacle-aware uneven reached, collision detected, etc.). In the following sub-sections,
terrain navigation. However, our method offers two distinct we will delve into the specifics of each stage of our approach.
advantages over their work. Firstly, we validate our approach
through experiments on an actual robot, in addition to simu- C. Traversability Assessment via Local Geometric Features
lations. Secondly, our method demonstrates a faster run cycle Suppose that an environment is represented with a point
of approximately 20Hz, whereas their reported control loop cloud map. We discretize this point cloud map through uniform
runs at approximately 3Hz. The NMPC scheme proposed in sampling with a voxel size of ds , resulting in a subset of
PUTN Jian et al. [8] also holds relevance. However, the authors j points referred to as the sampled point set PS . From
of this scheme have used a 2D kinematic model and the speed this sampled point set, we extract local terrain features such
of the control loop has not been explicitly reported. as roughness, tilt, and maximum height by analyzing the
geometrical relationship between the sampled points and their
B. Contributions neighboring points from the original point cloud map.
The specific contributions of our work include the follow- To obtain geometric features, we search for the radial
ing: nearest neighbors of each sampled point in the original point
72
Fig. 2: A comprehensive overview of our approach is presented. The first step involves the calculation of the traversability of a given point
cloud map, taking into account the specific capabilities of the robot (e.g. max tilt angle, min distance for ground clearance, etc.). Secondly,
utilizing the obtained traversability map, the traversable regions are segmented into compact supervoxels. Finally, the NMPC optimization
process is employed for path tracking by producing optimal control values that are kinodynamically feasible.
Ax + By + Cz + D = 0,
(2)
ns = (nsx , nsy , nsz ) = (A, B, C)
The coefficients A, B, C, D represent the scalar equation of
a plane. The normal vector for this plane (of a disk) is given
by ns .
We use the following cost critics to assign a traversability
cost to each disk.
• Tilt of the slope within the disk, represented by td .
• Average deviation of points from the plane of the disk,
represented by pdd , to measure roughness. Fig. 4: The impact of varying parameters on the traversability map
• Maximum height difference of points within the disk,
is shown. In (a), (b), and (c), the effect of using different disk
radii is depicted. In (d), (e), and (f), the effect of choosing different
represented by hdd . max ranges for the tilt cost is depicted. Just as the tilt cost critic
• Ground clearance for the bottom chassis of the robot, can be adjusted, the range and weight of other cost critics can
represented by gcd . also be customized, providing flexibility to accommodate robots
These criteria aim to evaluate the terrain patches based on the with different traversability capabilities. The RGB values encode
traversability cost values. See Fig. 3.
robot’s physical limitations, such as the maximum tilt (roll or
pitch) angle, the roughness of the terrain, ground clearance,
etc. The resulting cost values will reflect these constraints and here, arctan(ndx , ndz ) and arctan(ndy , ndz ) represent the roll
indicate the disk’s suitability for traversal. The cost values for and pitch of the disk, respectively.
each disk are computed using the following equations, which Xn
APx + BPy + CPz + D
are based on the geometrical properties of the points within pdd = 1/n | √ | (4)
disks. The cost values are then distributed to all points in the k=1
A2 + B 2 + C 2
global point cloud map.
where n denotes the number of points within the disk. The
roughness of a disk is determined by the average point
td = arg max(| arctan(ndx , ndz )|, | arctan(ndy , ndz )|) (3) deviation from the disk plane, pdd . This value is calculated
73
by summing up the distances from each point within the disk
to the disk plane and then dividing the result by the number
of points in the disk.
hdd = arg max(Pz1 , .., Pzn ) − arg min(Pz1 , ..., Pzn ), (5)
0:n 0:n
74
The NMPC optimization is formulated as;
u∗0:h−1 = arg min(Jt (s0:h , u0:h−1 ) + Jir (u0:h−1 )
0:h−1 (10)
+Jobs (s0:h , O))
subject to,
sk+1 = f (sk , uk ),
B(sk ) ∩ O = ∅, (11)
Umin ≤ U0:h−1 ≤ Umax
• The cost term Jt (s0:h , u0:h−1 ) is used to ensure that the
system follows the global path as closely as possible to
the Euclidean path between supervoxels. This cost term
also guarantees that the system’s kinematic constraints are
met since sk+1 = f (sk , uk ) is a hard equality constraint. Fig. 6: The figure presents the sample navigation scenes in a
• The term Jir (u0:h−1 ) represents the input rate cost, which simulated environment with uneven terrain and obstacles. The starting
aims to minimize the jerky movements of the control position of the robot is indicated by a checkered flag.
outputs.
• The term Jobs (s0:h , O) takes into account the cost associ- III. E XPERIMENTS
ated with the proximity of the system to obstacles, which
are represented as time-varying ellipsoids. Collision-free We conduct experiments in 3D uneven terrains with varied
control policies are guaranteed with B(sk ) ∩ O = ∅. slopes and roughness in both simulated and real environments.
Buildings, poles, trees, and other items are examples of objects
Specifically, the tracking cost Jt and the input rate cost Jir
in the environment. The maps are around 300x300 meters
are defined as follows:
h−1
in size. The simulated environment consists of more steep
X hills and more significant variance in terms of inclination. Our
Jt (s0:h ) = (si − sref ref T
i )Q(si − si ) ,
method relies on point cloud maps for the occupancy informa-
i=0
h−1
(12) tion of environments. Hence we construct point cloud maps
X
(ui − udv dv T of real environments with a SLAM method named LIO-SAM
Jir (u0:h ) = i )R(ui − ui )
i=0 by Shan et al. [23]. For building maps in real environments,
Q and R diagonal matrices are used to penalize specific we use the Ouster OS1-64 LiDAR and MTi-30-2A8G4 Xsens
state errors and input jerks. IMU. In the simulation, we rely on the Gazebo simulator
Finally, the obstacle costs are defined as follows: and simulated sensors and extract point cloud maps from the
simulated environment through a software plugin Koenig and
h−1 n Howard [11]. The onboard localization was performed through
XX
Dobs (s0:h ) = (xi − xobs 2 obs 2 a combination of ICP-based LIDAR, wheel odometry, and
j ) /(aj /2.0 + r) ,
i=0 j=0 IMU. The autonomy system is run on a ZOTAC ZBOX with
h−1
XX n (13) an Intel Core i7 CPU and an Nvidia RTX 2060.
+ (yi − yjobs )2 /(bobs 2
j /2.0 + r) ,
For the robot platform, the Thorvald II modular robot was
i=0 j=0 used Grimstad and From [7]. The software specifically de-
Jobs (s0:h ) = e(1.0/Dobs (s0:h )) signed for Thorvald runs on an Intel NUC computer equipped
with a Core i7 CPU, and the communication between all
As shown in Eq. 13, the cost is inversely proportional to components is managed by ROS 2 Macenski et al. [14].
the distance between the robot and obstacles. The variable n In evaluating the motion plan generated, two metrics are
represents the number of obstacles in O, h is the time horizon considered: the length of the path and the computation time.
considered in the NMPC setup and r is the robot radius. Each Tab. I is created based on these two evaluation metrics.
obstacle Oj is represented by a vector [xobs obs obs obs
j , yj , aj , bj ],
which are used in Eq. 14 to define the ellipse, see Fig. 2. A. Qualitative Performance Evaluation
(xobs )2 (y obs )2 In Fig. 8a, the robot arrives at the destination successfully;
+ obs 2 = 1 (14)
(aobs )2 (b ) the path depicted in blue is the resultant feedback plan, the
We implement the NMPC using CasADi Andersson et al. start pose is marked with a checkered flag. Similarly, in Fig. 8a
[1], a software framework that specializes in nonlinear opti- the robot navigates from a similar start pose (as in Fig. 8b) but
mization and optimal control. Although the NMPC algorithm to a closer goal pose, resulting in a near-optimal path depicted
can handle an arbitrary number of obstacles, to improve the in blue color. For Fig. 8a and Fig. 8b, we refer to optimality
speed of the control loop, we limit the number of obstacles in sense of Euclidean distance. However, with the availability
considered by the algorithm to those that are within 20 meters of traversability costs (represented with RGB colors in point
of the robot’s current position. cloud maps, seeFig. 3) we can re-adjust the sense of optimality
75
to refer to the traversability cost in point cloud maps. Hence of planning problems consisting of 20 unique planning prob-
the method can be configured such that it can navigate the lems in the map depicted atFig. 7. Each planner is requested
robot with one of the following properties: to solve 20 different planning problems five times, resulting
• least inclined path. in a total of 100 runs. The results of 100 runs are presented
• shortest path. in Tab. I.
• least rough path. Our method produced shorter paths in significantly less
In Fig. 7 (a), we configure the cost term such that it time. Optimal sampling-based planners require a timeout pa-
accounts for traversability costs. The resulting motion plans rameter where they use all allowed time to construct a valid
avoid higher-cost regions, resulting in an optimal motion plan path with minimal cost, in this case, the shortest length. To
where less-inclined, less-rough terrain segments are preferred observe the behavior of sampling-based planners with different
rather than a short distance. Hence the resultant motion plans time constraints, we set the timeout to 10 and 20 seconds
tend to navigate through greener regions. This allows the robot consecutively. The planners can improve their performance
to navigate through inclined terrains safely by avoiding steep due to the available time in the 20-second setup. To establish a
hills and rough terrain patches. baseline, we run the APS planner for 60 seconds; our method
almost achieves APS’s performance in less than one second.
B. Tuning the Method Based on the results in Tab. I, our technique distinguishes
itself from sampling-based planners by its short execution
The tuning of our algorithm is straightforward as it involves
time and deterministic nature. Compared to sampling-based
adjusting only a small number of configuration parameters.
planners, our method has the lowest final plan length deviation
Specifically, two parameters, known as seed resolution (psr )
in both setups (10 and 20 seconds), indicating consistency in
and resolution (pr ), are used in the construction of supervoxels.
the obtained results.
The seed resolution (psr ) is responsible for identifying the
radius of nearest neighbors that are used to compute the
Planners Length Mean (20 sec.) Length Mean (10 sec.)
supervoxel boundaries, while the resolution (pr ) determines
RRT* 65.2 ± 9.6 75.7 ± 13.9
the actual size of the supervoxel. Smaller values of pr result PRM* 51.2 ± 2.2 67.0 ± 12.0
in finer resolution of the global path, but we recommend using AIT* 52.0 ± 3.7 58.8 ± 8.3
values in the range of [0.1, 0.25] for large-scale outdoor robot CFOREST 51.6 ± 3.2 58.3 ± 6.9
APS 50.7 ± 1.8 54.1 ± 6.1
navigation. In our reported results, we set psr = 1.0 and APS (60 sec.) 48.7 ± 0.5
pr = 0.2. The general rule is that pr should be large enough Ours (0.65 sec.) 49.5 ± 1.3
to cover a number of points.
TABLE I: Means of acquired path lengths with 10 sec—timeout
Q = Diag(10, 10, 10, 0.1, 0, 0, 0.1), from 100 runs. A lower mean value indicates that planners achieve
(15) better performance for both metrics (shorter path in a shorter time).
R = Diag(10, 100)
The results in Tab. I show that the proposed approach is
In Eq. 12, we introduced tracking cost and input rate able to produce shorter paths in a significantly shorter amount
gain matrices Q and R. In this paper, we do not propose a of time compared to other planners. Specifically, the approach
methodological approach to tune these gain matrices optimally. generated 8.56% shorter paths than the next best planner (APS)
However, from the experimental observation, we realize that while only taking 6.68% of the time consumed by APS. It
the angular rate needs to be penalized higher as otherwise, was also observed that the best-performing baseline planner
the robot oscillates. From the optimality perspective, it is best (APS) required more than 40 seconds to surpass the proposed
to control the robot as close as possible to the coarse plan method. Additionally, it should be noted that the proposed
since the optimal property is explicitly contained within graph method is an anytime planner that re-computes supervoxels
traversal by Dijkstra or A*. Therefore in Q, we penalize posi- efficiently for each planning request and the reported times
tional errors on x, y, z heavier than the controllable heading ψ also include the time spent for supervoxel creation.
and velocity v. In the presented results, Q and R were chosen Our method is able to generate plans at a significantly
as in Eq. 15. faster rate than sampling-based planners. This is because
We also provide customizable objective selection in our our method leverages the underlying terrain information in
software implementation to optimize ensuing feedback plans, an active manner, whereas sampling-based planners do not
allowing us to achieve plans with the various characteristics take advantage of this information by default. As a result,
itemized in Subsec. III-A. generating a feasible plan for uneven terrain (e.g. no jump-
overs above buildings) for a ground robot in uneven terrain can
C. Comparison to Sampling-based planners
be a time-consuming process using sampling-based planners.
In this subsection, we evaluate the geometric plan produced This highlights the efficiency of the proposed method.
by our approach. As baselines, we choose some of the state-
of-the-art sampling-based planners. We compare the proposed IV. C ONCLUSION
method to various planners such as AnytimePathShortening In this paper, we presented a novel approach for navigating
(APS), CFOREST, AIT*, and others. We create a benchmark uneven terrains based on geometric traversability analysis,
76
Fig. 7: A comparison of the performance of the proposed approach with an objective set to the inclination cost (a) and Euclidean distance
(b), to that of the APS planner (c). The APS planner provides a longer path than the proposed approach despite taking more than ten times
longer to compute.
R EFERENCES
[1] Joel A E Andersson, Joris Gillis, Greg Horn, James B
Rawlings, and Moritz Diehl. CasADi – A software
framework for nonlinear optimization and optimal con-
trol. Mathematical Programming Computation, 11(1):
1–36, 2019. doi: 10.1007/s12532-018-0139-4.
[2] Fetullah Atas, Grzegorz Cielniak, and Lars Grimstad.
Elevation state-space: Surfel-based navigation in uneven
(a)
environments for mobile robots. In 2022 IEEE/RSJ
International Conference on Intelligent Robots and Sys-
tems (IROS), pages 5715–5721, 2022. doi: 10.1109/
IROS47612.2022.9981647.
[3] Robert Bohlin and Lydia Kavraki. Path planning us-
ing lazy prm. volume 1, pages 521 – 528 vol.1, 02
2000. ISBN 0-7803-5886-4. doi: 10.1109/ROBOT.2000.
844107.
[4] David D. Fan, Kyohei Otsu, Yuki Kubo, Anushri Dixit,
Joel Burdick, and Ali-Akbar Agha-Mohammadi. STEP:
stochastic traversability evaluation and planning for safe
(b)
off-road navigation. CoRR, abs/2103.02828, 2021. URL
Fig. 8: Example optimal motion plans resulting from the proposed the https://arxiv.org/abs/2103.02828.
approach in a real uneven environment. The chequered flag indicates [5] Jonathan D. Gammell, Siddhartha S. Srinivasa, and Tim-
the robot’s start pose, while the robot’s icon indicates the robot’s final
pose.
othy D. Barfoot. Informed rrt*: Optimal incremental
path planning focused through an admissible ellipsoidal
heuristic. CoRR, abs/1404.2334, 2014. URL http://arxiv.
supervoxels, and NMPC. Our experimental results on a real org/abs/1404.2334.
robot operating in a 3D environment demonstrate the method’s [6] Jonathan D Gammell, Timothy D Barfoot, and Sid-
efficiency. Specifically, our approach generated better-quality dhartha S Srinivasa. Batch informed trees (bit*): In-
geometric plans in less time than state-of-the-art sampling- formed asymptotically optimal anytime search. The
based planners on a benchmark. Additionally, the method International Journal of Robotics Research, 39(5):543–
enables a robot to navigate uneven 3D terrains with various 567, 2020. doi: 10.1177/0278364919890396. URL
objectives, such as finding the shortest or least-inclined path https://doi.org/10.1177/0278364919890396.
while dynamically avoiding obstacles. [7] Lars Grimstad and Pål Johan From. Thorvald ii
One potential limitation of our proposed method is that it - a modular and re-configurable agricultural robot.
does not explicitly model uncertainty that may arise from the IFAC-PapersOnLine, 50(1):4588–4593, 2017. ISSN
interaction between the robot and the uneven terrain. This 2405-8963. doi: https://doi.org/10.1016/j.ifacol.2017.
is due to the lack of sensors on the robot platform that 08.1005. URL https://www.sciencedirect.com/science/
can perceive terrain parameters, such as soil compaction and article/pii/S2405896317314830. 20th IFAC World
slippage estimation. In future work, we plan to extend our Congress.
approach to address this limitation and handle uncertainties [8] Zhuozhu Jian, Zihong Lu, Xiao Zhou, Bin Lan, Anxing
77
Xiao, Xueqian Wang, and Bin Liang. Putn: A plane- and Joachim Hertzberg. 3d navigation mesh gen-
fitting based uneven terrain navigation framework, 2022. eration for path planning in uneven terrain. IFAC-
[9] Sertac Karaman and Emilio Frazzoli. Sampling- PapersOnLine, 49(15):212–217, 2016. ISSN 2405-
based algorithms for optimal motion planning. CoRR, 8963. doi: https://doi.org/10.1016/j.ifacol.2016.07.
abs/1105.1186, 2011. URL http://arxiv.org/abs/1105. 734. URL https://www.sciencedirect.com/science/article/
1186. pii/S2405896316310102. 9th IFAC Symposium on Intel-
[10] L.E. Kavraki, P. Svestka, J.-C. Latombe, and M.H. Over- ligent Autonomous Vehicles IAV 2016.
mars. Probabilistic roadmaps for path planning in high- [21] Sebastian Pütz, Jorge Santos Simón, and Joachim
dimensional configuration spaces. IEEE Transactions on Hertzberg. Move base flex a highly flexible navigation
Robotics and Automation, 12(4):566–580, 1996. doi: framework for mobile robots. In 2018 IEEE/RSJ Inter-
10.1109/70.508439. national Conference on Intelligent Robots and Systems
[11] N. Koenig and A. Howard. Design and use paradigms (IROS), pages 3416–3421, 2018. doi: 10.1109/IROS.
for gazebo, an open-source multi-robot simulator. In 2018.8593829.
2004 IEEE/RSJ International Conference on Intelligent [22] Amir Salimi Lafmejani and Spring Berman. Nonlin-
Robots and Systems (IROS) (IEEE Cat. No.04CH37566), ear mpc for collision-free and deadlock-free navigation
volume 3, pages 2149–2154 vol.3, 2004. doi: 10.1109/ of multiple nonholonomic mobile robots. Robotics
IROS.2004.1389727. and Autonomous Systems, 141:103774, 2021. ISSN
[12] Steven Lavalle and James Kuffner. Rapidly-exploring 0921-8890. doi: https://doi.org/10.1016/j.robot.2021.
random trees: Progress and prospects. Algorithmic and 103774. URL https://www.sciencedirect.com/science/
computational robotics: New directions, 01 2000. article/pii/S0921889021000592.
[13] R. Luna, I. A. Şucan, M. Moll, and L. E. Kavraki. Any- [23] Tixiao Shan, Brendan Englot, Drew Meyers, Wei Wang,
time solution optimization for sampling-based motion Carlo Ratti, and Rus Daniela. Lio-sam: Tightly-coupled
planning. In 2013 IEEE International Conference on lidar inertial odometry via smoothing and mapping. In
Robotics and Automation, pages 5068–5074, 2013. doi: IEEE/RSJ International Conference on Intelligent Robots
10.1109/ICRA.2013.6631301. and Systems (IROS), pages 5135–5142. IEEE, 2020.
[14] Steven Macenski, Tully Foote, Brian Gerkey, Chris [24] Marlin P. Strub and Jonathan D. Gammell. Advanced bit*
Lalancette, and William Woodall. Robot operating (abit*): Sampling-based planning with advanced graph-
system 2: Design, architecture, and uses in the wild. search techniques. 2020 IEEE International Confer-
Science Robotics, 7(66):eabm6074, 2022. doi: 10.1126/ ence on Robotics and Automation (ICRA), May 2020.
scirobotics.abm6074. URL https://www.science.org/doi/ doi: 10.1109/icra40945.2020.9196580. URL http://dx.
abs/10.1126/scirobotics.abm6074. doi.org/10.1109/ICRA40945.2020.9196580.
[15] Sina Sharif Mansouri, Christoforos Kanellakis, Björn [25] Marlin P Strub and Jonathan D Gammell. Adaptively
Lindqvist, Farhad Pourkamali-Anaraki, Ali-akbar Agha- Informed Trees (AIT*): Fast asymptotically optimal path
mohammadi, Joel Burdick, and George Nikolakopoulos. planning through adaptive heuristics. In Proceedings
A unified nmpc scheme for mavs navigation with 3d of the IEEE International Conference on Robotics and
collision avoidance under position uncertainty. IEEE Automation (ICRA), pages 3191–3198, 31 May – 31
Robotics and Automation Letters, 5(4):5740–5747, 2020. August 2020. doi: 10.1109/ICRA40945.2020.9197338.
doi: 10.1109/LRA.2020.3010485. [26] I. A. Sucan, M. Moll, and L. E. Kavraki. The open
[16] M. Otte and N. Correll. C-forest: Parallel shortest path motion planning library. IEEE Robotics Automation
planning with superlinear speedup. IEEE Transactions Magazine, 19(4):72–82, 2012. doi: 10.1109/MRA.2012.
on Robotics, 29(3):798–806, 2013. doi: 10.1109/TRO. 2205651.
2013.2240176. [27] Chaoqun Wang, Lili Meng, Sizhen She, Ian M. Mitchell,
[17] M. Otte and Emilio Frazzoli. Rrtx: Real-time motion Teng Li, Frederick Tung, Weiwei Wan, Max. Q.-H.
planning/replanning for environments with unpredictable Meng, and Clarence W. de Silva. Autonomous mobile
obstacles. In WAFR, 2014. robot navigation in uneven and unstructured indoor en-
[18] Luigi Palmieri, Sven Koenig, and Kai O. Arras. Rrt- vironments. In 2017 IEEE/RSJ International Conference
based nonholonomic motion planning using any-angle on Intelligent Robots and Systems (IROS), pages 109–
path biasing. In 2016 IEEE International Conference 116, 2017. doi: 10.1109/IROS.2017.8202145.
on Robotics and Automation (ICRA), pages 2775–2781, [28] Siyuan Yu, Congkai Shen, and Tulga Ersal. Nonlin-
2016. doi: 10.1109/ICRA.2016.7487439. ear model predictive planning and control for high-
[19] Jeremie Papon, Alexey Abramov, Markus Schoeler, and speed autonomous vehicles on 3d terrains. IFAC-
Florentin Wörgötter. Voxel cloud connectivity segmen- PapersOnLine, 54(20):412–417, 2021. ISSN 2405-
tation - supervoxels for point clouds. In 2013 IEEE 8963. doi: https://doi.org/10.1016/j.ifacol.2021.11.
Conference on Computer Vision and Pattern Recognition, 208. URL https://www.sciencedirect.com/science/article/
pages 2027–2034, 2013. doi: 10.1109/CVPR.2013.264. pii/S2405896321022527. Modeling, Estimation and Con-
[20] Sebastian Pütz, Thomas Wiemann, Jochen Sprickerhof, trol Conference MECC 2021.
78
Paper IV
CostTrust: A Fast-Exploring,
Iteratively Expanding
Frontier-Based Kinodynamic
Motion Planner
IV
79
Modeling, Identification and Control, Vol. 44, No. 4, 2023, pp. 1–14, ISSN 1890–1328
1
Faculty of Science and Technology (REALTEK), Norwegian University of Life Sciences, N-7491 Aas, Norway.
E-mail: {fetullah.atas,lars.grimstad}@nmbu.no
2
Lincoln Centre for Autonomous Systems (LCAS), University of Lincoln, Lincoln, UK. E-mail: gciel-
niak@lincoln.ac.uk
Abstract
Sampling-based motion planning has recently experienced considerable advancements, particularly in the
domain of geometric motion planning for diverse robotic systems. Nonetheless, kinodynamic motion
planning, which additionally considers a robot’s kinematics and dynamics to generate a motion plan,
remains an open challenge, necessitating further research. Kinodynamic planning, inherently more complex
than geometric planning, mandates that the planner not only adhere to motion constraints but also account
for system dynamics, including limitations in velocity and acceleration.
Furthermore, kinodynamic planning often requires the navigation of extensive state and control spaces,
rendering the process both computationally demanding and time-consuming. To effectively tackle kino-
dynamic motion planning, our proposed approach introduces a dynamic balance between exploration and
exploitation, continuously adjusted throughout the execution. Our bi-directional and multi-threaded al-
gorithm is specifically tailored to fulfill the efficiency requisites of kinodynamic motion planning. Our
comprehensive benchmarks, conducted on an Ackermann-steered robot and a dynamic quadrotor, demon-
strate that our method notably outperforms state-of-the-art baselines in terms of solution rate percentage
and path cost.
To facilitate accessibility and further research within the community, we have made the implementa-
tion of our method available ∗. It is integrated with the Open Motion Planning Library (OMPL), a widely
utilized resource in the field, enhancing our approach’s practical applicability †
robot. Conversely, kinodynamic planning delves into tion, demonstrating effective exploitation.
optimizing a robot’s motion by meticulously consider- Our research introduces several significant advance-
ing its kinematics and dynamics, such as limitations on ments in kinodynamic planning:
acceleration, velocity, and specific motion constraints.
• Development of a heuristic-like function that effec-
Kinodynamic planning finds its application in
tively balances rapid pre-solution exploration with
various domains, including robotic manipulation,
post-solution exploitation, enhancing solution re-
aerospace systems, unmanned aerial vehicles, and au-
finement—a capability not prevalent in existing
tonomous vehicles. The dynamics of these systems typ-
state-of-the-art planners.
ically involve constraints on velocity, acceleration, and
other parameters, which inherently limit their maneu- • Implementation of a multi-threaded, bi-directional
verability. This limitation presents a significant chal- tree search method, significantly surpassing the
lenge for kinodynamic planners, necessitating a keen capabilities of current state-of-the-art kinody-
focus on the system’s maneuverability during method namic planners in terms of efficiency.
development. Consequently, this elevates the complex-
ity and intricacy of kinodynamic planning problems. • Integration of our planner into the widely-used
Thanks to their efficiency, sampling-based methods Open Motion Planning Library (OMPL), enhanc-
have been the mainstay of kinodynamic planning. The ing its usability and practical application, and pro-
basic idea behind these methods is to explore the state vision of benchmarking code to facilitate further
space by generating many samples and then use the research and comparisons.
information gathered to guide the search for a feasi-
ble trajectory. These methods benefit systems with
high-dimensional state spaces and complex constraints,
2 Related Work
as they can efficiently explore the state space without For the last two decades, sampling-based motion plan-
needing an explicit representation of the entire state ning algorithms have gained popularity in robotic mo-
space. tion planning due to their effectiveness and flexibility in
Our method represents a progressive stride in kin- handling complex systems and environments. Among
odynamic motion planning, focusing on a strategy the most well-known and widely-used algorithms in
that harmoniously balances exploration and exploita- this category are the Probabilistic RoadMap (PRM)
tion. Recent advancements in geometric motion plan- algorithm Kavraki et al. (1996) and the Rapidly-
ning, exemplified by contributions such as ABIT* exploring Random Tree (RRT) algorithm Lavalle and
and BIT* Strub and Gammell (2020b); Gammell Kuffner (2000), along with a plethora of their vari-
et al. (2020), emphasize exploiting established solu- ants. To gain a deeper understanding of the optimal-
tions through informed sampling. This technique se- ity properties of sampling-based planners, the authors
lectively samples regions with the potential to enhance in Karaman and Frazzoli (2011) conducted a compre-
the current solution, incorporating these samples into hensive examination of their completeness and guar-
the underlying data structure, typically a tree or graph. antees. Several libraries and frameworks have been
However, adapting this strategy to kinodynamic mo- developed to facilitate the use, and implementation
tion planning presents unique challenges, primarily due of sampling-based motion planning algorithms, such
to the added complexity of motion constraints inher- as the Open Motion Planning Library (OMPL) Su-
ent in dynamic systems. For instance, establishing a can et al. (2012). This library contains a wide range
connection between two states in such systems is not of sampling-based planners, including RRT* Karaman
straightforward, as it requires a steering function ca- and Frazzoli (2011) and PRM Karaman and Fraz-
pable of solving a two-point boundary value problem zoli (2011), as well as more recent algorithms that
(BVP). Addressing these limitations, we introduce a incorporate both graph and tree structures, such as
tree-search methodology that leverages random for- BIT* Gammell et al. (2020), ABIT Strub and Gammell
ward propagation of system states while conforming (2020b) and AIT* Strub and Gammell (2020a). Addi-
to the kinematic and dynamic constraints of the sys- tionally, some planners utilize parallelized approaches,
tem. A notable advantage of our approach is its inde- such as CFOREST Otte and Correll (2013) and Any-
pendence from boundary value problems (BVPs) that timePartShortening (APS) Luna et al. (2013), which
necessitate solving complex differential equations for allow for concurrent execution of multiple planners on
diverse dynamic systems. Furthermore, our method different threads, resulting in improved performance
excels in rapid exploration, attributed to its selective overall.
mechanism for choosing frontiers. Simultaneously, it To extend the applicability of sampling-based plan-
can continuously refine and improve the current solu- ners to systems with dynamics (e.g., velocities, ac-
2
82
Atas et.al., “CostTrust: A Kinodynamic Motion Planner”
Figure 1: A series of snapshots illustrating the progression of our planner’s execution is presented. In snapshot
a., we observe the initial phase where multiple trees start their bi-directional search to connect the
start and goal vertices. Upon finding an initial solution, as shown in snapshot c., the focus of the
trees shifts towards exploitation of the current solution. This phase involves continuous optimization
of the path. Snapshot e. displays the culmination of this process, showcasing the optimized motion
plan. Throughout this sequence, the planner demonstrates its capability to dynamically transition
from exploration to exploitation, effectively refining the path to achieve an optimal solution.
celerations, etc.), i.e., kinodynamic planning, sev- some systems (e.g., a quadrotor with 15 DOF). Re-
eral variants of RRT algorithm have been proposed. cently, there has been an intense effort to remove the
Two notable examples include the RRT-Extend algo- need for the steering function in the motion planning
rithm LaValle and James J. Kuffner (2001) and the Ex- community. In Li et al. (2014), a kinodynamic asymp-
pansive Space Trees (EST) algorithm Hsu et al. (1997). totically optimal planner named as SST was proposed
These methods focus on the forward propagation of based on a simplified random forward search tree that
dynamics and aim to efficiently and evenly explore the does not rely on a steering function. A theoretical
state space, regardless of obstacles. Additionally sev- analysis of the proposed algorithm was provided, which
eral works extended on Karaman and Frazzoli (2011) to showed that as long as an adequate number of Monte
benefit from asymptotic optimality, e.g. Karaman and Carlo propagations (random controls and duration) are
Frazzoli (2013); Perez et al. (2012); Xie et al. (2015). added to the search tree, the algorithm ensures asymp-
However, these works relied on the availability of a totic optimality. A meta-algorithm that produces an
steering function. A steering function computes the asymptotically optimal kinodynamic planner was pre-
optimal path between two states in a scenario with sented in Hauser and Zhou (2016), given any feasi-
no obstacles. Implementing a steering function is of- ble kinodynamic planner as a subroutine without using
ten associated with solving a two-point boundary value steering functions or numerical boundary-value prob-
problem (BVP). This problem involves solving a dif- lem solvers. The increase in the use of heuristics has
ferential equation while adhering to specific boundary also become prevalent in kinodynamic planners Little-
conditions. It is widely acknowledged in the field that field and Bekris (2018). Authors in Kleinbort et al.
this can be challenging, considering the complexity of (2018, 2019) have led to a deeper understanding of
3
83
Modeling, Identification and Control
the properties of asymptotically optimal kinodynamic The system is assumed to be Lipschitz continu-
planning with milder assumptions, which the proposed ous for both x and u such that; ∃Kx , Ku > 0,
method also takes advantage of. Most recently, authors ∀x0 , x1 ∈ X, u0 , u1 ∈ U ;
of Shome and Kavraki (2021) proposed a new approach
for kinodynamic motion planning using sampling tech-
niques to estimate the connectivity of high-dimensional ||f (x0 , u0 ) − f (x0 , u1 )|| ≤ Ku ||u0 − u1 ||
(2)
configuration spaces. The approach uses bundles of ||f (x0 , u0 ) − f (x0 , u1 )|| ≤ Kx ||x0 − x1 ||
kinodynamic edges to cover the state space before a
query arrives and is shown to be asymptotically opti- Definition 1. A valid kinodynamic trajectory π is
mal and to find high-quality solutions quickly in exper- produced by propagating system forwards (see Eq. 1)
imental validation. Upon analyzing the existing liter- starting from π(0) by applying control function Υ :
ature in sampling-based motion planning, a noticeable [0, tπ ] ⊂ U resulting in π : [0, tπ ] ⊂ F .
trend emerges: the increasing adoption of heuristics in Definition 2. A piecewise constant control func-
the most effective geometric motion planners. Yet, this tion Υ is composed of multiple constant control func-
trend has not been as prevalent in kinodynamic motion tions Υi , each defined over a specific interval of time
planning, likely due to its added complexity. Our work [0, ∆t], where each constant control function Υi maps
distinguishes itself from existing kinodynamic planners to a specific control value ui that belongs to set U . The
by placing a significant emphasis on the use of heuris- number of constant control functions is represented by
tics. This strategy is employed to rapidly explore the a natural number k ∈ N .
extensive state space. Once an initial solution is found, In optimal kinodynamic motion planning, the goal is
our approach shifts towards exploitation, which can to identify a control function Υ and a trajectory π that
also be construed as a heuristic-driven strategy. This lives in subset F . The planner should also minimize the
dual focus on exploration followed by exploitation, un- overall cost of the trajectory, which is calculated as the
derpinned by heuristic principles, sets our approach integral of a cost function g(π(t), Υ(t)) over the entire
apart in kinodynamic motion planning. duration of the trajectory, from time 0 to tπ . More
compactly the cost of π, g is denoted as COST (π).
3 Problem Setup
4 Approach
In the present work, we delve into the domain of kin-
Our algorithm is designed to identify an optimal mo-
odynamic motion planning, a detailed exposition of
tion plan for a specified start-goal pose, utilizing ran-
which is provided in Sec. 4. Kinodynamic motion plan-
dom forward propagation of the system’s states. To
ning is a specialized subset of motion planning that
achieve this, the algorithm accommodates input con-
intricately incorporates the dynamics of a system into
straints of the system, such as limits on velocity and ac-
the planning process. This includes considering con-
celeration. A prerequisite for the algorithm’s effective
straints imposed by the system’s velocity and acceler-
operation is the prior knowledge of the system model,
ation, as well as its equations of motion while charting
which enables the calculation of subsequent system
a trajectory from an initial state to a designated goal
states based on the current state and control inputs.
state. The primary objective of kinodynamic planning
This approach often necessitates navigating extensive
is to identify a path that is not only feasible but also
state spaces, presenting a significant challenge to the
optimal, adhering to both geometric and dynamic con-
algorithm’s time efficiency. In our novel approach, the
straints.
algorithm concurrently grows 2N trees, with N trees
We assume a d-dimensional smooth manifold for the
originating from the start pose and the remaining N
state space X. The goal region is assumed to be in
from the goal pose. Here, N represents the number of
the free region of state space XF such that XF ⊂ X,
threads, a configurable parameter in our method. This
xgoal ∈ XF , δgoal > 0 and Xgoal = Bδgoal (xgoal ). δgoal
design inherently renders our planner multi-threaded, a
denotes obstacle clearance and Bδgoal (xgoal ) is ball (Eu-
distinction that, to the best of our knowledge, is unique
clidean) centered at xgoal ∈ Rd with radius r. Since we in this field of study. The rationale for allocating an
consider a dynamic system, a control space is denoted equal number of trees to both the start and goal poses
with U ⊆ RD . The considered system evolves from the is to enhance the likelihood of discovering an initial
current state with given control inputs in the following solution faster. This strategy addresses the inherently
manner; stochastic nature of the process, stemming from the
non-deterministic control sampling, which significantly
˙ = f (x(t), u(t)), x(t) ∈ X, u(t) ∈ U
x(t) (1) influences the initial solution discovery. Upon finding
4
84
Atas et.al., “CostTrust: A Kinodynamic Motion Planner”
Figure 2: On the left side, we visualize the exploratory phase of the algorithm. Before finding an initial solution,
the algorithm employs a predominantly explorative heuristic alongside multiple bi-directional trees.
On the right side, following the discovery of an initial solution, all trees adapt to focus on the current
solution. They aim to enhance the solution’s cost-effectiveness, guided by the direction of the tree
that first identified the initial solution.
the initial solution, the algorithm enters a phase of con- ’start-goal’ are not used interchangeably in our ap-
tinuous exploitation and optimization of this solution. proach. Our bi-directional strategy means that for
This optimization process persists until a predefined some trees, the ’target’ vertex may actually be the
termination condition is satisfied. In the subsequent ’start’ vertex. Similarly, the ’root’ vertex could be syn-
subsections, we will elaborate on the specific subcom- onymous with the ’goal’ vertex in certain contexts. In
ponents of our planner, which we have named ’Cost- any given problem setup, while the ’start’ and ’goal’
Trust’. vertices remain consistent, the trees, depending on
their direction, may treat either of these vertices as
the ’root’ or ’target’.
4.1 Definitions
A tree structure is initiated from a root vertex and sys- 4.2 Selecting Frontier Vertexes
tematically extends outward. This expansion aims to
establish a linkage between the root and the designated After initializing a tree with the root vertex, we cre-
target vertex. Subsequently, we explain the concept of ate branches by applying random control samples to
a vertex, elaborating on its intrinsic properties and sig- the root vertex’s states. As the algorithm progresses,
nificance. the vertex count escalates, making selecting vertices
Vertex in Motion Planning: In the context of for further expansion increasingly critical. Our objec-
our motion planning algorithm, a vertex is conceptual- tive is to identify and select vertices most beneficial to
ized through a structure encapsulating key attributes rapid exploration initially. In this context, such ver-
essential to the planning process: tices are termed ’frontiers.’ Random control sampling
Our motion planning algorithm utilizes this struc- is applied to these frontiers, a process we will present
tured vertex representation for the search space; see in the following subsection.
Tab. 1. Furthermore, these vertices are organized In Fig. 3, we illustrate the opposing trees utilized
within a nearest-neighbor structure, facilitating effi- in our approach. The criteria for selecting frontier
cient query processing Sucan et al. (2012). vertices are based on several key properties: firstly,
It is crucial to note that the terms ’root-target’ and the number of branches in a vertex nb ; secondly, the
5
85
Modeling, Identification and Control
6
86
Atas et.al., “CostTrust: A Kinodynamic Motion Planner”
Figure 4: Initially, our algorithm initiates tree growth from both the start and goal vertices, as illustrated in
panels a., b., and c. Following discovering a solution, all trees and their respective threads shift focus
to exploit this initial solution. The objective is to enhance the solution, aligning with the direction
established by the tree that first identified the solution. For instance, as shown in panel f., a path
with a more favorable cost is achieved by strategically exploiting the current optimal solution. The
solution exploitation can run indefinitely.
Consequently, this status leads to its exclusion from Xs , while the blue tree originates from Xg , the goal
being selected in the subsequent iteration of frontier pose. It is important to note that, although these trees
identification. are independent, the multi-threaded architecture of our
After the successful expansion of a frontier, an ad- method enables their simultaneous growth, increasing
ditional critical evaluation is conducted. This assess- the efficiency dramatically as evidenced by findings in
ment determines whether the newly added vertex can Sec. 5. Upon the identification of an initial solution, all
be connected to the goal at a lower cost than that of trees are reinitialized to commence from the same root
the existing solution if one exists. If this condition is vertex as the tree that initially identified the solution.
met, the trees are reset, and the current best solution A detailed illustration of the planner’s progression is
is updated accordingly. provided in Fig. 4.
7
87
Modeling, Identification and Control
4.5 Algorithms
In this subsection, we delve into the core algorithms of
our approach, accompanied by a detailed discussion of Algorithm 2 Select Explorative Frontiers
the corresponding pseudocode. Require: max number, nn structure
Ensure: Selected set of frontier nodes
Algorithm 1 Solve Planning Problem 1: f rontier nodes ← listNodes(nn structure)
Require: ptc {Planner Termination Condition} 2: Remove blacklisted nodes from f rontier nodes
Ensure: PlannerStatus ∈ 3: for each node ∈ f rontier nodes do
{EXACT, APPROXIMATE, INVALID} 4: score[node] ← computeScore(node)
1: Initialize start state and goal state 5: end for
2: if isValid(start state) = FALSE ∨ 6: Sort f rontier nodes based on score
isValid(goal state) = FALSE then 7: f rontier nodes ←
3: return INVALID clip(f rontier nodes, max number)
4: end if 8: for each node ∈ f rontier nodes do
5: for t = 1 to numThreads do 9: density[node] ← computeDensity(node)
6: direction ← t%2 = 0 ? start to goal : 10: end for
goal to start 11: Sort f rontier nodes based on density
7: CALL Algorithm 2 12: f rontier nodes ←
8: CALL Algorithm 3 clip(f rontier nodes, max number)
9: end for
10: Wait for all threads to complete
11: bestP ath ← findBestPath()
12: if bestP ath ̸= NULL then
13: Update bestControlP ath and controlT rees
14: VisualizePaths Algorithm 3 Expand Frontiers
15: PlannerStatus ← EXACT or APPROXIMATE Require: f rontier nodes, num branch to extend,
16: end if nn structure, ptc, target property, path,
17: Clear data structures control paths vertices, exact solution f ound,
18: return PlannerStatus should stop exploration, current best path
Ensure: Expanded set of frontier nodes
In the proposed multi-threaded control planning 1: for each node ∈ f rontier nodes do
framework, three core algorithms interact to refine the 2: for i = 1 to num branch to extend do
solution path in a kinodynamic planning problem it- 3: if ptc = TRUE then
eratively. The primary algorithm, Solve Planning 4: return
Problem (Algorithm 1), orchestrates the overall plan- 5: end if
ning process. It initializes the planning scenario, in- 6: (new state, valid) ←
cluding the start and goal states, and then iteratively propagate(node, randomControl())
invokes parallel threads. Each thread independently 7: if valid then
executes a cycle of explorative frontier selection and 8: vertex ← createVertex(new state)
expansion, managed by Algorithms 2 and 3, respec- 9: vertex.cost ←
tively. computeCost(node, new state)
Select Explorative Frontiers (Algorithm 2) plays 10: if isCloseToGoal(vertex, target property)
a critical role in guiding the search process. Within then
each thread cycle, this algorithm selects a subset of 11: path ← updatePath(vertex)
frontier nodes from the rapidly-exploring tree. The se- 12: end if
lection is based on a scoring mechanism that considers 13: Add vertex to nn structure
both the cost associated with each node and the num- 14: end if
ber of existing branches, thereby balancing exploration 15: end for
and exploitation. The algorithm prioritizes nodes that 16: end for
present a beneficial trade-off between these two fac- 17: EvaluateBranches(f rontier nodes,
tors, ensuring an efficient yet thorough exploration of 18: current best path, should stop exploration)
the search space.
Expand Frontiers (Algorithm 3) is then called to
expand the selected frontiers. This algorithm gener-
8
88
Atas et.al., “CostTrust: A Kinodynamic Motion Planner”
ates new states by applying random controls to the This implies that these planners are designed to uti-
chosen frontier nodes, effectively growing the search lize the entirety of their allotted time to enhance their
tree. It ensures that new states are valid and checks solutions, provided they achieve a solution within this
for their proximity to the goal, updating the solution timeframe. To ensure a fair comparison, all planners
path whenever a more cost-effective route is discovered. are assigned a fixed time duration for operation. How-
Extending frontiers is pivotal in exploring unseen ar- ever, due to the significantly larger state space encoun-
eas of the state space and getting closer to an optimal tered in dynamic quadrotor planning, these planners
solution. are allocated four times the amount of time given for
Together, these algorithms enable a dynamic and planning with the Ackermann-steered robot. This ad-
adaptive approach to kinodynamic planning. By utiliz- justment acknowledges the increased complexity and
ing multiple threads, the Solve Planning Problem computational demands of the quadrotor’s state space.
algorithm ensures diverse search directions and a ro-
bust state space exploration. The interplay between 5.1 Ackermann-steered vehicle-like robot
Select Explorative Frontiers and Expand Fron-
tiers algorithms allows for a guided yet flexible search
strategy, continually adapting to the evolving topol-
ogy of the tree. This synergy effectively balances ex-
ploration and exploitation, significantly enhancing the
planner’s ability to find optimal or near-optimal solu-
tions for systems with complex kinodynamics.
5 Experimental Results
Our evaluation methodology encompassed two distinct
dynamic systems: an Ackermann-steered vehicle-like
Figure 5: The figure illustrates a sample from our
robot and a dynamic quadrotor with 15 state dimen-
benchmark results for kinodynamic motion
sions. Both systems included state and control com-
planning of an Ackermann-steered robot.
ponents, with the controls for each system being sam-
The motion plans generated by the various
pled within a range specified by the user. To assess
contestant planners are depicted using dis-
the efficacy of our proposed method in addressing kin-
tinct colors for clarity. Alongside each plan-
odynamic planning problems, we employed two maps
ner’s name, the corresponding path costs are
featuring a variety of objects. The map illustrated in
enumerated. Notably, our planner, Cost-
Fig. 2, for instance, spans an area of 18x6 meters.
Trust, achieves the lowest cost path, which
Our study included all prominent kinodynamic plan-
is marked with a green color.
ners available in the Open Motion Planning Library
(OMPL) to ensure a comprehensive evaluation. These
planners were compared against our proposed planner. The Ackermann steering system, often used in vehicle-
The ensuing sections detail the results, generated on a like robots, is characterized by a unique geometry that
laptop equipped with an Intel Core i7-6700HQ CPU, allows the front wheels to turn at different angles.
operating at 2600 MHz with eight cores, and supple- This system is ideal for ensuring that all wheels fol-
mented by 16GB of memory. low their respective circular paths during a turn, min-
Our primary metric for evaluation is the resulting imizing slippage. The typical equations governing an
path cost, quantified by the accumulative control du- Ackermann-steered vehicle are:
ration of the solutions generated by the planners. The
exact solution rate is also presented as a metric in the xnext = x + ∆t · v · cos(ψ + β)
benchmarks. Each planner is allowed an identical time ynext = y + ∆t · v · sin(ψ + β)
frame to produce its results. Given the larger state v
ψnext = ψ + ∆t · · sin(β)
and control dimensions of the Quadrotor compared to Lb (4)
the Ackerman robot, a different amount of time allo- vnext = v + ∆t · a
cation is employed for these two benchmarks. Specifi-
La
cally, planners are provided 10 seconds for the Acker- β = arctan · tan(df )
La + Lb
man robot and 40 seconds for the Quadrotor.
A majority of the baseline planners, including our In these equations: ∆t is the time step. x, y, ψ, v
own, fall under the category of optimizing planners. are the current state variables representing position,
9
89
Modeling, Identification and Control
The start and goal states are placed such that the
algorithm is forced to explore the state space since the
only connection point where the planners can connect
start-goal states is a gate at the bottom right of the
map. For the start-goal states depicted in Fig. 1 with (b)
3D axes, a benchmark is run 100 times. The results,
presented in Fig. 6, indicate that the proposed plan- Figure 6: In (a), we present the exact solve rate per-
ner’s solve rate for 100 problems is comparable to that centages for kinodynamic motion planning
of RRT*, both showing a solve rate of 89 % while of an Ackermann-steered vehicle-like robot.
outperforming all other baselines. However, the pro- In (b), we focus on the resulting path costs,
posed method’s efficiency lowers the resulting path cost taking into account only those paths derived
within the given time, depicted in the bottom image of from exact solutions. Our planner, Cost-
Fig. 6. Regarding the solve rate, the proposed planner Trust, demonstrates superior performance
failed to find a solution in 11 cases. In rare instances, over baseline planners in terms of path cost
the planner may struggle to establish a connection as efficiency. It achieves a comparable solve rate
the number of nodes in the tree increases, leading to to RRT*, registering an 89% success rate
longer execution times for each iteration. We believe in finding solutions while outperforming the
the cause of the missed cases could be attributed to the rest of the baselines.
dense tree structure utilized in the proposed planner.
Specifically, maintaining a dense tree structure may in-
crease the likelihood of prolonged exploration of some
narrow passages, potentially increasing the number of
missed cases. To address this issue, we have modi-
fied the implementation of our approach such that the
planner can be configured to maintain a sparse tree
structure similar to that of SST, which may reduce the
occurrence of missed cases.
10
90
Atas et.al., “CostTrust: A Kinodynamic Motion Planner”
5.2 Quadrotor
Figure 8: In (a), we present the exact solve rate per-
Sampling-based motion planning demonstrates marked centages, and in (b), resulting path costs for
effectiveness in higher-dimensional state spaces, where kinodynamic motion planning of a quadro-
discretization-based methods become computationally tor. Our planner, CostTrust, surpasses other
infeasible due to exponential computation cost escala- planners significantly in terms of the number
tion. To showcase the efficacy of our kinodynamic mo- of exact solutions achieved.
tion planner, we extended our benchmarking to include
kinodynamic planning for a quadrotor characterized by
15 state dimensions and six control dimensions. Con-
trol sampling for the quadrotor was confined within T = m · (g + az + Kp · (zd − z) + Kd · (żd − ż)),
a user-defined range, conforming to the design limita- (ax · sin(ψ) − ay · cos(ψ))
τϕ = Kϕ · −ϕ ,
tions of quadrotors. The forward propagation model g
employed for the quadrotor is detailed in Eq. 5.
(ax · cos(ψ) − ay · sin(ψ))
τθ = Kθ · −θ ,
In Eq. 5; The system’s state vector comprises 15 el- g
ements. The positions are denoted [x, y, z]. T is the τψ = Kψ · (ψd − ψ),
thrust. m is the mass. g is the gravitational accel-
eration. ax , ay , az are the accelerations in respective ϕ̇+ = τϕ · ∆t,
axes. Kp , Kd , Kϕ , Kθ , Kψ are proportional gains for z, θ̇+ = τθ · ∆t,
roll, pitch, and yaw. zd , ψd are the desired z position (5)
ψ̇+ = τψ · ∆t,
and yaw angle. z, ϕ, θ, ψ are the current z position
and orientation angles. R is the rotation matrix. ⃗a R = RotationMatrix(ϕ, θ, ψ),
is the acceleration vector. ⃗r¨, ⃗r˙, ⃗r are the acceleration, 0 0
velocity, and position vectors. ∆t is the time step. In ⃗a = R · 0 − 0 /m,
comparison, the control vector comprises six elements T m·g
denoted by [zd , żd , ψd , ax , ay , az ]. These control inputs ¨
⃗r = ⃗a,
correspond to the desired z position, z velocity, yaw
angle, and x, y, z accelerations, respectively. As the ⃗r˙ + = ⃗r¨ · ∆t,
system’s complexity increases, the performance dispar- ⃗r+ = ⃗r˙ · ∆t
ity between our proposed planner and other planners
becomes more pronounced, as illustrated in Fig. 8. The results unequivocally indicate that our pro-
11
91
Modeling, Identification and Control
posed planner achieves a considerably higher solve threaded bi-directional trees not only facilitates a
rate compared to baseline planners. In this analy- higher solution rate but also contributes to more nu-
sis, we exclusively consider exact solutions, disregard- anced and effective navigation through challenging
ing any approximate results produced by the plan- state spaces.
ners. Specifically, the solve rate achieved by our plan- In comparing the benchmark results for the two sys-
ner is 78%, markedly outperforming the next best- tems, namely the Ackermann-steered robot and the dy-
performing planner, RRT*, which registers a solve rate namic quadrotor, a notable observation emerges. De-
of 17%. The resulting path costs for all the plan- spite allocating four times more time for the quadro-
ners that are able to provide an exact solution seem tor, the solution rate for all planners, except ours, sig-
reasonably similar. Based on these findings, we draw nificantly diminishes. This phenomenon underscores
that our planner is particularly advantageous for high- the non-linear correlation between the state dimen-
dimensional systems requiring extensive sampling and sion complexity and the time required to find a solu-
rapid state space exploration to obtain a solution. For tion. The exceptional performance of our planner can
example, as illustrated in Fig. 8, existing planners ex- be attributed to its inherent exploratory instinct and
cept RRT* and SST cannot navigate the sole pas- the efficiency of its multi-threaded, bi-directional tree
sage (lower left) to the goal. As the number of sam- search approach. These characteristics are pivotal in
ples increases, it becomes increasingly time-consuming effectively navigating the complex state space of the
to identify the passage due to the growing number quadrotor, thereby facilitating a higher solution rate
of nodes in the tree structure. Our method’s bi- compared to other planners.
directional approach offers benefits in navigating sharp
While our proposed motion planner exhibits notable
turns, preventing tree growth from becoming trapped,
strengths, it is not without limitations, as reflected in
and prolonging exploration. The proposed planner
the solution rate results for both the Ackerman robot
achieves a threefold improvement in mean trajectory
and the quadrotor. In certain cases, the planner fails to
cost compared to the next best-performing planner,
derive an exact solution. This limitation is suspected
SST.
to stem from the planner’s inherent tendency towards
extensive exploration. Before identifying an initial so-
5.3 Discussion lution, the planner prefers areas with fewer branches
and lower vertex density. Consequently, in scenarios
The results distinctly demonstrate the advantages of
where numerous vertices are proximal to the goal yet
our newly introduced motion planner over current
remain unconnected to the tree, the planner may in-
state-of-the-art techniques, particularly in scenarios in-
advertently prioritize frontiers distant from the goal
volving a dynamic quadrotor characterized by high-
region. Over time, this reduces the likelihood of estab-
dimensional state and control spaces. The effectiveness
lishing a connection to the goal, as vertices near the
of our proposed planner can be attributed to several
goal are less frequently selected as frontiers.
pivotal characteristics:
Dynamic Balance between Exploration and Ex- However, this limitation could be mitigated by im-
ploitation: The planner adeptly navigates between ex- plementing goal biasing. Such an approach would in-
ploring new areas of the state space and exploiting volve incorporating a proportion of the nearest neigh-
known paths. This is achieved through the strategic bors of the goal at each iteration of frontier selection.
selection of frontier vertices within the tree, which are This strategy could enhance the planner’s ability to
instrumental in guiding the exploration process, cou- identify and prioritize goal-proximate vertices, thereby
pled with the continuous refinement of existing solu- improving the likelihood of establishing a connection
tions. to the goal.
Multi-threaded Bi-directional Tree Approach: A sig- We have made the source code for our proposed mo-
nificant strength of our planner is its implementation tion planner, as well as the benchmarking code, pub-
of multi-threaded, bi-directional trees. This architec- licly available as open-source resources. This initia-
ture has proven particularly beneficial in enhancing tive aims to facilitate further experimentation by re-
the solution rate for the dynamic quadrotor, outper- searchers and allows for the comparison of newly de-
forming traditional single-threaded or uni-directional veloped planners against our proposed method. From
approaches in terms of both efficiency and effective- a practical standpoint, users intending to deploy our
ness. motion planner (or other baseline methods) must pos-
These attributes underscore our planner’s robustness sess an understanding of the system model. Crucially,
and adaptability in complex motion planning scenar- they need to define a propagate function, which esti-
ios. The combination of an intelligent exploration- mates the next state of the system based on current
exploitation balance and the innovative use of multi- state parameters and control inputs. To assist users,
12
92
Atas et.al., “CostTrust: A Kinodynamic Motion Planner”
we provide exemplar codes for two system models: an Karaman, S. and Frazzoli, E. Sampling-based al-
Ackermann-steered robot and a quadrotor. Addition- gorithms for optimal motion planning. CoRR,
ally, for effective utilization of the sampling-based mo- 2011. abs/1105.1186. URL http://arxiv.org/abs/
tion planner, a collision-check function is required to 1105.1186.
determine whether a given state is in a collision or is
collision-free. Karaman, S. and Frazzoli, E. Sampling-based optimal
motion planning for non-holonomic dynamical sys-
tems. In 2013 IEEE International Conference on
6 Conclusion Robotics and Automation. pages 5041–5047, 2013.
doi:10.1109/ICRA.2013.6631297.
This paper introduces a novel approach to kinody-
Kavraki, L., Svestka, P., Latombe, J.-C., and Over-
namic planning, showcasing enhanced performance
mars, M. Probabilistic roadmaps for path plan-
compared to existing state-of-the-art methods, partic-
ning in high-dimensional configuration spaces. IEEE
ularly in high-dimensional contexts such as dynamic
Transactions on Robotics and Automation, 1996.
quadrotors. Distinctive in its strategy, our method
12(4):566–580. doi:10.1109/70.508439.
employs a balanced interplay between exploration and
exploitation, complemented by a multi-threaded, bi- Kleinbort, M., Solovey, K., Bonalli, R., Bekris, K. E.,
directional tree search process. This dual-pronged ap- and Halperin, D. RRT2.0 for fast and optimal kin-
proach yields significant performance improvements, as odynamic sampling-based motion planning. CoRR,
evidenced by our results in two benchmarks. The po- 2019. abs/1909.05569. URL http://arxiv.org/
tential of our method to address real-world problems abs/1909.05569.
in autonomous robotics is evident from the results. In
certain instances, our planner may not yield an exact Kleinbort, M., Solovey, K., Littlefield, Z., Bekris,
solution, a limitation primarily attributed to its de- K., and Halperin, D. Probabilistic com-
votion to fast exploration of unknown spaces initially. pleteness of rrt for geometric and kinodynamic
To address this issue, future iterations of our planner planning with forward propagation. IEEE
will explore the incorporation of goal-biasing sampling. Robotics and Automation Letters, 2018. PP:1–1.
This enhancement would adjust the frontier selection doi:10.1109/LRA.2018.2888947.
criteria to balance the exploration with a more delib-
erate effort to connect to the goal in each iteration. Lavalle, S. and Kuffner, J. Rapidly-exploring random
Such a modification aims to refine the planner’s effec- trees: Progress and prospects. Algorithmic and com-
tiveness, ensuring a more targeted approach towards putational robotics: New directions, 2000.
achieving the desired goal state.
LaValle, S. M. and James J. Kuffner, J. Randomized
kinodynamic planning. The International Journal
of Robotics Research, 2001. 20(5):378–400. URL
References https://doi.org/10.1177/02783640122067453,
doi:10.1177/02783640122067453.
Gammell, J. D., Barfoot, T. D., and Srinivasa,
S. S. Batch informed trees (bit*): Informed Li, Y., Littlefield, Z., and Bekris, K. E. Asymp-
asymptotically optimal anytime search. The In- totically optimal sampling-based kinodynamic plan-
ternational Journal of Robotics Research, 2020. ning. CoRR, 2014. abs/1407.2896. URL http:
39(5):543–567. URL https://doi.org/10.1177/ //arxiv.org/abs/1407.2896.
0278364919890396, doi:10.1177/0278364919890396.
Littlefield, Z. and Bekris, K. E. Efficient and
Hauser, K. and Zhou, Y. Asymptotically op- asymptotically optimal kinodynamic motion plan-
timal planning by feasible kinodynamic plan- ning via dominance-informed regions. In 2018
ning in a state–cost space. IEEE Trans- IEEE/RSJ International Conference on Intelligent
actions on Robotics, 2016. 32(6):1431–1443. Robots and Systems (IROS). pages 1–9, 2018.
doi:10.1109/TRO.2016.2602363. doi:10.1109/IROS.2018.8593672.
Hsu, D., Latombe, J.-C., and Motwani, R. Path plan- Luna, R., Şucan, I. A., Moll, M., and Kavraki, L. E.
ning in expansive configuration spaces. In Proceed- Anytime solution optimization for sampling-based
ings of International Conference on Robotics and motion planning. In 2013 IEEE International Con-
Automation, volume 3. pages 2719–2726 vol.3, 1997. ference on Robotics and Automation. pages 5068–
doi:10.1109/ROBOT.1997.619371. 5074, 2013. doi:10.1109/ICRA.2013.6631301.
13
93
Modeling, Identification and Control
Otte, M. and Correll, N. C-forest: Parallel short- Robotics and Automation (ICRA). pages 3191–3198,
est path planning with superlinear speedup. IEEE 2020a. doi:10.1109/ICRA40945.2020.9197338.
Transactions on Robotics, 2013. 29(3):798–806.
Strub, M. P. and Gammell, J. D. Advanced
doi:10.1109/TRO.2013.2240176.
bit* (abit*): Sampling-based planning with ad-
Perez, A., Platt, R., Konidaris, G., Kaelbling, L., and vanced graph-search techniques. 2020 IEEE
Lozano-Perez, T. Lqr-rrt*: Optimal sampling-based International Conference on Robotics and Au-
motion planning with automatically derived exten- tomation (ICRA), 2020b. URL http://dx.
sion heuristics. In 2012 IEEE International Confer- doi.org/10.1109/ICRA40945.2020.9196580,
ence on Robotics and Automation. pages 2537–2542, doi:10.1109/icra40945.2020.9196580.
2012. doi:10.1109/ICRA.2012.6225177. Sucan, I. A., Moll, M., and Kavraki, L. E.
The open motion planning library. IEEE
Shome, R. and Kavraki, L. E. Asymptotically opti- Robotics Automation Magazine, 2012. 19(4):72–82.
mal kinodynamic planning using bundles of edges. doi:10.1109/MRA.2012.2205651.
In 2021 IEEE International Conference on Robotics
and Automation (ICRA). pages 9988–9994, 2021. Xie, C., van den Berg, J., Patil, S., and
doi:10.1109/ICRA48506.2021.9560836. Abbeel, P. Toward asymptotically optimal mo-
tion planning for kinodynamic systems using a
Strub, M. P. and Gammell, J. D. Adaptively In- two-point boundary value problem solver. In
formed Trees (AIT*): Fast asymptotically optimal 2015 IEEE International Conference on Robotics
path planning through adaptive heuristics. In Pro- and Automation (ICRA). pages 4187–4194, 2015.
ceedings of the IEEE International Conference on doi:10.1109/ICRA.2015.7139776.
14
94
Paper V
95
Highlights
From Simulation to Field: Learning Terrain Traversability for Real-
World Deployment
Fetullah Atas, Grzegorz Cielniak, Lars Grimstad
97
From Simulation to Field: Learning Terrain
Traversability for Real-World Deployment
Fetullah Atasa , Grzegorz Cielniakb , Lars Grimstada
a
Faculty of Science and Technology, Ås, 1433, Norway
b
Lincoln Centre for Centre for Autonomous Systems, Isaac Newton
Building, Lincoln, LN6 7TS, Lincolnshire, UK
Abstract
The challenge of traversability estimation is a crucial aspect of autonomous
navigation in unstructured outdoor environments such as forests. It involves
determining whether certain areas are passable or risky for robots, taking
into account factors like terrain irregularities, slopes, and potential obsta-
cles. The majority of current methods for traversability estimation operate
on the assumption of an offline computation, overlooking the significant in-
fluence of the robot’s heading direction on accurate traversability estimates.
In this work, we introduce a deep neural network that uses detailed geometric
environmental data together with the robot’s recent movement characteris-
tics. This fusion enables the generation of robot direction awareness and
continuous traversability estimates, essential for enhancing robot autonomy
in challenging terrains like dense forests. To fulfill the data needs of the pro-
posed deep neural network, we present an innovative pipeline that utilizes
high-fidelity simulations for the automated generation and labeling of data,
dramatically reducing the need for exhaustive human involvement in data
collection and labeling.
The efficacy and significance of our approach are underscored by experi-
ments conducted on both simulated and real robotic platforms in various en-
vironments, yielding quantitatively superior performance results compared to
existing methods. Moreover, we demonstrate that our method, trained exclu-
sively in a high-fidelity simulated setting, can accurately predict traversabil-
98
ity in real-world applications without any real data collection. Our experi-
ments showcase the advantages of our method for optimizing path-planning
and exploration tasks within difficult outdoor environments, underscoring its
practicality for effective, real-world robotic navigation. In the spirit of col-
laborative advancement, we have made the code implementation available to
the public.
Keywords: Traversablity Estimation, Outdoor Robotics, Learning for
Navigation
1. Introduction
Field robotics often confronts challenges when operating within unstruc-
tured terrains as robustly determining the safe regions in such diverse fields
is a non-trivial problem. Such challenges arise in various applications, span-
ning mining, exploration, defense, agriculture, search and rescue, and beyond.
The essence of traversability analysis is to determine safe regions within the
environment robustly. Traversability assessment is a critical element for sev-
eral key operations of autonomous robots, such as formulation and refine-
ment of navigation plans. This study focuses on the robust traversability
assessment for robots in challenging outdoor settings, like forests or lightly
structured urban areas.
The task of evaluating traversability is inherently complex due to its de-
pendency on various factors, such as the robot’s design, the robot’s direction
(which continuously changes), and the characteristics of the terrain. Conse-
quently, a data-driven approach to traversability estimation requires careful
consideration. For example, accurate estimations for one set of robot spec-
ifications may become obsolete if there are alterations in the robot’s tire
dimensions or wheelbase. Acknowledging these dynamics, the methodology
for data-driven traversability must be adaptable rather than fixed to account
for potential changes in a robot’s capabilities.
Historically, researchers have used sensor modalities ranging from cameras
and LIDARs to proprioceptive sensing (e.g., IMU) to discriminate between
traversable and non-traversable regions. Each sensing technology presents
its own set of strengths and limitations. Our study uses a blend of LI-
DAR and IMU data to achieve an accurate, continuous value for deter-
mining traversability costs rather than a simple binary classification. This
blend, combined with an automated data generation and labeling strategy
99
that exploits the robot’s locomotion experience over the terrains, enables
our proposed neural network to generalize well to output traversability es-
timates that are aware of the robot’s angle of approach, which we term as
directionality-aware.
Wallin et al. [1] recently proposed a traversability estimation technique
using simulated interactions between robots and terrain to produce train-
ing data and labels autonomously. Their method employs a neural network
trained on a Digital Elevation Map (DEM), enabling robots to navigate a
simulated environment, collect data, and refine their learning—a process ver-
ified in real-world conditions.
Building upon this foundation, our research advances the state-of-the-art
by introducing a data-efficient neural network that directly processes dense
point cloud maps and IMU data. The advantage of using point clouds lies
in their ability to accurately represent any real-world structure, unlike the
DEM that the previous method [1] depended on. This enhancement paves
the way for more precise and versatile traversability assessments in robotic
applications, which we demonstrate in the Sec. 4
Our approach enhances the concept of automated data generation and
labeling by utilizing raw sensory data from LIDAR and IMU, facilitating a
seamless transition from simulated environments to real-world terrain. This
direct sensor data utilization addresses the constraints of DEM-based meth-
ods, which can be limited in overlapping surfaces. Due to their single-height
limitation per location, DEMs often overlook potentially navigable spaces be-
neath tree canopies in scenarios like dense forests. Our method, by contrast,
presents a more comprehensive solution for autonomous navigation by ac-
counting for the full spectrum of environmental features that may otherwise
be concealed or inaccurately represented in DEMs.
The concern regarding LIDAR’s sparse output (compared to an RGB
image) is a significant one, as it traditionally hinders the ability to gener-
ate granular traversability assessments. Addressing this issue, our technique
synthesizes a densified, precise local point cloud map through the integration
of an Extended Kalman Filter (EKF) state estimate and a GPU-powered It-
erative Closest Point (ICP) algorithm [2, 3]. This enriched, robot-centric
map is critical in our methodology’s training and inference phases, ensuring
that our system can make fine-grained, accurate traversability estimations
beyond mere binary classifications.
Our research introduces the following core contributions in the realm of
traversability estimation for autonomous robot navigation through challeng-
100
ing environments:
2. Related Work
This section provides an overview of seminal contributions on traversabil-
ity estimation, categorized according to their methodological taxonomy.
1
https://github.com/jediofgever/vox nav slam
2
https://github.com/jediofgever/traversablity estimation net
101
Prior to the advent of deep learning, traditional machine learning paradigms
were at the forefront of research for traversability estimation. Notable con-
tributions include the work of Angelova et al. [6], who introduced a decision
tree classifier that was informed by features extracted from RGB images.
This model utilized Speeded-Up Robust Features (SURF) descriptors and
incorporated an analysis of various color variations to enhance its prediction
performance. In a related development, Bajracharya et al. [7] presented a
hybrid approach that combined the strengths of Support Vector Machines
(SVM) and naive Bayesian classifiers. This combination sought to lever-
age the robustness of SVMs and the probabilistic insights offered by naive
Bayesian methods, thereby enhancing the traversability classifier’s perfor-
mance in complex environments.
Traditional classifiers predominantly depended on hand-crafted features
and descriptors. This reliance posed challenges in terms of adaptability.
However, their minimal data requirement was a silver lining compared to
deep learning approaches. Kelly [8] compared a feature-driven SVM classifier
against a Convolutional Neural Network (CNN) for terrain classification.
Their results underscored the superior efficacy of the CNN-based approach.
Recent research has increasingly embraced the advantages of deep learn-
ing. For instance, Leung et al. [9] employed a deep CNN for the semantic
segmentation of images, enhancing the analysis of terrain traversability. They
also utilized data from Digital Elevation Maps (DEM) to identify traversable
regions. In a related endeavor, Hosseinpoor et al. [10] developed a deep se-
mantic segmentation model that processed aerial RGB images emphasizing
varying robot modalities. However, the authors did not conduct experiments
using either simulated or actual robots.
Bekthi et al. [11] presented a 2D imagery technique for terrain uniformity
evaluation utilized for end-stage traversability assessment. Their methodol-
ogy integrated motion indicators from inertial sensors with textures captured
from RGB images. The combined features derived from both sources were
input into a Gaussian process predictor. However, the authors highlighted
the method’s constraints in vast non-uniform environments, primarily due to
significant acceleration readings from the inertial sensor.
Chavez-Garcia et al. [12] explored terrain traversability through a heightmap-
based classification approach. They sourced their training data from terrains
that were procedurally generated using Perlin noise. Although the method
presented several strengths, it predominantly categorized terrains as simply
traversable or non-traversable. This constraint rendered the approach less
102
proficient in recognizing terrains with specific nuanced features, like ramps
that are steep yet passable.
103
with our method, we incorporate inertial sensing and harness dense spatio-
temporal point cloud data processed by deep neural networks.
104
Guan et al. [22] introduced a method for traversability mapping utilizing
LIDAR point clouds combined with RGB images. This technique integrated
3D volumetric traversability metrics to develop a global map, primarily to
aid in the planning and navigation of an excavator. However, the authors
reported limited system testing due to safety concerns. It remains unclear
whether the method is apt for long-range navigation, as the paper does not
provide explicit details in this regard.
Readers are encouraged to consult the comprehensive survey paper by
Borges et al. [23]. This work provides an in-depth examination of various
traversability analysis methods, categorizing them based on their sensing
modalities and taxonomies.
Recent advancements in traversability estimation research increasingly
favor deep learning over traditional learning methods. While often requiring
less data, traditional techniques face challenges in terrains lacking distinct
features, as Kelly [8] points out. In response, our study introduces a deep
neural network architecture designed to determine continuous traversabil-
ity values across varied terrains. We utilize an automated data generation
method to address deep learning’s data inefficiency based on high-fidelity
simulation environments. Additionally, our approach uniquely integrates in-
ertial sensing with spatiotemporal LIDAR point clouds, generating a detailed
point cloud map to infer traversability estimates around the robot. These
traversability estimation maps enhance local motion planning and control,
ensuring safer navigation.
3. Approach
In this section, we provide an in-depth explanation of our methodology.
Our approach harnesses data from LIDAR and IMU sensors to generate
detailed, robot-centric traversability point cloud maps.
A framework of the proposed method is depicted in Fig. 1. Our sys-
tem encompasses several key elements: the TraverseNet, a sophisticated
LIDAR-Inertial neural network architecture, the auto-generation of a dataset
using high-fidelity simulations, and the construction of robot-centric point
cloud maps. The following sections will elaborate on each component of our
methodology, coupled with a precise problem statement to ensure clarity.
105
Figure 1: Comprehensive Traversability Estimation Pipeline: Utilizing sensor inputs and
high-fidelity simulations, data is processed via the EKF State Estimator and fed into Tra-
verseNet. This deep neural network, trained on autonomously generated datasets, outputs
precise traversability estimates vital for outdoor autonomous navigation components such
as planning, control, and costmap.
106
proposed neural network fθ , which aims to approximate the traversability
function T : fθ (S, C) ≈ T (S, C) The network’s parameters, θ, are optimized
to minimize the discrepancy between the network’s output and the actual
traversability scores derived from simulations. This optimization enhances
the robot’s ability to interpret its environment and make informed navigation
decisions respecting the terrain traversability.
10
107
uphill versus downhill, often simplifying traversability into a static measure.
This approach neglects the reality that climbing usually requires more power.
Our model advances beyond this limitation by incorporating the robot’s
angle of approach into traversability calculations, enabling a dynamic eval-
uation that aligns with the practical challenges encountered across diverse
landscapes. This enhancement is achieved through the data labeling tech-
nique rooted in the robot’s extensive locomotion experience, reflecting the
varied demands of different terrain types. The subsequent subsection delves
into the details of our locomotion-based data labeling method, shedding light
on the way a robot’s interaction with its surroundings shapes the nuanced
traversability estimates our model provides.
Figure 2: Fundamental TraverseNet Design: This illustration presents the core architec-
ture, utilizing point coordinates data for feature extraction through convolutional and fully
connected layers, subsequently integrated with a 13-dimensional IMU feature vector.
11
108
Figure 3: Optimal TraverseNet Configuration: Detailed in Section Sec. 4, the architecture
utilizing point coordinates (x, y, z) and point curvature inputs exhibits notable perfor-
mance. Unlike design with direct IMU fusion depicted in Fig. 2, this approach processes
IMU data through additional feature extraction before concatenating it with LIDAR fea-
tures in the final layer.
12
109
ology, we treat terrain traversability as a regression problem. Therefore, for
each data sample, the neural network requires both the sample and its asso-
ciated traversability cost label. We utilize the robot’s experience across the
terrain to fulfill this demand.
With specific physical attributes such as wheel radius and wheelbase,
and a defined fixed speed, one can anticipate the nominal distance a robot is
expected to traverse within a set time frame. Yet, challenging terrains, like
steep slopes or obstacles, can cause deviations in the actual traveled distance
from this estimation. Using the difference between the expected and actual
distances within a windowed time period, we can obtain a robust, continuous
traversability cost label. For example, while descending a slope, gravity might
enable the robot to cover more distance than predicted. Conversely, if the
robot encounters an obstacle like a rock and gets stuck, the traversability
cost at that region might peak at 1.0, its maximum value. The illustration
in Fig. 4 provides a visual representation of this concept, highlighting the
disparity between predicted and actual distances.
Additionally, our approach factors in past IMU readings, specifically ex-
tracting the acceleration covariance, which serves as neural network input.
The nominal distance dn is computed as:
13
110
integrates IMU data, we include the acceleration covariance from the preced-
ing t time and roll, pitch, and yaw angles in the inertial frame. To simplify
input for the neural network, orientation is represented using quaternions.
Our IMU feature vector is 13-dimensional; 9 elements originate from the flat-
tened acceleration covariance matrix Q, while the remaining four belong to
the quaternion vector H.
14
111
construct this environment in the Unity Editor using C# scripts, leveraging
elevation data, road structures, and other map objects from the Open Street
Map (OSM) dataset. It, too covers a 1km2 area.
These detailed environments ensure we gather comprehensive data for
both urban and outdoor settings.
The actual data and label collection proceeds in two distinct phases:
First, we deploy multiple robots for efficiency. Each robot is equipped
with LIDAR, IMU, Camera, and GNSS sensors, with their data relayed to
Robot Operating System (ROS) [27] topics.
Leveraging the gathered sensory data, we implement an Extended Kalman
Filter (EKF) as a state estimator, enabling us to generate a robot-centric lo-
cal point cloud map accurately. This process is elaborated in the subsequent
subsection. Using the robot’s tracked locomotion quality as a basis for the
traversability indicator, we determine the traversability cost label crop points
directly below the robot and ascertain the IMU covariance matrix. In the
procedure, every sample is stored on the disk with a timestamp, embedding
the traversability cost label with three-decimal precision in addition to saving
the disk text file including a 13-dimensional IMU feature vector.
Operating multiple robots simultaneously is critical in our data collection
methodology. Our experimental setup employs a gaming computer with 32
GB RAM and an NVIDIA 2060 4GB GPU. This configuration permits the
concurrent operation of up to 10 robots without any detrimental impact on
the frame rate. It is important to recognize that this figure is inherently tied
to the computational capabilities of our specific setup, particularly regarding
the demands of simulating a 64-ring LIDAR sensor for each robot.
The automated data generation steps are vividly depicted in Fig. 5, where
the anticipatory behaviors of the robots within the simulation are illustrated.
Furthermore, Fig. 6 expands on this by showcasing the multi-robot data col-
lection process in a life-like simulated environment. These visual represen-
tations provide a clear understanding of the sophisticated data generation
pipeline that underpins our research.
15
112
Figure 6: The image depicts three robots (highlighted within yellow circles) navigating
through a simulated high-fidelity forest environment and collecting data alone. The LIDAR
point clouds of each robot are also depicted in distinct colors. It is important to note that
these robots operate concurrently and can physically interact, with potential collisions and
the possibility of one robot’s body being detected by the sensors of another. The robots
are spawned at random locations with random orientations but move with a fixed linear
speed of 1 meter per second throughout each episode.
cost of an area with a continuous value, a single scan may fall short. That’s
why we use point clouds that accumulate data over a windowed time, moving
along with the robot, making them robot-centric.
We employ the Iterative Closest Point (ICP) algorithm for generating
dense point clouds, benefiting from its efficient GPU performance [2]. This
method not only improves the accuracy of the robot’s state estimation but
also effectively tracks movement on steep slopes, creating detailed local point
cloud maps. Our approach is also compatible with established SLAM tech-
niques like LIO-SAM (Shan et al. [28]). However, while these techniques are
reliable in many scenarios, they face challenges on highly uneven terrains.
For instance, the reliance on IMU pre-integration can lead to misinterpre-
tation of gravitational forces as movement when a robot is stationary on a
slope, which is why we did not prioritize this approach.
Loop closures, which help correct the robot’s trajectory and avoid drift in
traditional SLAM systems, are not crucial for our purpose. This is because
our robots generally move in a consistent direction until an obstacle stops
them or a set time limit is reached (in the data collection phase). For data
collection, we typically let each robot episode run up to a maximum of three
minutes to ensure a balanced dataset.
16
113
4. Experiments
4.1. Platform, Sensors, and Setup
The real robot platform we use for real-world experiments is a mid-sized
AGV from Robotnik with active-front-and-rear-steering, see Fig. 7. To facil-
itate data collection, we constructed a simulated skid-steered replica of this
robot within the Unity game engine. Details of our simulated robot are as
follows:
The replica model ”Dobbie” is designed to navigate uneven terrains. It is
equipped with a 64-ring Ouster LIDAR and an XSENS IMU. The platforms
feature additional sensors, including cameras and a GNSS system, though
these were not leveraged in developing our traversability estimation frame-
work. While the steering kinematics are not identical, they possess same
sensor suite with comparable geometric dimensions and ground clearance.
Figure 7: Robotic platforms used for experimentation. The red-bordered images depict the
simulated robot model, whereas the green-bordered images provide multiple perspectives
of the actual Robotnik AGV platform. For our method and experiments, the primary
sensors employed are 3D LIDAR and IMU; other sensors on the platform were not directly
utilized for traversability estimation.
17
114
4.2. The Dataset and Data Augmentation
Our dataset was generated through an automated approach; we created
a series of C# scripts to achieve this. We first position robots at random
coordinates within a 1km x 1km environment in Unity. Each robot was
oriented randomly and maintained a fixed linear speed of 1 meter per second.
The movement continued for either a maximum duration of three minutes
or until the robot became immobilized due to obstacles or tipping incidents.
We achieved these programmatically by observing the ground truth pose of
robots accessible within the simulation.
Throughout the robot’s trajectory, our scripts captured snapshots. These
comprised a cropped point cloud segment, highlighting the robot’s expanded
footprint (1.0, 0.67, 1.0) along the x, y, and z dimensions. Accompanying this
was the IMU acceleration covariance based on the previous three seconds and
the most recent IMU orientation within the inertial frame. Thus, each sample
incorporated a point cloud segment of the robot’s footprint, the IMU’s 3 x 3
covariance matrix, and a quaternion representation of the inertial orientation.
The IMU data was simulated based on the rigid body acceleration of the
robot.
To achieve a comprehensive representation of various terrain situations,
especially those involving non-traversable regions that are inherently difficult
to access, we employed manual teleoperation. This approach facilitated the
navigation of robots into such challenging areas. Subsequently, a ROS service
was utilized to initiate the capture of samples in these specific locations,
ensuring a balanced dataset.
To augment the diversity of our dataset, we employed two techniques
specifically for augmenting point cloud samples while leaving the IMU data
untouched. First, we downsampled the point cloud using a random voxel size
selected from the range of 0.05 to 0.25. Second, we introduced zero-mean
Gaussian noise with a standard deviation of 0.05 to the point cloud sam-
ples. We randomly selected augmentation samples, constituting half of the
entire training set. Consequently, post-augmentation, the dataset expanded
by a factor of 1.5X. All point cloud samples were subsequently normalized
to fit within a metric sphere, while the IMU data remained without any nor-
malization as the 13-dimensional IMU features vector ended up having an
intrinsically structured range.
18
115
Features Final Loss Train MAE Test MAE
XYZ 4.1 0.05 0.030
XYZ + N. 4.0 0.05 0.035
XYZ + C. 4.3 0.07 0.033
D-F IMU + XYZ + N. 4.2 0.08 0.032
M-F IMU + XYZ 3.7 0.05 0.032
M-F IMU + XYZ + C. 3.8 0.07 0.024
M-F IMU + XYZ + N. 3.5 0.06 0.034
Table 1: Training results on automatically generated traversability dataset, the results for
different feature combinations and IMU data fusion strategies are presented. The train
MAE numbers are peak values during the training. The Test MAE values were retrieved
after 300 epochs of training.
19
116
Mid-Fusion (M-F) (see, Fig. 3), we first extract features from the IMU data
using fully connected layers, then fuse these processed features for subsequent
prediction tasks.
Discussion: Analyzing the presented results, it is evident that the model
”M-F IMU + XYZ + C” stands out with the final loss of 3.8 and the lowest
end MAE of 0.024, making it the most accurate by the conclusion of the
training. Interestingly, its performance surpasses even its more feature-rich
counterparts, such as ”M-F IMU + XYZ + N.” This suggests that integrating
additional features did not always equate to enhanced performance. For
instance, incorporating normals (3-dimensional feature) to the base ”XYZ”
model results in only marginal improvements or, in some cases, even slight
regressions in metrics. The optimal balance between model complexity and
performance appears to be achieved with the ”M-F IMU + XYZ + C” model,
making it a promising choice for the rest of the experiments. Furthermore,
the application of Mid-Fusion—entailing additional feature extraction from
IMU data prior to concatenation with LIDAR features—appears to enhance
the model’s performance.
We experimented with L1 and L2 loss functions during the neural network
training phase. We observed that L1 loss facilitated better convergence and
exhibited greater robustness to outliers within the dataset. For optimization,
we utilized the ADAM [29] algorithm, with a learning rate of 0.001.
20
117
trained with footprint dimensions of (1.0, 0.67, 1.0) for the x, y, and z axes,
respectively. Even though optimal results are anticipated when using this
specific footprint size, our method remains adept at predicting traversability
estimates with changing footprint sizes (given that the ratio of dimensions
stays similar). This robustness is attributed to normalizing points within the
robot’s footprint to a unit sphere during the training and inference stages.
In Fig. 10(a), we demonstrate the ability to retrieve traversability estimates
even with a reduced footprint size of (0.25, 0.16, 0.25). This capability is at-
tributed to our method of aggregating the spatiotemporal LIDAR scan into
a dense, robot-centric map that rolls with the robot.
21
118
(a)
(b)
(c)
Figure 8: The figure illustrates traversability estimations in two distinct environments: a
bridge and a random forest patch. In part (a), the robot’s local vicinity is discretized to fit
within its footprint. In part (b), while the environment is again discretized based on the
robot’s footprint, the inference is conducted on a finer grid resolution. This approach fuses
multiple traversability estimations, yielding results that are more accurate and smoother
in their presentation. In (c), blue indicates areas that are easily traversable, green suggests
moderate challenges, and red denotes difficult or impassable regions, the range within (0,
1).
In Fig. 12, we detail the outcomes from the neural network using a fine-
grained step size configuration, as depicted on the right side of Fig. 9. The
results for varying robot footprint sizes reveal that smaller footprints make
the method less assertive in pinpointing high-cost areas, possibly due to the
limited point information and variation in these smaller regions. As the
22
119
footprint size expands, the method exhibits heightened confidence in distin-
guishing clearly non-traversable zones, particularly evident in Fig. 12(d).
Figure 9: Two distinct strategies were employed for inferring traversability estimates using
the neural network. In the left setup, the environment is discretized based on the robot’s
footprint size. Conversely, in the right setup, the environment is segmented using the
robot’s footprint but with a notably smaller step size, resulting in overlapping regions.
These overlaps allow for multiple predictions at the same location, enhancing the accuracy
and smoothness of the estimates, albeit at the cost of increased computation time.
23
120
(a) (b)
(c) (d)
Figure 10: Figures display traversability estimates for robot footprints with a consistent
x:y:z ratio as the original footprint of (1.0, 0.67, 1.0). The x dimensions are 0.25 in (a),
0.5 in (b), 0.75 in (c), and 1.0 in (d).
121
(a) (b)
(c) (d)
Figure 12: Figures show the effect of robot footprint dimensions on traversability, with x
values of 0.25, 0.5, 0.75, and 1.0 in (a) through (d) and a consistent x:y:z ratio. Differing
from Fig. 10, an overlapping box approach is used, refining the estimate precision.
25
122
Method Data Input Formulation Metric Performance Directionality
Our P. Clouds Regression MAE 0.024-0.035 Y
Wallin et al. [1] DEM Regression MAE ≈0.2 Y
Xue et al. [18] P. Clouds Classification Precision 97-99% N
Agishev et al. [13] P. Clouds Classification - - N
26
123
Figure 13: The figures showcase the efficacy of our traversability estimation method when
applied to an external forest dataset provided by Agishev et al. [13]. Our approach yields
enhanced accuracy even without IMU data integration. This improvement is especially
evident in resolving coarse binary classifications of objects like tree trunks and pedestrians,
where our method offers more accurate traversability assessments. Visual comparisons are
provided by highlighting our method’s estimates in blue boxes, contrasting with the orange
boxes used for Agishev et al.’s estimates.
27
124
worthy, it is important to highlight that their method views traversability in
simpler terms: it is either traversable or not. Moreover, they do not take into
account the robot’s direction. This distinction is vital; for example, the chal-
lenge of moving down a slope with a 10% grade is different from moving up
the same slope. The reported performance values are derived directly from
the authors’ conclusions as stated in their respective sections. Each method
underwent evaluation on its own distinct dataset; the absence of accompa-
nying implementations and different sensor modalities constituted a barrier
to the replication of the reported performance outcomes.
Upon analyzing the results from the external dataset by [13], our method
demonstrates a more cautious and detailed performance. For example, re-
gions at the base of tree trunks and portions of the pedestrian point clouds,
which were often marked as traversable by [13], were correctly identified as
non-traversable by our method. It is noteworthy, however, that our method’s
conservative bias may lead to mistakenly categorizing steep slopes as non-
traversable, as depicted by the images in Fig. 13.
28
125
Figure 14: The traversability estimates are merged with the point cloud map generated
using LIO-SAM [28], a 3D SLAM approach. The method accurately differentiates between
solid obstacles like trees and buildings. Additionally, it designates higher traversability
costs to areas with inclines or gravel, as indicated by the green patches). The integrated
traversability map is overlayed onto Google Earth’s satellite imagery for reference.
Figure 16: Snapshots from various moments during an outdoor experiment. As in Fig. 15,
the image-point pairs are provided where RGB colors of points indicate traversability
costs. The figure also demonstrates the method’s directionality awareness, assigning higher
traversability costs and a larger area of concern when approaching an uphill sidewalk bump
(b), as opposed to a less conservative estimate when descending from the same sidewalk
(c). The relevant regions have been marked with yellow contours in images (b) and (c) for
clarity.
29
126
Figure 15: A sequence showcasing the method applied to a real scene, featuring both
image-point cloud views. The top images overlay the traversability point cloud map on
actual images for visual validation. The bottom images present 3D visualizations of the
traversability estimation within the point cloud, where traversability costs are embedded
in RGB values.
30
127
4.8. Path Planning and Navigation Experiments
Figure 17: Snapshots capture two instances where the point cloud-based path planner
by Atas et al. [30] utilizes our proposed traversability mapping to generate collision-free,
cost-effective paths. The traversability map with the optimal path in yellow is shown on
the left. At the same time, the right showcases the high-fidelity simulated environment
and the corresponding robot positions for the scene.
31
128
Figure 18: Similar to Fig. 17, the left side displays the traversability cost map with
the optimal path highlighted in yellow, and the right depicts the high-fidelity simulated
environment and the robot.
5. Concluding Remarks
This paper introduces a novel neural network architecture to predict con-
tinuous traversability values using high-resolution, robot-centric point cloud
32
129
maps. Leveraging deep neural networks, our method circumvents the tedious
process of manual data labeling, as both data and labels are autonomously
generated within a high-fidelity simulation environment.
Our extensive experiments validate the robust performance of our pro-
posed method, exhibiting highly accurate predictions on our collected valida-
tion and third-party datasets. Furthermore, the pre-trained neural network
model was directly deployed on an actual robot platform, providing satis-
factory traversability estimates even without integrating any real-world data
samples.
The practicality of our approach is further demonstrated through its ap-
plication in 3D path planning algorithms that operate directly on point
clouds. In such applications, our method consistently enabled planners to
produce collision-free paths, proving critical for safe navigation and explo-
ration within complex outdoor environments.
We believe our work pioneers a new approach to handling the challenges
of data generation through realistic simulations and employing a deep neural
network tailored to interpret dense point cloud maps directly. While our
method adeptly utilizes spatiotemporal robot-centric point cloud maps, it
has limitations. Dynamic objects like pedestrians may become embedded
within the map, leading to potential prediction inaccuracies.
Looking ahead, we aim to refine our system to produce point-wise con-
tinuous traversability estimates rather than relying on a discretized robot
footprint box. This enhancement will help to preserve information integrity
and provide even more precise traversability estimates for autonomous robot
navigation.
33
130
Statement: During the parts of the preparation of this work, the au-
thor(s) used ChatGPT and Grammarly in order to improve the language
and readability. After using this tool/service, the author(s) reviewed and
edited the content as needed and take(s) full responsibility for the content of
the publication.
References
[1] E. Wallin, V. Wiberg, F. Vesterlund, J. Holmgren, H. J.
Persson, M. Servin, Learning multiobjective rough terrain
traversability, Journal of Terramechanics 102 (2022) 17–26.
doi:https://doi.org/10.1016/j.jterra.2022.04.002.
URL https://www.sciencedirect.com/science/article/pii/
S0022489822000313
[2] K. Koide, M. Yokozuka, S. Oishi, A. Banno, Voxelized gicp for fast and
accurate 3d point cloud registration, in: 2021 IEEE International Con-
ference on Robotics and Automation (ICRA), 2021, pp. 11054–11059.
doi:10.1109/ICRA48506.2021.9560835.
34
131
2007 IEEE Conference on Computer Vision and Pattern Recognition,
2007, pp. 1–8. doi:10.1109/CVPR.2007.383024.
[7] M. Bajracharya, B. Tang, A. Howard, M. Turmon, L. Matthies, Learning
long-range terrain classification for autonomous navigation, in: 2008
IEEE International Conference on Robotics and Automation, 2008, pp.
4018–4024. doi:10.1109/ROBOT.2008.4543828.
[8] K. Shen, M. F. Kelly, Terrain classification for off-road driving cs-229
final report, 2017.
URL https://api.semanticscholar.org/CorpusID:30413317
[9] T. H. Y. Leung, D. Ignatyev, A. Zolotas, Hybrid terrain traversability
analysis in off-road environments, in: 2022 8th International Conference
on Automation, Robotics and Applications (ICARA), 2022, pp. 50–56.
doi:10.1109/ICARA55094.2022.9738557.
[10] S. Hosseinpoor, J. Torresen, M. Mantelli, D. Pitto, M. Kolberg, R. Maf-
fei, E. Prestes, Traversability analysis by semantic terrain segmenta-
tion for mobile robots, in: 2021 IEEE 17th International Conference
on Automation Science and Engineering (CASE), 2021, pp. 1407–1413.
doi:10.1109/CASE49439.2021.9551629.
[11] M. A. Bekhti, Y. Kobayashi, Regressed terrain traversability cost for
autonomous navigation based on image textures, Applied Sciences 10 (4)
(2020). doi:10.3390/app10041195.
URL https://www.mdpi.com/2076-3417/10/4/1195
[12] R. O. Chavez-Garcia, J. Guzzi, L. M. Gambardella, A. Giusti, Learning
ground traversability from simulations, CoRR abs/1709.05368 (2017).
arXiv:1709.05368.
URL http://arxiv.org/abs/1709.05368
[13] R. Agishev, T. Petřı́ček, K. Zimmermann, Trajectory optimization us-
ing learned robot-terrain interaction model in exploration of large sub-
terranean environments, IEEE Robotics and Automation Letters 7 (2)
(2022) 3365–3371. doi:10.1109/LRA.2022.3147332.
[14] G. G. Waibel, T. Löw, M. Nass, D. Howard, T. Bandyopadhyay,
P. V. K. Borges, How rough is the path? terrain traversability es-
timation for local and global path planning, IEEE Transactions on
35
132
Intelligent Transportation Systems 23 (9) (2022) 16462–16473. doi:
10.1109/TITS.2022.3150328.
36
133
URL https://www.frontiersin.org/articles/10.3389/frobt.
2022.887910
[22] T. Guan, Z. He, D. Manocha, L. Zhang, Ttm: Terrain traversability
mapping for autonomous excavator navigation in unstructured environ-
ments, ArXiv abs/2109.06250 (2021).
URL https://api.semanticscholar.org/CorpusID:237503221
[23] P. Borges, T. Peynot, S. Liang, B. Arain, M. Wildie, M. Minareci,
S. Lichman, G. Samvedi, I. Sa, N. Hudson, et al., A survey on ter-
rain traversability analysis for autonomous ground vehicles: Methods,
sensors, and challenges, Field Robot 2 (1) (2022) 1567–1627.
[24] C. R. Qi, H. Su, K. Mo, L. J. Guibas, Pointnet: Deep learning
on point sets for 3d classification and segmentation, arXiv preprint
arXiv:1612.00593 (2016).
[25] S. Shi, X. Wang, H. Li, Pointrcnn: 3d object proposal generation and
detection from point cloud, in: The IEEE Conference on Computer
Vision and Pattern Recognition (CVPR), 2019.
[26] C. R. Qi, L. Yi, H. Su, L. J. Guibas, Pointnet++: Deep hierarchi-
cal feature learning on point sets in a metric space, arXiv preprint
arXiv:1706.02413 (2017).
[27] M. Quigley, K. Conley, B. Gerkey, J. Faust, T. Foote, J. Leibs,
R. Wheeler, A. Ng, Ros: an open-source robot operating system, Vol. 3,
2009.
[28] T. Shan, B. Englot, D. Meyers, W. Wang, C. Ratti, R. Daniela, Lio-sam:
Tightly-coupled lidar inertial odometry via smoothing and mapping, in:
IEEE/RSJ International Conference on Intelligent Robots and Systems
(IROS), IEEE, 2020, pp. 5135–5142.
[29] D. P. Kingma, J. Ba, Adam: A method for stochastic optimization,
CoRR abs/1412.6980 (2014).
URL https://api.semanticscholar.org/CorpusID:6628106
[30] F. Atas, G. Cielniak, L. Grimstad, Navigating in 3d uneven environ-
ments through supervoxels and nonlinear mpc, in: 2023 European Con-
ference on Mobile Robots (ECMR), 2023, pp. 1–8. doi:10.1109/
ECMR59166.2023.10256342.
37
134
Paper VI
VI
135
Enabling Robot Autonomy through a Modular Software Framework
Fetullah Atas 1 Grzegorz Cielniak 2 Lars Grimstad 1
137
II. A PPROACH
The essential elements of the proposed method are presented
in Fig. 2. The fundamental external libraries employed in
our framework include OMPL [13], robot localization [8],
LIO-SAM [12], and Casadi [1]. The three server nodes,
namely map server, planner server, and control server, offer
a plugin interface. This strategy facilitates the integration
of numerous planning and control plugins into the system, Fig. 2: The figure showcases the range of terrains on which
without necessitating any code modification. Instead, a YAML the framework was evaluated. Specifically, the left image
configuration file can be employed for different plugin depicts a robot operating on snowy, uneven terrain, the middle
integration. image shows a robot operating in an open field, and the right
image demonstrates a robot autonomously navigating in the
A. Plugin Specifications strawberry polytunnel.
At the time of writing, the system incorporates 2 map
plugins, 3 planner plugins, and 3 controller plugins. To
illustrate how our approach can be tailored to specific instance, the Acado implementation boasts an exceptionally
requirements, we will focus on one plugin from each of the fast control step with kHz frequency updates. In contrast, the
three categories. Casadi implementation generates smoother trajectories at low
1) Point Cloud Map Plugin: For an outdoor robot operating speeds, demonstrating the flexibility of the approach.
in uneven terrains, a 3D point cloud map serves as an effective
tool for representing the environment [2]. The objective of this III. E XPERIMENTS
plugin is to furnish a map that incorporates semantic data, such
The proposed framework was evaluated in both simulated
as whether a particular region of the environment is traversable
and real-world settings, using an agricultural robot platform
or not. The plugin offers the map and semantic data to theknown as Thorvald II [3]. In comparison to Navigation
planner or controller plugins through ROS servers and clients.
2 [6], a widely used navigation framework that is built
2) Planner Plugin: The planner plugins are classified based
on top of a well-established ROS 1 navigation stack [7],
on the state space in which the robot operates, such as SE2
our approach demonstrates superior performance in modeling
or SE3. Various parameters can be specified for the plugins,
the unstructured nature of outdoor scenes, especially in
including the collision body dimensions of the robot, state
terrains with steep inclinations. While Navigation2 is primarily
boundaries, and actual planning algorithms. For example, adesigned for indoor environments, it can also be applied to
configuration for a mobile robot operating in flat terrain may
outdoor scenes. Notably, both frameworks exhibit satisfactory
include the following specifications: performance in even terrain patches, but our framework
planner server node: outperforms Navigation2 in uneven terrains.
planner plugin: "SE2Planner" # or "SE3Planner", etc.
expected planner frequency: 1.0
The present approach was also tested for polytunnel
planning algorithm: "RRTStar" # or other OMPL planners navigation for an agricultural robot. Navigation within
planner timeout: 35.0 polytunnels poses a considerable challenge due to the absence
robot body dimens:
x: 1.2
of GPS signals, necessitating precise localization accuracy to
y: 1.2 prevent the robot from colliding with fruit rows. With the
z: 0.8 aid of a point cloud map, the proposed framework permits
SE2Planner:
plugin: "vox_nav_planning::SE2Planner"
LIDAR-based localizers to employ the Iterative Closest Point
se2 space: "REEDS" # or "DUBINS" (ICP) and Normal Distribution Transform (NDT) point cloud
rho: 2.5 # Curve radius for reeds and dubins spaces matching algorithms to enhance localization accuracy.
state space boundries:
minx: -100.0
maxx: 100.0 IV. C ONCLUSION
miny: -100.0
maxy: 100.0 We have presented a new software framework for robots
minyaw: -3.14 that can easily adapt to various environments and robot
maxyaw: 3.14
types. Unlike existing frameworks, our approach allows for
modifying the environment representation without requiring
3) Controller Plugin: The proposed framework is designed any adjustments to motion planners. Therefore, the proposed
specifically for mobile robots, and thus, the available controller framework is suitable for mobile robots, UAVs, and other
plugins are model-based controllers, such as MPC, tailored robots that operate under diverse kinematic constraints.
for mobile robot control. To ensure versatility, we have Our framework has been evaluated in both simulated and
incorporated two different MPC controller implementations real-world settings. However, further research is needed
using distinct optimization libraries, namely Casadi [1] and to enhance its reliability, particularly in dynamic urban
Acado [4]. Each implementation offers unique advantages. For environments.
138
R EFERENCES
[1] J. A. E. Andersson, J. Gillis, G. Horn, J. B. Rawlings, and M. Diehl.
CasADi – A software framework for nonlinear optimization and optimal
control. Mathematical Programming Computation, 11(1):1–36, 2019.
[2] F. Atas, G. Cielniak, and L. Grimstad. Elevation state-space:
Surfel-based navigation in uneven environments for mobile robots. In
To appear in 2022 IEEE/RSJ International Conference on Intelligent
Robots and Systems (IROS), 2022.
[3] L. Grimstad and P. J. From. Thorvald ii - a modular and re-configurable
agricultural robot. IFAC-PapersOnLine, 50(1):4588–4593, 2017. 20th
IFAC World Congress.
[4] B. Houska, H. Ferreau, and M. Diehl. An Auto-Generated Real-Time
Iteration Algorithm for Nonlinear MPC in the Microsecond Range.
Automatica, 47(10):2279–2285, 2011.
[5] S. Macenski, T. Foote, B. Gerkey, C. Lalancette, and W. Woodall. Robot
operating system 2: Design, architecture, and uses in the wild. Science
Robotics, 7(66):eabm6074, 2022.
[6] S. Macenski, F. Martin, R. White, and J. Ginés Clavero. The marathon
2: A navigation system. In 2020 IEEE/RSJ International Conference on
Intelligent Robots and Systems (IROS), 2020.
[7] E. Marder-Eppstein, E. Berger, T. Foote, B. Gerkey, and K. Konolige.
The office marathon: Robust navigation in an indoor office environment.
In International Conference on Robotics and Automation, 2010.
[8] T. Moore and D. Stouch. A generalized extended kalman filter
implementation for the robot operating system. In E. Menegatti,
N. Michael, K. Berns, and H. Yamaguchi, editors, Intelligent
Autonomous Systems 13, pages 335–348, Cham, 2016. Springer
International Publishing.
[9] S. Pütz, T. Wiemann, J. Sprickerhof, and J. Hertzberg. 3d
navigation mesh generation for path planning in uneven terrain.
IFAC-PapersOnLine, 49(15):212–217, 2016. 9th IFAC Symposium on
Intelligent Autonomous Vehicles IAV 2016.
[10] M. Quigley, K. Conley, B. Gerkey, J. Faust, T. Foote, J. Leibs,
R. Wheeler, and A. Ng. Ros: an open-source robot operating system.
volume 3, 01 2009.
[11] F. Ruetz, E. Hernández, M. Pfeiffer, H. Oleynikova, M. Cox, T. Lowe,
and P. Borges. Ovpc mesh: 3d free-space representation for local ground
vehicle navigation. In 2019 International Conference on Robotics and
Automation (ICRA), pages 8648–8654, 2019.
[12] T. Shan, B. Englot, D. Meyers, W. Wang, C. Ratti, and R. Daniela.
Lio-sam: Tightly-coupled lidar inertial odometry via smoothing and
mapping. In IEEE/RSJ International Conference on Intelligent Robots
and Systems (IROS), pages 5135–5142. IEEE, 2020.
[13] I. A. Sucan, M. Moll, and L. E. Kavraki. The open motion planning
library. IEEE Robotics Automation Magazine, 19(4):72–82, 2012.
139
Bibliography
[1] L. Grimstad and P. J. From, “Thorvald ii - a modular and re-
configurable agricultural robot,” IFAC-PapersOnLine, vol. 50, no. 1,
pp. 4588–4593, 2017, 20th IFAC World Congress. [Online]. Available:
https://www.sciencedirect.com/science/article/pii/S2405896317314830
[7] J. A. Reeds and L. A. Shepp, “Optimal paths for a car that goes both
forwards and backwards.” Pacific Journal of Mathematics, vol. 145, no. 2,
pp. 367 – 393, 1990. [Online]. Available: https://doi.org/
141
Bibliography
142
Bibliography
143
Bibliography
144
Bibliography
[49] X. Tian, S. Yang, K. Yu, M. Fu, and W. Song, “An efficient planning
framework for ground vehicles navigating on uneven terrain,” in 2023 IEEE
International Conference on Unmanned Systems (ICUS), 2023, pp. 1493–
1498.
145
Bibliography
[52] X. Xiao, B. Liu, G. Warnell, and P. Stone, “Motion control for mobile robot
navigation using machine learning: a survey,” CoRR, vol. abs/2011.13112,
2020. [Online]. Available: https://arxiv.org/abs/2011.13112
[53] M. Bakken, “Explainable and data-efficient learning for visual guidance of
autonomous agri-robots,” PhD thesis, Norwegian University of Life Sciences,
As, Norway, July 2021, available at https://hdl.handle.net/11250/2788278.
[54] N. Koenig and A. Howard, “Design and use paradigms for gazebo, an open-
source multi-robot simulator,” in 2004 IEEE/RSJ International Conference
on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566),
vol. 3, 2004, pp. 2149–2154 vol.3.
[55] M. Quigley, K. Conley, B. Gerkey, J. Faust, T. Foote, J. Leibs, R. Wheeler,
and A. Ng, “Ros: an open-source robot operating system,” vol. 3, 01 2009.
[56] M. Aicardi, G. Casalino, A. Bicchi, and A. Balestrino, “Closed loop steering
of unicycle like vehicles via lyapunov techniques,” IEEE Robotics Automation
Magazine, vol. 2, no. 1, pp. 27–35, 1995.
146
ISBN: 978-82-575-2145-5
ISSN: 1894-6402
120393 / MEDIEHUSET-ANDVORD.NO
Postboks 5003
NO-1432 Ås, Norway
+47 67 23 00 00
www.nmbu.no