Professional Documents
Culture Documents
BIT : Batch Informed Trees For Optimal Sampling-Based Planning Via Dynamic Programming On Implicit Random Geometric Graphs
BIT : Batch Informed Trees For Optimal Sampling-Based Planning Via Dynamic Programming On Implicit Random Geometric Graphs
of discretization by representing the planning domain as a set on random worlds in R2 (Figs. 4, 5) and R8 (Figs. 6, 7). BIT*
of samples drawn randomly from a continuous state space. more reliably finds solutions and finds solutions of equivalent
This allows them to scale effectively to high-dimensional cost faster than RRT*, informed RRT*, and FMT*.
configuration spaces. They are probabilistically complete, The remainder of this paper is organized as follows.
having a probability of finding a solution, if one exists, that Section II presents the general BRT class of planners while
goes to unity as the number of samples goes to infinity. Section III presents the specific BIT* algorithm. Section IV
Optimal variants, such as RRT*, PRM* [27] and FMT* [28], presents initial analysis of the algorithm. Section V presents
are also asymptotically optimal, having the probability that the experimental results in more detail and Sections VI and
the found solution is optimal, if one exists, also going to VII provide a discussion of the algorithm and ongoing work
unity as the number of samples goes to infinity. To date, and a conclusion.
very little has been said about the efficiency with which
these algorithms find the optimal solution; however, informed II. B ELLMAN R ANDOM T REES (BRT S )
RRT* [22] showed that a heuristic can be used to improve Karaman and Frazzoli [27] recognized that the random
the rate at which a solution converges towards the optimum. samples from the state space in a planning algorithm can
In this paper, we argue that optimal sampling-based plan- be modelled as an RGG. RGGs are a probabilistic structure
ning algorithms that generate trees (e.g., RRT*, FMT*) can with a number of important statistical properties depending
be unified as a class of algorithms we refer to as Bellman on their specific type. For planning, the type of RGG depends
random trees (BRTs). This class of planners uses (exact or on the connection scheme used by the planner, but common
approximate) dynamic programming to build an explicit tree types include r-disc graphs [29], [30], k-nearest neighbours
from the implicit random geometric graph (RGG) defined by graphs [31], [32], and online graphs [33].
the random samples. We unify these observations with dy- Karaman and Frazzoli [27] provide excellent definitions
namic programming to develop an anytime optimal planning of these graphs as they relate to planning. An r-disc RGG,
algorithm, batch informed trees (BIT*) (Figs. 1, 2). G near (n, r), is defined by a set of n randomly distributed
BIT* is a BRT algorithm that combines the anytime nature vertices and a radius of connection r, with edges existing
of incremental BRTs (i.e., RRT*) and the efficiency of bulk between vertices if and only if the distance between them
BRTs (i.e., FMT*) with a heuristic to focus the search. This is less than r. A k-nearest neighbours graph, G near (n, k), is
not only prioritizes low-cost solutions, but also minimizes similarly defined by a set of n random vertices but with edges
the size of the explicit tree constructed from the implicit existing between vertices and their k-th nearest neighbours.
RGG. It does this by using dynamic programming to solve An online nearest-neighbour graph, G onn (n), is defined by
increasingly dense RGGs that are constructed in a principled a sequence of n random vertices with edges existing only
manner, allowing for both quick initial solutions as well between a vertex and the preceding vertex in the sequence
as a focused search for improvements that reuses existing that minimizes the edge length. These different types of
information. Its performance depends on the properties of RGGs will be necessary in the following sections to describe
dynamic programming and RGGs, making it simple to how the different algorithms work.
analyze. It is shown that it is both probabilistically complete Any set (or sequence) of samples from the state space,
and asymptotically optimal, and it is conjectured that under V = {x ∈ Xfree }, can by viewed as an implicit RGG with
some currently unknown probabilistic conditions, it may also a specified successor function that uncovers a given vertex’s
be optimally efficient. connections, τ : v ∈ V 7→ {(v, w ∈ V )}. For example, in
The performance of BIT* is demonstrated relative to the implicit r-disc graph this successor function would be a
existing optimal algorithms through a series of experiments nearest-neighbours search
Solution cost
1.44
1.42 1.6
1.4 1.5
1.38
1.4
1.36
1.3
0 5 10 15 0 5 10 15 20 25 30 35 40 45
Run time [s] Run time [s]
RRT* Informed RRT* BIT* FMT*
RRT* Informed RRT* BIT* FMT*
Fig. 4. The median solution cost versus run time for 20 different exper- Fig. 5. The median solution cost versus run time for 20 different
iments on a typical random world in R2 for RRT*, informed RRT*, BIT* experiments on a difficult random world in R2 (Fig. 1) for RRT*, in-
with a batch size of 1000, and FMT* with n = 1000, 5000, & 10000. formed RRT*, BIT* with a batch size of 1000, and FMT* with n =
The dashed line represents a median calculated from a number of solutions 1000, 5000, 10000, & 25000. The dashed line represents a median cal-
between 10 and 20 and the solid line the median once all 20 runs had culated from a number of solutions between 10 and 20 and the solid line
found a solution with error bars denoting a non-parametric 95% confidence the median once all 20 runs had found a solution with error bars denoting
interval. Note that BIT* outperforms all the other planners a non-parametric 95% confidence interval. Note that BIT* outperforms all
then by construction no other edge in Qrewire can and we the other planners
clear the rewiring queue (Alg. 5 Lines 9–10). If it could, we from the asymptotic optimality of PRM* (Theorem 34 in
evaluate the actual cost of the edge, c (vm , wm ), (i.e., check [27]) and the ability of BIT* to find the optimal solution in
the edge for collisions, Alg. 5 Line 5), and if it improves the a given graph. The proof that the exact version of BIT* finds
path to wm , replace the existing incoming edge to wm with the shortest-path tree in a graph follows from the theorems
(vm , wm ) (Alg. 5 Lines 6–7). of dynamic programming. The proof that the approximate
As discussed in Section VI, in practice we do not check version (i.e., without cascaded rewiring) of BIT* also finds
for resulting improvements in the graph, omitting Alg. 5 the shortest-path tree should follow from proofs in [27] and
Line 8. This makes the rewiring algorithm at each batch an is currently being investigated.
approximation as in RRT*, but does not appear to affect the Conjecture 1 (Probablisitc Optimal Efficiency): It is con-
final asymptotic optimality. jectured that for a given total number of batches, b, number
IV. A NALYSIS of samples per batch, n, a radius of connection in terms of the
total number of samples, r, and a heuristic, fb(·), the exact
Without prior knowledge of the planning problem to jus-
version of BIT* is the optimally efficient method to solve
tify a nonuniform sampling distribution, the best solution we
the planning problem under some probabilistic measure.
can probabilistically expect from a sampling-based planner
Proof: (Sketch) For the single batch case, this should
is bounded by the properties of RGGs. Similarly, given a
follow immediately from the proofs of A* and showing that
specific G ∈ G near (n, r), dynamic programming provides
BIT* is equivalent to A* on an implicit RGG. It is believed
methods to efficiently search the graph for the optimal path
that it may be possible to make a more general statement
from xs to xg . It naturally follows that the analysis of
for the general planning problem under some appropriate
BIT* will depend on the probabilistic properties of the RGG
probabilistic limitations.
representation and the method used to search the graphs.
Sketches of these proofs are presented below, it is expected V. E XPERIMENTAL R ESULTS
that they will be expanded before final submission.
BIT* was compared to RRT*, informed RRT* and FMT*
Theorem 1 (Probabilistic Completeness): The probability
with various batch sizes on a series of random worlds in
that BIT* returns a solution to a given planning problem, if
R2 and R8 (Fig. 2). The worlds were randomly populated
a solution exists, goes to unity as the number of iterations,
with obstacles (axis-aligned rectangles and hyperrectangles,
i, goes to infinity.
respectively) such that at most a quarter of the environment
Proof: (Sketch) For each batch, BIT* searches an r-disc
was obstructed (Fig. 1). In total, the planners were tested on
RGG that is equivalent to the graph defined by PRM* in [27]
12 random worlds (6 in R2 and 6 in R8 ), with each world
for the same samples. Thus, the proof of its probabilistic
being solved by the planners 20 different times with different
completeness follows from the probabilistic completeness of
pseudo-random seeds1 .
PRM* (Thereom 22 in [27]) and the completeness of the
In an attempt to minimize the effects of the RRT-steer
graph search performed on the RGG by BIT*.
parameter on RRT* and informed RRT*, we set it equal to
Theorem 2 (Asymptotic Optimality): The probability that
the radius given in [27] for asymptotic optimality at each
the solution returned by BIT* is the optimal solution to a
iteration. This is the same calculation used by FMT* and
given planning problem, if such a solution exists, goes to
BIT* to define nearest neighbours. We evaluated various
unity as the total number of iterations, i, goes to infinity.
samples sizes for FMT* and used batch sizes of n = 1000
Proof: (Sketch) As in the proof of Thereom 1, we
in R2 and n = 5000 in R8 for BIT*.
recognize that the r-disc RGG searched by BIT* at any batch
is equivalent to the graph defined by PRM* for the same 1 The experiments were run on a MacBook Pro running Ubuntu 12.04
samples. Thus, the proof of asymptotic convergence follows with a 4 GB of ram and a 2.67 GHz processor.
Number of solutions
Solution cost
3.4
15
3.2
10
3
5
2.8
0
0 10 20 30 40 50 60 70 80 90 0 10 20 30 40 50 60 70 80 90
Run time [s] Run time [s]
RRT* Informed RRT* BIT* FMT* RRT* Informed RRT* BIT*
Fig. 6. The median solution cost versus run time for 20 different Fig. 7. The number of solutions from 20 different experiments on a typical
experiments on a typical random world in R8 for RRT*, informed RRT*, random world in R8 versus run time for RRT*, informed RRT* and BIT*
BIT* with a batch size of 5000, and FMT* with n = 1000, 2500, & 5000. with a batch size of 5000. Note that BIT* reliably finds a solution quickly.
The dashed line represents a median calculated from a number of solutions
between 10 and 20 and the solid line the median once all 20 runs had algorithm should be modified to consider whether edges in
found a solution with error bars denoting a non-parametric 95% confidence Qedge could further improve a newly added vertex before
interval. Note that BIT* outperforms all the other planners. they are removed.
These experiments demonstrated that in low dimensions To exactly rewire the tree in Alg. 5, all descendents of
BIT* finds an initial solution in approximately the same an updated vertex must also be evaluated for the potential
time as other planners, but finds solutions of equivalent to further improve the tree as shown in Alg. 5, Line 8.
cost significantly faster (Fig. 4). This difference is even Cascading these rewirings can be expensive, and anecdo-
more pronounced in worlds with narrow passages (Fig. 5). tal evidence suggests that the resulting computational cost
In high dimensions, BIT* significantly outperformed the outweighs their benefit in iterative algorithms. There exists
existing algorithms in terms of time required to find an initial precedence in both dynamic programming techniques [5] and
solution, time required to find solutions of equivalent costs sampling-based planners [27] to limit this rewiring to only
(Fig. 6), and also in the probability of finding a solution by the edges from the new vertex, and as a result we omit Line 8
a specific time (Fig. 7). from Alg. 5 in our implementation of the BIT* algorithm.
The algorithm implementations were unoptimized and This makes the graph search performed during each batch
these results are preliminary; however, all algorithms share an approximate dynamic programming algorithm, though it
the same primitive functions which should permit rough does not appear to affect the final asymptotic optimality.
relative comparisons. While optimizations will probably have In practice, there are a number of possible implementation
a greater effect on the slower algorithms, since BIT* ulti- improvements. The heuristic can be used to remove both
mately performs the fewest operations, the relative order of samples from Qfree and vertices from V that cannot be part
performance should not change. It would be misleading to of a better solution. In our experiments, we pruned Qfree but
present performance versus iteration as each algorithm does not V . It may also be more efficient to check that the target
a significantly different amount of work per iteration. of a new edge (Alg. 1, Line 8) is not already in the tree than
to clear the queue of all such edges each time a new state is
VI. D ISCUSSION added (Alg. 2, Line 2).
The BIT* algorithm can be considered to be a search
VII. C ONCLUSIONS & F UTURE W ORK
of a series of increasingly-dense r-disc RGGs. Thus, after
updating the existing tree with rewiring, we generally assume In this paper we define a general class of planners, Bell-
that g (x) ≈ gT (x) holds for the entire processing of a single man random trees (BRTs), that generate an explicit shortest-
Qedge (i.e., a batch). This is equivalent to solving the current path tree from an implicit RGG. We use this classification
RGG while ignoring the future possibility of denser RGGs, to motivate the development of an optimal anytime planning
which helps limit the size of the queues. In practice, however, algorithm, batch informed trees (BIT*). BIT* combines the
we found two situations where the algorithm performed anytime advantages of RRT* with the ordered-processing
better when we considered the possibility of future rewiring. advantages of FMT*. It does this by iteratively processing
These are the addition of new edges to the edge queue batches of samples, effectively searching a series of increas-
(Alg. 2, Line 4) and the addition of edges to the explicit tree ingly dense RGGs embedded in the continuous configuration
(Alg. 1, Line 11). At this time, it is not clear if the resulting space. The sizes of these batches can be viewed as a choice
difference in performance is from a fundamental algorithmic between exploration (small batches) and exploitation (large
advantage or a limitation in our implementations. batches).
In Section III-A we stated that we assumed that our It uses a heuristic and dynamic programming to do this in
heuristic estimate of the edge-cost was always the true an efficient and principled manner that minimizes changes
edge cost, except for when the edge intersected obstacles, to the shortest-path tree. This allows it to both quickly find
c (·, ·) = c (·, ·) ∪ ∞. This is an acceptable assumption for
b initial solutions and rapidly converge towards the optimal so-
planning problems where the calculation of the edge cost lution. By using a heuristic, it focuses its search towards the
is not expensive. For problems where this is not true, the goal and minimizes the size of the explicit tree constructed