Professional Documents
Culture Documents
10 1 1 125 62
10 1 1 125 62
Outdoor Environments
Bernard Ghanem, Ali Fawaz, and Ghassan Karame
Faculty of Engineering and Architecture, American University of Beirut
3 Dag Hammarskjold Plaza, 8th floor, NY, New York 10017-2303, USA
E-mail {bsg00}@aub.edu.lb
Abstract - This paper introduces the design of an outdoor context of a real-time “kidnapped robot problem” [1],
navigation system based on range sensors and camera whereby the robot, equipped with a single camera, is inserted
information intended to perform incremental localization in a into a previously unknown outdoor environment and at an
previously unknown environment, while being able to navigate initially unknown location within this environment. A number
successfully from source to destination, thus, avoiding impeding
of identifiable landmarks will be placed in the robot’s
obstacles.
Prior studies in the field of Simultaneous Localization and surroundings, some of which are not within initial camera
Mapping (SLAM) have shown computer vision (CV) to be an range and their respective global coordinates will be made
influential detection and recognition tool for indoor settings [2]. known to the robot a priori. Initially, the robot will be required
However, the application of this technique in dynamic outdoor to wander about its immediate location for preliminary
environments, especially where GPS is no longer efficient, has yet localization based on landmark detection, while avoiding
to be conducted and tested. Consequently, our mobile robot dynamic obstacles that may be encountered. The relative
system will be capable of localizing itself with respect to distances between the robot and these landmarks will need to
predefined identifiable landmarks, which are visually “sensed” be accurately estimated in order to minimize the error incurred
and detected. These static landmarks will be used by the robot to
in approximating its global position. A deterministic
provide a better estimate of its location in an attempt to plan a
path from its unknown initial position to a chosen destination triangulation method will be used to reduce the uncertainty
within its environment, while bypassing dynamic obstructions in accumulation inherent in automated navigation [1]. This
its path. research proposes a technique for incorporating CV methods
as an accessory to GPS when the latter renders inadequate
Index Terms - Mobile Robot, SLAM, Computer Vision, performance among tall buildings, large structures, and
Landmark detection, Localization. overhead trees. In fact, even though GPS is considered the
preferred sensor in outdoor operation, its performance
I. INTRODUCTION
efficiency is limited because of the obstruction of the vivid
The issue of SLAM has been extensively researched and view of the positioning satellite. Our implemented system
resolved with accurate precision via the use of traditional makes use of the abundant linear features describing
range sensors [3]. However, due to its relative complexity and neighbouring buildings in urban situations, where GPS fails to
highly demanding computational requirement, SLAM, using provide a substantial contribution to the localization task.
CV as its main sensory device to imitate human capabilities
II. EXPERIMENTAL SETUP
and perception, has not been appropriately encompassed
especially in outdoor settings, yet its future promise to be This section describes the experimental setup established to
more significant in the coming years due to the wide variety of depict the performance and functionality of our research and
relevant applications it renders. The advantages of this its various implementations.
technique will allow the robot to acquire more useful A. Robot Navigation Specifications
information from the features it senses and to augment its According to established experimental setup, the robot
initially incomplete “insight” into the existent environment. In will be initialized at an unknown position in the environment
fact, cameras have numerous other desirable properties that at which it will render the origin of its local coordinate frame
can be exploited for the purpose of enhancing robot sensing that is used for navigation. First, it will start to systematically
and navigation. For instance, they are low-cost general “wander” about this initial position in order to detect the first
purpose sensors, which behave passively preventing vision- identifiable landmark. In fact, the robot starts capturing
based navigation systems from suffering possible interference images, which are then analysed using the designed vision-
from external and/or internal sources. based tool as discussed later. If it detects a landmark, the robot
Our primary objective is to develop robust, user friendly will have to recognize it (i.e. through its distinctive colour and
software to be run on a Pioneer 3-AT outdoor robot, which shape), move closer to it, and then repeat the previous process
performs navigation on the basis of accurate localization (i.e. detection and recognition) in order to provide an estimate
results supplied by CV principles. In this regards, it is planned for that landmark’s position with respect to the robot’s local
to implement an incremental localization technique within the coordinate frame. Afterwards, it will continue to search for
another landmark and locate it as well, in order to transform method, a distinctive shape or feature can be drawn on these
the robot’s local position into coordinates suitable for the landmarks to be distinguished by the application. Due to the
global coordinate frame as defined by the experimenter. variation in shape, size, and perspective view, landmark
However, in case a landmark has not been found in the detection becomes more involved; however, localization is
camera’s optical range, the robot will revolve about itself with possible with only one collected image.
relative increments of 90° (knowing that the robot’s optical The CV application commences operation by executing a
range is about 100°). This rotation is terminated in either of stage of illumination normalization, which maintains a mean
the two situations: illumination value of 100 for the “raw” image. This ensures
The robot subsequently detects, recognizes the landmark, that the natural lighting affects neither landmark detection nor
and estimates the latter’s local pose localization [4]. After illumination augmentation, this image is
A complete turn has been performed, so the robot will further processed for the specific colours of the given
proceed with the same rotational activity but at a radius of landmarks. This demands selective background elimination on
1 meter from the previous position. This radius is the basis of chromatic properties (i.e. all blobs incomparable
incremented every time a complete turn is finished in colour to the landmark are whitened). This process can be
without finding a recognizable landmark. done in numerous ways, which include the two developed so
This scenario realizes all the possible combinations that far. The first methodology requires that each pixel is
may ensue, when the robot is placed in a previously unknown compared to a “generic” range for landmark colour, which is
environment. In this case, no action is conducted without a distinctive of the landmarks themselves. The other approach
reasonable motive, thus emulating human capacity in makes use of the correlation between landmark pixels and
wandering through a previously unseen setting. their neighbours via a normalization correlation matrix, which
In this overall layout, the robot will be required to is excerpted from a typical subsection of the landmark image.
perform landmark recognition/detection and self-localization. Although simpler in form and implementation, the first
These processes will be described in the following sections. method requires pixel-to-pixel processing with no relation to
what the surrounding pixels may yield. On the other hand, the
B. Landmark Recognition and Detection
second method is burdened with a relatively longer processing
In the context of this real-time “kidnapped robot
time than the first, yet it renders more details about the
problem”, the robot is challenged by various environmental
landmark, while eliminating the contrast factor from the
and technical issues that have to be resolved before a suitable
images.
localization method is applied.
After chromatic background rejection, the “crude” image
The robot is equipped with a Canon VC4 coloured
is analysed by a dilating morphological filter, which tends to
camera with a natural resolution of 768x480 pixels and an
remove noisy factors evident in the residual image. Also, it
optical range of approximately 100°. On the basis of the
renders a slightly “smoother” contour of the landmark, if
available camera features, the CV application must be able to
present, since it deals with expanding the majority colour
distinguish a suitable landmark, if available, and locate it
within a sampling matrix traversing the whole image. In this
within the processed image in a real-time situation. The main
regards, sharp and/or jagged edges are removed, thus,
limitation is caused by variations in illumination, which
facilitating the corner/line detectors that are applied later.
require the robot to be insensitive to image fluctuations
Moreover, a simple median-smoothing filter is applied to
induced by temporal and environmental settings.
the pre-processed image to remove any other insignificant
At start up, the robot is designed to use its frame grabber
noise elements due to random fluctuations. As a consequence,
to periodically capture a medium resolution image of its
the resulting image after preliminary landmark recognition
surroundings to be stored as “raw” data. This is based on the
must be void of any colour range other than the one to be
assumption that the robot has prior knowledge of the specific
tested for (red, green, or blue). This colour analysis can be
colour and shape of the available landmarks, which in the
performed in a variety of methods depending on the preferred
initial stage are limited to three cardboard cylinders red, green
colour space to be used (i.e. Red Green Blue space [RGB],
and blue in colour. This choice of landmark type comes as a
Hue Saturation Illumination space [HIS], or Chrominance
compromise between complex detection and localization. In
Luminance space [ChLu]) [5]. The optimal colour space to be
the case of cylindrical landmarks, the robot can simply check
implemented should be able to perform image processing
for a rectangular shape of a certain colour to detect the
independently of the changing environmental factors. Despite
presence of the landmark; however, the issue of localization
the fact that most images are represented as RGB sequences,
becomes more elaborate due to the inability of the robot to
this colour space remains sensitive to lighting that may disturb
distinguish its bearing towards the landmark from a single
the exact detection of a certain colour. For example, a
frame image. The need for accurate localization in the
coloured picture of a red landmark under bright light will
presence of a single camera obliges the application to require
represent a totally different RGB range than that of the same
a pair of images of the same landmark at two distinct
landmark but in normal lighting. As a better approach, either
positions, which are somewhat far apart. The alternative
HSV or ChLu spaces can be used directly to eliminate lighting
approach to landmark selection is to facilitate landmark
localization, while complicating landmark detection. In this
sensitivity. The resultant analysed image can be compared to their respective directional vectors in the image plane to
the raw one taken in Figure 1 below. decide whether they are “parallel” up to a certain extent
defined by another user-defined threshold. This takes into
account the inaccuracies that may result from both the Canny
and Hough methods and the relative slanting of the landmark
itself with respect to the robot’s camera.
Distance (m)
Fig. 5 Graph showing the inverse relationship between landmark width and
distance from camera