Professional Documents
Culture Documents
Barnes 2008
Barnes 2008
2, JUNE 2008
limit [1]. There are many possible reasons that a driver may
I. I NTRODUCTION be speeding; one of these is failure to notice a speed sign.
In laboratory experiments, it has been shown that distractions
I MPROVING safety is a key goal in road vehicle technology.
Driver support systems aim to improve safety by helping
drivers react to changing road conditions. Although the full
result in a twofold increase in the failure to detect simulated
traffic signals [2].
automation of road vehicles remains a motivating goal, our Technology can assist by alerting drivers to speed limit
research focuses on systems that can immediately assist drivers. changes to which they do not appear to have reacted and
Rather than replacing the drivers, we aim to keep the drivers by maintaining a monitor of the current speed limit. Road
in the loop while supporting them in controlling the car. This sign recognition also has other possible applications, such as
is the approach that has been adopted by driver assistance building and maintaining maps of signs within an urban envi-
systems (DASs). ronment and automated inspection of signs for deterioration.
The risk of a crash causing death or injury rapidly increases The primary focus of the research program at the National
with increases in the speed over an appropriately set speed ICT Australia (NICTA) on sign detection is on road safety,
particularly supporting drivers in their tasks.
Presently, available techniques for the classification of can-
Manuscript received April 10, 2007; revised August 5, 2007, October 28,
didate signs show strong performance. The major shortcoming
2008, December 11, 2007, and December 24, 2007. The National ICT Australia of many systems is with high-speed detection, i.e., quickly
is funded by the Australian Government’s Backing Australia’s Ability initiative identifying likely candidates for classification in a way that
in part through the Australian Research Council. The Associate Editor for this
paper was S. Nedevschi.
is robust to the vast range of lighting conditions that occur
N. Barnes is with the National ICT Australia, Canberra, A.C.T. 2600, in road scenes. In this paper, we present the application of a
Australia, and also with the Department of Information Engineering, The shape detection algorithm, i.e., the radial symmetry detector, to
Australian National University, Canberra, A.C.T. 0200, Australia (e-mail:
nick.barnes@nicta.com.au).
speed sign detection. We present its integration into a system
A. Zelinsky is with the Commonwealth Scientific and Industrial Re- for speed sign recognition, showing real-time performance and
search Organization ICT Centre, Canberra, A.C.T. 0200, Australia (e-mail: robustness to a broad array of lighting conditions over image
alex.zelinsky@csiro.au).
L. S. Fletcher is with the Computer Science and Artificial Intelligence Labo- sequences. The detector system is also demonstrated running
ratory, Massachusetts Institute of Technology, Cambridge, MA 02139 USA real time in on-road trials at the Australian National Univer-
(e-mail: lukesf@mit.edu). sity (ANU)/NICTA/Commonwealth Scientific and Industrial
Color versions of one or more of the figures in this paper are available online
at http://ieeexplore.ieee.org. Research Organization (CSIRO) intelligent vehicle, as shown
Digital Object Identifier 10.1109/TITS.2008.922935 in Fig. 1.
1524-9050/$25.00 © 2008 IEEE
BARNES et al.: REAL-TIME SPEED SIGN DETECTION USING THE RADIAL SYMMETRY DETECTOR 323
II. B ACKGROUND GPS or a gyro and a speedometer may be useful to note things
such as T-intersections. In this paper, we focus on vision-based
Speed sign regulations vary worldwide. We consider the
detection.
Australian regulatory framework that offers a typical exam-
ple of conditions.1 Speed signs are one of several ways in
B. Vision-Based Sign Recognition Systems
which the current speed limit may be specified. A single type
of regulation sign indicates a change in the speed limit; the Research into vision-based road sign recognition began in
signs are rectangular with a red circle and black numbers or earnest in the mid-1980s. Brute force approaches may be
illuminated white numbers indicating the prescribed speed. used where classifiers, such as normalized correlation, are
A speed zone can end with another speed sign, a dead end, a applied at every image pixel. Such an approach is computa-
T-intersection, or a derestriction sign. Derestriction signs also tionally prohibitive; however, this can be eased somewhat by
have a circle on a rectangular background. Furthermore, some approaches such as simulated annealing [4] or coarse-to-fine
areas have geographic limits, such as a limit of 50 km/h on techniques in applying a hierarchy of templates [5]. However,
urban local streets, unless a sign indicates otherwise. However, these methods are still computationally expensive. To manage
noting a reduction of speed limit is clearly a critical aspect for computation while still maintaining classification quality, many
any complete system and would form a useful function on its approaches have introduced separate stages for the detection
own. For the purposes of this detector, Australian signs are and the classification of the sign type (e.g., [6]–[8]). A low-
largely consistent with many international road signs such as expense algorithm that produces few false negatives is used, and
the European Union, and therefore, the technology that has then, only a small fraction of the incoming image stream needs
been developed would be applicable more broadly. to be explored with more expensive and accurate classification
algorithms.
Fig. 3. Typical detected candidate sign (a) at input image size and (b) close D. Sign Recognition
up. The circle does not appear as a single color, the edges fall within pixels, and
the right edge touches the edge of the sign. As early as 1996, correct classification rate percentages of
more than 90% were reported using normalized cross cor-
the shape using edges [18] in an interesting method that applies relation [10]. More recently, Escalera and Radeva [18] have
a deformable model using genetic algorithms and simulated applied the normalized k nearest neighbor classification after
annealing. Real-time detection performance was not possible initial detection using Adaboost and orientation correction
at the time of publication. Recent work has combined the using algorithms such as the radial symmetry. They have re-
color with support vector machines (SVMs) for shape-based ported 98% correct classification. Another approach [23] also
detection as well as SVM-based classification [19]. The system applied Adaboost for detection but used Bayesian generative
is able to classify a broad array of signs at 1.7 frames/s using modeling for classification, reporting error rates of 6% over a
large images with high accuracy. Early work by Besserer et al. large data set. SVMs have shown impressive results for sign
[20] used evidential reasoning for shape-based detection of classification, with the correct class selected out of 35 general
traffic signs, finding circles, triangles, and polygons from sign types in over 97% of the cases and substantially improving
chain codes. for smaller sets of classes [24]. Recent work by Paclik et al.
The complexity of sign detection. Lighting conditions change [25] improves the performance over standard cross correlation
between direct sunlight, headlights at night, heavy cloud cover, with a trainable similarity measure. By incorporating the most
and smog, and a sign may be in complete shadow on a bright discriminative regions, the classifier is not sensitive to uninfor-
day.2 Such changes have an impact on lighting chromaticity mative pixels such as occlusions. Computational demands are
and, therefore, on the wavelength of the reflected light. Also, also reduced.
heavy rain may blur the sign appearance [see Fig. 2(b)]. Fur- Although commercial systems will require results of close to
thermore, for most cameras, the varying incident intensity and 100% recognition with few false positives, with the incorpora-
wavelength produce both nonlinear and interdependent effects tion of additional cues and further research, viable commercial
on the HSI in the resultant image [21]. Ideally, signs have clear classification may be possible in the near future. For example,
color contrast; however, over time, they may fade but are still the application of superresolution over multiple images [26] can
clear to drivers [see Fig. 14(d)]. improve classification performance. Alternatively, better classi-
Last, compared to ideogram-based signs, speed signs only fication may be achieved by the fusion of multiple classifiers,
have narrow bands of color, which are a few pixels wide unless such as Adaboost, if their problems are complementary.
some form of high-resolution active vision is employed. If more In general, sign classification is closer to commercial appli-
than one color falls on a charge-coupled device element, the cation than detection. Thus, in this paper, we focus on detection.
resulting color is not easily predicted (e.g., Fig. 3). We combine our detection algorithm with basic classification
In practice, as signs may appear in many image positions based on normalized cross correlation. Our aim here is not to
on highways, or over uneven and winding roads, a priori propose an ideal method for classification but to demonstrate
that the detection method can be effectively combined with
2 See Fig. 14 and the supporting video at http://cecs.anu.edu.au/~nmb/ classification to produce an effective complete sign-recognition
speedsigns/. system.
BARNES et al.: REAL-TIME SPEED SIGN DETECTION USING THE RADIAL SYMMETRY DETECTOR 325
E. Driver Feedback
Once a sign is detected, this information must be presented to
the driver. The most passive approach that is likely to have high
acceptance is to add a simple readout of the current speed limit,
perhaps by dynamically changing the speedometer to show the
current speed limit. However, a driver may also wish to be
alerted in the case that he or she has failed to notice the speed
sign. Speed monitors that sound an alert when a driver-specified
Fig. 4. Radial symmetry voting. (a) For a given edge point as shown, the
warning limit is exceeded are now common features in vehicles. centroids of a 1-D family of circles lie along the line that is orthogonal to
These could be set based on the currently detected speed, along the direction of the edge point. Note that if we have no prior expectation
with a driver-specified tolerance if they so choose. Alerting the of light–dark versus dark–light transitions on the circle, the family of circles
extends in both directions from the edge point. (b) From a continuous circle,
driver to a sign only when an adjustment is required, and he or every edge point votes along a line of possible radii; these lines will all intersect
she has not seen it, is easily implemented within a DAS, such at the center of the circle, resulting in a peak. A number of vote lines are
as monitoring if the driver has looked at the sign [26]. Heads- shown here.
up displays, e.g., from Volkswagen, could be used to draw the
driver’s attention when required.
point. The fast radial symmetry detector eliminates factor b by
directly taking the gradient of the edge point from the output
III. A PPROACH of the Sobel edge detector. Instead of voting in all directions, it
only votes in the direction that is orthogonal to the edge, which
We examine an efficient method for speed sign detection—
is where the center must lie. This way, the computation of the
the fast radial symmetry detector [27]. Many shape detectors
radial symmetry detector is reduced by a factor of b, and the
are nonrobust because they require closed shapes. Robust tech-
resulting circle map is simplified by a dimension. This makes it
niques such as Hough circle detection [28] are slow to compute
suitable for real-time use.
over large images. Fast radial symmetry can be run as a detector
The operation of the fast radial symmetry detector can be
at the frame rate. All Australian speed signs have a red circle
understood simply by considering all the possible circles of
with a contrasting background. We are able to eliminate the
which an image edge pixel could be part. If we have the
vast majority of false positives by considering only radially
direction of the edge pixel, then a family of circles lies along a
symmetric regions that are stable across several images and
line in the direction that is orthogonal to the edge, with varying
have a high count of pixels in proportion to the radius. This
radius [see Fig. 4(a)]. A single edge point such as this would
reduces the number of locations where signs can possibly occur.
vote for all circle centroids along this line.
Fast-radial-symmetry-based detection is compatible with all
In considering an edge image, for each edge pixel, we form
major forms of classification, as it identifies the center of the
a set of radial symmetry transform images for different radii.
sign according to maximum-likelihood criteria. The maximum-
Thus, when many edge points align around the boundary of
likelihood formulation follows from the maximum-likelihood
a circle, each of them will vote along their orthogonal lines.
basis of the regular polygon algorithm (see [29]). With the
This set of votes will peak at the circle center. Fig. 4(b) shows
center of the sign identified with the center of the circle,
sample vote lines that would emerge from a continuous set of
recognition only needs to be applied to a small number of
edge points corresponding to the circle shown. All of these lines
pixels. To demonstrate the compatibility of fast radial symmetry
intersect at the centroid, leading to a peak in the vote space.
detection with classification, we apply normalized cross cor-
In practice, this is performed on a discrete image, and the
relation to detected candidates. Typically, for recognition after
radius space is sampled with separate vote spaces for each
detection, an unknown scale requires further processing or, for
radius to simplify the interpretation. An image and samples of
cross correlation, multiple templates at all possible resolutions.
its radial symmetric vote space are shown in Fig. 5. The sign
However, from the radius that is returned by the fast radial
vote peaks in Fig. 5(c). It appears blurred in Fig. 5(b) due to
symmetry detector, we know the approximate scale of the
the detector radius being too small and in Fig. 5(d) due to the
candidate.
detector radius being too large.
For the sake of self-contained presentation, we include a
A. Sign Detection: Fast Radial Symmetry formal description of the detector largely taken from [27]. For a
given pixel p, the gradient g is calculated using an edge operator
The fast radial symmetry detector [27] is a variant on the that yields orientation. If this pixel lies on the arc of a circle,
circular Hough transform that executes in the order kp, where then its center would be in the direction of the gradient at the
p is the number of pixels, and k is the number of discrete radii size of the radius. To achieve lighting invariance, we apply the
that are searched. This is as opposed to the traditional circular discrete form of the detector. The location of a pixel that will
Hough transform that executes in the order kbp. For the circular gain a vote as a potential center is defined as follows:
Hough transform, each edge pixel votes on all circles over a
discrete set of radii k that could pass through that edge pixel.
Factor b comes from the discretization into a number of bins g(p)
p+ve = p + round n (1)
on the angle of circular tangents that could pass through this g(p)
326 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, VOL. 9, NO. 2, JUNE 2008
Fig. 6. Templates that are used for matching. From the largest shown [(a) and
(c)], we scale all sizes down to (b) and (d).
within the capture zone for the centroid. Thus, the sign should
have its full vote. As all edge points vote, the degradation is
graceful with larger angles.
B. Sign Classification
To show the efficacy of a detector, we must demonstrate that
it can be effectively combined with classification. We reviewed
the effective methods for classification above; however, for the
purposes of this paper, we combined the detector with simple
Fig. 5. Image including a road sign and sample radial symmetry images for standard classification. We applied normalized cross correlation
the three largest radii. Detection peaks in (c).
from the Intel Integrated Performance Primitives library. Cross
correlation is run based on the templates that come from the text
where n ∈ N is the radius, and N is the set of possible radii. of a single sign from close by. We scaled this down by linear
In application to sign detection, this is defined by expectations interpolation across all color channels to all expected radii N
about the apparent sign size. (A negative image is similarly to form a total of eight templates for each sign. Prescaling of
defined, facilitating constraining the operator to find only dark templates saves online computation. The base templates used
light circles or only light dark circles.) A vote image is incre- for 40 and 60 signs are shown in Fig. 6. For each candidate
mented for each pixel based on the orientation votes, i.e., found, we are able to run cross correlation on only a 5 × 5
pixel window around the center of the candidate. The algorithm
On (p+ve ) = On (p+ve ) + 1. (2) provides a good estimate of the radius, and therefore, we are
able to apply a narrow band of sizes of templates. Note that the
In our implementation, the vote image is defined as follows: radius is only as precise as the sampling of the radial symmetry
transform in the radius dimension. Furthermore, the circle on
Õn (p) the sign has a finite width. The radius returned is the sample
F̂n (p) = (3)
kn that has the most support. As the sign is assumed to be vertical
and parallel to the image plane, no scaling or rotation of the
where kn is a scaling factor that normalizes On across different template is necessary. The detector itself, however, is unaffected
radii. We do not use the other parameters from the original by text orientation within the plane; therefore, recognition could
paper, as we are only interested in the maxima of the radial be extended to recognize damaged signs that are rotated in the
symmetry. image plane. Given the simplified nature of the recognition
To obtain the radial symmetry image, F̂n is convolved with a problem as described, it takes less than 1 ms to classify a
Gaussian function. There are several images that are produced single candidate as 40 or 60. Note that this classification can
by the transform. Each radius of N votes into a separate image. be performed on a separate processor and does not have to
There is also a full transform image that averages all the be synchronized with detection; overall, it must keep up to
symmetry contributions over all the radii considered, i.e., minimize the delays in the display to the driver.
1
S= Sn . (4) IV. A DAPTATION OF THE R ADIAL S YMMETRY T RANSFORM
|N |
n∈N
TO S IGN D ETECTION
See [27] for full details. Detecting speed signs using the radial symmetry transform
Sensitivity to sign alignment with the image plane. The assumes that the sign is close to parallel to the image plane.
radial symmetry detector is derived for circles; however, in our However, road signs are designed to be visible to the driver. On
discrete implementation, we accept votes for a single circle over a straight road, the sign will always be oriented perpendicular
a capture zone of four pixels. Thus, a vote from an edge point to the motion of the car. On highly curved roads, motion will be
must miss the centroid by at least 1.5 pixels to not count. The slower, and the sign will appear perpendicular to the car motion
largest radius that is applied in this paper is 13. Thus, if we turn when it is most visible to the driver. This means that when the
a sign off the image plane by up to 25◦ in either direction, then, sign is most easily detectable, unless it is damaged, the circle
without noise, the votes of the outer edge pixels should still fall will appear approximately as a circle in the image.
BARNES et al.: REAL-TIME SPEED SIGN DETECTION USING THE RADIAL SYMMETRY DETECTOR 327
Fig. 11. Radial symmetry in the rain [see Fig. 9(d)]. (a) Detector vote image
with a threshold 10. (b) Detector image with a threshold 11 299.
Fig. 12. Images from the second sequence taken around the university.
Fig. 15. These sign images were too small to be detected but were detected in
subsequent images in Fig. 14(a) and (b), respectively.
Fig. 14. Signs in difficult images were generally correctly detected with a Fig. 16. Parameters used in the integrated system.
high threshold, even when difficult with the human eye. All these signs were
detected with the standard threshold. (a)–(c) Heavily shadowed scenes and
(d) a weathered sign.
D. Real-Time Performance
The detector returning candidates comfortably runs at the
across the sequence, we may estimate the total probability of
given frame rate (30 Hz) with four radii and a 320 × 240
not detecting a sign as 0.1894n/2 for missing each of the odd
image. Classification only slows this down when synchronously
and even frames based on this data set. If we assume that a
run. The implementation is in C++ with some mmx operations
sign is visible to the camera for half a second with a processing
running on an Intel Xeon 3.40-GHz machine mounted within
rate of 30 Hz (n = 15 frames, (n/2) = 7 and 8) within the
the car. A slightly reduced version of the detector and the
possible range of radii, the probability of a false negative is
recognition system comfortably runs on a standard Pentium 4
9 per 1 million. This is actually a poor case of an assumed
1.6-GHz laptop.
distribution. Most failed recognition cases are clustered toward
the start of the sequences, as when the sign is closer, it is easier
to detect; therefore, the true failure rate for these sequences E. Integration With Classification
is lower. These statistics are only taken across two sequences.
We combined the detector and the classifier to demonstrate
Clearly, for a thorough examination, a very large number of
the detection ability in real time over a large number of video
sequences are necessary; however, this would be a significant
sequences, and running online in the vehicle on public roads, as
exercise.
what appears on the Web site. The parameters used are shown
in Fig. 16. The sequence video screen classification display is
updated every time the system detects signs, and it has high
C. Difficult Cases
correlation for two successive frames. For most sequences, the
The system, including the detector, was run over sequences sign is classified once or more than once, and often, multiple
showing varying lighting conditions and sign quality. In the correct classifications occur, although, in a few instances, the
case of Fig. 14, the heavily weathered sign was not well best correlation is with the wrong speed. The last classification
detected until it was a little closer than is generally the case but remains on the screen, and, although a sign is visible, this may
was detected in the image shown. The ANU campus has many rapidly change. Stepping through the video frame by frame, it
shaded roads. Fig. 14 shows a number of images where the is possible to see the transitions.
scene is difficult for color-based methods, and the substantial Fig. 17 shows false positives versus false negatives for all of
lighting variation means that the sign has poor contrast. All the sequences of signs using the detector classifier combination.
signs in these images were correctly detected. Four candidates are taken for each radius of the detector per
Last, Fig. 15 shows images with signs that are too small frame, and zero to five repeated appearances are required.
to be recognized but were recognized in subsequent images. Fig. 17 shows that by requiring classification for two frames,
Note that the detector may not reliably detect small signs as we can have 93% successful detection with around 0.5 false
the number of possible pixels becomes small, and the high positive per sequence or 86% successful detection with a false
curvature of the circle may reduce the accuracy of the gradient positive for every five signs that appear.
direction estimation. With a faster processor, higher resolution Note that this classifier may report multiple different sign
images could be used. Multiple processors could also process classifications in a sequence rather than taking the most likely
parts of the image. over the whole sequence. As noted earlier, better methods
BARNES et al.: REAL-TIME SPEED SIGN DETECTION USING THE RADIAL SYMMETRY DETECTOR 331
Nick Barnes (M’92) received the B.Sc. degree Luke S. Fletcher (M’06) received the B.E. (Elec.)
(with Honors) and the Ph.D. degree in computer (Hons.) and B.Sc. (Comp. Sci.) degrees from The
vision for robot guidance from The University of University of Melbourne, Melbourne, Australia, in
Melbourne, Melbourne, Australia, in 1992 and 1999, 1996. He is currently working toward the Ph.D.
respectively. degree in engineering at The Australian National
From 1992 to 1994, he was a Consultant in the IT University, Canberra, Australia, specializing in
industry. In 1999, he was a Visiting Research Fellow driver monitoring integrated with environmental
with the LIRA-Lab, University of Genoa, Genoa, perception.
Italy, where he was supported by an Achiever Award He is currently a Postdoctoral Fellow with the
from the Queens’ Trust for Young Australians. From Computer Science and Artificial Intelligence Labo-
2000 to 2003, he lectured in computer science and ratory, Massachusetts Institute of Technology (MIT),
software engineering with The University of Melbourne. Since 2003, he has Cambridge. He recently competed in the Defense Advanced Research Projects
been with the Canberra Research Laboratory, National ICT Australia, Canberra, Agency Urban Challenge on the MIT team and is now working on related
Australia, where he is currently the Principle Researcher and Research Group technologies for driver support.
Manager in computer vision. His research interests include visual dynamic
scene analysis, computational models of biological vision, feature detection,
vision for vehicle guidance, and medical image analysis.