Professional Documents
Culture Documents
Last Vergen Reports
Last Vergen Reports
Last Vergen Reports
pages
Road Sign Detection and Recognition …………………............. 2
Real Time Road Sign Recognition System Using Artificial…….5
Neural Networks for Bengali Textual Information Box
1
Road Sign Detection and Recognition
Overview
The approach is proposed by Michael Shneier in June 2005. Road signs are detected
by means of rules that restrict color and shape and require signs to appear only in
limited regions. The candidate regions identified by color and pruned using shape
criteria. They are then recognized using a template matching method and tracked
through a sequence of images over time. The method is fast and can easily be
modified to include new classes of signs.
Approach
These constants are determined by sampling the red, green, and blue
values of images of typical signs.
2. The constraints are applied pixel by pixel to the image and result in a
binary image with 1’s where pixels are candidates for belonging to a sign.
3. Following the creation of the binary images, morphological erosion is
performed to get rid of single pixels and two or three dilations are done to
join parts of signs that may have become separated due to the presence of
writing or ideograms on the signs (Figure 1).
4. The next step is to find connected components in the binary images and
identify blobs that are likely to be signs.
1. Properties of each component are computed, including the centroid,
area, and bounding box.
2. They are used in a set of rules that accept or reject each blob as a sign
candidate:
a. The rules require the area of the blob to be greater than a
minimum and less than a maximum, the height to width ratio to
be in a specified range, and the centroid to be in a restricted
part of the image where signs can be expected to appear.
2
b. The ratio of the area of the blob to the area of the bounding box
is restricted to prevent blobs that are too thin from being
accepted.
3. Blobs that conform to the rules are considered to be candidate signs
and are tracked from image to image.
4. If a blob is seen in five successive frames, it is confirmed as a
candidate and goes on to the recognition phase of the algorithm.
2- Recognition:
The stored sign templates are taken from video sequences similar to those being
recognized.
Because there is a lot of variation in the signs, several stored templates may be
needed for each canonical sign.
3
Experiments & Results
The algorithm was tested on video sequences recorded with a camera at normal
driving speeds.
The results reported here are for a total of 23,637 frames containing 92 warning
and stop signs.
The algorithm runs at over 20 frames per second on a 1.6 MHz Intel Pentium
Mobile.
Table 1 shows the results for three individual runs and the combined totals, while
Table 2 shows the corresponding percentages.
4
5
Real Time Road Sign Recognition System Using Artificial
Neural Networks for Bengali Textual Information Box
Overview
Approach
6
3) Bangla Optical Character Recognition Using MLP:
5) Speech Synthesis:
7
Experimental Results and Discussion
The performance and accuracy rate of the proposed system are measured by
testing four major modules with a set of real time video frames, which contains
different types of Bangla road signs.
The modules of our proposed Bangla Road Sign Recognition System are
implemented using Visual Basic 6.0 and MATLAB 7.0.
The following definitions are applied:
Success Rate = (Total of Success/ Total Number of Input Sample) x100 %
Failure Rate=(Total Number of Failure/Total number of Input Sample) x100%
Efficiency= (100-Failure Rate) %
In this system, the total computing time required for recognizing a road sign after
capturing an image up to speech synthesis is 0.98s in non complied Matlab
environment.
That means that before capturing a new frame the previous frame completely pass
all of phases of the system. As a result, the time complexity of the system is quite
low.
After testing this system, the obtained accuracy rate was evaluated at 91.48%.
8
9
10
A First Approach to Learning a Model of Traffic Signs
using Connectionist and syntactic methods
Overview
The approach is proposed by miguel sainz and alberto sanfeliu. The main purpose of this
research is to develop a system for learning and recognizing traffic signs using neural
network and syntactic methods. This paper use tow level of learning, first:
segmentation learning based on neural network, second: model learning based on
grammatical inference. Recognition process uses the results of the learning process as
input for identifying the traffic sings. The recognition of traffic sings in scene is done
in tow steps, first: the sign is located in the scene by using a connectionist
segmentation method, second: the sign is coded and analyzed to determine which
traffic sign it is. The system has been tested successfully only for the first step but the
second step is currently under development.
Approach
1. The Learning Process:
1. The input of this step is color image obtained from TV camera on the
car.
2. The human Operator decides how many different labels the system will
consider (in the traffic sign recognition problem we have considered the
following 5 labels (road-road lines- sky -grass-traffic sign)).
3. then he marks and labels some areas in a set of images "Segmentation
areas "
4. Segmentation module consist of 3 layered neural net, this net is trained by
the back propagation method.
5. Once the net is trained we perform a validation test over a set of test
images to check the learning performances, In the case of low efficiency or
non-satisfactory we can modify the samples or the net parameters to
improve the learning of Segmentation module.
11
1.2 Model Learning
1. In this level the operator marks the areas on the scenes where the
model is locate these areas called model areas
2. It is necessary to preprocess the image before the learning process
starts .
i. the preprocessing has 3 parts :
ii. Optimization of the areas.
1.2.2.2 Normalization of the sizes (to get the same information
from any of the different images area, we should
normalize the size of the model areas).
1. 2.2.3 coding into symbols the content of the sample areas (we
code each pixel of the model area into one of the
following 4 symbols red(R),white(W),black(B) , the
remaining of colors ($). improve the shape of the traffic
sign (remove holes and smoothing the contour ) by
applied morphological process) .
1.2.3 Extract the primitive chains by reading the primitives of the coded
sample.
1.2.4 Validation test is applied to evaluate how good the system ,if it is not
good the operator may restart the model learning level .
2.1 Location Of The Traffic Sign By Using Segmentation Model see figure 3 .
2.2 Morphological Process Is Applied To Remove Noise And Fill Up Grapes.
2.3 The Third Step Is The Recognition Of Traffic Sign This Step Is Divided
Into Phases
12
2.3.1 Finding A Distance Measure Between The Extracted
Symbol Chain And Each Inferred Grammar Of Traffic Sign
Models –Error correcting parser is used.
2.3.2 Analyze The Symbol Inside The Sign.
Results:
13
we see 2 road scenes on the right side and 2 segment rode scenes ,as we see the
Segmentation process gives very good results
we see traffic sign from the scenes and the results of coding them into grammar
symbols
14
Signfinder: Using Color to detect, localize and identify
Informational signs
Overview :
The Approach Is Proposed By A.L. Yuille, D. Snow, And M. Nitzberg .
The Main Purpose Of This Search Paper Is Describe An Approach That Is Able To Detect And
Locate Certain Important Classes Of Signs. The Signs Are Then Automatically Transformed
To A Standard (Frontal) Viewpoint . Most Street Signs Obey The Following Assumptions,
First :The Signs Have Stereotypical Boundary Shapes, Second : The Writing On The Sign Has
One Uniform Color And The Rest Of The Sign Has A Second Uniform Color , Third : The
Text Font Is In A Standard Font Set. We Start By Selecting Simple Tests Which Can Be Run
In Parallel Over The Image. These Tests Locate Seed Positions For Hypotheses. The Seeds
Initiate Region Growing To Segment Two Color Regions (E.G. Signs). A Specialized Edge
Detector Is Used On The Segmented Regions To Determine The Precise Location Of The Sign
Boundaries And To Con_Rm (Or Deny) That The Region Is Really A Sign. From The
Boundaries We Can Calculate An Affine Transformation To Transform The Sign To A
Standard (Frontal) Viewpoint .
Approach :
These Steps Have Been Applied On 100 Red And White Stop Signs . Steps In General Are As
Follows:
1. Get A Database Of Color Images With Signs In Them In Different Situations Of Light ,
Shadow , Vision Of The Place And Distortions
2. In Order To Locate The Position Of A Sign Inside Image ,We Must
2.1. Identify The Seed Regions (Which There Are Two Color Peaks).
In Order To Determine Seed We Use Statistical Analysis Of The Colors In Order To Learn Adjacent
Sets Of Red And White Pixels With Unknown Illuminant (I.E. For Stop Signs). And Then Apply
Multiplicative Model To Deduce The Set Of Illuminant Colors.
2.3 Tested It (I.E. Hypothesis Regions Which Obtained By Applying The Growing
Algorithm In The Previous Step) .
By Detecting Straight Line Edges (To Find Boundary , This Paper Use Specially
Tailored Edge Detectors And A Variant Of The Hough Transform To Detect The
Boundaries And The Corners Of The Sign) In Order To Obtain Information On The
Geometric Shape Of The Sign , Which Allows Us To Know The Direction Of The
15
Sign Then Normalize The Sign To A Front Parallel Viewpoint At Fixed Scale (This
Makes Reading Directly).
Results :
The Algorithm Worked At Close To One Hundred Percent Effectiveness On Our
Dataset . This Section Shows The Final Results On Some Of The More Difficult
Images In Our Database. These Involve Partial Occlusion, Heavy Shadowing, And
Difficult Illuminant Colors And Pose.
1 . Sign With Partial Occlusion
a) The image containing the occluded sign. b) The final result of the algorithm
a) The image containing the shadowed sign. b) The final result of the algorithm.
16
a) The image containing the sign at difficult viewpoint and illuminant. b) The final result of the
algorithm.
Approach:
The algorithm is divided in two modules described as follow:
17
Step 2: The Optimal Corner Detector
18
Module 1: TRAFFIC SIGN CLASSIFICATION
By using multilayer perceptron NN, the size of the input layer corresponds to an
image of 30* 30 pixels, and the output layer is ten . Present the image as the input
pattern. Two neural networks were trained because the detection algorithm is different
according to the form of the sign. The studied NN's were three, the number and
dimension of their hidden layers being different.
19
20
Fig.9 Ideal signs and test images.
Our Approach:
Module 1: concept of neural network
1-Diffentions:
A neural network is an interconnected group of neurons. The prime examples
are biological neural networks, especially the human brain.
An artificial neural network is a mathematical or computational model for
information processing based on a connectionist approach to computation.
The original inspiration for the technique was from examination of
bioelectrical networks in the brain formed by neurons and their synapses. In
a neural network model, simple nodes (or "neurons", or "units") are
connected together to form a network of nodes — hence the term "neural
network”.
Composed of many “neurons” that co-operate to perform the desired
function.
21
Predict weather.
Identify people with cancer risk.
Extrapolation based on historical data.
Classification
Image processing.
Predict bankruptcy for credit card companies.
Risk assessment.
Feature extraction, image matching.
Recognition
Pattern recognition: SNOOPE (bomb detector in U.S. airports).
Character recognition.
Handwriting: processing checks.
Data association
Not only identify the characters that were scanned but identify when
the scanner is not working properly.
Data Conceptualization
Infer grouping relationships e.g. extract from a database the names of
those most likely to buy a particular product.
Data Filtering
E.g. take the noise out of a telephone signal, signal smoothing.
Planning
Unknown environments.
Sensor data is noisy.
Fairly new approach to planning.
Noise Reduction
Recognize patterns in the inputs and produce noiseless outputs.
3- What is neuron?
I. A simple neuron:
An artificial neuron is a device with many inputs and one output.
22
For example:
The network has 2 inputs, and one output(Fig1). All are binary.
The output is
o 1 if W0I0 + W1I1 + Wb > 0
o 0 if W0I0 + W1I1 + Wb ≤ 0
We want it to learn simple OR: output a 1 if either I0 or I1 is 1.
Multile-layers:
Multilayer networks are more powerful than single-layer networks(Fig.3). For
instance,
23
a two-layer network having a sigmoid first layer and a linear second layer can
be trained to approximate most functions arbitrarily well. Single-layer
networks cannot do this.
Fig.3 Three-
Layer Network.
Supervised Learning :
The learning rule is provided with a set of examples(the training set) of proper
network behavior:
where p is an input to the network and t is the corresponding correct(target) output.
As the inputs are applied to the network, the network outputs are compared to the
targets.
unsupervised learning
24
The weights and biases are modified in response to network inputs only. There
are no target outputs available. At first glance this might seem to be impractical.
How can you train a network if you don`t know what it is supposed to do? Most
of these algorithms perform some kind of clustering operation. They learn to
categorize the input patterns into a finite number of classes. This is especially
useful in such applications as vector quantization.
25
26