Professional Documents
Culture Documents
Automated Accident Detection System: Charles Harlow and Yu Wang
Automated Accident Detection System: Charles Harlow and Yu Wang
Automated Accident Detection System: Charles Harlow and Yu Wang
The development of a system for automatically detecting and reporting Consider the problem of determining vehicle accidents from
traffic accidents at intersections was considered. A system with these acoustic signals. The problem of identifying vehicle accidents from
properties would be beneficial in determining the cause of accidents and acoustic records may be stated generically as an object identifica-
could also be useful in determining the features of the intersection that tion problem. The objects to be recognized in this situation are
have an impact on safety. A complete system would automatically detect vehicle accidents that must be distinguished from the background
and record traffic conditions associated with accidents such as time of traffic and other environmental noise. The 1-D signal in this case
the accident, video of the accident, and the traffic light signal controller would be audio measurements of vehicular traffic. An important
parameters. The basic research required to develop the system is con- step in object identification is to determine measurements that can
sidered. This involves developing methods for processing acoustic sig- form the basis for a classification system. These measurements
nals and recognizing accident events from the background traffic form a feature vector that is the input to a classification system. It
events. A database of vehicle crash sounds, car braking sounds, con- is necessary for each object to be identified to have distinguishable
struction sounds, and traffic sounds was created. The mel-frequency feature vectors. In this application, it is necessary for accidents to be
cepstral coefficients were computed as a feature vector for input to the detected from the feature vector extracted from the acoustic signal.
classification system. A neural network was used to classify these fea- There are several factors that complicate the object recognition
tures into categories of crash and noncrash events. The classification process. One problem is the communication channel that will vary
testing results achieved 99 percent accuracy. even if the object is the same. That is, sensors placed in different
environments will behave somewhat differently. They will produce
different signals for the same vehicle crash. Different traffic inter-
In this research the authors consider the development of a system sections and different weather conditions will affect the signals and
for automatically detecting and reporting traffic accidents. Such a distort the feature vectors for the objects. Another complication is
device could be useful at high-traffic intersections where accidents that objects in the same class, for example, collisions of the same
are likely to occur. These intersections will usually have signal type, will produce somewhat different signals. Finally, noise and
lights for traffic control. A complete system would automatically multiple signals from different sources will distort the signals.
detect and record traffic conditions associated with accidents such There are many different sources of acoustic signals that may occur
as time of the accident, video of the accident, and the traffic light at different intersections. These include normal traffic sounds, large
signal controller parameters. A system with these properties would trucks, and other vehicles that produce loud sounds in normal opera-
be beneficial in determining the cause of accidents. The informa- tion. There may also be construction, maintenance, and industrial
tion could also be useful in determining features of the intersection
activity in the vicinity. In addition, various types of accidents may
that have an impact on safety.
occur. Vehicles may crash into other vehicles or stationary objects.
In this project the basic research required to develop a system to
Vehicles may have head-on, rear-end, or side collisions. There may
detect accidents was considered. This involved developing methods
be braking sounds, tires skidding, and horns.
for processing acoustic signals and recognizing accident events from
the background traffic events that occur at intersections. If methods
can be developed for reliably recognizing accident events at intersec-
tions, then a system can be developed with complete functionality. An BACKGROUND
acoustic system was chosen for its simplicity and cost-effectiveness.
It is also the case that accidents create distinctive sound events. Most of the work in classifying acoustic signals has been done in
At the present time a device to detect accidents and report traffic association with speech recognition (4, 5). A number of methods
signal conditions does not exist. Work has been conducted on video have been explored for extracting measurements from the acoustic
systems to monitor traffic conditions and provide input to signal signal. A common method used is the mel-warped cepstra trans-
controllers for traffic control (1, 2). Video systems have also been form. The mel-scale is based on the nonlinear human perception of
used for incident detection (3). Incidents are detected when a vehi- sound frequencies (6, 7). Cepstral coefficients are often the pre-
cle stops or dramatically reduces its speed. The system proposed in ferred measurements extracted from an acoustic signal for speech
this work will seek to determine accidents directly from the acoustic recognition (4).
signal of the accident. The system is relatively simple because an Acoustic data have been used in traffic monitoring (1). The goal was
acoustic signal rather than a video signal is being processed. The to count the vehicles in traffic patterns on roadways. The system was
accident detection system can operate in a stand-alone configuration found to perform better when located closer to the traffic and away
or, alternately, provide complementary information when other from echo-producing structures such as bridges. It was affected
sensors are available. somewhat by cold weather. High traffic volumes caused under-
counting. Other technologies such as passive infrared, radar, and
Department of Electrical and Computer Engineering, Louisiana State University, Doppler microwave had better performance at vehicle counting.
Baton Rouge, LA 70803. Although acoustic technology is not the best for counting vehicles, it
Harlow and Wang Paper No. 01-2453 91
has a role in detecting accidents. Accidents are more likely to have a Feature Extraction
distinctive acoustic signal than individual vehicles in a traffic stream.
Consider the problem of extracting features from the acoustic signal.
An audio signal can be characterized by the short-time Fourier spec-
trum of a signal (6, 10). The short-time Fourier transform (STFT) is
METHODOLOGY defined as
This section provides descriptions of the data used in the study and ∞
the manner in which they were acquired, the features that were
extracted from the acoustic samples, and the neural network classi-
sˆ( w ) = ∫ s(t )g(t ) exp(− j 2πtw)dt
−∞
fication approach.
where g(t) is a windowing function. The function g should have
support over a finite time interval so that it localizes the response
in time. One common form for g is the Gaussian function
Vehicle Accident Database Formation
(t − µ )
2
The authors developed an acoustic database of traffic sounds, con- g(t ) = exp −
struction sounds, and accident sounds. The information contained in 2σ 2
the database consists of accident location, acoustic records, a descrip-
tion of the event (type of accident) and vehicles involved, and envi- The Hamming window (11) where
ronmental information. The equipment used in the field for recording
2 πn
vehicle acoustic signals consisted of sound-recording equipment and g(n) = 0.54 − .46 cos 0 ≤ n < N
a form to manually enter information related to the event. The N − 1
acoustic equipment consisted of a Larson-Davis Model 712 sound
level meter, Electro-Voice RE55 microphones, and a Sony TCD-D8 is often used in acoustic processing.
DAT Walkman. The sound data were recorded on the DAT recorder A common method used for feature extraction is the mel-warped
at 44.1 kHz. cepstra transform. Let FFT be the fast Fourier transform and FFT −1
Acoustic data of accidents are difficult to obtain. Arrangements be its inverse. Let s be the signal over a frame, then the cepstra
were made with the Texas Transportation Institute (TTI) Crash Test transform is cep(s) = FFT −1(log|FFT(s)|). Since (log|FFT(s)|) is real
Facility located in College Station, Texas, to collect data of vehicle and symmetric, the FFT and FFT−1 are equal up to a multiplicative
crashes. The authors recorded traffic sounds at intersections and con- constant. The cepstrum is, therefore, the spectrum of the log of the
struction sounds from construction sites in Baton Rouge to complete spectrum. The mel-warped spectrum is obtained by attenuating
the database. The traffic sounds were recorded at an intersection with high frequencies in (log|FFT(s)|) before taking the inverse trans-
a high volume of traffic, including a substantial amount of large truck form. The mel-scale is based on the nonlinear human perception
traffic. The total amount of data available for analysis and testing con- of sound frequencies (6, 7 ). Generally, about 10 to 14 of the low
sisted of 28 crash sounds from vehicle crashes, 2 car braking sounds, cepstral coefficients are required for recognition. The use of the
8 sounds of a pile driver at a construction site, and 200 sound records cepstra has often proved an effective approach in audio object
of routine traffic at intersections. The crash sounds consisted of small recognition tasks (5).
cars, regular-size cars, pickups, and large delivery trucks hitting crash Figure 1 shows the sound prms signal from a vehicle crash. The
barriers of different types and, in some cases, a rear-end collision. data here have been significantly reduced from the raw sound data
Because the accident data were recorded at a crash test facility, the collected for plotting purposes. Figure 2 is a plot of a sound signal
word “crash” is often used to refer to an accident.
Sound data are often processed in the root mean square (rms) of the
sound signal pressure p(t). The form is
1
∫
T2
prms = p 2 (t )dt
T2 − T1 T1
p
Lp = 20 log10
p0
p
Leq = 20 log10 rms
p0
The features were extracted from the frames in the order that the Acoustic detectors were affected by snow, extreme cold, and heavy
frame appears in the sound record. Each set of features extracted from traffic and may be affected by location (1). The acoustic sensor must
a frame was fed into the network for classification. Weak sound sig- be located near the intersection. The acoustic sensor was located
nals were detected and ignored. Some of the frames of the crash sig- near the signal controller box. It would be useful to obtain more data
nal may have been mistakenly classified because the training samples under a wide variety of conditions. Building a prototype system and
do not represent all possible patterns of a crash. Thus, a method was installing it at intersections for further data collection and evalua-
employed to remove the misclassified parts of the signal and give a tion could accomplish this. The classification system has been con-
correct final decision. The total number of frames labeled a crash and structed so that it can be readily trained as new data are obtained.
the total number of frames labeled a noncrash were computed. If the Larger training sets would make the classification more reliable.
total number of frames labeled a crash sound were greater than the The system should be cost-effective. The cost of a system
number of frames labeled noncrash, then the signal for classification depends very much on the vendor and the size of the market. Any
purposes was considered a crash sound. Otherwise, it was considered modern PC or digital signal processing unit could meet the pro-
a noncrash sound. cessing requirements. The necessary computing and sound equip-
In the final experiment, the 100-ms frame length was used. The ment can be purchased for under $1,000, with the price decreasing
training samples were selected such that the number of training and every year.
crash sounds was the same. This experiment used 16 crash sounds,
16 traffic sounds, 1 braking sound, and 1 pile-driving sound for train-
ACKNOWLEDGMENTS
ing. Once the neural network was trained, the performance of the net-
work was tested by using the test data. The test data had 184 traffic This work was supported by the Louisiana Transportation Research
samples, 12 crash samples, and 8 other sounds. The classification Center. The Texas Transportation Institute Crash Test Facility located
errors are shown below: in College Station, Texas, assisted the authors in obtaining acoustic
Category Crash Sounds Traffic Sounds Other Sounds data of vehicle crashes. Dick Zimmer allowed the authors to collect
Crash Sounds 0 0 0
data at the facility, and Richard Badillo helped with the data collec-
Traffic Sounds 2 0 0 tion. James Manning, who works with the city of Durham, North
Other Sounds 0 0 0 Carolina, suggested the value of an acoustic system for detecting
traffic accidents.
The detection rate (true positives) was 100 percent. All the crash
sounds were correctly classified. The test of the traffic samples has
only two mistakenly classified samples. The other data sounds, REFERENCES
including car braking and pile-driving sounds, were classified with-
out any error. The false alarm rate (false positives) was 1 percent; 1. Kranig, J., E. Minge, and C. Jones. Field Testing of Urban Vehicle Oper-
two traffic sounds were classified as crash sounds. ations Using Non-Intrusive Technologies. Report FHWA-PL-97-018.
Minnesota Department of Transportation, St. Paul, Minn., 1997.
The authors processed data in 3-s intervals and classified the data 2. Coulombe, R. F. Trends and Applications in CCTV, Part One: The
as crash or noncrash events. The false alarm rate was low, but there USA. Traffic Technology International, Oct./Nov. 1999, pp. 29–33.
was nevertheless potential for generating a substantial number of 3. Ritchie, S. G., B. Abdulhai, A. E. Parkany, J.-B. Sheu, R. L. Cheu, and
S. I. Khan. A Comprehensive System for Incident Detection on Freeways
false alarms. For example, if one assumes that 30 percent of the sig-
and Arterials. Intelligent Transportation: Serving the User Through
nals have sufficient acoustic power to be classified, then one would Deployment. Proc., 1995 Annual Meeting of ITS America, Washington,
process 360 acoustic records in 1 h. With a false alarm rate of 1 per- D.C., 1995, pp. 617–622.
cent, this could generate four false alarms per hour. The classifica- 4. Mammone, R. J., X. Zhang, and R. Ramachandran. Robust Speaker
Recognition. IEEE Signal Processing Magazine, Sept. 1996, pp. 58–71.
tion of a sound as an accident would imply that relevant information 5. Gish, H., and M. Schmidt. Text-Independent Speaker Identification.
about the accident such as traffic signal settings would be recorded. IEEE Signal Processing Magazine, Oct. 1994, pp. 18–32.
If an actual accident had been reported, then the data would be avail- 6. Rabiner, L. R., and B. H. Juang. Fundamentals of Speech Recognition.
able for review. If no accident had been reported, this might still Prentice-Hall, Englewood Cliffs, N. J., 1993.
7. Stevens, S. S., and J. Volkmann. The Relation of Pitch of Frequency:
mean that an accident had occurred. For safety studies, one would A Revised Scale. American Journal of Psychology, Vol. 53, 1940,
need to review the data to determine the actual conditions that pp. 329–353.
caused the sound event. It is possible that the false alarm rate may 8. Couvreur, C. Environmental Sound Recognition: A Statistical Approach.
need to be further reduced. Experience with the system would be Ph.D. dissertation. Faculte Polytechnique de Mons, Mons, Belgium, 1997.
9. Harlow, C. A. Automated Accident Detection System. Final Report.
required to determine the required rate. Louisiana Transportation Research Center, Baton Rouge, La., July 1999.
10. Deller, J. R., J. G. Proakis, and J. H. L. Hansen. Discrete-Time Processing
of Speech Signals. Macmillan, New York, 1993.
CONCLUSIONS 11. Pratt, W. K. Digital Image Processing. John Wiley & Sons, New York,
1991.
This project has demonstrated a promising approach for developing 12. MathWorks. User’s Guide to Neural Network Toolbox. Math Works
an acoustic system for automatically detecting and reporting traffic Inc., Natick, Mass., 1991.
13. Fausett, L. Fundamentals of Neural Networks: Architectures, Algorithms,
accidents at intersections. The system needs to be further evaluated and Applications. Prentice-Hall, Englewood Cliffs, N.J., 1994.
in situations with normal traffic flow and accident occurrences. The
data were not obtained under a wide variety of environmental con- The contents of this paper reflect the views of the authors, who are responsible
ditions as would be encountered in practice because crash tests are for the facts and accuracy of the data presented. The contents do not necessarily
reflect the official views or policies of the Louisiana Department of Transportation
not normally conducted under adverse weather conditions. It has and Development or the Louisiana Transportation Research Center. This report
previously been observed in tests of equipment used for counting does not constitute a standard, specification, or regulation.
vehicles that the equipment is affected by environmental conditions, Publication of this paper sponsored by Committee on Safety Data, Analysis, and
traffic conditions, lighting conditions, and intersection geometry. Evaluation.