Professional Documents
Culture Documents
2007, Video Recog. Systems - Rail & Transit
2007, Video Recog. Systems - Rail & Transit
2007, Video Recog. Systems - Rail & Transit
“Computer Vision allows computers to see. Rail and Urban Transit Security Workshop
Perceptual Vision allows computers to understand what they see. ” Montreal
November 2007
Since 2004: ** USA DTO/VACE (Disruptive Technology Office/Video Analysis and Content Extraction)
** NRC/IIT/Video Recognition Systems
IEEE-archived Intern. Workshops on + NRC/Administrative Services and Property Management Branch / Security Operations
Video Processing and Recognition (VideoRec’08 - in * NRC/ Institute for Aerospace Research/Flight Research Laboratory
* CRC (Industry of Canada, Communications Research Centre)/Advanced Video Systems
Windsor, May 27-30, 2008) ** CRIM (Computer Research Institute of Montreal)
Goal: Focus academic effort on newly emerged area. + CBSA (Canada Border Services Agency)/Laboratory and Scientific Services Directorate
+* RCMP/ Surveillance Technology Section / Covert Video (CV), Remote Sensing
Technologies (RST) and Special Purpose Vehicle (SPV) units
+ RCMP/ Technical Security Branch
Ottawa, June 5, 2007: +* DRDC/Automated Intelligent Systems/UAV
First federal departments meeting on +* DRDC/Network Information Operations Section
+* DRDC/Centre for Operations Research & Analysis (CORA)
Deploying Video Technologies for National Security +* CPRC (Canadian Police Research Center)
(VT4NS’07) + Transport Canada / Security Technology / Security and Emergency Preparedness
+ Office of the Privacy Commissioner of Canada
+ DND/Forces (several depts.)
Goal: Discuss the ways to synchronize the effort in developing VT * VT developers
+ VT users
solutions and setting VT standards for the new century within GoC.
9. Video Recognition Systems (Dmitry Gorodnichy) 10. Video Recognition Systems (Dmitry Gorodnichy)
• Term “Motion-based” is coined to make people believe – Changing light / weather (esp. in 24/7 monitoring)
that video recognition is happening, which is not! • Wind, precipitations
– Against sun/light, out of focus, blurred, thru glass
• It’s actually illumination-change-based, as it uses • Reflections, diffraction, optical interferences
simple pixel brightness comparison: – Image transmission, compression losses
| Bij(t) – Bij(t-1) | > N for K pixels Î “alarm”
– Which often happens not because of motion!
• Light changes
• Noise
– Especially: Outdoors & in long-term monitoring
21. Video Recognition Systems (Dmitry Gorodnichy) 22. Video Recognition Systems (Dmitry Gorodnichy)
Example: ACE Surveillance™ technology - Based on recent advances in object detection / tracking.
- Replaces video clips with annotated JPG images
– Compresses 1 Gb of video into 2 Mb of easy to browse still images
(can hold several years of evidence on a single computer).
– Shown annotations: size, velocity, colour of detected objects.
- Enables new Zoom-on-Evidence™ browsing
23. Video Recognition Systems (Dmitry Gorodnichy) 24. Video Recognition Systems (Dmitry Gorodnichy)
Object Detection and Motion-based capture
Tracking results
Ready to
1. For existing CCTV systems
• Works with stationery cameras only
2. For security desks with a computer
• Upto 8 cameras on a single (3GHz / 2Gb RAM) pc
Zoom-on-the-evidence™ Demo
Browsing
Back Door Entry Delivery Entry
31. Video Recognition Systems (Dmitry Gorodnichy) 32. Video Recognition Systems (Dmitry Gorodnichy)
Future trends
60 And what?
By humans
40
Face Recognition Vendor Test (www.frvt.org) is still seen: “in
By computers
making the video data of better quality”
20
0
In In Approaching NEW problem with OLD tools ?
photos video
Instead of developing approaches which can deal with low-
While humans easily recognize a person in video (with faces < 40 pixels!), resolution data
computer performance on video is much worse than that on photos!
37. Video Recognition Systems (Dmitry Gorodnichy) 38. Video Recognition Systems (Dmitry Gorodnichy)
Photos: Video:
- High spatial resolution - Low spatial resolution
Photographic facial data and video-acquired facial - No temporal knowledge - High temporal resolution
data are two different image-based modalities ( Individual frames of poor quality)
E.g. faces:
– different nature of data 1. in controlled environment 1. in unconstrained environment
(similar to fingerprint (in a “hidden” camera setup)
– different biometrics
registration) 2. don’t look into camera, don’t
– different approaches 2. “nicely” forced-positioned even face camera
– different testing benchmarks 3. 60 pixels IOD 3. 10-20 pixels IOD
(IOD = intra-ocular distance)
Adopted by ICAO’02 for • Psychological study: people recognize faces starting from
passport-type documents IOD > 10 pixels
(used in Canada, USA, EU)
• Good news (2002): computers can also detect faces
– with i.o.d >= 10 pixels
• One picture per person
– in poor illumination,
• IOD=60 (Width=120 pixels)
– with different orientations: +/- 45o
– different facial expressions
Used
- To store faces in databases
- In recognition algorithms