Professional Documents
Culture Documents
9 Histogram of Oriented Gradients For Human Detection in Video
9 Histogram of Oriented Gradients For Human Detection in Video
Abstract—Currently, Computer Vision (CV) is one of the most and finally developed to automobile technology. This system alerts the
popular research topics in the world. This is because it can driver when facing with dangerous or sensitive situations including
support the human daily life. Moreover, CV can also apply to the emergence of pedestrians on the street. For instance, the software
various theories and researches. Human Detection is one of the called Mobileye , which installed in Volvo S60 car, is the human
most popular research topics in Computer Vision. In this paper, detection system which launched the first vision-based collision. It
we present a study of technique for human detection from video, works with full auto brake and pedestrian detection [8]. Therefore, it
which is the Histograms of Oriented Gradients or HOG by can decrease the accidents which caused by human error.
developing a piece of application to import and detect the human This project aims to distinguish the present of people in the videos
from the video. We use the HOG Algorithm to analyze every from the one that people is absent. This video will compose of various
frame from the video to find and count people. After analyzing combinations such as people, animals, and objects. This program
video from starting to the end, the program generate histogram will be developed to analyze frame-by-frame to select only people
to show the number of detected people versus playing period of among others objects in the video. The appearance of people during
the video. As a result, the expected results are obtained, including video played is recorded and generated to a histogram illustrating the
the detection of people in the video and the histogram generation relation between video playing period and number of detected people.
to show the appearance of human detected in the video file. Moreover, another video file is created showing the flames which
emphasize the detected people and the live number of people detected
Keywords—Human Detection, Histogram of Oriented Gradients via this program. We developed the experiment program using Python
and OpenCV. We also selected the histogram of oriented gradient
I. I NTRODUCTION (HOG) technique because HOG working procedures are mainly focus
Object detection is the process to find any specific objects that in differentiate the objects from background [7].
human can find in their daily life; for examples, whole-body of the
human, faces, houses, pets, motorcycles, or cars in selected images or II. R ELATED WORK
videos. For the Object detection algorithms, it uses extracted features Nguyen (2013) detected human by using contour based local
and learning algorithms to recognize examples of an object from motion features. This research composed of two parts. First is
a category. Currently, the object detection is frequently used in a generating template. This part uses the training data to generate
number of applications such as security, surveillance, and advanced the templates and whole human body template and find the key
driver assistance systems (ADAS). Object detection becomes a very points of each template by put the weight in each point. The second
important subject for computer vision area, pattern recognition and part is a testing part that used the sliding window to extract the
image analysis [1], [2], [3]. Moreover, Internet of Things (IoT) is candidate regions and applied Canny Edge Detection to find an edge
another popular research topic that can be applied to many research in candidate regions. With regards to the Canny Edge Detection, there
areas especially for computer vision. Therefore, the combination are four steps which includes Noise Reduction using Gaussian filter,
research of two main topics such as IoT and computer vision increase Find the intensity gradient of the images, Non-maximum suppression
dramatically over the past three years [4], [5], [6]. and Hysteresis Thresholding. Gaussian filter is applied to blur images
Human detection is the technology that people usually use to by using kernel or filter. The classification in this paper is binary
detect objects in images or videos. It is also a crucial step in the Support Vector Machine (SVM) [8].
video-based surveillance systems. The aim is to identify and monitor Zhou (2016) presented the four steps of spatio-temporal matching
humans for security purposes in the crowded environment such as (STM) to track human in different activities in the video by, firstly,
airports, bus terminals or train stations. One example is the video extract trajectories from short video segments by dense sampling
which captured by the CCTV. It has been processed to detect and feature points in the first frame and track them using optical flow.
track the movements of human, both whole and partial of the body. Then, select a set of 3D motion capture sequences by random and
Another issue is related to the road traffic accident which is one cluster motion capture segment into four temporal clusters. [9].
of the main concerns for people [4]. Since the late 90s, driving Dalal (2006) developed detector for standing and moving people
assistance systems has been studied intensively. With this, Human in the videos with possibly moving cameras and backgrounds. This
detection is included as a main part of the driving assistance system paper is combined motion descriptor and appearance descriptor. A
173
Authorized licensed use limited to: UNIVERSITY OF MANITOBA. Downloaded on June 13,2024 at 20:32:37 UTC from IEEE Xplore. Restrictions apply.
2018 5th International Conference on Business and Industrial Research (ICBIR), Bangkok, Thailand
174
Authorized licensed use limited to: UNIVERSITY OF MANITOBA. Downloaded on June 13,2024 at 20:32:37 UTC from IEEE Xplore. Restrictions apply.
2018 5th International Conference on Business and Industrial Research (ICBIR), Bangkok, Thailand
Average
Average
100
0 Average
Average
Standard
Standard
Standard
Standard deviation
deviation
deviationdeviation
Accuracy / percentage
Accuracy/Percentage
800
Accuracy/Percentage
600
400
200
00
0 1 2 3 4 5 6 7 8 9
Fig. 4. Unsucsessful result because people is too small. No frame shown,
and the number shown as zero. Video
Videofile
Video file no.
fileno.
no.
Fig. 6. Accuracy percentage of each video file. The average and SD are also
2) Histogram Generation: The histogram which generated included.
from our program is shown in Fig. 5. It is show the number of
detected people versus playing period of the video. Moreover, this
histogram also elucidates the efficiency of our detection program. TABLE I. NUMERIC VALUE OF ACCURACY PERCENTAGE
According to Fig. 5, the histogram has not got a Gaussian distribution OF 10 VIDEO FILES
profile which represent the probability density function of a normally Accuracy
distributed random variable. This occurrence infers that our HOG Video file no.
Percentage
program works improperly. This obstacle have been found earlier. To 0 83.00
solve this problem, it is suggested to use simple movement average 1 73.33
(SMA) to run in parallel with our program. 2 91.33
3 73.33
4 82.17
3.0 5 87.13
2.5 6 100.00
number of people
7 60.50
2.0 8 76.83
1.5 9 84.65
1.0
Min Max Average SD
60.50 100.00 81.23 10.95
0.5
0
0 250 500 750 1000 1250 1500 1750 VI. C ONCLUSION
period of time We had the application development to detect the people in video.
The application we created consists of three main features which are
Fig. 5. Histogram of the human detection via our program human detection, human counting and histogram generation. For the
human detection, we can detect the human and showed the detection
result by inserting the green frame as well as specify the number
B. Discussion of people for each video frame. According to our experiment, the
According to the experiment, we used 10 videos to do human average accuracy of the human detection result from 10 videos is
detection with our program. The accuracy of the detecting results 81.23 percent with the SD of 10.95 percent. Our histogram is also
is calculated by compare the number that we got from the program very helpful to generate the diagram to show how many people in
with the actual number of people in each frame. The accuracy of all each frame in each individual video playing period. However, the
10 video file are shown in accuracy percentage, percentage that the development of our program still incomplete, since it can be improved
detection is correct. The result shows that the highest accuracy is 100 the detection efficiency by using higher resolution detection as well
percent while the lowest is 60.50 percent. The average accuracy of the as fitting with statistical theory.
175
Authorized licensed use limited to: UNIVERSITY OF MANITOBA. Downloaded on June 13,2024 at 20:32:37 UTC from IEEE Xplore. Restrictions apply.
2018 5th International Conference on Business and Industrial Research (ICBIR), Bangkok, Thailand
R EFERENCES
[1] Y. Pang, K. Zhang, Y. Yuan, and K. Wang, “Distributed object
detection with linear svms,” IEEE Transactions on Cybernetics,
vol. 44, no. 11, pp. 2122–2133, 2014. [Online]. Available:
http://www.ncbi.nlm.nih.gov/pubmed/26020454
[2] H. Ren and Z. N. Li, “Object detection using edge histogram of oriented
gradient,” in 2014 IEEE International Conference on Image Processing
(ICIP), Oct 2014, pp. 4057–4061.
[3] C. Sun and P. Vallotton, “Fast linear feature detection using multiple di-
rectional non-maximum suppression,” in 18th International Conference
on Pattern Recognition (ICPR’06), vol. 2, 2006, pp. 288–291.
[4] M. R. T. Hossai, M. A. Shahjalal, and N. F. Nuri, “Design of an iot
based autonomous vehicle with the aid of computer vision,” in 2017
International Conference on Electrical, Computer and Communication
Engineering (ECCE), Feb 2017, pp. 752–756.
[5] Y. Lu, C. Lu, and C. K. Tang, “Online video object detection using
association lstm,” in 2017 IEEE International Conference on Computer
Vision (ICCV), Oct 2017, pp. 2363–2371.
[6] D. Sangeetha and P. Deepa, “Efficient scale invariant human detection
using histogram of oriented gradients for iot services,” in 2017 30th
International Conference on VLSI Design and 2017 16th International
Conference on Embedded Systems (VLSID), Jan 2017, pp. 61–66.
[7] A. Satpathy, X. Jiang, and H. L. Eng, “Human detection by quadratic
classification on subspace of extended histogram of gradients,” IEEE
Transactions on Image Processing, vol. 23, no. 1, pp. 287–297, 2014.
[Online]. Available: http://www.ncbi.nlm.nih.gov/pubmed/26020454
[8] D. T. Nguyen, W. Li, and P. O. Ogunbona, “Human detection
from images and videos: A survey,” Pattern Recognition, vol. 51,
no. Supplement C, pp. 148 – 175, 2016. [Online]. Available:
http://www.sciencedirect.com/science/article/pii/S0031320315003179
[9] F. Zhou and F. D. l. Torre, “Spatio-temporal matching for human
pose estimation in video,” IEEE Transactions on Pattern Analysis and
Machine Intelligence, vol. 38, no. 8, pp. 1492–1504, 2016. [Online].
Available: http://www.ncbi.nlm.nih.gov/pubmed/26020454
[10] N. Dalal, B. Triggs, and C. Schmid, Human Detection Using Oriented
Histograms of Flow and Appearance. Berlin, Heidelberg: Springer
Berlin Heidelberg, 2006, pp. 428–441.
[11] D. Hosotani, I. Yoda, and K. Sakaue, “Wheelchair recognition by
using stereo vision and histogram of oriented gradients (hog) in real
environments,” in 2009 Workshop on Applications of Computer Vision
(WACV), Dec 2009, pp. 1–6.
[12] S. Guzman, A. Gomez, G. Diez, and D. S. Fernandez, “Car detection
methodology in outdoor environment based on histogram of oriented
gradient (hog) and support vector machine (svm),” in 6th Latin-American
Conference on Networked and Electronic Media (LACNEM 2015), Sept
2015, pp. 1–4.
[13] N. Dalal and B. Triggs, “Histograms of oriented gradients for human
detection,” in 2005 IEEE Computer Society Conference on Computer
Vision and Pattern Recognition (CVPR’05), vol. 1, June 2005, pp. 886–
893.
[14] H. Bristow and S. Lucey, “Why do linear SVMs trained on HOG features
perform so well?” ArXiv e-prints, Jun. 2014.
176
Authorized licensed use limited to: UNIVERSITY OF MANITOBA. Downloaded on June 13,2024 at 20:32:37 UTC from IEEE Xplore. Restrictions apply.