Professional Documents
Culture Documents
Real-Time Eye Blink Detection Using Facial Landmarks
Real-Time Eye Blink Detection Using Facial Landmarks
80
occurance [%]
60
40
Chehra
20 Intraface
Chehra−small
Chehra Chehra
Intraface−small
Intraface Intraface 0
0 5 10 15 20 25
localization error [% of IOD]
Eye landmarks
100
80
occurance [%]
60
40
Figure 3: Example images from the 300-VW dataset
with landmarks obtained by Chehra [1] and In- Chehra
20 Intraface
traface [16]. Original images (left) with inter-ocular Chehra−small
distance (IOD) equal to 63 (top) and 53 (bottom) pix- Intraface−small
0
0 5 10 15 20 25
els. Images subsampled (right) to IOD equal to 6.3 localization error [% of IOD]
(top) and 17 (bottom).
Figure 4: Cumulative histogram of average localiza-
tion error of all 49 landmarks (top) and 12 landmarks
Quantitatively, the accuracy of the landmark de- of the eyes (bottom). The histograms are computed
tection for a face image is measured by the average for original resolution images (solid lines) and a sub-
relative landmark localization error, defined as usu- set of small images (IOD ≤ 50 px).
ally
N
100 X
= ||xi − x̂i ||2 , (2) the Intraface is always more precise than Chehra. As
κN
i=1 already mentioned, the Intraface is much more robust
where xi is the ground-truth location of landmark i to small images than Chehra. This behaviour is fur-
in the image, x̂i is an estimated landmark location by ther observed in the following experiment.
a detector, N is a number of landmarks and normal- Taking a set of all 15k images, we measured a
ization factor κ is the inter-ocular distance (IOD), i.e. mean localization error µ as a function of a face im-
Euclidean distance between eye centers in the image. age resolution P determined by the IOD. More pre-
1
First, a standard cumulative histogram of the aver- cisely, µ = |S| j∈S j , i.e. average error over set of
age relative landmark localization error was calcu- face images S having the IOD in a given range. Re-
lated, see Fig. 4, for a complete set of 49 landmarks sults are shown in Fig. 5. Plots have errorbars of stan-
and also for a subset of 12 landmarks of the eyes only, dard deviation. It is seen that Chehra fails quickly
since these landmarks are used in the proposed eye for images with IOD < 20 px. For larger faces, the
blink detector. The results are calculated for all the mean error is comparable, although slightly better for
original images that have average IOD around 80 px, Intraface for the eye landmarks.
and also for all “small” face images (including sub- The last test is directly related to the eye blink de-
sampled ones) having IOD ≤ 50 px. For all land- tector. We measured accuracy of EAR as a func-
marks, Chehra has more occurrences of very small tion of the IOD. Mean EAR error is defined as a
errors (up to 5 percent of the IOD), but Intraface is mean absolute difference between the true and the
more robust having more occurrences of errors be- estimated EAR. The plots are computed for two sub-
low 10 percent of the IOD. For eye landmarks only, sets: closed/closing (average true ratio 0.05 ± 0.05)
All landmarks Low opening ratio (ρ < 0.15)
50 0.4
Chehra Chehra
Intraface 0.35 Intraface
40
0.3
0.25
30
0.2
20 0.15
0.1
10
0.05
0
0 0 20 40 60 80 100
0 20 40 60 80 100 IOD [px]
IOD [px]
High opening ratio (ρ > 0.25)
Eye landmarks
0.4
50 Chehra
Chehra Intraface
0.35
Intraface
0.25
30
0.2
0.15
20
0.1
10 0.05
0
0 20 40 60 80 100
0 IOD [px]
0 20 40 60 80 100
IOD [px]
Figure 6: Accuracy of the eye-opening ratio as a
Figure 5: Landmark localization accuracy as a func- function of the face image resolution. Top: for
tion of the face image resolution computed for all images with small true ratio (mostly closing/closed
landmarks and eye landmarks only. eyes), and bottom: images with higher ratio (open
eyes).
90 A B 0.9
C
0.8
80
0.7
70
0.6
AUC
Precision [%]
60 0.5
50 0.4
0.3
40
0.2
30
0.1 Chehra SVM
Intraface SVM
20 0
57.38 51.6 45.9 40.2 34.4 28.7 23.0 17.2 11.5 5.7
EAR Thresholding IOD [px]
10
Chehra SVM
Intraface SVM
0 Figure 9: Accuracy of the eye blink detector (mea-
0 10 20 30 40 50 60 70 80 90 100
Recall [%] sured by AUC) as a function of the image resolution
(average IOD) when subsampling the ZJU dataset.
(a) ZJU
100
60
SVM. The algorithm runs in real-time, since the ad-
50
ditional computational costs for the eye blink detec-
40 tion are negligible besides the real-time landmark de-
tectors.
30
The proposed SVM method that uses a temporal
20 window of the eye aspect ratio (EAR), outperforms
EAR Thresholding the EAR thresholding. On the other hand, the thresh-
10
Chehra SVM
Intraface SVM olding is usable as a single image classifier to detect
0
0 10 20 30 40 50 60 70 80 90 100 the eye state, in case that a longer sequence is not
Recall [%]
available.
(b) Eyeblink8 We see a limitation that a fixed blink duration for
all subjects was assumed, although everyone’s blink
Figure 8: Precision-recall curves of the EAR thresh-
lasts differently. The results could be improved by an
olding and EAR SVM classifiers measured on (a) the
adaptive approach. Another limitation is in the eye
ZJU and (b) the Eyeblink8 databases. Published re-
opening estimate. While EAR is estimated from a 2D
sults of methods A - Drutarovsky and Fogelton [8], B
image, it is fairly insensitive to a head orientation, but
- Lee et al. [9], C - Danisman et al. [5] are depicted.
may lose discriminability for out of plane rotations.
A solution might be to define the EAR in 3D. There
4. Conclusion are landmark detectors that estimate a 3D pose (po-
sition and orientation) of a 3D model of landmarks,
A real-time eye blink detection algorithm was e.g. [1, 3].
presented. We quantitatively demonstrated that
regression-based facial landmark detectors are pre- Acknowledgment
cise enough to reliably estimate a level of eye open-
ness. While they are robust to low image quality (low The research was supported by CTU student grant
image resolution in a large extent) and in-the-wild SGS15/155/OHK3/2T/13.
References [16] X. Xiong and F. De la Torre. Supervised descent
methods and its applications to face alignment. In
[1] A. Asthana, S. Zafeoriou, S. Cheng, and M. Pantic.
Proc. CVPR, 2013. 2, 3, 4, 5
Incremental face alignment in the wild. In Confer-
ence on Computer Vision and Pattern Recognition, [17] Z. Yan, L. Hu, H. Chen, and F. Lu. Computer vision
2014. 1, 2, 3, 4, 5, 7 syndrome: A widely spreading but largely unknown
epidemic among computer users. Computers in Hu-
[2] L. M. Bergasa, J. Nuevo, M. A. Sotelo, and
man Behaviour, (24):2026–2042, 2008. 1
M. Vazquez. Real-time system for monitoring driver
vigilance. In IEEE Intelligent Vehicles Symposium, [18] F. Yang, X. Yu, J. Huang, P. Yang, and D. Metaxas.
2004. 1 Robust eyelid tracking for fatigue detection. In ICIP,
2012. 1
[3] J. Cech, V. Franc, and J. Matas. A 3D approach to
facial landmarks: Detection, refinement, and track- [19] S. Zafeiriou, G. Tzimiropoulos, and M. Pantic. The
ing. In Proc. International Conference on Pattern 300 videos in the wild (300-VW) facial landmark
Recognition, 2014. 7 tracking in-the-wild challenge. In ICCV Work-
shop, 2015. http://ibug.doc.ic.ac.uk/
[4] M. Chau and M. Betke. Real time eye tracking and
resources/300-VW/. 3
blink detection with USB cameras. Technical Report
2005-12, Boston University Computer Science, May
2005. 1
[5] T. Danisman, I. Bilasco, C. Djeraba, and N. Ihad-
dadene. Drowsy driver detection system using eye
blink patterns. In Machine and Web Intelligence
(ICMWI), Oct 2010. 1, 6, 7
[6] H. Dinh, E. Jovanov, and R. Adhami. Eye blink
detection using intensity vertical projection. In In-
ternational Multi-Conference on Engineering and
Technological Innovation, IMETI 2012. 1
[7] M. Divjak and H. Bischof. Eye blink based fa-
tigue detection for prevention of computer vision
syndrome. In IAPR Conference on Machine Vision
Applications, 2009. 1
[8] T. Drutarovsky and A. Fogelton. Eye blink detec-
tion using variance of motion vectors. In Computer
Vision - ECCV Workshops. 2014. 1, 2, 5, 6, 7
[9] W. H. Lee, E. C. Lee, and K. E. Park. Blink detec-
tion robust to various facial poses. Journal of Neu-
roscience Methods, Nov. 2010. 1, 3, 6, 7
[10] Medicton group. The system I4Control. http://
www.i4tracking.cz/. 1
[11] G. Pan, L. Sun, Z. Wu, and S. Lao. Eyeblink-based
anti-spoofing in face recognition from a generic we-
bcamera. In ICCV, 2007. 1, 2, 5
[12] S. Ren, X. Cao, Y. Wei, and J. Sun. Face alignment
at 3000 fps via regressing local binary features. In
Proc. CVPR, 2014. 2
[13] A. Sahayadhas, K. Sundaraj, and M. Murugappan.
Detecting driver drowsiness based on sensors: A re-
view. MDPI open access: sensors, 2012. 1
[14] F. M. Sukno, S.-K. Pavani, C. Butakoff, and A. F.
Frangi. Automatic assessment of eye blinking pat-
terns through statistical shape models. In ICVS,
2009. 1, 2
[15] D. Torricelli, M. Goffredo, S. Conforto, and
M. Schmid. An adaptive blink detector to initial-
ize and update a view-basedremote eye gaze track-
ing system in a natural scenario. Pattern Recogn.
Lett., 30(12):1144–1150, Sept. 2009. 1