Professional Documents
Culture Documents
Mask-Vision A Machine Vision-Based Inference System of Face Mask Detection For Monitoring Health Protocol Safety
Mask-Vision A Machine Vision-Based Inference System of Face Mask Detection For Monitoring Health Protocol Safety
Abstract— To avoid adversely affecting community health use disinfectants [3]. The question of whether or not masks
and the global economy, effective ways to limit the COVID-19 should be worn has sparked a lot of debate. The fact is that in
pandemic require constant attention. In the absence of efficient the context of appropriate infection management, no one
antivirals and insufficient medical resources, WHO preventive measure contains the golden key to disease
recommends several methods to minimize infection rates and
prevention [4]. Each act makes a major contribution to the
prevent depletion of scarce healthcare resources. One of the
non-pharmaceutical treatments that can be used to decrease the procedure and aids the others in restricting and inhibiting the
primary source of SARS-CoV2 droplets expelled by an infected spread of COVID-19 [5].
individual is to wear a mask. Irrespective of disagreements Face masks, on the other hand, are the single most
about medical resources and mask types, all governments effective preventative strategy in our pandemic-stricken
enforce the wearing of masks that cover the nose and mouth by society [6]. Even though public spaces have recently been
the general population. In the next years, the suggested mask opened for economic reasons, the number of incidents in each
detection models might be a valuable tool for ensuring that nation is quite significant. “Face masks can help protect
safety measures are followed correctly. The YOLOv3 model, a against a variety of respiratory diseases conveyed by droplets,
deep transfer learning object identification state-of-the-art
such as coronavirus and influenza.” according to a prominent
approach, is used to create a mask detection model in this
research article. The suggested model's exceptional scholar at the Johns Hopkins Center for Health Security and
performance makes it ideal for video surveillance equipment. an infectious disease expert [7]. Viruses like the coronavirus
The suggested approach focuses on creating an enhanced may be spread via the air by coughing and sneezing, or by
dataset from a 300-image dataset utilizing data augmentation touching a contaminated surface and then touching your
techniques such as image filtering. The Data augmentation- mouth, nose, or eyes before washing your hands, according to
based mask detection model's mean average precision was the expert [8]. Such droplets will not come into contact with
found to be 89.8% during training and 100% during overall one's face or mouth if one is wearing a face mask before
testing, with detection per frame accuracy ranging from 40.03% hitting the ground [9].
to 65.03%.
Despite the fact that masks are mandated to be worn in
Keywords— covid-19, face mask detection, deep learning, public locations such as malls and offices as a precaution,
object detection, yolov3. many people do not comply [10]. Most apartments and
businesses rely on guards to keep an eye on such individuals,
I. INTRODUCTION putting them at risk. The danger can be decreased by
The coronavirus disease in 2019, also known as COVID- automating the mask detection procedure. This may be
19, is caused by the severe acute respiratory syndrome accomplished by automating access to any facility for only
coronavirus 2, according to the World Health Organization those wearing masks using object detection algorithms [11].
(WHO) (SARS-CoV-2) [1]. This is an extremely contagious Object detection methods have been successfully used during
illness distributed mostly by small droplets created by the last ten years to identify a variety of items, such as military
sneezing, coughing, and speaking. Despite the fact that the guns, as well as medical applications such as malignant cell
droplets are not usually airborne, those who are in close identification and other applications [12]-[15].
proximity are more vulnerable to infection [2]. COVID-19 is The goal of this work is to create a model for mask
an extremely infectious illness that may be transmitted from identification that employs a deep learning-based object
person to person nearby. To prevent the transmission of this detection method. The suggested model is trained to utilize an
disease, keep a safe distance from others, wear a mask, and appropriate data augmentation-based pre-processing approach
II. METHODOLOGY
Object detection is a computational intelligence method
used in image processing and computer vision to detect the
presence of certain objects in a digital image or video, such as
a person, automobile, billboard, structure, and so on [16]. The
Fig. 2. Sample dataset of face mask mages by Prajna Bhandary [19].
area of object identification research in computer vision is
rapidly growing. Traditional computer vision, such as
template object matching, was used at first. Some recent
region-based algorithms include Faster Region-Based A. Dataset Gathering and Preparation
Convolutional Neural Networks (R–CNN), Fast R–CNN, R– The dataset utilized in this study as shown inf Fig. 2 was
CNN, and You Only Look Once (YOLO) [17]. The YOLO created by Prajna Bhandary [19]. The dataset contains 1,376
technique was selected for this study because it can inspect the images divided into two categories: masked images of 690 and
entire image during model evaluation and collect every detail non-masked images of 686. The proponents selected 300
about the full image and target at the same time. YOLO is also images from a collection of 690 masked images for training
considerably faster than Faster R–CNN since it is and validation images. Training and validation datasets were
programmed to do bounding box regression and classification created from the image datasets. The training dataset had 80%
at the same time, predicting localization boxes [18]. Fig. 1 of the images, whereas the validation dataset contained 20%
depicts the system workflow, which includes dataset of them.
preparation, training, validation, and testing.
B. Dataset Annotation
The researchers used software to annotate and label the
datasets, as shown in Fig. 3, to annotate and label the images.
A rectangular bounding box was constructed in the lower face
region of a person to annotate the datasets.
Fig. 4. The network architecture of YOLOv3 (Redmon et al., 2018) [18]. (1)
D. Evaluation
For the evaluation procedure, the study used the mAP
(mean Average Precision) to compare the trained model to
ensure the selection of the right one for the model inference
detection. It will generate files that will be used for Fig. 5. Model training chart.
Fig. 6. Model evaluation chart.