This document discusses a study that developed a deep learning model to detect monkeypox skin lesions. The researchers first created a dataset of monkeypox, chickenpox, and measles skin lesion images. They then trained several pre-trained convolutional neural network models on this dataset. The best performing model was ResNet50, which achieved an accuracy of 82.96% on the monkeypox detection task. The researchers also created a prototype web application using this model to potentially help with early detection and contact tracing of monkeypox cases. However, they note that a larger, more diverse dataset is still needed to improve the model's generalizability.
This document discusses a study that developed a deep learning model to detect monkeypox skin lesions. The researchers first created a dataset of monkeypox, chickenpox, and measles skin lesion images. They then trained several pre-trained convolutional neural network models on this dataset. The best performing model was ResNet50, which achieved an accuracy of 82.96% on the monkeypox detection task. The researchers also created a prototype web application using this model to potentially help with early detection and contact tracing of monkeypox cases. However, they note that a larger, more diverse dataset is still needed to improve the model's generalizability.
This document discusses a study that developed a deep learning model to detect monkeypox skin lesions. The researchers first created a dataset of monkeypox, chickenpox, and measles skin lesion images. They then trained several pre-trained convolutional neural network models on this dataset. The best performing model was ResNet50, which achieved an accuracy of 82.96% on the monkeypox detection task. The researchers also created a prototype web application using this model to potentially help with early detection and contact tracing of monkeypox cases. However, they note that a larger, more diverse dataset is still needed to improve the model's generalizability.
Shams Nafisa Ali1 , Md. Tazuddin Ahmed1 , Joydip Paul1 , Tasnim Jahan1 , S. M. Sakeef Sani1 , Nawsabah Noor2 , Taufiq Hasan1 , Senior Member, IEEE 1 mHealth Lab, Department of Biomedical Engineering, Bangladesh University of Engineering and Technology 2 Popular Medical College, Dhaka, Bangladesh. Email: taufiq@bme.buet.ac.bd
Abstract—The recent monkeypox outbreak has become a
public health concern due to its rapid spread in more than 40 countries outside Africa. Clinical diagnosis of monkeypox in an arXiv:2207.03342v1 [cs.CV] 6 Jul 2022
early stage is challenging due to its similarity with chickenpox
and measles. In cases where the confirmatory Polymerase Chain Reaction (PCR) tests are not readily available, computer-assisted detection of monkeypox lesions could be beneficial for surveil- lance and rapid identification of suspected cases. Deep learning methods have been found effective in the automated detection of skin lesions, provided that sufficient training examples are available. However, as of now, such datasets are not available for the monkeypox disease. In the current study, we first develop the “Monkeypox Skin Lesion Dataset (MSLD)” consisting skin lesion images of monkeypox, chickenpox, and measles. The images are mainly collected from websites, news portals, and publicly Fig. 1. A flow-diagram of the proposed monkeypox detection system. A accessible case reports. Data augmentation is used to increase the prototype web-app is developed that incorporates the developed deep learning sample size, and a 3-fold cross-validation experiment is set up. In models to detect monkeypox from skin lesions uploaded by users. the next step, several pre-trained deep learning models, namely, VGG-16, ResNet50, and InceptionV3 are employed to classify monkeypox and other diseases. An ensemble of the three models available. Although the case fatality ratio has been reported is also developed. ResNet50 achieves the best overall accuracy of to be 3–6% for the recent outbreak [2], early detection of 82.96(±4.57%), while VGG16 and the ensemble system achieved monkeypox, corresponding contact tracing, and isolation are accuracies of 81.48(±6.87%) and 79.26(±1.05%), respectively. A prototype web-application is also developed as an online essential to limit the community transmission of the virus. monkeypox screening tool. While the initial results on this limited In this scenario, AI-based automated computer-aided systems dataset are promising, a larger demographically diverse dataset is may substantially limit its global spread. required to further enhance the generalizability of these models. In recent years, the multi-faceted applications of deep learn- ing (DL), particularly the variations of Convolutional Neural Index Terms—Computer-aided diagnosis, Skin lesion detection, Monkeypox, Deep learning. Networks (CNNs), have revolutionized different domains of medical science due to their superior learning capability [3], [4]. When trained with a large number of data, these deep I. I NTRODUCTION networks can process images in different layers, automatically As the world recovers from the COVID-19 pandemic, the extracting salient features and learning to identify the optimal recent multi-country outbreak of monkeypox has raised con- representations for specified tasks [5]. However, the require- cerns in global communities. The World Health Organization ment for large amounts of data and time-consuming training (WHO) declared that the outbreak poses a moderate risk to with dedicated computational resources limits the applicability global public health and has stopped short of declaring it a of DL-based frameworks [6]. While using accelerators (e.g., public health emergency. However, healthcare organizations GPU, TPU) resolves the time and resource-related issues, such as World Health Network (WHN) expressed a heightened the dataset-related concerns persist due to the difficulty of concern [1] and highlighted the need for immediate and obtaining unbiased, homogeneous medical data. Data augmen- concerted global action against the disease. tation [7] is a well-known method of increasing the dataset size Monkeypox is a zoonotic disease from the genus Or- by generating additional samples through slight modifications thopoxvirus. It closely resembles chickenpox, measles, and of existing data. In case of scarcity of data, transfer learning [6] smallpox regarding clinical features [2]. The minor differences is also a commonly used technique. This method utilizes a in the skin rash of these diseases, coupled with the relative CNN model pre-trained on a large dataset (e.g., ImageNet) rarity of monkeypox have made the early diagnosis of this and transfers its knowledge for context-specific learning on a condition very challenging for healthcare professionals. On different, comparatively smaller dataset. the contrary, the confirmatory PCR test is also not widely Currently, there is no publicly available monkeypox skin TABLE I D ISTRIBUTION OF THE M ONKEYPOX S KIN L ESION DATASET (MSLD)
No. of Original No. of Unique No. of Augmented
Class label Images Patients Images Monkeypox 102 55 1428 Others 126 107 1764 Total 228 162 3192
non-endemic countries with no epidemiological connections
to endemic areas, the disease was declared a moderate global health risk as assessed by the WHO [2]. Fig. 2. The Monkeypox lesion through its various stages: (a) early vesicle, Monkeypox is usually a self-limited disease; the symptoms (b) small pustule ( 2mm), (c) umbilicated pustule ( 3-4mm), (d) ulcerated last 2 to 4 weeks. Children are mostly affected severely. lesion ( 5mm), (e) crusting of a mature lesion, (f) partially removed scab. The severity of the disease depends on the extent of virus exposure, the health status of the patient, and the complica- tions. The virus’s incubation period is in the range of 5 to lesion dataset for developing automated detection algorithms. 21 days [2]. The invasion period (0-5 days) is characterized There are impediments considering privacy and validity con- by the symptoms of fever, lymphadenopathy (swollen lymph cerns. Moreover, the high prevalence of monkeypox in the nodes), myalgia (muscle ache), asthenia (physical weakness), under-developed African regions may introduce a bias in the and severe headache. The rash begins within 1-3 days of fever dataset because of very high inter-class similarity and intra- onset and is usually noticed on the face, palms of the hands, class variability. In this paper, we first introduce the “Mon- and soles of the feet. However, it can also be found on the keypox Skin Lesion Dataset (MSLD)”1 , an openly accessible mouth, eyes, and genitals [2]. In the skin eruption phase (2-4 dataset containing web-scrapped images of different body weeks), the lesions follow the four-stage progression; macules parts (face, neck, hand, arm, leg) of patients with monkeypox (lesions with a flat base) to papules (raised firm painful lesions) and non-monkeypox (measles, chickenpox) cases. We also to vesicles (filled with clear fluid) to pustules (filled with pus) present a DL-based preliminary feasibility study leveraging followed by encrustation. transfer learning involving VGG16 [8], ResNet50 [9] and The gold standard diagnosis of monkeypox is done through InceptionV3 [10] architectures to explore the potential of deep histopathology and virus isolation. In addition, a PCR test learning models for the early detection of the monkeypox can help for the confirmation of the diagnosis [12]. However, disease. Additionally, we have soft-launched a web-app2 built when these diagnostic tools are not readily available, early on the open-source streamlit framework that is capable of diagnosis based on clinical examination of the skin lesions can analysing the uploaded image and predicting whether a subject be beneficial. With the ubiquitous availability of smartphones, is a potential monkeypox suspect and he/she should consult AI-based skin lesion detection systems can assist in diagnosis a physician on an urgent basis or not. Our current working and bridge the gap in the healthcare systems. pipeline is graphically represented in Fig. 1. III. DATASET P REPARATION II. BACKGROUND Our monkeypox skin lesion dataset is primarily compiled Monkeypox virus, an enveloped double-stranded DNA from publicly available case reports, news portals, and web- virus, is one of the members of the Poxviridae family, which is sites through extensive manual searching. We did not use closely related to the variola virus [2]. Various animal species, automatic web-scrappers. In this work, we mainly focus including squirrels, Gambian pouched rats, dormice, and non- on distinguishing the ‘Monkeypox’ cases from similar non- human primates, were first identified as the natural hosts of monkeypox cases [2]. Thus, we include skin lesion images of monkeypox virus [2]. The first human case of monkeypox was chickenpox and measles as the ‘Others’ class and prepare the confirmed in the Democratic Republic of Congo (DRC) in dataset for binary classification 1970, and subsequently, it has been endemic in the tropical All the skin lesion images were verified using Google’s rainforests of west and central Africa [2]. Reverse Image Search and cross-referenced with other sources. Since the beginning of 2022, monkeypox outbreaks have Through a 2-stage screening process, the out-of-focus, low- been reported in diverse countries across the globe. Currently, resolution, and low-quality images were discarded, and only 5135 laboratory-confirmed cases have been reported from 66 the unique images that satisfy the quality requirements were member states, including the Regions of America, Europe, selected. Next, the images were cropped to their region of Africa, Eastern Mediterranean, and Western Pacific (updated interest and resized to 224 × 224 pixels while maintaining the 30 June 2022) [11]. After the rapid spread of the disease in aspect ratio. We have collected a total number of 228 images, among 1 https://github.com/mHealthBuet/Monkeypox-Skin-Lesion-Dataset which 102 belongs to the ‘Monkeypox’ class and the re- 2 https://monkey-pox-detector-mhealthlab.herokuapp.com/ maining 126 represents the ‘Others’ class (i.e., chickenpox Fig. 3. Sample images from the collected dataset. Images from the ‘Mon- keypox’ class are shown on the left panel, while the ‘Others’ class (measles and chickenpox) images are shown on the right. Fig. 4. An original monkeypox skin lesion image and its 13 augmented versions used for training. These created as follows: (1) Random rotation by a multiple of 90 degrees, (2) Random rotation in the range of -45 to 45 and measles). A few representative samples are shown in degrees, (3) Translation, (4) Reflection, (5) Shear, (6) Hue jitter, (7) Saturation Fig. 3. Since this is the early phase of the monkeypox jitter, (8) Brightness jitter, (9) Contrast jitter, (10) Salt and Pepper Noise, (11) outbreak, reliable data are scarce. Hence, to aid the classi- Gaussian Noise, (12) Synthetic Blur and (13) Scaling. fication task, several data augmentation methods, including rotation, translation, reflection, shear, hue, saturation, contrast B. Implementation Details and brightness jitter, noise, and scaling, have been applied to increase the dataset size. Post-augmentation, the number Input images with dimensions (224, 224, 3) were fed into of images increased by approximately 14-folds, with the the selected pre-trained models. The fully connected layers classes ‘Monkeypox’ and ‘Others’ consisting of 1428 and 1764 were removed. Although the architectures have varying depths, images, respectively. The augmented images are also provided we experimented with keeping different numbers (4/6/8/12) in a separate folder in the dataset to ensure reproducibility. of bottom layers trainable to ensure homogeneity and better However, based on their requirement, the users may also apply generalization. After extensive experimentation, we decided to different types of image augmentor on the original images. A unfreeze the bottom eight layers. Next, we flatten the backbone detailed distribution of the dataset is provided in Table I. model’s output, followed by three blocks of fully connected (FC) layers, and dropout to the network. The FC layers had IV. E XPERIMENTS successively 4096, 1072 and 256 nodes while the dropout factors were respectively 0.3, 0.2 and 0.15. Finally, an FC A. Pre-trained Models layer with two nodes was employed with a softmax activation In this work, we select three well-known CNN architectures function for this binary classification task. namely, VGG16 [8], ResNet50 [9] and InceptionV3 [10], pre- The network architectures were implemented in Keras and trained on the ImageNet dataset. These models are chosen as were accelerated using Nvidia K80 GPUs provided by Kaggle they have demonstrated excellent classification performance notebooks. The batch size was set to 16. The adaptive learning across different fields of computer vision, including medical rate optimizer (Adam) with an initial learning rate of 10−5 and images through transfer learning. binary cross-entropy loss function was employed for training. VGG16 is one of the first networks to investigate CNN’s C. Experimental Design depth to improve accuracy in large-scale image recognition tasks [8]. It consists of 3 × 3 convolutional filters applied in A three-fold cross-validation experiment was conducted. several layers. The concept of utilizing Factorized Convolution The original images were split into training, validation, and enabled feature extraction from increased depth but simultane- test set(s) with the approximate proportion of 70:10:20 while ously ensured that the model did not overfit the training data. The ResNet model was formulated based on the concept of residual learning [9]. It consists of residual modules where convolution operations are followed by Batch Normalization and ReLU non-linearity. These blocks enable the inputs to forward propagate faster and extract features more efficiently. Because of the “stacked” Inception modules, despite having fewer parameters than VGG16, the Inception architecture can perform better back-propagation [10]. Since its first release, several changes have been made to the original architecture, Fig. 5. An overview of the train-test split used for 3-fold cross-validation. and improved versions have been released. Data augmentation is used for the training and validation set only. have used the pre-trained weights from the ImageNet dataset for transfer learning which does not contain any skin lesion images. Thus, the accuracy and generalizability of the system could be further improved by using a multi-source dermato- scopic image dataset for pre-training the models. Furthermore, the dataset is created primarily by web-scraping, which lacks relevant meta-data that is vital for diagnosis, e.g., patient’s clinical history, number of days since disease onset, and the disease stage. A more consorted effort and international collaboration is needed to collect a larger dataset that can provide generalizable results across different demographics. Fig. 6. The interface of the online monkeypox screening tool. VI. C ONCLUSION In this study, we have presented the open-source “Mon- maintaining patient independence. According to the commonly keypox Skin Lesion Dataset (MSLD)” for automatic de- perceived data preparation practice, the training and validation tection of monkeypox from skin lesions and performed an images were augmented while the test set contained only the initial feasibility study using state-of-the-art deep learning original images (see Fig. 5). Accuracy, precision, F1-score, architectures (VGG16, ResNet50, InceptionV3) leveraging the sensitivity are used as performance metrics. transfer learning approach. Despite being a small dataset, the promising results obtained after 3-fold cross-validation reveal D. Results the potential to use AI-assisted early diagnosis of this disease. We evaluated the performance of the selected pre-trained We are hopeful that this dataset will create new avenues for models for the 3-fold cross-validation experiment. The ob- the researchers in terms of developing remotely deployable tained results are summarized in Table II. While ResNet50 computer-aided diagnostic tools for wide-scale screening and yields the best accuracy (82.96±4.57%), VGG16 shows com- early detection of monkeypox, especially in cases where tradi- petitive performance (81.48 ± 6.87%). We also implement an tional testing methods are unavailable. We also believe that our ensemble of the 3 models by employing majority voting. Based soft-launched web-app prototype will assist the monkeypox on averaged performance across the 3-folds, the ensemble did suspects to conduct preliminary screening from the comforts not show superior results compared to the best-performing of home and enable them to take adequate action in the early ResNet50 model. However, the ensemble system shows the stages of the infection. lowest standard deviation of the accuracy metric, indicating R EFERENCES that its performance is more consistent across the 3 folds. [1] “The World Health Network Declares Monkeypox A Pandemic - Press The best performing model has been deployed in a prototype Release— June 22, 2022,” 2022, [Online]. Available: https://www. web-app. Users can upload a photo of their skin lesions and worldhealthnetwork.global/monkeypoxpressrelease. receive an initial assessment for monkeypox (see Fig. 6). [2] “WHO — Monkeypox Fact Sheet,” 2022, [Online]. Available: https: //www.who.int/news-room/fact-sheets/detail/monkeypox. [3] D. Ravı̀, C. Wong, F. Deligianni, M. Berthelot, J. Andreu-Perez, B. Lo, TABLE II and G.-Z. Yang, “Deep learning for health informatics,” IEEE JBHI, P ERFORMANCE C OMPARISON OF D IFFERENT D EEP L EARNING M ODELS vol. 21, no. 1, pp. 4–21, 2016. AND T HE P ROPOSED E NSEMBLE [4] F. Wang, L. P. Casalino, and D. Khullar, “Deep learning in medicine—promise, progress, and challenges,” JAMA Int. Med., vol. Network Accuracy (%) Precision Recall F1 score 179, no. 3, pp. 293–294, 2019. VGG16 81.48± 6.87 0.85 ± 0.08 0.81 ± 0.05 0.83 ± 0.06 [5] K. M. Hosny, M. A. Kassem, and M. M. Foaud, “Classification of skin ResNet50 82.96± 4.57 0.87 ± 0.07 0.83 ± 0.02 0.84 ± 0.03 lesions using transfer learning and augmentation with Alex-net,” PloS InceptionV3 74.07± 3.78 0.74 ± 0.02 0.81 ± 0.07 0.78 ± 0.04 one, vol. 14, no. 5, p. e0217293, 2019. Ensemble 79.26± 1.05 0.84 ± 0.05 0.79 ± 0.07 0.81 ± 0.02 [6] S. J. Pan and Q. Yang, “A survey on transfer learning,” IEEE Trans. knowl. data. en., vol. 22, no. 10, pp. 1345–1359, 2009. [7] C. Shorten and T. M. Khoshgoftaar, “A survey on image data augmen- V. D ISCUSSION tation for deep learning,” J. of Big Data, vol. 6, no. 1, pp. 1–48, 2019. [8] K. Simonyan and A. Zisserman, “Very deep convolutional networks for In this work, we present our preliminary findings of mon- large-scale image recognition,” arXiv preprint arXiv:1409.1556, 2014. [9] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image keypox skin lesion detection using well-known deep learn- recognition,” in Proc. IEEE CVPR, 2016, pp. 770–778. ing architectures. Although the classification performance ob- [10] C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, “Rethinking tained is quite promising, certain constraints limit the wider the inception architecture for computer vision,” in Proc. IEEE CVPR, 2016, pp. 2818–2826. applicability of the results. Firstly, the current number of [11] “Monkeypoxmeter: Real time Monkeybox tracker,” 2022, [Online]. unique patients in the dataset is limited. This reduces the Available: https://www.monkeypoxmeter.com/. generalization capability of these models. If more samples [12] Centers for Disease Control and Prevention, “Monkeypox — poxvirus,” https://www.cdc.gov/poxvirus/monkeypox/index.html, (Ac- can be included with a better geographical, racial, and gender cessed on 06/30/2022). distribution, it is possible to achieve a consistent performance across such variabilities. Secondly, as a initial approach, we
Particle emission concept and probabilistic consideration of the development of infections in systems: Dynamics from logarithm and exponent in the infection process, percolation effects