Professional Documents
Culture Documents
Final Thesis
Final Thesis
Final Thesis
• Introduction
• Why VGG
• - Analysis of DNN Model Architecture of VGG19
• - Analysis of DNN Model
• - Summary Of VGG19
• - Summary Of Dataset
• Feature Extraction
• Why only reuse Convolutional Base
• Extract Feature from fashion mnist
• Result and conclusion
• Challenges for future
2
INTRODUCTION
• Recently, online retails stores are growing quickly and surpass traditional physical stores. Consumers can
browse through thousands of merchandises at online retails stores. Sometimes, it is difficult to search the
ideal item we want in the massive item choices offered by all online stores. For instance, it is time-
consuming to search the ideal cloth since there are too many choices. Also large industrial exporter facing
big issue with product classification and labeling.
• Fashion-MNIST is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a
test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a label from 10
classes. The dataset serves as a direct drop-in replacement for the original MNIST dataset for benchmarking
machine learning algorithms. It shares the same image size and structure of training and testing splits.
• Image recognition using CNN is excessively applied in fashion domains such as clothes classification, clothes
retrieval and automatic clothes labeling.
• - I apply VGG19 architecture on the Fashion MNIST dataset. Till Now LeNet-5 gives a higher performance as
compared to other existing models. I compared VGG19 accuracy with LeNet-5.
• - I am also trying with augment dataset(myntra) on existing higher performance model. So that existing
exporter issue resolve.
Why VGG19
Figure 4. The Summary of Dataset VGG19 requires minimum input image's width and
height of 48, but I'll resize my images from 28 x 28
to 150 x 150.
• Feature Extraction
This solution is very fast and cheap to run, because it only requires
running the convolutional base once for every input image, and the
convolutional base is by far the most expensive part of the pipeline.
However, for the exact same reason, this technique would not allow me
to leverage data augmentation at all.
Categorical Crossentropy
• Categorical crossentropy used as a
loss function for multi-class
classification model where there are
two or more output labels. The
output label is assigned one-hot
category encoding value in form of
0s and 1. The output label, if
present in integer form, is
converted into categorical encoding
using keras.utils to_categorical
method
Figure 5.
Training and Validation Accuracy
14
Challenges /Future Work
16