Professional Documents
Culture Documents
Assignment 3
Assignment 3
Assignment 3
Few-Shot Learning
Prepared By:
Member-1 Member-2
Name: SUMON AHMED Name: PRITOM PAUL
ID no: 210303020101 ID no: 200103020063
Semester: 9 (B)
th
Semester: 9 (B)
th
Member-3 Member-4
Name: ABDULLAH AL MAMUN Name: AKHI SARKER
ID no: 200103020010 ID no:
Semester: 9 (B)
th
Semester: 9 (B)
th
Member-5
Name: IQBAL AHMED
ID no:
Semester: 9 (B) th
Session: Fall-21
i
ABSTRACT
Overall, our research presents a promising avenue for addressing the challenges in
handwritten Bengali vowel recognition through the application of few-shot learning.
The proposed approach showcases the potential for accurately recognizing Bengali
vowels using limited training data, thus paving the way for practical implementations in
various domains such as optical character recognition, document analysis, and
automated handwriting recognition systems.
ii
TABLE OF CONTENTS
ABSTRACT ii
TABLE OF CONTENTS iv
LISTS OF FIGURES v
LIST OF ABBREVIATIONS v
1 INTRODUCTION 6
1.1 Background 6
1.2 Problem Formulation 6
1.3 Aim and Objectives 7
1.4 Project Scopes 7
1.5 Report Organization 8
2 Related review 9
2.1 Background 9
2.2 Paper-1 10
2.3 Paper-2 11
2.4 Paper-3 11
2.5 Paper-4 12
2.6 Paper-5 12
2.7 Comparisons 13
3 METHODOLOGY 14
3.1 Background 14
iii
3.2 Methodology Details 14
3.2.1 Flow chart of the work 15
3.2.2 Resources Used 15
3.2.3 Implementation/Simulation 15
3.2.3.1 Implementation Methodology 15
5 CONCLUSION 19
5.1 Conclusion Summary 19
5.2 Achievement/Contribution 20
5.3 Future Direction 20
5.4 Reflection or Lesson Learned 21
REFERENCES 22
APPENDICES A 23
iv
LIST OF FIGURES
Figure 3.1 Methodology 14
Figure A.1 Flow Chart 23
LIST OF ABBREVIATIONS
v
vi
CHAPTER 1
INTRODUCTION
1.1 Background:
7
1.2 Problem Formulation: Many problems occur while evaluating the system.
Some of those limitations and our proposed solutions are: The dataset of
handwritten Bengali vowels is very limited which means that we will need to
use a variety of techniques to augment the dataset. Also, Bengali vowels are
very similar to each other. This means that the system will need to be very
accurate in order to distinguish between them. Lastly, this system will need to
be able to manage a variety of different writing styles. This is because people
write Bengali vowels in a variety of different ways.
The aim of the project "Handwritten Bengali Vowel Recognition using Few-
Shot Learning" is to develop a system that can accurately recognize and classify
handwritten Bengali vowel characters, overcoming the limitations of limited
training data through the utilization of few-shot learning techniques.
Objectives for the project:
8
1.4 Project Scope:
9
CHAPTER 2
RELATED REVIEW
2.1 Background
10
2.2 Paper -1:
Samuel, M., Schmidt-Thieme, L., Sharma, D. P., Sinamo, A., & Bruck, A.
(2022).Offline Handwritten Amharic Character Recognition Using Few-shot
Learning. In PanAfriCon AI 2022. Retrieved from
https://arxiv.org/abs/2210.00275.
Snell, J., Swersky, K., & Zemel, R. S. (2017). Prototypical Networks for Few-
shot Learning. arXiv preprint arXiv:1703.05175v2.
https://arxiv.org/abs/1703.05175
11
2.4 Paper -3:
Chakrapani GV, A., Chanda, S., Pal, U., & Doermann, D. (2020). One-Shot
Learning-Based Handwritten Word Recognition. In: Document Analysis
Systems. DAS 2020. https://doi.org/10.1007/978-3-030-41299-9_17
12
2.6 Paper -5:
2.7 Comparisons:
13
A Data Augmentation Method for Few-Shot Learning of Handwritten
Characters[4]. This paper proposes a data augmentation method for few-shot
learning of handwritten characters. The method uses a variety of techniques to
augment the training dataset, including rotation, translation, and noise
addition. The paper presents experimental results showing that the proposed
method improves the performance of few-shot learning for handwritten
character recognition.
After comparing the five papers, Paper 1 seems to be the best choice for this
project. The paper proposes a unified few-shot learning framework that
achieves state-of-the-art results on a variety of handwritten character
recognition datasets. The framework is also relatively easy to implement,
which makes it a good choice for character recognition.
2.8 Summary
14
CHAPTER 3
METHODOLOGY
3.1 Background
Due to the limitations of deep learning, which requires huge amounts of labeled
data, few-shot learning has become an active area of research. Our methodology
includes the collecting dataset and the use of Prototypical Networks as a
baseline method.
- Number of Class = 11, Number of samples per classes = 5-8: Divide the
dataset into 11 classes with 5-8 samples per class.
15
- Resizing Normalized to: 200x200x1: Resize the images in the dataset to a
standardized size of 200x200 pixels with grayscale channel.
There will be only a few online materials, research papers, and a computer
involved. No external hardware or software will be required.
16
proven to be effective in handling small labeled datasets by learning a metric
space to classify new examples. This approach offers promising results for our
task of Bengali vowel recognition.
For the implementation of our experiments, we will utilize the PyTorch machine
learning library. PyTorch provides a flexible and efficient framework for
developing and training deep learning models. We will leverage its extensive
collection of pre-built modules and optimization algorithms, facilitating the
implementation of Prototypical Networks for few-shot learning.
In our simulation setup, we will divide the curated dataset into training,
validation, and testing subsets. We will ensure a balanced distribution of vowel
samples across these subsets to minimize bias and achieve reliable evaluation
results.
17
During the training phase, we will feed the Prototypical Networks model with a
limited number of labeled samples from the training set. The model will learn to
generate compact and discriminative representations of the vowel characters.
This training process will be guided by an optimization algorithm to minimize
the classification loss.
Following the training phase, we will evaluate the performance of the trained
model on the validation set to tune hyperparameters and assess generalization
capabilities. Finally, we will conduct a comprehensive evaluation of the testing
set to measure the effectiveness of our proposed approach in accurately
recognizing handwritten Bengali vowels under few-shot learning scenarios.
18
3.5 Team Performance
3.6 Summary
CHAPTER 4
4.1 Overview
19
academic research, and researchers in the field of Bengali language and
culture.
With limited training samples, the system will exhibit modest accuracy.
However, through successive iterations and the incorporation of techniques
like data augmentation, feature extraction, and model optimization, its
performance will steadily advance. As the system undergoes training, it will
learn to better capture variations in handwriting styles and generalize to
unseen data, resulting in higher accuracy rates.
4.3 Summary
20
CHAPTER 5
CONCLUSION
1. Achievements:
2. Contributions:
21
(ii) Developing a system that can accurately recognize handwritten
Bengali vowel characters, providing a practical and reliable tool for
language learners and image processing tasks.
22
REFERENCES
[1] Samuel, M., Schmidt-Thieme, L., Sharma, D. P., Sinamo, A., & Bruck, A.
(2022).Offline Handwritten Amharic Character Recognition Using Few-shot
Learning. In PanAfriCon AI 2022. Retrieved from
https://arxiv.org/abs/2210.00275.
[2] Snell, J., Swersky, K., & Zemel, R. S. (2017). Prototypical Networks for
Few-shot Learning. arXiv preprint arXiv:1703.05175v2.
https://arxiv.org/abs/1703.05175
[3] Shaffi, N., & Hajamohideen, F. (2021). Few-Shot Learning for Tamil
Handwritten Character Recognition Using Deep Siamese Convolutional Neural
Network. https://doi.org/10.1007/978-3-030-82269-9_16
[4] Chakrapani GV, A., Chanda, S., Pal, U., & Doermann, D. (2020). One-Shot
Learning-Based Handwritten Word Recognition. In: Document Analysis Systems.
DAS 2020. https://doi.org/10.1007/978-3-030-41299-9_17
abs/2008.12995
23
APPENDIX A
FLOWCHART
24