Professional Documents
Culture Documents
Internship Report
Internship Report
Host Company:
AMEN BANK
Created by:
Wissem Ellefi
Summer of 2022
Acknowledgement
Transferring scanned text from an image to a computer-readable form has always been
one of the attractive research challenges. The objective of text recognition is to develop
a robust and accurate system capable of achieving the level of human performance in
reading. Off-line text recognition applications can improve the input operation speed and
takes as input a raster image of a text captured by a scanner or from screen and then
involved in text recognition, including image and signal processing, pattern recognition,
researchers have been intensively investigated in the field, existing systems still did not
difficult due to the great variability of writing and printing styles. The inherently
difficulties are due to letters writing styles, where they presented in different sizes and
skews, and variation in width and shape of strokes. The research presented in this
The balance sheet is one of the three fundamental financial statements and is key to
both financial modeling and accounting. The balance sheet displays the company’s
total assets and how the assets are financed, either through debt or equity. It can also
be referred to as a statement of net worth or a statement of financial position.
And in this project, I’m going to deal with French versions of the Balance Sheets (Bilan
Financier).
As such, the balance sheet is divided into 3 Major Sections. The first section of the balance
sheet outlines all the company’s assets include all property and rights owned by the
business.
On the other section, the balance sheet outlines the company’s liabilities and shareholders’
equity. And finally, the income statement which is a key financial document of the business. It
shows what the business earns, what it spends, and whether it makes a profit over a given
period.
The Balance Sheet contains the Values of each component corresponding to the current year
and the previous year and we are interested in extracting the values of the current year.
1.2.3. Balance-Sheet Example
Amen Bank was founded in 1966, as a result of the independence from the “Crédit
Foncier d’Algérie et de Tunisie” (CFAT), a local branch of the French banking system
back in 1880 and headquartered in Algeria in 1966, it changed its name to Crédit
The CEO was Ismail Zouiten, yet all its shareholders were French citizens. In 1971, it
was bought by the “Banque Générale d'Investissement”, later known as PGI Holding,
and opened to Tunisian shareholders as Rachid Ben Yedder became the new CEO. In
1995, it changed its name again to Amen Bank. In 2009, Amen Bank launched Tunisia's
First Online Bank. In 2015, Amen Bank launched Tunisia's first online Direct Bank. Amen Bank
made a request to the Central Bank of Tunisia to create a subsidiary specialized in Islamic
· Chapter 3 discusses The Objective of the Study. In other words, we are going to
answer the question of what our research is trying to achieve, and a clear
explanation why we are pursuing it.
· Chapter 4, it presents Literature Review where you’ll find the previously published
works in the Object Detection and especially, in the Optical Character Recognition
domain.
· Chapter 6, where you will find the result and Analysis of the work which is our
accurate application and some screenshots using it.
· The last chapter is 6, it presents the conclusion, limitation, and future work. Where
Results are reviewed as well as the system limitation.
CAPTER 2: RESEARCH QUESTIONS
2.1. Introduction
• Proposed Solutions section describes the proposed idea to solve the problem described above.
• Solution Analysis discuss the requirements and the impact of the solution proposed in the
previous Section.
• Research Question where you’ll find the question our project sets to answer.
The bank agent who’s responsible for handling the balance-sheets must read them and then
manually input the extracted information to the database by typing each value apart (~50
values). In other words, this is a big loss of time and management.
Build a software for accurate extraction of balance bank-sheets information using optical
character recognition and successfully get rid of the manual waste of time.
To successfully build this software first we must Work cross-functionally (Cross multiple
departments within the BANK) to understand the business objective of the project, its
There are various tools and technologies that are available in the market that can be used to
perform this task. But after a discuss of the Bank to well-choose of the dependencies of this
project we end up by
Deciding to develop the AI system (which is responsible for the OCR task) using Python, the
Backend with JAVA specifically SPRING BOOT and the Frontend using ANGULAR.
· Python 3.10.5
· PyTorch with support of CUDA
· EasyOCR
· NumPy
· pdf2image
· OpenCV
· Flask
· JDK 11
· Oracle Database
· Spring Boot
· ANGULAR 13.3.9
Is this application capable of successfully completing these tasks? And effectively reduce the
loss of time?
CHAPTER 3: OBJECTIVE OF THE STUDY
The aim of this work is to develop sentence recognition systems inspired by the human reading
process. Cognitive studies observed that the human tended to read a word at a
time. He considers the global word shapes and uses contextual knowledge to infer and
discriminate a word among other possible words. The sentence recognition system is a fully
integrated system: a word level recognizer (baseline system) integrated with linguistic
recognition approach characterized as a probabilistic ranked task. The output of the system is
multiple recognition hypotheses (N-best word lattice). The basic unit is the word rather than
the character: it does rely on segmentation and requires baseline and object detection. So, The
Build a software for accurate extraction of balance bank-sheets information using optical
character recognition then injects the extracted information to the database automatically
This chapter is an overview of the previously published works on the Object Detection (SSD:
Algorithm) and Optical Character recognition fields.
We present a method for detecting objects in images using a single deep neural network. This
approach, named SSD, discretizes the output space of bounding boxes into a set of default
boxes over different aspect ratios and scales per feature map location. At prediction time, the
network generates scores for the presence of each object category in each default box and
produces adjustments to the box to better match the object shape. Additionally, the network
combines predictions from multiple feature maps with different resolutions to naturally handle
objects of various sizes. The SSD model is simple relative to methods that require object
resampling stage and encapsulates all computation in a single network. This makes SSD
easy to train and straightforward to integrate into systems that require a detection component.
Experimental results on the PASCAL VOC, MS COCO, and ILSVRC datasets confirm that SSD has
comparable accuracy to methods that utilize an additional object proposal step and is much
faster, while providing a unified framework for both training and inference.
The link below will redirect you to the official document of the research (SSD):
https://arxiv.org/pdf/1512.02325.pdf
4.2.2. SSD Architecture
Network (CNN) model. This OCR model utilizes embedded text recognition technology to
extract and process text found in images. What makes OCR an amazing technology is its
ability to recognize text from different images with the same font. This is where deep
To accomplish OCR machine learning, the process starts with a text localization followed
by character segmentation. Once all of this is done, the OCR performs character
This chapter describes the methodology or approach used (Cross-Industry Standard Process for
Data Mining (CRISP-DM))to solve the issue and successfully build an accurate solution.
5.2. CRISP-DM
Data Science is a complex process in which the projects involve a different variety of
stakeholders, data sources and goals. To maintain order, we used the CRISP-DM methodology
which is one of the most popular methods used to date. The methodology is split into 6
sections and is iterative which means that the steps can be repeated as often as necessary until
1. Business Understanding:
It is important at this point that both the Data Scientist and the stakeholders are on the
same page, in order to prevent a model that does not achieve its intended goal, and also so
that everyone understands and has the same expectations in regard to the outcome of the
2. Data Understanding:
Once we have a firm understanding of the objectives of our project ahead of us, it is only
natural that we must have the same understanding of the data that we have available to us.
It is important to know not only the information contained in our data but also where our
At this stage, we have enough of an understanding of our data that we can begin the
4. Modeling:
Once our data has been sufficiently cleaned and prepared, we are now at the step where
we can begin to create models with our data. At this point it is important to consider the
type of model we are hoping to build, guided by our understanding of our objectives and
5. Evaluating:
During this phase we take a step back and evaluate the models we have made! The
evaluation stage is important because it not only helps us evaluate the current progress of
our model and the success of our model but also provides us the opportunity to see if new
6. Deployment:
At this point the goal is to work with the stakeholders to ascertain how to put the model
into progress.
5.3. CRISP-DM Architecture
As Figure 1 shows, in this page you have to fill the form and choose the pdf file presenting the balance
sheet and when you click at ‘Traiter’ it will pass it to the OCR system to do his Job.
In addition to that we have also discuss the data science objective of the project including the
component of the balance sheet in order to have clear ideas about the work that must be done
in the earlier stages and the design where we have detailed the system architecture diagram
and described our software environment that will help us to achieve our goals in the end. Then
we showed the algorithms that we are working with and how they work . It was also essential
The project was technically beneficial as we were able to master the entire value chain of a
Data Science project using multiple tools and Technologies. Computer Vision a very broad and
very promising field with several fields of application like Optical Character Recognition and the
fact of being a specialist engineer in this field is to have the sense of analysis, design and
organization, and is also to have a sense of commitment to the community and to assume all
the responsibilities that have been assigned to it given the crucial importance of this discipline