Final Doc11

HANDWRITTEN TELUGU ACHULU CHARACTER
RECOGNITION USING CONVOLUTIONAL NEURAL

NETWORKS
Project report
Submitted in Partial Fulfilment of the Requirements for the Award of
Degree of
Bachelor of Technology
In
COMPUTER SCIENCE AND ENGINEERING
By
R.GANGAPAVANIDEVI 19K61A05E3
V.PAVAN KUMAR 19K61A05H2
P.SWETHA 19K61A05D4
P.RAMYA 19K61A05C6
Under the esteemed guidance of
S. Jagan Mohan, M.Tech
Assistant Professor, CSE
Department of Computer Science and Engineering

SASI INSTITUTE OF TECHNOLOGY&ENGINEERING
(Approved by AICTE, New Delhi, Permanently Affiliated to JNTUK, Kakinada, and
SBTET-Hyderabad, accredited by NAAC with ‘A’ Grade and NBA, ranked as "A"
Grade by Govt. of A.P., Recognised by UGC 2(f) & 12(B))
Kadakatla, TADEPALLIGUDEM– 534 101
ACADEMIC YEAR 2022-2023

VISION AND MISSION OF INSTITUTE
VISION
Confect as a premier institute for professional education by creating

technocrats who can address the society’s needs through inventions and innovations.
MISSION
1. Partake in the national growth of technological, industrial, and industrial areas

with societal responsibilities.
2. Provide an environment that promotes productive research.
3. Meet stakeholder’s expectations through continued and sustained quality

improvements.
VISION AND MISSION OF DEPARTMENT
VISION
Build a learning environment that enhances creativity, social awareness, and

leadership skills for total personality development.
MISSION
The Computer Science and Engineering department’s consistent effort is to

provide the learner with exposure to emerging technology by providing hands-
on experience making them creative and research-oriented professionals with values,
Leadership qualities, and zeal to serve society.
ii
PROGRAM OUTCOMES (POs)
Students in the Computer Science and Engineering program should, at the time
of their graduation be in possession of:
PO1.Engineering Knowledge:
Apply knowledge of mathematics, science, engineering fundamentals, and an

engineering specialization to the solution of complex engineering problems.
PO2.Problem Analysis:
Identity, formulates, research literature, and analyses complex engineering

problems reaching substantiated conclusions using first principles of mathematics,
natural sciences, and engineering sciences.
PO3.Design/ Development of Solutions:
Design solutions for complex engineering problems and design system

components or processes that meet specified needs with appropriate consideration for
public health and safety, cultural, societal, and environmental considerations.
PO4. Conduct investigations of complex problems:
Using research-based knowledge and research methods including design of

experiments, analysis, and interpretation of data, and synthesis of the information to
provide valid conclusions.
PO5.Modern Tool Usage:
Create, select and apply appropriate techniques, resources, and modern

engineering and IT tools including prediction and modelling to complex engineering
activities with an understanding of the limitations.
PO6.The Engineer and Society:
Apply to reason informed by the contextual knowledge to assess societal, health,

safety, legal and cultural issues and the consequent responsibilities relevant to
professional engineering practice.
PO7.Environment and Sustainability:
Understand the impact of professional engineering solutions in societal and

environmental contexts and demonstrate knowledge of and need for sustainable
development.
iii
PO8.Ethics:
Apply ethical principles and commit to professional ethics and responsibilities

and norms of engineering practice.
PO9. Individual and Team Work:
Function effectively as an individual, and as a member or leader in diverse teams

and multidisciplinary settings.
PO10.Communication:
Communicate effectively on complex engineering activities with the engineering

community and with society at large, such as being able to comprehend and write
effective reports and design documentation, make effective presentations and give and
receive clear instructions.
PO11.Life-long Learning:
Recognize the need for and have the preparation and ability to engage in
independent and life-long learning in the broadest context of technological change.
PO12.Project Management and Finance:
Demonstrate knowledge and understanding of engineering and management

principles and apply these to one’s work, as a member and leader in a team, to manage
projects and in multidisciplinary environments.
PROGRAM SPECIFIC OUTCOMES (PSOs)
PSO1.Mobile & Web Application Development:
Ability to develop mobile & web applications using J2EE, Android, and J2ME.
PSO2.Cloud Services:
To deploy virtualized and cloud-based services in the organization.
iv
PROGRAM EDUCATIONAL OBJECTIVES (PEOs)
PEO1: Graduates will be able to analyze, design, and develop advanced computer
applications to provide solutions to complex problems.
PEO2: Graduates are well-trained, confident, research-oriented and industry-ready

professionals who are intellectual, ethical, and socially committed.
PEO3: Graduates will have the technical, communication skills and character that will
prepare them for technical and leadership roles.
COURSE OUTCOMES (COs)
CO1. Develop problem formation and design skills for engineering and real-world
problems.
CO2. Collect and Generate ideas through literature surveys on current research areas
which help to analyze and present to impart knowledge in different fields.
CO3. Impart knowledge of software & hardware to meet industry perspective needs
and standards.
CO4. Create interest to research innovative ideas as lifelong learning.
CO5.Ability to work with a team, and enrich presentation and communication skills.
CO6. Create a platform that makes students employable.
v
EXPECTED OUTCOMES
PROGRAM OUTCOMES (POs)
PO1: Engineering Knowledge

PO2: Problem Analysis
PO3: Design/Development of Solutions
PO4: Conduct an investigation of complex problems
PO5: Modern Tool Usage
PO6: The Engineer and Society
PO7: Environment and Sustainability
PO8: Ethics
PO9: Individual Team Work
PO10: Communication
PO 11: Life-long Learning
PO12: Project Management and Finance
PROGRAM SPECIFIC OUTCOME (PSOs)
PSO1: Not Applicable

PSO2: Not Applicable
vi
SASI INSTITUTE OF TECHNOLOGY&ENGINEERING
(Approved by AICTE, New Delhi, Permanently Affiliated to JNTUK, Kakinada, and
SBTET-Hyderabad, accredited by NAAC with ‘A’ Grade and NBA, ranked as "A"
Grade by Govt. of A.P., Recognised by UGC 2(f) & 12(B))
Kadakatla, TADEPALLIGUDEM– 534 101
Department of Computer Science and Engineering
CERTIFICATE
This is to certify that the project work entitled “An Handwritten Telugu
Achulu Character Recognition Using Convolutional Neural Networks” is being
submitted by R. GANGAPAVANIDEVI (19K61A05E3), V. PAVAN KUMAR
(19K61A05H2),P. SWETHA (19K61A05D4), P. RAMYA (19K61A05C6)in partial
fulfilment for the award of the degree of BACHELOR OF TECHNOLOGY, in
Computer Science and Engineering to Jawaharlal Nehru Technological
University, Kakinada during the academic year 2022 to 2023 is a record of
Bonafede work carried out by them under my/our guidance and supervision. The
results presented in this thesis have been verified and are found to be satisfactory.
The results embodied in this thesis have not been submitted to any other University
or Institute for the award of any other degree or diploma.
Project Supervisor Head of the Department

Dr. P. Sivakumar Dr. M. V.S. S. Nagendranath
Associate Professor Professor & Head
Department of CSE Department of CSE
External Examiner
vii
DECLARATION BY THE CANDIDATES
We R. Gangapavanidevi, (19K61A05E3), V. Pavan , (19K61A05H2),P.

Swetha, (19K61A05D4), P. Ramya, (19K61A05C6), hereby declare the project
report entitled “An Handwritten Telugu Achulu Character Recognition Using
Convolutional Neural Networks” carried out under esteemed supervision of S. Jagan
Mohan, is submitted in partial fulfilment of the requirements for the award of the degree
of Bachelor of Technology in Computer Science& Engineering. This is a record of work
carried out by us and the results embodied in this project has not been reproduced or
copied from any source. The results embodied in this project report have not been
submitted to any other University or Institute for the award of any other degree or
diploma.
Project Associates
R. Gangapavanidevi 19K61A05E3
V. Pavan kumar 19K61A05H2
P. Swetha 19K61A05D4
P. Ramya 19K61A05C6
viii
ABSTRACT
Handwritten recognition plays vital role in the recognition of characters of old documents by
using Optical character recognition and Convolution Neural Networks. The main challenge in
the existing Telugu recognition system is unable to recognizes and the slant correction of the
Handwritten text of Telugu characters. To overcome this problem using different Telugu
character images as dataset and convolutional neural network method to obtain high accuracy
and recognition rate. Generally convolutional neural network method will have different layers
like conventional layer, max pooling layer, fully connected layer and SoftMax layer. Once the
input character image is passed through these layers and finally obtain the recognized character
as output image from SoftMax layer. Handwritten recognition is a challenging task for a long
time either English or other language. Especially, Indian language handwritten letter is having
many curves. It has been an open challenge for long time. In the advent of Machine Learning,
the handwritten recognition becomes easier. However, many challenges still persist, because
feature extraction is a difficult task since the characters are more in Indian languages. In this we
have chosen the telugu achulu characters for recognition. The classification is a challenging
task. There are numerous machine learning techniques, however, the accuracy is the key
challenge to achieve. Therefore, we deploy deep learning technique to enhance the accuracy
which is observed test accuracy and training accuracy as high as 79.61% and 96.13%
respectively. We have built a machine learning model with Convolutional Neural Network for
Telugu Handwritten Achulu.
Keywords: Telugu Character Recognition, Convolutional Neural network, Deep Learning,

Handwritten Achulu Recognition.
ix
ACKNOWLEDGEMENT
We pay obeisance to our dynamic Chairman, Sri B. Venu Gopala Krishna, Sasi
Educational Society, for his inspiring presence, which has always been the principal
driving force behind all-over endeavors. First of all, we would like to extend a special
thanks to Sri. M. Narendra Krishna, Vice-Chairman, Sasi Institute of Technology and
Engineering, for his everlasting support.
It is a matter of great honour to thank Dr. K. Bhanu Prasad, Director, Sasi

Institute of Technology and Engineering, who is behind us to support in all the aspects
to complete the project.
It is true pleasure for us to thank Dr. J. Anand Chandu Lal, Principal, Sasi
Institute of Technology and Engineering, who is the striving force for us to make this
project through periodical audits.
We feel the privilege to thank Dr. M. V. S. S. Nagendranath, Professor and

Head of the Department, of Computer Science and Engineering, for providing us with
invaluable feedback on our work, which allowed us to constantly improve our project.
We are very much grateful to thank our supervisor S. Jagan Mohan, Assistant
Professor, Sasi Institute of Technology and Engineering for his constant encouragement,
monitoring, and guidance and live with us throughout the submission of the project. He
motivated us whenever we encountered an obstacle along the way.
We are very happy to thank our project coordinator Dr. P. Siva Kumar and all
the Project Evaluation Committee members who provide valuable suggestions to fine-
tune our ideas and projects. We are also thankful for all teaching and non-teaching staff
members who contributed well to the successful completion of our project work.
With gratitude,
R.Gangapavanidevi(19K61A05E3)
V. Pavan Kumar(19K61A05H2)
P. Swetha(19K61A05D4)
P. Ramya(19K61A05C6)
x
TABLE OF CONTENTS
CHAPTER NO. TITLE PAGE NO.
VISION AND MISSION ii
POs, PSOs, PEOs and COs iii

EXPECTED OUTCOMES vi
ABSTRACT ix
LIST OF FIGURES xvi
LIST OF TABLES xix
LIST OF ABBRIVATIONS xx
CHAPTER 1: INTRODUCTION 1-12
1.1 PREAMBLE 1
1.2 OVERVIEW OF THE PROJECT 2
1.3 BACKGROUND STUDY 3
1.3.1 Handwritten Character Recognition 4
1.3.2 Need Of Handwritten Character Recognition 5
1.3.3 Working of Handwritten Character Recognition 5
1.3.4 Different Datasets in Handwritten Character Recognition 6
1.3.5 Advantages of Handwritten Character Recognition 6
1.3.6 Applications of Handwritten Character Recognition 7
1.3.7 Challenges in Handwritten Character Recognition 7
1.4 PROBLEM STATEMENT 7
1.5 AIM OF THE PROJECT 8
1.6 OBJECTIVE OF THE PROJECT 8
1.7 EXISTING METHODOLOGY 9
1.8 PROPOSED METHODOLOGY 9

xi
1.9 TABLE
SIGNIFICANCE OF
OF THE CONTENTS
WORK 10
1.10 LIMITATIONS OF THE WORK 11
1.11 ORGANIZATION OF THE REPORT 11
1.12 SUMMARY 12
CHAPTER 2: LITERATURE SURVEY 13-27

2.1 PREAMBLE 13
2.2 CONVOLUTIONAL NEURAL NETWORK 13
2.3 ADAPTIVE ZONING 18
2.4 OPTICAL CHARACTER RECOGNITION 19
2.5 RECURRENT NEURAL NETWORK 22
2.6 COMPARSION TABLE OF EXISTING ALGORITHM 23
2.7 SUMMARY 27
CHAPTER 3: SYSTEM REQUIREMENTS 28-33
3.1 PREAMBLE 28
3.2 IMPLEMENTATION TOOLS 28
3.2.1 Jupiter notebook 28
3.3 COMMUNICATION INTERFACE 29
3.4 OPERATING SYSTEM 30
3.5 HARDWARE REQUIREMENTS 31
3.5.1 Processor 31
3.5.2 RAM (Random Access Memory) 31
3.6 HARDWARE REQUIREMENTS TOOLS 32
3.7 SOFTWARE REQUIREMENTS 32
xii
3.7.1 TABLE
Python Libraries OF CONTENTS 32
3.8 SUMMARY 33
CHAPTER 4: SYSTEM DESIGN 34-41
4.1 PREAMBLE 34
4.2 PROPOSED SYSTEM ARCHITECTURE DIAGRAM 34
4.2.1 Dataset 36
4.2.2 Image Pre-Processing 36
4.2.3 Data Splitting 37
4.2.4 Training the Models 37
4.2.5 Classification Result 38
4.2.6 Performance Evaluation 38
4.3 CONVOLUTIONAL NEURAL NETWORK 38
4.3.1 Convolution Layer 40
4.3.2 Pooling Layer 40
4.3.3 Fully Convolution Layer 40
4.4 SUMMARY 41
CHAPTER 5: METHODOLOGY 42-47
5.1 PREAMBLE 42
5.2 PROPOSED METHOD 42
5.3 PROPOSED METHODOLOGY MODULE 43
5.3.1 CNN 43
5.3.2 Contour Based Feature Approach 44
5.3.3 Resizing 44
5.4 MODULES OF THE PROPOSED SYSTEM 45
5.4.1 Data Collection 45
5.4.2 Data Pre-Processing 45

xiii
5.4.3 Data TABLE
Splitting OF CONTENTS 46
5.4.4 Training the Model 47
5.4.5 Classification Result 47
5.5 SUMMARY 47
CHAPTER 6: EXPERIMENTAL RESULTS 48-54
6.1 PREAMBLE 48
6.2 RESULTS 48
6.3 GRAPHS 49
6.3.1 Graphs of Models 49
6.3.1.a Training and Validation accuracy 49
6.3.1.b Validation loss 50
6.4 RESULTS & DISCUSSION 50
6.5 SUMMARY 54
CHAPTER 7: CONCLUSION AND FUTURE ENHANCEMENT 55
7.1 CONCLUSION 55
7.2 FUTURE ENHANCEMENT 55
REFERENCES 56-57
APPENDIX A SOURCE CODE A1-A8
APPENDIX B SCREENSHOTS B1-B4
APPENDIX C STUDENTS CONTRIBUTION C1
APPENDIX D PO’s, PSO’s, PEO’s and CO’s Relevance D1-D5
Project
xiv
LIST OF FIGURES
FIGURE NO. FIGURE NAME PAGENO.
1.1 Image of the different layers for recognition 4
3.1 Jupiter Notebook 29
4.1 Architecture Diagram 35
4.2 Architecture of proposed CNN 39
3.1 Jupiter Notebook 34
6.1 Training and validation accuracy graph 49
6.2 Loss graph 50
6.3 Image of Home page 50
6.4 Image of Registration Page 51
6.5 Image of Login Page 51
6.6 Image of recognition of characters 52
B.1 Outcome of given input libraries B1
B.2 Outcome of loss and accuracy percentage B2
B.3.1 Represents the accuracy level B2
B.3.2 Represents the loss level B3
B.3.3 Accuracy percentage of model B3
B.3.4 Output recognized by model ’a’ B4
xv
LIST OF TABLES
TABLE NO. TABLE NAME PAGE NO.
2.1 Comparison of Literature survey 23-26
6.1 Comparison Table for different models 52
xvi
NOMENCLATURE
LIST OF ABBRIVATIONS
CNN - Convolutional Neural Network
HTACR - Handwritten Telugu Achulu Character Recognition
xvii
Handwritten Telugu Achulu Character Recognition Using Convolutional Neural Networks
CHAPTER 1
INTRODUCTION
1.1 PREAMBLE
The Hand written character recognition detection using CNN plays a vital role. A hand
written character that has to be the depend on the user that written in the position of
word written . It is very much crucial to detect without violating the written character
correction.. the recognition of the character that is written by hand is the most important
that will helps to identify the slant correction. Most of the users still can’t able to
recognize that whether they written character is correct or not . Since, the CNN
algorithm are designed for fast recognition of hand written characters and to take
necessary actions. In this paper, our proposed methodology is about implementing
accuracy of the hand written Telegu character recognition which are available in current
running detection devices. Our project is not only for improving the efficiency and
accuracy of hand written Telegu character recognition in school sector but for also to
assist the user even which character are not having the automatic detection system.
They will assist the user by displaying the slant correction and character recognition on
the display and it will give alert to the user if they are violating the character mistake.
Some of the user may not aware of all character signs and they may do errors in writing
. Our model is also to assist these kind of users too.
Handwriting recognition has been one of the challenging and active research
areas in the field of pattern recognition and image processing. Number of character
recognition systems have been developed and are used for even commercial purpose
also. Several applications including mail reading aid for blind, bank cheques
processing, sorting, document reading and postal address recognition require offline
handwriting systems. For machines it is tough to recognize the character. Machine
considered it as document of pixels. Handwritten recognition has many real time
applications like sorting the letters in post-offices, banks for processing of handwritten
characters etc.,. Hence it is an important application of pattern recognition. Optical
character recognition (OCR) is automatic reading of optically sensed document text
Sasi Institute of Technology & Engineering 1

materials to translate human-readable characters into machine-readable codes. On the

other hand Handwritten Character Recognition is a tough task when compared to OCR.
Handwritten files contain structurally various handwritten styles for each symbol in the
language. In this paper we discuss about Telugu HWCR. Telugu is the old and ancient
language in India. As per the 2011 census, there are more than 8.11crore Telugu
speaking people in India. Telugu has advanced from the historical Brahmi script.
Hence, it is an area of ongoing research.
This chapter mainly describes about the introduction to the Hand written character
recognition using CNN, what are the challenges that are solved by the authors,
background studyof the handwritten telugu character recognition, aim, and objective, a
brief overview of the proposed methodology and the significance of the work.
1.2 OVERVIEW OF THE PROJECT
Optical Character Recognition is used to convert all the printed text, image
documents into editable text format irrespective of the language. This OCR concept can
be used in many applications like education, administration etc. The main challenge in
the existing Telugu OCR system is unable to recognizes the hand written text of Telugu
characters. To overcome this problem we are using different Telugu character images
as dataset and convolutional neural network method to obtain high accuracy and
recognition rate. Generally conventional neural network method will have different
layers like conventional layer, max pooling layer, fully connected layer and SoftMax
layer. Once the input character image is passed through these layers and finally obtain
the recognized character as output image from SoftMax layer. Future scope of our
project to obtain all the recognized characters in the editable text format for future usage
of the obtained characters. In order to build this project we came across referring Thirty
projects which have build this model. Each model is containing there own algorithm’s
and their own methodologies. Each model is again having their own case studies and
building stages. To design and build a model it need to be checked that the correction
of the characters . That if the built model could recognize the character accurately or
not. Because it is a Deep Learning project the model should be able itself to learn with
its previously performed activities. It should be given a Dataset starting to recognize
and match the characters which are present in with dataset.

This model should be capable to recognize the character within the

milliseconds once after it selects the image of character . It should watch the sign,
recognize and react according to the seen sign. The hand written character recognition
using convolution neural networks has been developed for the where the robust datasets
have been taken to find any kind of the characters. The model will capture the image
and the captured image will be compared with the thousands of the characters .In the
developed model the sign after it matched with the one in the dataset it will classify and
give the result to the direct automatic character recognition.
1.3 BACKGROUND STUDY OF HANDWRITTEN TELUGU

CHARACTER RECOGNITION
Machine learning (ML) is a centre of artificial intelligence. It is the most

common method for creating smart computers. For the most part, AI depends on
acceptance and blend, allowing computers to learn new information by mimicking
human learning behavior. It restructures current learning to continuously increase
computer performance. Medical diagnosis, scene analysis, and computer vision are only
a few of the fields where AI has been widely used.Deep learning (DL) has been
proposed as a result of developments in neural networks. Deep Learning is not only a
development in neural networks but also a new branch in machine learning research.
As a result, for each application, Deep Learning may be used to discover characteristics
from a given dataset. Traditional procedures, for the most part, depend on previous
knowledge to focus on a specific application. In this way, these algorithms are feature
extraction procedures that have been handcrafted. Deep Learning can identify new
features that are reasonable to implement.
Specialists have never recently discovered explicit applications. Automatic

feature acquisition strategies are another name for these methodologies. The medical
industry is one of the most well-known industries in which Deep Learning can play a
significant role, notably in the field of medical imaging. Several research studies have
proved theeffectiveness and productivity of using Deep Learning in the field. The brain
is an important part of the human body and is made up of several different regions.
One of the most popular types of deep neural networks is known as

convolutional Neural Networks (CNN or ConvNet). A CNN convolveslearned features
with input data, and uses 2D convolutional layers, making this architecture well suited

to processing 2D data, such as images. CNNs eliminate the need for manual feature
extraction, so you do not need to identify features used to classify images. The CNN
works by extracting features directly from images. The relevant features are not
pretrained; they are learned while the network trains on a collection of images. This
automated feature extraction makes deep learning models highly accurate for computer
vision tasks such as object classification.
There are several stages in character recognition They are:
1. Capturing
2. Extraction
3. Detection
4. Classification
5.recognition.
Fig1.1:several stages in character recognition

Regardless, using automated algorithms to properly segment and
recognition of characters is difficult. This is because infected regions in character terms
of shape, size, texture and uneven written characters. Concluding the accurate
characters is difficult in identifying the also. In comparison to older standard
methodologies, deep learning strategies have become the cutting-edges strategies and
having the accurate detection methods.
1.3.1 HANDWRITTEN CHARACTER RECOGNITION
The handwritten character recognition is the important one which is the

recognition of the characters from the offline to online ones where there is nothing will
be happen from this it would be the most important recognition to that it happens then
it will be classified and recognition. The old characters that are detected in this project.
Handwritten recognition is a challenging task for a long time for Indian
languages since letters is having many curves. It has been an open challenge for long

time. In the advent of Machine Learning, the handwritten recognition becomes easier.
However, many challenges still persist, because feature extraction is a difficult task
since the characters are more in Indian languages. There are numerous machine learning
techniques to predict handwritten characters, however, the accuracy is the key challenge
to achieve.
1.3.2 NEED OF HANDWRITTEN CHARACTER

RECOGNITION
Handwriting recognition (HWR), also known as handwritten text

recognition (HTR), is the ability of a computer to receive and interpret
intelligible handwritten inputfromsourcessuchas paper documents, photographs, touch
-screens and other devices. The image of the written text may be sensed "off line" from
a piece of paper by optical scanning (optical character recognition or intelligent word
recognition. Alternatively, the movements of the pen tip may be sensed "on line", for
example by a pen-based computer screen surface, a generally easier task as there are
more clues available. A handwriting recognition system handles formatting, performs
correct segmentation into characters, and finds the most plausible words.
1.3.3 WORKING OF HANDWRITTEN CHARACTER

RECOGNITION
The handwritten character recognition system works based on the computer

vision. Initially the model is built with the image recognizing and processing algorithms
that which come under the deep learning. The model is further trained with the various
robust data sets which are containing the numerous set of images. The continues
iterative learning of the character recognition is being happening on one side of the
model. This will make the model trained as much as it can. Later the developed model
is come into the real time experience for its working.
The Model is integrated with the computer vision that will always look for
the font of the character. when written character capture by the system if any got
identified and it will recognize that which kind of the character is got captured and

process what should be given as the output to the correct the slant characters. By that
reply from device it will react what decision have to be taken and implement that
decision on the written character.
1.3.4 DIFFERENT DATA SETS IN HANDWRITTEN

There are multiple datasets containing thousands of recognition of hand

written character images that are collected from the certain limited city or country that
which will be penetrated and trained in the model. These datasets are a block of
character images which will be checked with the captured images and takes the
necessary action.
Types of datasets in hand written character recognition.
• HP Data set
• CNN dataset
• MNIST dataset
• Own data set by the team.
1.3.5 ADVANTAGES OF HANDWRITTEN CHARACTER RECOGNITION
Handwriting recognition helps to transform the writings in the papers to a

text document format which can also be said as readable electronic format. By this way,
historical facts can be stored, reviewed and shared easily too many people. Lastly, the
advantage is textual studies. A textual study is a category of literature studies. Literature
studies involve reviewing the original manuscripts of literature in comparison with the
printed version. This means that the story has been dug and changed at the hands of the
editor’s hand after coming from the hands of the author. This is why original
manuscripts are cared very well but still this requires a complete review of the
manuscript. Handwriting recognition helps to keep safe these original manuscripts in
an electronic format and it can be reviewed without damaging the original copy of the
manuscript. In spite of rough handling, one can read the OCR information with high
degree of accuracy. Flatbed scanners are very accurate and can produce reasonably high
quality images.

➨The processing of OCR information is fast. Large quantities of text can be input
quickly.
➨A paper based form can be turned into an electronic form which is easy to store
or send by mail.
➨It is cheaper than paying someone amount to manually enter large amount of text
data. Moreover it takes less time to convert in the electronic form.
➨The latest software can re-create tables as well as original layout.
1.3.6 APPLICATIONS OF HANDWRITTEN TELUGU

• Educational sector.
• Mail reading aid for blind.
• Bank cheques processing.
• Document reading and postal address recognition require offline handwriting
systems.
1.3.7 CHALLENGES IN HAND WRITTIEN RECOGISATION

• Slant words
• Collision of words
• Dimensions
• Reverse of a word
1.4 PROBLEM STATEMENT
Optical Character Recognition is used to convert all the printed text, image
documents into editable text format irrespective of the language. This OCR concept can
be used in many applications like education, administration etc. The main challenge in
the existing Telugu OCR system is unable to recognizes the handwritten text of Telugu
characters. To overcome this problem we are using different Telugu character images
as dataset and convolutional neural network method to obtain high accuracy and

recognition rate. Generally conventional neural network method will have different
layers like conventional layer, max pooling layer, fully connected layer and SoftMax
layer. Once the input character image is passed through these layers and finally obtain
the recognized character as output image from SoftMax layer. Future scope of our
project to obtain all the recognized characters in the editable text format for future usage
of the obtained characters.
Convolutional Neural Networks (CNN) is used for recognition of handwritten

characters. The beauty of CNN approach is, we no need of handcraft features. The CNN
itself learns the feature extraction. CNN uses convolution of image and filters to
generate invariant features which are passed on to the next layer. The features in next
layer are convoluted with different filters to generate more invariant and abstract
features and the process continues till one gets final feature. The convolution layers
exploit the fact that an interesting pattern can occur in any region of the image, and
regions are contiguous blocks of pixels. Also, another key feature is that deep
convolution networks are flexible and work well on image data.
1.5 AIM OF THE PROJECT
The main Aim of the project is to recognize the Telugu handwritten

characters by using CNN . Although OCR software is available to convert handwritten
text into editable data but it sometimes unable to recognizes Telugu text of human
handwriting accurately.To overcome all the problems that arises in recognition of
Telugu handwriting by using OCR.
1.6 OBJECTIVE OF THE PROJECT
In the computer vision community, the recognition is well-researched problems.

For advanced preprocessing and autonomous systems a majority of existing approaches
perform well on recognition. The objectives of the project are
• To analyze how to recognize the character and correct the character.
• To recognize and correct the slant written character with better accuracy
and improve safety through a quicker recognition.
• To compare the performance of the proposed model with existing

methods in the current context.

1.7 EXISTING METHODOLOGY
Different methods are used for detecting and recognition of telegu character.
These all methods are developed to get accurate results for the detection of character .
The character is a very difficult task. So an efficient technique and approach is given to
recognize the telegu character. autonomous systems analyze the information supplied
by various signal images and take decisions and alters the vehicles by drawing the
targeted regions carefully by going through different images in the network. This is a
time- consuming operation that necessitates a great deal of expertise and training.
There are many existing methodologies which helps in identifying and
correct the Telegu characters recognition or objects from all those existing
methodologies we considered this methodology as the base methodology or research
for this project. Here they evaluated and trained the data using 600 training images and
300 test images with 1360 X 800 pixels resolution.
The data set is divided into 3 categories like characters ,collision
characters ,slant characters . The STS dataset is also a large dataset contains 2000
images of resolution with 1280 X 960 pixels in which the annotated are 20%.The model
that identifies particular characters in the images. In addition ,they also used anchor box
method for object detection. The existing system has improved the CNN network with
network .The steps or procedure in the algorithm are
• Sorting and gathering the dataset thus according character size and shape
in the set of exercises.
• Forming clusters based on specified condition
• Extracting foundation anchors and its derivatives.
1.8 PROPOSED METHODOLOGY
Handwritten character recognition is a difficult task, but it is also an important

task to ensure that the character is in slant position or not and recognize that and correct
it to take decisions accordingly.
The proposed system is the enhancement of the existing system to use image
processing techniques such as Optical character recognition (OCR) is the mechanical

or electronic conversion of images of type written or printed text into machine encoded
text. OCR technology provides fast automated data capture which can save
considerable time with which we can improve the speed and accuracy of the previous
models of hand written Telegu character recognition.
The proposed system will be trained on a dataset that contains wide rage of images of
Telegu characters. The characters will include all types such as small, medium, large
and in different slants to be able to recognize and correct any character.The general
challenge tasks for recognition of Telegu character is
• Processing-The stage will denoise the unwanted data, perform

binarization, converting to gray scale and then gray scale to binary scale
format.
• Feature Extraction-It includes all available feature of characters and

their pixels value of information which can be used for future
recognition of characters.
• Character Recognition-It recognizes all the characters by using

different algorithms we had used and that would let to obtain the
characters that exist in the text.
• Classification Stages-It classifies techniques and all recognized

characters into groups.
In our new method we will modify the existing system the helps to get the
better accuracy for the handwritten character recognition.
1.9 SIGNIFICANCE OF THE WORK
Importance of this project work is to recognize the Telugu handwritten

characters by using the different Telugu characters collected from different handwriting
by different individuals. By using all the collected characters and using conventional
neural networks we can able to recognize the input characters and can be used for
further usage. Further it can be used many applications like to recognize the entire text
from the different documents and can be used for future use. Documents it includes are
old documents, rewrites the promise notes, to collect and use the existing information

in the old documents, books written by the elders, to reuse the content of characters.
The OCR are often enforced by Convolutional Neural Network (CNN), that
may be a well-liked deep neural spectrum. the standard CNN classifiers area unit
capable of learning, the classification is performed by victim soft-max layer. A
convolutional neural network (CNN or convnet) may be a set of machine learning. it's
one in every of the varied styles of artificial neural networks that area unit used for
various applications and knowledge sorts. A CNN may be a quite spec for deep learning
algorithms and is specifically used for image recognition and tasks that involve the
process of picture element knowledge apers and the books.
1.10 LIMITATION OF THE WORK
● This project recognizes the character only if the image is present in the dataset
which is trained.
● This project is mainly dealing with the slant correction of the Handwritten
character which may leads to high accuracy.
● This project does not applicable to the characters which are not in the particular
dimensions.
1.11 ORGANIZATION OF THE REPORT
The remainder of the chapter is laid out as follows.
Chapter-1: This chapter provides an introduction, problem statement, project goal,

methodology, work significance, and conclusion.
Chapter-2: This chapter offers a review of the literature as well as a comparison of
several strategies for detecting fraudulent reviews.
Chapter-3: This chapter provides System Requirements such as Hardware Tools,
Communication Interfaces, and Software/Hardware Requirements.
Chapter-4: This chapter consists of System design, including System
Architecture
Chapter-5: This chapter contains the implemented and how it will be approach
of the project.
Chapter-6: This chapter provides experimental results, project outcome and

comparison with the existing approaches.

Chapter 7: This chapter provides conclusion and future enhancement of the project.
1.12 SUMMARY
This chapter includes a brief introduction to an overview of the project.

And this chapter deals with the identifying problem statement, estimating the objective
of the project, a brief introduction about the methodology used in the project, the
significance of the project, organization of the project which includes the output of
every chapter introduction. The next chapter is the Literature review which discusses
various journal papers to obtain the specific problem statement by analyzing all the
relevant work and information mentioned in that reference paper to understand the
present problem statement existing in that area.

CHAPTER 2
LITERATURE SURVEY
2.1 PREAMBLE
The purpose of the literature survey is to obtain a clear understanding of the

existing problem in the particular area of the domain. By clearly understanding all the
previous development and their works will provide the best way to obtain the perfect
problem statement existing in the present situation.
The following section summarizes the history of those works which are done
previously, highlighting the strengths and weakness of each method. Various methods
were proposed and reviewed here.
• Convolution Neural Network
• Adaptive Zoning
• Optical Character Recognition(OCR)
• Recurrent Neural Network
• Recognition by other methods
In the literature survey, several methods had been proposed for
Handwritten telugu character recognition. Among the most recently published works
are those presented as follows:
2.2 CONVOLUTION NEURAL NETWORK
The purpose of the literature survey is to obtain a clear understanding of the A

convolutional neural network (CNN or convnet) is a subset of machine learning.
Machine learning includes convolutional neural networks, sometimes known as
convnets or CNNs. It is a subset of the several artificial neural network models that are
employed for diverse purposes and data sets. A CNN is a particular type of network
design for deep learning algorithms that is utilized for tasks like image recognition and
pixel data processing. Although there are different kinds of neural networks in deep
learning, CNNs are the preferred network design for identifying and recognizing
objects. They are therefore ideally suited for computer vision (CV) activities and for
applications where accurate object identification is crucial, such face and self-driving
Sasi Institute of Technology & Engineering
13
automobile systems. For quick multi-scale object detection, a unified deep neural
network called the multi-scale CNN (MS-CNN) is suggested. A proposal sub-network
and a detection sub-network make up the MSCNN.
The proposed sub-network performs detection at several output layers, allowing

receptive fields to correspond with objects of various scales. A powerful multi-scale
object detector is created by combining these complimentary scale-specific detectors.
A multi-task loss is optimized in order to learn the unified network from beginning to
finish. A two-stage detection algorithm is R-CNN. A subset of areas in an image that
potentially contain an item are found in the first step. The item is categorized in each
region in the second stage. First, the R-CNN detector creates region suggestions with
an algorithm like Edge Boxes. The picture is scaled and cropped to remove the proposed
parts. The clipped and resized regions are then classified by CNN. Finally, a support
vector machine (SVM) that has been trained using CNN features refines the region
proposal bounding boxes.
The purpose of the literature survey is to obtain a clear understanding of the A

convolutional neural network (CNN or convnet) is a subset of machine learning. It is
one of the various types of artificial neural networks which are used for different
applications and data types. A CNN is a kind of network architecture for deep learning
algorithms and is specifically used for image recognition and tasks that involve the
processing of pixel data. There are other types of neural networks in deep learning, but
for identifying and recognizing objects, CNNs are the network architecture of choice.
This makes them highly suitable for computer vision (CV) tasks and for applications
where object recognition is vital, such as self-driving cars and facial recognition.
Because Convolutional neural networks can identify patterns and trends in

photos, a convolutional neural network (CNN) is a group of algorithms mostly used for
image classification and processing. A CNN is an effective instrument, but it needs a
lot of tagged datasets to learn. These are meant to handle problems with visual data, but
they also contain a wide range of software that is not related to image recognition or
computer vision, such as image processing, text classification, bioengineering, and
negative health evaluations.
Soumya T Soman et at,(2013) proposed a method that it is in order to create a powerful

14
system, this paper's challenges included identifying offline handwritten Telugu

characters. Four distinct pattern analysis techniques were used to accomplish thisto
create a system that is successful at reading handwritten characters utilizing two
separate data sets. There are four techniques included in the suggested methodology.
Offline handwriting Telugu characters were recognized that use the following four
techniques neural networks with convolutions analysis of the primary components
machines that support vectors systems with several classifiers.
Data gathered from various sources, as described was divided into training and
testing groups. 5156 images were utilized for testing after 47428 images were used for
training. 9 was trained using a training set of data many networks. The networks that
performed best on the test dataset in terms of least correlation were integrated to build
a classifier. Two networks out of the nine were chosen based on their performance on
the test dataset having the lowest correlation. Using the two categories of voting
mentioned above, the performance of the new classifier was assessed over the test
dataset. Over individual classifiers, the new classifier pair produced good results with
better accuracy.
The ensemble classifier for the same data outperformed the best individual
classifier trained on MNIST data, with such a test performance of 97.25% 98.5%. A
number of models built on CNNs produced the following results, which might be
compared to this result with the result ever published using MNIST data. Consonants
and vowel modifier recognition produced results of 92.26% and 92% for the proposed
approach, accordingly.
Dr.Anupama Angad et at,(2018) proposed a method that it is Convolution

Neural Network is used to identify online handwritten Telugu characters and to increase
identification accuracy even on a larger dataset. There have been numerous attempts to
solve this issue using various classifiers and feature sets, but the issue has remained
difficult. The suggested solution made use of a convolutional neural network for the
accurate recognition of online Telugu characters on sizable datasets.
The network has four layers, one of which has a 5 X 5 kernel while the others
have 3 X 3 kernels, ReLU, softmax activation functions, max pooling, and two dense
layers. The 168 outputs in the final layer corresponds to the classes taken into account:
vowels and consonants. We preowned 45,133 images created by numerous telugu
writers to train and estimate the CNN. A thorough investigation of an intriguing dataset

15
demonstrated the proposed CNN's effectiveness when compared to previous

technologies. The finding is more accurate than some recently suggested literature used
to identify handwritten Telugu characters online. The result of the proposed approach
is when compared to modern algorithms, this is extremely outstanding. The
classification method achieved 92.4% accuracy on the test dataset.
Naragudem Sarika et at,(2021) proposed a method that a webcam can be

employed to take pictures using an optical character recognition device, which can also
recognize images from typed and handwritten text. Searching, indexing, editing, and
reducing storage space are the optical character recognition technologies. This is
accomplished by first scanning the text image character by character, processing the
scanned image, and then transforming the character image into character codes, like
ASCII. The Optical Character Recognition method is utilized to convert text included
within an image into text format.
The database contains a maximum of all 1600 Telugu characters used in the
literature. The image is grayscale and is divided into training and testing sets with 70%
and 30%, respectively. The photograph is (52, 52). picture preprocessing step is
addressed in table II after being scaled from the dimension (52, 52) into 224*224,
128*128, 64*64, and 52*52. Three steps are involved in character segmentation data
collection, data analysis, and validation. The image is normalized.
The dataset includes a large number of Telugu characters in 1600 characters or
so in the literature. CNN is a deep learning approach that is utilized for character
recognition in identification many CNN.The utilization of techniques and their
architectures is examined handwritten character recognition is categorized as follows
surveyed. The proposed technique uses the VGG-16 architecture.Telugu character
dataset was discussed and trained for recognition in handwriting. The outcomes show
how accurately dataset testing and training. Image processing research in handwritten
character recognition is the most critical and challenging subject. The capability of a
computer to recognize handwriting input from various.
T.Sitamahalakshmi et at,(2022) proposed a method that is identification can be
attributed to original sources including paper documents, photos, touch displays, and
other online and offline devices. As ANN performance declined in object recognition
and image classification, using CNN in image processing became more obvious and
advantageous.

16
We had to create our own dataset because Telugu characters don't have any
publically accessible training data. Initially, there is a collection of Telugu letters
written by individual.Various formats of guninthalu are scanned into exact
symbols.Each of the 21 guninthalu contains 16 characters, for a total of 275520
handwritten characters.The built-in dataset was used to develop CNN.This testing was
conducted on a 64-bit operating system.
The objective of this research is to create a system that heavily relies on
classification to recognize Telugu handwritten characters. In the model's layer-by-layer
characteristics Numerous handwritten Telugu guninthalu served as the input for the
creation of the data set that was the foundation for our suggested model.In comparison,
the RMSprop optimizer performs 94.26% better than the ADAM and SGD optimizer.
Mayur Bhargab et at,(2020) proposed Handwritten character recognition from
images using CNN-ECOC.Handwritten character recognition is difficult and
researchers are exploring completely different techniques within the pass few decades.
so as to seek out an acceptable feature extractor, 3 standard CNN architectures are
explored, particularly LeNet, AlexNet and ZfNet. From the simulation result, it's been
determined that LeNet provides an occasional accuracy rate. thus it's been changed by
adding dropout layer and ReLu layer once the primary totally connected layer, and it's
resulted in higher accuracy rate. The OCR may be enforced by exploitation
Convolutional Neural Network (CNN), that could be a standard deep neural spec.
during this paper , they need conferred OCR by combining CNN and Error Correcting
Output Code (ECOC) classifier. The CNN is employed for feature extraction and
therefore the ECOC is employed for classification.The CNN area unit feedforward
networks and uses back-propagation coaching algorithmic program.
The coaching is performed in 2 passes; forward and backward pass. within the
aerial the network weight and bias area unit initialized with tiny random numbers and
calculate the network output by exploitation coaching input. The error is computed by
examination the network output with the specified coaching output. within the
backward pass the error propagates backward and every one the weights and bias area
unit adjusted to reduce the error. the method is recurrent till the specified result's
obtained. Once the network is trained with an acceptable dataset, it may be used for
resolution a particular drawback.Given any input image the ECOC classifier works by
extracting the options of the input image so feeding those to all or any the binary
learners.
17
The binary learners area unit trained exploitation linear SVM, wherever one
category is taken as positive et al area unit taken as negative and area unit separated
employing a hyper-plane. Once the binary learner is trained, every learner produces a
chance. All the chances area unit collected into a string and regenerate into a codeword
by employing a appropriate threshold. The codeword generated by the ECOC category
is compared and therefore the class akin to the closest codeword is that the foreseen
output. we will use completely different models and classifiers.
2.3 ADAPTIVE ZONING
Zoning can manage the variability in handwriting patterns, zoning is a popular

feature extraction technique for handwritten digit recognition. Recently, adaptive
techniques for zoning design have replaced static techniques, where zoning design is
seen as the product of an optimization procedure. In fact, a variety of researchers have
already been interested in the subject of zoning design and have suggested a variety of
image-zoning topologies based on static and dynamic procedures.
Sanugula Durga Prasad et at,(2016) proposed an algorithm for Telugu
Handwritten Character Recognition. In this paper, the method is based in MLP a
classifier that uses error back propagation to train it algorithm. Each character
representation is scaled down using this technique to 32 by 32 pixels The 32x32 image's
intensities are used in the input layer.54 output units in the output layer can recognize
vowels ,consonants, too. This method provides 84.9% accuracy for vowels and
consonants.
According to their research, the recognition accuracy increases with the
frequency of iterations. In contrast to the second way that is recommended, we can
achieve higher recognition rates with GA, but it requires more computational
capabilities.KNN algorithm and static zoning method produced good accuracy.
Furthermore, we developed an adaptive approach with features for line intersecting and
contour profiles developed using GA.This also provided reasonable accuracy (almost
100%) for lesser symbols.As according literature, the HWCR has a recognition
accuracy of almost 80%.
PanyamNarahari Sastry et at,(2014) proposed a method that it is possible to
identify handwritten characters both online and offline. aspects of the zoning technique.

18
For handwritten character authentication, a feature vector made up of the features of

every zone in the image is used. For humans, it requires little effort to recognize
handwritten characters, but for computers, it is a challenging challenge. In order to
distinguish between handwritten characters, the zoning method was utilized. In the
Zoning feature extraction approach, zones of known sizes are often created within a
character.
A 5050 frame was used for the experiment since it is a suitable normalized size
and includes different features. The image is further separated into 100 55 zones. These
values from various zones are concatenated one below the other after this feature has
been calculated for all of the zones. 100 rows would make up this column vector, which
provides as the feature vector for this image.
The feature vector of an image can be determined by adding all the pixel intensities
within such a given zone. As a consequence, each image's feature vector is a column
vector of 100 through 1.This method is used to make training and test images.According
to available research, the HWCR's recognition accuracy is less than 60%.HWCR is in
its beginnings for Indian languages because compared to English, it has fewer speakers,
composite characters, and more modifiers. An algorithm for evaluating images using
500 samples has also been developed successfully.The proposed Zoning method
achieved a 78% recognition accuracy.
2.4 OPTICAL CHARACTER RECOGNITION
Optical character recognition (OCR) is a process by which specialized software

is used to convert scanned images of text to electronic text so that digitized data can be
searched, indexed and retrieved the OCR can be implemented by using Convolutional
Neural Network (CNN), which is a popular deep neural network architecture. The
traditional CNN classifiers are capable of learning the important 2D features present in
the images and classify them, the classification is performed by using soft-max layer.
The Steps of an OCR Deep Learning Model Preprocessing an input image. This OCR
step includes simplification, detection of meaningful edges, and defining the outline of
the text characters, Detection of the text, Recognition of the text.
Kamakshi Prasad et at ,(2016) proposed a method for Online Recognition of

19
Handwriting Telugu Script Characters. Online Handwritten Character Recognition

(OHCR) is the method of recognizing characters by a machine while the user writes, in
which the handheld devices record coordinates of the track of the character. With the
advent of handheld devices, there is a great attention towards OHCR of regional
languages. Preprocessing is the main phase, in OHCR, as it increases the performance
of succeeding phases, by removing the inconsistency or the redundancy present in the
data collected in real-world environment. In this paper, we depict the model of
Preprocessing of Online Handwritten Telugu Strokes.
The preprocessing steps we address in our article are Normalization,
Smoothing, Duplicate Point Removal, Interpolation, Dehooking and Resampling.
Preprocessing data performance is evaluated through parameters namely recognition
accuracy, recognition speed, false acceptance rate and false rejection rate over HP labs
dataset hpl-Telugu-ISO-char-online-1.0. The dataset contains samples of the 166
character classes collected of different writers on ACECAD Digimemo (A4 sized)
using an AcecadDigi memo DCT application. It consists of 270 samples on average for
each of 166 Telugu "characters" written by native Telugu writers.
Srilakshmi Inuganti et at,(2017) proposed a method for preprocessing of online
handwritten telugu character recognition .Online written Character Recognition
(OHCR) is that the technique of recognizing characters by a machine whereas the user
writes, within which the handheld devices record coordinates of the track of the
character. With the arrival of hand-held devices, there's a good attention towards OHCR
of regional languages. Preprocessing is that the main section, in OHCR, because it will
increase the performance of succeeding phases, by removing the inconsistency or the
redundancy gift within the information collected in real-world atmosphere. They depict
the model of Preprocessing of on-line written Telugu Strokes.
The preprocessing steps they address ar standardization, Smoothing, Duplicate
purpose Removal, Interpolation, Dehooking and Resampling. Preprocessing
information performance is evaluated through parameters specifically recognition
accuracy, recognition speed, false acceptance rate and false rejection rates. during this
paper, they incontestable preprocessing techniques over on-line information of dataset
of Telugu strokes.The parameters evaluated in approach to urge recognition.This paper
proposes the preprocessed character is recognized victimisation K-Nearest Neighbor
supported dynamic time distortion (DTW) distance live and by victimisation pen-tip
position as feature.A number of various models are applied to Indian OHCR.
20
completely different models of on-line handwriting recognition arillustrated . These

recognition models ar Motor Models, Structured based mostly Models, applied
mathematics Models, and Neural Network Models. add every of the higher than
mentioned strategies is illustrated within the following sections. • Usually the
popularity rate is high, if we tend to normalize the character with relation to the breadth
and height, at the side of a start line. during this paper we tend to normalize the
dimensions and beginning position of the stroke.
The parameters evaluated in our approach are recognition time, recognition
accuracy, False Acceptance Rate (FAR) and False Rejection Rate (FRR). the typical
recognition time is calculated by dividing the entire range of recognized characters by
the entire time taken to acknowledge all check samples.Errors may be debugged
instantly with perennial tests. The implementation of additional preprocessing
techniques may be used.
Vasantha Lakshmi et at,(2019) proposed A high accuracy OCR system for
printed telugutext.This work describes the look and development of a Telugu Optical
Character Recognition system for written text (TOSP). Pre-processing tasks thought of
during this paper are: Conversion of a gray scale image to a binary image, image
rectification, skew detection and removal, segmentation of text into lines, words and
basic symbols. Basic symbols ar known because the elementary unit of segmentation
during this paper that ar recognized by neural recognizers. The acknowledgers ar
power-assisted by Associate in Nursing improvement module that uses extra logic to
properly recognize confusing symbols leading to augmented recognition accuracy.
throughout the past thirty years, substantial analysis efforts are dedicated to Optical
Character Recognition.
The item of OCR is automatic reading of optically perceived document text
materials to translate human legible characters into machine-readable codes. industrial
OCR packages ar already accessible for languages like English. substantial work has
additionally been in deep trouble languages like telugu.A high accuracy OCR system
for written Telugu Text. It describes the look and development of a Telugu Optical
Character Recognition system for written text. Preprocessing tasks ar Conversion of
gray scale image to binary image, Image rectification skew detection, segmentation of
text into lines, words and symbols. Recognizers ar power-assisted to properly
acknowledge confusing symbols ensuing augmented recognition accuracy. The special
feature of TOSP is that's designed to handle multiple sizes and fonts.
21
Vaibhav(2020),have proposed the method for the procedures adopted by

The algorithm is shown as follows for optical character recognition. For the purpose of
recognising particular features of characters or numerals, feature detection in the
document's characters. The process known as "Optical Character Recognition" converts
handwritten data into an electronic format. It includes pre-processing, segmentation,
feature extraction, and post-processing, among other processes. For character
recognition, OCR has been employed by numerous researchers. The android phone is
used in this method to take a picture of the document, and OCR is then used for the next
few processes. The biggest difficulty is identifying the characters in various
handwriting styles. As a result, a system that can recognise handwritten data and
produce editable text is created.
2.5 RECURRENT NEURAL NETWORKS
A Recurrent Neural Network (RNN) is a network good at modeling sequential

data. Sequential data means data that follow a particular order in that a thing follows
another. In RNN, the output of the previous stage goes back in as an input of the current
step. RNN is a feedback neural network. Saving the output helps make other decisions.
RNNs are a powerful and robust type of neural network, and belong to the most
promising algorithms in use because it is the only one with an internal memory.
Recurrent neural networks (RNNs) are a class of neural network that are helpful in
modeling sequence data. Derived from feed forward networks ,RNNs exhibit similar
behavior to how human brains function. Simply recurrent neural networks produce
predictive results in sequential data that other algorithms can’t.
Shun nishide et at,(2011)proposed a handwriting prediction based

character recognition using recurrent neural network. Humans area unit aforesaid to
accidentally trace handwriting sequences in their brains supported handwriting
experiences once recognizing transcription. During this paper, they propose a model for
predicting handwriting sequence for transcription recognition supported handwriting
experiences. The model is initial trained mistreatment image sequences non inheritable
where as writing text. The image options of sequences area unit self-organized from the
pictures mistreatment Self-Organizing Map. The feature sequences area unit wont to
train a neuro-dynamics learning model. For recognition, the text image is input into the

22
model for predicting the handwriting sequence and recognition of the text. They
conducted 2 experiments mistreatment 10 telugu characters. The results of the
experiments show the powerfulness of the model .Handwriting Prediction primarily
based Character Recognition mistreatment perennial Neural Network .Humans area
unit aforesaid to accidentally trace handwriting sequences in their brains based on
handwriting experiences once recognizing transcription.
In the experiment, they use the pen pill Intuos4 ptk-640 (WACOM),shown
sure acquiring information. 10 telugu characters , were written mistreatment the pen
pill. the japanese character, corresponding pronunciation, an example of the particular
handwritten image area unit shown for every character. Each character was written 10
times to acquire a complete of a hundred sequences. pictures were non inheritable at
twenty images/sec every with resolution of twenty × twenty, ranging from the instant
the pen has had contact with the pill and ending once the character is finished writing.
5 out of every of the 10 sequences. The findings were the current the results of the
experiment for recognition of image sequences .Because the recognition method
derives the values representing every sequence, we evaluate the distribution of the
values. we have a tendency to apply the Principal element Analysis (PCA) to the values
for view ability. The primary 3 PCs were hand-picked for analyzing, because the
accumulative contribution magnitude relation of the third laptop was ninety
seven.5%.The distribution of the values . Each mark represents the worth of the
corresponding letter. The model was created mistreatment self organizing map (SOM)
for feature extraction module of words. The limitations for this paper area unit they
need not detected the massive characters that they are to be check by the long run
improvement.
2.6 COMPARISON TABLE OF EXISTING ALGORITHM
TABLE 2.1: COMPARISON OF LITERATURE SURVEY
S.N Authors, Method/Algorithm Merits Demerits

O Publication Year used

23
1 C. Vasantha In this paper ,the performed for large Trained only

Lakshmi and optical character number of symbols. for three
C.Patvardhan, Recognition is used fonts
,2019 toget the maximum (Harshapriy
. accuracy for the hand a ,Godavari
written telugu ,Hemalatha
Recognition. ).
2 A. Kamakshi In this paper ,SVM The proposed Few errors

Prasad, K. C. based recognizer for method displays were
Yelavarti and K.P. online Recognition of recognition occurred to
Valurouthu, 2016 isolated Telugu performance better improve the
handwritten when compared to recognition
characters. others. performance
,varient
analysis and
creating
models
correspondi
ng to each
varient ,
adding more
features
needs to be
explored.
3 S. D. Prasad and Y. Two method where In the proposed In order to
Kanduri, 2016. used first method method were obtain
based upon Genetic adaptive technique higher
Algorithm and uses with contour and accuracy in
Adaptive Zoning line intersections GA we need
Topology with features to take large
extracted geometric implemented using set of
features. In second GA, which have training
method Zoning is done good accuracy for examples
in static way. lesser number of and loop for
symbols. more
iterations.

24
4 Soumya T Soman, The four techniques We find that

Ashakranthi were used for The experimental in proposed
Nandigam, V. recognition of Offline results show that the method has
Srinivasa Handwriting Telugu proposed method only 2
Chakravarthy,2013. Characters: acheived a classifiers
1.Convolutional neural performance of used which
networks 2. Principal 92.26% and 92% on leads to
components analysis consonants and least
3. Support vector vowels accuracy
machines 4. recognition,al so when
Multiclassifier incresing this compared to
systems. number increases others.
performance
accuracy.
5 Mayur Bhargab The OCR can be Different CNN We can use

Bora, Dinthisrang implemented by using Architectures are different
Daimary, Convolutional Neural used to recognize models and
Khwairakpam Network (CNN), the accuracy. classifiers.
Amitab, Debdatta which is a popular
,2020 deep neural network
architecture. In this
paper , they have
presented OCR by
combining CNN and
Error Correcting
Output Code (ECOC)
classifier. The CNN is
used for feature
extraction and the
ECOC is used for
classification.
6 Naragudem They proposed method The CNN It is difficult
Sarika,Nageswarara by using Architecture shown to recognize
o sirisala,Muni Convolutional neural in the paper is the hand
Sekhar Velpuru network (CNN) . The giving accuracy with written
2021. CNN is used for 79.61% with a small telugu
recognition of hand dataset Of 516 characters
written characters samples. particularly
telugu
Guninthalu
since each
character is
almost
similar to
other.

25
7 Shun Nishide, The RNN algorithm is The distribution of The

Hiroshi G. Okuno, used to identify the the values . Each limitations
and Tetsuya hand written character mark represents the for this
OgataTaniBrain recognition. value of the paper are
,2022 corresponding letter. they have
Red, blue, and black not detected
marks correspond to the large
the values acquired characters
during self that they are
organization during to be check
by the future
enhancemen
t.
8. Muni Sekhar Optical Character By this method were By this is
Velpuru,RamanDu Recognition (OCR) the fine tuning on intended to
gyala,Tejasree can be done with the the line separation design that
Ganji,2020 help of OCR and Deep module, skew will be font
Learning techniques. correction module, independent
grammar check . For font or
module, the OCR characters to
can be made to be be
98% effective. recognized a
parser is
required.
9. .Panyam Narahari The Zoning method In the proposed The
Sastry, 2014 were used in method were the proposed
recognition of recognition accuracy method
handwritten using proposed were Some
characters. Zoning methodis of the
obtained as 78%. characters of
Telugu like
Va, Ma, Ya,
Pa or Na are
very similar
to each
other. Hence
very low
rate of
recognition.
10. Prof. Vaibhav. V. They used the OCR In the system offers The text
Mainkar Ms. Jyoti algorithm is used to 90% accuracy for should
A. Katkar Mr. recognition of handwritten recognize
Ajinkya B. Upade handwritten characters documents and gives with good
Ms. Poonam R. the easiest way to accuracy ,it
Pednekar ,2020 edit or share the can’t work
recognized data. for large
characters.

26
2.8 SUMMARY
In the above literature survey we conducted, few papers were compared related
to methods on handwritten telugu character recognition of which most of them used
CNN, Adaptive Zoning, Optical Character Recognition algorithms, etc. The approaches
have shown a good performance and precision in handwritten telugu character
recognition In almost all the papers, the dataset in various all the letters of telugu are
not considered and though these are considered in some of the papers the developed
model is not able to give exactly correct output. In some of the papers very small dataset
is taken and most of the dataset images are of high resolution and slant correction is an
important factor in applications requiring word recognition which is one of the major
factor in developing better handwritten recognition. So, in our work we want to increase
the accuracy based on slant correction by using large dataset.

27
CHAPTER 3
SYSTEM REQUIREMENTS
3.1 PREAMBLE
The previous chapter describes the literature review related to handwritten

Telegu character recognition and various approaches and techniques of character
recognition, discuss the various research papers and journals with their merits,
demerits, and future direction in chapter 2. In this chapter, describes the different
kinds of tools used and different requirements to develop the proposed system. This
works provides details about tools used like front-end tools, back-end tools, hardware
tools, and different system requirements like hardware and software requirements of
the system.
3.2 IMPLEMENTATION TOOL
3.2.1 Jupyter Notebook
Jupiter is a free open-source interactive web tool known as a computational

notebook that researchers can use to combine all this software code, computational
output, explanatory text, and multimedia resources in a single document or a file.
Jupyter notebook provides you with easy-to-use interactive data science and
environment across many programming languages that do not only work like an IDE
but also as a presentation tool or educational tool. Jupyter notebook is not included with
python so if you want to try it out, you will need to install Jupiter then are any
distributions of python language.

28
Fig 3.1 Jupyter Notebook
The Jupyter notebook consists of two components. Users input programming

code or text in rectangular cells on a front-end web page. The browser then passes that
code to a back-end ‘kernel’, which runs the code and returns the results. A Jupyter
Notebook document is a JSON document with an ordered collection of input/output
cells that can contain code, text (using Markdown), mathematics, graphs, and rich
media, and typically ends with the “. ipynb” extension. These two components will help
in accessing of the Jupiter notebook. The code will be run at the front end and it will
provide the information about the code. What does the code have and its function about
all the requirements? The back-end browser which the code is having will move to the
back-end which has the kernel. By running the code in the back-end the results will be
displayed.
3.3 COMMUNICATION INTERFACE
Python is a general-purpose interpreted, interactive, object-oriented, and high-

level programming language. It was created with the aid of using Guido van Rossum at
some point in 1985- 1990. Like Perl, Python supply code is likewise to be held below
the GNU General Public License (GPL). Python is a high-level, interpreted, interactive,
and object-oriented scripting language. Python is designed to be tremendously readable.
It makes use of English key phrases often while different languages use punctuation,
and it has fewer syntactical buildings than different languages.
● Python is Interpreted – Python is processed at runtime with the aid of using the
interpreter. You no longer want to assemble your software earlier than executing
it. This is much like PERL and PHP.

29
● Python is Interactive – You can take a seat down at a Python activate and have
interaction with the interpreter immediately to jot down your programs.
● Python is Object-Oriented – Python supports the Object-Oriented style or

technique of programming that encapsulates code within objects.
● Python is a Beginner’s Language – Python is a great language for beginner level

programmers and supports the development of a wide range of applications
from simple text processing to WWW browsers to games.
● Characteristics of Python Python is very easy to learn the language as compared

to other languages like c, c#, JavaScript, Java, etc.
● Python language is freely available on the official website and you can
download it from the www.python.org website.
● It provides very high-level dynamic data records and supports dynamic type
checking.
● Portable, Python can run on a wide variety of hardware platforms and has the
same interface for all programs.
● Scalable, Python provides a better structure and support for large programs than
shell scripting.
● It may be without difficulty to incorporate with C, C++, COM, ActiveX,

CORBA, and Java.
3.4 OPERATING SYSTEM
It is an interface between a computer user and computer hardware. An operating

system is software that performs all the basic tasks like file management, memory
management, process management, handling input and output, and controlling
peripheral devices such as disk drives and printers. It is a time-sharing operating system
that schedules tasks for efficient use of the system and may also include accounting
software for cost allocation of processor time. In this proposed work we use Windows
operating system. Windows is a series of operating systems developed by Microsoft.
Each version of Windows includes a graphical user interface, with a desktop that allows
users to view files and folders in windows. For the past two decades, Windows has been
the most widely used operating system for personal computers and PCs. We can use the

30
versions such as Windows 7 and its above versions such as Windows 10. It helps the
user to understand what is the basic task that a system can provide to them. It makes
the user interact with the system and tells about how the data will be processed, how to
access the data. At which location the file is saved and how to access it. All the
information is present in the operating system.
3.5 HARDWARE TOOLS
3.5.1 Processor
A processor is an integrated electronic circuit that performs the calculations that

run the computer. A processor performs arithmetical, logical, input/output (I/O), and
other basic instructions that are passed from an operating system (OS). The Pentium
Dual-Core brand was used for mainstreamx86 architecture microprocessors from Intel
from 2006 to 2009 when it was renamed Pentium. The processors are based on either
the 32-bit or 64-bit, and it was targeted at mobile or desktop computers. In terms of
features, price, and performance at a given clock frequency, Pentium DualCore
processors were positioned above Celeron but below Core and Core2 microprocessors
in Intel’s product range. The Pentium Dual-Core was also a very popular choice for
overclocking, as it can deliver high performance (when overclocked) at a low price
3.5.2 RAM (Random access memory)
Random-access memory (RAM) is a form of computer memory that can be read

and changed in any order, typically used to store working data and machine code. A
random-access memory device allows data items to be read or written in almost the
same amount of time irrespective of the physical location of data inside the memory.
RAM contains multiplexing and demultiplexing circuitry, to connect the data lines to
the addressed storage for reading or writing the entry. Usually, more than one bit of
storage is accessed by the same address, and RAM devices often have multiple data
lines and are said to be “8-bit” or “16-bit”, etc. devices. In this proposed work, we
require a minimum of 4GB RAM.
3.6 HARDWARE REQUIREMENTS

31
● System: Intel or compatible Pentium dual-core
● OS: Windows 7/Windows 10
● System type: 64-bit Operating System
● Input devices: Keyboard, Mouse
3.7 SOFTWARE REQUIREMENTS
● Coding Language: Python
● Operating System: Windows 10/Mac/Linux
● Tensor flow1.14
● Keras 2.3.1
● IDE: Jupyter Notebook
3.8 PYTHON LIBRARIES
• NumPy: NumPy is the fundamental package for scientific

computing in python. NumPy arrays give advanced mathematical and
other types of operations on large numbers of data. Typically, such
operations are executed more efficiently and with less code than is
possible using Python's built-in sequences.
• TensorFlow: Tensor Flow is an open-source library for fast
numerical computing. It was created and is maintained by Google and
released under the Apache 2.0 opensource license. The API is
nominally for the Python programming language, although there is
access to the underlying C++ API. TensorFlow was designed for use
both in research and development and in production systems.
• Keras: Keras is a high-level neural networks API, written in Python
and capable of running on top of TensorFlow. It allows for easy and
fast prototyping (through userfriendliness, modularity, and
extensibility). Supports both convolutional networks and recurrent
networks, as well as combinations of the two. Runs seamlessly on CPU
and GPU.
• Open CV: Open CV-Python makes use of NumPy, which is a highly
optimized library for numerical operations with a MATLAB-style syntax.

32
3.9 SUMMARY
This chapter introduced the software and hardware requirements of the system.
The above requirements are required for the successful implementation of the project.
Python is the programming language chosen for the handwritten telugu character
recognition for better implementation and accurate results using most Jupiter
notebooks. A system with the 64-bit Windows operating system and an Intel core
Processor device. The hardware requirements are the basic ones that are required for
the execution of any regular python script.

33
CHAPTER 4
SYSTEM DESIGN
4.1 PREAMBLE
The previous chapter describes the different requirements and tools used for
developing the proposed system. It also consists of the hardware requirements like
processor, memory, and software requirements like python and frontend and backend
tools. This chapter deals with the design of the proposed system. It includes the system
architecture, block diagram, and various diagrams to represent the design of the system.
4.2 ARCHITECTURE DIAGRAM
An architecture diagram is a system representation that is used to abstract the

relationships, constraints, and boundaries between components as well as the overall
structure of the software system. It gives a broad overview of the software system's
physical deployment.
An architecture diagram is a graphical representation of a set of concepts, that

are part of architecture, including their principles, elements and components.
Architecture diagram can help system designers and developers visualize the high-
level, overall structure of their system or application, in order to ensure the system
meets their users' needs.

34
Fig 4.1 Architecture Diagram
Firstly, we have taken the handwritten characters for every different individuals,
and later we have to captures the every individual characters. And later we have to test
the images. A common technique is to divide the data into two groups, the training set
and the testing set. The training set forms the basis for estimating parameters,
contrasting models, and carrying out other operations required to produce a final model.
It is used to generate models and feature sets. The test set is only used to estimate a
final, objective evaluation of the model's performance when all of these procedures are
complete.
The dataset for recognition of characters and recognition will be used to divide
the data. The classification of the characters will be aided by the additional division of
this dataset into the two groups. The training data and the testing data are the two
different forms of splitting. The new models will be built and the features placed in the
training dataset. It will be possible to identify the parameters and predict them.

35
Performance will be determined after the models have been compared. The testing data
is utilized to test the data and provide the outcome. Once the dataset has been tested
and trained we have to create an web page that estimates outcome of recognition of an
character. After the creation of webpage ,an page will be displayed with different
constraints like home, about project and the user.
The first feature is home which gives the information about the web page. And
the next feature is about the project, this one gives the complete information about the
project what was going on. And the complete details about the project. The next feature
is user phase where the user can login or can register. The user can register or can login
directly if he has registered already and we have to take a character as an input image
and captures the image into the system. Later we have to upload the images soon on
after uploading the files. And the image will undergoes the testing process and results
as an output image.
4.2.1 Dataset
This project is developed with the combination of the data set contains 2000
plus images at which they are hand written telugu characters that they have with
different slants. the data set classified into 2 types where as they are training and testing
where 70% of images used to train and 30% to test again they are divided according to
their word name this is the overall data set explanation.
4.2.2 Image Pre-Processing
The term "image pre-processing" describes activities performed at the most

fundamental level on images. If entropy is a measure of information, then these
techniques actually decrease the amount of information present in the images. Pre-
processing aims to enhance image data by suppressing undesirable distortions or
strengthening specific visual characteristics that are crucial for subsequent processing
and analysis. The four categories into which image pre-processing techniques are
broken down are listed below.
⮚ Resizing: The images are resized to a fixed size of 30x30 pixels using the resize()
method from the PIL.Image library. This is done to ensure that all images have the
same dimensions, which is a requirement for training most machine learning

36
models.
⮚ Converting to NumPy array: The images are converted to NumPy arrays using the
np.array() function. This allows the image data to be easily manipulated and used
as input for machine learning models.
⮚ Appending data to a list: The NumPy arrays representing the images are appended
to a list called data. This list will be used to create a NumPy array containing all the
image data.
4.2.3 Data Splitting
How to use the current data is one of the first decisions to be made before
beginning a modelling project. A common technique is to divide the data into two
groups, the training set and the testing set. The training set forms the basis for estimating
parameters, contrasting models, and carrying out other operations required to produce
a final model. It is used to generate models and feature sets. The test set is only used to
estimate a final, objective evaluation of the model's performance when all of these
procedures are complete. The dataset for recognition of characters and recognition will
be used to divide the data. The classification of the disorders will be aided by the
additional division of this dataset into the two groups. The training data and the testing
data are the two different forms of splitting. The new models will be built and the
features placed in the training dataset. It will be possible to identify the parameters and
predict them. Performance will be determined after the models have been compared.
The testing data is utilized to test the data and provide the outcome.
4.2.4 Training the Model
Training a machine learning model refers to the process of optimizing its

parameters (also known as weights) based on a set of input data and their corresponding
output labels. The goal of training is to find the optimal set of parameters that minimize
the difference between the predicted output of the model and the actual output labels of
the training data. This process is done by defining a loss function that measures the
difference between the predicted and actual outputs, and then using an optimization
algorithm to iteratively adjust the parameters of the model to minimize the loss. The
dataset will be trained on the using the CNN with 12 layers constructed over a
sequential model. The sequential model involves constructing a neural network with 12

37
layers, configuring it for training, and fitting it to a training dataset.
4.2.5 Classification Result
Training data is a set of samples such as images with assigned labels or tags.
It is used to train a computer vision algorithm or model to perform the desired function
or make correct predictions. Training data goes by several other names, including
learning set, training set, or training data set. It creates the machine learning model and
trains what the desired outcome should be. The model also scrutinizes the dataset
repetitively to understand its traits and fine-tune itself for optimal performance. It
creates the machine learning model and trains what the desired outcome should be. The
model also scrutinizes the dataset repetitively to understand its traits and fine-tune itself
for optimal performance.
In the same way, human beings learn better from examples; computers also need them
to begin noticing patterns and relationships in the data. But unlike human beings,
computers require plenty of examples as they don’t think as humans do. In fact, they
don’t see objects or people in the images. They need plenty of work and huge datasets
for training a model to recognize different sentiments from videos.
4.2.6 Performance Analysis
The model will give the recognized character in the image and produce the
class of the character it belongs to. The results generated by the model will be analyzed
and compared with that of the existing methods and a conclusion is made
4.3 CONVOLUTION NEURAL NETWORKS
Convolutional Neural Network is a part of Deep Learning. A Convolutional

Neural Network consists of multiple layers of artificial neurons. In this, every layer is
made up of a set of neurons, where each neuron is connected to all the neurons of the
previous layer. Finally, the last layer is a fully connected layer which is called the output
layer. This output layer represents the predictions. The Convolutional Neural Networks
are mainly used in image processing, classification, segmentation, and also for the other
correlated data. The CNN follows a hierarchical model which works on building a
network. Convolutional Neural Networks have a different architecture from Neural
Networks. CNN is better than the feedforward network as it consists of the attributes as
parameter sharing and dimensionality reduction. CNN depends less on pre-processing.

38
Fig 4.2 Architecture of Proposed CNN
The network consists of several layers:
• Two Conv2D layers with 32 filters each and a kernel size of 5x5, followed by ReLU
activation functions.
• One MaxPool2D layer with a pool size of 2x2 to downsample the feature maps.
• One Dropout layer with a rate of 0.25 to prevent overfitting.
• Two more Conv2D layers with 64 filters each and a kernel size of 3x3, followed by
ReLU activation functions.
• Another MaxPool2D layer with a pool size of 2x2 to downsample the feature maps
again.
• Another Dropout layer with a rate of 0.25 to prevent overfitting.
• A Flatten layer to convert the 2D feature maps into a 1D vector.
• A Dense layer with 256 units and a ReLU activation function.
• Another Dropout layer with a rate of 0.5 to prevent overfitting.
• Finally, a Dense layer with 43 units (one for each class in the dataset) and a softmax
activation function, which will output a probability distribution over the classes.

39
4.3.1 Convolution Layer
This is the initial layer that extracts the different features from the input images.
The convolution mathematical operation is done between the input image and a filter
of a specific size MxM in this layer. The dot product between the filter and the sections
of the input image concerning the size of the filter is taken by sliding the filter across
the input image (MxM). The Feature map is the result, and it contains information about
the image such as its corners and edges. This feature map is then supplied to further
layers, which learn a variety of other features from the input image.
• Relu Layer
The second part of this step will involve the Rectified Linear Unit or Relook.
We will cover Relook layers and explore how linearity functions in the context of
Convolutional Neural Networks. Not necessary for understanding CNN will but there
will no harm in a quick lesson to improve your skills.
4.3.2 Pooling Layer
A Pooling Layer is usually applied after a Convolutional Layer. This layer's

major goal is to lower the size of the convolved feature map to reduce computational
expenses. This is accomplished by reducing the connections between layers and
operating independently on each feature map. There are numerous sorts of Pooling
operations, depending on the mechanism utilized. The largest element is obtained from
the feature map in Max Pooling. The average of the elements in a predefined sized
Image segment is calculated using Average Pooling. Sum Pooling calculates the total
sum of the components in the predefined section. The Pooling Layer is typically used
to connect the Convolutional Layer and the FC Layer.
4.3.3 Full Convolution Layer
The weights and biases, as well as the neurons, make up the Fully Connected
(FC) layer, which connects the neurons between two layers. The final few layers of a
CNN Architecture are usually placed before the output layer. The previous layers' input
image is flattened and supplied to the FC layer in this step. The flattened vector is then
sent through a few more FC levels, which are where the mathematical functions are
normally performed. The classifying procedure gets underway at this point.

40
4.4 SUMMARY
This chapter deals with the system design. System architecture, data flow
diagrams, use case diagrams, and sequence diagrams are all included. System
architecture is a conceptual framework that describes the organisation, behaviour, and
system parts that will cooperate to construct the whole system. Data movement from
one phase to another is shown using a data flow diagram. This project has only a system
architecture diagram. The next chapter describes the implementation details like
modules or steps in implementation and proposed approach, techniques, and equations.

41
CHAPTER 5
IMPLEMENTATION
5.1 PREAMBLE
The previous chapter discussed system design, including system architecture,

Data flow diagrams, and UML diagrams. The execution of the system used in the
project and its various components are the topics covered in this chapter. To make
things easier to comprehend, the modules are divided up and their techniques are
quickly described.
5.2 PROPOSED METHOD
In proposed system we are introducing a deep learning concepts. Here we used

a combined deep neural network and contour feature-based approach to classify Telugu
Achulu. Handwritten Telugu Character recognition technique was proposed using
CNN algorithm to extract deep features. Here, we are considering the dataset of images
that are consider by dividing in to frames from the images. The dataset is trains using
CNN. Once after the training we recognize the characters using OpenCV .The model
helps in the recognition of the telugu characters.
The proposed work is implemented in Python 3.6+ with libraries - Jumpy,
Pandas, seaborne, sickie learn, matplotlib, pilot and other mandatory libraries. They are
many applications of CNN in recognition of character in different classes like splitting
of frames and preprocessing the frames into the appropriate format to train our model.
Use the pre-processed training to train our model using CNN algorithm. There are
numerous ways to recognize the character ,previously an existing method that which is
designed using the CNN technique to recognize the character but without the correction
of a slant. As a result, the recognition of a character with better accuracy and matching
speed. But there are some disadvantages with time consuming, poor performance.
In this project, the model is implemented using a Deep learning algorithm called
CNN algorithm, with the help of OpenCV module(Open Source Computer Vision),it is
a popular computer vision library that provides a wide range of tools and algorithms for
image processing. It also supports the implementation of deep learning models,
including CNNs. You can use the module to load pre-trained models, preprocess data,
and perform inference on input frames. After training and testing it with the different

42
character images and trained using the CNN algorithm of deep learning. Once the
training is completed, we used OpenCV that captures the image and recognizes the
characters based on the extracted parameters. The proposed method provides more
enhanced results when compared to other methods in terms of recognition scalability
and robustness to variations..
5.3 PROPOSED METHODOLOGY MODULE
For the following reasons, we tried with areas of 704 x 704, 672 x 672, 640 x
640, and 608 x 608 pixels:
In order to comply with CNN requirement that each region dimension be a

multiple of 32, we looked into a region size that would cover every instance of a telugu
character in each image from the dataset. We also looked into whether each region
could be propagated quickly enough to support real-time performance for all four sizes.
There are the modules that we proposed in our methodology:
1. CNN
2. Contour based feature approach
3. Resizing
5.3.1 Convolutional Neural Networks
Convolutional Neural Networks (CNNs) are commonly used for feature

extraction in handwritten character recognition. CNNs are able to learn hierarchical
representations of the input image data, where lower layers learn basic features such as
edges and corners, and higher layers learn more complex features such as shapes and
textures.
Here are the steps involved in feature extraction using CNNs for traffic sign
detection:
• Input image: The first step is to input the image of the telugu character into the
CNN.
• Convolutional layers: The input image is then passed through a series of
convolutional layers, which apply filters to the image to detect features such as
edges, corners, and blobs.

43
• Pooling layers: After each convolutional layer, a pooling layer is usually applied to
reduce the size of the feature maps and make the model more computationally
efficient.
• Fully connected layers: The output of the convolutional and pooling layers is then
flattened and passed through one or more fully connected layers, which learn to
classify the features into different traffic sign classes.
• Soft-max activation: The final layer of the CNN typically uses a soft-max activation
function to generate a probability distribution over the different telugu character
classes.
• Prediction: The CNN predicts the most likely telugu character class based on the
highest probability value from the soft-max output.
The features learned by the CNN during this process can be used to distinguish
between different telugu characters classes and accurately detect and classify telugu
characters in real-time. In summary, feature extraction using CNNs in telugu character
recognition involves passing the input image through a series of convolutional, pooling,
and fully connected layers to learn hierarchical representations of the image, which are
then used to classify the telugu character into its respective class.
5.3.2 Contour Based Feature Approach
The method of finding image displacements which is easiest to understand is

the feature-based approach. This finds features (for example, image edges, corners, and
other structures well localized in two dimensions) and tracks these as they move from
frame to frame. a feature extraction scheme for application on image classification and
retrieval that is based on shapes' contours, while discarding information within the
boundaries such as colour and texture.
The center of mass and opposite distances are calculated for every contour pixel and
used to measure distances between pairs of images that are invariant to common
transformations.
5.3.3 Image Resizing
Resizing of images is a common pre-processing step in telugu character

recognition and recognition systems. The purpose of resizing is to standardize the size

44
of the images so that they can be processed efficiently by the machine learning model.
Telugu character images captured by cameras can vary widely in size and aspect ratio,
so resizing the images to a fixed size is necessary to ensure that the machine learning
model can accurately learn the patterns and features of the telugu characters.
Typically, resizing involves scaling the images to a smaller size while

maintaining the aspect ratio. The smaller size reduces the amount of computation
required during training and inference, making the model more efficient. A common
practice is to resize the images to a square shape, such as 32x32 or 64x64 pixels, which
is a standard size used in many telugu character recognition systems.
5.4 MODULES OF PROPOSED SYSTEM
The proposed work has six main modules which are data collection, data pre-
processing, pre-training with dataset, feature extraction, data splitting, classification
result. First the telugu achulu character recognition dataset is collected from Kaggle
and few other images are added into this dataset. The dataset contains both training and
validation data. And the next modules are listed below and discussed further. The
modules are given below:
1. Data-collection
2. Data pre-processing
3. Data splitting
4. Training the model
5. Classification result
5.4.1 Data Collection
Data Collection for telugu character involves gathering images of telugu

characters from various sources, which can then be used to train a machine learning
algorithm to recognize and classify telugu characters in real time. Some common
methods we used in data collection.
Once the data has been collected, it can be labelled with the corresponding
telugu character class, such as “aa” or “ aww”. This labelled data is then used to train
a machine learning algorithm, such as convolutional neural network(CNN), to
recognize and classify telugu characters in real-time.
5.4.2 Data Pre-Processing

45
Data pre-processing is an essential step in preparing data for machine learning

algorithms. In the case of telugu characters, pre-processing involves transforming the
raw image data into a format that can be fed into a machine learning algorithm. Here
are some common pre-processing techniques used for telugu characters:
• Image Resizing: The images are resized to a fixed size of 30x30 pixels using
the resize() method from the PIL. Image library. This is done to ensure that all
images have the same dimensions, which is a requirement for training most
machine learning models.
• Converting to NumPy array: The images are converted to NumPy arrays using
the np.array() function. This allows the image data to be easily manipulated and
used as input for machine learning models.
• Appending data to a list: The NumPy arrays representing the images are
appended to a list called data. This list will be used to create a NumPy array
containing all the image data.
• Labeling: A list called labels is used to store the class labels of the images. The
label for each image is determined by the directory it is stored in. The labels list
is also converted to a NumPy array.
Overall, the pre-processing of telugu characters data involves transforming the

raw images into a format that can be used by a machine learning algorithm. This may
involve resizing, normalization, data augmentation, class balancing, class balancing,
and noise reduction. By applying these pre-processing techniques, the machine learning
algorithm, can more accurately recognize and classify telugu character in real-time.
5.4.3 Data Splitting
A common technique is to divide the data into two groups, the training set and
the testing set. The training set forms the basis for estimating parameters, contrasting
models, andcarrying out other operations required to produce a final model. It is used
to generate models and feature sets. The test set is only used to estimate a final,
objective evaluation of the model's performance when all of these procedures are
complete. The dataset for handwritten character recognition will be used to divide the
data. The classification of the disorders will be aided by the additional division of this
dataset into the two groups. The training data and the testing data are the two different
forms of splitting. The new models will be built and the features placed in the training

46
dataset. It will be possible to identify the parameters and predict them. Performance
will be determined after the models have been compared. The testing data is utilized to
test the data and provide the outcome.
5.4.4 Training the Model
Training the machine learning model refers to the process of optimizing its
parameters (also known as weights) based on a set of input data and their corresponding
output labels. The goal of training is to find the optimal set of parameters that minimize
the difference between the predicted output of the model and the actual output labels of
the training data. This process is done by defining a loss function that measures the
difference between the predicted and actual outputs, and then using an optimization
approach such contour based feature apporach as to iteratively adjust the parameters
of the model to minimize the loss. The dataset will be trained on the using the CNN
with 12 layers constructed over a sequential model. The sequential model involves
constructing a neural network with 12 layers, configuring it for training, and fitting it
to a training dataset.
5.4.5 Classification Result
Training data is a set of samples such as images with assigned labels or tags. It
is used to train a computer vision algorithm or model to perform the desired function
or make correct predictions. Training data goes by several other names, including
learning set, training set, or training data set. It creates the machine learning model and
trains what the desired outcome should be. The model also scrutinizes the dataset
repetitively to understand its traits and fine-tune itself for optimal performance. It
creates the machine learning model and trains what the desired outcome should be. The
model also scrutinizes the dataset repetitively to understand its traits and fine-tune itself
for optimal performance.
5.4 SUMMARY
This chapter covered the use of all the approaches and modules in the proposed
work in brief detail. The project's primary framework, implementation also serves as
the primary factor in determining how well our work turns out. As a result, using the
best algorithms produces the best outcomes, which is why we compare several
algorithms to ensure the project's success

47
CHAPTER 6
EXPERIMENTAL RESULTS
6.1 PREAMBLE
The previous chapter discussed project implementation and the choice for the
majority used throughout project implementation. The obtained measurements of the
developed model are discussed in this chapter. Moreover, the obtained measurements
of a few existing systems are being compared in order to determine the efficiency of
the proposed system.
6.2 RESULTS
The identifying the Handwritten telugu character recognition is done by using

Deep Learning algorithms. The model Conv2D is applied and proposed a Handwritten
telugu character recognition model. This model makes use of the Telugu character
dataset. This project is developed with the combination of customized images and
exiting images . The dataset is having , where the dataset is divided into train and test
data of 7:3 ratio. The train dataset contains 70% samples and test data contains 30%
samples. There are two folders in the dataset i.e., Training and Validation. Each class
consists of various sizes, sight views and illuminating conditions of that class. All
images are of size 30x30 to fit our model.
The models are trained by the Image Net dataset and applying the pre-trained
models on the desired dataset we obtained the following results. The first step is data
collection, the images in varying illumination conditions at various resolution
conditions is collected. In second step we are pre-processing it by re-sizing the collected
train images to fit the proposed model. In third step the features are extracted for further
predictions. In the next step we are training the developed using with the train dataset.
The images are given to the model for testing. The last step is evaluating the trained
model over the test dataset. Whenever the size of the image increases the time taken for
completing each epoch is going to be increased when compared to the predefined image
size of the model.
6.3 GRAPHS

48
The performance of the models which is having the accuracy and loss are shown
in the below graphs.
6.3.1 Graphs of The Models
6.3.1.a Training and validation accuracy
The below Figure 6.1 is an accuracy graph that describe about accuracies. Here
in the graph green line indicates training accuracy and red line indicates validation
accuracy.
• The model achieved a training accuracy of 89.8%

• The model achieved a validation accuracy of 92%.
Figure 6.1: Training and validation Accuracy Graph
6.3.1.b validation loss
The below Figure 6.2 is a loss graph that describe about accuracies. Here in the
graph green line indicates validation loss.

49
Figure 6.2: loss Graph
6.4 RESULTS &DISCUSSION
The results as follows:
Figure6.3: Home page

In this project we developed an web page which is used to recognize the hand
written telugu character . Handwriting recognition has been one of the challenging
and active research areas in the field of pattern recognition and image processing. It is
difficult to deal with the dataset with multiple classes. In this project we have chosen
Handwritten Telugu Achulu for recognition using Convolutional Neural Network. We

50
have collected the characters as customized data set. Each character in Telugu
Achulu is almost similar to other and difficult to classify. The CNN architecture
shown in the project is giving better accuracy with a dataset containing the different
samples with different slant and to recognize them.
Figure6.4: Registration Page
Figure6.5:Login Page
The next stage that the user has to login or if the user is new user the user must
register before sign in by entering all the details likes name, mail id ,the password must

51
be alphanumeric character and all other details like phone number etc.,
Figure6.6: recognition of characters
After the user has to select the hand written character from their laptop and the web
page will give the image recognition.
Table 6.1: Comparison Table of different models
Authors Methods Used Dataset Ability to Ability to

recognize a recognize the
character slant
Vasantha Optical 376 basics Yes No

Lakshmi Character symbols
Recognition
Anupama Convolutional 270 sample Yes No

Angadi Neural images
Networks
P.Narahari Adaptive 183 images Yes No

Sastry Zoning which were
well trained and

52
tested
Shun Nishide Recurrent Self-Organizing Yes No

Neural map(SOM)
Networks
Proposed Convolutional Images of all Yes Yes

Method Neural classes,different
Network telugu achulu
characters with
different slants
The comparison table having the many methods in that CNN method has the
better accuracy and used dataset are Images and it was low to recognize the characters.
This method was not able to recognize the without the slant.
Optical character recognition(OCR) method has the better accuracy and used
dataset are All sizes Images and it recognizes the characters with less number of
characters in the dataset not for large number of samples in the dataset. This method
was not able to recognize based upon the slant.
Convolutional neural networks(CNN) method has the better accuracy and used
dataset are All Images classes and it has the 270 samples of telugu characters and ability
to recognize them . This method was not able to recognize based upon the slant.
Adaptive Zoning method has the better accuracy and used dataset are contains
the images 183 images which were well trained and tested and they used to recognize
the characters in zoning techniques. This method was not able to recognize based upon
the slant .
RNN method has the better accuracy and used dataset are Self-Organizing
map(SOM) has used to categorized the related features and to recognize the characters
with in the dataset. This method was not able to recognize based upon the slant.
The proposed method using CNN achieved an accuracy of better accuracy

which shows a optimum value. The dataset consists of different classes of telugu
characters with different slants. The model is able to recognize the telugu achulu

53
character and also based upon the slant.
6.5 SUMMARY
This chapter summarizes about the results produced by using different methods.
We also discussed about the parameters and accuracies of the models. A comparison
table gives better understanding of result. The existing methods were used focused on
recognition of characters without based upon the slant. Most of them missed out the
which was not in the proper slant and no dimension images. So, the new project is
implemented in such a way that it can recognize the telugu character of blurred, noise
view and also the which was in slant also.

54
CHAPTER 7
CONCLUSION AND FUTURE ENHANCEMENT
7.1 CONCLUSION
This project “Hand written telugu character recognition using convolutional neural
networks” is used for recognition of the hand written telugu characters. Handwriting
recognition has been one of the challenging and active research areas in the field of
pattern recognition and image processing. It is difficult to deal with the dataset with
multiple classes. In this project we have chosen Handwritten Telugu Achulu for
recognition using Convolutional Neural Network. We have collected the characters as
customised data set. Each character in Telugu Achulu is almost similar to other and
difficult to classify. The CNN architecture shown in the project is giving accuracy with
a dataset of more samples. The collected high number of samples to improve the
accuracy. Finally, our future work will also focus on implementing the recognition of
telugu characters.
7.2 FUTURE ENHANCEMENT
Our work can be further improved by considering the recognition of hand written telugu
characters. In the future, this project can be implementing these methods for the
different languages and words also. This project mainly focuses on recognition of
telugu characters.

55
REFERENCES
[1]B. Soujanya,Suresh Chittineni, T. Sitamahalakshmi,A CNN based Approach for
Handwritten Character Identification of Telugu Guninthalu using Various
Optimizers,(IJACSA) International Journal of Advanced Computer Science and
Applications,Vol. 13, No. 4, 2022.
[2] N. Sarika, N. Sirisala and M. S. Velpuru, "CNN based Optical Character
Recognition and Applications," 2021 6th International Conference on Inventive
Computation Technologies (ICICT), 2021, pp. 666-672,
doi:10.1109/ICICT50816.2021.9358735.
[3] Vijaya Krishna Sonthi,Dr. S. Nagarajan,Dr. N. Krishnara,Automated Telugu
Printed and Handwritten Character Recognition in Single Image using Aquila
Optimizer based Deep Learning Model,(IJACSA) International Journal of Advanced
Computer Science and Applications,Vol. 12, No. 12, 2021.
[4]Mayur Bhargab Bora,Dinthisrang Daimary,Khwairakpam Amitab,Debdatta
Kandar,Handwritten Character Recognition from Images using CNN-ECOC,Procedia
Computer Science ,Volume 167, 2020, Pages 2403-2409.
[5] R. Parthiban, R. Ezhilarasi and D. Saravanan, "Optical Character Recognition for
English Handwritten Text Using Recurrent Neural Network," 2020 International
Conference on System, Computation, Automation and Networking (ICSCAN), 2020,
pp-5, doi:10.1109/ICSCAN49426.2020.9262379.
[6] N. B. Muppalaneni, "Handwritten Telugu Compound Character Predictionusing
Convolutional Neural Network," 2020 International Conference onEmerging Trends in
Information Technology and Engineering (ic-ETITE)2020, pp. 1-4, doi: 10.1109/ic-
ETITE47903.2020.349.
[7] B. M. Cheekati and R. S. Rajeti, "Telugu handwritten character recognitionusing
deep residual learning," 2020 Fourth International Conference onISMAC (IoT in
Social, Mobile, Analytics and Cloud) (I-SMAC), 2020, pp788-796, doi: 10.1109/I-
SMAC49090.2020.9243348.
[8] P. Sujatha, D. Lalitha Bhaskari,Telugu and Hindi Script Recognition using Deep
learning Techniques,International Journal of Innovative Technology and Exploring
Engineering (IJITEE)ISSN: 2278-3075, Volume-8 Issue-11, September 2019.
[9] D. Trivedi, S. Satra and P. M. Pimpale, "Handwritten Character Recognition Using
Deep-Learning," 2018 Second International Conference on Inventive Communication
and Computational Technologies (ICICCT), 2018, pp. 772-775, doi:
10.1109/ICICCT.2018.8473291.
[10] L. D’souza and M. Mascarenhas, "Offline Handwritten Mathematical Expression
Recognition using Convolutional Neural Network," 2018 International Conference on
Information , Communication, Engineering and Technology (ICICET), 2018, pp. 1-
3,doi:10.1109/ICICET.2018.8533789.
[11] Inuganti S and Ramisetty RR (2017). Preprocessing of online handwritten Telugu
character recognition. International Journal of Advanced and Applied Sciences, 4(7):
179-189.
[12] S. D. Prasad and Y. Kanduri, "Telugu handwritten character recognition using
adaptive and static zoning methods," 2016 IEEE Students’ Technology Symposium

56
(TechSym), 2016, pp. 299-304, doi:10.1109/TechSym.2016.7872700.

[13] Sunitha Anne M. O. Chacko, Ansu Joseph, Jeena Joji Anchanattu,
[14]Sreelakshmi .S , Veena A. Kumar,Offline Handwritten Character Recognition in
South Indian Scripts: A Broad Visualization,(IJCSIT) International Journal of
Computer Science and Information Technologies,Vol. 6 (3) , 2015, 2314-2317.
[15] S. T. Soman, A. Nandigam and V. S. Chakravarthy, "An efficient multiclassifier
system based on convolutional neural network for offline handwritten Telugu character
recognition," 2013 National Conference on Communications (NCC), 2013, pp. 1-5, doi:
10.1109/NCC.2013.6488008.
[16] J. Pradeep, E. Srinivasan and S. Himavathi, "Neural network based handwritten
character recognition system without feature extraction,"2011 International Conference
on Computer, Communication and Electrical Technology (ICCCET), 2011, pp. 40-
44,doi:10.1109/ICCCET.2011.5762513.
[17] Anita Pal and Davashankar Singh. Handwritten English CharacterRecognition
Using Neural Network. International Journal of ComputerScience & Communication.
2010;1(2):141-144.
[18] U. Pal, N. Sharma, T. Wakabayashi and F. Kimura, "Handwritten Numeral
Recognition of Six Popular Indian Scripts," Ninth International Conference on
Document Analysis and Recognition (ICDAR 2007), 2007, pp. 749-753, doi:
10.1109/ICDAR.2007.4377015.
[19] C. Vasantha Lakshmi, C Patvardhan “A high accuracy OCR System for Printed
Telugu Text”, IEEE,2003, 0-7803-7651.
[20] Y. Kojima et al., "Recognition of handwritten numeric characters using neural
networks designed on approximate reasoning architecture," Proceedings of 1993
International Conference on Neural Networks(IJCNN-93-Nagoya,
Japan),10.1109/IJCNN.1993.714153.
[21] Angadi, Anupama and Kumari Vatsavayi, Valli and Keerthi Gorripati, Satya, A
Deep Learning Approach to Recognize Handwritten Telugu Character
UsingConvolution Neural Networks (April 2, 2018). International Journal
of Information Systems & Management Science, Vol. 1, No. 2, 2018,Available at
SSRN: https://ssrn.com/abstract=3364322.
[22] A. K. Kinjarapu, K. C. Yelavarti and K. P. Valurouthu, "Online recognition of
handwritten Telugu script characters," 2016 International Conferenceon Signal
Processing, Communication, Power and Embedded System (SCOPES), 2016, pp. 426-
432, doi: 10.1109/SCOPES.2016.7955866.
[23] P. N. Sastry, T. R. V. Lakshmi, N. V. K. Rao, T. V. Rajinikanth and A. Wahab,
"Telugu Handwritten CharacterRecognition Using Zoning Features," 2014
International Conference on IT Convergence and Security (ICITCS), 2014, pp. 1-4,
doi:10.1109/ICITCS.2014.7021817.

57
APPENDIX A
SOURCE CODE
Import required libraries
from flask import Flask, request, render_template,
send_from_directory,session,flash
import pandas as pd
import string
import os
import smtplib
import mysql.connector
import numpy as np
from datetime import timedelta
import sys
from PIL import Image
import base64
import io
import re
file paths
import PIL.Image
from datetime import datetime
app = Flask(__name__)
app.config['SECRET_KEY'] = 'the random string'
images are shown
classes = ['System Predicted Image is a',
Sasi Institute of Technology & Engineering A1

'System Predicted Image is aa',
'System Predicted Image is ae',
'System Predicted Image is aha',
'System Predicted Image is ai',
'System Predicted Image is am',
'System Predicted Image is ea',
'System Predicted Image is ee',
'System Predicted Image is i',
'System Predicted Image is o',
'System Predicted Image is oa',
'System Predicted Image is oo',
'System Predicted Image is ou',
'System Predicted Image is r',
'System Predicted Image is ru',
'System Predicted Image is u'
@app.route("/")
def index():
return render_template("index.html")
@app.route("/user")

def user():
return render_template("user.html")
@app.route("/reg")
def reg():
return render_template("ureg.html")
@app.route('/regback',methods = ["POST"])
def regback():
if request.method=='POST':
name=request.form['name']
email=request.form['email']
pwd=request.form['pwd']
pno=request.form['pno']
#email = request.form["email"]
mydb = mysql.connector.connect(
host="localhost",
user="root",
passwd="",
port = 3308,
database="telugu_achulu"
mycursor = mydb.cursor()
sql = "select * from ureg"
result = pd.read_sql_query(sql, mydb)

email1 = result['email'].values
print(email1)
if email in email1:
flash("email already existed","warning")
return render_template('ureg.html', msg="email existed")
sql = "INSERT INTO ureg (name,email,pwd,pno)

VALUES(%s,%s,%s,%s)"
val = (name, email, pwd, pno)
mycursor.execute(sql, val)
mydb.commit()
flash("Your registration successfully completed", "success")
return render_template('user.html', msg="registered successfully")
print("Successfully Registered")
@app.route('/userlog',methods=['POST', 'GET'])
def userlog():
global name, name1
global user
if request.method == "POST":
username = request.form['email']
password1 = request.form['pwd']
print('p')

mydb = mysql.connector.connect(host="localhost",
user="root",port=3308, passwd="", database="telugu_achulu")
cursor = mydb.cursor()
sql = "select * from ureg where email='%s' and pwd='%s'" %

(username, password1)
print('q')
x = cursor.execute(sql)
print(x)
results = cursor.fetchall()
print(results)
if len(results) > 0:
print('r')
# session['user'] = username
# session['id'] = results[0][0]
# print(id)
# print(session['id'])
flash("Welcome to website", "success")
return render_template('userhome.html', msg=results[0][1])
else:
flash("Invalid Email/password", "danger")
return render_template('user.html', msg="Login Failure!!!")
return render_template('user.html')
@app.route("/userhome")

def userhome():
return render_template("userhome.html")
@app.route("/upload", methods=["POST","GET"])
def upload():
print('a')
if request.method=='POST':
myfile=request.files['file']
fn=myfile.filename
mypath=os.path.join('images/', fn)
myfile.save(mypath)
print("{} is the file name",fn)
print ("Accept incoming file:", fn)
print ("Save it to:", mypath)
#import tensorflow as tf
import numpy as np
from tensorflow.keras.preprocessing import image
from tensorflow.keras.models import load_model
# mypath="images/Covid (5).png"
Code to build the model
new_model = load_model("models/MobileNet.h5")

test_image = image.load_img(mypath, target_size=(224, 224))
test_image = image.img_to_array(test_image)
test_image = np.expand_dims(test_image, axis=0)
result = new_model.predict(test_image)
prediction=classes[np.argmax(result)]
return render_template("template.html",image_name=fn,
text=prediction)
@app.route('/upload/<filename>')
def send_image(filename):
return send_from_directory("images", filename)
@app.route('/upload1')
def upload1():
return render_template("upload.html")
Accuracy of the model
model.compile(optimizer='Adam',loss='categorical_crossentropy',metrics
=['accuracy'])
step_size_train=train_generator.n//train_generator.batch_size
history=model.fit_generator(generator=train_generator,
steps_per_epoch=step_size_train,
epochs=1)

model.save("models/MobileNet.h5")
from matplotlib import pyplot as plt
plt.style.use("ggplot")
plt.figure()
plt.plot(history.history['accuracy'],'r',label='Testing
accuracy',color='green')
# plt.plot(history.history['v
# al_accuracy'],label='validation accuracy')
plt.xlabel('# Epochs')
plt.ylabel('Accuracy')
plt.legend()
plt.savefig("models/mobile.png")
plt.show()
acc=history.history['accuracy'][-1]

Networks Networks
APPENDIX B
SCREENSHOTS
B.1 INPUT LIBRARIES
The respective figure B1 describes the outcome of the given input libraries and the
figure is the output screenshot of the given input block.
Figure B.1: Outcome of given input libraries
B.2 LOSS AND ACCURACY PERCENTAGE OF SAMPLES
Sasi Institute of Technology & Engineering B1

Networks Networks
Figure B.2: Outcome of loss and accuracy percentage
The above data that present in the figure B2 is the outcome of loss and accuracy
percentage obtained on validation of given 62734 training samples.
B.3 GRAPHS
B.3.1 Accuracy:
Figure B3.1: Represents the accuracy level

Networks Networks
The graph above represents the accuracy level on the given training model.
B.3.2 Loss Percentage:
The graph below shows the loss while training the data in the model (figure B3.2)
Figure B.3.2: Represents the Loss Percentage
B.3.3 Accuracy Percentage:
Figure B.3.3: Accuracy Percentage of the model
The obtained accuracy with the test data is shown in the above given figure B3.3
B.3.4 Result:
The image that mentioned below is the input given after training the model and the
Achulu character was recognized is the output of the model.

Networks Networks
Figure B.3.4: Output recognized by the model as ‘a’

APPENDIX C
STUDENT CONTRIBUTION
No Activity 19K61A05E3 19K61A05H2 19K61A05D4 19K61A05C6

1 Title ✓ ✓ ✓ ✓
Conformation
2 Literature ✓ ✓ ✓ ✓
Survey
3 Problem ✓ ✓ ✓ ✓
Formulation
4 Requirement ✓ ✓
Gathering
5 Designing ✓ ✓ ✓
6 Implementation ✓ ✓ ✓ ✓
7 ✓ ✓
Documentation ✓ ✓ ✓
Sasi Institute of Technology & Engineering C1

APPENDIX D
POs, PSOs, PEOs and Cos Relevance with Project
PROGRAM OUTCOMES (POs):
PO Program Outcome Relevance

PO1 Engineering Knowledge: Apply Applying the engineering
knowledge of mathematics, science, knowledge of
engineering fundamentals and an programming
engineering specialization to the solution languages like Python.
of complex engineering problems.
PO2 Problem Analysis: Identify, formulate, Done with literature

research literature and analyze complex survey and find the
engineering problems reaching complex problems and
substantiated conclusions using first then analyzed the
principles of mathematics, natural solutions to overcome the
sciences and engineering sciences. drawback in the existing
system.
PO3 Design/Development of Solutions: By using the Star UML

Design solutions for complex engineering we designed the UML
problems and design system components diagrams which includes
or processes that meet specified needs Use case diagram, class
with appropriate consideration for public diagram, sequence
health and safety, cultural, societal, diagram etc.. and by using
environmental considerations. Draw.io we designed the
architecture diagram and
data flow diagram to give
the detail idea for doing
the project
D1
PO4 Conduct investigations of complex We have investigated the

problems: Using research based project using previous
knowledge and research methods research papers.
including design of experiments, analysis
and interpretation of data and synthesis of
information to provide valid conclusions.
PO5 Modern Tool Usage: Create, select We have used modern

and apply appropriate techniques, tools like Python IDE
resources and modern engineering and IT and PyQt to develop
tools including prediction and modelling programs for identifying
to complex engineering activities with an spammers using Deep
understanding of the limitations. learning.
PO6 The Engineer and Society: Apply To automatically detect

reasoning informed by contextual and recognise the traffic
knowledge to assess societal, safety, legal signs on the road by a
and cultural issues and the consequent camera on the top of the
responsibilities relevant to professional vehicle with any manual
engineering practice. intervention.
PO7 Environment and sustainability: By using the Advanced

Understand the impact of professional Driving Assistance
engineering solutions in societal and system, the number of
environmental contexts and demonstrate car accidents will be
knowledge of and need for sustainable reduced.
development.
PO8 Ethics: Apply ethical principles and Advanced Driving

committo professional ethics and Assistance Systems are
responsibilities and norms of engineering very useful in cars for
practice assisting the drivers
about the traffic signs on
the road.
PO9 Individual and Team Work: Function To provide solutions for

effectively as an individual and as a the project activities
member or leader in diverse teams and in carried out as an
individual and as a team.
D2
multidisciplinary.
PO10 Communication: Communicate Effective communication

effectively on complex engineering was done in front of the
activities with the engineering community panel members when
and with society at large, such as being reviews were conducted
able to comprehend and write effective and it improves
reports and design documentation, make communication in every
effective and give and receive clear individual.
instructions.
PO11 Project Management and Finance: All the members in the

Demonstrate knowledge and team are worked together
understanding of engineering and to fulfil requirements and
management principles manage projects.
and apply these to one’s own work, as a
member and leader in a team to manage
projects and in multidisciplinary
environments.
PO12
PROGRAM SPECIFIC OUTCOME (PSOs):
PSO Program Specific Outcome Relevance

PSO1 Cloud Services: To deploy virtualized We can deploy the cloud
and cloud based services in the services to the
organisation organisation.
D3
PROGRAM EDUCATIONAL OBJECTIVES:
PEO Program Educational Outcome Relevance

PEO1 Graduates are well trained, confident, By giving presentation in
research oriented and industry ready front of professors our
professionals who are intellectual, ethical confidence has been
and socially committed. enhanced.
PEO2 Graduates will have the technical, We have used technical

communication skills and character that skills to develop a
will prepare them for technical roles program using
python.
PEO3 Graduates will be able to analyze, design Problems are analysed to

and develop advanced computer design appropriate
applications to provide solutions to real solutions.
world problems.
COURSE OUTCOME (COs):
CO Course Outcomes POs, PEOs, PSOs Mapped
CO1 Develop problem formulation and PO1,PO2,PO3,PO5,PEO3

design skills for engineering and real
world problems.
CO2 Collect and generate ideas through PO2,PO10

surveys on current research areas
D4
Handwritten Telugu Achulu Recognition Using Convolutional Neural Networks
which help to analyse and present to

impart knowledge in different fields.
CO3 Import knowledge on software and PO5,PO6,PO7,PEO2

hardware to meet industry perspective
needs and standards.
CO4 Create interest to carry out research on PO2,PO5,PO8,PO12

innovative ideas as a lifelong learning.
CO5 Ability to work with team and enrich PO9,PO10,PE01,PE02

presentation and communication skills.
CO6 Create a platform that everyone makes PO6,PO7,PO8,PEO3

use of it.
Sasi Institute of Technology & Engineering D5

Final Doc11

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Final Doc11

Uploaded by

Copyright:

Available Formats

HANDWRITTEN TELUGU ACHULU CHARACTER

RECOGNITION USING CONVOLUTIONAL NEURAL

Department of Computer Science and Engineering

ACADEMIC YEAR 2022-2023

Confect as a premier institute for professional education by creating

1. Partake in the national growth of technological, industrial, and industrial areas

2. Provide an environment that promotes productive research.

3. Meet stakeholder’s expectations through continued and sustained quality

VISION AND MISSION OF DEPARTMENT

Build a learning environment that enhances creativity, social awareness, and

The Computer Science and Engineering department’s consistent effort is to

Apply knowledge of mathematics, science, engineering fundamentals, and an

Identity, formulates, research literature, and analyses complex engineering

PO3.Design/ Development of Solutions:

Design solutions for complex engineering problems and design system

PO4. Conduct investigations of complex problems:

Using research-based knowledge and research methods including design of

PO5.Modern Tool Usage:

Create, select and apply appropriate techniques, resources, and modern

PO6.The Engineer and Society:

Apply to reason informed by the contextual knowledge to assess societal, health,

PO7.Environment and Sustainability:

Understand the impact of professional engineering solutions in societal and

Apply ethical principles and commit to professional ethics and responsibilities

PO9. Individual and Team Work:

Function effectively as an individual, and as a member or leader in diverse teams

Communicate effectively on complex engineering activities with the engineering

PO12.Project Management and Finance:

Demonstrate knowledge and understanding of engineering and management

PROGRAM SPECIFIC OUTCOMES (PSOs)

PSO1.Mobile & Web Application Development:

To deploy virtualized and cloud-based services in the organization.

PEO2: Graduates are well-trained, confident, research-oriented and industry-ready

COURSE OUTCOMES (COs)

PROGRAM OUTCOMES (POs)

PO1: Engineering Knowledge

PROGRAM SPECIFIC OUTCOME (PSOs)

PSO1: Not Applicable

Department of Computer Science and Engineering

Project Supervisor Head of the Department

We R. Gangapavanidevi, (19K61A05E3), V. Pavan , (19K61A05H2),P.

report entitled “An Handwritten Telugu Achulu Character Recognition Using

Convolutional Neural Networks” carried out under esteemed supervision of S. Jagan

of Bachelor of Technology in Computer Science& Engineering. This is a record of work

V. Pavan kumar 19K61A05H2

Keywords: Telugu Character Recognition, Convolutional Neural network, Deep Learning,

It is a matter of great honour to thank Dr. K. Bhanu Prasad, Director, Sasi

We feel the privilege to thank Dr. M. V. S. S. Nagendranath, Professor and

CHAPTER NO. TITLE PAGE NO.

VISION AND MISSION ii

POs, PSOs, PEOs and COs iii

1.2 OVERVIEW OF THE PROJECT 2

1.3 BACKGROUND STUDY 3

1.3.1 Handwritten Character Recognition 4

1.3.2 Need Of Handwritten Character Recognition 5

1.3.3 Working of Handwritten Character Recognition 5

1.3.4 Different Datasets in Handwritten Character Recognition 6

1.3.5 Advantages of Handwritten Character Recognition 6

1.3.6 Applications of Handwritten Character Recognition 7

1.3.7 Challenges in Handwritten Character Recognition 7

1.4 PROBLEM STATEMENT 7

1.5 AIM OF THE PROJECT 8