Professional Documents
Culture Documents
Data Extraction From Hand Filled Forms Using Ocr
Data Extraction From Hand Filled Forms Using Ocr
1
Contents
OCR
RPA OCR
Flowchart of Proposed work
Proposed Methodology
Limitations
Future work
References
A well-educated person can easily glance at a piece of paper and read its
contents, but having a computer do the same is far more difficult than most
people believe.
To identify each individual letter, one must first have a digital image of the
text, process it to remove extraneous information, and then use a computer to
locate and segment the characters.
Only then will it be able to generate a series of machine-readable characters
as an output[1].
This procedure is known as optical character recognition (OCR).
Step-1: Open UiPath studio and create a project with a name and
description.
Step-2: Click on open main workflow.
Step 3: Import both Uipath.documentunderstanding.ML.Activities and
Uipath.IntelligentOCR.Activities packages from manage packages.
Step 4: Drag and drop the Sequence and Load Taxonomy activities to the
main workflow window and create Taxonomy variable.
Figure 4. Workflow
[1] - K. A. Barchard and L. A. Pace, “Preventing human error: The impact of data entry
methods on data accuracy and statistical results,” Comput. Human Behav., vol. 27, no. 5, pp.
1834–1839, 2011, doi: 10.1016/j.chb.2011.04.004.
[2] - https://www.edureka.co/blog/what-is-robotic-process-automation/
[3] - https://www.nice.com/guide/rpa/rpa-ocr-elevating-process-automation
[4] - https://medium.com/@CereLabs/the-technology-that-is-better-than-ocr-354e989cb270
[5] - https://www.information-age.com/optical-character-recognition-tools-ocr-ai-123479324/