Professional Documents
Culture Documents
AAI Document Automation
AAI Document Automation
March 1, 2023
Legal Notices
© 2023 Automation Anywhere, Inc. All Rights Reserved.
See the list of Automation Anywhere trademarks at https://www.automationanywhere.com/trademark.
All other customer or partner trademarks or registered trademarks are owned by those companies.
The information contained in this documentation is proprietary and confidential. Your use of this information
and Automation Anywhere Software products is subject to the terms and conditions of the applicable
End-User License Agreement and/or Nondisclosure Agreement and the proprietary and restricted rights
notices included therein.
You may print, copy, and use the information contained in this documentation for the internal needs of
your user base only. Unless otherwise agreed to by Automation Anywhere and you in writing, you may
not otherwise distribute this documentation or the information contained here outside of your organization
without obtaining Automation Anywhere’s prior written consent for each such distribution.
Examples and graphics are provided only as reference information and might not match your site.
Content
Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Document Automation architecture. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Document Automation FAQ. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Intelligent Document Processing solutions feature comparison matrix. . . . . . . . . . . . . . . . . . . . . . 11
Set up your Document Automation environment. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Create a custom role for Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Document Automation users. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Configure key for Google Document AI. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
Create a learning instance in Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
Data extraction in Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
About the AARI process in Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
Process documents in Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Formula validation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Pattern validation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Connect learning instance. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
Extracting data from check boxes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Publish the learning instance. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Build a bot to upload documents to Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . 42
Deploy the learning instance assets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Using the AARI Task Manager Validator for Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . 49
Validate documents in Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
Export/Import learning instances. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
Using Document Automation for Standard Forms. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
Evaluate your use case. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
Graphical demonstration of configuring Standard Forms in Document Automation. . . . . . 56
Set up extraction service. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Configure Document Automation with Microsoft Azure Form Recognizer. . . . . . . . . . . . . 58
Create standard form in Document Automation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
Create a learning instance for standard forms. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
Transfer standard forms learning instance. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
Automation 360 - Document Automation
Automation 360
Document Automation
Document Automation is the new Cloud-native intelligent document processing solution that business users
can set up to automatically read and process documents quickly using pretrained models and validation
feedback.
Document Automation is fully integrated into Automation 360: Document Automation is installed as part of
the Control Room, RPA bots are used to extract semi-structured data to automate document-centric
business processes, and Automation Anywhere Robotic Interface manages the end-to-end extraction
process and validation tasks.
The Document Automation workflow enables users to scale their document processing operation. Users
create learning instances that use Automation Anywhere or Google Document AI pre-trained models to
process invoices, utility bills, and receipts. Once a learning instance is running in production, it automatically
improves extraction accuracy based on feedback from manual validation.
To compare Document Automation features side by side with Automation 360 IQ Bot, see Intelligent
Document Processing solutions feature comparison matrix.
Document Automation is installed simultaneously with the Control Room and shares the Control Room
database. There are no additional installation tasks for Control Room Cloud customers.
To get started using Document Automation, you must first configure users, roles, and devices, and connect
the Control Room with Automation Anywhere Robotic Interface:
Workflow map: Click the following schematic image to view the Document Automation workflow in an
interactive visual format:
The following is an overview of the end-to-end process to create, configure, and publish a learning instance
in Document Automation:
Related concepts
Document Automation users
Related tasks
Set up your Document Automation environment
Related reference
Document Automation FAQ
Related information
Document Automation Developer Training (A-People login required)
The following architecture graphic demonstrates the flow of data in the intelligent document processing
workflow for the two possible deployment scenarios:
• The Learning instance creator uploads documents to the Document Automation interface, for example
to test the extraction capabilities of the learning instance.
• An Unattended Bot Runner deploys the Scheduler Bot to upload and process the documents.
The graphic also shows the scenarios where data is sent to third-party products, such as for Google
Document AI or Standard Forms extraction.
For customers who configured Document Automation with Google Document AI licenses purchased
from Google: Documents uploaded to Document Automation for processing in Google Document AI are
sent to the Apigee gateway that you configured when you created the processor in Google Document AI.
For customers who purchased Google Document AI product licenses from Automation Anywhere:
Google Document AI uses the Apigee API gateway to manage the document processing APIs. When a
document uploaded to Document Automation is sent to Google Document AI, data is sent to the Apigee
gateway that corresponds with your AWS or Google Cloud Platform region.
• US-East
• US-Central
US • US-West
• CAN
• LATAM
• EU-West1
• EU-West4
• ME
• S-Africa
Europe
• AP-Aus
• AP-Singapore
• AP-JP
• India
Related concepts
Data extraction in Document Automation
About Automation 360
Related reference
About the AARI process in Document Automation
Product FAQ
Does Document Automation have feature parity with Automation 360 IQ Bot?
Document Automation supports the following features from Automation 360 IQ Bot:
• Data extraction from text, number, date time, and check box fields
• Validation rules based on patterns, formulas, lists, and statements
Note: If you use Document Automation in Firefox, the user interface might not behave
as expected.
Note: Extraction for French, German, and Spanish language documents is currently in
preview. Extraction results will improve in future releases.
You can process documents in other languages by creating and training a training a learning instance
in Automation 360 IQ Bot and connecting it with Document Automation to process documents using
Document Automation technology and workflow. For more information, see Connect learning
instance.
What file formats are supported?
Document Automation can process documents in the following file types:
• PDF
• JPG
• JPEG
• PNG
• TIF
• TIFF
Note: To configure the Control Room with your Google Document AI license key, you
must do these steps: Configure key for Google Document AI. These steps are not necessary
if you purchased Doc AI Document licenses through Automation Anywhere.
We provide a recommendation on configuring users with the minimum necessary roles and device
licenses to perform specific tasks in Document Automation users.
Can learning instances be migrated from Automation 360 IQ Bot to Document Automation?
Users can connect their learning instances from Automation 360 IQ Bot to their Document
Automation environment, which allows them to process documents using the Document Automation
technology without having to rebuild the learning instances. For more information, see Connect
learning instance.
Are customers required to maintain their Automation 360 IQ Bot environments?
If you have connected a learning instance from Automation 360 IQ Bot to Document Automation, do
not delete the learning instance from Automation 360 IQ Bot since it stores the training data.
We recommend you maintain Automation 360 IQ Bot in a version compatible with the associated
Control Room.
What learning instance fields are supported out of the box?
Document Automation offers over 35 of the most common invoice fields, with the option to configure
custom fields. To see all the possible fields when you are creating a learning instance, click Show
unused fields.
Extraction capabilities
Classifier relies on
document layout and Layouts are identified using an
training. Documents object detection model to find
Handling document uploaded with a new layout key-value pairs. It is not
variations are sent to a new document necessary to configure all the
group, which requires fields in a learning instance
training. Some docs could before uploading documents.
not be classified.
End-to-end automation
Customizable
document processing No Yes
workflow
Security
Procedure
2. Upload the Document Extraction package to the Control Room: Upload the IQ Bot packages to the
Control Room
3. Create a custom role that allows users to check in and check out bots, manage packages, and validate
documents.: Create a custom role for Document Automation
4. Create the users necessary to complete the end-to-end process of creating and publishing a learning
instance: Document Automation users
Note: When creating the Unattended Bot Runner user, provide the device
credentials. You will connect the Unattended Bot Runner to the device in the next step.
5. Configure the unattended Bot Runner user device and device pool to allow the learning instance to
function in public mode:
a. Using Remote Desktop Protocol (RDP), open the virtual machine where the unattended Bot
Runner will run.
b. Log in to the Control Room as the Document Automation Admin user you created
previously.
c. Navigate to Manage > Devices and click Connect local device. Follow the steps in the wizard to
connect the Control Room to your device.
d. Click Refresh to verify that your device is connected.
e. Click Create device pool and perform the following steps:
i. Enter a name for the device pool.
ii. Click Devices and then the arrow to move the device to the Selected column.
iii. Click Consumers and then the arrow to move aari-document-processor to the
Selected column.
iv. Click Create device pool.
You can perform the following steps in the virtual machine or you can return to your device. If you perform
the following steps on your device, ensure that you log in to the Control Room as the Document
Automation Admin user.
6. Open Automation Anywhere Robotic Interface to connect it with the Control Room.
a. Navigate to the AARI interface by adding /aari after your assigned Control Room URL.
b. Navigate to Manage > Process > Global scheduler and click Edit.
c. Select the Scheduler user that you created previously and click Save.
The following message is displayed: Secure connection between AARI Web and
Enterprise Control Room established successfully
7. Create a team in AARI to grant the Validator user access to the documents awaiting validation.
a. Navigate to Manage > Team > Create new team.
b. Enter a team name.
c. Select the Shared request visibility.
d. In the Users field, add the Validator user that you created previously and click Save.
Next steps
If users will create learning instances that use the Google Document AI model and you did not purchase
Google Document AI licenses through Automation Anywhere, follow the steps to Configure key for
Google Document AI.
Log in to the Control Room as the Learning instance creator user and begin creating learning
instances: Using Document Automation
Related tasks
Installing additional licenses
Related reference
Automation 360 licenses
Procedure
Next steps
Create users and assign this custom role to them. For more information about the required users, refer to
Document Automation users.
The following table contains information about each type of user that interacts with Document Automation.
It includes a description of the tasks that each user performs, the required roles, permissions, and licenses,
and links to the pages most relevant to that user. Use this page to guide you in creating the users in the table
below: Create a user
Note: Some users require a custom role in Document Automation. This role grants users
permissions to check in and check out bots, manage packages, and validate documents. For
instructions on how to create a custom role, see Create a custom role for Document Automation.
Related
User Description Roles License
links
Create a
learning
instance in
Can create, edit, Document
and delete learning Automation
AAE_IQ Bot Services,
instances, upload
Learning AAE_Basic, and Process
documents for
instance aari-document- Bot Creator documents
processing, and
creator processor custom in
check in or check
role Document
out learning
Automation
instances.
Publish the
learning
instance
Create
Standard Document
standard
Forms Creates an AAE_IQ Bot Services Automation
form in
model extraction model. or AAE_IQ Bot Admin Standard
Document
creator Forms
Automation
Related
User Description Roles License
links
(Number of
pages)
Set up your
Document
Automation
Deploys the AARI
environment
process and RPA
AAE_Admin,
bots to the
Document AAE_Pool Admin, Create an
unattended Bot
Automation AAE_Locker Admin, -- AARI team
Runner, manages
Admin and AAE_Robotic and assign
AARI teams, and
Interface Manager team roles
assigns validation
to members
tasks to a team
Publish the
learning
instance
Related
User Description Roles License
links
Runner,
Automation
aari-document-
Anywhere
processor custom
Robotic
role
Interface,
and the
instances in public Note: product
mode to upload, Provide the licenses
process, and device necessary
download credentials to upload
documents. when you documents
configure this to learning
user. You will instances in
Note: connect the production:
This user Unattended
IQ Bot
is not Bot Runner
pages,
associated user to the standard
with a real device in one forms, or
person. of the steps Document
in Set up your AI
Document
Automation Document
environment. processing
product
licenses
Validates Validate
Automation
documents that AAE_IQ Bot Validator documents
Anywhere
Validator were processed by and AAE_Robotic in
Robotic
a learning instance Interface User Document
Interface
in public mode. Automation
Related tasks
Create a user
Create a role
Related reference
Automation 360 licenses
Prerequisites
• If you have not done so already, complete the steps to Set up your Document Automation
environment.
• Log in as the Admin user type. For more information about the user types, refer to Document
Automation users.
You will assign this role to a Credential Vault locker, which will allow the Learning instance creator
user to securely provide the Google Document AI key to the Extraction bot, allowing the learning instance
to send documents to Google Document AI for extraction.
Procedure
Create a custom role to access the Credential Vault locker that holds the Google Document AI key:
Assign the role to the Learning instance creator and Unattended Bot Runner user types.
Create a locker to store the key and provide access to other users.
14. In the Consumers tab and select the doc-ai-credential-role and click the right arrow to move
the credential to the Selected column.
15. Click Create locker.
Next steps
Related tasks
Installing additional licenses
Related reference
Automation 360 licenses
Prerequisites
• To create a learning instance, you must be a Learning instance creator user. See Document
Automation users.
• To create a learning instance that uses a Google Document AI model, you must create a credential and
locker to hold your security token. See Create credential and Create locker.
Note: When configuring the locker, you must add the aari-document-processor
custom role as a locker consumer.
• The default OCR is ABBYY FineReader Engine. Alternatively, Cloud Control Room users can create a
learning instance to process documents in Google Vision OCR.
Watch this video for the complete end-to-end process of creating a learning instance:
Procedure
1. From the Control Room home page, navigate to Manage > Learning Instances > Create Learning
Instance.
2. Enter a name and description for the learning instance.
Document Automation does not allow duplicate learning instance names, so the name you provide
must be unique.
3. Select the document type: Invoice ,User-defined, Utility Bill, or Receipt
Use the user-defined document type to process documents that are visually similar to invoices, such as
purchase orders and sales orders, which contain key-value pairs and a table structure. In this document
type, you create and configure all of the form and table fields.
4. Select the language.
Document Automation supports English, Dutch, French, German, Italian, Portuguese (Brazilian), and
Spanish.
Note: Extraction for French, German, and Spanish language documents is currently in
preview. Extraction results will improve in future releases.
We recommend that you open a sample document side by side with the Control Room window as you
configure the form and table fields.
Note:
aliases in the Custom aliases field. See the following video for a demonstration of creating a custom
alias: Animation of editing a field
Note: Custom fields are not available for learning instances that use the Google
Document AI provider.
Mouse over the menu icon to the right of a field to access the up/down arrows. Use the arrows to
rearrange the order of the fields for a more efficient manual validation. The order of the fields does not
impact extraction.
To learn more about the other field attributes, review the table in the next step.
10. To add a field, click Add a field and complete the following fields:
Option Description
Field name Enter a field name that begins with an alphabetical character (A-Z or
a-z).
Option Description
If the data in the field does not match the data type, the document is
sent to the validation queue.
Validation Depending on data type, create rules using patterns, formulas, lists,
rules and statements such as starts or ends with.
Option Description
When a new learning instance is created, the Control Room creates a folder with the same name as the
learning instance in the Automation > Document Workspace folder. The folder contains two bots
(extraction and download), a process, and a form.
• Process: Manages the process using if/else scenarios through which Document Automation extracts
data from uploaded documents, assigns documents to users for validation, and downloads the
extracted data. To learn more, see About the AARI process in Document Automation
• Extraction bot: Extracts data from defined fields in the uploaded documents.
• Download bot: Downloads the extracted data to a specific folder on the device or shared network.
• Form: Defines the input parameters that are sent to the process. Input parameters include the learning
instance name, uploaded file, and output file path.
Next steps
Upload documents to the learning instance, fix validation errors, and verify the extracted data: Process
documents in Document Automation
When a learning instance is created, the user has the option to enable this feature to send feedback to the
learning instance based on user-provided changes in the Validator. In Document Automation, learning
instances running in production mode can continuously "learn" whenever a user resizes or relocates the
extraction region in the Validator.
Note: This feature is available only for Automation Anywhere pre-trained models.
The following graphic provides a visual overview of the process by which learning instances continuously
receive feedback from validation:
If the learning instance can not extract the data, the system evaluates whether the document contains
an unfamiliar layout.
3. If the learning instance does not recognize the document layout (new layout), the document is sent for
manual validation where the user "teaches" the learning instance how to extract the data by setting the
extraction region.
4. The extracted values are downloaded to a file in the Success folder and the changes are collected in
a feedback file, which is sent to the feedback database.
Note:
• Feedback is only collected when the user changes the extraction region. If the user
manually inputs text, the system does not collect feedback.
• The feedback file only contains data on the field location to improve extraction
accuracy for subsequent documents.
If the learning instance recognizes the cluster, it retrieves previous feedback from the feedback
database and uses it to extract data.
As of Automation 360v.27, you can train a learning instance to extract data from a cell that contains more
than one field.
For example, if a product description column also includes item number, you can outline the item number in
the Validation interface. When the learning instance processes subsequent documents, it will extract the item
number and ignore the product description.
Follow this process to configure a learning instance to extract specific values from a cell:
1. Create a learning instance using an Automation Anywhere pretrained model and select the option to
send validation feedback: Create a learning instance in Document Automation
2. Upload a sample document: Process documents in Document Automation
3. In the Validator, locate the field and redraw the box to only surround the values that you want to
extract.
4. After you click Submit, the information on the new extraction region is sent to the feedback database.
5. Upload more documents to test the accuracy of extraction. When you are satisfied with the results,
proceed to preparing the learning instance to run in production: Publish the learning instance
Document Automation extraction is based on object detection. During document processing, the extraction
engine identifies objects, or key-value pairs of the field and associated value. The engine creates a
"fingerprint" of the document, which stores the sequence of the objects and each object's location in the
document.
When a document is processed, if the engine recognizes the keys and their locations, the document is
classified and extracted based on that existing fingerprint. Otherwise, the engine saves a new fingerprint of
the keys and their locations.
When a learning instance is in production, each incoming document creates a new request, which triggers
the extraction process. If a document requires manual validation, AARI creates a validation task and assigns it
to a validation team.
After the documents are processed, the Download bot downloads the extracted data to a file in the
Success folder.
1. Start: The AARI form that defines the inputs, such as the learning instance name, uploaded
documents, and output folder.
2. Bot Task ExtractionBot: Executes the RPA bot that processes input documents to extract data from
them.
3. Document Validator Task: Opens the Validation user interface where the user can manually validate
the fields in the document.
4. Documents require validation: The ExtractionBot tried to process the documents then sent the
documents for validation, the documents were successfully validated, and the DownloadBot
downloaded the extracted data to the Success subfolder.
5. Invalid documents: The previous process was interrupted at validation, where a user marked certain
documents as invalid. These documents were downloaded to the Invalid subfolder.
6. Reprocessed documents: The previous process was interrupted at validation because the user clicked
Reprocess. Documents were sent back to the ExtractionBot for reprocessing.
7. Documents are successfully processed: The ExtractionBot successfully processed the documents,
no documents required validation, and the DownloadBot downloaded the exacted data.
8. Processing failed: The ExtractionBot failed to process the documents (for example: invalid format or
could not read the file) and the file is saved to the Failed subfolder
9. Deployment failed: The Control Room failed to deploy the ExtractionBot (for example: lost server
connection). The Control Room makes three attempts.
Warning: We do not recommend deleting the process. If you unintentionally delete a process
associated with a learning instance, do the following steps to recover the process:
1. Navigate to Manage > Learning Instances and mouse over the actions menu to the right of
the learning instance.
2. Click Export.
3. Mouse over the actions menu to the right of the learning instance and click Delete.
4. Click Import at the top of the page.
5. Click Browse and select the .dw file you exported.
6. When prompted, click Overwrite to generate a new process for the learning instance.
Prerequisites
• If you have not done so already, Create a learning instance in Document Automation.
• Verify that your device is connected to the Control Room: Install Bot Agent and register device
• If the learning instance uses a Google Document AI model and you did not purchase Google
Document AI licenses through Automation Anywhere, you must provide your Google Document AI
credentials to the Extraction bot. See Configure key for Google Document AI
• If the learning instance uses an Automation Anywhere model, ensure that each file is 50 MB or less.
If the learning instance uses a Google Document AI model, ensure that each file is 20 MB or less, with
a maximum of 5 pages.
• Ensure that the sample documents are in one of the following supported document types:
• PDF
• JPG
• JPEG
• PNG
• TIF
• TIFF
• The default output format for the extracted data is CSV file. To change the output to JSON, see
Change output format from CSV to JSON.
Perform the following steps to upload sample invoices to the learning instance to test the data extraction
capabilities of the learning instance.
Procedure
b. In the Process Documents window, click Browse to select the files to upload.
c. In the Download data to field, enter the file path that will hold extracted data.
When the process runs, it creates the following three folders in the provided file path:
• Success : Contains the extracted data in the specified format (CSV or JSON).
• Invalid : Holds documents marked invalid.
• Failed : Holds documents that could not be processed.
You can provide an output folder path based on one of the following options:
• Option 1: The local device path if you have set up document processing and validation on
the same device.
This option is typically used when you are testing the learning instance.
• Option 2: The shared folder path if you have set up distributed validation on separate
devices.
This option is typically used for published learning instances. For example, \
\10.239.192.60\Sharepath\Output.
If there is a value next to the Validate documents link, you must manually validate the document fields.
Otherwise, proceed to step 3.
Note: When documents are awaiting validation, if you edit the learning instance,
click Reprocess to reattempt extraction.
• To skip a document without correcting errors, click Skip to proceed to the next document
in the validation queue.
• To remove a document that cannot be processed, click Mark as Invalid.
d. After you make the necessary corrections, click Submit so that the document can finish
processing.
The next document in queue appears. When all the documents are corrected, the system
displays a message stating that no more tasks are available.
e. Close the tab to return to the Learning Instances page.
If the learning instance repeatedly cannot find a field or if characters are not correctly recognized (such as
the letter "l" extracted as the number "1"), you can try changing the OCR to Google Vision OCR.
Next steps
Build a bot that uploads documents from a source folder to the learning instance. Then, publish the learning
instance assets (process, form, and bots) to the public repository so that the learning instance can be used in
public mode to extract data from real documents, and validators can manually validate documents: Publish
the learning instance
Related concepts
Extracting data from check boxes
Data from successfully processed documents is available as a CSV file, by default. Change the output format
by editing the download data action before you begin uploading documents.
Prerequisites
If you have not done so already, Create a learning instance in Document Automation.
Procedure
1. Navigate to Automation > Private tab > Document Workspace and click the folder with the same
name as the learning instance to configure the assets.
2. Open the process.
You must update the output format for DownloadBot-ValidationSuccess and DownloadBot-
Success :
Next steps
Formula validation
Use basic arithmetic, comparative, logical, and functional operations.
Mathematical Operators
Comparative Operators
Logical Operators
Functional Operators
SUM(n1,n2,....,nN)
SUB(n1,n2,....,nN)
Subtraction: gives result of
subtraction of one or more For example, SUB(SUB_TOTAL,
SUB field/column/fixed-number TOTAL_DISCOUNT, ADJUSTMENTS) ==>
values from the first specified equivalent to SUB_TOTAL minus
value TOTAL_DISCOUNT minus
ADJUSTMENTS
MUL(n1,n2,....,nN)
DIV(n1,n2,....,nN)
Division: gives result of
division of one or more field/ For example, DIV(AMOUNT,
DIV
column/fixed-number values SHIPPED_BOXES, UNIT_PRICE) ==>
from the first specified value equivalent to AMOUNT divide-by
SHIPPED_BOXES divide-by UNIT_PRICE
COLSUM("<table-name>", "<expression to
evaluate for each row of specified table-
name>")
Pattern validation
A pattern defines an acceptable format for date, number, or text fields. Reference the tables below for
common examples:
Text
^([a-z0-9_\.-]
This regular expression The Email field
+)@([\da-z\.-]
Email validates emails like supports regular
+)\.([a-z\.]
john@ado.com. expressions.
{2,6})$
The Phone
^(\(?\+? This regular expression
Phone Number field
[0-9]*\)?)? validates phone numbers
Number supports regular
[0-9_\- \(\)]*$ like (+64) 38 3235393.
expressions.
Number
Prerequisites
To perform this task, you must have one of the following roles:
Open the Document Automation and Automation 360 IQ Bot environments on the same device.
When you connect a learning instance that was created in Automation 360 IQ Bot, the system creates
learning instance assets (RPA bots, AARI process, and form) to enable the learning instance to start
processing documents in Document Automation.
Note:
• This feature is available only for Document Automation customers. It is not available in
Community Edition.
• The Automation 360 IQ Bot environment must be associated with the Control Room where
Document Automation is installed. In addition, Automation 360 IQ Bot must be in a version
that is compatible with the Control Room.
• You can connect one learning instance at a time.
A connected learning instance can extract data from check boxes if your device is configured with the
Microsoft Visual C++ 2019 Redistributable package: Extracting data from check boxes
Procedure
1. Login to the Control Room as the Admin user. Navigate to Manage > Learning Instances. Click
Connect learning instance.
This takes you to Connect Learning Instances from IQ Bot classic page which contains a list of
Automation 360 IQ Bot learning instances.
2. Select the learning instance from version Automation 360 IQ Bot to connect with. Move it from left to
right. Click Connect.
The newly-connected learning instance appears in the Learning instances table in private mode.
Note: The value in the Provider column indicates whether a learning instance was connected
from Automation 360 IQ Bot or created in Document Automation.
Next steps
Note: You must maintain the Automation 360 IQ Bot server because it stores the
configuration and training data for the connected learning instance.
• A connected leaning instance can only be edited in Automation 360 IQ Bot. To edit the leaning
instance after connecting it, click the name in the Learning instances table and you will be redirected to
open the learning instance in Automation 360 IQ Bot in a new window.
• To upload documents for processing, click Process Documents. Process documents in Document
Automation
A check box is a user interface object that allows you to make a binary choice - select or leave unselected.
Document Automation supports extracting data from check boxes or radio buttons marked with a tick,
cross, or dot. In addition to square check boxes and round radio buttons, Document Automation supports
triangle and rhombus outlines.
Document Automation supports the extraction of group check boxes from connected learning instances.
For more information on group check boxes, see Extract data from check box groups in Automation 360 IQ
Bot.
Note: To enable check box detection, you must install the Microsoft Visual C++ 2019
Redistributable package on the Bot Runner device where the documents are processed. Download
Microsoft Visual C++ Redistributable
The possible output values are Yes , No , or No Checkbox Found . The output format depends on the
format you selected Change output format from CSV to JSON.
Note: Regardless of the document language, the output values are always in English.
Prerequisites
For Bot Runner devices with a Windows 2012 server-based operating system:
1. Download and install the Windows Essential Media Pack from Microsoft.
2. Install the Media Foundation feature from the Windows Server Manager.
Procedure
b. Mouse over the vertical ellipses to the right of the process to open the actions menu and click
Check in process.
The Check in Process window opens with the bots and form auto-selected to also be checked
in.
c. In the Check in Process window, add your comment and click Check in.
The process appears in the same folder structure in the public workspace.
See these steps in a video:
2. Build the Extraction-Scheduler bot to automate uploading documents to the learning instance:
Build a bot to upload documents to Document Automation
3. Publish the Extraction-Scheduler bot.
a. Locate the Extraction-Scheduler bot in the private tab.
b. Mouse over the triple ellipses to the right of the bot to open the menu and click Check in Task
bot.
The Check in Task bot page opens with the bot auto-selected.
c. Add a comment and click Check in.
4. Login as the Document Automation Admin user to deploy the process and bots to unattended Bot
Runner devices: Deploy the learning instance assets
Once the process and Extraction-Scheduler bot are deployed, documents in the specified folder are
uploaded to Document Automation, and Document Automation either extracts data from the documents or
sends the documents for validation.
Next steps
Navigate to Activity > In progress to see the status of the running bots.
Prerequisites
• If you have an adequate amount of Bot Runners connected. For instructions on connecting a Bot
Runner, see step 5 of Set up your Document Automation environment.
Note: You need a dedicated Bot Runner for this bot and additional Bot Runners for the
extraction and download bots, depending on the average number of pages in the
documents. Review the explanation below for more guidance on how many Bot Runners are
needed.
• Decide on the scheduling interval for deploying this bot with a maximum interval of every thirty
minutes.
• Determine the average number of pages in the documents awaiting upload.
This bot is necessary for end-to-end automation of the document extraction process. When deployed, the
bot checks if there are enough Bot Runners to handle the documents, by comparing the number of files
already uploaded against a calculated value. The calculation considers the number of Bot Runners,
scheduling interval, and average number of pages in the documents.
Note: This step is important as it controls the number of documents being uploaded to
ensure smooth processing and avoid overloading the Bot Runners, which might cause a time-out
error.
Use this equation to determine how many Bot Runners are needed:
Number of Documents to Process per day * Average Number of Pages per document / 1440 = Nu
mber of Bot Runners
Note: If the equation result is not an integer, you must round up to the next whole number to
obtain an adequate number of Bot Runners. For example, if the equation result is 4.2, you will need
5 Bot Runners.
As an example, if you want to process 10,000 documents each day with an average of one page per
document, you need seven Bot Runners:
Procedure
Increments
with each loop
iteration to
File_Count count the Number 0
number of files
uploaded for
processing.
Number of Bot
Enter the
Runner devices
number of
Extraction_DeviceCount connected to Number
connected Bot
the Control
Runner devices
Room
Average
Enter the
number of
Average_Pages approximate
pages per
average number
document
of pages in the
documents
4. Insert a Loop action to iterate through all the documents in a specific file path.
a. Double-click or drag the Loop action.
b. Select the For each file in folder iterator.
c. Insert the SourcePath variable into the Folder path field.
d. Insert the FilesInFolder variable into the Assign file name and extension to this variable
field.
5. Insert an If action to compare the number of documents uploaded for processing to the number of
Bot Runner devices multiplied by a scheduling interval and divided by the average number of pages.
Note: You must include this equation to limit the number of files uploaded for
processing at one time to ensure that the Bot Runners are able to process the documents.
The completed bot should resemble the one in the image below:
Next steps
Publish the process and bot. See step 3 of Publish the learning instance
Prerequisites
• The user who created the learning instance assets must check-in the process, form, and bots to the
public repository. If you have not done so already, complete steps 1 and 2 of Publish the learning
instance.
• To deploy the learning instance assets, you must be an Admin user. See Document Automation users.
Procedure
Note: If you are scheduling the bot to run starting today, you must repeat the
steps two times to schedule the bot as follows:
• Schedule the bot with the desired start time for today, with an end time of
11:59pm on the same day.
• Schedule the bot to start tomorrow at 12:00am, with an end time of 11:59
on a future date.
Warning: If you do not follow this guidance, the bot will start
running each day at the time you provided in the Start time field,
instead of being scheduled to run for 24 hours of each day.
• When selecting a Bot Runner, deploy the Extraction-Scheduler bot to a different Bot
Runner than the one running the process.
Note: A Bot Runner can run only one bot at a time. Ensure that you use
separate Bot Runners for the process and the Extraction-Scheduler bot, so
that the Extraction-Scheduler bot does not interfere with the
Extraction and Download bots.
Once the process and Extraction-Scheduler bot are deployed, documents in the specified folder are
uploaded to Document Automation, and Document Automation either extracts data from the documents or
sends the documents for validation.
Next steps
Navigate to Activity > In progress to see the status of the running bots.
When you click the Validate documents link in the Learning instances page, the AARI Task Manager opens
in a new tab, with the first failed document in queue.
Note: An empty page with a notification message appears in the following scenarios:
• If you are not assigned to any validation teams, and thus do not have access to learning
instances.
• If there are no files that require validation in the learning instances to which you have access.
• If another user is working on the remaining file in the validation queue.
Note: You can search for specific documents (tasks) in Detail view only.
Table view: Use the Table view see learning instance history. When you open a document from this
view, the history for the learning instance that is processing this document appears in a pane on the
left. Click the empty text field, then select the blue-bounded box in the document that contains the
data to extract. After you select a box, the data is highlighted in yellow and a green outline appears,
which you can resize or reposition to include all the data that you want to extract.
2. Filter the documents in your queue
The Validator opens with certain filters enabled by default to show only the documents from a specific
learning instance that need validation.
3. Your documents queue
The documents that match the parameters you specified in the filters pane appear here.
You can switch between documents in the queue without completing validation for the current
document.
4. Customize your view
Use the icons in the taskbar to hide the blue-bounded system identified regions, stack the document
pane above the fields pane, or change the zoom levels.
• Fields missing
• Tables missing
• Wrong values
Prerequisites
The Validator is comprised of two panels with the document on the left side and the fields with extracted
data on the right side.
In a window that is 1280 pixels or more, the panels are positioned side-by-side. Otherwise, the panel
containing the fields and extracted data shifts below the panel containing the document, as demonstrated in
this video:
Note: If a document contains tables with large number of rows or data fields (>1000), there
might be a delay in the initial validation and the corresponding response time.
Procedure
a. Select Show fields that need validation from the drop-down list in the right panel.
b. Click the field to extract.
c. In the document image, click the blue-bounded box containing the correct data or draw a box
around the data you want to extract.
• To skip a document without correcting its errors, click Skip to proceed to the next
document in the validation queue.
• To remove a document that cannot be processed, click Mark as Invalid.
5. After you make the necessary corrections, click Submit so the document can be processed and the
extracted data downloaded.
The extracted values are saved to a file in the Success folder. The Validator shows the next document
in queue. When all documents are corrected, the system displays a message stating that there are no
more tasks available.
When you export a learning instance, a .dw file is downloaded to your desktop with the learning instance
configurations, including the language, provider, and fields.
To export a learning instance, navigate to Manage > Learning Instances and mouse over the vertical
ellipses to the right of the learning instance you want to export. Then, click Export.
Note: The process and bots associated with the learning instance are exported separately:
Export an AARI process | Export bots
First, import the process and bots associated with the learning instance: Import an AARI process. If you do
not import the process, Document Automation will create a new process when you import the learning
instance.
Note: Verify that the destination environment has the necessary licenses. See Automation 360
licenses.
1. Navigate to Manage > Learning Instances and click Import at the top of the page.
2. Click Browse to select the learning instance.
Note: If there is already a learning instance in this environment with the same ID
number, the imported learning instance overwrites the existing one.
3. Select whether to pair the learning instance with an existing process or to create a new process.
See the following video on the integration of Standard Forms in Document Automation and its enhanced
processing capabilities.
With enhanced accuracy, this feature enables you to extract data from densely structured forms including
those with handwritten input in more than 200 languages. For more information on the supported
languages for text and field extraction and detection, see Language support for Form Recognizer.
The following is an overview of the end-to-end process to create the Standard Forms data model, configure
the learning instance, and publish the learning instance to process documents autonomously, all within the
Document Automation interface:
Note: Ensure you complete the steps to set up your Document Automation environment,
which includes configuring users, roles, and devices, and connecting the Control Room with
Automation Anywhere Robotic Interface: Set up your Document Automation environment
7. Upload documents to the learning instance to test the model, fix validation errors, and verify the
extracted data.
Process documents
8. Build a bot that uploads documents from a source folder to Document Automation.
9. Move Standard Forms learning instance across different Document Automation environments.
10. Check-in the learning instance assets (process, form, and bots) to the public repository. Then, deploy
the process and bots to unattended Bot Runner devices to begin processing documents in real time.
11. Validate the processed documents for Standard Forms learning instance.
Consider the following parameters to determine if Standard Forms is right for your use case:
Once you have confirmed that the documents are standard forms, you can then plan the type of standard
forms extraction service that fits your requirement.
Document Automation, you can use the Microsoft Form Recignizer service. Before processing, ensure the
following:
Note: The input documents can also contain tables that span over a single page.
However, if the standard forms contain table that span across multiple pages, the
content extraction can fail.
The model creation tool is used to label and create models. Here, the training documents are uploaded
within the tool and are stored in the Blob storage of the third-party service. You must upload and label a
minimum of 5 training documents to create a model and get the best results.
Blob Storage
The blob storage is a storage unit for training documents and contains all the training related information.
Extraction models
The trained models stored in the blob storage of the third-party service are fetched through APIs. Here, the
Model ID, which can be a GUI ID or a standard name, are used for creation.
Creation of LI
You can browse and select the model to create a Learning Instance. Once the model is selected, you can
upload a document for extraction to this Learning Instance. The Learning Instance hits the third-party
extraction service to get extraction results. The results either goes to the output folder or the document
lands in the validator.
Extraction Engine
The extraction engine performs extraction based on the model ID and returns the results to the Learning
Instance.
Output results
Once the extraction is completed, the output results will have three components:
• extraction .csv
• auto detected table .csv
• response.json
Document Automation provides you with the option to get your own license key or purchase license from
Automation Anywhere. For more information on licenses, see Automation 360 licenses.
Microsoft provides the Azure Form Recognizer service. This service enables you to build automated
data processing software that can identify and extract specific content from documents. You must
have Microsoft Azure subscription in order to access Azure resources (OCR, Form Recognizer),
Automation Anywhere provides access to these resources through its subscription. Contact your
Customer Success Manager (CSM) or Partner Enablement Manager (PEM) who will help you setup the
services.
Request access for Microsoft Azure resources (OCR and Form Recognizer) (A-People login required)
• If you purchase the license from Automation Anywhere, no additional configurations or changes
are required.
• If you purchase the license from Microsoft Azure, review the following article to setup and
configure your Microsoft Azure Form Recognizer for extracting content from standard forms.
Setup Azure Form Recognizer and Storage (Cloud) (A-People login required)
Next steps
Connect Document Automation with the extraction service. See, Configure Document Automation with
Microsoft Azure Form Recognizer.
Request
Request body:
Request
Request body:
Next steps
Log in as the Learning instance creator user to Create standard form in Document Automation.
Prerequisites
• Ensure your Control Room has the Document Automation Standard Forms (Number of pages) product
license
• You must have the AAE_IQ Bot Admin or AAE_IQ Bot Services roles to complete this task
• Identify between five and fifteen training documents
Procedure
1. In the Control Room, navigate to Manage > Learning Instances and click Create model.
The Projects window opens in a new tab.
2. Click Create project.
3. Enter a name for the project, click Browse to upload documents to train the extraction model, and
click Create.
4. Define the field tags:
a. Click Add new tag.
b. Enter a name for the tag. For example, Invoice no. or Invoice Date .
5. Define the table tags:
a. Click Configure table tag.
d. When you are done entering the columns names, click Save.
6. Define the location of the data for each field:
a. For the form fields, highlight the location of the data to extract, then click the respective field in
the right panel.
b. For each row in the table, highlight the location of the data to extract, then click the respective
table field on the right.
c. Click Done, then open the next document and repeat the steps to define the locations of the
form and table fields.
7. Click the Train model icon, enter a name for the model, and click Train.
8. Click the Analyze icon, click Browse for file, and select a document to test the extraction model.
Next steps
Note: You cannot edit an extraction model after associating it with a learning instance.
Prerequisites
Procedure
1. From the Control Room home page, navigate to Manage > Learning Instances > Create Learning
Instance.
4. For model selection, click Browse and search for a model using name or description filter.
5. Select your model and click Next.
6. To proceed further, you must select atleast one field:
• Table
• Forms
Note: Standard forms support multiple tables. However, Document Automation does
not allow displaying multiple tables. Hence, the tables are displayed as tablename:field
name under Tables tab.
Option Description
Field name Enter a field name that begins with an alphabetical character (A-Z or
a-z).
Option Description
If the data in the field does not match the data type, the document is
sent to the validation queue.
Option Description
Validation Depending on data type, create rules using patterns, formulas, lists,
rules and statements such as starts or ends with.
8. Click Create.
When a new learning instance is created, the Control Room creates a folder with the same name as the
learning instance in the Automation > Document Workspace folder. The folder contains two bots
(extraction and download), a process, and a form.
• Process: Manages the process using if/else scenarios through which Document Automation extracts
data from uploaded documents, assigns documents to users for validation, and downloads the
extracted data. To learn more, see About the AARI process in Document Automation
• Extraction bot: Extracts data from defined fields in the uploaded documents.
• Download bot: Downloads the extracted data to a specific folder on the device or shared network.
• Form: Defines the input parameters that are sent to the process. Input parameters include the learning
instance name, uploaded file, and output file path.
Next steps
Upload documents to the learning instance, fix validation errors, and verify the extracted data: Process
documents in Document Automation
• Ensure that the source and destination systems are running the same Document Automation version.
Learning instances from different Document Automation versions cannot be imported because their
database schema will not match.
• Back up your database before importing learning instances.
• Export the .dw file from the source Document Automation environment.
• Import the .dw file into the destination Document Automation environment.
•
Note: The incoming model of the source system must be available in the configuration of the
target system.
For detailed information on exporting/importing learning instances, see Export/Import learning instances.