Date post: | 08-Feb-2017 |
Category: |
Technology |
Upload: | abbyy-usa |
View: | 333 times |
Download: | 11 times |
ABBYY FlexiCapture 11Feature Review and Recertification
Two Hour Overview
Preparation for FlexiCapture
Recertification Test
© ABBYY Confidential
Upgrade from previous versions
● FlexiCapture 10 network projects that were loaded on server can also be used in FlexiCapture 11.
● Finish processing of all batches in FC10
● Close all opened sessions via Administration and Monitoring Console
● Install FlexiCapture 11
● Upgrade database to necessary version on the Administration and Monitoring Console
● Open the project from Server on the Project Setup Station. It will be converted into FC11 format
● Check if the project and document definition settings are correct
● Continue working with the project.
4ABBYY FlexiCapture 11. What Is New
E-mail UpdateImport via POP3: SSL Encryption
● Cryptographic protocols are supported to provide communication security
● “Use SSL” option in the import profile settings
5ABBYY FlexiCapture 11. What Is New
Import from E-mail POP3: Save Source E-mail
● A source e-mail with images can be attached to a batch
6ABBYY FlexiCapture 11. What Is New
Import from E-mail Registration Parameters: Email Addressee, Email Date
● Batch registration parameters:
● Subject: fc_Predefined:EmailSubject
● Sender: fc_Predefined:EmailSender
● Recipient: NEW! Fc_Predefined:EmailAddressee
● Date and Time: NEW!Fc_Predefined:EmailDate
● Values are assigned automatically to any batch imported from e-mail
● No need to create them as parameters for the batch type
7ABBYY FlexiCapture 11. What Is New
FlexiCapture Project SetupEnvironmental variables (1)
● Parameters that can be shared between different settings and accessed from scripts within the project
● String – contains any text– Can be used in export settings to data files and image files to specify a path or
a file name (file name tag: <Env:Environmental_variable_Name>)
● ADO Connection String – contains a connection string to a database– Allows easy database connection setup in rules, format or export settings of
any document definition by selecting the variable name
● In case the export paths or DB connection are changed, modify the variables and all the definitions (rules, export destinations) and scripts will use the updated settings
8ABBYY FlexiCapture 11. What Is New
Project SetupEnvironmental variables (2)
● ADO connection string variable can be created
● in the project properties
● in the “Database check”rule settings
9ABBYY FlexiCapture 11. Demo dataset setup
Project Properties
Rule Settings
Document DefinitionData Form. Improvements
● Data Form layout in the Document Editor is the same now on rich and web stations
● Data form was redesigned in order to increase flexibility:● All form elements can be aligned vertically and horizontally
● Group instances are displayed as a table now so they take less space on the form
● Descriptive text and field caption can be multi-line (by pressing Shift+Enter)
● and more…
10ABBYY FlexiCapture 11. What Is New
Document DefinitionData Form. Groups of Fields (1)
● Groups of fields can be expanded/collapsed
11ABBYY FlexiCapture 11. What Is New
Group Setup
Verification Window
Document DefinitionData Form. Show Field Picture, Tab Order
● Fields can be displayed with a picture (snapshot) to facilitate visual review of data
● Text, Barcode or Checkmark Group fields
● By “Tab Order…” it is now possible to specify an order of fields using Tab, F4, F8 in document editor
12ABBYY FlexiCapture 11. What Is New
Document Definition Suggestion of Values at Typing
● When the operator starts typing a value in a blank field, the list of possible values is generated
● Available for FlexiLayout-based documents
● Search is done through the OCR cache for the page
● “Store the whole document text” should be enabled in the definition properties
13ABBYY FlexiCapture 11.Demo->StandardInvocie V11
Document DefinitionData Settings Enhancements
● “Data” tab was reorganized
● Extended content options – general properties of Date, Number, Amount of money
● New option – “Default value”
● New settings for automatic normalization of values
● “Cannot be blank” option was moved from “General” tab to “Data” tab
14ABBYY FlexiCapture 11. What Is New
See the next slides for details
Document DefinitionDefault Field Value
● Empty value of a field can be substituted with the pre-defined value
● if the field is not filled in after Document Definition matching
● if the field is not filled in after assembly
● in case field region was deleted
15ABBYY FlexiCapture 11. What Is New
Document DefinitionNormalization of Fields Values● Settings for normalization fields of Date, Number and Amount of money
data types were added
● Fields values can be recognized in various formats and automatically converted to the format required for export
16ABBYY FlexiCapture 11. What Is New
Format for export
Possible formats to recognize
Document DefinitionNew Settings for Inverted Text Recognition
● Inverted text can be recognized based on the Inversion settings:
● Autodetect -NEW!
● Don’t invert
● Invert
17ABBYY FlexiCapture 11. What Is New
Document DefinitionField Verification Threshold
● Percentage of highlighted characters after which a field will be sent to field verification
● Useful, if a field should be sent to verification only in case it contains many unconfident characters
18ABBYY FlexiCapture 11. What Is New
Document DefinitionRules: Conditions of Launch
● “If this condition is true”option allows to specify a condition that triggers the rule
● A set of possible values can be added manually
● Launch conditions can be used for text fields, barcodes, and table columns
19ABBYY FlexiCapture 11. What Is New
Document DefinitionRules: Chains
● Rule chains are groups of rules with a specific order of applying
● Several rules can edit the same field
20ABBYY FlexiCapture 11. What Is New
Document DefinitionData Sets
● A table of data that is stored within FC project for quick and easy access● Columns and their types
● Can be used for advanced DB check rules (DB lookup)
● Contains data from the external database table or database view
● Can be synchronized with the external source manually or automatically
21ABBYY FlexiCapture 11. What Is New
Document DefinitionDatabase Check Rule: Source Type
● When creating a rule, it is now possible to indicate a Source Type for the database:
● Data Set means that the data will be taken from the internal database structure (see the previous slide)
● External Database (Simple) – a direct connection to the external database
● External Database (Variable) – a direct connection to the external database. Plus the connection string can be defined as a variable which allows reusing in rules or export settings of any document definition
22ABBYY FlexiCapture 11. What Is New
Document DefinitionExport on Condition
● At export step the required destination can be selected based on the value of a particular field
● Enable “Field condition” option and provide the conditions● Belongs to set (list of values)● Does not belong to set (list of values)● Is empty● Is not empty
23ABBYY FlexiCapture 11. What Is New
Document DefinitionExport to SharePoint. Typified Columns
● Now it is possible to export fields of a particular type in typified columns in SharePoint
● The following column types are supported:
● Date and time
● Number
● Currency
● Boolean
24ABBYY FlexiCapture 11. What Is New
Administration & MonitoringDisplay of License Details
● Current license expiry date and pages left are displayed in the main page of the Administrator and Monitoring Console
25ABBYY FlexiCapture 11. What Is New
Administration & Monitoring Notifications by E-mail
E-mail notifications can be sent to
● Administrator and Monitoring operator, in case of
● Errors
● Requests for permissions
● License expiration, etc.
● Registered users (for FlexiCapture authentication only), in case of
● Successful registration by Administrator
● Password reset
26ABBYY FlexiCapture 11. What Is New
Administration & Monitoring Automatic Deletion of Hanging Sessions
For sessions opened via API:
● Station sessions hanged due to a station failure are deleted automatically
● License ticket is released
● The default inactivity period is 240 minutes
● It can be changed in the IIS settings on the Server level: InactiveSessionTimeoutMinutes parameter
27ABBYY FlexiCapture 11. What Is New
Administration & Monitoring
● SQL Server 2014 and Oracle 12 support
● Logging of login and logout events on all stations
● Enhancement of notifications on the Administration and Monitoring Console
● Automatic termination of inactive sessions on operator stations
● Enhanced user group features on the Administration and Monitoring Console
28ABBYY FlexiCapture 11. What Is New
FlexiLayout StudioMain Window Improvements
29ABBYY FlexiCapture 11. What Is New
Multiple Batches
Editable Properties
New Element –Line of Elements
Log Window
Match on Navigation
Objects Names
AutocorrectionTools
See details on the next slides
FlexiLayout StudioNew Element – Line of Elements
● Line Of Elements (LineGroup)
● Search of subelements of various types positioned in the same line
● Any text in between the objects is allowed
● Vertical shift is allowed
30ABBYY FlexiCapture 11. What Is New
FlexiLayout StudioNew Languages in Date Element
● New languages are supported to search months in words in Date element:
31ABBYY FlexiCapture 11. What Is New
FlexiLayout StudioMultiple Batches
● A FlexiLayout project can contain multiple batches
● It allows independent work with images of different types or from different sources
● User can work with one batch at a time
32ABBYY FlexiCapture 11. Demo
FlexiLayout StudioEditable Properties Window
● Parameters of FL, blocks, and elements can be also corrected in the Propertieswindow
33ABBYY FlexiCapture 11. What Is New
FlexiLayout StudioUser OCR Dictionaries
● Quality of pre-recognition (OCR) can improved on poor-quality documents by custom dictionaries
● Used in conjunction with built-in dictionaries
● May be given priority over the words in built-in dictionaries
● Based on file or regular expression
34ABBYY FlexiCapture 11. What Is New
FlexiLayout StudioGeneral Improvements (1)
1. After project is created, a properties windows opens automatically
2. If the user opens an empty batch, a dialog box for adding pages is displayed
3. Images can be imported from another FLS project with information on their classes and alternative layouts
35ABBYY FlexiCapture 11. What Is New
FlexiLayout StudioGeneral Improvements (2)
4. Batch window columns can be enabled/disabled
5. Log window is displayed with information about the processes and detailed statistics of matching (can be saved to a text file)
6. FlexiLayouts can be applied automatically when going to the next or previous page or document
7. User-defined Hot-keys
● Tools> Options> Shortcuts
36ABBYY FlexiCapture 11. What Is New
ABBYY FlexiCapture for Invoices Invoice Processing Project● Special type of project
● Pre-defined document definition
37ABBYY FlexiCapture 11. What Is New
ABBYY Technologies
● ABBYY develops a wide range of high-tech products and services that are based on document recognition and linguistic technologies (core technologies)
ABBYY FlexiCapture. Overview
CORE TECHNOLOGIES
Barcodes
OCR
39
FlexiCapture Standalone
● Small businesses
● Up to 50 K pages per month
● 1 user involved in processing
ABBYY FlexiCapture. System Architecture 40
ABBYY FlexiCapture DistributedDistribution of Duties
● Large projects and enterprises
● More than 50 K pages per month
ABBYY FlexiCapture. System Architecture 41
FlexiCapture Distributed. Components
● Operator Stations are responsible for document processing routines ● Scanning Station – creating batches and adding images● Data Verification Station – verifying recognized symbols● Verification Station – validating assembly errors, project level rules, processing of exceptions
● Automatic Stations● Processing Station – adding images from the Hot Folder, recognition, exporting
● Servers ensure the operation of the entire system● Application Server – main functioning
– DB Server – operational data storage
– File Storage – project files storage
● Processing Server – task distribution among processing stations● License/Protection Server – license control, license key is installed on it
● Setup and monitoring tools:● Administration and Monitoring Console – connection to database, distribution of rights to the
operators, statistics on the system’s functioning ● Project Setup Station – creating and setting up projects
ABBYY FlexiCapture. System Architecture 43
Installation: External Components
On CD:
● MS SQL Server 2005 Express
● Full SQL Server, for production usage
● “Mixed Mode” should be selected (authentication must be turned on)
● .Net Framework 4.0
● 4.5 if running under Microsoft Server 2012
● Required to install Application Server and Administration and Monitoring Console
● Crystal Reports Basic Runtime for Visual Studio 2008
● Requite to print reports in the Administration and Monitoring Console
Windows component:
● Internet Information Server 6, 7, 7.5 or 8
● Required for running the Application Server and Administration and Monitoring Console
● a free Windows component
44ABBYY FlexiCapture. System Installation.
IIS Settings for Web and Remote Stations – follow the Admin Guide!
45ABBYY FlexiCapture. System Installation.
Types of Protection
● USB dongle● iKey Rainbow or WIBU dongles
● The driver is installed automatically
● Protection key can be transferred to another computer
● FlexiCapture will detect its dongle– More than one USB dongles can be plugged into the same computer (for different
software products)
● One protection key for the entire system (installed on Protection Server)
● Serial number● Activation is required (on ABBYY activation server)● Mainly used for trial versions
46ABBYY FlexiCapture. Licensing.
FlexiCapture Authentication
● FlexiCapture authentication is enabled by installing a special module● Module is not installed by default!
● FlexiCapture uses its own user accounts● User names and encrypted passwords
are stored in the FlexiCapture
● Accounts are created● By Administrator
● By users (self-registration at a web page)
47ABBYY FlexiCapture. User Authentication
What is an Operator’s Permission?
● A permission is a set of rights granted to an operator to
● Start a station of a specific type
● Complete the processing stage
● Work with batches of certain type
● Verify certain data fields
● When using a distributed installation, it is important to have flexible control over the operators, and to grant correct permissions
● Single operator may have multiple permissions
48ABBYY FlexiCapture. User Permissions
System Components and Standard Permissions
49ABBYY FlexiCapture. User Permissions
Administrator
Senior Verification
Operator
Verification Operator
Data Verification
Operator
Monitoring Operator
Project Settings Editor
Scanning Operator
Processing Server
User Profile Setup
● User permissions are maintained by the Administrator using the Administration and Monitoring Console
50ABBYY FlexiCapture. User Permissions
Composition of Document Definitions
● To define the principles of identification and processing of a
particular type of documents you need to create a Document
Definition
● It contains
● Document identifiers
● Marking data fields
● Validation rules
● Verification options
● Document assembly
settings
● Export settings
51ABBYY FlexiCapture. Creating Document Definitions51
Document Definition Editor
● Sample images for layout
(blank for forms)
● Fields regions
● List of fields
● Toolbars
● Data Form
● Fields’ and Definition properties
52ABBYY FlexiCapture. Creating Document Definitions
The Document Definition Editor is accessed from the Project Setup Station.
Main Steps of Document Definition Creation
1. Creation Wizard
● Images for definition editor loading – may be more than one image
● Technology for recognition
● Fields marking tool
2. Fields creation
3. Recognitions settings
4. Validation Rules, Data types, and Data Verification settings
5. Assembly and export settings
6. Definition testing
53ABBYY FlexiCapture. Creating Document Definitions53
More information on definition creation see in the other Document Definition Creation
modules
Data Type Definition
● Data type is an important setting for increasing the recognition quality and automatic data normalization and validation
54ABBYY FlexiCapture. Improving Recognition Quality 54
Text - ?
Date - ?
Currency - ?Type of Content
Recognized Document
Creating a Document Definition for a Structured Document : Static Elements
● Black squares are the recommended type of anchor:
● Document Definitions are applied fast and reliably
● Linear distortions of scanned pages can be corrected
● If the form has no black squares, you can use other static elements to facilitate Document Definition matching
55ABBYY FlexiCapture. Creating Document Definitions55
Creating a Document Definition for a Structured Document : Static Elements
56ABBYY FlexiCapture. Creating Document Definitions56
If your document was designed without using black squares, you can use static elements of other types for definition matching
Anchor Text
Separator
Barcode
Creating a Document Definition for a Structured Document : Identifiers
● Ensure the matching of the correct Document Definition when several forms have similar (the same) anchors.
● Anchor text
● Barcode (recommended identifier)
57ABBYY FlexiCapture. Creating Document Definitions57
Anchor Text: Identifier
● Anchor text can be used as an identifier
● Select the reliable text of a good quality and large font
● Specifying the maximum allowed number of errors in the text to allow misprints or noise on the images
58ABBYY FlexiCapture. Creating Document Definitions58
Text Field: Marking Type
Important:
● Incorrect settings for “Marking type” and “cells” may adversely affect the quality of recognition!
59ABBYY FlexiCapture. Recognition Fields Creation59
Simple
Underlined
Simple comb
Text in frame
Comb in frame
Partitioned frame
Char box series
Gray boxes
Document Confidence Level
● The ratio of reliably recognized characters to the total number of characters in the document● Percentage is also shown for each section/page
● By default unreliably (uncertainly) recognized characters require data verification● Depends on the field verification properties
60ABBYY FlexiCapture. Document Verification
Work vs Test Batches
● Documents are processed in work batches (using the published versions of the document definitions)
● Work batches list is available at
● Project Setup Station
● Verification Station
● Test batches are used to test published or unpublished document definitions
● Available at Project Setup Station only
Note
● If work batches were created or edited on the Project Setup Station, these batches or changes can only be sent to the server queues from the Verification Station
61ABBYY FlexiCapture. Project Setup Overview
Document Definitions List
● After creation the Document Definition can be edited
● To be able to use a Document Definition for documents recognition, you must publish it
● Unpublished document definitions can be tested on test batches in the project window
62ABBYY FlexiCapture. Creating Document Definitions62
Applying Document Definitions
● You can set the order in which Document Definitions should be applied
● You can also use the first acceptable FlexiLayout (and stop following definitions applying)
● Order of applying can be re-defined in the batch type settings
63ABBYY FlexiCapture. Creating Document Definitions63
Testing a Document Definition in the Document Definition Editor● You can easily test your Document Definition settings without
quitting the editor
You can test the quality of recognition and validation rules on the image displayed in the Document Definition Editor
64ABBYY FlexiCapture. Creating Document Definitions
Types of Rules
● Pre-defined rules
● Calculate Sum
● Check Sum
● Compare Fields
● Database Check
● Merge Fields
65ABBYY FlexiCapture. Validation Rules
● Custom script rules
Script Rule
● Performs document validation and fields normalization based on the values of one or more fields
● Supported languages
● In the event of error
● set an error flag
● add error description
● specify focus field
● propose replacement variants
66ABBYY FlexiCapture. Validation Rules. Advanced
Script Rule Settings
● The fields linked by the rule must be added to the list of Available fields
● Read only option must be unchecked if the field value is modified by the rule
● Alias Names can be used to shorten the fields’ names in script code
67ABBYY FlexiCapture. Validation Rules. Advanced
Note: Script rules violations are corrected in the Verification module.
“General” Field Properties
68ABBYY FlexiCapture. Recognition Fields Creation68
Export field value If value should be exported
Read only If field modification by verification
operator is allowed
Show in data form If field is shown in verification
window
Cannot be blank If value must be entered
Format error appears
Index field If field can be used as index (for
export files naming, accessible
from scripts, etc)
Should be matched If field should be found in
document
Error “Unmatched region”
appears
Dual Monitor Support for Verification
● You can undock the Image window and drag it to the second monitor.
● Layout> Image window > Undocked
69ABBYY FlexiCapture. Document Verification
Document structure checking
● Imported pages are classified and assembled into documents automatically!
● Operator checks assembly to make sure no pages have been mixed up or missed:● Number and order of pages in a document
● Repetitions of some page or group of pages
● And so on
Full information about structure creation find in Document Definition Creation module
Full information about structure verification tools find in Assembly Verification module
70ABBYY FlexiCapture. Document Verification70
Standard Data Types: Amount of Money, Address, Name, Code
71ABBYY FlexiCapture. Improving Recognition Quality 71
Set of built-in dictionaries
How to correct set of pages
● Switch to “Thumbnails view”
● Change the order of the pages
● Simple drag and drop the pages to the right place with the mouse (either within the same document or between different documents)
● Delete redundant pages
● Scan/add missing pages
● Create a new document
● Add to the current document
72ABBYY FlexiCapture. Document Verification
Data Verification Task Management
At this stage batches are always divided to tasks
● 1 Step – Get Task from the queue
● 2 Step – Data verification
● If error occurs – Send Task to Exception to Senior verification operator(Task>Send Task to Exception)
● To cancel Task and return it to the queue – Reject task
● 3 Step – Sending task to the next queue
After completing, the task is automatically forwarded to the next queue!
73ABBYY FlexiCapture. Verification. Task Management
Verification Task Management
74ABBYY FlexiCapture. Verification. Task Management
• To perform a task, select the desired queue and click Get Task button
• After finishing processing a task, click Close Task button to send it to the next stage
• To cancel a task either before or while doing it click Reject task button
• To save the temporary results without returning the task into the queue and resume it later click Postpone task button
• To send a task to the Exceptions queue if there is not enough information to verify and correct the data click Mark for sending to exception button
• To send a task to rescanning if the recognition result is too low click Mark for sending to rescanning button
Sending Documents to
75ABBYY FlexiCapture. Verification. Task Management
• To send the document of a task to the Exceptions queue or rescanning as a separate task mark it:
select the document(s)
select Mark Documents for > Sending to Exception\Rescanning
• To add another document(s) to one of the previous exception\rescanning taskselect Add Documents to Marked…
• To return the document
select Unmark Documents
• View Marked Documents Groups…
Image Sources for Batch creation
● Scanning of paper documents
● Scanning Station
– Add images with a scanner using pre-defined settings
(scanning batch type)
– Import images from folder using pre-defined settings (scanning batch type)
● Automatic import of image files - Hot Folder
● Automatic images loading
– FTP
– Folders
– Mailbox
76ABBYY FlexiCapture. Import profiles
Hot Folder Feature
● Hot Folder is a storage of images to be checked by the system● Images found in the folder are automatically imported to FlexiCapture batch
● Several import profiles can be created (from different storages, for different batch types, etc.)
● After import, batches automatically line in queues for recognition
77ABBYY FlexiCapture. Import profiles
Import Settings
● Project Setup Station -> Image Import Profiles
● Hot folder Setup
● Scanning profile (testing settings only)
78ABBYY FlexiCapture. Import profiles
Created on the Project Setup Station by the ProjectSettings Editor or Administrator
Creating a Hot Folder Import Profile
● Hot Folder image source:
● FTP
● Folder
● Mailboxes
– MAPI
– POP3
79ABBYY FlexiCapture. Import profiles
Hot Folder Import Profile: Folder
● Local or LAN folder
● For LAN folder
1. Create a folder– Give “Modify” permissions to an account, which is used for processing station
• Default account – Network Service (selected during installation)
2. When creating a Hot Folder Import Profile, specify the network path to your folder
80ABBYY FlexiCapture. Import profiles
Workflows
● Workflow defines steps of batch processing
● Simple mode ● Using standard verification stages
● Advanced mode● Modification of standard stages
● Creation of custom stages
● Unattended mode
● No operator involvement is required (batches are sent from Recognition stage to Export directly)
● For Web Capture station mode
● For web station: batch creation and verification steps are done at Web Capture station only
81ABBYY FlexiCapture. Batch Types
See more details in “Advanced Workflow” module
Processing Station Tasks
● Processing Stations work on tasks marked as acceptable in the station settings(Processing Server Monitor):
● Document import (processing of Hot Folders)
● Batch recognition
● Data export
● Verification support
● Custom processing
82ABBYY FlexiCapture. Processing Server and Processing Stations
Administration and Monitoring Console Capabilities
● The Administration and Monitoring Console is used to
● configure the Application Server,
● set up storage and database
● set user permissions,
● control processing ,
● generate reports
83ABBYY FlexiCapture. Administration. Monitoring
ABBYY FlexiCapture Web Stations
● FlexiCapture provides possibility to work with operator tasks in Web interfaces● Processing documents from any place without system installation● For web stations the Silverlight is used
● Supported stations
● Web Scanning Station
● Web Data Verification Station
● Web Verification Station
● Web Capture Station
● User registration module allows user self-registration
84ABBYY FlexiCapture. Web Stations
Limiting the Number of Pages
● A license may limit the number of pages that can be processed
● The page counter can be either absolute (cannot be renewed) or renewable (renews monthly, quarterly, or most typically yearly on January 1st):
● Decrementing the page counter– All the pages added to the batch are counted
• Therefore, all processed pages are counted, whether recognized or not.
• The only exception is separator pages which are automatically deleted from the batch
– The pages counter decrements for both test batches and work batches
● Renewing the page counter– A page counter can be renewed monthly, quarterly, or yearly (on the 1st calendar day of each
month/quarter/year)
85ABBYY FlexiCapture. Licensing.
Limiting the Number of Stations
● Station types are specified in licenses
● Standalone Installation● “Operator.” Allows running an Operator Station.● “Administrator.” Allows running an Administrator Station.
● Distributed Installation● Scanning Station● Data Verification Station● Verification Station ● Project Setup Station● Number of Administration and Monitoring Consoles is unlimited● Number of running of Processing Stations is limited by number of CPU cores allowed
● Concurrent licensing of Scanning Stations and Verification Stations● The number of Scanning Stations or Verification Stations in the FlexiCapture license is the whole
number of stations which can be run simultaneously, both local and remote.
86ABBYY FlexiCapture. Licensing.
Creating a FlexiLayout
● Preparation:
1. Creating a FlexiLayout project
2. Creating a batch of sample images
3. Pre-recognition of images
4. Analyzing pre-recognition results
5. Creating FlexiLayout blocks (future fields to be captured in FlexiCapture)
● Layout description:
6. Creating identification elements
7. Creating header/footer elements for multi-page
documents assembly
8. Creating elements to detect the required data positions
9. Setting source elements for blocks
● Exporting the FlexiLayout
10. Exporting project to a description file *.afl87ABBYY FlexiLayout Studio. FlexiLayout Creation Steps
Matching and debugging
Elements Types - Simple
● Static Text – single- or multi-line text with known contents
● Character String – single-line text with unknown contents
● Paragraph - single- or multi-line text with unknown contents
● Date – a line in data format
● Currency – a line in currency format
● Phone – a line in phone number format
● Barcode – a barcode
● Separator – a horizontal or vertical black line
● White Gap – an empty rectangle with almost no objects
● Object Collection – objects of pre-define types
● Region – an area on the image, regardless of objects contained in it
88ABBYY FlexiLayout Studio. Elements. Search By Format
Elements Types – Complex and Composite● Complex
● Labeled Field – a field with a header
● Table – a simple two-dimensional table
● Composite● Group – formation of several elements, smaller number of hypotheses
● First Found - group several elements which find the same object
● Repeating Group – a repeatable element group
● Header – beginning of the multi-page document
● Footer – end of the multi-page document
89ABBYY FlexiLayout Studio. Elements. Search By Format
automatically when creating new project
Barcode
● Barcode type ● Several variants can be selected
● Orientation ● Several variants can be selected
90ABBYY FlexiLayout Studio. Elements. Search By Format
Object Collection
● Type● Type of searched objects
● Object size● The object’s length and height in
dots● For each of the objects
91ABBYY FlexiLayout Studio. Elements. Search By Format
Labeled Field
● Is used to search the fields with key words (labels) nearby
● Sub-elements:
● Label – field header
● Gap – space between label and value
● Field - field value
● Expand to Group
92ABBYY FlexiLayout Studio. Elements. Search By Format
Blocks
● Block area can be set on the base of
● Source file region
● Any area on the image which does not coincide with any of the element regions
● Such area can be described in the FlexiLayout Language in the Expression field
● Predefined variables that can be used in Expression field● OutputRegion
● OutputTable
● OutputInstances
● IsNull
It depends on block type
93ABBYY FlexiLayout Studio. FlexiLayout Language, Basic concepts.
Block Types
● Text – text data
● Table – table data
● Barcode – barcode
● Checkmark – checkmark
● Checkmark group – checkmark group, can include only
checkmark blocks
● Picture – non-textual objects
● Non-Recognized – exclusion of areas from the
recognition process
● Group – logical block grouping
● Repeating group – block group allowing blocks with
repeating instances
94ABBYY FlexiLayout Studio. FlexiLayout Creation Steps
Setting Blocks Locations
● Defining the location
● For layout – layout alternative indicating the image area
● Source element – area matching the element’s region
● Expression – expression in the FlexiLayoutlanguage defining block location using the areas of found elements
● Has repeating instances – the block consists of multiple instances
● Instance sort order – order,
in which group instances
are merged into a block
95ABBYY FlexiLayout Studio. FlexiLayout Creation Steps
FlexiLayout Content
● Identifiers of document type
● Objects for automatic multi-page assembly (first page and last page detection)
● List of Fields to be captured from the document (“Blocks” in terms of FLS)
● Elements that describe objects necessary to locate the fields
● Nearby objects– keywords, separators, white space, pictures, etc.
● Elements that describe desired fields’ data (by format, location, relations to other objects, etc.)
96ABBYY FlexiLayout Studio. Overview
Approaches to Creating FlexiLayouts (1)Autocreation● Autocreation (training on samples)
● Suitable for – detection of several fields with not very varying locations or for almost fixed documents
● Advantages– Easy to use
– Elements and blocks are created automatically
– Ready trained FlexiLayout project can be edited
● Method – Special training mode – manual drawing regions of fields and keywords on several test
documents
97ABBYY FlexiLayout Studio. Overview
Autocreation Usage Cases
● FlexiLayout Autocreation helps to automatically create simple one-page flexible descriptions for
● Semi-structured documents with shifted fields (fixed-like structure)
● Semi-structured/unstructured documents with context-sensitive fields
● FlexiLayout Autocreation can be used as a first step of creating a complex FlexiLayout
● Helps to create automatically elements for the fields with predictable location and format
● After automatic creation FlexiLayout can be adjusted:
● New elements such as tables and additional blocks can be added
● Created elements and blocks can be deleted or edited
ABBYY FlexiLayout Studio. FlexiLayout Autocreation. 98
Blocks and Elements
99ABBYY FlexiLayout Studio. Overview
Elements=«milestones» for template application
Blocks=futureFlexiCapture fields
Table element creation steps
● The Table block
● Table name
● Columns names and types
● The Table element
● Link with the corresponding Table block
● Each column: header and content
● Order of columns
● Header search parameters (top border)
● Footer search parameters (bottom border)
● Division into rows
● Search area and additional search conditions
● The Table block - location
101ABBYY FlexiLayout Studio. Table Detection
Table Header
● The top border of the table body
● Has an auxiliary function, no data is extracted
● Headers contain column names
● Headers are searched according to column parameters
102ABBYY FlexiLayout Studio. Table Detection
Table Footer
● The bottom border of the table body
● Has an auxiliary function, no data is extracted
● Search:
● By keywords and keyword combinations –Search by keyword
● Element of any type - Use found element as footer
103ABBYY FlexiLayout Studio. Table Detection
Table Rows
● Automatic row detection:
● By black separators and white spaces in all columns
● Text line = table row
● By rows of the Master column:
● Multiple selection is possible
● Text alignment in fields
● Text line = table row
Note: Master column should contain one line of text per row
104ABBYY FlexiLayout Studio. Table Detection
Multipage Documents: Document Assembly
● If there is a Header, it will be used to detect the first page
If there is a required Footer, it will be used to detect the last page
● If there is no Header, but there is:● A required Footer, then it will be used (the next pages will belong
the next document)
● An optional Footer:- if found, it will be used- if not found, the top page limit will be used (multi-page
parameters)
● If there is neither a Header nor a Footer, the top page limit will be used (multi-page parameters)
105ABBYY FlexiLayout Studio. Multipage Documents Overview.
Exporting a FlexiLayout
● After creating the FlexiLayout, project is exported to a description file *.afl
● Description file is added to a Document Definition in FlexiCapture project
106ABBYY FlexiLayout Studio. FlexiLayout Creation Steps
FlexiCapture 11 Certification
108
Must attend ABBYY FlexiCapture Online Class 60 Multiple Choice Questions (60 Possible Points)
1 FlexiCapture Project (40 Possible Points) 1 Fixed Form Document Definition 1 FlexiLayout Document Definition
Passing score is 80% or above
FlexiCapture 11 Recertification Must be already certified in FlexiCapture version 10 or earlier 60 Multiple Choice Questions (60 Possible Points)
Passing score is 48/60… 80% or above