The Read OCR/Barcodes action is used to queue up the documents in the active Record for OCR 

This is normally used as an Assignment Action with the trigger assigned to the doMgt_Agent user. That way the workflow stops here and waits for the OCR to be done before moving on. Since OCR can take some time it is best to ensure the OCR is done before moving on in workflow - especially if the next steps use the OCR data. 


Options


    • OCR Mode
      • Read Full Text Only
        • Performs OCR on the documents
      • Read Full Text and Barcodes
        • Performs OCR on and reads barcodes from the documents
      • Read Barcodes Only
        • Reads barcodes from the documents
      • Non-OCR Read for Office documents
        • Extracts plain text from office documents. This can help with searching but will not enable click indexing or highlighting.
      • Recognize Invoice Data
        • Performs OCR using machine learning which automatically finds all invoice-oriented data.
      • Recognize Receipt Data
        • Performs OCR using machine learning which automatically finds all receipt-oriented data.
      • Recognize Identity Data
        • Performs OCR using machine learning which automatically finds all identity-oriented data. This includes US driver's licenses and US passports.
      • Recognize Business Card Data
        • Performs OCR using machine learning which automatically finds all business card-oriented data.
      • Recognize W2 Data
        • Performs OCR using machine learning which automatically finds all W2-oriented data.
      • Recognize General Forms
        • Performs OCR using machine learning which automatically finds all name/value data from form documents. This will find labels on the forms and extract the values that go along with those labels.


    • Read Barcodes - This option is available with all the Machine Learning modes. Turning this option ON will also read all the barcodes while the Machine Learning is happening.



NOTE: Invoice Data, Receipt Data, Identity Data, Business Card Data, W2 Data, and General Forms all incur extra fees. Check with your sales rep for more information!


Advanced


    • Documents to OCR - Specify the Category / SubCategory / Name of the Documents that you wish to have OCR'd. All documents that match will be OCR'd. Leave blank for ALL.


    • First N Pages - You can tell the system to only perform the OCR operation on the first pages. Specify how many pages to process from the beginning here.


    • Last N Pages - You can tell the system to only perform the OCR operation on the last pages. Specify how many pages to process from the end here.


    • OCR Job Tag - (Enterprise Only) You can use this option to tag your OCR jobs with billing codes, etc. They are available in the OCR reporting.



Barcodes

Specify which barcodes to look for. You can choose any or all but BE AWARE that the more barcodes you enable, the slower the processing. The slowest barcodes to read are Data Matrix, PDF417, PDF417 Compact and QR Code. 


Barcode Technologies

Here are links to information about some of the more common barcode types:


Extracted Fields

The following OCR modes will extract data and place them into data fields with defined names:

Recognize Invoice Data

Master Values: Invoice_Master_Vendor, Invoice_Master_InvoiceNumber, Invoice_Master_InvoiceDate, Invoice_Master_Amount, Invoice_Master_DueDate, Invoice_Master_PONumber,Invoice_Master_VendorAddress, Invoice_Master_VendorAddressFirst, Invoice_Master_Terms


Other Values: Invoice_AmountDue, Invoice_BillingAddress, Invoice_BillingAddressRecipient, Invoice_CustomerAddress, Invoice_CustomerAddressRecipient, Invoice_CustomerId, Invoice_CustomerName, Invoice_DueDate, Invoice_InvoiceDate, Invoice_InvoiceId, Invoice_InvoiceTotal, Invoice_InvoiceTerms, Invoice_PreviousUnpaidBalance, Invoice_PurchaseOrder, Invoice_RemittanceAddress, Invoice_RemittanceAddressRecipient, Invoice_ServiceAddress, Invoice_ServiceAddressRecipient, Invoice_ServiceEndDate, Invoice_ServiceStartDate, Invoice_ShippingAddress, Invoice_ShippingAddressRecipient, Invoice_SubTotal, Invoice_TotalTax, Invoice_VendorAddress, Invoice_VendorAddressRecipient, Invoice_VendorName


Line Item Values: InvoiceLine_ProductCode, InvoiceLine_Description, InvoiceLine_UnitPrice, InvoiceLine_Quantity, InvoiceLine_Amount


* The AUTO-SPLIT action must be run on the invoice document(s) to apply these values to the Record data. Before then, the data is stored in the pages only!!


Recognize Receipt Data

Receipt Values: Receipt_MerchantName, Receipt_MerchantAddress, Receipt_MerchantPhoneNumber, Receipt_TransactionDate, Receipt_TransactionTime, Receipt_Subtotal, Receipt_Tax, Receipt_Total, Receipt_Tip


Recognize Identity Data (Licenses, Passports)

Receipt Values: ID_CountryRegion, ID_DateOfBirth, ID_DateOfExpiration, ID_DocumentNumber, ID_EyeColor, ID_FirstName, ID_LastName, ID_Nationality, ID_PlaceOfBirth, ID_PlaceOfIssue, ID_Sex, ID_Height, ID_Weight, ID_MachineReadableZone


* ID_MachineReadableZone = barcode data from ID


Recognize Business Cards

Receipt Values: BC_ContactName, BC_ContactName_#, BC_FirstName, BC_FirstName_#, BC_LastName, BC_LastName_#, BC_CompanyName, BC_CompanyName_#, BC_Department, BC_Department_#, BC_JobTitle, BC_JobTitle_#, BC_Email, BC_Email_#, BC_Website, BC_Website_#, BC_Address, BC_Address_#, BC_MobilePhone, BC_MobilePhone_#, BC_Fax, BC_Fax_#, BC_WorkPhone, BC_WorkPhone_#, BC_OtherPhone, BC_OtherPhone_#


* All fields have the possibility for multiple values. If you only care about the first or main value then you can use the singular version - i.e. "BC_FirstName". If you wish to get all the values you can check the BC_FirstName_# value by replacing the # with the number of the item you want - i.e. "BC_FirstName_1" for the first one, BC_FirstName_2" for the second and so on.


Recognize W2 Data

W2 Values: W2_AllocatedTips, W2_ControlNumber, W2_DependentCareBenefits, W2_Employee, W2_EmployeeSSN, W2_EmployeeAddress, W2_Employer, W2_EmployerID, W2_EmployerAddress, W2_FederalIncomeTaxWithheld, W2_MedicareTaxWithheld, W2_MedicareWagesAndTips, W2_SocialSecurityTaxWithheld, W2_SocialSecurityTips, W2_SocialSecurityWages, W2_TaxYear, W2_W2Copy, W2_W2FormVariant, W2_WagesTipsAndOtherCompensation


Recognize General Forms

General Forms will place values into the Record using "FORM_<fieldName>" syntax. So if a form field is labeled "NAME", the value will be placed in to a field Record field named "FORM_NAME". It is usually best to use the [FORM()] variable to extract these as it supports wildcards for name matching.





Advanced Tip

The OCR Engine is built to use specific record values as commands. If those values exist it will trigger different behaviors. Here is a break down of the special OCR codes:


    • @OCRFirstNPages - If this value is set to a number then the OCR engine will only OCR the first N pages of each document.
    • @OCRLastNPages - If this value is set to a number then the OCR engine will only OCR the last N pages of each document.
    • @OCRMode - This will instruct the OCR what type of OCR to perform:
      • Empty or record value does not exist = Full Text OCR and Barcodes
      • OCR = Full Text OCR only
      • Barcode = Barcode only
      • Simple = Content read only - no actual OCR. This will only work with text-based documents, DOCX documents, XLSX documents,m and PDF documents. This does NOT extract the location of the words so no highlighting or click indexing will work with this mode.
      • Receipt = Invoice Recognition Mode
      • Invoice = Invoice Recognition Mode
      • LineItems = Invoice Recognition Mode with Line Items