Search

Document Processing-FAQs

Last Updated: Sep 13, 2021

Articles

What kind of document file types can be processed by Doc Reader Node?
PDF Documents can be processed by the Doc Reader Node.
You can use the DOCX To PDF and Excel To PDF functions to convert the required documents to PDF and process them using Doc Reader Node.
How do I re-train a document that has already been approved?
To know more about Re-familiarization, click here.
I processed an Invoice, and all fields were correctly extracted. When I processed another Invoice of the same template, the fields were auto-populated in the table and the status is EXTRACTED_SUCCESSFULLY. But PO Number is blank, and I do not have an option to re-familiarize the document. How can this be corrected? I notice that in the earlier document label for PO Number is PO No: and in the current document it is PO N0:
You cannot re-familiarize a document with the status EXTRACTED_SUCCESSFULLY. Ensure that validations for the required fields are designed in the Post Processing task.
If the validations are not met, the status of the record is updated to MANUAL_INTERVENTION_VALIDATION_FAILED. Then you can re-familiarize the document. You can add all the possible labels for PO Number as Pseudonyms so that the field is extracted for any of those labels.
In this scenario, validation should be designed in the Post Processing task to check PO Number is Not Null. Both PO No: and PO N0: should be added to the Pseudonym List.
How can I check the results of the Post Processing task that was triggered by the Document processing task?
The Output tab of the Result of Execution window displays the Run ID of the Post Processing task. Note the Run ID and check the results on the Execution History page.
Do I have an option to not train the document if a wrong template was received?
Yes, you can skip and choose the option of Manual Review.
When I clicked on the Update Data button, the table that is populated in the right panel has unwanted rows in the end. How do I correct this?
This happens if the End of the table is not familiarized. Select the End of the table option.
When I clicked on the Update Data button, the table that is populated in the right panel does not display the required columns. How do I correct this?
You can familiarize the required columns from the left panel again and Click the Update Data button. To know more about Familiarizing Inline table, click here.
When I clicked on the Update Data button, the table that is populated in the right panel does not display the records correctly as the rows across the columns in the table are not aligned. Some columns have two or three lines of data, others have one. How do I handle this?
You can handle this by using the Row and Column Definition option.
What tables can be used as dictionary tables in the document functions that use Dictionary Item?
Jiffy tables and Datatables can be used as dictionary tables.
In the document functions that use Dictionary Item, in what order are the records in the table traversed for the search?
The records are traversed and picked for the search in the same order as they were added to the table. To know more about this, refer to the Sample in the Find Occurrence Of Dictionary Item function.
What image formats are supported in the Extract Image function?
The supported image formats in the Extract Image are TIFF, JPG, GIF, PNG, and JPEG.
Where are the output files saved after the functions are executed?
The path of output files is automatically assigned by the function itself. Check the value of the variable to get the path of the output files.
Navigate to the Step Data tab in the Result of Execution window. Click on the icon to view the value of the variable.
To know more about how to use Document Functions, click here.
What are the names of the output files, if any returned by the document functions?
The output file names are auto-assigned by the function.
I have multiple tables in the document and I want to extract a particular one. How can I do that?
You can use Data Capture Rule option to select the table.
I am getting the error “Please select reference label which is above the table headers” while using Data Capture Rule?
When extracting the table, table below the reference label set in Data Capture Rule is extracted. So you must always select the reference label above the table header.

See Also

Did you find what you were looking for?