Multi Invoice Document Process Using AI - Updated Version
Multi Invoice Document Process Using AI - Updated Version
AI/ML
Document types like invoices, receipts, ACORD forms, and others are often combined
into a single PDF file, with some documents spanning multiple pages. The process of
handling and extracting data from such PDFs is known as Multi-Invoice Document
Processing.
Digitization of Raw PDF: First, we will digitize a single raw PDF containing
multiple document types. Some of these document types may span multiple
pages.
The image below illustrates a single PDF that contains multiple invoices:
Intelligent/Keyword-Based Classification: The document will first be classified
using an intelligent or keyword-based classifier. If the confidence level for any
document type falls below a certain threshold, the document will be routed to the
action centre for further classification.
Classifier Training Approach: To improve accuracy, the classifier will be
trained with individual PDFs that contain only one document type—regardless of
whether the document spans a single page or multiple pages. This focused
approach will enhance the classifier’s ability to accurately process and categorize
documents.
The image below illustrates a single PDF containing multiple document
types, which has been routed to the action center for further classification:
Splitting the Raw PDF: Based on the classification results array and the
document bounds property, the raw PDF will be split into multiple PDFs using
activities from the PDF package.
The Extract PDF Page Range activity will be leveraged to separate the raw
PDF, which contains multiple document types, into individual, categorized PDFs.
You can set the range as shown in the snippet below. The current item type is
UiPath.DocumentProcessing.Contracts.Results.ClassificationResult, which is
provided for reference and may vary depending on your scenario:
Next, we can process the individual split PDFs using the Document
Understanding template provided by UiPath. This involves incorporating the
Machine Learning Extractor, which can be retrained and fine-tuned later to
improve model performance and accuracy using a classic approach.
Depending on the business scenario, you may also choose to adopt the modern
approach for enhanced flexibility and efficiency.