Processing PDF files embedding multiple invoices

Steven Beauchem 25 Reputation points
2024-11-01T14:24:38.7166667+00:00
  1. In my industry, we frequently deal with single invoices that span multiple pages (anywhere from 1-20 pages).
  2. We have built a Document Intelligence model that successfully extracts the desired data from this "single invoice" pattern.
  3. However, we often receive from our business partners PDF files that contain multiple of these multi-page invoices.

What is the recommended Azure/Power Platform-based tooling / pipeline to decompose these multi-invoice PDF files so that they can be processed by a "single invoice" Document Intelligence model?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,721 questions
{count} votes

Accepted answer
  1. VasaviLankipalle-MSFT 17,646 Reputation points
    2024-11-06T02:57:59.89+00:00

    Hello @Steven Beauchem , I'm glad that you were able to resolve your issue and thank you for posting your solution so that others experiencing the same thing can easily reference this! Since the Microsoft Q&A community has a policy that "The question author cannot accept their own answer. They can only accept answers by others ", I'll repost your solution in case you'd like to "Accept " the answer.

    Issue: Processing PDF files embedding multiple invoices

    Solution: The right approach is to build a Logic Apps pipeline that uses a classification model to identify "multi-invoice" pdf files, and run them thru a sub-flow that extracts the individual invoices into separate pdf files to process through the custom extraction model.

    If you have any other questions or are still running into more issues, please let me know.

    Thank you again for your time and patience throughout this issue.

    Regards,
    Vasavi

    Please remember to "Accept Answer" if any answer/reply helped, so that others in the community facing similar issues can easily find the solution.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.