How can I extract both images and tables as separate sections from a PDF with Azure?

Andrii Batutin 5 Reputation points
2024-01-29T10:38:23.87+00:00

Hi.

I need to get from PDF both

  • tables
  • images (like block diagrams, charts, schematics)

as separate sections

For images i need to get bounding box of the image

Also i use Azure Document Intelligence and it detect text that belongs to images, i need instead of that detect that there is a image on page, and not to have separate sections for text from images

Is there a way to do it with Azure? With Azure Vision?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,714 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,902 questions
{count} vote

2 answers

Sort by: Most helpful
  1. Sedat SALMAN 14,065 Reputation points MVP
    2024-01-29T12:45:49.77+00:00

    Form Recognizer is a very powerful that can help you you can view and review the following article it will helo you how to build a solution for your question https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/extract-data-from-pdfs-using-form-recognizer-with-code-or/ba-p/2214299

    0 comments No comments

  2. Deleted

    This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.


    Comments have been turned off. Learn more

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.