Can Document Intelligence SDK extract the bytes out of DocumentFigure objects?

Asmond Noah 0 Reputation points
2024-12-02T06:35:59.58+00:00

I am building a system that processes a large amount of inconsistent and unstructured pdf documents. In this process, I need to extract the figure bytes and save it in Azure Blob. However the Doc Intelligence was only able to extract the bounding regions of said figures. Is there any way or built-in functions in the SDK which I can extract the bytes?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,767 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,955 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Shikha Ghildiyal 770 Reputation points Microsoft Employee
    2024-12-02T06:54:28.24+00:00

    Hi @Asmond Noah

    Thanks for reaching out to Microsoft Q&A.

    Unfortunately, there is no option to extract the byes. Please find the reference document that talks about what all can be extracted-

    https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/concept/analyze-document-response?view=doc-intel-4.0.0

    Please do not forget to "Accept the answer” and “up-vote” wherever the information provided helps you, this can be beneficial to other community members.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.