How can I enable users to label PDF documents directly in my application without using Document Intelligence Studio?

Arun 0 Reputation points
2024-11-20T09:27:47.6766667+00:00

I want to implement the labeling process within my application, so users don’t need to visit Document Intelligence Studio. My goal is to generate the required <filename>.pdf.labels.json and fields.json files directly in my app. How can I achieve this?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,746 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. David Singh 0 Reputation points
    2024-11-20T10:15:15.8266667+00:00

    You can build a custom interface that allows users to upload and view PDFs, select text or regions, and assign labels. Use tools like PDF.js to render PDFs and capture user annotations, such as bounding boxes or text selections. Save these annotations as JSON files (<filename>.pdf.labels.json and fields.json) that match the format required by Azure AI Document Intelligence. This way, users can label PDFs without leaving your app, and you can use these files for training or inference directly.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.