Azure AI document intelligence prebuilt-Layout model cannot extract "role" with docx files
I am using Azure Document Intelligence SDK (Version 1.0.0) with the prebuilt-layout model to extract paragraph roles from documents with docx type(it works well with PDFs). However, I am not receiving any roles for the paragraphs, even though the…
Azure document intelligence 3.1 to 4.0 output comparision
We have upgraded from 3.1 to 4.0 custom extraction model. We would like to compare results between older 3.1 and latest 4.0 version's output. Any solution for this?
Getting InternalServerError trying to make an api call to document intelligent (document v3.0) container
Error: - Azure.RequestFailedException: 'An unexpected error occurred. Status: 200 (OK) ErrorCode: InternalServerError Additional Information: AdditionInformation: InternalServerError: An unexpected error occurred. Details: HttpRequestException: Could not…
upload_blob and download_blob takes 2 seconds to execute which increases execution time of our API
This is how we are creating a pdf on azure blob storage and reading the file from Azure blob storage upload_blob and download_blob takes 2 seconds to execute which increases execution time of our API i tried async but it didnt reduced time. func tools…
Issues with Microsoft Syntex Document Processing Model: Incomplete Extraction for Multi-Page PDFs
I'm facing several challenges with the Microsoft Syntex document processing model, particularly when dealing with multi-page PDFs and large tables. I'd appreciate any insights or suggestions from fellow users or Microsoft experts who may have encountered…
Azure Document Intelligence Behavior and Interface Questions
A few questions regarding Azure Document Intelligence: To confirm, is https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/prebuilt/layout?view=doc-intel-4.0.0&tabs=rest%2Csample-code the only main documentation site? It would…
Issue labeling docs words in two paragraph
Good morning. I have a problem training a custom model using Document Intelligence. For example, I need to select the city, but it's split between two paragraphs. If I draw a region over "Buenos" only, it only allows me to select another…
System Message not working when working with Data Source in Azure AI Open Service
System Message not working when working with Data Source in Azure AI Open Service I have tried chat GPT4o-mini, GPT4o Turbo models. below is my use case, I have Search index, indexer, DataSource in Cognitive Search service, i create a deployment model in…
For a Batch Run with Figures Output how to download the Figures
For azure AI document intelligence, when I batch run with begin_analyze_batch_documents() using the python SDK, I would like to download the figures for each of the results in a batch, is there a way to do that? I see then method…
Document Intelligence Model Internal Server Error
We are using AP Automation solution and facing issues reading an invoice that is 476 pages and 918 KB. Azure supports 2000 pages, I also tested on another Document Intelligence instance and had same issue. I then split to first two pages and it processed…
AKS/Docker: I am processing PDF's as large as 5,000 to 10,000 pages for OCR. I'm trying to run the pages concurrently in groups of 50 but running into a situtation where the memory goes to 10GB for a 200MB file.
I am processing PDF's as large as 5,000 to 10,000 pages for OCR with 200 MB files. I'm trying to run the pages concurrently in groups of 50 but running into a situtation where the memory goes to 10GB for a 200MB file. I've tried cleaning up the…
Document Intelligence API Version v4.0 Container Enquiry
I am reaching out to inquire if Azure's Document Intelligence version v4.0 is available as a containerized Docker image for use. Additionally, I would like to know if there are any plans for its availability in the future if it's not yet released in this…
Can I move a custom extraction model to a subscription belonging to a different organization?
Hi, I need to train a custom extraction model to detect signatures. If I train it on a subscription within one organization, is it possible to move only the model to a subscription in another organization? Thank you for your help
azure document-intelligence or Form-Recognizer package
Dear all, I've just realized that for my existing system I am using form-recognizer package in Python. However, azure-ai-documentintelligence seems to be the right approach for the future. Do you know if there is any plan that the form-recognizer package…
Need some details about the Azure Custom Classifier
Hi all, So I have been using up azure custom classifier for a while now and i have few doubts regarding a bunch of things Training the model using API - Have been uploading documents using console but uploading and labelling bulk documents via console…
Is there an issue with the custom classifier's confidence scores? When testing documents, the confidence consistently returns as either 0.0 or 99.9.
I’ve been using Azure’s Custom Classification model from the Azure Document Intelligence Studio, and it was working successfully for my use case. My process involved training individual document classes and then uploading bulk documents to classify them…
Azure AI document intelligence - Format preserve
I have used Azure AI document intelligence Read, layout APIs and got the output as a extracted text + Spatial information. But I want to preserve the format of the input image using extracted text. Approach 1: It would be great help if you share…
How to batch create a searchable pdf using azure document intelligence python API
Hi All, I have stored some PDFs in Azure Blob storage and I am trying to batch OCR these documents while also creating a searchable PDF. Unfortunately at the moment I'm getting a file with the following extension "filename.pdf.ocr.json" which…
Document Intelligence Studio - 'Custom Model project could not be found' error when training labeled documents
Hello, I'm getting 'The Custom Model project could not be found' error when I tried to train a custom model after labeling PDF files in Document Intelligence Studio. What could be causing this? Could it be related to permissions? I currently have…
Document Intelligence Studio | deploy / import models with CI/CD
I am looking to deploy a Document Intelligence Studio resource and I wish to be able to import a pre-trained Custom Extraction Model into the newly deployed resource as part of an automated CI/CD pipeline. How would I do this? Thanks!