Why does the latest version of the Microsoft Document Intelligence API (2024-11-30) seem moderately worse than the previous version (the July preview)?
I have created a program that utilizes the JSON results generated using the document intelligence model API (using the receipt model) for business related purposes. For several months I was using the July preview version of the API and was very impressed…
A regression in OCR quality and ability to find fields in REST API 2024-11-30 (GA) for receipts/invoices?
We use Document intelligence to OCR and pull out relevant information from receipts and invoices. We are currently on the API 2024-07-31-preview but saw there was a new GA release in December (API 2024-11-30 (GA)) so we tested it out. There is a small…
Running a composed model with only one extraction algorithm
I have two document types that I want to create a composed model from. One of the document types is null/irrelevant/"other" data which I don't want extracted. The other document type contains relevant information that I want to run through an…
Could not build the model: Total number of pages in the dataset is 1013. The maximum number of pages allowed in a dataset is 1000
Error: (Depicted Below) Could not build the model: Total number of pages in the dataset is 1013. The maximum number of pages allowed in a dataset is 1000 Question: why this happing ?
How to fix request timeout issue with large pdfs in Document Intelligence API?
Hello guys, I am getting a request timeout issue with the document intelligence API request. The catch is the timeout doesn't occur when I extract the details using the Document Intelligence Studio. I am requesting to my custom extraction model. const…
Run Analysis Custom extraction model
Hello, when we train a custom extraction model we can hit the run analysis button in the document intelligence UI to generate the ocr file of the pdf, which we can then download in the azure blob storage manager. is there a possiblity to generate…
Azure Document Intelligence, The requested model was not found.
I'm not the one who created the training models, and I just get access to the azure portal as developer to work on a task In Document Intelligence Studio -> settings , when I select resource for Service I can see a list of models but when I switch…
Azure Document Intelligence : custom classification model training issue
Hi Team, @ Subject: Issue Training Azure Custom Classification Model via code We are facing an issue while training a custom classification model programmatically. Below is the detailed context of the setup and the issue we are encountering: Setup…
AI builder vs Document intelligence
I have recently been looking into the AI builder (specifically the models that can do form recognition like prebuilt models like invoices for example) and Document Intelligence (previously form recognizer), which also has prebuilt and custom models. My…
Why after training, "Test" only looked at 2 pages of my test document, instead of looking at the entire document?
In Azure AI Document Intelligence Studio, I am using Custom Extraction Model. As a trustee from a board, my goal is to be able to analyze Check Journals in PDF Format. The check journals are 10 to 20 pages in table format of check number, date, code,…
The compose model is not working in ai foundary portal
I build the model TestModel1 with field TotalAsset and build the another filed TestModel2 with field TotalLibilites when I compose the model (ComposeTestModel1AndTestModel2) of TestModel1 and TestModel2 then it trained on only one TotalLibilites …
Custom Neural model for Containers
when the custom neural model for containers will be available. At 3.1 it still isn't Grazie
Pricing of Composed Models
How does the pricing for composed models work? Are we paying for each classification performed and each extraction performed? What if I have a document type called "other" that has irrelevant information, and I don't want to run extraction on…
Unstable Table Layout with Merged Column Headers
Hello Everyone, We are currently using Azure Document Intelligence to extract table data from PDF documents. By default, we use the layout model for data extraction. However, we've encountered some issues with specific documents, particularly when: …
ResourceNotFoundError: (NotFound) Figures analyze result does not exist.
I'm trying to extract figure from a PDF document. Below is the code that I'm running: import os from dotenv import load_dotenv import base64 import json # Azure packages to import from azure.core.credentials import AzureKeyCredential from…
Can I get image related information on Azure's Document Intelligence Studio?
Out of the models supported by Document Intelligence Studio, is there a model that gives out image information? Is there an option of using Custom model along with the existing one?
Azure Document Intelligence unable to read XFA PDFs
I am running into an issue where the Azure AI Document Intelligence interface is unable to read certain PDF files and displays "To view the full contents of this document, you need a later version of the PDF viewer...." I believe this is…
Add subscript/superscript handling to Azure DocumentAI
Hi, I am using Azure DocAI layout model and I really like its high-quality results. One thing I am recently struggling with is the handling of subscript and superscript characters. I am using PDF input documents (not scneed), though I guess this doesn't…
Internal Server Error when calling Azure.AI.DocumentIntelligence.AnalyzeDocumentAsync with latest SDK
We have been successfully using the previous Azure.AI.FormRecognizer SDK v4.1 to call prebuilt models in Azure Document Intelligence. Since we now want to use some of the latest prebuilt models (like prebuilt-tax.us), we now have to upgrade to the newly…
OCR is reading watermark letters
We are using Form recognizer for extracting data from these types of ID's. however these ID's have a watermark (not visible on this sample image) which are getting picked. How do we avoid that from happening as it is impacting the accuracy.