PDF Extraction to sql server table
Hi Expert in the blob structure, I am connecting to Azure database and when i mapped the json i am getting status, page, bounding text columns, i want table column header and data to map with sql server table column here is my code …
Is any AI module smart enough to tell me if a table spanning multiple pages is the same
Is any AI module smart enough to tell me if a table spanning multiple pages is part of a single table? I have a few use cases: A table spans to a new page and the table header also exists on the new page. A table spans to a new page but the header is…
Tables Extraction Using Custom Extraction Model (Merged Rows and Row Splitting Issues)
Hello, I am working on extracting tables from PDF documents using a custom extraction model. The documents vary: some contain tables, others have paragraphs, and one is a scanned PDF. On average, each PDF has about 16 pages, with tables typically found…
Confidence value for table fields
Hello. Is it possible to get confidence value for table type fields returned by form recognizer analyze call? For now we can get confidence value for regular key/value fields only. Is it planned to add this feature in next releases?
Document Intelligence Studio - Failed to access Blob container
We are facing this issue within load projects in Document Intelligence Studio. The strange thing is that this error appears randomly and not always. The CORS settings for storage are set according to the documentation.
I'm receiving a "401 Access Denied due to invalid subscription key or wrong API endpoint" message when trying to run the Form Recognizer Studio in a container?
I used the following link to learn how to run and configure containers for Azure AI Document Intelligence. Specifically, I created the requisite files and folders needed for the Custom container, then pulled and ran the images using docker compose as…
process gets stuck at the "Training" stage
Hi, I'm trying to train a custom model for document classification in Azure AI Document Intelligence but the training process gets stuck at the "Training" stage. I've followed the doc and my data is correctly labeled and formatted. My Azure…
Extracting specific table from a document using AI module
A document, especially an earnings report, will have different tables in the document, e.g. https://report.adidas-group.com/2023/en/_assets/downloads/cons-financial-statements-adidas-ar23.pdf. I am only interested in a specific table, "Consolidated…
Azure Document Intelligence - General Internal Server Error
I am using custom classification in Azure Document Intelligence. While trying to train the model, I am facing "General Internal Server Error" for more than a week. If anyone is aware of the solution, please share the solution. Thanks
Run layout doesnt recognize text
When running layout, it doesn't recognize the 2 values below: How to fix this? I tried to resave the file as pdf. Any workaround? thanks
Cannot share any version 2024-11-30 custom model projects (works for previous projects of previous version)
Hi, In the Azure Document Intelligence Studio, I can't share any projects with API version 2024-11-30, but it works fine with previous version, as you can see: Any ideas on how to fix this?
form-recognizer studio: Document Intelligence Service endpoint configuration
NOTE: I did ask this question before, but the other post leads to a 404?! Please close the first post and not this one. Thanks There is little information online for the form-recognizer studio YML configuration. When we spin up the studio and the…
DocumentIntelligence: UnicodeDecodeError While Batch OCR Local PDFs
Hello, I have been trying to use Azure AI's DocumentIntelligence to OCR about 1,000 locally-stored PDF files. I mostly have been following guidance by @Anonymous on this question:…
The "PageBreak" in Azure AI Document Intelligence v4
This is a follow-up discussion for adding the real page number in the markdown output. With the Azure AI Document Intelligence v4, the output contains "<!-- PageBreak -->" for each page, probably we can replace this flag with the real…
TrainingHoursLimitNeural
Hi, Me and my team are upgrading and training a bunch of new models with the 4.0 release, but we've hit a roadblock. Were on a S0 Tier paid service but getting this error message: TrainingHoursLimitNeural Account cannot train more than 10 hours of…
Custom generative extraction-Endpoint
Hi All, I am using the preview version of Custom generative extraction with Azure AI document intelligence. There was a API available which used this end point as part of 2024-07-31 release but this is no longer available in the documentation now. Even…
I need your advice on how to extract information from a pdf on smart document and then use this information in power automate.
Hello, Thank you in advance for your answers. I have a document that is 60 to 150 pages long and I would like to extract information from only 5 pages (accounting ledger with account numbers). I used document intelligent on my document and I specified…
Inaccurate OCR value for date field
We are using custom models for the form recognizer service, and the most annoying issue we are facing is the date field. We tried different formats, and it would not read an accurate date from the date field. There were multiple issues while reading the…
Document Intelligence: Sharing a project using token does not work for version GA 4.0
Hi all, Up till now we were able to share projects between co-workers using the 'share' button. Since we updated the models to GA4.0 in document intelligence, this button is greyed out. What could be the reason for this? Is it expected this…
how to get multiple occurrences back in document from custom model and how to avoid if irrelavent data is present at position where the field is trained in custom model
Hi Team, we are planning to move from general document to custom model. we have 2 questions 1)we trained the model but if the same field occurs twice in a document the Doc Intelligence is returning only first occurrence from the entire document . can…