READ OCR Bounding Box Accuracy

Baruch Shadrouz 31 Reputation points
2021-03-08T15:59:38.083+00:00

I am using the basic Code for the READ OCR that Microsoft gives in their getting started guide. The only modification is that I am running that program with multiprocessing with 4 cores so I am making multiple calls to the API at the same time.

The READ OCR sometimes gives me a Bounding Box of a word that is either too large or too small for that word. You can see in the screenshot below that the word "40.00" was completely found but the bounding box is too small and does not cover the entire word. The same issue happens with the word "Current". There is cases where the bounding box is too large as well.
75542-image.png

Any help would be appreciated!

Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
379 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,897 questions
{count} vote

6 answers

Sort by: Most helpful
  1. Eike Thies 0 Reputation points
    2024-09-11T12:11:14.76+00:00

    it is 2024 and this problem is still there. we have it on thousands of documents. what is even more strange is that in the document intelligence studio on hover you can see the correct values displayed. it also correctly highlights the words. but the results in the json via api or in the "result" tab in studio are slightly off

    grafik

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.