Azure OCR Line Concatenation Issue Leading to Incorrect Text Recognition
Hello,
We've consistently noticed an issue with Azure OCR in which the service returns a JSON that concatenates words from two separate document lines. It's extremely unpredictable and leads to a lot of issues that we can't fix or foresee and repair.
For example, on the image of a PDF scan there are 3 lines in three different languages. Azure OCR on one occasion returned the text "FRACHTBRIEF SAMOCHODY" as a valid read in the JSON, on a different occasion it returned "CONSIGNEMENT SAMOCHODOWY".
Is there a way to configure how will the lines be concatenated?
1 answer
Sort by: Most helpful
-
Deleted
This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.
Comments have been turned off. Learn more