OCR is reading watermark letters

Vikas Mathad 0 Reputation points
2023-06-09T11:02:37.01+00:00

IC Sample2

We are using Form recognizer for extracting data from these types of ID's. however these ID's have a watermark (not visible on this sample image) which are getting picked. How do we avoid that from happening as it is impacting the accuracy.

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,882 questions
{count} votes

1 answer

Sort by: Most helpful
  1. VasaviLankipalle-MSFT 18,391 Reputation points
    2023-06-09T22:10:07.9666667+00:00

    Hi @Vikas Mathad , Thanks for using Microsoft Q&A Platform.

    Sorry for the inconveniences. Yes, this is a known issue. I don't think we have this watermark removal feature supported yet in the Form recognizer.

    My suggestion is if the watermark is not blocking the values from being read by the OCR, then it may not be necessary to remove it. However, if it is causing issues with the data extraction, then please try to explore external tools for removing it.

    This feature is in the plan but looks like not any time soon will be available, so we do not have any workaround at this moment. Thanks for understanding.

    I hope this helps.

    Regards,
    Vasavi

    -Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.