@santoshkc Yes i have had a chance to check the answers above. I think that for the time being I will continue to use the preview version and in the mean time hope that the newer model irons out any issues that it is having so that when I have to switch to the newer API version there will be a much smoother transition on my end.
Why does the latest version of the Microsoft Document Intelligence API (2024-11-30) seem moderately worse than the previous version (the July preview)?
I have created a program that utilizes the JSON results generated using the document intelligence model API (using the receipt model) for business related purposes. For several months I was using the July preview version of the API and was very impressed with the amount of correct data.
However after updating my npm packages (and making the necessary changes in order to use the 2024-11-30 version (1.0.0 from 1.0.0-beta.3), I notice that some information that I am getting using the same PDF files is now somewhat different - in a negative way (e.g. more letters/numbers read inaccurately, less fields populated in the finished JSON files, etc.).
Is this just because the API is currently sitting at version 1.0.0 and therefore needs some time to mature? Or are there other issues at play such as model overfitting - leading to degradation of performance - which were absent in previous versions of the API?
As of right now I have reverted to using the July preview API version along with the older NPM package which (beta-3) which still allows its use. However, this is only a temporary solution since I am certain it will be deprecated at some point.
Are there any others who have noticed a similar phenomenon? And if so, are there any ways to mitigate these issues?