Decent results using the Custom Classification model for invoices got worse after increasing the number of training documents

Patrick Gonzalez 0 Reputation points
2025-02-04T02:20:23.25+00:00

We initially trained a model to detect invoices using 10 documents. We saw decent results, but found that some obvious invoice documents (to a human) resulted in very low confidence levels. We then increased the number of training documents to 99 and the model began producing worse results in the other direction, meaning that documents that were obviously not invoices were classified as invoices with very high confidence. Has anyone else seen this behavior?

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,891 questions
0 comments No comments
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.