Training file: Preprocessing Summary: The provided data failed validation.

Hassan K 0 Reputation points
2025-03-03T18:39:49.1433333+00:00

I get training data set validation errors on about 1/3 of my training dataset. I looked at the images with the problems and I do not agree with the status. The images are all RBG or RGBA files and open fine. All these documents are for engineering documents and pictures of equipment. In the Ignite conference they said if you need to disable checks, reach out to the support team. Here is the link to the video:
https://youtu.be/7LzNwsQVTZ8?t=1705

status : Training file: Preprocessing Summary: The provided data failed validation. Number of skipped multimodal examples exceed the maximum allowed 200 limit: not an image (889), image oversize (35), contains captcha (131), contains faces (6), contains people (1). Please visit our docs to learn how to resolve these issues, and try again.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,746 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.