Hi @Josh Wilson
Greetings & Welcome to Microsoft Q&A forum! Thanks for posting your query!
The error "Failed due to data collecting error" when creating a Custom Trainable Classifier in Microsoft Purview could be caused by several factors. Here are some troubleshooting steps that might help you in resolving the issue:
Check Seed Data:
- Ensure that the positive and negative seed content items are correctly placed in separate folders and that each folder contains only the respective seed content. The folders should be dedicated to holding only the seed data.
Indexing Time:
- If you create a new SharePoint site and folder for your seed data, allow at least an hour for that location to be indexed before creating the trainable classifier that will use that seed data.
Permissions Check:
- Ensure that the account you are using to create the classifier has the necessary permissions to access the SharePoint sites and folders where your positive and negative samples are stored. This includes having read access to the content within these folders.
Data Source Configuration:
- Verify that the SharePoint site URLs provided are correct and accessible. Sometimes, a typo or incorrect URL can lead to data collection errors. Make sure that the SharePoint sites are properly indexed and that the content within them is searchable.
Network and Connectivity:
- Check for any network issues that might be affecting connectivity to the SharePoint sites. Ensure that there are no firewall or proxy settings blocking access.
Retry the Process:
- If the issue persists, try to recreate the classifier by re-collecting the seed data and following the creation steps again. In the event of intermittent issues, attempting a retry process can often resolve the problem.
For details, please refer: How to create a trainable classifier
For a similar issue, you might find it helpful to check out this thread link for additional insights: https://learn.microsoft.com/en-us/answers/questions/2111187/unable-to-create-custom-trainable-classifier-due-t
I hope this information helps. Please do let us know if you have any further queries.
Thank you.