Internal error while fine tuning Whisper

Pier Luigi Susini 0 Reputation points
2025-03-06T21:57:51.3066667+00:00

While trying to fine tuning Whisper Large V2 I get an internal error:
"Model training failedAn internal server error has occurred. Retry after a while. If the issue persists".

I've tried several times and different region but the problem persists. I've also tried with the sample azure data but I still observe the same problem.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,940 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Prashanth Veeragoni 1,190 Reputation points Microsoft External Staff
    2025-03-07T04:59:50.01+00:00

    Hi Pier Luigi Susini,

    Welcome to Microsoft Q&A forum. Thanks for posting your query.

    Yes! The "Internal server error" while fine-tuning Whisper Large V2 in Azure OpenAI Service can be caused by several issues, including service availability, quota limits, or backend issues in Azure. Here’s a step-by-step troubleshooting guide to resolve it:

    1. Check Azure OpenAI Service Status

    Sometimes, the error is due to temporary service issues. Check the Azure status page for OpenAI services:

    Visit Azure Status and check if there are any ongoing issues in your selected region.

    If there are reported issues, wait until Microsoft resolves them.

    1. Ensure Fine-Tuning is Enabled for Your Subscription

    Fine-tuning Whisper Large V2 requires explicit approval from Microsoft. You must check if fine-tuning is enabled for your Azure OpenAI resource.

    How to Check and Request Access:

    Go to Azure OpenAI Studio → Click on your OpenAI resource.

    Navigate to "Limits and Quotas" and check if fine-tuning is available for Whisper Large V2.

    If fine-tuning is not enabled, raise a support request to Microsoft via Azure Support.

    1. Check Quotas and Limits

    Even if fine-tuning is enabled, your subscription might have exceeded the allowed limits.

    How to Check Quota:

    Open Azure Portal → Go to your Azure OpenAI resource.

    Click on Usage + quotas.

    Look for the fine-tuning token limit (e.g., number of tokens allowed per month).

    If you're exceeding the limit, request an increase from Microsoft via Azure Support.

    1. Use a Different Region

    Since you've tried different regions and still see the issue, only a few regions fully support Whisper fine-tuning.

    Recommended Regions for Whisper Fine-Tuning:

    East US

    West Europe

    South Central US

    Try fine-tuning in one of these regions.

    1. Use Correct Data Format and Parameters

    Fine-tuning errors may also be caused by incorrect dataset formatting.

    Check Your Training Data

    Ensure your dataset follows Azure OpenAI’s Whisper fine-tuning format:

    The dataset must be in JSONL format.

    Each line must contain a prompt and completion field

    The dataset should not exceed 100M tokens.

    Verify Training Command

    Use the correct fine-tuning command:

    Code

    1. Retry After Some Time (Rate Limiting Issue)

    If too many users are fine-tuning at the same time, Azure might rate-limit your request.

    Solution: Wait for a few hours and retry.

    Hope this helps, if you still face any issue, please share me your training arguments so that I can deep dive more into the issue.

    And also, please refer below link for

    Whisper v2 model details and fine-tuning steps:https://huggingface.co/openai/whisper-large-v2

    If this answers your query, do click Accept Answer and Yes for was this answer helpful.

    Thank you.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.