Why are there differences between ASR performance of Whisper(HuggingFace) vs OpenAI Whisper difference?

Shamus Sim 0 Reputation points
2025-02-27T02:51:28.94+00:00

Hi,

In my experience of using Whisper via OpenAI Azure services and comparing ASR performance (for same dataset) with Whisper large-v3 via HuggingFace.

I have seen that OpenAI Azure Whisper performs consistently better across different metrics like WER and even semantic similarity scores like BERTScore.

Is there any reason for this? Perhaps Azure has the latest version of OpenAI that is more performant than its open-source counterpart Whisper large-v3.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,732 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Manas Mohanty 945 Reputation points Microsoft Vendor
    2025-02-27T16:38:19.28+00:00

    Hi Shamus Sim

    It is showing version 1 of Whisper is Whisper Large v2 model.

    So, there is a model architecture difference for version 1 as mentioned in hugging face model card details

    Reference is model card details from Azure OpenAI foundry

    Screenshot 2025-02-27 215030

    Azure product team also does additional finetuning before hosting the models in Datacenter which might be reason behind the better results.

    https://github.com/openai/whisper/blob/main/model-card.md

    Hope that helps.

    Thank you.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.