How do i change the request limit for whisper model in azure

Harinath J 0 Reputation points
2025-01-23T09:45:50.6666667+00:00

Hi everyone! đź‘‹

I’m currently using the Whisper model in Azure OpenAI Service and running into issues with the default request limits. I need to increase the request limit to better handle my application’s workload.

Here’s what I’ve tried/checked so far:

  1. Looked at the Usage + Quotas section in the Azure Portal to view the current limits.
  2. Explored deployment settings, but I don’t see a clear way to adjust the limits directly.
  3. Researched Azure documentation, but I couldn’t find detailed steps for increasing the Whisper model’s request limit specifically.

Questions I have:

  1. How do I properly request a quota increase for the Whisper model in Azure OpenAI?
  2. Is there a way to adjust request limits or rate limits directly in the Azure Portal?
  3. Any tips for handling higher workloads with Whisper while waiting for a quota increase?

Thanks in advance for your help! 🙏Hi everyone! 👋

I’m currently using the Whisper model in Azure OpenAI Service and running into issues with the default request limits. I need to increase the request limit to better handle my application’s workload.

Here’s what I’ve tried/checked so far:

  1. Looked at the Usage + Quotas section in the Azure Portal to view the current limits.
  2. Explored deployment settings, but I don’t see a clear way to adjust the limits directly.
  3. Researched Azure documentation, but I couldn’t find detailed steps for increasing the Whisper model’s request limit specifically.

Questions I have:

  1. How do I properly request a quota increase for the Whisper model in Azure OpenAI?
  2. Is there a way to adjust request limits or rate limits directly in the Azure Portal?
  3. Any tips for handling higher workloads with Whisper while waiting for a quota increase?

Thanks in advance for your help! 🙏

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,885 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,591 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,076 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Pavankumar Purilla 2,930 Reputation points Microsoft Vendor
    2025-01-25T00:35:00.6+00:00

    Hi Harinath J,
    Greetings & Welcome to Microsoft Q&A forum! Thanks for posting your query!

    Is there a way to adjust request limits or rate limits directly in the Azure Portal?
    Yes, we can adjust request limits or rate limits directly in the Azure Portal.

    How do I properly request a quota increase for the Whisper model in Azure OpenAI?
    To request a quota, increase in Azure OpenAI Service for Whisper model usage, follow these steps:

    • Navigate to the Quotas page in Azure OpenAI Studio.
    • Locate and click the Request Quota Increase option. This action opens a form.
    • Fill out all the required details in the form. The form will look like the screenshot below:
      User's image
    • Request Quota Type: Select the appropriate quota type (e.g., Standard).
    • Region: Choose the region where your deployment is hosted.
    • Model: From the dropdown, select Whisper.
    • Quota Details: Enter the required quota amount, (Requests Per Minute) as your desired quota.
      User's image

    Once all the fields are completed, submit the request. Please note that due to overwhelming demand, quota increase requests are being accepted and will be filled in the order they are received. Priority will be given to customers who generate traffic that consumes the existing quota allocation, and your request may be denied if this condition isn't met.

    Any tips for handling higher workloads with Whisper while waiting for a quota increase?
    To handle higher workloads with the Whisper model while waiting for a quota increase, optimize usage by batching audio files to reduce the number of API calls and splitting longer audio into shorter segments if possible. Consider creating multiple Whisper deployments across different Azure regions or subscriptions to distribute the load.

    Hope this helps. Do let us know if you have any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.