Hi Harinath J,
Greetings & Welcome to Microsoft Q&A forum! Thanks for posting your query!
Is there a way to adjust request limits or rate limits directly in the Azure Portal?
Yes, we can adjust request limits or rate limits directly in the Azure Portal.
How do I properly request a quota increase for the Whisper model in Azure OpenAI?
To request a quota, increase in Azure OpenAI Service for Whisper model usage, follow these steps:
- Navigate to the Quotas page in Azure OpenAI Studio.
- Locate and click the Request Quota Increase option. This action opens a form.
- Fill out all the required details in the form. The form will look like the screenshot below:
- Request Quota Type: Select the appropriate quota type (e.g., Standard).
- Region: Choose the region where your deployment is hosted.
- Model: From the dropdown, select Whisper.
- Quota Details: Enter the required quota amount, (Requests Per Minute) as your desired quota.
Once all the fields are completed, submit the request. Please note that due to overwhelming demand, quota increase requests are being accepted and will be filled in the order they are received. Priority will be given to customers who generate traffic that consumes the existing quota allocation, and your request may be denied if this condition isn't met.
Any tips for handling higher workloads with Whisper while waiting for a quota increase?
To handle higher workloads with the Whisper model while waiting for a quota increase, optimize usage by batching audio files to reduce the number of API calls and splitting longer audio into shorter segments if possible. Consider creating multiple Whisper deployments across different Azure regions or subscriptions to distribute the load.
Hope this helps. Do let us know if you have any further queries.
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful.